Google Applies Large-Scale Machine Learning To Drug Discovery


Google ($GOOG) has given the world a peek at one of the ways in which it thinks algorithms and huge datasets could reshape drug discovery. The work involves trying to make virtual drug screening more efficient using the same ethos Google applies to most problems: More data, more computing power.

A team at Google Research worked with a group from Stanford University on the project, which aimed to go beyond typical virtual drug screening models by pulling in data on multiple diseases. The desire to increase the breadth of data fed into the model was driven, in part, by the idea that machine learning is more effective when multiple problems are tackled at once. Such multi-task learning has shown promise in multiple fields but requires considerable computing power. Fortunately for Google, this is one area in which it is well equipped.

The search giant applied its large-scale neural network training system to the work. Google built the system to train networks of tens of thousands of CPU cores to perform a task. In the drug discovery project, the training entailed equipping the network to comb through 37.8 million data points covering more than 200 different biological processes. After running the system for more than 50 million CPU hours, Google has concluded the inclusion of data from multiple sources allows it to make more accurate predictions of the efficacy of a drug across different diseases.

Even greater scales are in Google’s sights. At the time of writing a paper on the project, Google had scaled the system up to 239 tasks and the upward efficiency trend was yet to plateau. Similarly, the addition of more data was found to increase efficiency, too. The researchers have cast lustful looks at the “vast private stores of experimental measurements” locked away at Big Pharma companies as they try to figure out the next steps for the model. More data and more tasks are the near-term goals for the project.

Whether the efficiency gains touted by Google will have an effect on drug discovery remains to be seen. Even the paper’s authors accept that the complexity of drug discovery could limit the impact of the approach, but overall are as optimistic as one would expect Google staffers to be about the potential for data and algorithms deliver improvements.

– read the paper (PDF)
– check out Google’s blog post
– and VentureBeat‘s take

Related Articles:
Google Ventures splashes into life sciences with its $425M purse
Google steps up Genomics pitch with $25-a-genome storage service

Source: FierceBiotechIT

Previous Blog Posts: Machine Learning & Medicine, Super Intelligence & Medicine, and Companion Dx & Drug Design

Facebooktwitterredditpinterestlinkedinmailby feather

John Macey

I was born, and principally educated, in the fields of biochemistry, and business management in the northeastern USA. However, my world-wide professional career has greatly expanded upon that US base to involve the many different segments of the Biotech / Life Science fields globally. – I have been dressed as a surgeon to view many, many surgical proceedures - The major players in the health care fields that I have worked for include: Johnson & Johnson, DuPont, Abbott Laboratories, and F. Hoffmann-La Roche. – Positions have included selling nuclear materials for both in-vivo & in-vitro (radio-pharmaceutical & radio-immunoassay) medical diagnostic purposes, in the four countries in Scandinavia, based south of Stockholm, to managing Ph. Ds at a global Swiss headquarters location. – At one time I held a position of Strategy Manager for Europe, Middle-East & Africa (EME&A) for a Chicago based company, but living in Germany. – I do speak fluent colloquial German. – In addition to having lived in multiple European countries, my professional career took me to Asia for well over a decade. There I had management control of Oceania, the Pacific Rim, Northern Asia, Japan, out to India. – Occasionally, management assignments have taken me to all of Latin America, and most of South America. – I am extremely culturally aware, a skilled negotiator, and a seasoned manager of men and science. – My one abiding passion has always been computing, data, and analysis. As such, my main computer operating system is Linux, and open-source computer applications. – I do also run Microsoft 7, and Mac OS X (all 3 operating systems on the same H-P Ultrabook). I hope you enjoy your time on the Blog, and should you have any comments / feedback please feel free to email me @, or visit my Linux Web Site @ (always evolving) – John J. Macey – AKA Adler, which in German means Eagle – Wildwood, New Jersey - Together, we can expand your global markets - with our partnerships. The partnerships are global utlizing multiple Law, Regulatory, Seasoned Management, Employment, M&A and buidlers of Business. Contact use.