25
HPC at NIBR Nick Holway, NIBR Scientific Computing Group Speedup 2016 September 15, 2016 Twitter: @nickholway LinkedIn: https://ch.linkedin.com/in/nickholway Novartis Institutes for Biomedical Research (NIBR)

Accelerating Drug Discovery using HPC

  • Upload
    dothuan

  • View
    232

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Accelerating Drug Discovery using HPC

HPC at NIBRNick Holway, NIBR Scientific Computing GroupSpeedup 2016September 15, 2016Twitter: @nickholwayLinkedIn: https://ch.linkedin.com/in/nickholway

Novartis Institutes for Biomedical Research(NIBR)

Page 2: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

Today’s talk

1. HPC at NIBR – a quick introduction

2. HPC in the cloud

3. Accelerating a “compound search engine” using HPC

4. Expediting drug discovery with GPUs

Public2

Page 3: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

HPC at NIBR - Hardware

• x86 servers– Intel Xeon– 128-768GB RAM– FDR Infiniband– 10GigE

• Nvidia GPUs on some servers• Isilon storage

– CIFS/NFS– 10GigE to Arista switches

• Lustre– Scratch

Public3

Page 4: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

HPC at NIBR - Software

• RHEL 6.x

• Univa Grid Engine for scheduling

• Software compilation & configuration– Easybuild– Modules– GCC, Intel, Nvidia compilers

• Languages: C++, Fortran, CUDA, Python, R, Matlab

Public4

Page 5: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

HPC at NIBR - Humans

• Global team (Europe, USA, Asia)

• Complementary backgrounds and skills– Sysadmins– Mathematicians– Scientists

• HPCWire award winners in 2014

• NB: HPC exists elsewhere in the Company for Clinical Trial analysis, CFD etc.

Public5

Page 6: Accelerating Drug Discovery using HPC

HPC in the cloud

Page 7: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

HPC in the cloud

• NIBR have used Amazon EC2 for compute workloads– Cycle computing

• ISVs eg DNANexus– Bioinformatics NGS

Public7

Page 8: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

Docking at scale in the cloud

• Ligand-protein docking is “to predict the position and orientation of a ligand (a small molecule) when it is bound to a protein receptor or enzyme” (Wikipedia)

• Embarrassingly parallel - compute-heavy / data-light

• We used the cloud to screen 10 million molecules against a cancer target

Public8

Page 9: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

How we did it

• Cycle computing’s software (Cycle server, Cyclecloud)

• Over 10,000 EC2 spot instances– Extensive benchmarking to select instance type

• Licence files (licence servers cannot cope with the load)

• Proprietary compounds run in NIBR’s VPC, others in “public”

• See http://opensource.nibr.com/videos/aws-litster/ and http://cyclecomputing.com/novartis-taps-cloud-hpc-for-faster-drug-discovery-better-science/

Public9

Page 10: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

Where we’re going in the cloud

• “Cloud by default” for many non-HPC applications

• Clinical data (subject to “informed consent”)

• HPC where appropriate– IB etc for tightly-coupled parallel jobs usually unavailable– Data locality challenging

Public10

Page 11: Accelerating Drug Discovery using HPC

Accelerating a compound target search engineSlides courtesy Douglas Selenger

Page 12: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

Introduction

• There are many disparate public and private sources of information which is hard for experts to query and almost impossible for “normal” scientists

• Scientists would like to ask questions like “What is the target and Mechanism of Action (MoA) of my compound?”

• MOA Central is a web-based tool

Public12

Page 13: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

”Flow” of a query

Public13

Page 14: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

Diagrammatic network

Public14

Page 15: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

Example output

Glivec (Imatinib)

Public15

Page 16: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

Impact of HPC

• Our scientists developed a tool, MOA Central, using graph analysis techniques using well known workflow software

• MOA Central worked so well that the server couldn’t keep up with demand

• We helped port it to Python (Pandas, SciKit etc)– Large queries and data preparation can now run on the cluster– Version control!

• Moving from CSV files, database queries & web services to HDF5 will improve scalability

Public16

Page 17: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

Want to know more about MOA Central?• Look for more information at

https://www.researchgate.net/profile/Douglas_Selinger

Public17

Page 18: Accelerating Drug Discovery using HPC

Accelerating Motor Neuron Disease drug discovery with GPUsSlides courtesy Imtiaz Hossain

Page 19: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

In-vitro model for neuromuscular junctions• Faulty junctions between motor neurons and muscle

cells are implicated in MND

• We’d like to create a drug which corrects this

• Motor neurons & myotube (muscle fibre) cells were “co-cultured” in a “plate” to which drug candidates are added

• Cells were imaged in real time to measure their contractility

• This is very hard to see by eye and also hard to segment using computers

Public19

Page 20: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

What do the cells look like?

Public20

Page 21: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

Motion estimated with Optic Flow

Public21

Different contracting regions

Total area under contraction

Page 22: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

Impact of HPC

• A good joint project between bench scientists, lab automation experts & informaticians

• 80x increase of throughput compared to CPU

• NIBR scientists have access to new method of monitoring myotube contractility

Public22

Page 23: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

The future

• GPUs– Deep learning– Cryo-EM

• Real time collection & processing of data from clinical trials

• Integration of “big data” technologies such as Apache Spark into HPC

Public23

Page 24: Accelerating Drug Discovery using HPC

Thank you

Page 25: Accelerating Drug Discovery using HPC

Novartis Institutes for Biomedical Research

Backup: MOA Central predicting a side effect

Public25