41
NIH Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013

NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Embed Size (px)

Citation preview

Page 1: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

NIH Bioassay Research Database

(BARD)

AJIT JADHAV

NIH/NCATS

SEPTEMBER 25, 2013

Page 2: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Presentation Highlights

• Introduction to NCATS

‘Open Source’ Drug Discovery

• Motivations for BARD

• Interacting with BARD

• Extensibility & Community

Page 3: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

The Best of Times, The Worst of Times

• Poor transition of basic or

clinical observations into to

interventions that tangibly

improve human health

• Drug/device/diagnostic

development system in crisis

• Clinical trials system in crisis

• Poor adoption of demonstrably

useful interventions

Fundamental science unprecedentedly advanced, but:

People unhealthier and funders of biomedical research

enterprise (public and private) impatient

Page 4: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

NCATS Mission

To catalyze the generation of innovative methods and

technologies that will enhance the development,

testing and implementation of diagnostics and

therapeutics across a wide range of human diseases

and conditions.

Page 5: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Catalyzing Collaborations Within NIH

NEI

NCI NHLBI NIAID

NIDCR

NIDDK NIAMS

NIDA

CIT

NIEHS

NIMH

NINDS

NCATS

NCCAM

NIMHD

NIDCD

NIGMS NINR

NIAAA

NICHD

NLM

CC

OD

NIA

NHGRI

FIC

NIBIB

CSR

Page 6: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Catalyzing Collaborations Outside NIH

• Complements ― does not

compete ― with the work of

others

• Revolutionizes the process of

translation by promoting

innovative research

• Galvanizes and supports new

partnerships

• Supports and augments

regulatory science and its

application

• Expands the precompetitive

space

NCATS

Biotech

FDA

Academia

Pharma Non-

Profits

Advocacy

Groups

Page 7: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

NCATS Programs and Initiatives

Clinical and Translational Science Activities

• Clinical and Translational Science Awards

Rare Diseases Research and Therapeutics

• Therapeutics for Rare and Neglected Diseases

• Bridging Interventional Development Gaps

• Office of Rare Diseases Research

Re-engineering Translational Sciences

• NIH Chemical Genomics Center

• Toxicology in the 21st Century

Page 8: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

KO mice genome-wide (KOMP)

Creating a Human Genome Translation Toolbox

Transcriptome

Reference Sets

(MRT Project)

Small molecules

(MLP)

siRNAs

cDNA collection (MGC)

ENCyclopedia Of

DNA Elements

Page 9: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

The “Non-Druggable” Genome Problem

GPCRs

Enzymes

Ion Channels

Othe

r Unknown

46%

Drews, J. (2000) Science 287:1962 Venter et al., (2001) Science 291:1304

(n=483)

Receptor (1543, 5%)

Nuclear Receptors

Drug Target Classes Human Genome

Page 10: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Molecular Libraries Program Enabled by

Convergent Developments

HTOS

Compound Libraries

Robotic Technology

Human Genome Project

Availability of targets

Availability of screening, ADME

Availability of compounds

“Open Source” Chemical Biology and Drug Discovery

Assay Technology

Page 11: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

NIH Chemical Genomics Center • Obligatory collaboration

model

• Currently > 230 collaborations with investigators worldwide

• Assay development, HTS, chemical informatics, medicinal chemistry: “target to lead”

• Focus is unprecedented targets, rare/neglected diseases

• Mission Chemical and siRNA

probes/leads

New technologies/paradigms to improve efficiency and success rates of target-to-lead stage of drug development

Chemical genomics: general principles of siRNA action, small molecule – target interactions

Page 12: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

NCATS’ Quantitative HTS (qHTS)

1536-well plates, inter-plate

dilution series

Assay volumes 5 μL

Assay concentration ranges

over 4 logs (high:~ 60 μM) Automated curve fitting and classification

Automated concentration-response

data collection

A

B

C

D

Page 13: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Comprehensive Drug Repurposing Library

~3500 drugs

• Malaria profiling, Yuan, et al. Science. 2011 Aug 5;333(6043):724-9.

• Chronic Lymphocytic Leukemia – IND filing 2011 • Niemann-Pick C – IND filing 2012 • Other rare disease repurposing projects in pipeline • Collaboration with Lilly Open Innovation Drug

Discovery

Page 14: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Drug Repurposing Library – “NPC Browser”

http://tripod.nih.gov/npc/

Page 15: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Example Probes from NCATS • ML165/RUC2

Collaborator: Barry Coller Lab (Rockefeller University)

Mechanism: potent inhibitor of platelet IIb 3 receptor

Indication: myocardial infarction

Status: IND filing on advanced derivative expected by early 2014

Publication: Zhu, et al. Science Translational Medicine. 2012 Mar

14;4(125):125ra32.

• ML290 Collaborator: Alexander Agoulnik (Florida International University)

Mechanism: potent and selective agonist of relaxin receptor RXFP1

Indication: chronic heart failure

Status: Will be developed in collaboration with a pharma company

Publication: Xiao, et al. Nature Communications. 2013;4:1953.

ML165/RUC2 Key Personnel: C Thomas, M Shen, W Huang, J McCoy NRF2 Key Personnel: M Boxer, M Shen, J Rohde, J Li, M Ferrer, Y Zhang

Page 16: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Example Probes from NCATS • ML365 and ML285

Collaborator: Lew Cantley (Harvard) and Matt Vander Heiden (MIT)

Mechanism: potent activators of PKM2

Indication: various cancers

Status: collaborating with public and private organizations

Publications: Anastasiou, et al. Science. 2011 Dec 2;334(6060):1278-83.

Anastasiou, et al. Nature Chemical Biology. 2012 Oct;8(10):839-47.

• ML323 Collaborator: Zhihao Zhuang(University of Delaware)

Mechanism: potent and selective inhibitor of USP1/UAF1

Indication: various cancers

Status: in vivo POC stage, to be further developed

Publication: Nature Chemical Biology, under review.

ML165/RUC2 Key Personnel: C Thomas, M Shen, W Huang, J McCoy NRF2 Key Personnel: M Boxer, M Shen, J Rohde, J Li, M Ferrer, Y Zhang

Structure released Dec. 2013

ML323

Page 17: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

The BioAssay Research Database

• Originated from the NIH Molecular Libraries

Program

• Motivated to make the bioassay data generated by

the MLP more accessible and amenable to

exploration and hypothesis generation

• Joint effort between NCATS, UNM, U. Miami,

Broad, Vanderbilt, Burnham, Scripps

Page 18: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

NIH Molecular Libraries Program – Glenn McFadden, Ajay Pillai

NIH Chemical Genomics Center – Chris Austin (PI), John Braisted,

Rajarshi Guha, Ajit Jadhav, Trung Nguyen, Tyler Peryea, Noel Southall,

Cordelle Tanega

Broad Institute – Benjamin Alexander, Jacob Asiedu, Kay Aubrey, Joshua

Bittker, Steve Brudz, Simon Chatwin, Paul Clemons, Vlado Dancik, Siva

Dandapani, Andrea de Souza, Dan Durkin, David Lahr, Jeri Levine, Judy

McGloughlin, Phil Montgomery, Jose Perez, Stuart Schreiber (PI), Gil

Walzer, Xiaorong Xiang

University of New Mexico – Cristian Bologa, Steve Mathias, Tudor Oprea,

Larry Sklar (PI), Oleg Ursu, Anna Waller, Jeremy Yang

University of Miami – Saminda Abeyruwan, Hande Küküc, Vance

Lemmon, Ahsan Mir, Magdalena Przydzial, Kunie Sakurai, Stephan

Schürer, Uma Vempati, Ubbo Visser

Vanderbilt University – Eric Dawson, Bill Graham, Craig Lindsley (PI),

Shaun Stauffer

Sanford-Burnham Medical Research Institute – “T.C.” Chung, Jena

Diwan, Michael Hedrick, Gavin Magnuson, Siobhan Malany, Ian Pass,

Anthony Pinkerton, Derek Stonich, John Reed (PI)

Scripps Research Institute – Yasel Cruz, Mark Southern,

Hugh Rosen (PI)

BARD Collaborators

Page 19: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Goals of BARD

BARD’s mission is to enable novice and expert scientists to

effectively utilize MLP and other public data to generate new

hypotheses

• Developed as an open-source, industrial-strength platform

to support public translational research

• Foster new methods to interpret & analyze chemical biology data

• Develop and adopt an Assay Data Standard

• Enable co-location of data and methods

Page 20: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Components of the BARD Platform

CAP, Data Dictionary,

and Results

Deposition Data

model created &

populated

CAP UI with View and

basic editing

Dictionary defined as

OWL using Protégé

Warehouse loaded

with all PubChem

AIDs and results

Warehouse loaded

with GO terms, KEGG

terms, and DrugBank

annotations

Page 21: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Interacting with the BARD

Page 22: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Searching the BARD

• Full text search via Lucene/Solr

Key entry point for new users

Search code runs queries in parallel

Fast faceting, auto suggest, filters

• Entities are linked manually via Solr schema

Allows us to pick up entities related to the query

Supply matching context

• Custom NCATS code for fast structure

searches

Page 23: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

The BARD API

• A RESTful application programming interface

• Access to individual & collections of entities

• Documented via the wiki

• Versioned

• Hit a URL, get a JSON response

Every language supports parsing JSON documents

Easy to inspect via the browser or REST clients

http://bard.nih.gov/api/v18/

Page 24: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

API Architecture

• Java, read-only, deployed on Glassfish cluster

• Different functionality

hosted in different

containers

Maintenance, security

Stability

Performance

API

Text

Search

Struct

Search

Data Warehouse

Plugins

Page 25: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Open Source as Far as Possible

ETL Database Text Search Engine Structure Search Engine

Caching Layer

http://bard.nih.gov/api

Jersey Webapps

deployed on HA

Application

Server Cluster

Page 26: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

API Resources

• Covers many data types

• Each resource supports a

variety of sub-resources

Usually linked to other

resources

• Use /_info to see what

sub-resources are available

/assays/_info

Page 27: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

API Resources

Entity Count /assays 990

/biology 1080

/cap 1942

/compounds 42,572,799

/documents 10,499

/experiments 1314

/exptdata 32,754,242

/projects 144

/substances 113,751,456

Page 28: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Data Warning

We’re still in the process of data curation and QC so while you can access all BARD data, keep in mind that much of it will be undergoing review and curation and so may change

Page 29: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Summary Resources

• A number of entities have a /summary sub-resource

(Compounds, Projects)

• Aggregates information, suitable for dashboards

Page 30: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

JSON Responses

• All responses are

currently JSON

• Entities can include

other entities (recursively)

• JSON Schema is available

via /_schema

/assays/_schema

Page 31: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Extending the API

• Concept of plugins

Expands the resource hierarchy

Has to be written in Java

• What can a plugin accept?

Anything

• What can a plugin provide?

Anything – plain text, XML, JSON, HTML, Flash

• Plugin manifest – describe available

resources, argument types

Page 32: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

What Can a Plugin Expect?

• Direct access to the database via JDBC

• Faster access to REST API via co-localization

• No local storage in the BARD warehouse

But the plugin can use its own storage (such as an

embedded database)

• Plugins have access to system JARs (e.g., XOM)

but should bundle their own required

dependencies

Page 33: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Plugin Validation

• Run a series of checks on a plugin before

deployment

Catches manifest/resource errors

Doesn’t check for correctness (not our job)

Examines plugin Java class

Examines final plugin package

• Run as a command line tool or from your code

• Could be made into an Ant/Maven plugin

Page 34: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

What Plugins are Available?

• The plugin registry provides

a list of deployed plugins

Path to the plugin

Version

Availability

• Long term goal is to

have a plugin store

/plugins/registry/list

Page 35: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Exemplar Plugins

• Look at the plugin repository on Github

• Ranges from

trivial calls to external service

Incorporate external tools/programs

• Current set of plugins highlight plugin

development features

• Plugins let us move non-essential functionality

out of the core API

Page 36: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

BARD- SMARTCyp

Based on Lars Olsen lab’s work. http://www.farma.ku.dk/smartcyp/

Page 37: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

More Than Just Data

BARD is not just a data store – it’s a platform

• Seamlessly interact with users’ preferred tools

• Allows the community to tailor it to their needs

• Serve as a meeting ground for experimental and

computational methods

• Enhance collaboration opportunities

Page 38: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

BARD API Source Repository

https://github.com/ncats/

Page 39: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

BARD Plugin Source Code Repository

https://github.com/ncats/bardplugins

Page 41: NIH Bioassay Research Database (BARD) - ChemAxon Bioassay Research Database (BARD) AJIT JADHAV NIH/NCATS SEPTEMBER 25, 2013 Presentation Highlights • Introduction to NCATS ‘Open

Other NCATS Cheminformatics Resources

Free tools and code at http://tripod.nih.gov/