43
Why not use data intensive science to build models of disease within a commons? Power of a Compute Space/ Beyond Current Reward Structures Five Pilots- Missing Ingredients

Stephen Friend Nature Genetics Colloquium 2012-03-24

Embed Size (px)

DESCRIPTION

Stephen Friend, Mar 24, 2012. Nature Genetics Colloquium, Boston, MA

Citation preview

Page 1: Stephen Friend Nature Genetics Colloquium 2012-03-24

Why not use data intensive science to build models of disease

within a commons?

Power of a Compute Space/ Beyond Current Reward Structures

Five Pilots- Missing Ingredients

Page 2: Stephen Friend Nature Genetics Colloquium 2012-03-24

What  is  the  problem?  

     Most  approved  therapies  assumed  indica2ons  would  represent  homogenous  popula2ons  

 Our  exis2ng  disease  models  o8en  assume  pathway  knowledge  would  be  sufficient  to  infer  correct  therapies  

Page 3: Stephen Friend Nature Genetics Colloquium 2012-03-24

Personalized Medicine 101: Capturing Single bases pair mutations = ID of responders

Page 4: Stephen Friend Nature Genetics Colloquium 2012-03-24

Reality: Overlapping Pathways

Page 5: Stephen Friend Nature Genetics Colloquium 2012-03-24
Page 6: Stephen Friend Nature Genetics Colloquium 2012-03-24
Page 7: Stephen Friend Nature Genetics Colloquium 2012-03-24

WHY  NOT  USE    “DATA  INTENSIVE”  SCIENCE  

TO  BUILD  BETTER  DISEASE  MAPS?  

Page 8: Stephen Friend Nature Genetics Colloquium 2012-03-24

what will it take to understand disease?

                   DNA    RNA  PROTEIN  (dark  maGer)    

MOVING  BEYOND  ALTERED  COMPONENT  LISTS  

Page 9: Stephen Friend Nature Genetics Colloquium 2012-03-24

Preliminary Probabalistic Models- Rosetta

Gene symbol Gene name Variance of OFPM explained by gene expression*

Mouse model

Source

Zfp90 Zinc finger protein 90 68% tg Constructed using BAC transgenics Gas7 Growth arrest specific 7 68% tg Constructed using BAC transgenics Gpx3 Glutathione peroxidase 3 61% tg Provided by Prof. Oleg

Mirochnitchenko (University of Medicine and Dentistry at New Jersey, NJ) [12]

Lactb Lactamase beta 52% tg Constructed using BAC transgenics Me1 Malic enzyme 1 52% ko Naturally occurring KO Gyk Glycerol kinase 46% ko Provided by Dr. Katrina Dipple

(UCLA) [13] Lpl Lipoprotein lipase 46% ko Provided by Dr. Ira Goldberg

(Columbia University, NY) [11] C3ar1 Complement component

3a receptor 1 46% ko Purchased from Deltagen, CA

Tgfbr2 Transforming growth factor beta receptor 2

39% ko Purchased from Deltagen, CA

Networks facilitate direct identification of genes that are

causal for disease Evolutionarily tolerated weak spots

Nat Genet (2005) 205:370

Page 10: Stephen Friend Nature Genetics Colloquium 2012-03-24

"Genetics of gene expression surveyed in maize, mouse and man." Nature. (2003)

"Variations in DNA elucidate molecular networks that cause disease." Nature. (2008)

"Genetics of gene expression and its effect on disease." Nature. (2008)

"Validation of candidate causal genes for obesity that affect..." Nat Genet. (2009) ….. Plus 10 additional papers in Genome Research, PLoS Genetics, PLoS Comp.Biology, etc

"Identification of pathways for atherosclerosis." Circ Res. (2007)

"Mapping the genetic architecture of gene expression in human liver." PLoS Biol. (2008)

…… Plus 5 additional papers in Genome Res., Genomics, Mamm.Genome

"Integrating genotypic and expression data …for bone traits…" Nat Genet. (2005)

“..approach to identify candidate genes regulating BMD…" J Bone Miner Res. (2009)

"An integrative genomics approach to infer causal associations ...” Nat Genet. (2005)

"Increasing the power to detect causal associations… “PLoS Comput Biol. (2007)

"Integrating large-scale functional genomic data ..." Nat Genet. (2008)

…… Plus 3 additional papers in PLoS Genet., BMC Genet.

d

Metabolic Disease

CVD

Bone

Methods

Extensive Publications now Substantiating Scientific Approach Probabilistic Causal Bionetwork Models

• >80 Publications from Rosetta Genetics

Page 11: Stephen Friend Nature Genetics Colloquium 2012-03-24

  50 network papers   http://sagebase.org/research/resources.php

List of Influential Papers in Network Modeling

Page 12: Stephen Friend Nature Genetics Colloquium 2012-03-24
Page 13: Stephen Friend Nature Genetics Colloquium 2012-03-24

Equipment capable of generating massive amounts of data A-

“Data Intensive” Science- Fourth Scientific Paradigm Score Card for Medical Sciences

Open Information System D-

IT Interoperability D

Host evolving computational models in a “Compute Space F

Page 14: Stephen Friend Nature Genetics Colloquium 2012-03-24

.

We still consider much clinical research as if we were “hunter gathers”- not sharing

Page 15: Stephen Friend Nature Genetics Colloquium 2012-03-24

Sage Mission

Sage Bionetworks is a non-profit organization with a vision to create a “commons” where integrative bionetworks are evolved by

contributor scientists with a shared vision to accelerate the elimination of human disease

Sagebase.org

Data Repository

Discovery Platform

Building Disease Maps

Commons Pilots

Page 16: Stephen Friend Nature Genetics Colloquium 2012-03-24

Sage Bionetworks Collaborators

  Pharma Partners   Merck, Pfizer, Takeda, Astra Zeneca,Roche Amgen, Johnson &Johnson

16

  Foundations   Kauffman CHDI, Gates Foundation

  Government   NIH, LSDF

  Academic   Levy (Framingham)   Rosengren (Lund)   Krauss (CHORI)

  Federation   Ideker, Califarno, Butte, Schadt

Page 17: Stephen Friend Nature Genetics Colloquium 2012-03-24

Knowledge Network

Information Commons

Page 18: Stephen Friend Nature Genetics Colloquium 2012-03-24
Page 19: Stephen Friend Nature Genetics Colloquium 2012-03-24

Why not share clinical /genomic data and model building in the ways currently used by the software industry (power of tracking workflows and versioning

Page 20: Stephen Friend Nature Genetics Colloquium 2012-03-24

Leveraging Existing Technologies

Taverna

Addama

tranSMART

Page 21: Stephen Friend Nature Genetics Colloquium 2012-03-24

INTEROPERABILITY  

INTEROPERABILITY

Genome Pattern CYTOSCAPE tranSMART I2B2

SYNAPSE  

Page 22: Stephen Friend Nature Genetics Colloquium 2012-03-24

Watch What I Do, Not What I Say

sage bionetworks synapse project

Page 23: Stephen Friend Nature Genetics Colloquium 2012-03-24

Reduce, Reuse, Recycle

sage bionetworks synapse project

Page 24: Stephen Friend Nature Genetics Colloquium 2012-03-24

Most of the People You Need to Work with Don’t Work with You

sage bionetworks synapse project

Page 25: Stephen Friend Nature Genetics Colloquium 2012-03-24

My Other Computer is “The Cloud”

sage bionetworks synapse project

Page 26: Stephen Friend Nature Genetics Colloquium 2012-03-24

CTCAP  The  FederaKon  Portable  Legal  Consent  Sage  Congress  Project  BRIDGE  

Five  Pilots  around  Sharing  –Leveraging  the  Work  of  others  

RULES GOVERN

PLAT

FORM

NEW

MAP

S

Page 27: Stephen Friend Nature Genetics Colloquium 2012-03-24

Clinical Trial Comparator Arm Partnership (CTCAP)

  Description: Collate, Annotate, Curate and Host Clinical Trial Data with Genomic Information from the Comparator Arms of Industry and Foundation Sponsored Clinical Trials: Building a Site for Sharing Data and Models to evolve better Disease Maps.

  Public-Private Partnership of leading pharmaceutical companies, clinical trial groups and researchers.

  Neutral Conveners: Sage Bionetworks and Genetic Alliance [nonprofits].

  Initiative to share existing trial data (molecular and clinical) from non-proprietary comparator and placebo arms to create powerful new tool for drug development.

Started Sept 2010

Page 28: Stephen Friend Nature Genetics Colloquium 2012-03-24

Shared clinical/genomic data sharing and analysis will maximize clinical impact and enable discovery

•  Graphic  of  curated  to  qced  to  models  

Page 29: Stephen Friend Nature Genetics Colloquium 2012-03-24

The  FederaKon  

Page 30: Stephen Friend Nature Genetics Colloquium 2012-03-24

(Nolan)  

Page 31: Stephen Friend Nature Genetics Colloquium 2012-03-24

sage federation: model of biological age

Faster Aging

Slower Aging

Clinical Association -  Gender -  BMI -  Disease Genotype Association Gene Pathway Expression Pr

edicted  Age  (liver  expression)  

Chronological  Age  (years)  

Age Differential

Page 32: Stephen Friend Nature Genetics Colloquium 2012-03-24

Reproducible  science==shareable  science  

Sweave: combines programmatic analysis with narrative

Sweave.Friedrich Leisch. Sweave: Dynamic generation of statistical reports using literate data analysis. In Wolfgang Härdle and Bernd Rönz,editors, Compstat 2002 –

Proceedings in Computational Statistics,pages 575-580. Physica Verlag, Heidelberg, 2002. ISBN 3-7908-1517-9

Dynamic generation of statistical reports using literate data analysis

Page 33: Stephen Friend Nature Genetics Colloquium 2012-03-24

Federated  Aging  Project  :    Combining  analysis  +  narraKve    

=Sweave Vignette Sage Lab

Califano Lab Ideker Lab

Shared  Data  Repository  

JIRA:  Source  code  repository  &  wiki  

R code + narrative

PDF(plots + text + code snippets)

Data objects

HTML

Submitted Paper

Page 34: Stephen Friend Nature Genetics Colloquium 2012-03-24

Portable  Legal  Consent  

(AcKvaKng  PaKents)  

John  Wilbanks  

Page 35: Stephen Friend Nature Genetics Colloquium 2012-03-24
Page 36: Stephen Friend Nature Genetics Colloquium 2012-03-24

Sage  Congress  Project  April  20  2012  

RealNames  Parkinson’s  Project  RevisiKng  Breast  Cancer  Prognosis  

Fanconi’s  Anemia  

(Responders  CompeKKons-­‐  IBM-­‐DREAM)  

Page 37: Stephen Friend Nature Genetics Colloquium 2012-03-24

BRIDGE  

DemocraKzaKon  of  Medical  Research  

(Social  Value  Chain)  ASHOKA  

Page 38: Stephen Friend Nature Genetics Colloquium 2012-03-24
Page 39: Stephen Friend Nature Genetics Colloquium 2012-03-24
Page 40: Stephen Friend Nature Genetics Colloquium 2012-03-24
Page 41: Stephen Friend Nature Genetics Colloquium 2012-03-24
Page 42: Stephen Friend Nature Genetics Colloquium 2012-03-24

Networking  Disease  Model  Building  

Page 43: Stephen Friend Nature Genetics Colloquium 2012-03-24

Why not use data intensive science to build models of disease

within a commons?

Power of a Compute Space/ Beyond Current Reward Structures

Five Pilots- Missing Ingredients