52
DataScience@NIH : Pivoting to the Future Michael F. Huerta, Ph.D . Coordinator of Data Science & Open Science Associate Director for Program Development, NLM, NIH Food and Drug Administration BioCompute March 16, 2017

DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

Page 1: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

DataScience@NIH: Pivoting to the Future

Michael F. Huerta, Ph.D.Coordinator of Data Science & Open Science

Associate Director for Program Development, NLM, NIH

Food and Drug Administration BioCompute – March 16, 2017

Page 2: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Biomedical Science

NLM’s Science & Approach

Page 3: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Biomedical Science

LiteratureData

Page 4: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Biomedical Science

LiteratureData Software

Reagents Workfllows

Page 5: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Biomedical Science

Page 6: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Information Science

Biomedical Science

Curate

Page 7: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Information Science

Biomedical Science

Acquisition ClassificationMetadata

Selection Preservation

Page 8: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Information Science

Biomedical Science

Page 9: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Information Science

Informatics

Biomedical Science

Compute in context

Page 10: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Information Science

Informatics

Biomedical Science

Computation

Software tools

Ontologies

Algorithms

Page 11: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Information Science

Informatics

Biomedical Science

Page 12: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Data Science

Information Science

Informatics

Biomedical Science

Extract insight from data

Page 13: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Data Science

Information Science

Informatics

Biomedical Science

Statistics

Visualization

Probabilistics Artificial

intelligence

Page 14: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Data Science

Information Science

Informatics

Biomedical Science

Page 15: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Data Science

Information Science

Open Science Informatics

Biomedical Science

Page 16: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Open Science

Findable Accessible

Re-usable Interoperable

Page 17: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Open Science

Findable Accessible

Re-usable Interoperable

Attributable

Sustainable

Page 18: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Unified Medical Language System

BioProject

Page 19: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Data Science at NIH

Page 20: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Data Science at NIHNIH Designates NLM as the Lead

Page 21: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Data Science at NIHNIH Designates NLM as the Lead

n Data Science - NLM should be the intellectual and programmatic epicenter for data science at NIH and stimulate its advancement throughout biomedical research and application.

Page 22: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Data Science at NIHNIH Designates NLM as the Lead

n Data Science - NLM should be the intellectual and programmatic epicenter for data science at NIH and stimulate its advancement throughout biomedical research and application.

n Open Science - NLM should lead efforts to support and catalyze open science, data sharing, and research reproducibility, striving to promote the concept that biomedical information and its transparent analysis are public goods.

Page 23: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

NIH Support for Informatics & Data ScienceResearch, Tools & Workforce Development

Page 24: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

NIH Support for Informatics & Data ScienceResearch, Tools & Workforce Development

n US Human Brain Project à Neuroinformaticsn Biomedical Informatics Research Network (BIRN)

Page 25: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

NIH Support for Informatics & Data ScienceResearch, Tools & Workforce Development

n US Human Brain Projectn Biomedical Informatics Research Network (BIRN)n Biomedical Informatics Science & Tech Initiativen NIH Blueprint for Neurosciencen Big Data to Knowledge Initiativen The BRAIN Initiative

Page 26: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

NIH Support for Informatics & Data ScienceResearch, Tools & Workforce Development

n US Human Brain Projectn Biomedical Informatics Research Network (BIRN)n Biomedical Informatics Science & Tech Initiativen NIH Blueprint for Neurosciencen Big Data to Knowledge Initiativen The BRAIN Initiativen Many other IC-specific initiatives & programs

Page 27: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

NIH Support for Data-Centric & Open ScienceLarge Scale Data

Page 28: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

NIH Support for Data-Centric & Open ScienceLarge Scale Data

Page 29: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program
Page 30: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

TOPMed

The Human Connectome

Project

Page 31: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

TOPMed

The Human Connectome

Project

Page 32: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

DataScience@NIHBuild on NIH-Wide Opportunities

Page 33: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

DataScience@NIHBuild on NIH-Wide Opportunities

n Findable – PubMedu Finding literatureu Finding data via PubMed Central data deposit (10/17)

Page 34: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

DataScience@NIHBuild on NIH-Wide Opportunities

n Findable – PubMedu Finding literatureu Finding data via PubMed Central data deposit (10/17)

n Accessible - Holdren Memo to increase accessu NIH plan for publications – PubMed Centralu NIH plan for data – Peer reviewed DMP for all researchu Many repositories open for data deposit and withdrawal

Page 35: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

DataScience@NIHBuild on NIH-Wide Opportunities

n Findable – PubMedu Finding literatureu Finding data via PubMed Central data deposit (10/17)

n Accessible - Holdren Memo to increase accessu NIH plan for publications – PubMed Centralu NIH plan for data – Peer reviewed DMP for all researchu Many repositories open for data deposit and withdrawal

n Interoperable - Standardsu NLM – UMLS, SNOMED-CT, LOINC, RxNorm, etc.u Repository & Initiative-related standards across NIHu NIH Clinical Common Data Element Task Force

Page 36: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

DataScience@NIHBuild on NIH-Wide Opportunities

n Findable – PubMedu Finding literatureu Finding data via PubMed Central data deposit (10/17)

n Accessible - Holdren Memo to increase accessu NIH plan for publications – PubMed Centralu NIH plan for data – Peer reviewed DMP for all researchu Many repositories open for data deposit and withdrawal

n Interoperable - Standardsu NLM – UMLS, SNOMED-CT, LOINC, RxNorm, etc.u Repository & Initiative-related standards across NIHu NIH Clinical Common Data Element Task Force

n Big Data to Knowledge Initiative u Data science research & toolsu Commons & cloud pilotsu Workforce developmentu Open science prize

Page 37: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

DataScience@NIH: Pivot to the Future

Page 38: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

DataScience@NIH: Pivot to the FutureStrategic Engagement Across & Beyond NIH

Page 39: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

DataScience@NIH: Pivot to the FutureStrategic Engagement Across & Beyond NIH

n Sustainability solutions – urgent to address

Page 40: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Sustainable

Page 41: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Sustainable

Page 42: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Sustainable

More expensive are the opportunity costsof not addressing sustainability

Page 43: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

Sustainable

Perhaps we can even bend the cost-curve

Page 44: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

DataScience@NIH: Pivot to the FutureStrategic Engagement Across & Beyond NIH

n Sustainability solutions

Page 45: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

DataScience@NIH: Pivot to the FutureStrategic Engagement Across & Beyond NIH

n Sustainability solutions u Enterprise-wide approaches (balance w IC needs)

t Solve common problems oncet Converge on data-related standardst Lessons learned & best practices

Page 46: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

DataScience@NIH: Pivot to the FutureStrategic Engagement Across & Beyond NIH

n Sustainability solutions u Enterprise-wide approaches (balance w IC needs)

t Solve common problems oncet Converge on data-related standardst Lessons learned & best practices

u Value assessment & criteria for investment in policy changes, infrastructure, data acquisition, preservation, etc.

t Cost vs benefitt Develop and use evidence base & models

• Econometric & other studies• Innovate in curation (e.g., RFA-LM-17-001)

Page 47: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

DataScience@NIH: Pivot to the FutureStrategic Engagement Across & Beyond NIH

n Sustainability solutions u Enterprise-wide approaches (balance w IC needs)

t Solve common problems oncet Converge on data-related standardst Lessons learned & best practices

u Value assessment & criteria for investment in policy changes, infrastructure, data acquisition, preservation, etc.

t Cost vs benefitt Develop and use evidence base & models

• Econometric & other studies• Innovate in curation (e.g., RFA-LM-17-001)

u Range of storage optionst Minimal function repository à full guided services

Page 48: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

DataScience@NIH: Pivot to the FutureStrategic Engagement Across & Beyond NIH

Page 49: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

DataScience@NIH: Pivot to the FutureStrategic Engagement Across & Beyond NIH

n Engage and partner across sectors & around the world

Page 50: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

DataScience@NIH: Pivot to the FutureStrategic Engagement Across & Beyond NIH

n Engage and partner across sectors & around the worldn Grow a talented workforce intra- & extramural

u Data science expertsu Train across bio, info, & data scienceu NIH staff – research, technical, program, review & policy

Page 51: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

DataScience@NIH: Pivot to the FutureStrategic Engagement Across & Beyond NIH

n Engage and partner across sectors & around the worldn Grow a talented workforce intra- & extramural

u Data science expertsu Train across bio, info, & data scienceu NIH staff – research, technical, program, review & policy

n Promote open science & citizen scienceu Evidence-based changes in policies & practicesu Strategic incentive structure across full research enterpriseu Empower research participants, patients, & citizens

Page 52: DataScience@NIH: Pivoting to the Future...DataScience@NIH: Pivoting to the Future Michael F. Huerta, Ph.D. Coordinator of Data Science & Open Science Associate Director for Program

DataScience@NIH: Pivot to the FutureStrategic Engagement Across & Beyond NIH

n Engage and partner across sectors & around the worldn Grow a talented workforce intra- & extramural

u Data science expertsu Train across bio, info, & data scienceu NIH staff – research, technical, program, review & policy

n Promote open science & citizen scienceu Evidence-based changes in policies & practicesu Strategic incentive structure across full research enterpriseu Empower research participants, patients, & citizens

n Continue research & innovation in data science u Analytics, statistics, artificial intelligenceu Innovations in storage and accessu Use of provenance, PUIDs, & block chain to form imputed generate

metadata in a more machine-driven & adaptive ecosystem