12
1 Big Data in Health: The Global Burden of Disease Study Peter Speyer @peterspeyer March 29, 2014

Big Data in Global Health: Steps to get data to audiences

Embed Size (px)

DESCRIPTION

 

Citation preview

Page 1: Big Data in Global Health: Steps to get data to audiences

1

Big Data in Health:The Global Burden of Disease Study

Peter Speyer@peterspeyerMarch 29, 2014

Page 2: Big Data in Global Health: Steps to get data to audiences

Big data in health

2

• Surveys

• Censuses

• Disease registries

• Vital registration

• Verbal autopsy

• Mortuaries / burial sites

• Police records

Variety Volume Velocity

• Hospital / ambulatory / primary care records

• Claims data

• Surveillance systems

• Administrative data

• Literature reviews

• Sensor data

• Social media

• Quantified self

Page 3: Big Data in Global Health: Steps to get data to audiences

IHME input data cataloged in the GHDx

3

Page 4: Big Data in Global Health: Steps to get data to audiences

From data to impact

4

Big data

Audiences

1. Data access

2. Data preparation

3. Data analysis

4. Data translation

Getting relevant data

in useful formats

to the right audiences

Page 5: Big Data in Global Health: Steps to get data to audiences

Example: Global Burden of Disease Study

• A systematic scientific effort

to quantify the comparative magnitude of

health loss due to diseases, injuries and risk factors

• GBD 2010 results published in The Lancet in 2012– 291 causes, 67 risk factors

– 187 countries

– 1990-2010

– By age and sex

• GBD 2013 Update in process

5

Page 6: Big Data in Global Health: Steps to get data to audiences

1. Accessing the data

• Systematic identification of all relevant data sources– Data Indexers

– Lit reviews

• Challenges– Data on paper, PDF, proprietary & obsolete

formats

– Patient / participant consent

– Confidentiality / de-identification

– Cost

6Tristan Schmurr / Flickr

CC Chapman / Flickr

Page 7: Big Data in Global Health: Steps to get data to audiences

2. Preparing data for analysis

• Data extraction (databases, tables, papers)

• Analysis of microdata

• Correction for bias

• Data quality issues, e.g. garbage codes

• Cross-walks, e.g. between ICDs

7

Page 8: Big Data in Global Health: Steps to get data to audiences

Demo: Tobacco Viz

8

Page 9: Big Data in Global Health: Steps to get data to audiences

3. Analyzing data

• Use all available data– Use covariates: indicators related to quantity of interest

• Test the modeling approach, e.g. predictive validity testing (CODEm)

• Apply appropriate corrections, e.g. causes of death to match all-cause mortality

• Quantify uncertainty

• Review: 1000+ experts, peer-reviewed publication

Page 10: Big Data in Global Health: Steps to get data to audiences

4. Data translation

10

• Academic papers

• Policy reports

• Data search engine

• Data visualizations– Input data

– Comprehensive results

– Key insights

Page 12: Big Data in Global Health: Steps to get data to audiences

Parting thoughts

• Grab relevant data and get started– Prep data with Data Wrangler, MS Power BI

– Visualize with Tableau Public, Google Fusion

– Learn to code: R, Python (analysis), JavaScript (viz)

• Check out IHME data sources and GHDx

Find resources & contact me at

[email protected]

@peterspeyer

http://healthdatainnovation.org

12