31
Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013

Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013

Embed Size (px)

DESCRIPTION

Data Integration and our Data Mining Tool Our strategy is to help biologists make the most of their ‘-omics’ data We analyse array and sequence data using current methods Biologists mine their results in a custom built, secure web based platform We help integrate other relevant data from biologist’s lab and the literature

Citation preview

Page 1: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013

Data Integration & Data Mining Tool

Donald DunbarBHF CoRE Bioinformatics Team

Edinburgh Bioinformatics MeetingApril 2013

Page 2: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013

BHF CoRE Bioinformatics

Page 3: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013

Data Integration and our Data Mining Tool

Our strategy is to help biologists make the most of their ‘-omics’ data

We analyse array and sequence data using current methods

Biologists mine their results in a custom built, secure web based platform

We help integrate other relevant data from biologist’s lab and the literature

Page 4: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013

Data Mining Tool

• Wish list– Web accessible– Secure– Complex queries across datasets– Technology agnostic– Query cross species– Annotation, statistics and graphs– Links to external databases– Include downstream tools

Page 5: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013

Data Mining Tool

• Login via EASE or htaccess (+ vpn)• Built in PHP with mySQL back end• Generic database structure for statistics

– counts, intensity, fold change, p-value• Separate annotation tables• Includes experiment details and QC info• Query builder type interface• Output as tables with links• Gene set enrichment, heat-map and literature

Page 6: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 7: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 8: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 9: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 10: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 11: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 12: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 13: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 14: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 15: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 16: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 17: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 18: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 19: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 20: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 21: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 22: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 23: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 24: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 25: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 26: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013

Data Integration

• Across technologies– array, sequencing– gene expression, methylation, proteomics, genetics

• Across species– Human, mouse, rat, fly, fish

• At the gene level– Probe level for within array– Entrez gene within species– orthologous groups across species

Page 27: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 28: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013
Page 29: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013

Development• New platform: Drupal

– Some nice features– New look and feel

• Web services– interactions, diseases, TF binding sites, miRNA…

• More use of literature data– Top 10 co-cited on gene detail page– Better visualisation– Better text mining

• Correlation data (expression profiles)– searchable with other stats

• Cross experiment gene sets

Page 30: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013

Thanks• Jon Manning• John Mullins• Our collaborators• British Heart Foundation

Page 31: Data Integration & Data Mining Tool Donald Dunbar BHF CoRE Bioinformatics Team Edinburgh Bioinformatics Meeting April 2013

[email protected] | www.bioinf.mvm.ed.ac.uk

“Providing bioinformatics services to biology teamsthroughout the research process”