21
BioPivot: Applying BioPivot: Applying Microsoft Live Microsoft Live Labs’ Pivot to Labs’ Pivot to Problems in Problems in Bioinformatics Bioinformatics Stephen Taylor, CBRG Stephen Taylor, CBRG GMOD Europe 2010 GMOD Europe 2010

BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

  • View
    217

  • Download
    1

Embed Size (px)

Citation preview

Page 1: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

BioPivot: Applying BioPivot: Applying Microsoft Live Labs’ Microsoft Live Labs’ Pivot to Problems in Pivot to Problems in

Bioinformatics Bioinformatics

Stephen Taylor, CBRGStephen Taylor, CBRG

GMOD Europe 2010GMOD Europe 2010

Page 2: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

IntroductionIntroduction

Visualization of large numbers of genome Visualization of large numbers of genome regions regions

Querying and filtering properties of Querying and filtering properties of genome regionsgenome regions

Pivot and BioPivot toolsPivot and BioPivot tools

Open discussion of other applications of Open discussion of other applications of technology technology

Page 3: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

CBRGCBRG

Page 4: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

Over 50 different GBrowse Over 50 different GBrowse databasesdatabases

Many labs started wanting GBrowseMany labs started wanting GBrowseHuman Human MouseMouseBacterial Bacterial

Time seriesTime seriesArraysArraysChIP-SeqChIP-SeqRNA-SeqRNA-Seq

Page 5: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

Next Generation SequencingNext Generation Sequencing

Histone modification DataHistone modification Data

ChIP-SeqChIP-Seq

Interaction cis/trans dataInteraction cis/trans data

PCR amplified regionsPCR amplified regions

RNA-SeqRNA-Seq

Exome Sequencing/SNP detectionExome Sequencing/SNP detection

Page 6: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

ChIP-Seq exampleChIP-Seq example

NGSreads

Map

Peak pick

Extract sequences from featuresMotif extract

Weblogo

Page 7: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

ChIP-SeqChIP-Seq

NGSreadsNGSreads

Map

Peakfind

Page 8: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

ProblemsProblems

Experimental conditionsExperimental conditions

AntibodyAntibody

Peak finders give false positivesPeak finders give false positives Lots of parametersLots of parameters Must choose a suitable cut-offMust choose a suitable cut-off Eyeballing lots of peaksEyeballing lots of peaks

Page 9: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

Further AnalysisFurther Analysis

Which of my peaks overlap with:Which of my peaks overlap with:

Genes Genes

ExonsExons

PromotersPromoters

CpG IslandsCpG Islands

Areas of conservationAreas of conservation

etcetc

Page 10: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

TraditionallyTraditionally

Make spreadsheet of data with links to Make spreadsheet of data with links to Gbrowse/UCSC regions of interestGbrowse/UCSC regions of interest

Click/Filter various parametersClick/Filter various parameters

Add data to spreadsheet each new Add data to spreadsheet each new analysisanalysis

Page 11: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

Deep Zoom TechDeep Zoom Tech

Blaise Aguera y Arcas (TED 2007)Blaise Aguera y Arcas (TED 2007)

Seadragon/PhotosynthSeadragon/Photosynth http://www.seadragon.com/showcasehttp://www.seadragon.com/showcase/

Microsoft Live Labs’ PivotMicrosoft Live Labs’ Pivot http://www.getpivot.com/http://www.getpivot.com/

Page 12: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

Wouldn’t it be cool...Wouldn’t it be cool...

To use this in bioinformatics...?To use this in bioinformatics...?

Take thousands of regions of interest of Take thousands of regions of interest of genomegenome

View View andand Filter seamlessly on metadata Filter seamlessly on metadata

Page 13: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

BioPivot ToolsBioPivot Tools

GFF3 of ROIGFF3 of ROI

Ninth column contains ‘facets’Ninth column contains ‘facets’

Choose your GBrowse or UCSC Browser Choose your GBrowse or UCSC Browser ViewView

Run the command:Run the command:

gff32pivot my.gff3 –dzi –generateimages –conf mytypes.cfg gff32pivot my.gff3 –dzi –generateimages –conf mytypes.cfg -o my.cxml –browser gbrowse2-o my.cxml –browser gbrowse2

Page 14: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

BioPivot ToolsBioPivot Tools

Parsers for peakfindersParsers for peakfinders

Annotate a GFF3 fileAnnotate a GFF3 file nearest genenearest gene exons, introns, intergenic, intragenicexons, introns, intergenic, intragenic TSS/TES up and down stream regionsTSS/TES up and down stream regions

Overlaps of GFF3/BED featuresOverlaps of GFF3/BED features

Page 15: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

Open SourceOpen Source

Zoomable User Interfaces (ZUIs)Zoomable User Interfaces (ZUIs)

OpenZoom OpenZoom http://openzoom.org/http://openzoom.org/

SDK for Flash, Flex & AIRSDK for Flash, Flex & AIR

APIsAPIs

Page 16: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

Deep Zoom ImageDeep Zoom Image

Page 17: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

Deep Zoom CollectionDeep Zoom Collection

Groups of tiled imagesGroups of tiled images

Page 18: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

CXML FileCXML File

Page 19: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

To DoTo Do

Installation scriptsInstallation scripts

Deploy in a web browser using SilverlightDeploy in a web browser using Silverlight

RNA-Seq parsers e.g. cufflinks, DESeqRNA-Seq parsers e.g. cufflinks, DESeq

Get feedback from the communityGet feedback from the community

What else can we do with this tech?What else can we do with this tech?

http://www.cbrg.ox.ac.uk/data/biopivothttp://www.cbrg.ox.ac.uk/data/biopivot

Page 20: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

Acknowledgements - CodeAcknowledgements - Code

OpenZoom OpenZoom http://openzoom.org/http://openzoom.org/

CisgenomeCisgenome http://www.biostat.jhsph.edu/~hji/cisgenome/http://www.biostat.jhsph.edu/~hji/cisgenome/

BEDtoolsBEDtools http://code.google.com/p/bedtools/http://code.google.com/p/bedtools/

Page 21: BioPivot: Applying Microsoft Live Labs’ Pivot to Problems in Bioinformatics Stephen Taylor, CBRG GMOD Europe 2010

Acknowledgements - PeopleAcknowledgements - People

Jim HughesJim Hughes

CBRG TeamCBRG Team

GMOD TeamGMOD Team