45
GMOD/GBrowse_syn Sheldon McKay iPlant Collaborative DNA Learning Center Cold Spring Harbor Laboratory

GMOD/GBrowse_syn

  • Upload
    cheng

  • View
    36

  • Download
    0

Embed Size (px)

DESCRIPTION

GMOD/GBrowse_syn. Sheldon McKay iPlant Collaborative DNA Learning Center Cold Spring Harbor Laboratory. Outline. A few words on whole genome alignment A brief survey of synteny browsers A few challenges of rendering comparative data Comparative genome browsing with GBrowse_syn. - PowerPoint PPT Presentation

Citation preview

Page 1: GMOD/GBrowse_syn

GMOD/GBrowse_syn

Sheldon McKayiPlant CollaborativeDNA Learning CenterCold Spring Harbor Laboratory

Page 2: GMOD/GBrowse_syn

A few words on whole genome alignment

A brief survey of synteny browsers

A few challenges of rendering comparative data

Comparative genome browsing with GBrowse_syn

Outline

Page 3: GMOD/GBrowse_syn
Page 4: GMOD/GBrowse_syn

Hierarchical Genome Alignment Strategy

Mask repeats (RepeatMasker, Tandem Repeats Finder, nmerge, etc

Identify orthologous regions (ENREDO, MERCATOR, orthocluster, etc)

Nucleotide-level alignment (PECAN, MAVID, etc)

Further processing

GBrowse_syn

GBrowse_syn

GBrowse

Raw genomic sequences

Page 5: GMOD/GBrowse_syn
Page 6: GMOD/GBrowse_syn

A Few Use Cases

Multiple sequence alignment data from whole genomes

Synteny or co-linearity data without alignments

Gene orthology assignments based on proteins

Self vs. Self comparison of duplications, homeologous regions, etc

Others

Page 7: GMOD/GBrowse_syn

What is a Synteny Browser?

- Has display elements in common with genome browsers

- Uses sequence alignments, orthology or co-linearity data to highlight different genomes, strains, etc.

- Usually displays co-linearity relative to a reference genome.

Page 8: GMOD/GBrowse_syn

A Brief Survey of GMOD-friendly Synteny Browsers*

Page 9: GMOD/GBrowse_syn

Wang H, Su Y, Mackey AJ, Kraemer ET and JC Kissinger . SynView: a GBrowse-compatible approach to visualizing comparative genome data Bioinformatics 2006 22:2308-2309

Page 10: GMOD/GBrowse_syn

Pan, X., Stein, L. and Brendel, V. 2005. SynBrowse: a Synteny Browser for Comparative Sequence Analysis. Bioinformatics 21: 3461-3468

Page 11: GMOD/GBrowse_syn

Crabtree, J., Angiuoli, S. V., Wortman, J. R., White, O. R. Sybil: methods and software for multiple genome comparison and visualization Methods Mol Biol. 2007 Jan 01; 408: 93-108.

Page 12: GMOD/GBrowse_syn

+ others...

Youens-Clark K, Faga B, Yap IV, Stein LD, Ware, D. 2009. CMap 1.01: A comparative mapping application for the Internet. doi:10.1093

Page 13: GMOD/GBrowse_syn

GBrowse_syn

+others…McKay SJ, Vergara IA and Stajich, J. 2010. "Using the Generic Synteny Browser (Gbrowse_syn)" in Current Protocols in Bioinformatics (Wiley Interscience) doi: 10.1002/0471250953.bi0912s31

Page 14: GMOD/GBrowse_syn

GMOD Browser branding/nomenclature issues…

Page 15: GMOD/GBrowse_syn

Other non-GMOD Browsers

http://mkweb.bcgsc.ca/circos/ http://www.mizbee.org

Page 16: GMOD/GBrowse_syn

Other non-GMOD Browsers

http://synteny.cnr.berkeley.edu/CoGe/

Page 17: GMOD/GBrowse_syn

Apologies to others not listed

Page 18: GMOD/GBrowse_syn

How is GBrowse_syn different?

• Does not rely on perfect co-linearity across the entire displayed region (no orphan alignments)

• Offers “on the fly” alignment chaining• No upward limit on the number of species• Used grid lines to trace fine-scale indels (sequence

insertion/deletions)• Integration with GBrowse data sources• Ongoing support and development

Page 19: GMOD/GBrowse_syn

GBrowse-like interface

Page 20: GMOD/GBrowse_syn

GBrowse Databases*

*.synor

*.conf

*.synconf

GBrowse_synalignment database

GBrowse_syn

Species config. Master config.

Page 21: GMOD/GBrowse_syn

GBrowse_syn Architecture[GBrowse]

[GBrowse] [GBrowse]

[GBrowse]

Page 22: GMOD/GBrowse_syn

Getting Data into GBrowse_syn

CLUSTALW PECANMSF ad hoc tab-delimitedFASTA STOCKHOLM GFF3 etc…

Loading scripts

Page 23: GMOD/GBrowse_syn
Page 24: GMOD/GBrowse_syn
Page 25: GMOD/GBrowse_syn
Page 26: GMOD/GBrowse_syn

Optional “All in one” view

Page 27: GMOD/GBrowse_syn

Adding markup to the annotations

Page 28: GMOD/GBrowse_syn

Problem : How to use Insertions/Deletion data

Page 29: GMOD/GBrowse_syn

Tracking Indels with grid lines

Page 30: GMOD/GBrowse_syn

Evolution of Gene Structure

Page 31: GMOD/GBrowse_syn

Putative gene or loss

Page 32: GMOD/GBrowse_syn

Comparing gene models

Page 33: GMOD/GBrowse_syn

Comparing assemblies

Not bad

Needs work

Page 34: GMOD/GBrowse_syn

Example Mercator Alignment

Page 35: GMOD/GBrowse_syn

Getting the most out of small aligned regions or orthology-only data

Page 36: GMOD/GBrowse_syn

Gene Orthology

Chained Orthologs

Page 37: GMOD/GBrowse_syn

2 panels mergedInversion + translocation?

Page 38: GMOD/GBrowse_syn

What about synteny blocks that fall off the ends of the displayed reference sequence?

Page 39: GMOD/GBrowse_syn

Solution 1 : With multiple sequence alignment data, calculate many anchor points (done anyway for grid lines)

Solution 2 : For orthology-based synteny blocks, use individual start and end coordinates of orthologs as anchor points.

Solution 3: If all else fails, guess the end of the target block based on the overall length ratio.

length displayed target = (length target/length reference)* length displayed reference

Page 40: GMOD/GBrowse_syn

What if the aligned DNA sequences are too distant?

!=

Page 41: GMOD/GBrowse_syn

Pecan alignments

Protein orthology based Synteny blocks

Page 42: GMOD/GBrowse_syn

What about segmental duplications?

Page 43: GMOD/GBrowse_syn
Page 44: GMOD/GBrowse_syn

The Future of GBrowse_syn

• Full Integration with GBrowse 2

• “On the fly” sequence alignment view

• AJAX-based user interface and navigation (Jbrowse_syn)

• Suggestions?

Page 45: GMOD/GBrowse_syn

Acknowledgments

Lincoln SteinDave Clements

Scott CainJason Stajich

Bonnie HurwitzEva Huala

Cynthia LeeJack Chen

Ismael VergaMichael Paulini

WormBase Curators

Richard Hayes Rob Buels

Projects Funding