2009-02_JohnInnesRoadshow_GO_jdeegan.ppt

Preview:

Citation preview

13. April 2023

EBI is an Outstation of the European Molecular Biology Laboratory.

The Gene Ontology Project

An Introduction

13. April 20232

There is a lotof biological research output.

13. April 20233

Search on mesodermdevelopment…

13. April 20234

You get 6752results!

How will you ever find whatyou want?

Anotherexample…

13. April 20235

Selected Gene Tree: pearson lw n3d ...Branch color classification:Set_LW_n3d_5p_...

Colored by: Copy of Copy of C5_RMA (Defa...Gene List: all genes (14010)

attacked

time

controlSelected Gene Tree: pearson lw n3d ...Branch color classification:Set_LW_n3d_5p_...

Colored by: Copy of Copy of C5_RMA (Defa...Gene List: all genes (14010)

Bregje Wertheim at the Centre for Evolutionary Genomics, Department of Biology, UCL and Eugene Schuster Group, EBI.

Microarray datashows changed expression ofthousands of genes.

How will you spot the patterns?

13. April 20236

Scientists work hard.

http://www.kilbot.com.au/wp-content/shop/careful-scientist.gifhttp://www.teamtechnology.co.uk/f-scientist.jpg

13. April 20237

There are lots of papers to read.

http://www.kilbot.com.au/wp-content/shop/careful-scientist.gifhttp://www.teamtechnology.co.uk/f-scientist.jpg

13. April 20238

more everyweek.

http://www.kilbot.com.au/wp-content/shop/careful-scientist.gifhttp://www.teamtechnology.co.uk/f-scientist.jpg

13. April 20239

andmore…

http://www.kilbot.com.au/wp-content/shop/careful-scientist.gifhttp://www.teamtechnology.co.uk/f-scientist.jpg

13. April 202310

more andmoreand more!

http://www.kilbot.com.au/wp-content/shop/careful-scientist.gifhttp://www.teamtechnology.co.uk/f-scientist.jpg

13. April 202311

Help!Help!Help!Help!more and

moreand more!

http://www.kilbot.com.au/wp-content/shop/careful-scientist.gifhttp://www.teamtechnology.co.uk/f-scientist.jpg

13. April 202312

Ontology is a way to capture knowledge in a written and computable form.

Computable means that the computer finds patterns so we don’t have to.

ComputableComputable

13. April 202313

Ebay search (keyword ‘lead’)

v. Pubmed search (keyword ‘flower’)

Demo and practical work

13. April 202314

The Gene Ontology

13. April 202315

This is ourbrowser.

13. April 202316

Search onmesoderm development.

13. April 202317

Here is mesoderm development.

13. April 202318

Definition of mesodermdevelopment.

Gene productsinvolved in mesodermdevelopment.

13. April 202319

There are many gene products involved in mesoderm development.

You can read papers describingwhat is knownabout them.

But fewer gene products than papers.

13. April 202320

13. April 202321

Selected Gene Tree: pearson lw n3d ...Branch color classification:Set_LW_n3d_5p_...

Colored by: Copy of Copy of C5_RMA (Defa...Gene List: all genes (14010)

attacked

time

controlSelected Gene Tree: pearson lw n3d ...Branch color classification:Set_LW_n3d_5p_...

Colored by: Copy of Copy of C5_RMA (Defa...Gene List: all genes (14010)

Bregje Wertheim at the Centre for Evolutionary Genomics, Department of Biology, UCL and Eugene Schuster Group, EBI.

13. April 202322Selected Gene Tree: pearson lw n3d ...Branch color classification:Set_LW_n3d_5p_...

Colored by: Copy of Copy of C5_RMA (Defa...Gene List: all genes (14010)

attacked

time

control

Puparial adhesionMolting cyclehemocyanin

Defense responseImmune responseResponse to stimulusToll regulated genesJAK-STAT regulated genes

Immune responseToll regulated genes

Amino acid catabolismLipid metobolism

Peptidase activityProtein catabloismImmune response

Selected Gene Tree: pearson lw n3d ...Branch color classification:Set_LW_n3d_5p_...

Colored by: Copy of Copy of C5_RMA (Defa...Gene List: all genes (14010)

See which processes are upregulated or downregulated.

Bregje Wertheim at the Centre for Evolutionary Genomics, Department of Biology, UCL and Eugene Schuster Group, EBI.

13. April 202323

Practical work: Search AmiGO

Did you find your favouritegene product or process?

13. April 202324

How does theGene Ontology work?

13. April 202325

13. April 202326

The Gene Ontologyis like a dictionary

term: transcription initiation

definition: Processes involved in the assembly of the RNA polymerase complex at the promoter region of a DNA template resulting in the subsequent synthesis of RNA from that promoter.

id: GO:0006352

13. April 202327

Clark et al., 2005

part_of

is_a

The whole system.

13. April 202328

Mitochondrial P450 (CC24 PR01238; MITP450CC24)

An example…

13. April 202329

GO cellular component term:GO:0005743

Where is it?

Mitochondrial p450

mitochondrial inner membrane

13. April 202330

GO molecular function term:GO:0004497

What does it do?

substrate + O2 = CO2 +H20 product

monooxygenase activity

13. April 202331

http://ntri.tamuk.edu/cell/mitochondrion/krebpic.html

GO biological process term: GO:0006118

Which process is this?

electron transport

13. April 202332

Clark et al., 2005

part_of

is_a

The whole system.

13. April 202333

The Gene Ontology is for all species

and that meanswe have to

*bridge*

some language barriers.

13. April 202334

http://www.darknessandlight.co.uk/cambridge_photographs.html

http://www.lockeheemstra.com/italy/bridge-of-sighs-venice.html

Ponte dei Sospiri,Venice.

Bridge of Sighs,Cambridge.

Same name, same thing?

13. April 202335

TactitionTactile sense

Taction

?In biology…

13. April 202336

Tactition Tactile senseTaction

perception of touch ; GO:0050975

13. April 202337

Bud initiation?

13. April 202338

= tooth bud initiation

= reproductive bud initiation

= branch bud initiation

13. April 202339

Demo: Writing an ontology

The car ontology

13. April 202340

• Demo: The gene ontology

13. April 202341

Categorization of gene productsusing GO is called annotation.

So how does that happen?

13. April 202342

P05147

Choose your favourite gene.

13. April 202343

P05147

PMID: 2976880

Find a paperabout it.

13. April 202344

P05147PMID: 2976880

GO:0047519

Find the GO term describing itsfunction, process or location of action.

13. April 202345

P05147

PMID: 2976880

IDA

GO:0047519

What evidencedo theyshow?

13. April 202346

P05147

PMID: 2976880

GO:0047519

IDA

P05147 GO:0047519 IDA PMID:2976880

Write these down…

13. April 202347

Send to the GO Consortium .

13. April 202348

Finding annotations in a paper

In this study, we report the isolation and molecular characterization of the B. napus PERK1 cDNA, that is predicted to encode a novel receptor-like kinase. We have shown that like other plant RLKs, the kinase domain of PERK1 has serine/threonine kinase activity, In addition, the location of a PERK1-GTP fusion protein to the plasma membrane supports the prediction that PERK1 is an integral membrane protein…these kinases have been implicated in early stages of wound response…

Process: response to wounding GO:0009611

wound response

serine/threonine kinase activity,

Function: protein serine/threonine kinase activity GO:0004674

integral membrane protein

Component: integral to plasma membrane GO:0005887

…for B. napus PERK1 protein (Q9ARH1)

PubMed ID: 12374299

13. April 2023

EBI is an Outstation of the European Molecular Biology Laboratory.

Annotation details

13. April 202350

13. April 202351

13. April 202352

Where to get annotations?

• Non-redundant species database• Contains all GO annotations for given species + other

information.• http://www.arabidopsis.org/

• Multispecies database - GOA• Contains all GO annotations.• http://beta.uniprot.org/

13. April 202353

Evidence codes

13. April 202354

IDA - inferred from direct assay

Enzyme assaysIn vitro reconstitution (e.g. transcription)Immunofluorescence (for cellular component)Cell fractionation (for cellular component)Physical interaction/binding

IEP - inferred from expression pattern

Transcript levels (e.g. Northerns, microarray data)Protein levels (e.g. Western blots)

IGC - inferred from genomic context

Operon structureSyntenic regionsPathway analysisGenome-scale analysis of processes

13. April 202355

IGI - inferred from genetic interaction

"Traditional" genetic interactions such as suppressors, synthetic lethals, etc.Functional complementationRescue experimentsInference about one gene drawn from the phenotype of a mutation in a different gene.

IMP - inferred from mutant phenotype

Any gene mutation/knockoutOverexpression/ectopic expression of wild-type or mutant genesAnti-sense experimentsRNAi experimentsSpecific protein inhibitorsPolymorphism or allelic variation

IPI - inferred from physical interaction

2-hybrid interactionsCo-purificationCo-immunoprecipitationIon/protein binding experiments

13. April 202356

ISS - inferred from sequence or structural similarity

Sequence similarity (homologue of/most closely related to)Recognized domainsStructural similaritySouthern blotting

RCA - inferred from reviewed computational analysis

Large-scale protein-protein interaction experimentsMicroarray experimentsIntegration of large-scale datasets of several typesText-based computation

IEA - Inferred from Electronic Annotation

NAS - non-traceable author statement

ND - no biological data available

TAS - traceable author statement

NR - not recorded

13. April 202357

PMID: 15960829

Should we trust electronic annotations?

13. April 202358

http://www.geneontology.org/GO.indices.shtml

13. April 202359

13. April 202360

!version: $Revision: 1.67 $

!date: $Date: 2008/01/21 11:29:01 $

!Mapping of GO function_ontology "enzymes" to Enzyme Commission Numbers.

!original mapping by Michael Ashburner, Cambridge.

!This version parsed from function.ontology on 2008/01/15 14:01:16

!by Daniel Barrell, EBI, Hinxton

!

EC:1 > GO:oxidoreductase activity ; GO:0016491

EC:1.1 > GO:oxidoreductase activity, acting on CH-OH group of donors ; GO:0016614

EC:1.1.1 > GO:oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor ; GO:0016616

EC:1.1.1.1 > GO:alcohol dehydrogenase activity ; GO:0004022

EC:1.1.1.10 > GO:L-xylulose reductase activity ; GO:0050038

EC:1.1.1.100 > GO:3-oxoacyl-[acyl-carrier-protein] reductase activity ; GO:0004316

EC:1.1.1.101 > GO:acylglycerone-phosphate reductase activity ; GO:0000140

EC:1.1.1.102 > GO:3-dehydrosphinganine reductase activity ; GO:0047560

EC:1.1.1.103 > GO:L-threonine 3-dehydrogenase activity ; GO:0008743

EC:1.1.1.104 > GO:4-oxoproline reductase activity ; GO:0016617

ec2go mapping

13. April 202361

interpro2go mapping

!date: 2008/01/15 13:01:24!Mapping of InterPro entries to GO!Nicola Mulder, Hinxton!InterPro:IPR000003 Retinoid X receptor > GO:DNA binding ; GO:0003677InterPro:IPR000003 Retinoid X receptor > GO:steroid binding ; GO:0005496InterPro:IPR000003 Retinoid X receptor > GO:regulation of transcription, DNA-dependent ; GO:0006355InterPro:IPR000003 Retinoid X receptor > GO:nucleus ; GO:0005634InterPro:IPR000005 Helix-turn-helix, AraC type > GO:transcription factor activity ; GO:0003700InterPro:IPR000005 Helix-turn-helix, AraC type > GO:intracellular ; GO:0005622InterPro:IPR000006 Metallothionein, vertebrate > GO:metal ion binding ; GO:0046872InterPro:IPR000013 Peptidase M7, snapalysin > GO:extracellular region ; GO:0005576InterPro:IPR000014 PAS > GO:signal transducer activity ; GO:0004871InterPro:IPR000015 Fimbrial biogenesis outer membrane usher protein > GO:transporter activity ; GO:0005215InterPro:IPR000018 P2Y4 purinoceptor > GO:purinergic nucleotide receptor activity, G-protein coupled ; GO:0045028InterPro:IPR000020 Anaphylatoxin/fibulin > GO:extracellular region ; GO:0005576InterPro:IPR000021 Hok/gef cell toxic protein > GO:membrane ; GO:0016020InterPro:IPR000022 Carboxyl transferase > GO:ligase activity ; GO:0016874InterPro:IPR000023 Phosphofructokinase > GO:6-phosphofructokinase activity ; GO:0003872InterPro:IPR000025 Melatonin receptor > GO:integral to membrane ; GO:0016021InterPro:IPR000026 Guanine-specific ribonuclease N1 and T1 > GO:endoribonuclease activity ; GO:0004521InterPro:IPR000028 Chloroperoxidase > GO:peroxidase activity ; GO:0004601

InterPro is a database of protein families, domains and functional sites in which identifiable features found in known proteins can be applied to unknown protein sequences.

13. April 202362

Manual annotation appears in AmiGO.

Manual and electronic annotationappears in QuickGO.

13. April 202363

Clark et al., 2005

Many species groups annotate.

We see the research of one function across all species.

13. April 202364

Exercise: Search for your favourite gene

and see if the annotation is electronic or manual.

http://www.ebi.ac.uk/ego/

13. April 202365

Submit new GO terms:

http://www.geneontology.org/

13. April 202366

13. April 2023

EBI is an Outstation of the European Molecular Biology Laboratory.

GO slims

13. April 202368

Clark et al., 2005

part_of

is_a

13. April 202369

Clark et al., 2005

part_of

is_a

13. April 202370

Whole genome analysis(J. D. Munkvold et al., 2004)

13. April 202371

Selected Gene Tree: pearson lw n3d ...Branch color classification:Set_LW_n3d_5p_...

Colored by: Copy of Copy of C5_RMA (Defa...Gene List: all genes (14010)

attacked

time

control

Puparial adhesionMolting cyclehemocyanin

Immune responseToll regulated genes

Amino acid catabolismLipid metobolism

Peptidase activityProtein catabloismImmune response

Selected Gene Tree: pearson lw n3d ...Branch color classification:Set_LW_n3d_5p_...

Colored by: Copy of Copy of C5_RMA (Defa...Gene List: all genes (14010)

Bregje Wertheim at the Centre for Evolutionary Genomics, Department of Biology, UCL and Eugene Schuster Group, EBI.

…analysis of high-throughput data according to GO

13. April 202372

Making Slims:

OBO-Edit

13. April 202373

Reapplying slimmed ontology to annotations:

AmiGO

http://amigo.geneontology.org/

13. April 202374

Converting IDs:PICR

http://www.ebi.ac.uk/Tools/picr/

13. April 202375

GOOSEhttp://www.berkeleybop.org/goose

13. April 202376

2006 Consortium Meeting,

St. Croix,

U.S. Virgin Islands, March 30 - April 3, 2006

13. April 202377

http://www.geneontology.org

Reactome

E. Coli hub

Recommended