45
Enzymatic Function Module (KEGG, MetaCyc, and EC Numbers)

Annotation Presentation Week 7

  • Upload
    micol

  • View
    42

  • Download
    0

Embed Size (px)

DESCRIPTION

Annotation Presentation Week 7. Enzymatic Function Module (KEGG, MetaCyc, and EC Numbers). Provide contextual information about your gene as component of pathway or cellular structure. Provides information about the class to which an enzyme belongs: oxidoreductases transferases - PowerPoint PPT Presentation

Citation preview

Page 1: Annotation Presentation Week 7

Enzymatic Function Module(KEGG, MetaCyc, and EC Numbers)

Page 2: Annotation Presentation Week 7

Provide contextual informationabout your gene as componentof pathway or cellular structure

Provides information aboutthe class to which an enzymebelongs:

oxidoreductases transferases hydrolases lysases isomerases ligases

Page 3: Annotation Presentation Week 7

Kyoto Encyclopedia of Genes and Genomes

- KEGG is a collection of biological information compiled from published material curated database.

- Includes information on genes, proteins, metabolic pathways, molecular interactions, and biochemical reactions associated with specific organisms

- Provides a relationship (map) for how these components are organized in a cellular structure or reaction pathway.

The Good: Information is reliable!

The Bad: Information is not available for many organisms.Recall: Many used Rhodopirellula baltica as the reference genomefor our initial gene hunt (Why? It is in the same family as P. limnophilus)

Page 4: Annotation Presentation Week 7

Recall: You have already enteredthe information from the KEGGdatabase into your team notebook.If not, you should do it now.

NOTE: Refer to the week 2student presentation for details

Page 5: Annotation Presentation Week 7

Refers not to the team pathway,but to your individual gene.

This information may be foundon the Gene Detail page ordetermined directly from theKEGG pathway database.

Recording results in your individual notebook

Go to Gene Detail page for your gene

Page 6: Annotation Presentation Week 7

Scroll down to Pathway Information for your gene

OID 2500609751

Notice your genemay be involved inKEGG pathwaysother than the oneyou are annotatingwith your team.

Why? Your gene mayfunction in more thanone pathway in the cell.

• Sequentially click on the link for each pathway. Note the EC number if your ORF is an enzyme in a biochemical pathway or the gene name if your ORF is an protein component of a cellular complex so you can locate your ORF on the KEGG map.

• SAVE each KEGG map in PNG format then upload to your notebook.

Page 7: Annotation Presentation Week 7

OID 2500609751

Note the color scheme for KEGG maps differ from that obtained directly from database

EC 6.3.1.2

Page 8: Annotation Presentation Week 7

KEGG Pathways for EC 6.3.1.2

Page 9: Annotation Presentation Week 7

1- Modify heading

2- Insert KEGG map

3- Include comment

Recording results in your individual notebook

Page 10: Annotation Presentation Week 7

4- Repeat for all KEGG maps

Scrolldown

Recording results in your individual notebook

Page 11: Annotation Presentation Week 7

What if no KEGG pathways listed on Gene Detail page?

1- Try pathway text search

Enter part or all of the gene product name

“Click”

Page 12: Annotation Presentation Week 7

Pathway text search results

KEGG pathway names

KEGG maps

“Click”

Page 13: Annotation Presentation Week 7

Pathway text search results

“Click” for larger image

Inspect map for EC number or gene name corresponding to your gene

Page 14: Annotation Presentation Week 7

What if no KEGG pathways listed on Gene Detail page?

2- Try KEGG2 search

“Click”

Enter part or all of the gene product name

“Click”

Page 15: Annotation Presentation Week 7

KEGG2 Search Results

“Click”

Page 16: Annotation Presentation Week 7

KEGG2 Search Results

“Click”

Inspect map for EC number or gene name corresponding to your gene

Page 17: Annotation Presentation Week 7

MetaCycMetaCyc

- Database of nonredundant, experimentally elucidated metabolic pathways curated database

- Goal is to catalog the universe of metabolism by storing a representative sample of all pathways that have been experimentally elucidated

Caspi et al. (2008)NAR 36: D623-631.

Page 18: Annotation Presentation Week 7

Click the link in your notebook

Note that this databaseonly works properly inFirefox browser.

Page 19: Annotation Presentation Week 7

Select the database to be used for your search

“click”

Select Pirellula sp.(It is in the same familyas P. limnophilus)

“click”

Page 20: Annotation Presentation Week 7

Perform a BLAST search against Pirellula sp. genome

Note the databasename has changed

Reveal drop-down menuunder Search function tab

“click”

Page 21: Annotation Presentation Week 7

Note the BLASTsearch functions

are on the BioCyc site

Make sure you performa protein BLAST search

Copy/paste the amino acidsequence for your gene inFASTA format into query box

Change E-value to 0.01

Leave other settingsas default

“click”

Page 22: Annotation Presentation Week 7

Look at your top hit: - check for a positive bit score - confirm that the E-value is less than 10-3

- make sure identity 30 to 35% and that the alignment covers at least 50% of query sequence

Inspect your BLAST search results for significant hits

If the search does notproduce significant hits,or if the top hits havefairly low score & highE-value, try searching adifferent database:E. coli K-12 substr. MG1655

Alignment correspondsto only 73 / 470 = 16%

Page 23: Annotation Presentation Week 7

Note the BLASTsearch functionsfor E. coli are onthe EcoCyc site

“click”

Page 24: Annotation Presentation Week 7

Look at your top hit: - check for a positive bit score - confirm that the E-value is less than 10-3

- make sure identity 30 to 35% and that the alignment covers at least 50% of query sequence

Inspect your BLAST search results for significant hits

Alignment correspondsto only 470 / 470 = 100%

Click on link for genename for top hit thatmeets above criteria;otherwise write“No significant hits” inyour lab notebook

Page 25: Annotation Presentation Week 7

The results will look something like this..

- Compare the name of the search hit to the name of the protein on your Gene Detail page in img/edu

- For example: “glutamine synthetase” to “L-glutamine synthetase”

Page 26: Annotation Presentation Week 7

Examine the “Gene-Reaction Schematic”which you will copy into your lab notebook

The Gene-Reaction Schematic depicts the relationshipsamong a set of genes, enzymes, and reactions:

NOTE: A line indicates a relationshipbetween two objects. For example, a line from a gene to a circle indicates that gene encodes that product)

The letters on the rightrepresent the gene

The box on the left represents the enzymecatalyzing the reaction

The circle representsa polypeptide

or protein product

Scrolling over each of the symbols will

show a text box with additional

information

For more informationabout the schematic,click on the ? symbol

Page 27: Annotation Presentation Week 7

Modify your notebook as follows:

Page 28: Annotation Presentation Week 7

Enter the following informationinto your lab notebook

Gene information:Name & EC number

Gene-ReactionSchematic

To SAVE the reaction schematic:• Scroll over a LINE (not circle,

box, or surrounding space)• Right-click to bring up window

with option to “Save Image As”• SAVE in .gif format to upload

to the notebook.

Page 29: Annotation Presentation Week 7

Recording results in your notebook

Page 30: Annotation Presentation Week 7

EC NumberEC Number

Enzyme Commission (EC) Number is a seriesof 4 numbers describing enzymatic function:

1: Indicates which of the 6 classes the enzyme belongs: oxidoreductase, transferase, hydrolase, lyase, isomerase, ligase

2 and 3: Depends on the enzyme class For example, in oxidoreductases: 2 describes the substrate,| and 3 describes the acceptor)

4: Gives the specific enzyme activity

Page 31: Annotation Presentation Week 7

How do you determine the EC number?

Four options to try:

1- Inspect the img/edu Gene Detail page

2- Look at KEGG results

3- Examine the MetaCyc results

4- Search the Expasy database

We will go through each option.All should agree on the EC number

if assigned accurately by gene caller.

First ask yourself: Is my gene an enzyme or a structural gene?Only enzymes have EC numbers. . .

Page 32: Annotation Presentation Week 7

Inspect the img/edu Gene Detail page

Scrolldown

“click”

Page 33: Annotation Presentation Week 7

EC name

EC number

Page 34: Annotation Presentation Week 7

Look at KEGG results

EC name

EC number

“click”

Page 35: Annotation Presentation Week 7

Examine the MetaCyc results

EC name

EC number

Page 36: Annotation Presentation Week 7

Scenario:Scenario: By gene name/description, you know you havean enzyme to annotate but are not sure what MetaCyc orKEGG pathway it belongs to – what do you do?

Search the Expasy database

1- Go to Expasy at http://www.expasy.org/enzyme/

NOTE: You must access the websitevia the above URL. The link in the

notebook takes you to a different partof the Expasy site – do not use it.

Page 37: Annotation Presentation Week 7

2- Enter the name or description of your gene then press [Search] button

Page 38: Annotation Presentation Week 7

3- Results should produce a list of possible genes with more detailed descriptions & associated EC numbers

EC number

Page 39: Annotation Presentation Week 7

4- Depending on the description for your gene in P. limnophilusor other evidence you've accumulated through the quarter(see BLAST results for example), you may be able to narrowdown which one it is . . . Put some thought into it and see whatyou can come up with. . .

. . . If you click on the EC number for a candidate, it willtake you to a page with links to KEGG and MetaCyc.

click

Page 40: Annotation Presentation Week 7

5- Click the link to KEGG to see a detailed results page

click

NOTE: There also are links to other databases,including PubMed (MEDLINE) and MetaCyc.

Page 41: Annotation Presentation Week 7

6- On detailed results page, click [Show all] for KEGG reaction to see schematic

click

Page 42: Annotation Presentation Week 7

7- Inspect the reaction schematic(s). Is the equation consistent with proposed enzyme function?

Click to obtainhigh-resolutionimage

EC number

Click here for names ofcompounds in the reaction

EC name

Page 43: Annotation Presentation Week 7

8- Optional: You may add high-resolution image to your notebook. SAVE as .gif file and upload to your notebook.

Page 44: Annotation Presentation Week 7

REMEMBER: Not all genes will have an EC number. Only genes with enzyme function are assigned an EC number. If your gene ends with ‘ase’, it could be an enzyme. Remember, enzymes have catalytic function.

Recording results in your notebook