32
PREditor P redictive R NA Editor for Plant Mitochondrial Genes Jeff Mower

PREditor P redictive R NA Editor for Plant Mitochondrial Genes

  • Upload
    avel

  • View
    16

  • Download
    0

Embed Size (px)

DESCRIPTION

PREditor P redictive R NA Editor for Plant Mitochondrial Genes. Jeff Mower. What is RNA Editing? A process that alters the RNA sequence Nt insertion, deletion, or conversion Does not include RNA maturation processes. RNA Editing in Plants Occurs in mitochondria and chloroplasts - PowerPoint PPT Presentation

Citation preview

Page 1: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

PREditorPredictive RNA Editor for Plant Mitochondrial Genes

Jeff Mower

Page 2: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

What is RNA Editing?– A process that alters the RNA sequence

– Nt insertion, deletion, or conversion

– Does not include RNA maturation processes

Page 3: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

RNA Editing in Plants– Occurs in mitochondria and chloroplasts

– C to U and U to C conversions

– Mechanism is not known

Page 4: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

RNA Editing in Plants– In seed plants (conifers, flowering plants, etc.)

• Widespread in mitochondrion• Rare in chloroplast• Predominantly C to U

– In non-seed plants (mosses, ferns, etc.)• Frequent in mitochondrion and chloroplast• Both C to U and U to C are common

Page 5: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

AUG UGAAGACGGUC CAAAAUCGU UCU UGCGGCGUA

M R N S V G C Q *

UGGC

Page 6: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

AUG UGAAGACGGUC CAAAAUCGU UCU UGCGGCGUA

M R N S V G C Q *

UGGC

AUGAUG CAAAAUCGU UCU UGCGGCGUA

M V M R N S V G C Q *

GUC

Creation of new start codon

AG UGA UGGC

Page 7: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

AUG UGAAGACGGUC CAAAAUCGU UCU UGCGGCGUA

M R N S V G C Q *

UGGC

AUGAUG CAAAAUUGU UUU UGCGGCGUA

M V M C N F V G C Q *

GUC

Alteration ofprotein sequence

Creation of new start codon

AG UGA UGGC

Page 8: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

AUG UGAAGACGGUC CAAAAUCGU UCU UGCGGCGUA

M R N S V G C Q *

UGGC

AUGAUG CAAAAUUGU UUU UGCGGUGUA

M V M C N F V G C Q *

GUC

Alteration ofprotein sequence

Creation of new start codon

AG UGA UGGC

No effect onprotein sequence

Page 9: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

AUG UGAAGACGGUC CAAAAUCGU UCU UGCGGCGUA

M R N S V G C Q *

UGGC

AUG UGAUGGCAUG UAAAAUUGU UUU UGCGGUGUA

M V M C N F V G C *

GUC

Alteration ofprotein sequence

Creation of new stop codon

Creation of new start codon

AG

No effect onprotein sequence

Page 10: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

Identifying Edit Sites1. Determine experimentally

• Need to isolate and reverse transcribe RNA• Need multiple reads (editing is not always complete)

Page 11: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

Identifying Edit Sites1. Determine experimentally

• Need to isolate and reverse transcribe RNA• Need multiple reads (editing is not always complete)

2. Predict based on sequence context• Upstream and downstream regions are important• Unambiguous motifs have not been identified

Page 12: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

Identifying Edit Sites1. Determine experimentally

• Need to isolate and reverse transcribe RNA• Need multiple reads (editing is not always complete)

2. Predict based on sequence context• Upstream and downstream regions are important• Unambiguous motifs have not been identified

3. Predict based on protein conservation• Proteins are more conserved after editing • Editing tends to “correct” amino acid sequences

Page 13: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

PREditor Methodology – Aligned sequence database (ASD) Construction

• 363 DNA sequences RNA editing information is known Organisms do not perform RNA editing

• Proteins were translated using the editing information

• Homologous proteins were aligned 42 different alignments All known mt proteins are covered

Page 14: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

PREditor Methodology – Input Sequence Manipulation

• Accept a protein-coding DNA sequence as input• Translate input sequence• Align translation to homologous proteins in ASD

Page 15: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

PREditor Methodology – The Underlying Principle

• ASD sequences translated from edited RNA• Input sequence translated from unedited DNA• “Where can RNA editing in the input sequence increase

conservation to the ASD sequences?”

Page 16: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

ASD1 A T L G

ASD2 E M L G

ASD3 A M L G

Page 17: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

ASD1 A T L G

ASD2 E M L G

ASD3 A M L G

Input E (GAA) T (ACG) P (CCU) G (GGC)

Page 18: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

ASD1 A T L G

ASD2 E M L G

ASD3 A M L G

Input E (GAA) T (ACG) P (CCU) G (GGC)

Unedited E (GAA) 0.33

Edited N/A N/A

Page 19: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

ASD1 A T L G

ASD2 E M L G

ASD3 A M L G

Input E (GAA) T (ACG) P (CCU) G (GGC)

Unedited E (GAA) 0.33 T (ACG) 0.33

Edited N/A N/A M (AUG) 0.67

Page 20: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

ASD1 A T L G

ASD2 E M L G

ASD3 A M L G

Input E (GAA) T (ACG) P (CCU) G (GGC)

Unedited E (GAA) 0.33 T (ACG) 0.33 P (CCU) 0.0

Edited N/A N/A M (AUG) 0.67 S (UCU) 0.0

L (CUU) 1.0

F (UUU) 0.0

Page 21: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

ASD1 A T L G

ASD2 E M L G

ASD3 A M L G

Input E (GAA) T (ACG) P (CCU) G (GGC)

Unedited E (GAA) 0.33 T (ACG) 0.33 P (CCU) 0.0 G (GGC) 1.0

Edited N/A N/A M (AUG) 0.67 S (UCU) 0.0 G (GGU) 1.0

L (CUU) 1.0

F (UUU) 0.0

Page 22: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

Performance Analysis – Remove one protein sequence from the database

– Use the unedited DNA sequence as input

– Calculate statistics • Accuracy = (TP + TN) / (TP + FP + TN + FN)• Sensitivity = TP / (TP + FN)• Specificity = TN / (TN + FP)

– Repeat for each sequence

Page 23: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

Performance Analysis– Total # C’s = 58,982

– True edited sites = 3,548 (6.0%)• TP = 2,922• FN = 626

– True non-edited sites = 55,434• TN = 54,829• FP = 605

Page 24: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

Performance Analysis – Sensitivity = 82.4%

• Proportion of true edited sites that were predicted correctly• Increases to 94.6% if you ignore missed silent edited sites

– Specificity = 98.9%• Proportion of true non-edited sites that were predicted correctly

– Accuracy = 97.9%• Proportion of all sites that were predicted correctly• Increases to 98.7% if you ignore missed silent edited sites

Page 25: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

Limitations– Cannot predict editing at silent sites

• 458 of 626 FN are at silent sites

Page 26: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

Limitations– Cannot predict editing at silent sites

• 458 of 626 FN are at silent sites

– Not a major problem in practice• Silent editing sites do not affect the protein sequence • Many silent sites are only occasionally edited• Only 13% of editing sites are silent (expect ~38%)

Page 27: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

Limitations– Aligned sequence database is skewed

Origin of

RNA editing

Angiosperms 266 (74%)

Gymnosperms 2 (1%)

Ferns 0

Horsetails 0

Hornworts 0

Mosses 0

Liverworts 32 (9%)

Charophytes 59 (16%)

Chlorophytes ―

Page 28: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

Limitations– The skewed database effect

ASD1 Z

ASD2 Z

ASD3 Z

ASD4 Z

ASD4 X

Input X or Z?

Page 29: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

Ongoing Work– Reducing the skewed database effect

• Weighted sequences and phylogenetics

ASD1 Z

ASD2 Z

ASD3 Z

ASD4 Z

ASD4 X

Input X!

Page 30: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

Ongoing Work – Making the online resource more appealing

– Making the online resource more user-friendly

Page 31: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

Future Directions – Increase diversity in the ASD

– Analyze sequence context using the ASD

– Apply methodology to editing in chloroplasts

– Apply methodology to U to C editing

Page 32: PREditor P redictive  R NA  Editor  for  Plant Mitochondrial Genes

Thanks – Jeff Palmer

– Sun Kim

– Danny Rice and other members of the Palmer lab