Lecture 7: RNA foldingcs425/fall19/slides/... · Lecture 7: RNA folding Chapter 6 – Problem 6.51...

Preview:

Citation preview

Lecture7:RNAfolding

Chapter6–Problem6.51inJonesandPevznerandtheTurnermodel

Fall2019

September19,2019

RNABasics

2

�  RNAbasesA,C,G,U�  CanonicalBasePairs◦  A-U◦  G-C◦  G-U“wobble”pairing◦  Basescanonlypairwithoneotherbase.

Image: http://www.bioalgorithms.info/

RNAStructuralLevels

3

Primary AAUCG...CUUCUUCCA Primary

Secondary Tertiary

RNASecondaryStructure

4

Hairpin loop Junction (Multiloop)

Bulge Loop

Single-Stranded

Internal Loop

Stack

Pseudoknot

BasePairMaximization

5

U

C

C

A G

G

A

C

Zuker (1981) Nucleic Acids Research 9(1) 133-149

BasePairMaximization–DynamicProgrammingAlgorithm

6

Simple Example: Maximizing Base Pairing

BasePairMaximization–DynamicProgrammingAlgorithm

7

S(i,j) is the folding of the subsequence of the RNA strand from index i to index j which results in the highest number of base pairs

BasePairMaximization–DynamicProgrammingAlgorithm

8

BasePairMaximization–DynamicProgrammingAlgorithm

9

BasePairMaximization–DynamicProgrammingAlgorithm

10

BasePairMaximization–DynamicProgrammingAlgorithm

11

CircularRepresentation

12 Images – David Mount

Pseudoknots

13

�  PseudoknotscauseabreakdowninthepresentedDynamicProgrammingAlgorithm.

�  Inordertoformapseudoknot,checksmustbemadetoensurebaseisnotalreadypaired–thisbreaksdownthedivideandconquerrecurrencerelations.

Images – David Mount

SimplifyingAssumptions

•  RNAfoldsintooneminimumfree-energystructure.

•  Therearenoknots(basepairsnevercross).•  Theenergyofaparticularbasepairinadoublestrandedregionissequenceindependent.•  Neighborsdonotinfluencetheenergy.

• Wassolvedbydynamicprogramming,ZuckerandSteigler1981

14

SequenceDependentBasePairEnergyValues(NearestNeighborModel)

15

U U C G G C A U G C A UCGAC 3’ 5’

U U C G U A A U G C A UCGAC 3’

5’

Example values: GC GC GC GC AU GC CG UA -2.3 -2.9 -3.4 -2.1

FreeEnergyComputation(NearestNeighborModel)

16

U U A A G C G C A G C U A A U C G A U A 3’ A 5’

-0.3

-0.3

-1.1 mismatch of hairpin -2.9 stacking

+3.3 1nt bulge -2.9 stacking

-1.8 stacking

5’ dangling

-0.9 stacking -1.8 stacking -2.1 stacking

G= - 4.9 kcal/mol

+5.9 4 nt loop

RNASecondaryStructure

17

Stack

NearestNeighborModel

•  Stackingenergy-assignnegativeenergiestothesebetweenbasepairregions.•  Energyisinfluencedbythenearestclosingbasepair•  Theseenergiesareestimatedexperimentallyfromsmall

syntheticRNAs.

•  Positiveenergy-addedforlowentropyregionssuchasbulges,loops,etc.

18

RNASecondaryStructure

19

Hairpin loop

NearestNeighborModel

•  Hairpinenergy:•  Experimentallymeasuredforhairpinsoflength5,6,7,8,…

uptoamaximum.Extrapolationabovethemaximum.

•  Theclosingpairaffectstheenergy.DistinguishbetweenA-UandC-G.

20

RNASecondaryStructure

21

Bulge Loop

Internal Loop

NearestNeighborModel

•  Bulge/Internalenergy:•  LetL1,L2denotethelengthsofthetwosidesofthebulge/

internalloop.

•  ExperimentallymeasuredfordifferentvaluesofL1,L2.

•  Inpracticeforcomputationalconvenience,theenergyisgivenasfunctionofL1+L2byalookuptableandextrapolation.

22

RNASecondaryStructure

23

Junction (Multiloop)

NearestNeighborModel

•  Multiloopenergy:•  LetUdenotethenumberofunpairedbases.

•  LetPdenotethenumberofbasepairs.•  ThefreeenergyisanaffinefunctionofUandP:

a1 + a2 U + a3 P.

•  ThisistheleastaccuratecomponentoftheNNmodel.

24

Recommended