24
Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland, College Park

Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

Embed Size (px)

Citation preview

Page 1: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

Homotopy Optimization Methods and Protein Structure Prediction

Daniel M. Dunlavy Applied Mathematics and Scientific Computation

University of Maryland, College Park

Page 2: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

2/19

Protein Structure Prediction

Ala Arg Asp Gly Arg

Protein StructureAmino Acid Sequence

C CN

R

H OH

Given the amino acid sequence of a protein (1D), is it possible to predict it’s native structure (3D)?

Page 3: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

3/19

Protein Structure Prediction• Given:

– Protein model• Properties of constituent particles • Potential energy function (force field)

• Goal:– Predict native (lowest energy) conformation

• Thermodynamic hypothesis [Anfinsen, 1973]

– Develop hybrid method, combining:• Energy minimization [numerical optimization]

• Comparative modeling [bioinformatics]

– Use template (known structure) to predict target structure

Page 4: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

4/19

• Backbone model– Single chain of particles with residue attributes

– Particles model C atoms in proteins

• Properties of particles– Hydrophobic, Hydrophilic, Neutral – Diverse hydrophobic-hydrophobic interactions

Protein Model: Particle Properties

[Veitshans, Klimov, and Thirumalai. Protein Folding Kinetics, 1996.]

Page 5: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

5/19

Potential Energy Function

Page 6: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

6/19

Potential Energy Function

Page 7: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

7/19

• Goal– Minimize energy function of target protein:

• Steps to solution– Energy of template protein:– Define a homotopy function:

• Deforms template protein into target protein

– Produce sequence of minimizers of starting at and ending at

Homotopy Optimization Method (HOM)

Page 8: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

8/19

Energy Landscape Deformation Dihedral Terms

Page 9: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

9/19

Illustration of HOM

Page 10: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

10/19

Homotopy Optimization using Perturbations & Ensembles (HOPE)

• Improvements over HOM– Produces ensemble of sequences of

local minimizers of by perturbing intermediate results

– Increases likelihood of predicting global minimizer

• Algorithmic considerations– Maximum ensemble size– Determining ensemble members

Page 11: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

11/19

Illustration of HOPEMaximum ensemble size = 2

Page 12: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

12/19

Numerical Experiments9 chains (22 particles) with known structure

Loop Region

Hydrophobic HydrophilicNeutral

ABCDE F GH I

A B C D E F G H I

A 100

B 77 100

C 86 91 100

D 91 86 77 100

E 73 82 73 82 100

F 68 68 59 77 86 100

G 68 68 59 77 86 100 100

H 68 68 59 77 86 100 100 100

I 73 59 64 68 77 73 73 73 100

Sequence Homology (%)

Page 13: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

13/19

Numerical Experiments

Page 14: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

14/19

Numerical Experiments• 62 template-target pairs

– 10 pairs had identical native structures

• Methods– HOM vs. Newton’s method w/trust region (N-TR)

– HOPE vs. simulated annealing (SA)• Different ensemble sizes (2,4,8,16)

• Averaged over 10 runs

• Perturbations where sequences differ

• Measuring success– Structural overlap function:

• Percentage of interparticle distances off by more than 20% of the average bond length ( )

– Root mean-squared deviation (RMSD)

Ensemble SABasin hoppingT0 = 105

Cycles = 10Berkeley schedule

Page 15: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

15/19

Structural Overlap Function

NativePredicted

Page 16: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

16/19

RMSD

Measures the distance between corresponding particles in the predicted and lowest energy conformations when they are optimally superimposed.

where is a rotation and translation of

Page 17: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

17/19

Results

MethodEnsemble

Size c = 0 Success Mean cMean RMSD

Time (sec)

HOPE 2 33.40 0.54 0.14 0.17 354 43.10 0.70 0.08 0.11 658 54.60 0.88 0.03 0.04 115

16 59.00 0.95 0.01 0.02 200

SA 2 13.10 0.21 0.27 0.36 524 20.80 0.34 0.19 0.26 1078 28.50 0.46 0.13 0.19 22916 40.20 0.65 0.08 0.12 434

Method c = 0 Success Mean cMean RMSD

Time (sec)

HOM 15 0.24 0.36 0.38 10N-TR 4 0.06 0.45 0.55 1

Page 18: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

18/19

ResultsSuccess of HOPE and SA with ensembles of size 16 for each template-target pair. The size of each circle represents the percentage of successful predictions over the 10 runs.

SAHOPE

Page 19: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

19/19

Conclusions

• Homotopy optimization methods– More successful than standard minimizers

• HOPE– For problems with readily available– Solves protein structure prediction problem– Outperforms ensemble-based simulated annealing

• Future work– Protein Data Bank (templates), TINKER (energy)– Convergence analysis for HOPE

Page 20: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

20/19

Acknowledgements

• Dianne O’Leary (UM)– Advisor

• Dev Thirumalai (UM), Dmitri Klimov (GMU)– Model, numerical experiments

• Ron Unger (Bar-Ilan)– Problem formulation

• National Library of Medicine (NLM)– Grant: F37-LM008162

Page 21: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

21/19

Thank You

Daniel Dunlavy – HOPEhttp://www.math.umd.edu/~ddunlavy

[email protected]

Page 22: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

22/19

HOPE Algorithm

Page 23: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

23/19

Homotopy Parameter Functions• Split low/high frequency dihedral terms

• Homotopy parameter functions for each term

Page 24: Homotopy Optimization Methods and Protein Structure Prediction Daniel M. Dunlavy Applied Mathematics and Scientific Computation University of Maryland,

24/19

Different for individual energy terms

Homotopy Function for Proteins

Template Target