15
Doshisha Univ. Japan GECCO2002 Energy Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover Takeshi Yoshida Tomoyuki Hiroyasu Mitsunori Miki Maki Ogura Doshisha University, Kyoto, Japa Yuko Okamoto Institute for Molecular Science, Aichi, Ja Doshisha Univ. Japan Doshisha Univ. Japan GECCO2002 GECCO2002

Doshisha Univ. JapanGECCO2002 Energy Minimization of Protein Tertiary Structure by Parallel Simulated Annealing using Genetic Crossover Takeshi YoshidaTomoyuki

Embed Size (px)

Citation preview

Doshisha Univ. JapanGECCO2002

Energy Minimization of Protein Tertiary Structureby Parallel Simulated Annealing

using Genetic Crossover

Takeshi Yoshida Tomoyuki HiroyasuMitsunori Miki Maki Ogura

Doshisha University, Kyoto, Japan

Yuko OkamotoInstitute for Molecular Science, Aichi, Japan

Doshisha Univ. JapanDoshisha Univ. JapanGECCO2002GECCO2002

Doshisha Univ. JapanDoshisha Univ. JapanGECCO2002GECCO2002

BackgroundProtein tertiary structure is closely related with biological function.

• lead to development new medicines.• lead to manifestation mechanism of pathology.

Prediction of protein tertiary structure

Molecular simulation

• high searching ability

• need huge calculate time

Apply Heuristic method        to this problem.

Doshisha Univ. JapanDoshisha Univ. JapanGECCO2002GECCO2002

Protein Tertiary Structure

Energy function of protein

Folding protein to stable state

Tertiary StructureAmino acid array

Analyzing structureas Optimization problem

• Protein structure naturally exist with lowest energy

• Protein is composed of an array with 20 amino acids. • Amino acid array is folding to the lowest energy

Doshisha Univ. JapanDoshisha Univ. JapanGECCO2002GECCO2002

Simulated Annealing

High temperature

Local minimum

Global minimum

Low temperatureBy the parameters, SA’s searching point moves to worse a state in a certain probability.

T : Temperature of current step

Probability; P = - (Enext- Ecurrent)

Texp

SA is often applied to prediction of Protein tertiary structure.

SA has parameters, those are a temperature and a step.

Doshisha Univ. JapanDoshisha Univ. JapanGECCO2002GECCO2002

Energy function of Protein

Minimum in local areaMinimum in local area

Energy function of Protein has many minima in local areaAnd a few minima in global area.

Minimum in global areaMinimum in global area

No success result has ever prediction of protein using SA

Doshisha Univ. JapanDoshisha Univ. JapanGECCO2002GECCO2002

Purpose of this study

Parallel SA using Genetic Crossover(PSA/GAc)

• Genetic Algorithm(GA) is good at searching a solution in a wider area.(global search)

Parallel SA

GA operation+

PSA/GAc is a hybrid method of SA with GA operation

• Simulated Annealing(SA) is good at searching a solution in narrower area.(local search)

PSA/GAc is good at searching not only locally but also globally. We apply PSA/GAc to Protein tertiary structure.

Doshisha Univ. JapanDoshisha Univ. JapanGECCO2002GECCO2002

Modeling of Protein tertiary structure

Design variable : dihedral angle between main chain and side chain.

Minimize energy function of protein

Protein is composed of an array with 20 amino acids.

Changing dihedral angle

dihedral angle

Doshisha Univ. JapanDoshisha Univ. JapanGECCO2002GECCO2002

PSA/GAcPSA/GAc is based on Parallel SA.

SA

SA

SA

crossover

1 2 3

1 2 3

1 2 3

1 2 3

1 2 3

1 2 3

1 2 3

1 2 3

SA

cross

over

cross

over

en

d

temperaturetemperaturehighhigh lowlow

n n nn : crossover interval

Searching points : individuals Total number of SA : Population size

Genetic Crossover is used to exchange the information of individuals.

n

Doshisha Univ. JapanDoshisha Univ. JapanGECCO2002GECCO2002

PSA/GAc

Genetic Crossover is performed as follows. e.q continuous optimization problem(3 dimensions)

1 2 3parent1

1 2 3parent2

crossover

1 2 3

1 2 3

child1

child2

cross point

-1.3

-1.8

-1.1

-2.0

energy

1 2 3

1 2 3

Nextsearching points

Doshisha Univ. JapanDoshisha Univ. JapanGECCO2002GECCO2002

PSA/GAcPSA/GAc is based on Parallel SA.

SA

SA

SA

crossover

1 2 3

1 2 3

1 2 3

1 2 3

1 2 3

1 2 3

1 2 3

1 2 3

SA

cross

over

cross

over

en

d

temperaturetemperaturehighhigh lowlow

n n nn : crossover interval

Searching points : individuals Total number of SA : Population size

Each process reduce temperature from high to low as parameter of SA.

n

Doshisha Univ. JapanDoshisha Univ. JapanGECCO2002GECCO2002

Target Protein Structures

C-peptide ; 13 amino acidsIn case of using ECEPP/2 program,

Protein for numerical example

• Lowest-energy conformation

E < - 42 kcal/mol [okamoto,1991]

64 dihedral angles

• Design variables

• There are 64 times annealing per 1MCsweep

C-peptide structure

Doshisha Univ. JapanDoshisha Univ. JapanGECCO2002GECCO2002

Num of Processors

Parameters

Parameter

Population sizeInitial temperature

Crossover intervalRange size

Value

2.0(100k)24

0.1(50K)32

6

Last temperature

180 (180 0.3)

We tried two types of simulations.

Simulation1 : 4164 MCsweeps and 10 trials. (100,000 MCsweeps totally)

Simulation2 : 41646 MCsweeps and 7 trials.

Doshisha Univ. JapanDoshisha Univ. JapanGECCO2002GECCO2002

Optimum [Okamoto, 1991]

Result : Energy

PSA/GAc Simulation1

Energy (kcal/mol)

- 46.7

• To derive a good solution , PSA/GAc with long MCsweep annealing is more effective than small Mcsweep annealing.

PSA/GAc Simulation2 - 57.8

- 42

Simulation Type

• PSA/GAc has high searching ability in predicting protein tertiary structure problem

Doshisha Univ. JapanDoshisha Univ. JapanGECCO2002GECCO2002

Result : Protein structure

Simulation1: -46.7kcal/mol

• To derive a good solution , PSA/GAc with long MCsweep annealing is more effective than small Mcsweep annealing.

Simulation2: -57.8kcal/mol

• PSA/GAc has high searching ability in the prediction of protein tertiary structure.

Doshisha Univ. JapanDoshisha Univ. JapanGECCO2002GECCO2002

Conclusion

This study show a new hybrid method, Parallel SimulatedAnnealing using Genetic Crossover(PSA/GAc).

We apply PSA/GAc to energy function of protein, this result shows that PSA/GAc has good searching abilityfor prediction of protein tertiary structure.

PSA/GAc has follow features

• use Genetic crossover to exchange the information between the individuals

• is good at searching not only locally but also globally.

• is based on Parallel SA, so calculate time is less than sequential SA.