Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
Computer Simulations of
Protein-Peptide Complexes Using the
Myosin Light Chain Protein as a Model
Thesis submitted for the degree
"Doctor of Philosophy"
by
Assaf Ganoth
Submitted to the Senate of Tel-Aviv University
July 2006
II
This work was carried out under the supervision of
Prof. Menachem Gutman
III
ACKNOWLEDGMENTS
It has been a long journey. During the journey, I was fortunate to come in touch
with a variety of people who helped me to pursuit my dreams.
It is my pleasure to thank them.
Prof. Hemi Gutman,
For guidance, unconditional support, brainstorm sessions and, most important,
believing in me. His special personality and profound knowledge accompanied me
throughout the research. I doubt I will ever be able to convey my appreciation fully,
but I owe him my scientific career.
Dr. Eti Nachliel,
For continuous help, professional advices, inspirational ideas, and endless assistance.
Ran Friedman,
For teaching me Molecular Dynamics, expert knowledge, being for me whenever I
encountered scientific problems or difficulties, and for bicycle trips.
Elad Project,
For assistance with Molecular Dynamics simulations, programming skills, and
unlimited help with solving computer problems.
Dr. Dani Canaani, Dr. Yossi Tsfadia and Prof. Rimona Margalit,
For professional and moral support during the last years. Each provided unique
insights and challenged my thinking, substantially contributing to a successful
completion of this thesis.
Finally, I would like to express my deepest gratitude to my friends in the lab, past and
present, for close collaboration: Anna Seltzer, Assaf Amitay, Cintia Sbarsky, Dana
Baron, Eran Bosis, Tom Mark, Aviv Mezer, Limor Radozkowicz, and Yael Rabin.
IV
"Two roads diverged in a wood, and I-
- I took the one less traveled by,
And that has made all the difference"
Robert Lee Frost (1874-1963)
V
TABLE OF CONTENTS
ABSTRACT IX
LIST OF ABBREVIATIONS XII
I. INTRODUCTION
1. Biological and Biochemical Background................................................................ 1
1.1 The Myosin Family......................................................................................... 1
1.2 The Myosin V Protein..................................................................................... 2
1.3 The Mlc1p Protein.......................................................................................... 5
1.4 IQ Motif Peptides............................................................................................ 6
1.5 The Calmodulin Protein.................................................................................. 7
1.6 Calmodulin-Peptide Complexes..................................................................... 8
1.7 Mlc1p-IQ Complexes...................................................................................... 9
2. Computer Simulations............................................................................................. 9
2.1 Molecular Dynamics....................................................................................... 10
2.1.1 General Overview.................................................................................. 10
2.1.2 Methodology.......................................................................................... 11
2.1.3 Molecular Dynamics of Calmodulin...................................................... 13
2.1.4 Molecular Dynamics of Calmodulin's Complexes................................. 14
2.2 Free Energy..................................................................................................... 14
2.2.1 Free Energy of Interaction..................................................................... 15
2.2.2 The MM-PBSA Approach..................................................................... 15
3. Significance of Study.............................................................................................. 16
VI
II. METHODS
1. MD Simulations...................................................................................................... 18
1.1 Simulations of Protein-Peptide Complexes.................................................... 18
1.1.1 Mlc1p-IQ2 Complex.............................................................................. 18
1.1.2 Mlc1p-IQ4 Complex.............................................................................. 19
1.2 Simulations of Peptides................................................................................... 19
1.2.1 IQ2 Peptide............................................................................................ 19
1.2.2 IQ4 Peptide............................................................................................ 20
2. Visual Presentations................................................................................................ 20
3. Inter-Helical Angles................................................................................................ 21
4. Dihedral Angle Calculations................................................................................... 21
5. The Electrostatic Potential Around the Peptides..................................................... 21
6. The Protein-Peptide Interaction Free Energies....................................................... 22
6.1 Molecular Mechanics Calculations................................................................. 23
6.2 Polar Solvation Calculations........................................................................... 23
6.3 Nonpolar Solvation Calculation...................................................................... 24
6.4 Entropy Calculations....................................................................................... 25
III. RESULTS & DISCUSSION
1. Simulations of Protein-Peptide Complexes............................................................ 26
1.1 Mlc1p-IQ2 Complex....................................................................................... 26
1.1.1 The Crystallographic and Simulated Structures..................................... 26
1.1.2 The Dynamics of the Protein-Peptide Complexes................................. 27
1.1.3 Summary................................................................................................ 29
VII
1.2 Mlc1p-IQ4 Complex....................................................................................... 30
1.2.1 Overall Conformational Changes During the Simulation...................... 30
1.2.2 Relative Rotation of the Helices During the Simulation........................ 34
1.2.3 The Structural Characteristics of the Compaction Event....................... 37
1.2.4 The Forces that Stabilize the Refolded Model Structure of the
Complex.................................................................................................
39
1.2.5 The Interactions Between Residues During the Structural Change of
the Complex...........................................................................................
44
1.2.6 MD Simulation of the Mlc1p-IQ4 Complex at 400 K........................... 47
1.2.7 Summary................................................................................................ 49
2. Comparison Between the Mlc1p-IQ2 and the Mlc1p-IQ4 Complexes................... 52
2.1 Crystallographic and Simulated Structures Comparison................................ 52
2.2 Structural Evolution of the Simulated Mlc1p Protein at the Protein-Peptide
Complexes.......................................................................................................
54
2.3 The Root Mean Square Fluctuation (RMSF) of the Mlc1p Protein at the
Protein-Peptide Complexes.............................................................................
57
2.4 The Electrostatic Field Around the IQ Peptides............................................. 60
2.5 The Protein-Peptide Interaction Free Energies............................................... 62
2.6 The Contacts Between the Protein and the Peptides....................................... 67
2.7 Summary......................................................................................................... 70
3. Simulations of Free IQ Peptides............................................................................. 72
3.1 Synopsis of the Presented Simulations........................................................... 72
3.2 Overall Conformational Changes During the Simulations............................. 73
3.3 Structural Characteristics of the Refolding Process of the IQ Peptides.......... 78
3.4 Secondary Structures of the IQ Peptides......................................................... 81
VIII
3.5 Salt Bridges Analysis of the IQ4 Peptide Simulation..................................... 84
3.6 Dynamics of the IQ4 Peptide at Different Salt Concentrations...................... 86
3.7 Summary......................................................................................................... 88
4. The Structure of the Light Chain-Binding Domain (LCBD) of Myosin V............ 89
4.1 The Current Structural Model......................................................................... 89
4.2 Structure and Dynamics of an Extended IQ4 Peptide..................................... 92
4.3 The Suggested Solution Model....................................................................... 94
4.4 Summary......................................................................................................... 96
IV. OVERALL GENERAL DISCUSSION................................................ 98
V. SUPPLEMENT............................................................................................... 102
1.1 Articles............................................................................................................ 102
(I) Ganoth, A., E. Nachliel, R. Friedman, and M. Gutman. 2006.
Molecular dynamics study of a calmodulin-like protein with an IQ peptide:
Spontaneous refolding of the protein around the peptide. Proteins 64: 133-146
(II) Ganoth, A., R. Friedman, E. Nachliel, and M. Gutman. 2006. A
molecular dynamics study and free energy analysis of complexes between the
Mlc1p protein and two IQ motif peptides. Biophys J 91:2436-2450
REFERENCES..................................................................................................... 103
IX
ABSTRACT
The Mlc1p protein, which is a member of the Calmodulin family, is an
essential component of the mechano-chemical myosin system. At the budding yeast
Saccharomyces cerevisiae, six Mlc1p proteins bind the neck domain of myosin V,
composed of six IQ motif peptides. The structures of a few Mlc1p complexes with IQ
peptides had been resolved by X-ray crystallography. The present thesis expands our
knowledge, by investigating, through molecular dynamics simulations and subsequent
calculations, of the structures, dynamics and energetics of Mlc1p-IQ complexes.
Besides grossly extending the structural data embedded in the crystalline structures,
our study provides atomistic understanding about the movement mode of myosin V
over the actin filament. We found out how bending of a specific site, located on the
neck of the myosin, can serve as the flexible joint along the myosin, securing a proper
function of its stroke lever arm.
The thesis includes four main chapters: (I) Molecular dynamics simulations of
the IQ2 and the IQ4 peptides in a complex with the Mlc1p protein; (II) A detailed
comparison between the simulations of both Mlc1p-IQ complexes; (III) Molecular
dynamics simulations of free IQ peptides; (IV) Reevaluation of the structure of the
light chain-binding domain of myosin V.
In the first chapter of the thesis, we present molecular dynamics simulations of
the Mlc1p-IQ2 and the Mlc1p-IQ4 complexes, following their relaxation in a
physiological salt solution. The Mlc1p-IQ2 complex relaxed without loosing its main
packing features, exhibiting a limited conformational change throughout the
simulation. The other complex, with the IQ4 peptide, experienced a major refolding
process, where the protein transformed its conformation from an extended to a
compact one, and the peptide was snapped into two sections. In the second chapter of
X
the thesis, we offer a detailed comparative study between the molecular dynamics
simulations of both complexes. We performed a comprehensive comparison between
the structure and the dynamics of the Mlc1p-IQ complexes and an extensive
interaction free energy analysis. The latter includes an analysis of the various forces
operating on the protein-peptide complexes by indicating their specific contributions.
In the third chapter of the thesis, molecular dynamics simulations of free IQ peptides
at different conditions for various durations are given. Variations between the
conformations of the bound and the free peptides, and differences between the
configurations that the two IQ peptides make take in solution, are discussed. The
results of the molecular dynamics simulations of the protein-peptide complexes and
the free peptides enable to assess the structural model of the light chain-binding
domain of myosin V, being composed of six complexes between light chain proteins
(such as the Mlc1p) and IQ peptides, presented in the fourth chapter of the thesis. At
the core of our suggested model stands the notion that the light chain-binding domain
is a dynamic cellular entity, and hence the proposed model incorporates the ability of
the Mlc1p protein and the IQ peptides to flex and curve in a mutual manner.
The results of the thesis were summed up in three articles. The first article,
which describes the molecular dynamics simulation of the Mlc1p-IQ4 complex, was
recently published (Ganoth, A., E. Nachliel, R. Friedman, and M. Gutman. 2006.
Molecular dynamics study of a calmodulin-like protein with an IQ peptide:
Spontaneous refolding of the protein around the peptide. Proteins 64:133-146). The
second paper presents a comprehensive comparison between the simulations of the
Mlc1p-IQ4 and the Mlc1p-IQ2 complexes, including an energetic analysis and a
reevaluation of the structure of the light chain-binding domain of myosin V (Ganoth,
A., R. Friedman, E. Nachliel, and M. Gutman. 2006. A molecular dynamics study and
XI
free energy analysis of complexes between the Mlc1p protein and two IQ motif
peptides. Biophys J 91:2436-2450 (Cover article)); A third paper, dealing with
simulations of the free IQ peptides, is in preparation.
The current research is a novel MD study of IQ peptides in a complex with a
CaM-like protein and at its absence. In view of the major role taken by the IQ
peptides in mechano-chemical processes, the thesis provides a comprehensive
description at atomic resolution for their interaction with CaM and CaM-like proteins.
The study, which portrays a computerized journey, starting from molecular dynamics
simulations and ending at physiological insights concerning myosin V, exemplifies
that cooperation between crystallographers and biophysicists may contribute to a
better understanding of structure-function relationship.
XII
LIST OF ABBREVIATIONS
APBS Adaptive Poisson-Boltzmann Solver
Apo-CaM Ca+2
-deprived CaM
ATP Adenosine Triphosphate
CaM Calmodulin
CL C-lobe
FIONA Fluorescence Imaging with One-Nanometer Accuracy
GTP Guanosine Triphosphate
Holo-CaM Ca+2
-bound CaM
ID Inter-domain
LCBD Light Chain-Binding Domain
LJ Lennard-Jones
MD Molecular Dynamics
MM-PBSA Molecular Mechanics - Poisson-Boltzmann Surface Area
mRNA messenger RNA
NL N-lobe
NMR Nuclear Magnetic Resonance
PB Poisson-Boltzmann
PDB Protein Data Bank
PME Particle Mesh Ewald
RMSD Root Mean Square Deviation
RMSF Root Mean Square Fluctuation
SASA Solvent Accessible Surface Area
SMFP Single Molecule Fluorescence Polarization
SPC Single Point Charge
XIII
TFP Trifluoperazine
VdW Van der Waals
VMD Visual Molecular Dynamics
1
I. INTRODUCTION
1. Biological and Biochemical Background
1.1 The Myosin Family
Within all eukaryotic cells, the generation of mechanical force is provided by
specific motor proteins necessary for performing vast array of sub-cellular tasks required
to sustain life. The motors that move along actin filaments make up a diverse protein
family, collectively referred to as myosins (1). The myosin family of molecular motors
consists of at least 20 structurally and functionally distinct classes. Myosins have been
implicated in a variety of intra-cellular functions, including cell migration and adhesion,
intra-cellular transport and localization of organelles and macromolecules, signal
transduction, and tumor suppression. The myosins constitute a large family of actin-
dependent motors found in many organisms from yeast to humans. Upon interaction with
actin filaments, they convert energy from ATP hydrolysis into mechanical force
(reviewed in (2)).
The first identified motor protein was skeletal muscle myosin, which is
responsible for generating the force for muscle contraction (3). This protein, called
myosin II (for two-headed), was also found in non-muscle cells, including protozoan
cells. Myosin II is composed of two heavy chains; each bears a globular motor domain
that includes a binding site for ATP and a domain that interacts with actin. Two light
chains wrap around the elongated neck region of each heavy chain. In the late 1970's (4),
one-headed myosin (called myosin I), was found in the fresh water amoeba
Acanthamoeba castellanii. Myosin I is considered an unconventional myosin since it
functions as a monomer and consists of very short neck domain. Since then, many other
2
myosin types were discovered. The new types of myosins include a number of one-
headed and two-headed varieties that are related to myosin I and myosin II, and the
nomenclature now reflects their approximate order of discovery (myosin III through at
least myosin XX).
The myosin tails (and the tails of motor proteins generally) have apparently
diversified during evolution to permit the proteins to dimerize with other subunits and to
interact with different cargoes. Some myosins (such as VIII and XI) have been found
only in plants, and some are exclusive to vertebrates (IX). However, most of them are
common to all eukaryotes, suggesting that myosins arose early in eukaryotic evolution
(reviewed in (5, 6)).
1.2 The Myosin V Protein
The family of myosin V is a class that differs structurally from other myosins by
having an extended neck domain and a tail domain that allows dimerization, but not the
formation of filaments due to the presence of a globular carboxy-terminus. Members of
the myosin V family have been identified in humans, mice, chickens, flies, fungi, worms,
yeast and plants. The myosin V proteins are molecular motors involved in a range of
organelle-transporting functions, including the transport of melanosomes and synaptic
vesicles in mammals and vacuoles and mRNA in yeast (reviewed in (7, 8)). Myosin V is
essential for transport of vesicles in actin-rich cortical regions of neurons, i.e. dendritic
spines and axon termini. Moreover, vesicle-associated myosins, like myosin V, may do
more than serve as mechanical feet, transporting vesicles from one location to another
along actin tracks. They may interact with membrane-associated F-actin to "stitch"
3
membranes together during membrane fusion (9), and to sort proteins from the fluid
phase into lipid rafts during the assembly of membrane protein complexes (10). Based on
phylogenetic analysis of the myosin family, myosin V evolved after myosins II and I, the
two most ancient members of the myosin super-family (1, 11).
Myosin V is composed of two identical heavy chains, each one is capable of
binding six light chains (reviewed in (12, 13)). The two heavy chains dimerize to produce
a two-headed protein consisting of three distinct domains (Fig. 1).
The first domain is the head domain (green), the second domain constitutes the
neck (red), and third domain is the tail domain (colored yellow for its proximal and
medial modules, and blue for its distal module). When bound to the actin filaments, it has
the ability to convert the energy released by ATP hydrolysis into mechanical work, i.e.
Figure 1: A cartoon diagram of myosin V. The three domains are color coded: the head
domain is green, the neck domain is red, and the tail domain is yellow/blue. The light
chains, which engulf the neck domain, are colored in gray.
4
movement (14). Successive cycles of ATP hydrolysis allow the two heads to "step" on
the actin filament towards its barbed or plus end (15).
The head domain of myosin V contains ATP and actin binding sites (16). Since
the force required for transport of cargoes is generated in the head domain, it is often
referred also as motor domain. The neck domain contains six repeating amino acid
sequences called IQ motifs (designated IQ1-IQ6), each of which binds a light chain. The
light chains have been shown to be CaM or CaM-like proteins, and their primary function
is to regulate the ATPase activity of the globular heads (17, 18). According to the lever
arm hypothesis (19, 20), the step size of myosin V motors is proportional to the length of
the neck domain, which functions as a lever. The long neck domain of myosin V gives
rise to a step size of ~ 36 nm, the largest step size thus far measured for a myosin motor.
Not surprisingly, the step size of myosin V shortens when one or more of the IQ motifs
from the neck domain is truncated (21).
It has been clearly established that myosin V motors from vertebrates are
processive, i.e. have a long duty cycle, enabling it to undergo multiple steps before
dissociating from the actin filament. This allows a single motor to transport cargo, in ~ 36
nm steps, for several micrometers along the actin filament (20). Yet, not all class V
myosins are processive. A recent kinetic analysis of myosin V from drosophila concluded
that it is not a processive motor (22). The two classes of myosin V from the yeast
Saccharomyces cerevisiae, Myo2p and Myo4p (the former is the myosin V discussed in
this research), have been reported to be low duty non-processive motors, based upon
motility and landing rate assays (23).
The tail domain can be divided into two modules: proximal/medial and distal. The
5
proximal/medial module is the site of dimerization of the two heavy chains. In this
module of the tail, alpha-helical regions of the heavy chains are predicted to interact to
form a coiled coil segment that is interrupted by small globular regions (16). A PEST site,
which has been shown to be a cleavage site for the calcium-dependent protease calpain
(14), is located in the N-terminus of the proximal/medial tail module. The proximal/distal
module of the tail domain, in addition to its primary role in dimerization, also serves as
the link between the neck domain and the cargo-binding module of the tail. The distal
module of the tail domain is the cargo-binding domain (24). When bound to vesicles by
the distal tail module, myosin V becomes part of a large proteinous complex that includes
the membrane receptor.
1.3 The Mlc1p Protein
The Mlc1p protein is a CaM-like protein from the budding yeast Saccharomyces
cerevisiae, associated with the yeast's mechano-chemical myosin system. It was
originally identified as a light chain protein of the Myo2p protein, which is a myosin type
V (25). The Mlc1p protein is essential for viability and secretory vesicle delivery at the
mother bud neck domain during cytokinesis due to its ability to bind to the IQ motifs of
the Myo2p protein (26). While calcium binding to CaM promotes its binding/release
to/from the IQ motifs, the Mlc1p protein is unable to bind calcium since its EF-hand
motifs are abortive (27, 28). The signal that stimulates its interaction with target IQ
motifs is still unknown, but recently it was found that its interaction with the Myo2p
protein is mediated by GTP (29). Though also the exact function of the Mlc1p protein has
not been clarified yet, it is assumed that its binding to the neck domain of the Myo2p
6
protein has a stabilizing effect on the latter (25). The three-dimensional structure of the
Mlc1p was determined when it was co-crystallized with the IQ2, IQ2/3 and IQ4 peptides
of the Myo2p protein (28). The Mlc1p protein shares a structural similarity with CaM;
both possess a dumbbell-shaped conformation, consisting N- and C- globular lobes
(referred, at the thesis, as NL for the N-lobe and CL for the C-lobe) that are connected by
a flexible inter-domain (referred, at the thesis, as ID).
1.4 IQ Motif Peptides
The IQ motifs constitute common target peptides for CaM and CaM-like proteins.
These motifs are widely distributed among different kinds of proteins, contributing >3700
Pfam entries in the database (30). The motifs are ~ 25 amino acids long, alpha-helical,
and carry a net positive charge. They conform to the general consensus sequence
IQXXXRGXXXXR, but in many cases, the sequence is rather loosely adhered to this
consensus. For example, the isoleucine in the first position is frequently replaced by other
branched-chain amino acids (leucine or valine) or, rarely, by a methionine. The arginines
in both the sixth and the terminal positions are sometimes replaced by lysine or histidine,
and the seventh position glycine is poorly conserved. Despite the lack of strict
conservation, there is no doubt that this sequence is a recognizable protein motif that
binds CaM and CaM-related proteins, such as Troponin C and myosin light chains.
IQ motifs are present in a number of neuronal growth proteins, sodium and
calcium voltage-dependent channels, EF-hand-containing protein phosphatases, spindle
pole and centrosomal proteins, transient receptor potential proteins, plant cyclic
nucleotide-regulated channels, certain signaling molecules, and at the neck region of the
7
myosins. In most cases, the IQ motifs appear in tandem repeats, and therefore enable to
bind multiple copies of CaM and CaM-like proteins. IQ motifs were first identified as
Apo-CaM (Ca+2
-deprived CaM) binding sites; however, it is now clear that IQ motifs can
show different levels of Ca+2
dependency (31-33).
1.5 The Calmodulin Protein
The Calmodulin (CaM) protein is recognized as a prototypical calcium sensor and
regulation orchestrator of cellular events through its interaction with a diverse group of
proteins. The CaM protein is a small (16.7 kDa), highly acidic, ubiquitous, evolutionary
conserved protein. It is considered as the major calcium sensor and is expressed in all
eukaryotic cells, where it participates in signaling pathways that regulate crucial
processes such as growth, proliferation, and movement. The concentration and location of
CaM appear to play an important role in regulating its biological activity. CaM
constitutes at least 0.1% of the total protein present in cells (10-6
M – 10-5
M) and is
expressed at even higher levels in rapidly growing cells, especially those undergoing cell
division and differentiation. The local intra-cellular availability of CaM is likely to be
biologically significant because various CaM-dependent proteins are regulated over a
wide range of CaM concentrations. The dynamics of CaM and its interactions with target
proteins had been extensively studied (reviewed in (34, 35)).
CaM and CaM-like proteins belong to a family of soluble proteins that share a
common structure. These proteins are built of three structural domains: the NL, the CL,
and an elongated, mostly helical, ID that connects the two lobes to form a dumbbell-like
shape. Each lobe possesses two Ca+2
-binding domains, each of which is made of two
8
helix-loop-helix EF-hand motifs. These proteins are present at two configurations: a
dumbbell extended state, where the ID is stretched; and a folded configuration, where the
ID bends, bringing the lobes into close contact. The CaM and CaM-like proteins can form
protein-protein complexes with specific target peptides, a reaction that is a key event in
many intra-cellular regulatory processes. On the binding of Ca+2
and in the presence of
target peptides, the CaM undergoes a conformational change that enables it to bind its
target. This basic mechanism activates well over 100 distinct target proteins. However, in
some cases (as at the Mlc1p protein), the reaction with target peptides can be mediated by
the Apo-CaM. The promiscuity of CaM on one hand, and binding specificity on the other
hand, made it one of the most studied proteins (reviewed in (36-38)).
1.6 Calmodulin-Peptide Complexes
Investigations of the CaM protein in a complex with different bound peptides
have been carried out by genetic methods, such as site-directed mutagenesis of the
protein (39-41), or the target peptide (42-44), and their thermodynamical properties were
explored (45, 46). In addition to these studies, the structure of the protein-peptide
complexes was investigated by NMR (47), and crystallographic studies (48-54). These
studies revealed that, upon binding of the peptides, the ID of Holo-CaM (Ca+2
-bound
CaM) adopts a bent conformation accompanied by its partial unwinding. The bending of
the ID brings together the two lobes of the protein. Thus, the flexibility of the ID region is
critical for the ability of CaM to interact with target peptides. The various aspects of these
interactions, such as recognition and activation, are reviewed by Vetter et al. (55).
9
1.7 Mlc1p-IQ Complexes
The structure of the Mlc1p protein was solved by X-ray crystallography in
complexes with the IQ2, IQ2/3, and IQ4 peptides (28, 56). These Mlc1p-IQ protein-
peptide complexes were formed and crystallized in the absence of Ca+2
ions. According
to these structures, when bound to the IQ2 peptide, the Mlc1p protein adopts a compact
conformation in which both the NL and CL interact with the IQ motif. However, in the
complex with the IQ4 peptide, the NL of the protein does not interact with the IQ motif,
resulting in an extended conformation of the protein. Based on these crystal structures,
combined with comparative sequence analysis, a model for the entire poly-IQ (IQ1-IQ6)
sequence of the neck domain of myosin V, together with six light chain proteins, was
built (57). The model of the Light Chain-Binding Domain (LCBD) of myosin V has
important implications for the understanding of the structure-function relationship of the
lever arm of myosin V. In the present study, the crystal structures of the Mlc1p-IQ4,
Mlc1p-IQ2 and the model of the LCBD are discussed, analyzed, and reevaluated.
2. Computer Simulations
Dynamic simulation methods are widely used to obtain information on the time
evolution of conformations of proteins and other biological macromolecules (58-61), as
well as kinetic and thermodynamic information. Simulations can provide fine details
concerning the motions of individual particles as a function of time. They can be utilized
to quantify the properties of a system at a precision and on a time scale that is otherwise
inaccessible, providing a valuable tool in extending our understanding of model systems.
Theoretical computational consideration of a system additionally allows one to
10
investigate specific contributions of property through “computational alchemy” (62), that
is, modifying the system in a way that is nonphysical but nonetheless allows a model’s
characteristics to be probed.
2.1 Molecular Dynamics
MD simulations have become valuable tools for investigating the basis of protein
structure and function. Given the structure of a biomolecular system, that is, the relative
coordinates of the constituent atoms, MD is used to investigate and study the dynamics of
that system. A brief outline of its history and methodology is offered below.
2.1.1 General Overview
The conformational dynamics of proteins, which is encoded in their structures, is a
critical element of their function. A fundamental appreciation for how proteins work
requires an understanding of the connection between three-dimensional
structures,
obtained by X-ray crystallography and NMR, and dynamics, which is much more difficult
to probe experimentally. Molecular Dynamics (MD) simulations provide links
between
structure and dynamics by enabling the exploration of
the conformational energy
landscape accessible to protein molecules (59, 63, 64). The first MD simulation of a
protein was reported in the year 1977 and consisted of a 9.2-ps trajectory for
a small
protein in vacuum (65). Eleven years later, a 210-ps simulation of the same protein in
water was reported (66), and the phenomenal increase in computing power since then
makes it routine to run simulations of much larger proteins for 1000-10000 times longer
than the original simulation (tenths of nanoseconds), in which the protein is surrounded
11
by water and salt. Significant improvements in the potential
functions have also been
achieved, making the simulations much more stable and accurate (67).
Biomolecular dynamics simulations find two major areas of application today.
Firstly, MD simulations are used to bring biomolecular structures alive, giving insights
into the natural dynamics on different timescales of biomolecules in solution. Secondly,
MD simulations can explore which conformations of a molecule or a complex are
energetically accessible, by searching the conformational space. Driven by improvements
in simulation methodology, increasing accuracy of biomolecular force fields and the
ever-increasing power of computers, MD simulations is a rapidly progressing scientific
discipline. Nowadays, it is well established that MD is not a mere historical phenomenon,
but an important fundamental developing field of science (68-71).
2.1.2 Methodology
We briefly sum up the methodology of MD, but for a comprehensive inclusive
literature the reader is referred to (72, 73). MD simulations calculate the "real" dynamics
of the system, from which time averages of properties can be calculated. Sets of atomic
positions are derived in sequence by applying Newton's equations of motion. MD is a
deterministic method, i.e. the state of the system at any future time can be predicted from
its current state. The first MD simulations were performed using very simple potentials
such as the hard-sphere potential. The behavior of the particles in this potential is similar
to that of billiard balls: the particles move in straight lines at a constant velocity between
collisions. The collisions are perfectly elastic and occur when the separation between a
pair of spheres equals the sum of their radii. After a collision, the new velocities of the
12
colliding spheres are calculated using the principle of conservation of linear momentum.
The hard-sphere model has provided many useful results but is obviously not ideal for
simulating atomic or molecular systems. In potentials such as the Lennard-Jones (LJ)
potential, the force between two atoms or molecules changes continuously with their
separation. The continues nature of more realistic potentials requires the equations of
motion to be integrated by breaking the calculation into a series of very short time steps
(in our simulations the time step was usually 2-fs). At each step, the forces operating on
the atoms are computed and combined with the current positions and velocities to
generate new positions and velocities. The force acting on each atom is assumed to be
constant during the time interval. The atoms are then moved to the new positions; an
updated set of forces is computed, and so on. In this way, an MD simulation generates a
trajectory that specifies how the positions and velocities of the particles in a system
change with time.
In biomolecular simulations, such as presented in this study, successive
configurations of the system are produced by iterative numerical calculations of
instantaneous forces present in a molecular mechanical system and the consequential
movements in that system. The molecular mechanical system consists of a set of particles
that move in response to their interactions according to the equations of motion defined in
classical Newtonian laws of motion. Newton's laws of motion can be stated as follows:
(I) A body continues to move in a straight line at a constant velocity unless a force acts
upon it.
(II) Force equals the rate of change of momentum.
(III) To every action there is an equal and opposite reaction.
13
The trajectory is obtained by solving the differential equation embodied in
Newton's second law (F = m·a):
(1)
Equation 1 describes the motion of a particle of mass (mi) along one coordinate
(xi) with (Fxi) being the force on the particle in that direction.
2.1.3 Molecular Dynamics of Calmodulin
The unique structure of CaM and its fundamental role in Ca+2
-signaling processes
has also drawn the attention of computational scientists besides the experimental
biologists and biochemists. Early MD simulations of Holo-CaM confirmed the flexibility
of its ID (74-76). However, due to limited computational power available at that time,
these simulations were carried out over sub nanosecond time scales and included
relatively few, if any, water molecules. In the recent decade, MD simulations of CaM
were extensively performed under more realistic conditions, addressing issues concerning
the relative positions of its NL and CL and the nature of its flexible ID. In the year 1996,
Van Der Spoel et al. (77) performed an MD simulation of the ID of Holo-CaM, and
observed its bending. Since this pioneering MD simulation of CaM in an aqueous
solution, simulations of Apo- (78, 79) and Holo- CaM (79-87) have been performed for
durations ranging from hundreds of picoseconds to 20-ns. These MD studies
demonstrated the unwinding and curvature of the ID of the protein. Moreover, it was
found out that, although bound Ca+2
ions harden the structure of the protein, it collapses
i
Xi
m
F
dt
xdi=
2
2
14
to a form resembling a dumbbell with a hinged bar connecting its lobes.
2.1.4 Molecular Dynamics of Calmodulin's Complexes
Only a few MD simulations of CaM together with target molecules or peptides
had been performed. An MD simulation of CaM-TFP complex was carried out,
comparing the binding affinities of the NL and CL of Holo-CaM to TFP (88). The
binding dynamics of melatonin to Holo-CaM were examined by means of MD after
molecular docking (89). The dynamics and entropy of a Holo-CaM complexed with a
peptide were studied by NMR and MD. A good agreement was found between amide
order parameters measured by NMR, and those obtained from the simulation (90).
Finally, an MD simulation of a complex between Holo-CaM and a peptide examined the
structure, dynamics and the interaction mode between the protein and the peptide. In a 4-
ns MD simulation, the CaM-peptide complex was quite rigid and did not exhibit any
large amplitude domain motions (91, 92).
2.2 Free Energy
The calculation of free energy from molecular simulations is an area of intense
research activity (reviewed in (93)). This is because free energy is at once one of the most
central and one of the most difficult thermodynamic quantities to compute from atomic
level simulations. It is of paramount importance in efforts to relate microscopic details
stemming from atomic interactions to measurable macroscopic quantities, and to
understand the physical and structural basis of biological phenomena. In particular, free
energy of interaction is a measure of the stability of a complex, a measure that is
15
fundamental to all studies of biomolecular binding processes.
2.2.1 Free Energy of Interaction
The interactions between proteins and other molecules are critical to all biological
systems and processes. Signal transduction, metabolic regulation, enzyme cooperativity,
physiological response, and other processes are all dependent upon noncovalent binding.
These processes may be investigated through modeling and computer simulations. At this
study, we estimate interaction free energy for the protein-peptide complexes through MD
simulations and complementary computation techniques. Approaches available for
estimating interaction free energies cover a broad range of accuracies and computational
requirements. Given that the purpose of this research is not to carry a comparative study
of interaction free energy calculations' techniques, but rather use the interaction free
energy to characterize the protein-peptide complexes, we chose to calculate the
interaction free energy in a nonrigorous technique, called the MM-PBSA approach.
2.2.2 The MM-PBSA Approach
Molecular Mechanics Poisson-Boltzmann Surface Area (MM-PBSA) is basically
a post-processing method to evaluate the standard free energies of molecules or the
interaction free energies of molecular complexes in a relatively computationally efficient
manner. The MM-PBSA approach was developed by Srinivasan and co-workers in 1998
(94). Since then, it has been widely used for estimation of free energies of different RNA
(94), DNA (95, 96) and protein conformations (97), binding affinities of protein
complexes and mutational analysis on them (98-102), binding affinities of small
16
compound-protein complexes (103, 104), interaction free energies of RNA-protein (105),
RNA and metal ions (106), and RNA-ligand (107) complexes. The strategy of
calculations is based on applying of a continuum model to solute configurations derived
from an MD simulation in explicit solvent. For each selected solute configuration,
molecular mechanics energy is determined. Free energies of solvation are estimated by
using Poisson-Boltzmann (PB) calculations for the electrostatic contribution and a
surface-area-dependent term for the nonelectrostatic contribution to solvation. Solute
entropic contributions are estimated from a normal mode analysis. To get a statistically
meaningful value of the interaction free energy of a complex, calculations are commonly
carried out on several snapshots extracted from an MD trajectory with explicit solvent.
3. Significance of Study
The general significance of this research lies in the fact that it demonstrates that
careful application of MD can be used for evaluation of proteinous structures, offering
close analysis as well as deeper insights of crystalline configurations. We have employed
a multi-stage approach in order to investigate the Mlc1p-IQ complexes. For this purpose
we performed MD simulations of the Mlc1p-IQ2 complex, the Mlc1p-IQ4 complex, and
the free IQ2 and IQ4 peptides at various conditions. Our MD simulations enable to
propose solution conformations of protein complexes, which were not offered by X-ray
crystallography studies. Besides grossly expanding the structural data embedded in the
crystalline configurations of these protein complexes, our study provides detailed analysis
of the protein-peptide interaction energy, and atomistic understanding about the
movement mode of myosin V over the actin filament. Thus, this research, which starts
17
from MD simulations and ends at an enlightment of the dynamic aspects of a
physiological phenomenon, illustrates the biophyiological relevance of the MD
methodology.
18
II. METHODS
1. MD Simulations
1.1 Simulations of Protein-Peptide Complexes
1.1.1 Mlc1p-IQ2 Complex
The MD simulations were performed using the GROMACS 3.2.1 package of
programs (108-110), with the GROMOS96 43a1 force field (111). The crystal structure
of the Mlc1p protein bound to an IQ2 peptide of the Myo2p protein (PDB file 1M45),
determined by X-ray crystallography at 1.65 Å (56), was downloaded from the Protein
Data Bank (112). Four missing residues (D-50, S-51, R-54, D-55) were added to the
structure using the PROFIX program, which is incorporated in the JACKAL molecular
modeling package (113). The protein-peptide complex was embedded in a box containing
the SPC water model (114), that extended to at least 12 Å between the protein-peptide
structure and the edge of the box. Assuming normal charge states of ionizable groups
corresponding to pH 7, the net charge of the Mlc1p-IQ2 structure is -7e. Hence, 35
sodium and 28 chloride ions were added to the simulation box at random positions, to
neutralize the system at a physiological salt concentration of ~ 100 mM. Prior to the
dynamics simulation, internal constraints were relaxed by energy minimization.
Following the minimization, an MD equilibration run was performed under position
restraints for 40-ps. Then, an unrestrained MD run was initiated. The first 100-ps of the
run were treated as a further equilibration simulation, and the remainder 12-ns were saved
and used for the analysis. During the MD run, the LINCS algorithm (115) was used in
order to constrain the lengths of all bonds; the waters were restrained using the SETTLE
algorithm (116). The time step for the simulation was 2-fs. The simulation was run under
19
NPT conditions, using Berendsen's coupling algorithm for keeping the temperature and
the pressure constant (117) (P = 1 bar; τP = 0.5 ps; τT = 0.1 ps; T = 300 K). Van der
Waals (VdW) forces were treated using a cutoff of 12 Å. Long-range electrostatic forces
were treated using the PME method (118). The coordinates were saved every 1-ps.
1.1.2 Mlc1p-IQ4 Complex
The simulations' conditions for the Mlc1p-IQ4 protein-peptide complex were the
same as those for the Mlc1p-IQ2 protein-peptide complex. The calculations were carried
out using the crystal structure of the Mlc1p protein bound to an IQ4 peptide of the Myo2p
protein (PDB code 1M46) determined by X-ray crystallography at 2.1 Å (56), that was
downloaded from the Protein Data Bank (112). The net charge of the complex is -3e, and
hence 43 sodium and 40 chloride ions were added in random positions to neutralize the
system at a physiological salt concentration of ~ 100 mM. The simulations were
performed at two temperatures, 300 K and 400 K. The simulations' conditions were
similar at both temperatures, except for the time step and τP that, at the 400 K, were 1.5-fs
and 2-ps, respectively.
1.2 Simulations of Peptides
1.2.1 IQ2 Peptide
The coordinates for the IQ2 peptide were derived from the crystal structure of the
Mlc1p-IQ2 protein-peptide complex. The net charge of the IQ2 peptide is +2e, and hence
13 sodium and 15 chloride ions were added in random positions to neutralize the system
at a physiological salt concentration of ~ 100 mM. The conditions of the simulation were
20
similar to those described for the Mlc1p-IQ2 complex.
1.2.2 IQ4 Peptide
The coordinates for the IQ4 peptide were derived from the crystal structure of the
Mlc1p-IQ4 protein-peptide complex. MD simulations were performed at various
conditions as elaborated on Table 1. The net charge of the IQ4 peptide is +6e, whereas
the number of ions that were added at the simulations varied according to the specific
conditions of each simulation. The conditions of the simulations were similar to those
described for the Mlc1p-IQ4 complex.
Table 1
Summary of the MD Simulations of the IQ Peptides
* High temperature (400 K). †
Low salt concentration. ‡
High salt concentration. ¶ Very high salt concentration.
§ An IQ4 peptide containing 10 additional amino acids flanking each of its terminals was
built using Swiss PDB Viewer (119). These added 20 residues belong to the IQ3 and IQ5
peptides. Thus, the 45 amino acids elongated IQ4 peptide constitutes a portion of the
poly-IQ sequence as present at the neck of the myosin.
2. Visual Presentations
All protein and peptides figures were created using the VMD computer program
(120).
Duration (ns)
Concentration
of Salt
Number of
added Na+
ions
Number of
added Cl-
ions
IQ4 peptide 100 ~ 100 mM 12 18
IQ4 peptide * 30 ~ 100 mM 12 18
IQ4 peptide † 20 ~ 30 mM 4 10
IQ4 peptide ‡ 20 ~ 300 mM 36 42
IQ4 peptide ¶ 20 ~ 2.4 M 288 294
Extended IQ4 peptide § 20 ~ 100 mM 10 17
21
3. Inter-Helical Angles
For the protein-peptide complexes, inter-helical angles were calculated according
to the following procedure: The secondary structure of the protein and the peptide were
determined using the STRIDE algorithm (121). Three residues were selected at the N-
and C- terminals of each alpha helix. The C-alpha atoms' center of mass of each triplet of
residues was determined. A vector-defining-helix was drawn between these centers of
mass along each helix. The inter-helical angle was calculated between each two
successive vectors. For the IQ peptides, intra-helical angle was calculated according to
the same procedure.
4. Dihedral Angle Calculations
The position of the protein’s lobes towards each other can be expressed by
measuring the dihedral angle between the planes defined by the two lobes and the ID.
Each plane was defined by the straight section of the ID and a selected representative
residue located at each lobe. The C-alpha atoms of residues N-47, L-58, and V-69 defined
one plane; whereas the C-alpha atoms of residues L-58, V-69, and E-129 defined the
other. The calculation of the dihedral angles was performed for the last snapshot of the
simulations at t = 12-ns using a standard GROMACS utility.
5. The Electrostatic Potential Around the Peptides
The electrostatic potential around the IQ peptides was calculated for the model
structures of the peptides as derived from the Mlc1p-IQ protein-peptide simulations. The
coordinates of 21 snapshot structures, extracted every 100-ps from t = 10 until t = 12-ns,
22
were used for the electrostatic potentials calculations. The electrostatic potential surface
around each peptide was calculated by solving the nonlinear Poisson-Boltzmann (PB)
equation through the use of the APBS (Adaptive Poisson-Boltzmann Solver) software
package (122) with a grid spacing of 0.5 Å. The volumes of the averaged positive and
negative electrostatic fields of both peptides (for the time frame t = 10 until t = 12-ns) and
the Coulomb cages at the last snapshot (t = 12-ns) of the simulations are presented.
6. The Protein-Peptide Interaction Free Energies
The general strategy used for calculating the protein-peptide interaction free
energy is based on the MM-PBSA method. This method was successfully employed by
numerous studies (94-107), and involves calculating energies for snapshot configurations
taken from the MD trajectories of the Mlc1p-IQ complexes. The configurations of the
protein-peptide complexes, the protein and the peptides were obtained from the MD
simulations of the Mlc1p-IQ2 and the Mlc1p-IQ4 structures. The coordinates of 21
snapshot model structures, extracted at 100-ps intervals during the last 2-ns of the
simulations, where both complexes appeared to gain a stable configuration, were used for
the calculations. The calculations, which were performed for each of these snapshots and
their average values are presented, were intended for estimation of the protein-peptide
free energy interaction.
The changes in the Gibbs free energy of interaction were calculated from the
atomic model structures of the protein and the peptide undergoing the binding to form the
protein-peptide complex. Thus, the free energy of interaction was defined as presented in
equation 2:
23
(2) ∆Ginteraction = (Gcomplex) − (Gprotein) − (Gpeptide)
The calculations of the free energy of each molecule were carried out according to
equation 3:
(3) TSGGE)(G solvation nonpolar,solvation polar,MMmolecule −⟩⟨+⟩⟨+⟩⟨=
where the free energy was decomposed into molecular mechanics (⟨EMM⟩), polar
solvation (⟨Gpolar,solvation⟩), nonpolar solvation (⟨Gnonpolar,solvation⟩) and entropy (TS)
contributions. ⟨ ⟩ denotes an average over a set a snapshots along an MD trajectory. Each
term on the right side of the equation was calculated as detailed below.
6.1 Molecular Mechanics Calculations
The molecular mechanics contribution to the free energy of interaction energy
was calculated according to equation 4:
(4) ⟩⟨+⟩⟨+⟩⟨=⟩⟨ VdWticelectrostaintMM EEEE
(⟨Eint⟩) includes bond, angle, and torsional angle energies, while (⟨Eelectrostatic⟩) and
(⟨EVdW⟩) denote the intra-molecular electrostatic and VdW energies. (⟨Eelectrostatic⟩) was
calculated using the APBS software package (122). (⟨EVdW⟩) was calculated using a
standard GROMACS utility.
6.2 Polar Solvation Calculations
The electrostatic contribution to the solvation energy, (⟨Gpolar,solvation⟩), was
determined by using a continuum electrostatic with the Poisson-Boltzmann (PB)
approach (123). We used the APBS software package (122), with a grid spacing of 0.5 Å
and solution of 100 mM NaCl, for the numerical solution of the nonlinear PB equation.
24
The term (⟨Gpolar,solvation⟩) refers to the energy associated with the transfer of the solute
from a continuum medium with a low dielectric constant (ε = 4) to a continuum medium
with the dielectric constant of water (ε = 78.4).
Crucial to the application of PB models, and a source of many scientific disputes,
is the so-called macromolecule dielectric constant, ε. It is generally accepted that, in a
continuum electrostatics approach, the dielectric constant of the solute is a scaling factor
that represents all the contributions that are not treated explicitly, rather than a true
dielectric constant (124, 125). Different protein-associated dielectric constants are
frequently used in the literature, and we chose to perform the calculation with a dielectric
constant of 4 as commonly employed (123, 126-128). To make sure that the conclusions
derived from our calculations are not dependent upon the choice of the value used, we
repeated the calculation for representative snapshots with a lower (ε = 2) and a higher (ε
= 8) dielectric constants. Comparison between the results indicated that, although the
value of the calculated (⟨Gpolar,solvation⟩) varies with ε, its trend is independent from the
dielectric constant used.
6.3 Nonpolar Solvation Calculations
The nonpolar contribution to the solvation free energy, (⟨Gnonpolar,solvation⟩), was
determined by using the Solvent Accessible Surface Area (SASA). The SASA was
calculated by a standard GROMACS utility, which implements the double cube lattice
method (129) with a probe radius of 1.4 Å. The nonpolar solvation energy was described
as Gnonpolar,solvation= γ·(SASA) + β. The constants γ and β are 2.2 kJ mol-1
nm-2
and 3.84 kJ
mol-1
, respectively. These values of γ and β are in accord with the MM-PBSA approach
25
(101, 103-105, 107).
6.4 Entropy Calculations
Entropy changes upon binding of the peptide were calculated by use
of normal mode analysis. The conformations of the protein, the peptide and
the protein-peptide complex were extracted from the last frame of each of the trajectories
as performed in (101). The model structures were subjected to rigorous energy
minimization, until the maximal force operating on an atom was less than 10-6
kJ mol-1
nm-1
. Normal mode analysis (130-132) was performed by calculating and diagonalizing
the mass-weighted Hessian matrix. The frequency of the normal mode was then used to
calculate the vibration entropy (133) as given by equation 5:
(5)
where Svib is the vibrational entropy, h is Planck's constant, ν0 is the frequency of the
normal mode, k is the Boltzmann constant, T is the absolute temperature and NA is
Avogadro's number. All calculations were performed with the GROMACS program,
compiled with double precision.
)T(
hvN)(R
vibee
eS /kThv
/kThv
A/kThv
0
0
0
11ln
0
−
−
−
−+−−=
26
III. RESULTS & DISCUSSION
1. Simulations of Protein-Peptide Complexes
In this chapter of the thesis, we describe an investigation, through MD
simulations, of the dynamics of two Mlc1p-IQ complexes: the Mlc1p-IQ2 and the Mlc1p-
IQ4 complexes.
1.1 Mlc1p-IQ2 Complex
The Mlc1p-IQ2 complex, which had been resolved by crystallography to 1.65 Å,
confers to a Ca+2
-independent stable structure. Its 12-ns MD simulation is presented.
1.1.1 Crystallographic and Simulated Structures
The crystallographic structures and those obtained by the simulations of the
Mlc1p protein with the IQ2 peptides are presented in Fig. 1.1. The NL, the ID, the CL
and the IQ peptides are colored in blue, red, green and yellow, respectively.
At the crystalline structure of the Mlc1p-IQ2 complex (Fig. 1.1A), the Mlc1p
protein is found at a compact state as evident by its curved ID. In this configuration, the
CL of the protein engulfs the IQ2 peptide, which interacts also with the NL and the ID of
the protein. Overall, the configurations of the Mlc1p protein and the IQ2 peptide only
slightly change throughout the 12-ns long simulation, and hence their final simulated
state structures (Fig. 1.1B) resemble the crystalline ones. Thus, the simulated structure of
the Mlc1p-IQ2 complex exhibits just a few minor conformational deformations compared
to its crystalline structure. These deformations consist of appearance of a new kink
located at helix D of the protein's ID (Fig. 1.1B, see arrow), a consequent rotation of the
27
NL, and minor changes of the inter-helical angles observed mainly at its CL, as
elaborated below.
1.1.2 The Dynamics of the Protein-Peptide Complexes
A quantitative expression of the conformational change is given by the RMSD
(Root Mean Square Deviation) of the backbone atoms of the protein-peptide complexes.
Fig. 1.2A depicts the RMSD values as calculated for the whole Mlc1p-IQ2 complex
(black), and its components: the Mlc1p protein (red), and the IQ2 peptide (green). The
RMSD of the protein-peptide complex exhibits some structural fluctuations that can be
fully attributed to the Mlc1p protein. It increases until ~ 0.35 nm, stays around this value
for ~ 4-ns, decreases for a short while and then stabilizes at ~ 0.28 nm. The RMSD track
Figure 1.1: Cartoon diagrams of the crystal and the simulated structures of the Mlc1p
protein when it binds the IQ2 peptide (PDB 1M45). The NL (residues 1−59), the ID
(residues 60−92), the CL (residues (93−148), and the IQ peptide are shown in blue, red,
green and yellow, respectively. Both crystal structures, and both simulated solution
structures, are presented with the same orientation, where the NLs are structurally
aligned. (A) The crystal structure of the Mlc1p-IQ2 complex; (B) The simulated
structure of the Mlc1p-IQ2 complex after 12-ns simulation. The arrow points towards a
kink discussed at the text.
B A
C-lobe
N-lobe Inter-domain
28
of the IQ2 peptide, which contributes little to the RMSD of the complex, exhibits a
different pattern. It appears to explore the configurational space until ~ 3.2-ns, and then
increases to a value of ~ 0.23 nm. From this time point until the end of the simulation, the
RMSD of the IQ2 peptide is relatively stable.
The fluctuations of the Mlc1p protein can be resolved into a contribution of its
structural domains. Accordingly, the RMSD of its NL (black), ID (red), and CL (green)
are presented in Fig. 1.2B.
The RMSD of the NL exhibits a sharp increase at ~ 5-ns, and then stabilizes at a
value of ~ 0.24 nm. The RMSD of the ID, which is the flexible domain of the Mlc1p
protein, hardly changes. The stability of the ID throughout the simulation time is not
surprising since its structure is already bent and curved at the Mlc1p-IQ2 protein-peptide
A B
Figure 1.2: (A) The RMSD of the backbone atoms of the Mlc1p-IQ2 complex (black), the
Mlc1p protein (red), and the IQ2 peptide (green) as a function of the simulation time; (B)
The RMSD of the backbone atoms of the different domains of the Mlc1p protein as a
function of the simulation time. The domains of the protein are defined as follows: residues
1−59 for the NL (black), 60−92 for the ID (red), and 93−148 for the CL (green).
29
crystalline structure. Apparently, the CL is much more flexible than the other structural
domains, exhibiting the largest variations in its RMSD value. Inspection of the Mlc1p-
IQ2 model structure reveals that most of its eight alpha helices retain their structures and
the angles between them, except the angles found at the CL. The angle between its G and
H helices increases from ~ 100° at the beginning of the simulation to ~ 115° at the end of
it. The angle between its E and F helices changes from ~ 110° to ~ 123° during the time
frame of 2.7 until 5.5-ns, but settles back at its original value. The fluctuations of the
RMSD of the CL are due to these inter-helical motions.
1.1.3 Summary
Overall, the configurations of the Mlc1p protein and the IQ2 peptide barely
change throughout the 12-ns long simulation. This relative stability of the complex
inspired us to perform MD simulation for another complex, the Mlc1p-IQ4 complex,
which is composed of the same protein and a slightly different peptide.
30
1.2 Mlc1p-IQ4 Complex
The Mlc1p-IQ4 complex, which had been resolved by crystallography to 2.1 Å,
confers to a Ca+2
-independent stable structure. During its MD simulations, the complex
undergoes a complicated modulation process, which involves bending of the angles
between the alpha-helices of the protein, breaking of the alpha-helical structure of the
IQ4 peptide into two sections, and formation of new contact points between the protein
and the peptide. The dynamics of the process consist of fast sub picosecond events and
much slower ones that take a few nanoseconds to completion. Our study expands the
information embedded in the crystal structure of the Mlc1p-IQ4 complex by describing
its dynamic behavior as it evolves from the crystal structure to a form stable in solution.
1.2.1 Overall Conformational Changes During the Simulation
Fig. 1.3 depicts the structure of the Mlc1p-IQ4 complex as present in the crystal
structure (Fig. 1.3A), and after 12-ns of simulation at 300 K (Fig. 1.3B). The structures
were aligned over their NLs, to present them from the same orientation. In its crystalline
configuration, the Mlc1p-IQ4 complex, (PDB file 1M46), has an extended structure,
where the IQ4 peptide is mainly bound to the CL of the Mlc1p protein (for details see
Fig. 1.3 below). The ID (residues 60−92, colored red) that connects the NL (1−59,
colored blue) with the CL (residues 93−148, colored green) is a straight alpha helix. The
IQ4 peptide (colored in yellow) also keeps a straight alpha helix configuration,
perpendicular to the long axis of the ID. It forms few contacts with the CL and the ID of
the Mlc1p protein. At the end of the 12- ns simulation, the complex reached a new
conformation. The most stable part of the Mlc1p-IQ4 complex is the NL, which even
31
after 12-ns retained most of its original conformation and its contact points with the N-
terminal section of the ID. The ID had refolded, so that its two helices (helices D and E),
became perpendicular each to the other and are in contact with the NL and CL. The CL
itself rotated and refolded into a shape that engulfs the IQ4 peptide, snapping its straight
helix into two, almost perpendicular sections. In the final configuration, the number of
contact points between the peptide and protein increased and the three sections of the
protein are in contact with the peptide.
The modulation of the protein and peptide with time can be followed by
examination of the backbone RMSD relative to the crystal structure. Fig. 1.4A depicts the
RMSD values as calculated for the whole complex (black), and its components: the
A
C-lobe
N-lobe
Inter-domain
B
Figure 1.3: Cartoon diagram of the crystal and the simulated structures of the Mlc1p
protein bound to the IQ4 peptide (PDB 1M46). The N-lobe, the inter-domain, the C-lobe,
and the IQ4 peptide are shown in blue, red, green, and yellow, respectively. The domains
of the protein are defined as residues 1–59, NL; 60–92, ID; and 93–148, CL. (A): The
crystal structure; (B): The simulated structure at 300 K after the 12-ns simulation in the
presence of water and ions. Both structures are presented with the same orientation.
32
Mlc1p protein (red) and the IQ4 peptide (green).
Some 3.2-ns after the initiation of the simulation, the RMSD trace reveals a major
structural modulation. The RMSD value of the entire Mlc1p-IQ4 complex and that of the
Mlc1p protein almost double (from ~ 0.4 to ~ 0.7 nm). The rise continues in a moderate
fashion till the end of the simulation. In contrast to the complex and the protein, the
RMSD calculated for the IQ4 peptide (green line) exhibits different dynamics. The trace
A B
C D
Figure 1.4: (A) The RMSD of the backbone atoms of the Mlc1p-IQ4 complex (black), the
Mlc1p protein (red), and the IQ4 peptide (green) as a function of the simulation time; (B)
The RMSD of the backbone atoms of the different domains of the Mlc1p protein as a
function of the simulation time. The domains of the protein are defined as residues 1−59
for the NL (black), 60−92 for the ID (red), and 93−148 for the CL (green); (C) Expansion
of frame B between 2- and 3.5- ns; (D) The RMSD of the backbone atoms of the last two
ns of the simulation (10−12 ns) relative to the position of the same set of atoms at t = 10-
ns.
33
reveals a reversible increment at ~ 1.75-ns, followed by a minor response to the structural
change of the Mlc1p protein at 3.2-ns. A short time later, the peptide undergoes some
structural modification, which increases its RMSD to ~ 0.4 nm. From this time point on,
the RMSD of the peptide is practically constant. The fact that the RMSD values of the
peptide hardly reflect the transition detected in the RMSD of the Mlc1p protein, implies
that the structural changes are driven by the Mlc1p protein itself, as evident in Fig. 1.4B.
Fig. 1.4B presents the RMSD values of the structural elements of the protein: the
NL, the CL and the ID as shown (marked in black, green and red, respectively). The NL
and the CL exhibit only moderate deviations from their initial structures, reaching, at the
end of the simulation, RMSD values of ~ 0.27 nm and ~ 0.36 nm, respectively. The ID
section of the protein is much more flexible than the lobes, as its RMSD exhibits large
variations before and after the conformational change of the complex. These events are
depicted with higher temporal resolution in Fig. 1.4C. During the 2.1−2.8-ns time
interval, the ID flexes for a very short time. Its RMSD increases to > 0.3 nm, but
eventually the ID relaxes to its original structure. Within this time frame, the CL slowly
regains a new conformation, roughly doubling its RMSD from 0.1 to 0.2 nm. Once the
new conformation of the CL was attained, the next structural modulation of the ID
appears to lead to the main conformational change at t ~ 3.2-ns. The RMSD of the CL
further increases and, after a short delay, the NL undergoes its conformation change.
Thus, the structural transition process experiencing the protein-peptide complex is
composed of a set of sequential steps, not a concerted motion.
Once the complex had reached its new stable conformation, the protein and the
peptide do not exhibit further major conformational changes. The calculation of the
34
backbone RMSD value for the last 2-ns of the simulation, relative to the backbone heavy
atoms’ position at t = 10-ns, reveals only moderate deviations around 0.25 nm (Fig.
1.4D). This is an indication that, in the refolded state, the complex does not sample many
other conformations but rather settles down to a stable configuration with no further
evolution.
1.2.2 Relative Rotation of the Helices During the Simulation
The Mlc1p protein, like CaM, consists of 8 well-defined helices (A−H).
Inspection of the Mlc1p-IQ4 complex's structure (Fig. 1.3) reveals that most of the alpha
helices retained their structure. The variations of the RMSD reflect mostly the rotation of
the helices one with respect to the other. This motion can be expressed in changes of the
angle between the helices. Thus, in order to examine the relative rotation of the helices
during the simulation, we have calculated the inter-helical angles between each
successive pair of them, and present the results in Fig. 1.5A−C. A snapshot from the time
point t = 12-ns is shown on the right of each panel in order to highlight the discussed
helices. It should be noted that the cartoon diagram of the Mlc1p-IQ4 complex is
presented solely clarification purposes, and we can not deduce the angle itself from it.
The first three helices (A−C) are located at the NL, which hardly experienced
conformational changes (see Fig. 1.3). The angles between the helices of this lobe barely
changed throughout the simulation course (data not shown). The angle between the NL
and the first helix of the ID was also stable with time, and not affected by the structural
transition of the complex. The rest of the protein was more flexible, and the dynamics of
the relative motions of its helices is presented in Fig. 1.5. The first global motion of the
35
Mlc1p protein is a slow (~ 1-ns), ~ 35° modulation in the F-G angle (Fig. 1.5A),
representing a deformation of the CL (~ 1−2 ns). Immediately later, this angle exhibits a
reversible bending, a process that may be considered as a preparation for the major
structural deformation event. This event is characterized by rotations of two pairs of
helices (D vs. E and G vs. H).
A
F G
B
D
E
C
H
G
D
N-section
C-section
Figure 1.5: The dynamics of the relative rotation of the alpha helices of the Mlc1p
protein and the IQ4 peptide during the 12-ns simulation time. In order to clarify which
angle is presented on each panel, a cartoon diagram of the complex after 12-ns of
simulation is shown on the right of each panel in black, whereas the discussed helices
are shown in the domain's characteristic color. (A) The inter-helical angle between
helix F (residues 102−111) and helix G (residues 119−126) of the CL; (B) The inter-
helical angle between helix D (residues 60−70) and helix E (residues 81−92) of the
ID; (C) The inter-helical angle between helix G (residues 119−126) and helix H
(residues 138−147) of the CL; (D) The inter-helical angle between the N- (residues
3−8) and C- (residues 18−23) sections of the IQ peptide.
36
The D and E helices are the components of the ID, with a hinge located between
them at residues 71−80. This hinge exhibits a considerable flexibility, demonstrated by
NMR studies of Apo-CaM (134, 135), crystal structures of Holo-CaM by itself (136) and
in a complex with peptides (49, 50), and MD simulations of Apo- and Holo-CaM (78,
81). Hence, it is well established that the ID is a flexible bar rather than a stiff stick. Also
in the present simulation the hinge flexes, initiating the collapse of the extended structure.
The angle between the helices of the ID increases by almost 80° within 400-ps starting at
~ 3.2-ns (Fig. 1.5B). This bending leads to the refolding of the dumbbell-like shape of the
Mlc1p protein to its final model solution structure.
Another pair of rotating helices (G and H) is located at the CL (Fig. 1.5C). The
angle between them decreases gradually by ~ 50° during a time frame that precedes the
major structural change event (~ 2−3 ns). This change is followed by a sharp increase in
the angle, corresponding with the snapping of the straight helix of the peptide into two,
almost perpendicular, sections.
In order to demonstrate the modulation of the IQ4 peptide during the simulation,
two alpha helices, in its N- and C- sections, were defined. The angle between them is
presented in Fig. 1.5D. The IQ4 peptide, which at the initial state consists of a single
alpha helix, shows a significant flexibility. Its straight alpha helical structure is snapped
in its middle during the complex's structural transition, in a process that is slightly
delayed with respect to the major conformational change, when the two shorter helices
attain an almost perpendicular relation. The process, in which the angle between the
helices increases from ~ 20° to ~ 70°, is sharp and quick.
The major conformational change of the Mlc1p protein is preceded by reversible
37
changes at the RMSD values of the ID (Fig. 1.4C), and the inter-helical angle between
helices of the CL (Figs. 1.5A and 1.5C). Analysis of these events shows that the protein
undergoes several minor conformational changes that precede the major conformational
modulation. These changes may be needed to allow the modulation of the protein
structure, and to enable the structural change (i.e. the accumulation of many minor
changes drives the major refolding of the complex). It is of interest to point out that the
rotation of the CL protein’s helices (Figs. 1.4A and 1.4C) precedes the sharp rise at the
RMSD values (Figs. 1.3A−C), and then proceeds, although after that the RMSD function
seemed to reach constant values. This may indicate that the structural modulation of the
structure is not an instantaneous event, but rather a complex “Plate Tectonics”, whereas
the forces are operating well before and after the deformation event.
1.2.3 The Structural Characteristics of the Compaction Event
The variations of three major structural characteristics of the Mlc1p protein with
simulation time (namely, the length of its ID, the distance between its N- and C- lobes'
centers of mass, and its gyration radius) are presented in Fig. 1.6A.
A B
Figure 1.6: (A) The modulation of the structure of the Mlc1p protein as a function of the
simulation time. ID's length (green); Distance between the NL center of mass and the CL
center of mass (red); Radius of gyration (black); (B) Solvent Accessible Surface Area
(SASA) of the structure of the Mlc1p-IQ4 complex as a function of the simulation time.
38
As the ID of the Mlc1p protein experiences the most dramatic changes of
structure, we quantified its contraction during the simulation. The distance between the
first and the last alpha carbons of the ID of the protein was followed throughout the
course of the 12-ns simulation (Fig. 1.6A, green). The ID gradually shortens from 4.77 to
2.78 nm, demonstrating a significant conformational flexibility. The flexible ID
extensively curves, bringing the two lobes of the Mlc1p protein closer. The approach of
the lobes towards each other is also expressed by a decline of the distance between the
mass centers of the NL and the CL of the protein (Fig. 1.6A, red), from 4.11 to 2.99 nm.
This modulation reflects the transition from the extended complex structure at the crystal
to a more compact structure at the end of the simulation. The decrease of the distance
between the lobes occurs due to a twist of the ID. This can be manifested by calculation
of the gyration radius of the Mlc1p protein (Fig. 1.6A, black), which drops from 2.22 to
1.78 nm. The decrease in the radius of gyration demonstrates the decline in the overall
dimension of the Mlc1p protein. Overall, the conformational change at t ~ 3.2 ns leads to
a compaction of the protein. This can be observed by variations of the ID's length, the
distance between the N- and C- lobes' centers of mass, and the gyration radius.
The effect of the compaction of the protein on the Solvent Accessible Surface
Area (SASA) of the complex was calculated for both the hydrophobic and the hydrophilic
residues, and the combined SASA is presented in Fig. 1.6B. Both parameters exhibited a
significant reduction. The total SASA decreased with time, exhibiting a sharp drop (at t ~
3.2-ns) of the surface area within less than 100-ps from ~ 112 nm2 to ~ 102 nm
2. The time
point of this drop is associated with the major structural change of the complex. This
suggests that when one inspects the SASA criteria, taken as a whole, the structural
39
change is quick rather than gradual.
The dynamics of the various parameters reflecting the contraction, as presented in
Fig. 1.6A, demonstrate that the structural changes are not an instantaneous event, but
rather a prolonged event stretching over ~ 1500-ps (~ 2.5-4 ns). Thus, the transition that
appears to be rapid when the RMSD (as displayed in Figs. 1.4A−C), the inter-helical
angles of the protein and the peptide (Figs. 1.5B and 1.5D), or the SASA (as displayed in
Fig. 1.6B) are examined, is actually a sum of individual steps as presented in Fig. 1.6A.
Some aspects of the overall compaction are abrupt, while other aspects are gradual. What
is more, after the major contraction takes place, the protein-peptide complex continues to
reshape for a long period. This shows that the post compaction relaxation is also a
complex, multi-component process.
1.2.4 The Forces that Stabilize the Refolded Model Structure of the Complex
To gain a comprehensive evaluation of the energetic aspects of the structural
modulation process, one needs to examine not only the protein-peptide complex but also
its interaction with the solvent, and the solvent-solvent interactions. However,
determination of parameters such as the solvent's cavitation energy, solvent's entropy,
solvation energy and the Long-Range electrostatic energies in a periodic system pose
both practical and conceptual difficulties when dealing with MD simulations. Therefore,
we shall limit our discussion only to certain specific aspects of the protein's energy, being
aware that only part of the overall system is analyzed. Accordingly, the variations of the
contraction of the protein in terms of VdW interactions (the Lennard-Jones component of
the potential energy), electrostatic interactions (Short-Range Coulomb potential), and
40
entropy changes are presented below.
The Lennard-Jones (LJ) component of the potential energy interaction among the
residues of the Mlc1p protein, and the LJ component of the potential energy of the
protein-peptide interaction, are presented in Figs. 1.7A and 1.7B, respectively. The LJ
contribution to the potential energy, calculated for the interaction between the residues of
the Mlc1p protein, is changing rapidly, covering the time frame associated with the major
contraction event (Fig. 1.7A). The gain in stability, due to the better interaction between
hydrophobic residues of the Mlc1p protein, is ~ 200 kJ/mol. The contraction also
increases the contact between the protein and the IQ4 peptide, a process that contributes a
similar amount of stabilization but follows a different time course (Fig. 1.7B). Instead of
a rapid formation of LJ stabilization energy, the process is stretched over ~ 4-ns, slowly
gaining its final level. Thus, the slow evolution of the LJ stabilization, in a time frame
where there is hardly a change at the RMSD value, is attributed to minor rearrangements
of the side chains; these small motions make a large contribution to the stabilizing
energy. The LJ potential of the peptide with itself was constant in time (data not shown).
Figure 1.7: The contribution of the Lennard-Jones interactions to the stabilization of the
refolded model state of the Mlc1p-IQ4 complex. (A) The LJ component of potential
energy of the interactions among the residues of the Mlc1p protein; (B) The LJ
component of potential energy of the interactions between the Mlc1p protein and the
IQ4 peptide.
B A
41
The contribution of the Short-Range (SR) electrostatic interactions to the stability
of the contracted complex is rather small (Fig. 1.8). However, it should be mentioned that
the mode of calculation is more qualitative than quantitative as described below.
The Coulomb SR intra-complex electrostatic interactions between the charges on
the protein and the peptide exhibit a transient nature. As the structural elements of the
system shift their relative position (2.5–4 ns time frame), there is a gradual decrease of
the overall electrostatic potential. Yet, when the side chains readjust (4–6 ns time frame),
the Coulomb SR potential increases and becomes less favorable. This leads to initial and
final levels of the overall electrostatic interactions being about the same.
Though the calculation of the Coulomb SR interactions was performed using a
cut-off of 12 Å, inspection of the variation of the Coulomb SR interactions, calculated for
longer cut-off distances (14, 16, 18, and 20 Å), revealed pattern similar in shape with that
presented in Fig. 1.8. As the amplitude of the electrostatic potential varies with the cut-off
distance, the values presented in Fig. 1.8 should not be taken as a quantitative
Figure 1.8: The electrostatic potential of the Mlc1p-IQ4 complex as a function of the
simulation time. The data is a summation of all Short-Range Coulomb potentials
between charges on the complex.
42
representation of the electrostatic interactions. Apparently, the qualitative description of
the process is independent of the cut-off used. Thus, we conclude that the overall
electrostatic potential responses transiently to the structural modification process,
relaxing after few nano-seconds to its original value. According to our simulation results,
the stabilization of the Mlc1p-IQ4 complex is the sum of VdW interactions and a
transient variation of the electrostatic potential of the system. However, the VdW
interactions seem to be the dominant stabilizing force operating on the complex, while
the electrostatic interactions exhibit reversible changes. Apparently, the stability gained
by the rearrangements of the side chains, located both on the Mlc1p protein and on the
IQ4 peptide, follows the major changes of the main structural elements of the protein.
While the compaction event is brief, the LJ forces leading to the refolding operating for a
relatively long time.
In addition to the potential energy and electrostatic terms, one should also
consider the entropic changes associated with the compaction of the complex's structure.
The refolding of the protein and the tighter packing of its side chains reduced the freedom
of motion of the interacting residues, thus affecting the entropy of the system. To
estimate the contribution of this term, we calculated the entropy of the protein-peptide
complex (prior to and following the compaction event). These calculations indicated that
the entropy of the complex decreased, upon compaction, from an average value of 26.02
± 0.16 kJ mol-1
K-1
(calculated in time frames of 1000-ps between 1000 and 3000 ps) to
25.31 ± 0.29 kJ mol-1
K-1
(calculated in time frames of 1000-ps between 4000 and 12000
ps). This is equivalent to an entropic destabilization of the refolded conformation by ~
210 kJ/mol (300 K). The decrease of the complex’s entropy is due to formation of a
43
tighter, more compact conformation of the refolded protein. However, the entropy of the
solvent, which cannot be directly obtained from the MD simulation, is likely to increase
upon the compaction event due to release of water molecules from the protein's surface to
the bulk.
To account for the release of water molecules from the solvation layer of the
Mlc1p protein, its first solvation layer was defined as a 2.8-Å-thick shell around it. The
number of water molecules found within the solvation layer, as it varies with time, is
presented in Fig. 1.9.
It can be seen that the compaction process is accompanied with a sharp decrease
in the number of water molecules that are in the VdW contact with the protein. Some 20
water molecules are released from the interface of the protein to the bulk at a time point
associated with the major structural modification event. Besides the abrupt fast reduction,
a slow trend of a decrease in the number of water molecules in the solvation layer
throughout the time is also observed. The detachment of the water molecules from the
Water Molecules at the First Hydration Shell
Figure 1.9: The number of water molecules at the first solvation layer (2.8Å) of the
Mlc1p protein as a function of the simulation time.
44
hydration shell of the solute and their rejoining to the solvent, produce an entropic
advantage for the new, refolded model structure of the protein-peptide complex.
Apparently, the release of water molecules from the solvation layer compensates for the
reduction at the protein's entropy, thus assisting in the overall free energy balance.
1.2.5 The Interactions Between Residues During the Structural Change of the
Complex
The detailed interactions between individual residues of the Mlc1p protein and the
IQ4 peptide were followed at three stages: at the crystal structure, during the simulation
but before the compaction event (from the beginning of the simulation till 2-ns), and at
the last half of the simulation when the new compact configuration was already formed
(6−12-ns). For each state we listed the residues on the peptide which are in contact (less
than 4 Å) with the protein. In the case of the crystal structure, we searched for present
contacts between residues at the static presentation. For the simulated solution structures,
either before or after the compaction, we chose to present only contacts between residues
that were present in at least 80 % of the snapshots. This restriction eliminated transient
interactions that make a negligible contribution to the stabilization of the compacted
complex. The results are summarized in Fig. 1.10.
At the crystal structure, 31 contacts between the IQ4 peptide and the Mlc1p
protein are found (Fig. 1.10A). 24 of these contacts involve the CL of the protein (green),
and the rest are with residues of ID (red). During the MD simulation, the complex
underwent an initial relaxation and assumed the initial solution model structure, in which
the number of contacts between the IQ4 peptide and the CL was reduced to 18 (Fig.
45
1.10B), 3 of them not being present at the crystal structure. In this intermediate state, the
number of contacts with the ID was reduced to 5.
Figure 1.10: Contacts between residues of the IQ4 peptide with side chains of the
Mlc1p protein. The residues of the protein are marked by their location for identification
of their presence. The color code: NL (blue), ID (red), CL (green). (A) Contacts (less
than 4 Å) between residues of the Mlc1p protein and the bound IQ4 peptide obtained
from the crystal structure PDB 1M46; (B) Contacts (less than 4 Å) between residues of
the Mlc1p protein and the bound IQ4 peptide obtained from the simulation at t <= 2 ns.
Only contacts persist at least 80 % of the examined period are presented; (C) Contacts
(less than 4 Å) between residues of the Mlc1p protein and the bound IQ4 peptide
obtained from the simulation at t >= 6 ns. Only contacts persist at least 80 % of the
examined period are presented. In Figs. 1.10B and 1.10C, contacts that are common for
the solution states and the crystal structure, are marked by bold letters.
46
After the structural transition of the complex's structure, the number of contacts
between the peptide and the protein increased to 42 (Fig. 1.10C). 25 of these contacts are
with the CL, 4 new contacts are with the NL (colored in blue), and 13 with the ID (8 of
which were not present at the crystal structure). It is of interest to point out that the
residues 21−23 of the IQ4 peptide, which at the crystallized complex made no contacts
with the Mlc1p protein, interacted with its CL during the simulation.
Each of the interactions summarized in Fig. 1.10 had its own time evolution. Most
of the contacts appear in conjunction with the major structural transition event, yet in
some cases the interaction appears to be composed from more than two states. Hence, in
order to examine more closely the interaction between the amino acids formed the
contacts presented in Fig. 1.10, the minimal distances between all of the pairs were
followed as a function of time. Two representative examples are shown in Fig. 1.11.
As presented in Fig. 1.11A, the interaction of R4 (of the IQ4 peptide) with N36
(located at the NL of the Mlc1p protein) has at least two states. Before the compaction
event, the distance between the residues was larger than 1 nm; after that, it fluctuated
wildly since there was no interaction between the residues. At t ~ 3.2-ns, the structural
deformation of the complex' model structure imposed a stable separation of ~ 0.7 nm
between the residues. Yet, at t ~ 5-ns, a new structural organization took place and atoms
of the two residues almost reached a contact of their VdW radii (d = 2.5 Å). A different
scenario was observed concerning the interaction of Q10 of the IQ4 peptide (a conserved
residue of the IQ family), with residue F92 (located at the ID of the Mlc1p protein),
presented in Fig. 1.11B. In this case, the approach of the two residues towards each other
is a slow, gradual process, which resembles the slow evolution of the LJ stabilization of
47
the complex (Fig. 1.7B). Apparently, the initiation of the approach precedes the structural
deformation event, yet proceeds for several ns.
1.2.6 MD Simulation of the Mlc1p-IQ4 Complex at 400 K
CaM and CaM-like proteins are rather stable and, in some cases, their purification
employs a brief boiling of the proteins’ mixture (137-139). Most of the proteins, unlike
the CaM and CaM-like proteins, cannot stand this treatment and precipitate. To check
whether the elevated temperature may cause further structural changes of the protein, and
detect changes not observed at 300 K, we repeated the simulation at 400 K.
According to our simulation results, the elevated temperature does not affect the
refolding mechanism of the protein-peptide complex, but rather its rate. Fig. 1.12A
depicts the structure of the Mlc1p-IQ4 complex after 12-ns of simulation at 400 K., while
superposition between the structures of the complex at both temperatures is presented on
Fig. 1.12B. Inspection of the model structure of the complex at the end of the simulation
A B
Figure 1.11: Representative examples of contacts between the IQ4 peptide and the
Mlc1p protein as a function of the simulation time. (A) Minimal distance between
residue R4 of the peptide and residue N36 of the protein; (B) Minimal distance between
residue Q10 of the peptide and residue F92 of the protein.
48
Figure 1.12: (A) Cartoon diagram of the Mlc1p protein bound to the IQ4 peptide (PDB
file 1M46). Simulated model structure, after 12-ns of MD simulation at 400 K in the
presence of water and ions. The NL, the ID, the CL and the IQ4 peptide are shown in
blue, red, green and yellow, respectively. The domains of the protein are defined as
residues 1−59, NL; 60−92, ID; and 93−148, CL. Structure is presented with the same
orientation as in Fig. 1; (B) Superposition of the cartoon diagrams of the simulated model
structures of the Mlc1p-IQ4 complexes at 300 K and 400K. The Mlc1p protein and the
IQ4 peptide at 300 K are shown in black and gray, respectively; whereas the Mlc1p
protein and the IQ4 peptide at 400 K are shown in red and orange, respectively. (C) The
backbone atom RMSD of the different domains of the Mlc1p protein as a function of the
simulation time, for the MD simulation at 400 K. The NL, ID and CL of the Mlc1p
protein are shown in black, red and green, respectively.
A
C B
at 400 K reveals similarity with the 300 K simulation: 1. The Mlc1p protein lost its
extended configuration, exhibiting a compact configuration; 2. The NL retained two of its
helices and a good contact with the N-terminal section of the peptide; 3. The ID folded in
a mode similar to that gained at 300 K; 4. The CL retained its three helices while
engulfing the IQ4 peptide; 4. At the elevated temperature the IQ4 peptide lost a fair
fraction of its alpha helix structure. Yet, as can be seen from the superimposed structures,
it is located between the lobes of the protein at both temperatures.
49
The mechanism of the conformational transition is basically comparable at the
two temperatures with a clear acceleration at the higher temperature. The RMSD values,
calculated for the structural elements of the protein: the NL, the CL and the ID (marked
in black, green and red, respectively), are presented in Fig. 1.12C. The quick rise of the
RMSDs of the structural elements of the protein within the first nano-second of the
simulation, especially that of the ID, illustrates the fast alternation of the protein's initial
structure when the temperature is elevated. Evidently, the RMSDs of the MD simulation
at 400 K increased at the beginning of the simulation to levels achieved only after ~ 3.2-
ns by the MD simulation at 300 K. A careful analysis of the trajectories at both
simulations reveals high similarity. Moreover, since also the final level of RMSDs is akin
at both simulations, we suggest that the modulation's speed of the complex' structure is
temperature-dependent.
1.2.7 Summary
In the present study we investigated, through MD simulations, the dynamics of a
complex between a CaM-like protein and its target peptide, the IQ4 peptide. This
complex, which is a Ca+2
-independent stable structure, underwent a complicated
structural transition process. The process appeared to be initiated by fluctuations of the
model structure in the multi-dimensional parameter space, with no clear triggering event.
Once the proper situation was attained, the reaction became unidirectional and the system
gained stability in few steps. Only 15 out of 31 contact points detected at the crystal
structure were retained in the solution model structure, while 27 new interactions were
established. The structural modification of the complex involved bending of the angles
50
between the alpha helices of the Mlc1p protein, and breaking of the alpha helical
structure of the IQ4 peptide into two, almost perpendicular sections. Some of the
structural changes coincide with the main conformational transition event. This event was
preceded by preparatory events that proceed in time, and followed by some “after-
shocks” that helped to seal and stabilize the refolded model structure.
It is likely that the progression of the simulation, even before the major structural
transition event, included stepwise preparatory steps. Since the MD simulation at 400 K
yielded a final conformation similar to the one at 300 K, we suggest that the pattern of the
structural transition is embedded as an inherent instability of the crystal structure of the
Mlc1p-IQ4 complex. Once the restraints imposed by the crystallization conditions were
released, the system progressed into a stable model solution structure at a temperature-
dependant rate.
The refolding of the protein-peptide complex is composed of both fast, sub pico-
seconds events, and also slow events that take up a few ns. The stabilization of the
compact state is mostly hydrophobic in nature, although electrostatic interactions also
take part in this process. The main deformation is probably directed by introductory
events, followed by the rapid process of compaction. The continuing stabilization of the
protein-peptide complex can be considered as optimization of the packing of the refolded
model structure, minimizing the LJ potential energy by local shuffling of atoms at the
Mlc1p-IQ4 complex contact surface. These proceeding events help to “close the deal”,
thus completing the process till a new configuration is formed.
The number of residues involved in the interaction between the IQ4 peptide and
the Mlc1p protein is much larger than the number of conserved residues that define the
51
IQ motif. What is more, the 4 conserved motif residues L9, Q10, R14 and R20 contribute
only 15 out of the 42 contact points that stabilize the refolded complex. Apparently, the
plethora of residues that participate in the interaction between the Mlc1p protein and the
IQ4 peptide allows some tolerance to a single point mutation of a given site.
We wish to note that the proposed simulated model solution structure of the
Mlc1p-IQ4 complex presented in this study differs from its crystalline structure. Our
suggested solution model structure resembles structures of Holo-CaM and Apo-CaM in a
complex with peptides, obtained by NMR (47) and crystallography studies (48-54).
Moreover, the crystal structure of the Mlc1p protein in a complex with the IQ2 peptide
(28, 56) is similar to the simulated model structure of the Mlc1p-IQ4 complex. However,
experimental methods, like Dynamic Light Scattering or NMR, may be carried out to
verify the proposed simulated model solution structure of the Mlc1p-IQ4 complex.
Nevertheless, this kind of experiments is beyond the scope of the present study.
52
2. Comparison Between the Mlc1p-IQ2 and the Mlc1p-IQ4 Complexes
In this chapter of the thesis, the MD simulations of the Mlc1p-IQ2 and the Mlc1p-
IQ4 are compared. The crystal and the simulated structures are judged against each other,
the dynamic properties of both complexes are described, and a comprehensive energetic
detailed analysis using the MM-PBSA approach is presented.
2.1 Crystallographic and Simulated Structures Comparison
The crystallographic structures and those obtained by the simulations of the
Mlc1p protein with the IQ2 and IQ4 peptides are presented at the previous chapter of the
thesis. At the crystal state of the Mlc1p-IQ2 complex, the protein assumes a compact
state; while at the crystal state of the Mlc1p-IQ4 complex, the Mlc1p confers to an
extended configuration. These distinctive states can be attributed either to complex-
specific interactions, to the different crystallization conditions of each complex (28, 56),
or to a combination of both. A close examination of the crystal and simulated structures
of both protein-peptide complexes (Fig. 2.1) reveals that, following the simulations, the
protein’s model structures are more similar than in their crystalline states.
The calculated backbone atoms RMSD between the Mlc1p proteins found at the
Mlc1p-IQ2 and the Mlc1p-IQ4 crystal structures (Figs. 2.1A and 2.1C), is as large as
1.296 nm. However, this value drops to 0.959 nm when calculated between the solution
model of the protein after 12-ns simulations (Figs. 2.1B and 2.1D). A value of 0.959 nm
may still seem to be quite high, but it should be noted that, due to the shape of the
protein, any attempt to align two of its structures is expected to result in a relatively high
RMSD. Therefore, the decline of the RMSD from 1.296 nm to 0.959 nm is structurally
53
notable, as seen in Fig. 2.1. On the top of the RMSD calculation, other structural
indicators (such as length of the ID, distance between the lobes center-of-mass and
gyration radius) were calculated for the crystal and the simulated structures of the protein
(data not shown). All these point out that the solution configurations of the Mlc1p protein
are more similar than its crystalline ones. Additionally, it should be mentioned that the
simulated model structures of the protein acquire special conformational features not
present in either of the crystals. Besides exhibiting a common compact form of the
simulated protein, during the simulations a new kink appears at both of its model
structures (Figs. 2.1B and 2.1D, see arrow). This kink is missing from the crystal
structures of neither of the Mlc1p-IQ complexes.
A CB D
Figure 2.1: Cartoon diagrams of the crystal and the simulated structures of the Mlc1p
protein when it binds the IQ2 peptide (PDB 1M45), and the IQ4 peptide (PDB 1M46).
The NL (residues 1−59), the ID (residues 60−92), the CL (residues (93−148), and the IQ
peptides are shown in blue, red, green and yellow, respectively. Both crystal structures,
and both simulated solution structures, are presented with the same orientation, where
the N-lobes are structurally aligned. (A) The crystal structure of the Mlc1p-IQ2
complex; (B) The simulated model structure of the Mlc1p-IQ2 complex after 12-ns
simulation; (C) The crystal structure of the Mlc1p-IQ4 complex; (D) The simulated
model structure of the Mlc1p-IQ4 complex after 12-ns simulation. The black arrows in
frames B and D mark the kink of helix D.
54
Though both simulated model structures are characterized by an overall similarity,
there is still a difference between them, as the CL of the protein points towards opposite
directions. The position of the CL of the Mlc1p protein in respect to its NL may be
expressed by calculation of the dihedral angle between the planes defined by the two
lobes and the ID. We found that, at the end of the simulations, the dihedral angle at the
Mlc1p-IQ2 simulated model structure is 128.54°, while the dihedral angle at the Mlc1p-
IQ4 simulated model structure is significantly smaller, 72.1°. The difference represents
two binding modes and orientations of the IQ peptides in respect to the protein, and two
different conformations of the latter. Apparently, various Mlc1p-binding partners may
affect and dictate its structure. The Mlc1p protein is very flexible, enabling it to wrap
around the IQ peptides in different ways. Its tolerance and adaptivity towards different IQ
peptides make it an appropriate candidate to bind a variety of target helical peptides.
Evidently, this unique property enables it to bind six IQ peptides of the LCBD of myosin
V, each distinguished by a unique sequence.
2.2 Structural Evolution of the Simulated Mlc1p Protein at the Protein-Peptide
Complexes
A dynamic comparison of the trajectories of the Mlc1p protein in both simulations
is shown in Fig. 2.2. The RMSD of the C-alpha atoms of the Mlc1p protein obtained
from the Mlc1p-IQ4 simulation's trajectory is presented in relation to the C-alpha atoms
of the Mlc1p protein obtained from the Mlc1p-IQ2 simulation's trajectory. The two-
dimensional matrix representation exemplifies the time evolution of the protein's
configurations as a function of the simulations time. A color code is used for visualizing
55
Figure 2.2: Matrix representation of mutual C-alpha atoms' RMSD of the Mlc1p protein
obtained upon comparison of the two simulations. The RMSD values of the Mlc1p protein
for the Mlc1p-IQ4 simulation's trajectory were calculated in relation to those of the Mlc1p
protein obtained from the Mlc1p-IQ2 simulation's trajectory and vice versa. The values
are given by color codes; where blue and red represent high and low similarity,
respectively.
how the two model structures of the Mlc1p protein, in the Mlc1p-IQ protein-peptide
complexes, approach a common compact shape during the simulations. The pattern of the
colors exhibits progressive yet reversible changes, indicating that both model structures
are fluctuating, and the similarity between them varies. The mutual evolution of the
conformational changes, as shown in Fig. 2.2, suggests that the convergence and settling
of the Mlc1p protein's model structures towards relatively similar compact configurations
can be divided into three phases: a relaxation phase (from the beginning of the simulation
till ~ 3.2-ns), a progression phase (from ~ 3.2 till ~ 10-ns), and a quiescent phase (from ~
10-ns till the end of the simulations).
56
The relaxation phase is represented by the yellow-reddish color laying at the
bottom of the figure, stretching over its full width and extending up to ~ 3.2-ns mark of
the ordinate. At this phase, the Mlc1p protein at the Mlc1p-IQ4 complex responds to the
absence of the packing forces present at the crystal lattice. Hence, the model structures
sampled by the protein from both simulations are still remarkably different, each
remaining close to its crystal form. At the progression phase, the Mlc1p protein at the
Mlc1p-IQ4 simulation undergoes a major conformational change (140), rendering it more
similar to that of the Mlc1p-IQ2 simulation. This newly gained configuration is stabilized
by hydrophobic interactions, where minor rearrangements of the side chains contribute to
a relatively slow progression stabilizing process. The refolded conformation of the Mlc1p
protein obtained from the Mlc1p-IQ4 simulation becomes more similar to the model
structure of the Mlc1p protein obtained from the Mlc1p-IQ2 simulation. This tendency
increases along the ordinate as seen by the shift from the yellow-greenish to the green-
bluish colors. Finally, at the quiescent phase, the new model structure of the Mlc1p
protein obtained from the Mlc1p-IQ4 simulation is already stabilized. The higher degree
of similarity between the protein’s model structures, as reflected by the smaller RMSD
values one with respect to the other, is observed at this phase (represented by the bluish
hue). Evidently, as the simulations progress, the model structures of the proteins become
more similar in a time-dependent manner. Thus, the final MD-derived solution model
structures of the protein are more similar than its crystal structures states. Furthermore,
these final MD-derived solution model structures of the protein resemble also compact
configurations of the CaM protein presented at crystal structures of Holo-CaM with target
peptides (48-54, 141).
57
It is of interest to point out that the model structures of the protein at both
simulations do not co-evolve in parallel. The model derived from the Mlc1p-IQ2
simulation experiences limited changes, whereas that derived from the Mlc1p-IQ4
simulation assumes a significant modification. Thus, the Mlc1p' model structures evolve
at different rates towards a more similar configuration.
Carrying out the same analysis only for the ID of the Mlc1p protein (data not
shown) reveals a pattern resembling that of the whole protein. The structure of the ID of
the Mlc1p protein, obtained from the Mlc1p-IQ2 simulation, is almost invariable, while
that obtained from the Mlc1p-IQ4 simulation evolves with time. A high degree of
similarity is obtained after ~ 10-ns, as observed for the whole protein.
2.3 The Root Mean Square Fluctuation (RMSF) of the Mlc1p Protein at the Protein-
Peptide Complexes
In order to further analyze the trajectories of the Mlc1p protein at both
simulations, we computed the standard deviation from the RMSD for each of its residues,
i.e. their Root Mean Square Fluctuations (RMSF). Fig. 2.3A presents the RMSF of the
Mlc1p protein at the Mlc1p-IQ2 structure simulation (solid line), and at the Mlc1p-IQ4
structure simulation (dashed line). Residues of the Mlc1p protein, that comprise alpha
helices at the crystalline configurations of the Mlc1p-IQ protein-peptide complexes, are
shown as bold horizontal bars parallel to the abscissa. The RMSF curves reveal the
different behavior of the protein at both simulations, characterizing the mobility of each
of its residues during the MD runs.
In general, the structural sections of the protein, namely its eight alpha helices, are
58
more rigid and confined and tend to be less flexible than its other sections (e.g. residues
39−50).
Correspondingly, the protein's unstructured sections show an increased motility.
Thus, at both simulations, the RMSF data indicate large fluctuations of segments
belonging to loops that connect secondary structure elements (e.g. residues 14−20 and
128−137), as well as of residues located at the edges of the alpha helical sections (e.g.
residues 90−92, and 123−125). While these features are common to both complexes, the
Mlc1p protein exhibits a different mobility when it binds the IQ2 or the IQ4 peptides as
the absolute values of the RMSF differ. The RMSF curve of the Mlc1p protein at the
Mlc1p-IQ4 simulation consistently reveals a higher extent of motion than that of the
protein at the Mlc1p-IQ2 simulation. Despite these different degrees of motion, the
A B
Figure 2.3: The Root Mean Square Fluctuation (RMSF) as a function of the residue
number of the Mlc1p protein. The RMSF was calculated for the backbone atoms of the
Mlc1p protein for each residue at both simulations. The solid line represents the RMSF of
the Mlc1p protein at the simulation of the Mlc1p-IQ2 structure, while the dashed line
represents the RMSF of the Mlc1p protein at the simulation of the Mlc1p-IQ4 structure.
The bold horizontal bars, drawn in parallel to the abscissa, represent the alpha helices that
the Mlc1p protein comprises. The RMSF of both MD trajectories is presented for the
entire simulations time (A), and for the time frame t = 10-ns till t = 12-ns (B).
59
RMSF of some sections of the protein is correlated (for example, the RMSF of the NL of
the protein, between both simulations, is correlated, with R2 = 0.69). However, other
sections of the protein do not exhibit such correlation. The variation in the correlations
between the structural domains of the protein implies that the main differences regarding
the dynamics of the protein at both simulations are located at its ID and CL.
It can be argued that the relatively high RMSF of the protein at the Mlc1p-IQ4
simulation is due to its structural modification process, by which it refolds, and that
process is still going-on. However, detailed structural and energy analysis presented in
our previous publication (140) suggests that its major conformational change had been
completed in the course of the simulation. In order to observe the behavior of the Mlc1p
protein at the stable period at both simulations, we repeated the RMSF analysis during the
time frame 10-ns till 12-ns (Fig. 2.3B). The RMSF curves reveal a pattern similar to that
calculated for the entire simulations time, although the extent of the fluctuations is
smaller. Nevertheless, the RMSF curves resemble each other more than those obtained
for the entire simulations' time. This indicates as well that the Mlc1p protein at the
Mlc1p-IQ4 simulation had already experienced its structural modification and may not
significantly evolve at the discussed time frame. We wish to note that, even at this time
frame, we found that the RMSF of the protein derived from the Mlc1p-IQ4 simulation is
still higher than that obtained by the protein from the Mlc1p-IQ2 simulation. This is in
accord with the observation that more contacts are involved in the Mlc1p-IQ2 interaction
than in the Mlc1p-IQ4 interaction (See Fig. 2.5, below). Thus, the protein at the latter
simulation is not strongly retained to its position and may be more mobile.
60
2.4 The Electrostatic Field Around the IQ Peptides
The Mlc1p-IQ protein-peptide complexes are composed of the same protein, and
similar, although not identical, peptides. Yet, the protein-peptide complexes differ one
from the other by their crystalline structures (Figs. 2.1A and 2.1C), and to some extent by
the simulated model structures obtained by means of MD simulations (Figs. 2.2B and
2.2D). Evidently, the differences between the structures of the protein-peptide complexes
reflect the variation sequence of the bound IQ peptides. To account for the difference
between the peptides, we calculated the electrostatic field surrounding the IQ2 and IQ4
peptides. The volumes of the averaged (for the time frame t = 10 until t = 12-ns) positive
and negative electrostatic fields of both peptides are presented in Table 2, while in Fig.
2.4 we show the Coulomb cages of the IQ peptides at the last snapshot (t = 12-ns) of the
simulations.
Table 2
The Electrostatic Field Around the IQ Peptides
* Calculated by summation of the volumes of the positive and the negative Coulomb
cages.
The positive (transparent blue) and negative (transparent red) domains are drawn
where the electrostatic potential equals 1 kBT/e. The C-alpha traces of both peptides are
IQ2 Peptide IQ4 Peptide
Charge +2 +6
Positively-charged residues K-7, K-14, R-19, R-21 R-4, K-11, K-12, R-14,
K-15, K-18, R-20, K-23
Negatively-charged residues D-24, E-25 E-16, E-25
Volume of the positive
Coulomb cage (Å3)
4257.12 ± 264.48 10180.42 ± 426.12
Volume of the negative
Coulomb cage (Å3)
2080.56 ± 147.25 796.02 ± 144.94
Total volume of the
Coulomb cage* (Å
3)
6337.69 ± 395.11 10976.76 ± 498.02
61
Figure 2.4: The electrostatic potential surface around the IQ2 peptide (A) and the IQ4
peptide (B) at the last snapshot (t = 12-ns) of both simulations. Both peptides are
presented in yellow with the same orientation, while their positive and negative residues
are drawn in blue and red, respectively. The Coulomb cages for the positive (transparent
blue) and negative (transparent red) domains are drawn at the distance where the
electrostatic potential equals 1 kBT/e.
A B
colored in yellow, while their positive and negative residues are shown in blue and red,
respectively. The potential field of the IQ peptides consists of two main lobes, one
positive and the other negative. However, although the IQ2 and IQ4 peptides are both
basic, alpha-helical, 25 amino acids long, they produce different electrostatic fields
around them. The volume of the positive Coulomb cage around the IQ2 peptide is
4257.12 ± 264.48 Å3, while the volume of its negative Coulomb cage is 2080.56 ± 147.25
Å3. The volume of the positive Coulomb cage around the IQ4 peptide is 10180.42 ±
426.12 Å3, while the volume of its negative Coulomb cage is 796.02 ± 144.94 Å
3.
62
The differences between the electrostatic fields surrounding the peptides are
caused by variations in their local charge and charge distribution. The total charge of the
IQ2 peptide is Z = +2, while that of the IQ4 peptide is Z = +6. The peptides differ not
only in their total net charge, but in the distribution of the charges along them as well.
Four positive residues (K-7, K-14, R-19, R-21) are scattered along the IQ2 peptide, while
its negative residues (D-24, E-25) are concentrated at its C-terminal edge. The IQ4
peptide consists a series of four positive residues (K-11, K-12, R-14, K-15) located in its
middle section, whereas its other positive residues (R-4, K-18, R-20, K-23) are
distributed along it. The mid-cluster of positive charge contributes to the high volume
positive Coulomb cage bulb at the center of the peptide. The differences between the
electrostatic fields of the two peptides may suggest that electrostatic forces play a major
key role in the protein-peptide interactions.
2.5 The Protein-Peptide Interaction Free Energies
In order to analyze the energetics of peptide binding to the Mlc1p protein, the
various components of the interaction free energy of the two protein-peptide complexes
were evaluated during the last 2-ns for each simulation. This time frame corresponds with
the stable MD-derived model solution structures of the protein-peptide complexes at both
simulations, from which we can calculate the energy associated with protein-peptide
interaction. The detailed results of the energetic analysis are presented in Table 3.
The analysis was based on the MM-PBSA approach (94), in which the interaction
free energy, (∆Ginteraction), for each of the complexes is composed of three energetic terms:
The molecular mechanics energy term (⟨∆EMM⟩), the solvation energy term (⟨∆Gsolvation⟩),
63
and the solute entropic contribution (T∆S). The first term includes internal (⟨∆Eint⟩), VdW
(⟨∆EVdW⟩) and electrostatic (⟨∆Eelectrostatic⟩) components. The second term consists of
electrostatic (⟨∆Gpolar,solvation⟩) and nonpolar (⟨∆Gnonpolar,solvation⟩) contributions. Solute
entropies were determined at the last snapshots of the MD trajectories. Note that the
internal component of the molecular mechanics energy, (⟨∆Eint⟩), is set per definition as
zero and thus cancels out, making no contribution at all (101, 103, 104).
Table 3
Components of the Mlc1p-IQ Interaction Free Energy
Energies are presented in kJ/mol. The calculations present the average values
obtained from t = 10 till t = 12-ns for both Mlc1p-IQ simulations. ⟨ ⟩ denotes an average
over a set a snapshots along an MD trajectory. Atomic charge and radii values were taken
from the PARSE parameter set (142).
Definitions of the energetic components are as follows: (⟨∆Eelectrostatic⟩),
electrostatic molecular mechanics energy; (⟨∆EVdW⟩), VdW molecular mechanics energy;
(⟨∆EMM⟩), total molecular mechanics energy defined as ⟨∆EMM⟩ = (⟨∆Eint⟩ + ⟨Eelectrostatic⟩ +
⟨∆EVdW⟩); ⟨∆Gpolar,solvation⟩, electrostatic contribution to the solvation energy calculated by
the PB approach; ⟨∆Gnonpolar,solvation⟩, nonpolar contribution to the solvation energy;
⟨∆Gsolvation⟩, total solvation energy defined as (⟨∆Gpolar,solvation⟩ + ⟨∆Gnonpolar,solvation⟩); T∆S,
solute entropic contribution; and (∆Ginteraction), total free energy of interaction defined as
(⟨∆EMM⟩ + ⟨∆Gsolvation⟩ – (T∆S)).
On combining the (⟨∆EMM⟩) with the (⟨∆Gsolvation⟩) and the (T∆S) terms, we end
up with interaction free energy, (∆Ginteraction), for the complexes' formation. The estimated
interaction free energy of the Mlc1p-IQ2 and the Mlc1p-IQ4 complexes is ~ -560 kJ/mol
Mlc1p-IQ2 Complex Mlc1p-IQ4 Complex
⟨∆Eelectrostatic⟩ -590.77 ± 49.96 -1534.02 ± 84.28
⟨∆EVdW⟩ -664.18 ± 25.2 -547.7 ± 25.88
⟨∆EMM⟩ -1254.95 -2081.72
⟨∆Gpolar,solvation⟩ 678.25 ± 41.85 1594.88 ± 78.84
⟨∆Gnonpolar,solvation⟩ -58.69 ± 5.59 -53.98 ± 5.48
⟨∆Gsolvation⟩ 619.56 1540.9
-T∆S 75 372
∆Ginteraction -560 -169
64
and ~ -169 kJ/mol respectively, consistent with their observed stability. These data
represent a balance between enthalpy and entropy in which, according to our calculations,
the complexes' formation is an enthalpically driven process and is entropically
unfavorable. The favorable formation of both Mlc1p-IQ complexes is driven by the
electrostatic (⟨∆Eelectrostatic⟩) and the VdW (⟨∆EVdW⟩) terms of the molecular mechanics
energy and the nonpolar component of the solvation energy (⟨∆Gnonpolar,solvation⟩).
Of particular interest is the total solvation energy, (⟨∆Gsolvation⟩), composed of
polar (⟨∆Gpolar,solvation⟩) and nonpolar (⟨∆Gnonpolar,solvation⟩) terms. The total solvation energy
is unfavorable at both complexes (619.56 kJ/mol for the Mlc1p-IQ2 complex, and 1540.9
kJ/mol for the Mlc1p-IQ4 complex). Thus, considering the solvation energy, it appears
that the protein-peptide complexes would rather not be formed at all. Yet, the molecular
mechanics energy component of the interaction energy strongly favors the complexes
over the unbound molecules.
Electrostatic interactions were assumed to play a dominant major role in the
interaction between CaM and target peptides (49, 143-145). This view was emerged from
X-ray structures of complexes showing close proximity between the negatively charged
CaM and positively charged peptides (47-54, 141). Accordingly, electrostatic interactions
have been anticipated to be significant in the Mlc1p-IQ systems, as the Mlc1p-IQ
complexes also present a close distance between a negatively charged protein and highly
positively charged peptides. It is of high importance to consider the electrostatic
component of the molecular mechanics energy, (⟨∆Eelectrostatic⟩), together with the
electrostatic contribution to solvation, (⟨∆Gpolar,solvation⟩), when examining the role of
electrostatics in the protein-peptide complexes formation. At both protein-peptide
65
complexes, the nature of the electrostatic interactions is similar: The molecular
mechanics electrostatic term per se favors the bound state of the complexes (⟨∆Eelectrostatic⟩
< 0), while the electrostatic PB solvation energy favors the unbound state of the protein-
peptide complexes (⟨∆Gpolar, solvation⟩ > 0). As the latter is dominant (|⟨∆Gpolar,solvation⟩| >
|⟨∆Eelectrostatic⟩|), their sum, representing the total electrostatic energy, opposes the
formation of the protein-peptide complexes. Thus, the positive solvation energy
electrostatic term penalty paid by the electrostatics of solvation is not completely covered
by favorable electrostatic interactions within the resulting protein-peptide complexes.
Evidently, the same phenomenon was also demonstrated by numerous studies (101, 103,
104, 128, 146-148), in which the total electrostatics between two interacting molecules
unfavors their bound state over the unbound due to intense solvation forces. Interestingly,
the electrostatic energy terms (⟨∆Eelectrostatic⟩) and (⟨∆Gpolar,solvation⟩) are more prominent in
the Mlc1p-IQ4 complex compared to the Mlc1p-IQ2 complex by approximately a factor
of three, which is proportional to the charge of the peptides (The charges of the IQ2 and
IQ4 peptides are +2 and +6, respectively). Hence, the differences in charges between the
protein and the peptide are more substantial in the Mlc1p-IQ4 complex, and consequently
its electrostatic terms are more profound.
It is not surprising that, at both complexes, the entropic term does not support the
interaction between the protein and the peptide. The Mlc1p protein, as well as the IQ
peptides, are characterized by higher entropy in their unbound states. In its free state, the
lobes of the Mlc1p protein can tumble more or less independently of one another,
constrained only by the ID. However, in the bound state, its lobes interact with the
peptide and hence are relatively at fixed positions. Similarly, the free IQ peptides may
66
also acquire more structural freedom when they are not constrained by the protein.
The energy calculations agree fairly well with isothermal titration calorimetry
(45, 149) and NMR relaxation (150) experiments, in which it was found that the binding
of a peptide to Holo-CaM is favored by enthalpy and opposed by entropy. Our results are
also in accord with an MD study of Holo-CaM complexed with a target peptide
suggesting that the protein-peptide free energy is enthalpy-dependent and not entropy-
dependent (92). As pointed out by these authors, identification of changes in entropy (45,
150) or enthalpy (92) upon complex formation is fraught with difficulty. Therefore, the
qualitative agreement between our calculations regarding the Mlc1p-IQ complexes
obtained by computer simulations, and experimental and theoretical calculations
regarding Holo-CaM peptide complexes, is encouraging and promising.
Finally, it must be stressed that the values given in Table 3 are model-dependent
and reflect all of the approximations implemented in the MM-PBSA formalism. Thus, the
numerical values should be taken as indicative, representing qualitative trends rather than
actual quantitative ones. The consistency of the results with the observed stability of the
complexes supports the acceptance of this mode of calculation as a proper representation
of the operating forces.
Owing to the opposite charges of the protein and the peptide, the electrostatic
interactions may serve as the initial driving force for long-range molecular recognition
between the Mlc1p protein and the IQ peptides. On the other hand, the highly charged
protein and peptides strongly interact with the solvent, leading to intensive solvation
forces. Upon formation of the protein-peptide complexes, the Mlc1p protein approaches
into a close vicinity to the IQ peptides. This desolvation process, which is unfavorable, is
67
accompanied by a release of water molecules from their interfaces, replacing solute-
solvent interactions by intra-complex interactions. The unfavorable change in the
electrostatics of solvation is mostly, but not fully, compensated by the favorable
electrostatic charge-charge interactions within the resulting Mlc1p-IQ complexes. The
close interaction of the Mlc1p protein with the IQ peptides is grossly mediated by the LJ
interactions, whereas their opposite highly charged surfaces contribute to their initial
attraction. The major role played by the molecular mechanics VdW interactions
demonstrates how inter-residues contacts, where the tight fitting of the surface atoms
occurs, contribute to the LJ stabilization energy term and consequently to the stability of
the complexes. In conclusion, electrostatic interactions seem to operate mostly during the
long-range attraction between the protein and the peptides before the complexes are
formed. Once protein-peptide contact occurs, VdW and non-specific hydrophobic
interactions stabilize the Mlc1p-IQ structures, whereas the contribution of salt bridges is
relatively negligible.
2.6 The Contacts Between the Protein and the Peptides
The detailed interactions between individual residues of the Mlc1p protein and the
IQ peptides were followed during the last half of the simulations (6−12-ns). Residues of
the peptides, which were in a contact (less than 4 Å) with the Mlc1p protein, were listed.
The rapid motion of the residues during the simulations led to numerous encounters, but
most of them were temporary and made a marginal contribution to the protein-peptide
interaction. To account for that, we selected only those contacts, between residues of the
IQ2 or the IQ4 peptides with the Mlc1p protein, which were present in at least 80 % of
68
the snapshots. These lasting interactions are presented in Fig. 2.5.
Altogether, 51 contacts between the peptide and the protein were found at the
Mlc1p-IQ2 simulation. 15 of these contacts involve the NL of the protein (blue), 14 the
ID (red), and 22 the CL (green). Similarly, at the Mlc1p-IQ4 simulation, 42 contacts were
found between the peptide and the protein (140). Only 4 of these contacts involve the NL
of the protein (blue), 13 the ID (red) and 25 the CL (green). It is of interest to point out
that 20 residues of the Mlc1p protein interact with both IQ peptides (underlined in Fig.
2.5), 12 of its residues interact only the IQ2 peptide, whereas 7 of its residues exclusively
interact the IQ4 peptide. Residues 24−25 of both IQ peptides made no contact with the
Mlc1p protein in neither of the simulations.
Figure 2.5: Contacts (less than 4 Å) between residues of the Mlc1p protein and the
bound IQ peptides obtained from the MD simulations at t >= 6-ns. Data regarding the
Mlc1p-IQ4 model structure simulation (140) are presented at the upper half of the
illustration, whereas data regarding the Mlc1p-IQ2 model structure simulation are
presented at its lower half. The IQ peptides, the NL of the protein, the ID of the protein,
and the CL of the protein are colored in black, blue, red, and green, respectively.
Residues of the Mlc1p protein that interact with both peptides are underlined. Only
contacts persist at least 80 % of the examined period are presented.
The IQ2 peptide has 4 positive residues (K-7, K-14, R-19, R-21), and 2 negative
69
residues (D-24, E-25) located in its C-terminal edge. These negative residues repel the
negatively charged Mlc1p protein, and hence only one residue of the protein (I-9)
interacts with the last 5 residues of the IQ2 peptide. In comparison, the IQ4 peptide is
more positive. It has a substantial cluster of positive residues (K-11, K-12, R-14, K-15,
K-18, R-20, K-23) located at its mid- and C-terminal parts. These positive residues are
bound to the CL of the Mlc1p protein mainly through hydrophobic interactions (i.e. R-14
− L-116, R-20 − V-128), but two salt bridges are also present (K-11 − E-114, R-14 − E-
120).
The different binding modes of the two peptides are reflected by their different
regions of contact with the Mlc1p protein at each simulation. At the Mlc1p-IQ2
simulation, the mid- and C-terminal regions of the peptide (residues 13−25) interact with
8 residues of the NL and 12 residues of the CL of the protein. On the other hand, at the
Mlc1p-IQ4 simulation, the mid- and C-terminal regions of the peptide (residues 13−25)
do not interact with the NL of the protein as they are bound to 17 residues of its CL. The
fact that the mid- and C-terminal sections of the IQ2 peptide interact with the NL of the
protein, while those regions of the IQ4 peptide do not interact with it, is manifested by
the various orientations of the peptides when bound to the protein (Figs. 2.2B and 2.2D).
Although the protein and the peptides are highly charged in opposite charges, out
of the 51 contacts between the IQ2 peptide and the Mlc1p protein, only 2 involve
electrostatic interactions (K-7 − E-114, K-14 − D-28). Similarly, out of the 42 contacts
between the IQ4 peptide and the Mlc1p protein, only 2 involve electrostatic interactions
(K-11 − E-114, R-14 − E-120). Besides these few electrostatic interactions, all the other
protein-peptide interactions are hydrophobic in nature. Matter of fact, even the positive
70
residues of the peptides interact with the positive residues of protein (K-14 − R-31, R-19
− R-147 for the Mlc1p-IQ2 protein-peptide complex; R-4 − R-31, R-14 − K-115, R-19 −
R-147, K-23 − R-147 for the Mlc1p-IQ4 protein-peptide complex). Considering the
repulsive force between positive charges, the interactions between the basic residues are
clearly hydrophobic. These data are consistent with the findings regarding the dominant
role played by the LJ component of the molecular mechanics energy and the nonpolar
component of the solvation energy in the stabilization of the protein-peptide complexes
(Table 3).
2.7 Summary
The present study provides a fundamental understanding of the Mlc1p protein’s
solution behavior in a complex with IQ peptides by sampling the conformational space of
two Mlc1p-IQ complexes. Our findings suggest that, although the IQ2 and the IQ4
peptides share similar sequence and structure, the fine details of each individual IQ
sequence determine its binding mode to the Mlc1p protein. The ability of the Mlc1p
protein to assume different conformations, which is driven by the specific IQ peptides, is
crucial. The flexibility of the protein and the dominance of its nonspecific hydrophobic
interactions with the IQ peptides are probably correlated with its ability to bind a wide
range of targets. Besides describing the structure and dynamics of the protein in the
presence of the peptides, we analyze the interaction free energy that governs the protein-
peptide formation. Using a combination of energies derived from MD simulations in an
explicit solvent, a continuum solvent model, and solute entropies contributions derived
from normal mode analysis, we have obtained approximate values for the protein-peptide
71
interaction free energy of both complexes. We found that favorable molecular mechanics
energy contribution profoundly supports this protein-peptide interaction, while the polar
solvation energy and the entropy oppose it.
72
3. Simulations of Free IQ Peptides
In the first two chapters of the thesis we found out that the solution model
structures of the Mlc1p-IQ complexes are both compact, while the IQ peptide is located
between the lobes of the protein. These observations may imply that the Mlc1p protein
contributes to the relative stability of the IQ peptides, while the peptides affect the
specific folding form of the protein. Since the only difference between the Mlc1p-IQ2
and the Mlc1p-IQ4 simulations is the IQ peptide, and in order to check how the IQ
peptides refold at the absence of the Mlc1p protein, we have performed MD simulations
of the peptides in solution. Accordingly, this chapter deals with MD simulations of free
IQ peptides. These simulations complement our research of the Mlc1p-IQ complexes, and
aim for elucidation the dynamic behavior of isolated IQ peptides without the presence of
the Mlc1p protein. Since none of the IQ peptides were crystallized at the absence of the
protein, their structures, which are basically alpha-helical, were derived from the
published crystal structures of the Mlc1p-IQ protein-peptide complexes (28, 56).
Considering the long duration of our simulations and the repetitive pattern of the models
obtained at different simulations' conditions, we argue that the predicted model structures
of both IQ peptides represent their solution conformations. Comparisons between the
dynamics of the free and the bound IQ peptides, between the free IQ2 and IQ4 peptides,
and between the simulations of the free IQ4 peptide at different conditions, were
performed.
3.1 Synopsis of the Presented Simulations
A series of MD simulations, presented in Table 4, was performed for various
73
durations and conditions. These simulations include one MD run of the IQ2 peptide, and
five MD runs of the IQ4 peptide. Additionally, an extended IQ4 was constructed and
subjected to an MD run as further elaborated.
Table 4
Summary of the MD Simulations of the IQ Peptides
* 400 K. †
~ 30 mM NaCl. ‡
~ 300 mM NaCl. ¶ ~ 2.4 M NaCl (mimicking the salt concentration in which the Mlc1p-IQ4 complex was
crystallized (56)). § An IQ4 peptide containing 10 additional amino acids flanking each of its terminals.
These added 20 residues belong to the IQ3 and IQ5 peptides. Thus, the 45 amino acids
elongated IQ4 peptide constitutes a portion of the poly-IQ sequence as present at the neck
of the myosin. This simulation is described in chapter 4 of the thesis.
3.2 Overall Conformational Changes During the Simulations
The crystallographic and simulated structures of the free IQ2 peptide are
presented in Fig. 3.1 (Snapshots from the simulation are shown every 20-ns). At its initial
configuration (Fig. 3.1A), the peptide is almost linear alpha-helix. After 20-ns, the
peptide stretches (Fig. 3.1B), keeping its linearity. At the next snapshots (Figs. 3.1C-F),
its N- and C- edges loose, while its mid-section retains its straight conformation. From
40-ns till 100-ns, the model structure of the peptide hardly changes.
In contrast to the IQ2 peptide, snapshots of the IQ4 peptide reveal a different
Duration
(ns) Conditions of simulation
IQ2 peptide 100 As written in section 1.2 of the Methods
IQ4 peptide 100 As written in section 1.2 of the Methods
IQ4 peptide 30 High temperature *
IQ4 peptide 20 Low salt concentration †
IQ4 peptide 20 High salt concentration ‡
IQ4 peptide 20 Very high salt concentration ¶
Extended IQ4 peptide 20 An elongated IQ4 peptide §
74
scenario (Fig. 3.2). The initial configuration of the IQ4 peptide is a linear alpha-helix
(Fig. 3.2A). After 20-ns of simulation it bends (Fig. 3.2B), whereas most of its secondary
structure is no longer alpha-helix. The conformation of the peptide continues to explore
the configurational space reaching a new configuration, observed at the next snapshots
(Figs. 3.2C-F). At this newly gained conformation, the IQ4 peptide is refolded in a
manner that its N- and C- edges are relatively close to each other. At the final
configuration, the IQ4 peptide is composed of two helices, separated by a coiled hinge.
The refolding process of the peptide, in which it loses most of its alpha-helical structure,
occurs in less than 20-ns. It can be seen that its conformation after only 20-ns of
simulation (Fig. 3.2B) grossly deviates from its initial one (Fig 3.2A). The secondary
structure analysis, presented in section 3.4 of this research, emphasizes the abrupt
transition of the peptide from an elongated alpha-helix into a conformation of two helices
separated by a coiled section.
A B C D E F
Figure 3.1: Snapshots of the crystal and the simulated structures of the free IQ2 peptide.
The structure of the IQ2 peptide as derived from the crystal structure of the Mlc1p-IQ2
complex (A). The simulated model structure of the IQ2 peptide after 20-ns (B); 40-ns (C);
60-ns (D); 80-ns (E) and 100-ns (F) of MD run.
75
From the comparison of the snapshots between the two free IQ peptides, it is
evident that the IQ2 peptide is more rigid and stiff than the IQ4 peptide. Whereas the IQ2
peptide keeps most of its structural features, the IQ4 peptide modifies into a helix-loop-
helix conformation through an intermediate state. This argument will be further
strengthened and elaborated
We repeated the simulation of the IQ4 peptide at a high temperature (400 K). Our
aim was to check whether the elevated temperature may cause further structural changes
Figure 3.2: Snapshots of the crystal and the simulated structures of the free IQ4 peptide.
The structure of the IQ4 peptide as derived from the crystal structure of the Mlc1p-IQ4
complex (A). The simulated model structure of the IQ4 peptide after 20-ns (B); 40-ns (C);
60-ns (D); 80-ns (E) and 100-ns (F) of MD run. The simulated model structure of the IQ4
peptide after 30-ns (G) of MD run at 400 K.
A B C D
GE F
76
to the peptide and detect changes not observed at 300 K. Fig. 3.2G depicts the model
structure of the IQ4 peptide after 30-ns of simulation at 400 K. Inspection of the peptide's
model at the end of the simulation at 400 K reveals a high similarity to the model
structure of the peptide after 100-ns simulation at 300 K. According to our simulations,
the elevated temperature does not affect the refolding mechanism of the peptide but,
rather, its rate (as also observed for the Mlc1p-IQ4 simulation at 400 K presented in
chapter 1 of the thesis). Thus, the progression of the conformational transition is basically
comparable at the two temperatures with a clear acceleration at the higher temperature.
Hence, we suggest that the modulation's speed of the peptide's model structure is
temperature-dependent.
The modulation of the IQ peptides' model structures with time can be followed by
examination of their backbone RMSD relative to their crystal structures in the protein-
peptide, as given in Fig. 3.3 (black for the IQ2 peptide, red for the IQ4 peptide). The
RMSD of the free IQ2 peptide (Fig. 3.3A, black) increases in a moderate fashion for the
first 13-ns of the simulation. Then, it rises abruptly to a value of ~ 0.76 nm, drops and
stabilizes at ~ 0.48 nm till the end of the simulation. The RMSD of the free IQ4 peptide
(Fig. 3.3A, red) increases at the very first nanoseconds of the simulation till a value of ~
0.5 nm, then gradually rises till it stabilizes on t ~ 50-ns at ~ 0.9 nm. Once the peptides
had reached their new stable conformations, they do not exhibit further configurational
changes. The calculation of the backbone RMSD value for the last 20-ns of the
simulations, relative to the backbone heavy atoms’ position at t = 80-ns (Fig. 3.3B),
reveals that both free IQ peptides hold steady solution model structures. These model
structures do not further evolve and thus represent settled down configurations.
77
Figure 3.3: (A) The RMSD of the
backbone atoms of the free IQ2 (black)
and IQ4 (red) peptides as a function of
the simulation time; (B) The RMSD of
the backbone atoms of the last 20-ns of
the simulations (80−100 ns) relative to
the position of the same set of atoms at t
= 80-ns; (C) The RMSD of the backbone
atoms of the bound IQ2 (black) and IQ4
(red) peptides as a function of the
simulation time. Note the differences in
the abscissa between the simulations of
the free and the bound peptides.
B
C
A
The evaluation of the structural stability of the free peptides calls for comparison
with their stability in a complex with the Mlc1p protein. The RMSDs of the bound IQ
peptides were reported in Figs. 1.2A and 1.4A, and presented also in Fig. 3.3C. That for
the bound IQ2 peptide increases after ~ 3-ns to a value of ~ 0.23 nm and stabilizes,
whereas that for the bound IQ4 peptide doubles after ~ 3.3-ns from ~ 0.2 nm to ~ 0.4 nm,
and stays at this value till the end of the simulation. When one compares the simulations
of the free and the bound peptides, it can be clearly seen that free peptides tend to loose
their structure in a more profound manner than the bound ones.
78
The Mlc1p protein is likely to shield or protect the bound peptides from a gross
deviation from their initial structure observed for the free peptides. It causes a steric
hindrance that prevents the peptide to assume conformations observed when simulated at
its absence. It should be mentioned that at both the free and at the bound peptide's
simulations, the IQ4 peptide is more flexible and tends more to deviate from its original
structure than the IQ2 peptide.
3.3 Structural Characteristics of the Refolding Process of the IQ Peptides
To demonstrate the modulation of the IQ peptides during the simulations, two
alpha-helices in their N- and C- sections, as present in their crystalline structures, were
defined. The angle between them is presented in Fig. 3.4.
Figure 3.4: The dynamics of the relative rotation of the free and the bound IQ peptides
during the simulations time. (A) The intra-helical angle between the N- and C- sections
of the free IQ2 (black). The arrow points towards a conformation of the free IQ2 peptide
at 15-ns; (B) The intra-helical angle between the N- and C- sections of the bound IQ2
(black) and IQ4 (red) peptides.
A B
79
For the free IQ2 peptide, the angle is relatively constant at a value of ~27°,
besides a transient increase to a value of ~125° at the time range ~13-ns till ~20-ns. This
temporary rise, corresponds to the short-lived configuration of the peptide marked with
an arrow, coincides with the increment of the RMSD observed at Fig. 3.3. Thus, even
though the IQ2 peptide explores the parametrical space and tries to refold, it relatively
returns to its initial conformation and does not grossly deviate from it. For the free IQ4
peptide, the intra-helical calculation is not given since the peptide does not hold two
defined sections throughout the entire simulation. However, its structural modulation is
clearly detected on Fig. 3.2.
A different scenario is observed when dealing with the bound IQ peptides (Fig.
3.4B). The angle of the bound IQ2 peptide remains constant throughout the entire
simulation time. The bound IQ4 peptide, which at the initial state consists of a single
alpha-helix, shows a significant flexibility. Its straight alpha-helical structure is snapped
in its middle during the complex's structural transition, when the two shorter helices
attain an almost perpendicular relation. The process, in which the angle between the
helices increases from ~ 20° to ~ 70°, is sharp and quick.
As demonstrated, the IQ4 peptide is characterized by a greater flexibility than the
IQ2 peptide, both in its free and its bound configurations. While the former experienced a
refolding process, the latter keeps its linearity. Upon comparison of the extent of the
curvature of the free and the bound IQ4 peptides in solution, it is evident that the free
peptide bends much more than the bound one. The considerable modification of the free
IQ4 peptide enables it to form intra-peptide bonds (between its N- and C- terminals),
whereas the bound IQ4 peptide exhibits two defined, almost vertical, sections.
80
Figure 3.5: The end-to-end distance (from C-alpha of the first residue till C-alpha of the
last residue) of the free IQ2 (black) and IQ4 (red) peptides.
The refolding of the IQ peptides can be also expressed when calculating the
distance between their first and last alpha-carbons (Fig. 3.5). For the free IQ2 peptide
(black), the end-to-end distance hardly changes during the simulation. After decreasing
from ~ 3.7 nm to ~ 2.5 nm, it keeps a relatively constant distance. However, this end-to-
end distance decreases for a short while (~ 13-ns till ~ 20-ns) to a distance that is even
shorter than 1 nm. The short end-to-end distance reflects its transient curved
conformation, which is not stable and alternates back to the initial straight configuration.
The refolding of the free IQ4 peptide (red) is clearly seen as its length shortens from ~
3.7 nm to ~ 2 nm. Afterwards, at t ~ 22-ns it drops to ~ 0.75 nm, rises to ~ 1.5 nm and
finally stabilizes at ~ 0.7 nm. The free IQ4 peptide shows a significant conformational
flexibility, shown when its bending brings its ends into a closer contact than seen at their
initial state. Overall, the structural characteristics of the extensive conformational change
of the peptide are presented by variation of its length as well as by the increase of the
angle between its N- and C- sections.
81
3.4 Secondary Structures of the IQ Peptides
We followed the secondary structure of the free IQ peptides as a function of the
simulation time and the analyses of the free IQ2 (Fig. 3.6) and the free IQ4 (Fig. 3.7) are
presented.
The initial alpha helical conformation of the free IQ2 (Fig. 3.6) peptide remains
stable for more than 20-ns, and then transforms into a 5-helix conformation while
keeping its linearity. A 5-helix conformation, also known as a Π-helix, is a helical
structure in which the backbone C=O group of the i residue forms a hydrogen bond with
Figure 3.6: Secondary structure analysis of the free IQ2 peptide as a function of the
simulation time. The residue number runs along the ordinate and time along the abscissa.
Color codes are used to represent secondary structure elements.
82
Figure 3.7: Secondary structure analysis of the free IQ4 peptide as a function of the
simulation time. The residue number runs along the ordinate and time along the abscissa.
Color codes are used to represent secondary structure elements.
the backbone N-H group of the residue five amino acids ahead (i → i + 5 hydrogen
bonding). The 5-helix conformation becomes the dominant secondary structure element
and holds till the end of the simulation of the free IQ2 peptide.
The secondary structure analysis of the free IQ4 peptide (Fig. 3.7) reveals a
different pattern than that for the IQ2 peptide.
After a short relaxation period, the peptide is no longer composed of only alpha
helical elements. While its first ~ 15 residues still keep their alpha helical configuration,
the remaining 10 residues become composed of bends and turns. At ~ 16-ns, a 5-helix
83
appears and gradually controls over the model structure till ~ 38-ns (besides a middle
coil/turn section). Apparently, the 5-helix conformation is not stable enough to persist for
the entire helix, and the first ~ 15 residues reverse back to the alpha-helical conformation.
From this time point on, the model structure of the peptide exhibits three secondary
structure elements, reflecting an energetically stable configuration of the peptide: the N-
section, composed of an alpha-helix; the C-section, composed of a 5-helix; and a hinge,
composed of turn and coil, separating between these N- and C- sections.
The secondary structure analyses of both peptides add an additional value to the
data presented by the snapshots and the RMSD of the peptides. Subtle changes in the
secondary structure composition of the peptides, that are not necessarily observed in Figs.
3.1−3.3, are clearly seen in this type of analysis. For the IQ2 peptide, although one can
allegedly think that the peptide barely changes during its simulation; this analysis
demonstrates that it loses most of its alpha-helical structure at the expense of gaining a 5-
helix structure. For the IQ4 peptide, it can be assumed that its alpha-helix had practically
lost. However, we demonstrated that it attains about 50 % from its initial alpha-helix
content, while the newly appearing 5-helix and coil/turn configurations form the other
half. Interestingly, the alpha- to 5- helix transition (full-length and partial for the IQ2 and
the IQ4 peptides, respectively), which was detected in both simulations, was observed, as
well, in previous MD simulations of peptides. 5-helix propagation at the expense of an
alpha-helix was noticed in simulations of the transmembrane domain of ErbB-2 (151,
152), the central domain of caldesmon (153) and synthetic peptides (154). Besides
theoretical MD simulations reporting a formation of the 5-helix structure, there are also
experimental evidences to the existence of this secondary structure element in proteins
84
and peptides. For example, 5-helix was identified by x-ray crystallography and NMR in
fumarase C (155), glycogen phosphorylase b (156), lipoxygenase (157), catechol O-
methyltransferase (158), serine carboxypeptidase (159), and the protein NMA1147 (160).
5-helix was also identified at the vasoactive intestinal protein, using an empirical
approach involving peptide design, construction, and distribution frequency techniques
(161).
Both peptides explore the conformational space till they find a stable
configuration. At more (for the IQ2 peptide) or less (for the IQ4 peptide) 20-ns, both
loose they initial structures, and then, following more exploration, new stable model
structures emerge.
3.5 Salt Bridges Analysis of the IQ4 Peptide Simulation
An analysis for the presence of salt bridges was performed for the 100-ns long
simulation of the IQ4 peptide.
The IQ4 peptide is a highly-charged peptide, containing 8 positive residues and 2
negative ones. The oppositively-charged residues can form salt bridges between them,
and thus we followed the minimal distance between each pair of oppositively-charged
residues, and present the main results in Fig. 3.8. A ribbon diagram of the IQ4 peptide,
presenting its positive and negative residues (blue and red, respectively) is shown on the
right.
Only one salt bridge (R20-E16) persisted over the whole simulation time (Fig.
3.8A), keeping the minimal distance between the residues as ~ 0.25 nm. Two salt bridges
(K12-E16, K11-E16) were broken during the simulation at a time point corresponding
85
Figure 3.8: Salt bridges analysis of the free IQ4 peptide simulation. (A) Minimal
distance between R20 and E16; (B) Minimal distance between K12 and E16; (C)
Minimal distance between K11 and E16; (D) Minimal distance between R20 and E25;
(E) Minimal distance between K11 and E25; (F) Minimal distance between R4 and
E25. The crystalline conformation of the IQ4 peptide, as derived from the crystal
structure of the Mlc1p-IQ4 complex, is presented on the right side. The positive and
negative residues of the IQ4 peptide are shown in blue and red, respectively.
A B C
D E F
R20
E25
R4
E16
K11
K12
K23
K18
R14
K15
with the major conformational change of the peptide (Figs. 3.8B-C). Two salt bridges
(R20-E25, K11-E25) were detached at ~ 62-ns (Figs. 3.8D-E). One salt bridge (R4-E25)
was formed (Fig. 3.8F) at the end of the simulation when the two edges of the peptide got
close enough to be linked through a salt bridge. This latter is of particular interest since it
represents the structural modification of the peptide and reflects its curvature. The
residues R4 and E25 are located at the edges of the IQ4 peptide. At the beginning of the
simulation, they are at a distance that exceeds 3 nm. As the simulation progresses, these
residues become closer to each other, keeping a distance of ~ 1.1 nm. Finally, the
structural deformation of the peptide imposed a stable contact of the residues, in which
their VdW radii almost reached (d = 0.25 nm).
86
From the salt bridge analysis it becomes clear that only two salt bridges maintain
the refolded model structure of the peptide, while four pre-existing salt bridges detached.
However, these bonding salt bridges contribute to the enhanced stability of the refolded
model structure of the IQ4 peptide.
3.6 Dynamics of the IQ4 Peptide at Different Salt Concentrations
We repeated the MD simulations of the free IQ4 peptide, for a duration of 20-ns
each, in three different salt concentrations: ~ 30 mM (low), ~ 300 mM (high) and ~ 2.4 M
(very high). Snapshots of the structures after 20-ns simulations (Fig. 3.9) are presented.
At the low salt concentration, the IQ4 peptide refolded into two distinctive
sections separated by a hinge (Fig. 3.9A). However, at the high salt concentration, it did
not refold, although its N- and C- terminals were loosen (Fig. 3.9B). The high salt
concentration prevented the peptide to assume a different conformation from its initial
Figure 3.9: Snapshots of the crystal and the simulated structures of the IQ4 peptide at
different salt concentrations. The simulated model structures of the IQ4 peptide after 20-
ns simulation at low (A), high (B), and very high salt (C) concentrations.
A B C
87
one, keeping it as an elongated helix.
Given that the Mlc1p-IQ4 complex was crystallized at a very high salt
concentration (56) (~ 2.4 M), we suggest that its crystal structure was imposed by its
crystallization conditions. According to this argument, which is compatible with the
results we present in chapter 1 of the thesis, the crystal structure of the Mlc1p-IQ4
complex does not represent its physiological solution structure. Thus, we repeated the
simulation of the free IQ4 peptide at salt conditions mimicking its crystallographic ones,
and a snapshot of the simulated model structure after 20-ns simulation (Fig. 3.9C) is
shown. The snapshot resembles the one obtained by the simulation at the high salt
because that already at ~ 300 mM of NaCl the screening effect caused by the ions is
extensive. The 8-fold increase at the NaCl concentration did not significantly change the
screening effect and thus the results of the high and the very high salt concentrations'
simulations are close.
Since the peptide hardly bends at the high salt concentrations (as opposed to the
bending at ~ 30 mM and ~ 100 mM salt concentrations), it is obvious that its refolding
pattern is influenced by the concentration of ions in its surrounding. High salt
concentrations probably prevent the peptide to assume a conformation that reflects its
solution structure, and thus yield a configuration that differs from a physiological one.
We can generalize the argument and suggest that the conditions of the crystallization,
such as the salt concentration of the crystallization buffer, may influence the
configuration of the protein inside the crystal lattice. A clear influence of the crystal
environment on protein structures was also found by others (162, 163). Therefore, the
simulations' results at the high salt concentrations strengthen our conclusion that the
88
crystallization conditions of the Mlc1p-IQ4 affected its crystal packing forces, while the
latter determined the crystalline structure. Hence, the simulations of the peptide at the
high and the very high salt concentrations exemplify how crystallographic conditions
may determine the outcome structure of the crystal.
3.7 Summary
MD simulations of IQ peptides indicate that the free (and also the bound) IQ2
peptides almost do not refold in a solution and maintain a stable helical structure. In
comparison, both the free and the bound IQ4 peptides are less stiff and tend to curve and
flex in a profound manner. This is attributed to the higher net charge (Z = +6) of the IQ4
peptide than this of the IQ2 peptide (Z = +2), and reflected by the electrostatic attraction
between positive and negative residues located on its N- and C- terminals (for example,
R4-E25). The IQ4 peptide presents an intrinsic instability and an increased tendency to
flexibility at a more or less defined location (further discussed in the next chapter).
Though the IQ4 peptide exhibits a structural modification when simulated at the presence
of the protein, its extent of refolding is considerably higher when it is simulated at its
absence. Secondary structure analysis of both free IQ peptides followed their secondary
structure change throughout the simulations, revealing their stable solution
conformations. The curved shape of the free IQ4 peptide was observed when it was
simulated under sub-physiological and physiological salt concentrations (~ 30 mM and ~
100 mM), and was not detected when the simulations were performed under over-
physiological salt concentrations (~ 300 mM and ~ 2.4 M).
89
4. The Structure of the Light Chain-Binding Domain (LCBD) of Myosin
V
Given our results presented in chapters 1−3 of the thesis, and an MD simulation
of extended IQ4 peptide presented below, we propose a dynamic solution model of the
LCBD of myosin V (section 4.3). The model, which involves mutual modulations of the
structures of the light chain proteins in respect to the IQ peptides of the myosin's neck
towards each other, may have important implications regarding the structure-function
relationship of the lever arm of myosin V.
4.1 The Current Structural Model
Myosin V is a versatile motor involved in the short-range transport of vesicles in
the actin-rich cortex of the cell. Its long neck domain, serving as a lever arm (19, 20),
gives rise to a step size of ~ 36 nm, the largest step size thus far measured for a myosin
motor. The LCBD neck of myosin V consists of six tandem IQ motifs, to which light
chain proteins, such as the CaM and the Mlc1p, are bound. The primary function of the
light chains is to regulate the ATPase activity of the globular head of myosin V (13, 17,
164). The crystal structures of the Mlc1p-IQ2, Mlc1p-IQ4 and Mlc1p-IQ2/3 complexes
had been determined by Terrak and co-workers (28, 56). On the basis of these structures,
and sequence similarity among the six IQ motifs, Terrak and co-workers suggested a
model for the LCBD (Fig. 4.1) (57). According to this model, the six IQ motifs, that
constitute the neck domain of the myosin V, adopt a straight long alpha helical
configuration (green). Moreover, two of the light chain proteins retain an extended
configuration (see the arrows pointing towards the light chains that bind the IQ4 and the
90
IQ6 peptides), in which their NL does not interact with the IQ motif, as determined by the
crystalline form of the Mlc1p-IQ4 complex.
Figure 4.1: Model of the LCBD of myosin V. The light chains, which can be either CaM
or CaM-related molecules such as the Mlc1p, are colored cyan (NLs) and magenta (CLs),
and the six-IQ fragment of the heavy chain is colored green. The black arrows point
towards the light chains that bind the IQ4 and the IQ6 peptides. An enlargement
illustrates the interaction between adjacent light chains. Adapted from (57).
This proposed model of the LCBD does not take into account the conformations
that the proteins may reveal in solution; rather, it is constrained by the packing forces of
the Mlc1p-IQ crystal structures. These packing forces may vary between the Mlc1p-IQ2
and the Mlc1p-IQ4 structures because each complex was crystallized solely under unique
crystallographic conditions. Attempts to grow crystals of both complexes under identical
or similar conditions failed (Terrak-M, personal communication). The elucidation, by
MD simulations, of the solution structures of two Mlc1p-IQ complexes (presented in
chapters 1-2), and with a combination of MD simulation of free IQ2 and IQ4 peptides
(presented in chapters 3-4), calls for reevaluation of the Mlc1p-LCBD model structure.
Our reevaluation of the current structural model and suggestion for a solution
model, which are discussed in section 4.3, are based on a wide range of experimental data
91
as well. Fluorescence imaging with one-nanometer accuracy (FIONA) (165-167) and
time-resolved single-molecule fluorescence polarization studies (SMFP) (168) suggest a
fundamental role to the elasticity of the LCBD during the movement of the myosin V.
Myosin V "walks" following an asymmetric hand-over-hand mechanism, where its heads
alternate leading and trailing positions along the actin filament, analogous to the hands of
a rope climber. In the course of its stride, a conformational change was demonstrated
during the transition of the lever-arm from a pre-stroke to post-stroke state. This change
is manifested by a tilting of the LCBD between two distinct conformations, a straight one
and a bent one. The curvature of the LCBD deduced from these experiments is in a very
good accord with the one predicted in our MD studies as further discussed. In addition,
when actin-bound myosin V was imaged by electron microscopy (Fig. 4.2) (169, 170), a
bent lever-arm was observed.
Figure 4.2: Two electron microscopy images showing filaments of actin with two-
headed myosin V "walking". The red arrows point towards the myosin's lead arm, seen in
a bent conformation. These electron microscopy images, by presenting myosins with
curved lever arms, strengthen our suggested model. The scale bar is 50 nm (note that the
myosin was found with fringes and thus spans ~ 50 nm). Images adapted from (169).
The leading head was curved backwards, whereas the rear head was straighter,
92
resembling a skier in a telemark stance. Thus, the electron microscopy data confirms the
curved shape of the LCBD of the lead head pre-stroke state. Furthermore, when the
crystal structure of scallop’s myosin S1 was determined (171), a ~ 90° curvature of the
myosin's IQ4 motif neck domain was observed. It was found that the lever arm of the
scallop's myosin S1 does not move as a rigid body, but rather flexes when the myosin is
in motion.
4.2 Structure and Dynamics of an Extended IQ4 Peptide
In order to verify that the kink presented by the bound (Fig. 3.4B and chapter 1)
and the free IQ4 peptides (Figs. 3.2 and 3.4A) has a physiological relevance, we build an
extended IQ4 peptide and performed an MD simulation for it. The sequence of this
peptide consists of the IQ4 peptide, as present at the crystal structure of the Mlc1p-IQ4
complex, and an additional twenty residues. Ten of these residues flank its N-terminus
and are part of the IQ3 peptide sequence, while the other ten of these residues flank its C-
terminus and belong to the IQ5 peptide sequence. Hence, this 45 long amino acids
sequence represents a section from the poly-IQ (IQ1-IQ6) of the neck of myosin V.
Comparison between the model solution structures of the free IQ4 peptide after
100-ns of simulation, the bound IQ4 peptide after 12-ns with the Mlc1p protein, and the
extended IQ4 peptide after 20-ns of simulation are, presented in Fig. 4.3 (A-C,
respectively). All model structures exhibit a major kink (see the black arrows), that is
roughly located at the same location along the peptides. The common position of the
angular shift of the peptide (from a linear peptide with a ~ 180° angle to a curved one
with a hinge) can be referred as a "hot spot", which tends to curve rather than assume a
93
Figure 4.3: Snapshots of the simulated model structures of the IQ4 peptides. (A) After
100-ns simulation of free IQ4 peptide; (B) After 12-ns simulation of bound IQ4 peptide;
(C) After 20-ns simulation of extended IQ4 peptide (the IQ4 peptide is drawn in blue,
while its extended residues are colored in red). The black arrows indicate the "hot spot"
position.
C B A
straight conformation. This "hot spot" is also seen at the simulation of the free IQ4
peptide at the low salt concentration (Fig. 3.9A).
We use the term "hot spot" not only as a phrase to describe the location of the
peptide's twist but literally as an idiom to characterize a sequence encompassing a high
B-factor. B-factor or temperature factor is an indicator of thermal motion of an atom, and
is commonly used as a measure of how much an atom oscillates or vibrates around a
specified position. Thus, the high B-factor that is observed at the location of the kink in
the simulations of the IQ4 peptide reflects its local inherent instability. Moreover, when
the RMSF was calculated for the IQ4 peptide's simulations, it turned out that residues that
present high RMSF values correspond to those that curve. Therefore, intrinsic flexibility
of the IQ4 peptide was already embedded in its X-ray structure, and, as such, our finding
is not surprising.
The RMSD analysis of the extended IQ4 peptide is presented in Fig. 4.4. It
increases from ~ 0.1 nm to ~ 0.5 nm, and stabilizes on this value. Since the RMSD
94
Figure 4.4: The RMSD of the backbone atoms of the MD run of the extended IQ4
peptide as a function of the simulation time.
reached to a relatively constant value for the last 10-ns of the simulation, further major
conformational changes are not expected. Thus, the curved extended IQ4 peptide seems
to be structurally steady and hence may represent a physiological entity. This finding
strengthens and supports our reevaluation regarding the structural model of the LCBD
presented in section 4.3.
4.3 The Suggested Solution Model
The following suggested solution model of the LCBD is a consequence of our
overall findings presented throughout this study. We propose that the light chains of the
myosin, namely the CaM and the Mlc1p proteins, may maintain a compact conformation
and not an extended one as their ID is bent (as in Figs. 1.1B and 1.3B). Their NL is
probably not free to engage in protein-protein interactions as claimed by Terrak and co-
workers, but rather interacts with the IQ peptides, coming into a close contact with their
CL. In addition, and not less important, we propose that the poly-IQ sequence of the
95
myosin's neck may curve when bound to the light chain proteins. This structural
flexibility of the poly-IQ is associated with the myosin's walking over the actin filament,
generating the bended knee conformation of the lead arm of the LCBD, which is a crucial
element in its mechano-chemical mechanism. Therefore, the banana-shaped arm of the
myosin, which was not included in the structural model of Terrak and co-workers, is
revealed by our simulations. Besides supplying a dynamic model of the poly-IQ
sequence, we point out the exact location of its knee. According to our simulations, it
appears that the curvature of the neck may be located within the IQ4 peptide sequence.
The hinge presented by the IQ4 peptide, observed in all our simulations under
physiological salt conditions (and also for the extended IQ4 peptide), may correspond to
a physiologically relevant bent solution structure of the lever arm of myosin. This unique
observation regarding the location of the curve, which was not specified by electron
fluorescence or electron microscopy studies, exemplifies the ability of MD simulations to
provide additional information to the one obtained by experimental biologists.
The tight refolding of the protein around the peptides and the bending of the IQ4
peptide point out the difficulty of predicting the solution structure of a large protein-
peptide complex based on crystal structures of some of its isolated components. The
schematic description of the IQ motifs as constitutes of a rigid straight alpha helix (as
shown on Fig. 4.1), which is not consistent with the dynamics of the peptides and
complexes, represents an unlikely physiological conformation.
Our view about myosin V is corroborated by an illustration, in which the
movement of myosin V over an actin filament is shown (Fig. 4.5). Though this is only an
illustration, we feel that it faithfully delivers our proposed solution model. The lead arm
96
of the neck is curved at a position that corresponds to the IQ4 peptide (see the white
arrow), exemplifying that the flexibility of the neck is embedded in the myosin's motion.
Thus, the flexibility of the neck, as opposed to the model proposed by Terrak and co-
workers, is necessary for the proper function of the mechano-chemical function of the
myosin molecules.
Figure 4.5: Myosin V (green), a biomolecular motor that moves in nanometer size steps
on actin (red), transports cargo within cells. By placing a fluorophore near one foot
(rainbow colored oval), and following the motion a single myosin V, it was determined
that myosin V "walks", placing one foot over the other, and does not "crawl". The
illustration demonstrates that the neck domain of myosin V, as predicted by our
simulations, curves when it strides over the actin filament. The white arrow points
towards the IQ4 peptide of the lead arm of the myosin. The light chains were omitted
from the illustration for simplicity. Adapted from Yildiz et al. (166).
4.4 Summary
On this chapter we addressed the structure of the LCBD of myosin V from the
yeast Saccharomyces cerevisiae. We presented the current structural model and proposed
a modified model, based upon the various MD simulations discussed on the thesis. The
97
crystal structure of the Mlc1p-IQ4 protein-peptide complex was obtained under high salt
conditions. As such, the IQ4 peptide exhibits a straight alpha-helical configuration and
the protein, restricted by the linear conformation of the peptide, warps itself around it.
According to our suggestion, which was elaborated throughout the thesis, the crystal
structure of the Mlc1p-IQ4 protein-peptide complex may not represent its physiological
one, and thus the model proposed by Terrak and co-workers may be misleading. We wish
to stress out that although a wide plethora of evidences point out to this conclusion, it
would be interesting to perform a simulation of that complex under high salt conditions.
Such an MD simulation will provide a further verification for the influence of the
crystallization conditions, i.e. high salt and ionic strength of the crystallization buffer, on
the configuration of the protein inside the crystal lattice.
Apparently, the solution structure of the LCBD of myosin V is more complicated
than the one based on crystal structures of a few individual Mlc1p-IQ complexes, as
predicted by Terrak and co-workers. When our simulations are taken into account
together with the experimental data presented above, it is reasonable to conclude that the
LCBD is not a passive structural device but dynamic proteinous machinery. We argue
that mutual structural flexibility of the light chain proteins and the IQ peptides represents
a more realistic model of the neck region of myosin V. Our proposed solution model of
the flexed neck of myosin V IQ4 peptide is in accord with all our MD simulations and
also in agreement with other research groups reporting the flexible nature of the neck.
However, a word of caution should be stated. We cannot preclude the possibility that the
neck may curve and break its linearity along other IQ peptides, besides the primary kink
at the IQ4 sequence.
98
IV. OVERALL GENERAL DISCUSSION
The current research is the first MD study of IQ peptides in a complex with a
CaM-like protein and at its absence. Since IQ motifs are widely spread in nature, serving
as binding sites for CaM and CaM-like proteins, and as the structure of CaM resembles
that of Mlc1p, it was of great importance to conduct this research in order to elucidate the
structural mechanism underlying the interaction of CaM and CaM-like proteins with IQ
peptides. Analysis of the structure, dynamics and energetic aspects of the Mlc1p-IQ
complexes was of high significance also due to their important role in the regulation of
the mechano-chemical myosin system of the yeast Saccharomyces cerevisiae. Given that
the Mlc1p protein functions and binds myosin V at the absence of Ca+2
, fulfilling roles
taken by Holo-CaM in other classes of myosins, our conclusions presented in this study
may, as well, hold for the entire family of myosins.
In this study, we described a usage of the MD methodology as a tool to analyze
the crystal structures of Mlc1p-IQ complexes. The reported simulations addressed the
manifold structures of the Mlc1p-IQ complexes as found in crystal structures and in
solution. We have clearly demonstrated that the crystal structure of the Mlc1p-IQ4
complex, obtained under high salt conditions, does not represent its solution structure. At
this high ionic strength, the IQ4 peptide assumes a straight alpha helix and does not bend.
It supplies a scaffold on which the protein binds but the latter keeps an extended
conformation. At a physiological ion strength, the IQ4 peptide curves and consequently
the protein may hold a compact structure. We speculate that at low-medium ionic
strength the Mlc1p-IQ4 could not well diffract, representing an inhomogeneous crystals
population.
99
We can generalize the above mentioned argument and claim that crystal structures
of proteins do not necessarily represent their solution structures. Though this claim may
not seem to be too adventurous or innovative, our research clarifies that a crystal structure
can be influenced by the crystallographic conditions, while the latter determine the
packing forces operating at the crystal lattice. Furthermore, crystalline forms of proteins
can be imposed by specific crystallographic conditions, consequently leading to a
fundamental deviation of the obtained X-ray structure from the native configuration.
We have noticed that the Mlc1p protein had a structural versatility, allowing it to
bind the IQ peptides in a mode reflecting their precise sequence, hardly using the "IQ
motif residues" as key anchoring sites. Apparently, the Mlc1p protein can be flexed in
many configurations, each suitable for a certain IQ structure. Thus, the sequence variation
among the IQ motif peptides is sufficiently large to induce non-identical interactions with
the same protein. Taking a broader view, we suggest that when two similar (but not
identical) peptides, which belong to the same family, bind the same protein, each protein-
peptide structure may assume a unique conformation.
Following a comprehensive energy analysis, we found that the protein-peptide
complexes formation is mediated by enthalpy, while entropy opposes it. The close
interaction between the Mlc1p protein and the IQ peptides is grossly mediated by LJ
interactions, whereas their opposite highly charged surfaces contribute to their initial
encounter. Electrostatic interactions may serve as the driving force for long-range
molecular recognition, but once protein-peptide contact occurs, highly specific VdW
interactions and non-specific hydrophobic interactions stabilize the Mlc1p-IQ complexes.
This finding may also be valid for pairs of highly opposite charged proteins that form
100
protein-protein or protein-ligand complexes.
The simulations of the protein-peptide complexes, combined with the simulations
of the free IQ peptides, allow us to offer a comprehensive solution model of the LCBD.
According to our simulations, and in contrast to a structural model for the LCBD
suggested by Terrak and co-workers, the solution structures of the Mlc1p-IQ protein-
peptide complexes may impose refolding of the light chain proteins around the IQ
peptides. Thus, the light chain proteins may reveal a compact configuration and not an
extended one. Considering a poly-sequence of IQ motifs, as present in the LCBD, it
appears that its final structure, saturated by the CaM-like proteins, is under intensive
internal stress. The internal stress may be associated with the mechano-chemical function
of the myosin molecules. When the myosin "walks" over the actin filament, its lead arm
of the neck domain must flex in every stride. Our simulations, by adding a dynamic
nature to the structural model, demonstrate the curvature of the LCBD. As suggested by
our simulations, the LCBD assumes a banana-like conformation, while the location of its
knee corresponds to the IQ4 peptide. The knee-shape structure that the LCBD may hold
is essential to its functionality. We wish to note that our model emphasizes that the
LCBD is a dynamic proteinous machinery, and not a passive structural device as
previously suggested.
In a broader perspective, our research shows that careful application of MD
simulations can be used for extending the structural information presented by crystal
structures, thereby revealing the dynamic configurations of proteins in their physiological
environment. It is to be hoped that experimental structural biologists will make increasing
use of MD simulations for obtaining a deeper understanding of particular biological
101
systems. When MD simulations for long durations will become routine procedures, a
constructive interplay between the simulations and experiments will be useful and
informative regarding both sides. Given the availability of several MD programs, large
amounts of computer time, and examples for which MD has really played a role in
furthering our understanding of protein structure and functions (as presented in this
research), we look forward to a wide field of biological and biophysical applications for
MD in the future.
102
V. SUPPLEMENT
1.1 Articles
The outcome of the thesis is three articles:
(I) Ganoth, A., E. Nachliel, R. Friedman, and M. Gutman. 2006. Molecular dynamics
study of a calmodulin-like protein with an IQ peptide: Spontaneous refolding of the
protein around the peptide. Proteins 64:133-146. (The article is attached at the following
pages).
(II) Ganoth, A., R. Friedman, E. Nachliel, and M. Gutman. 2006. A molecular
dynamics study and free energy analysis of complexes between the Mlc1p protein and
two IQ motif peptides. Biophys J 91:2436-2450. (Cover article. Both article and cover are
attached at the following pages).
(III) Ganoth, A., R. Friedman, E. Nachliel, and M. Gutman. The structural basis for
myosin V movement: A molecular dynamics study (temporary title). In Preparation.
103
REFERENCES
1. Berg, J. S., B. C. Powell, and R. E. Cheney. 2001. A millennial myosin census.
Mol Biol Cell 12:780-794.
2. Sellers, J. R. 2000. Myosins: a diverse superfamily. Biochim Biophys Acta
1496:3-22.
3. Weeds, A. G., and R. S. Taylor. 1975. Separation of subfragment-1 isoenzymes
from rabbit skeletal muscle myosin. Nature 257:54-56.
4. Pollard, T. D., W. F. Stafford, and M. E. Porter. 1978. Characterization of a
second myosin from Acanthamoeba castellanii. J Biol Chem 253:4798-4808.
5. Krendel, M., and M. S. Mooseker. 2005. Myosins: tails (and heads) of functional
diversity. Physiology (Bethesda) 20:239-251.
6. Baker, J. P., and M. A. Titus. 1998. Myosins: matching functions with motors.
Curr Opin Cell Biol 10:80-86.
7. Reck-Peterson, S. L., D. W. Provance, Jr., M. S. Mooseker, and J. A. Mercer.
2000. Class V myosins. Biochim Biophys Acta 1496:36-51.
8. Kalhammer, G., and M. Bahler. 2000. Unconventional myosins. Essays Biochem
35:33-42.
9. Eitzen, G., L. Wang, N. Thorngren, and W. Wickner. 2002. Remodeling of
organelle-bound actin is required for yeast vacuole fusion. J Cell Biol 158:669-
679.
10. Luna, E. J., K. N. Prestonjamasp, T. Nebl, Y. Chen, C. Gatto, S. W. Oh, and R. K.
Pope. 2001. Motile Membrane Skeletons: What Neutrophils and Muscle Have in
Common. Cell Mol Biol Lett 6:219.
11. Korn, E. D. 2000. Coevolution of head, neck, and tail domains of myosin heavy
chains. Proc Natl Acad Sci U S A 97:12559-12564.
12. Sellers, J. R., and C. Veigel. 2006. Walking with myosin V. Curr Opin Cell Biol
18:68-73.
13. Langford, G. M. 2002. Myosin-V, a versatile motor for short-range vesicle
transport. Traffic 3:859-865.
14. Nascimento, A. A., R. E. Cheney, S. B. Tauhata, R. E. Larson, and M. S.
Mooseker. 1996. Enzymatic characterization and functional domain mapping of
brain myosin-V. J Biol Chem 271:17561-17569.
15. Langford, G. M., S. A. Kuznetsov, D. Johnson, D. L. Cohen, and D. G. Weiss.
1994. Movement of axoplasmic organelles on actin filaments assembled on
acrosomal processes: evidence for a barbed-end-directed organelle motor. J Cell
Sci 107 ( Pt 8):2291-2298.
16. Cheney, R. E., M. K. O'Shea, J. E. Heuser, M. V. Coelho, J. S. Wolenski, E. M.
Espreafico, P. Forscher, R. E. Larson, and M. S. Mooseker. 1993. Brain myosin-V
is a two-headed unconventional myosin with motor activity. Cell 75:13-23.
17. Espreafico, E. M., R. E. Cheney, M. Matteoli, A. A. Nascimento, P. V. De
Camilli, R. E. Larson, and M. S. Mooseker. 1992. Primary structure and cellular
localization of chicken brain myosin-V (p190), an unconventional myosin with
calmodulin light chains. J Cell Biol 119:1541-1557.
18. Espindola, F. S., D. M. Suter, L. B. Partata, T. Cao, J. S. Wolenski, R. E. Cheney,
S. M. King, and M. S. Mooseker. 2000. The light chain composition of chicken
104
brain myosin-Va: calmodulin, myosin-II essential light chains, and 8-kDa dynein
light chain/PIN. Cell Motil Cytoskeleton 47:269-281.
19. Rayment, I., H. M. Holden, M. Whittaker, C. B. Yohn, M. Lorenz, K. C. Holmes,
and R. A. Milligan. 1993. Structure of the actin-myosin complex and its
implications for muscle contraction. Science 261:58-65.
20. Mehta, A. D., R. S. Rock, M. Rief, J. A. Spudich, M. S. Mooseker, and R. E.
Cheney. 1999. Myosin-V is a processive actin-based motor. Nature 400:590-593.
21. Sakamoto, T., F. Wang, S. Schmitz, Y. Xu, Q. Xu, J. E. Molloy, C. Veigel, and J.
R. Sellers. 2003. Neck length and processivity of myosin V. J Biol Chem
278:29201-29207.
22. Toth, J., M. Kovacs, F. Wang, L. Nyitray, and J. R. Sellers. 2005. Myosin V from
Drosophila reveals diversity of motor mechanisms within the myosin V family. J
Biol Chem 280:30594-30603.
23. Reck-Peterson, S. L., M. J. Tyska, P. J. Novick, and M. S. Mooseker. 2001. The
yeast class V myosins, Myo2p and Myo4p, are nonprocessive actin-based motors.
J Cell Biol 153:1121-1126.
24. Schott, D., J. Ho, D. Pruyne, and A. Bretscher. 1999. The COOH-terminal domain
of Myo2p, a yeast myosin V, has a direct role in secretory vesicle targeting. J Cell
Biol 147:791-808.
25. Stevens, R. C., and T. N. Davis. 1998. Mlc1p is a light chain for the
unconventional myosin Myo2p in Saccharomyces cerevisiae. J Cell Biol 142:711-
722.
26. Wagner, W., P. Bielli, S. Wacha, and A. Ragnini-Wilson. 2002. Mlc1p promotes
septum closure during cytokinesis via the IQ motifs of the vesicle motor Myo2p.
Embo J 21:6397-6408.
27. May, K. M., F. Z. Watts, N. Jones, and J. S. Hyams. 1997. Type II myosin
involved in cytokinesis in the fission yeast, Schizosaccharomyces pombe. Cell
Motil Cytoskeleton 38:385-396.
28. Terrak, M., G. Wu, W. F. Stafford, R. C. Lu, and R. Dominguez. 2003. Two
distinct myosin light chain structures are induced by specific variations within the
bound IQ motifs-functional implications. Embo J 22:362-371.
29. Bielli, P., E. C. Casavola, A. Biroccio, A. Urbani, and A. Ragnini-Wilson. 2006.
GTP drives myosin light chain 1 interaction with the class V myosin Myo2 IQ
motifs via a Sec2 RabGEF-mediated pathway. Mol Microbiol 59:1576-1590.
30. Bateman, A., L. Coin, R. Durbin, R. D. Finn, V. Hollich, S. Griffiths-Jones, A.
Khanna, M. Marshall, S. Moxon, E. L. Sonnhammer, D. J. Studholme, C. Yeats,
and S. R. Eddy. 2004. The Pfam protein families database. Nucleic Acids Res
32:D138-141.
31. Martin, S. R., and P. M. Bayley. 2004. Calmodulin bridging of IQ motifs in
myosin-V. FEBS Lett 567:166-170.
32. Rhoads, A. R., and F. Friedberg. 1997. Sequence motifs for calmodulin
recognition. Faseb J 11:331-340.
33. Bahler, M., and A. Rhoads. 2002. Calmodulin signaling via the IQ motif. FEBS
Lett 513:107-113.
34. Chin, D., and A. R. Means. 2000. Calmodulin: a prototypical calcium sensor.
Trends Cell Biol 10:322-328.
105
35. Yamniuk, A. P., and H. J. Vogel. 2004. Calmodulin's flexibility allows for
promiscuity in its interactions with target proteins and peptides. Mol Biotechnol
27:33-57.
36. Vogel, H. J., R. D. Brokx, and H. Ouyang. 2002. Calcium-binding proteins.
Methods Mol Biol 172:3-20.
37. Soderling, T. R. 1999. The Ca-calmodulin-dependent protein kinase cascade.
Trends Biochem Sci 24:232-236.
38. Jurado, L. A., P. S. Chockalingam, and H. W. Jarrett. 1999. Apocalmodulin.
Physiol Rev 79:661-682.
39. Persechini, A., P. M. Stemmer, and I. Ohashi. 1996. Localization of unique
functional determinants in the calmodulin lobes to individual EF hands. J Biol
Chem 271:32217-32225.
40. VanBerkum, M. F., S. E. George, and A. R. Means. 1990. Calmodulin activation
of target enzymes. Consequences of deletions in the central helix. J Biol Chem
265:3750-3756.
41. Craig, T. A., D. M. Watterson, F. G. Prendergast, J. Haiech, and D. M. Roberts.
1987. Site-specific mutagenesis of the alpha-helices of calmodulin. Effects of
altering a charge cluster in the helix that links the two halves of calmodulin. J Biol
Chem 262:3278-3284.
42. Lukas, T. J., W. H. Burgess, F. G. Prendergast, W. Lau, and D. M. Watterson.
1986. Calmodulin binding domains: characterization of a phosphorylation and
calmodulin binding site from myosin light chain kinase. Biochemistry 25:1458-
1464.
43. Findlay, W. A., S. R. Martin, K. Beckingham, and P. M. Bayley. 1995. Recovery
of native structure by calcium binding site mutants of calmodulin upon binding of
sk-MLCK target peptides. Biochemistry 34:2087-2094.
44. Findlay, W. A., M. J. Gradwell, and P. M. Bayley. 1995. Role of the N-terminal
region of the skeletal muscle myosin light chain kinase target sequence in its
interaction with calmodulin. Protein Sci 4:2375-2382.
45. Wintrode, P. L., and P. L. Privalov. 1997. Energetics of target peptide recognition
by calmodulin: a calorimetric study. J Mol Biol 266:1050-1062.
46. Brokx, R. D., M. M. Lopez, H. J. Vogel, and G. I. Makhatadze. 2001. Energetics
of target peptide binding by calmodulin reveals different modes of binding. J Biol
Chem 276:14083-14091.
47. Osawa, M., H. Tokumitsu, M. B. Swindells, H. Kurihara, M. Orita, T. Shibanuma,
T. Furuya, and M. Ikura. 1999. A novel target recognition revealed by calmodulin
in complex with Ca2+-calmodulin-dependent kinase kinase. Nat Struct Biol
6:819-824.
48. Meador, W. E., A. R. Means, and F. A. Quiocho. 1993. Modulation of calmodulin
plasticity in molecular recognition on the basis of x-ray structures. Science
262:1718-1721.
49. Meador, W. E., A. R. Means, and F. A. Quiocho. 1992. Target enzyme
recognition by calmodulin: 2.4 A structure of a calmodulin-peptide complex.
Science 257:1251-1255.
50. Kurokawa, H., M. Osawa, H. Kurihara, N. Katayama, H. Tokumitsu, M. B.
Swindells, M. Kainosho, and M. Ikura. 2001. Target-induced conformational
106
adaptation of calmodulin revealed by the crystal structure of a complex with
nematode Ca(2+)/calmodulin-dependent kinase kinase peptide. J Mol Biol
312:59-68.
51. Clapperton, J. A., S. R. Martin, S. J. Smerdon, S. J. Gamblin, and P. M. Bayley.
2002. Structure of the complex of calmodulin with the target sequence of
calmodulin-dependent protein kinase I: studies of the kinase activation
mechanism. Biochemistry 41:14669-14679.
52. Aoyagi, M., A. S. Arvai, J. A. Tainer, and E. D. Getzoff. 2003. Structural basis
for endothelial nitric oxide synthase binding to calmodulin. Embo J 22:766-775.
53. Yamauchi, E., T. Nakatsu, M. Matsubara, H. Kato, and H. Taniguchi. 2003.
Crystal structure of a MARCKS peptide containing the calmodulin-binding
domain in complex with Ca2+-calmodulin. Nat Struct Biol 10:226-231.
54. Matsubara, M., T. Nakatsu, H. Kato, and H. Taniguchi. 2004. Crystal structure of
a myristoylated CAP-23/NAP-22 N-terminal domain complexed with
Ca(2+)/calmodulin. Embo J 23:712-718.
55. Vetter, S. W., and E. Leclerc. 2003. Novel aspects of calmodulin target
recognition and activation. Eur J Biochem 270:404-414.
56. Terrak, M., L. R. Otterbein, G. Wu, L. A. Palecanda, R. C. Lu, and R.
Dominguez. 2002. Crystallization, X-ray characterization and selenomethionine
phasing of Mlc1p bound to IQ motifs from myosin V. Acta Crystallogr D Biol
Crystallogr 58:1882-1885.
57. Terrak, M., G. Rebowski, R. C. Lu, Z. Grabarek, and R. Dominguez. 2005.
Structure of the light chain-binding domain of myosin V. Proc Natl Acad Sci U S
A 102:12718-12723.
58. Cheatham, T. E., and P. A. Kollman. 2000. Molecular dynamics simulation of
nucleic acids. Annual Review of Physical Chemistry 51:435-471.
59. Karplus, M., and J. A. McCammon. 2002. Molecular dynamics simulations of
biomolecules. Nat Struct Biol 9:646-652.
60. Norberg, J., and L. Nilsson. 2002. Molecular dynamics applied to nucleic acids.
Accounts Chem Res 35:465-472.
61. Moraitakis, G., A. G. Purkiss, and J. M. Goodfellow. 2003. Simulated dynamics
and biological macromolecules. Rep Prog Phys 66:383-406.
62. Straatsma, T. P., and J. A. Mccammon. 1992. Computational Alchemy. Annual
Review of Physical Chemistry 43:407-435.
63. Wang, W., O. Donini, C. M. Reyes, and P. A. Kollman. 2001. Biomolecular
simulations: recent developments in force fields, simulations of enzyme catalysis,
protein-ligand, protein-protein, and protein-nucleic acid noncovalent interactions.
Annu Rev Biophys Biomol Struct 30:211-243.
64. Hansson, T., C. Oostenbrink, and W. van Gunsteren. 2002. Molecular dynamics
simulations. Curr Opin Struct Biol 12:190-196.
65. McCammon, J. A., B. R. Gelin, and M. Karplus. 1977. Dynamics of folded
proteins. Nature 267:585-590.
66. Levitt, M., and R. Sharon. 1988. Accurate simulation of protein dynamics in
solution. Proc Natl Acad Sci U S A 85:7557-7561.
67. Mackerell, A. D., Jr. 2004. Empirical force fields for biological macromolecules:
overview and issues. J Comp Chem 25:1584-1604.
107
68. Karplus, M. 2002. Molecular dynamics simulations of biomolecules. Acc Chem
Res 35:321-323.
69. Karplus, M., A. T. Brunger, R. Elber, and J. Kuriyan. 1987. Molecular dynamics:
applications to proteins. Cold Spring Harb Symp Quant Biol 52:381-390.
70. Karplus, M., and J. Kuriyan. 2005. Molecular dynamics and protein function.
Proc Natl Acad Sci U S A 102:6679-6685.
71. Lu, D., A. Aksimentiev, A. Y. Shih, E. Cruz-Chu, P. L. Freddolino, A. Arkhipov,
and K. Schulten. 2006. The role of molecular modeling in bionanotechnology.
Phys Biol 3:S40-53.
72. van Gunsteren, W. F., and H. J. C. Berenden. 1990. Computer simulation of
molecular dynamics: methodology, applications and perspectives in chemistry.
Angew Chem Int Ed Engl 29:992-1023.
73. Leach, A. R. 2001. Molecular modeling: principles and applications. Prentice
Hall.
74. Vorherr, T., O. Kessler, A. Mark, and E. Carafoli. 1992. Construction and
molecular dynamics simulation of calmodulin in the extended and in a bent
conformation. Eur J Biochem 204:931-937.
75. Mehler, E. L., J. L. Pascual-Ahuir, and H. Weinstein. 1991. Structural dynamics
of calmodulin and troponin C. Protein Eng 4:625-637.
76. Weinstein, H., and E. L. Mehler. 1994. Ca(2+)-binding and structural dynamics in
the functions of calmodulin. Annu Rev Physiol 56:213-236.
77. van der Spoel, D., B. L. de Groot, S. Hayward, H. J. Berendsen, and H. J. Vogel.
1996. Bending of the calmodulin central helix: a theoretical study. Protein Sci
5:2044-2053.
78. Yang, C., and K. Kuczera. 2002. Molecular dynamics simulations of calcium-free
calmodulin in solution. J Biomol Struct Dyn 19:801-819.
79. Komeiji, Y., Y. Ueno, and M. Uebayasi. 2002. Molecular dynamics simulations
revealed Ca(2+)-dependent conformational change of Calmodulin. FEBS Lett
521:133-139.
80. Vigil, D., S. C. Gallagher, J. Trewhella, and A. E. Garcia. 2001. Functional
dynamics of the hydrophobic cleft in the N-domain of calmodulin. Biophys J
80:2082-2092.
81. Shepherd, C. M., and H. J. Vogel. 2004. A molecular dynamics study of Ca2+-
calmodulin: evidence of interdomain coupling and structural collapse on the
nanosecond timescale. Biophys J 87:780-791.
82. Yang, C., G. S. Jas, and K. Kuczera. 2001. Structure and dynamics of calcium-
activated calmodulin in solution. J Biomol Struct Dyn 19:247-271.
83. Fiorin, G., R. R. Biekofsky, A. Pastore, and P. Carloni. 2005. Unwinding the
helical linker of calcium-loaded calmodulin: a molecular dynamics study. Proteins
61:829-839.
84. Likic, V. A., P. R. Gooley, T. P. Speed, and E. E. Strehler. 2005. A statistical
approach to the interpretation of molecular dynamics simulations of calmodulin
equilibrium dynamics. Protein Sci 14:2955-2963.
85. Wriggers, W., E. Mehler, F. Pitici, H. Weinstein, and K. Schulten. 1998. Structure
and dynamics of calmodulin in solution. Biophys J 74:1622-1639.
108
86. Goto, K., A. Toyama, H. Takeuchi, K. Takayama, T. Saito, M. Iwamoto, J. Z.
Yeh, and T. Narahashi. 2004. Ca2+ binding sites in calmodulin and troponin C
alter interhelical angle movements. FEBS Lett 561:51-57.
87. Project, E., R. Friedman, E. Nachliel, and M. Gutman. 2006. A molecular
dynamics study of the effect of Ca2+ removal on calmodulin structure. Biophys J
90:3842-3850.
88. Yamaotsu, N., M. Suga, and S. Hirono. 2001. Molecular dynamics simulation of
the calmodulin-trifluoperazine complex in aqueous solution. Biopolymers 58:410-
421.
89. Turjanski, A. G., D. A. Estrin, R. E. Rosenstein, J. E. McCormick, S. R. Martin,
A. Pastore, R. R. Biekofsky, and V. Martorana. 2004. NMR and molecular
dynamics studies of the interaction of melatonin with calmodulin. Protein Sci
13:2925-2938.
90. Prabhu, N. V., A. L. Lee, A. J. Wand, and K. A. Sharp. 2003. Dynamics and
entropy of a calmodulin-peptide complex studied by NMR and molecular
dynamics. Biochemistry 42:562-570.
91. Yang, C., and K. Kuczera. 2002. Molecular dynamics simulations of a
calmodulin-peptide complex in solution. J Biomol Struct Dyn 20:179-197.
92. Yang, C., G. S. Jas, and K. Kuczera. 2004. Structure, dynamics and interaction
with kinase targets: computer simulations of calmodulin. Biochim Biophys Acta
1697:289-300.
93. Rodinger, T., and R. Pomes. 2005. Enhancing the accuracy, the efficiency and the
scope of free energy simulations. Curr Opin Struct Biol 15:164-170.
94. Srinivasan, J., T. E. Cheatham, P. Cieplak, P. A. Kollman, and D. A. Case. 1998.
Continuum solvent studies of the stability of DNA, RNA, and phosphoramidate -
DNA helices. J Am Chem Soc 120:9401-9409.
95. Hazel, P., G. N. Parkinson, and S. Neidle. 2006. Predictive modelling of topology
and loop variations in dimeric DNA quadruplex structures. Nucleic Acids Res
34:2117-2127.
96. Cheatham, T. E., 3rd, J. Srinivasan, D. A. Case, and P. A. Kollman. 1998.
Molecular dynamics and continuum solvent studies of the stability of polyG-
polyC and polyA-polyT DNA duplexes in solution. J Biomol Struct Dyn 16:265-
280.
97. Lee, M. R., Y. Duan, and P. A. Kollman. 2000. Use of MM-PB/SA in estimating
the free energies of proteins: application to native, intermediates, and unfolded
villin headpiece. Proteins 39:309-316.
98. Villacanas, O., and J. Rubio-Martinez. 2006. Reducing CDK4/6-p16(INK4a)
interface. Computational alanine scanning of a peptide bound to CDK6 protein.
Proteins 63:797-810.
99. Gohlke, H., C. Kiel, and D. A. Case. 2003. Insights into protein-protein binding
by binding free energy calculation and free energy decomposition for the Ras-Raf
and Ras-RalGDS complexes. J Mol Biol 330:891-913.
100. Massova, I., and P. A. Kollman. 1999. Computational alanine scanning to probe
protein-protein interactions: A novel approach to evaluate binding free energies.
Journal of the American Chemical Society 121:8133-8143.
109
101. Chong, L. T., Y. Duan, L. Wang, I. Massova, and P. A. Kollman. 1999. Molecular
dynamics and free-energy calculations applied to affinity maturation in antibody
48G7. Proc Natl Acad Sci U S A 96:14330-14335.
102. Huo, S., I. Massova, and P. A. Kollman. 2002. Computational alanine scanning of
the 1:1 human growth hormone-receptor complex. J Comput Chem 23:15-27.
103. Huo, S., J. Wang, P. Cieplak, P. A. Kollman, and I. D. Kuntz. 2002. Molecular
dynamics and free energy analyses of cathepsin D-inhibitor interactions: insight
into structure-based ligand design. J Med Chem 45:1412-1419.
104. Kuhn, B., and P. A. Kollman. 2000. Binding of a diverse set of ligands to avidin
and streptavidin: an accurate quantitative prediction of their relative affinities by a
combination of molecular mechanics and continuum solvent models. J Med Chem
43:3786-3791.
105. Reyes, C. M., and P. A. Kollman. 2000. Structure and thermodynamics of RNA-
protein binding: using molecular dynamics and free energy analyses to calculate
the free energies of binding and conformational change. J Mol Biol 297:1145-
1158.
106. Tsui, V., and D. A. Case. 2001. Calculations of the absolute free energies of
binding between RNA and metal ions using molecular dynamics simulations and
continuum electrostatics. J Phys Chem B 105:11314-11325.
107. Gouda, H., I. D. Kuntz, D. A. Case, and P. A. Kollman. 2003. Free energy
calculations for theophylline binding to an RNA aptamer: MM-PBSA and
comparison of thermodynamic integration methods. Biopolymers 68:16-34.
108. Berendsen, H. J. C., D. van der Spoel, and R. van Drunen. 1995. GROMACS: A
message-passing parallel molecular dynamics implementation. Comp Phys Comm
91:43-56.
109. Lindahl, E., B. Hess, and D. van der Spoel. 2001. Gromacs 3.0: A package for
molecular simulation and trajectory analysis. J Mol Mod 7:306-317.
110. van Der Spoel, D., E. Lindahl, B. Hess, A. R. van Buuren, E. Apol, P. J.
Meulenhoff, D. P. Tieleman, A. L. T. M. Sijbers, A. K. Feenstra, R. van Drunen,
and H. J. C. Berendsen. 2004. GROningen Machine for Molecular Simulations.
Groningen.
111. van Gunsteren, W. F., S. R. Billeter, A. A. Eising, P. H. Hunenberger, P. Kruger,
A. E. Mark, W. R. P. Scott, and I. G. Tironi. 1996. Biomolecular Simulation: The
GROMOS96 Manual and User Guide. Vdf Hochschulverlag AG an der ETH
Zurich, Zurich:1-1024.
112. Berman, H. M., T. Battistuz, T. N. Bhat, W. F. Bluhm, P. E. Bourne, K.
Burkhardt, Z. Feng, G. L. Gilliland, L. Iype, S. Jain, P. Fagan, J. Marvin, D.
Padilla, V. Ravichandran, B. Schneider, N. Thanki, H. Weissig, J. D. Westbrook,
and C. Zardecki. 2002. The Protein Data Bank. Acta Crystallogr D Biol
Crystallogr 58:899-907.
113. Xiang, Z. 2002. JACKAL: A Protein Structure Modeling Package. Columbia
University, New York, NY.
114. Berendsen, H. J. C., J. P. M. Postma, W. F. van Gunsteren, and J. Hermans. 1969.
Interaction Models for Water in Relation to Protein Hydration. Nature 224:175-
177.
110
115. Hess, B., H. Bekker, H. J. C. Berendsen, and J. G. E. M. Fraaije. 1997. LINCS: A
linear constraint solver for molecular simulations. J Comp Chem 18:1463-1472.
116. Miyamoto, S., and P. A. Kollman. 1992. SETTLE: An Analytical Version of the
SHAKE and RATTLE Algorithms for Rigid water models. J Comp Chem 13:952-
962.
117. Berendsen, H. J. C., J. P. M. Postma, A. DiNola, and J. R. Haak. 1984. Molecular
dynamics with coupling to an external bath. J Chem Phys 81:3684-3690.
118. Essman, U., L. Perela, M. L. Berkowitz, T. Darden, H. Lee, and L. G. Pederse.
1995. A smooth particle mesh Ewald method. J Chem Phys 103:8577-8592.
119. Guex, N., and M. C. Peitsch. 1997. SWISS-MODEL and the Swiss-PdbViewer:
an environment for comparative protein modeling. Electrophoresis 18:2714-2723.
120. Humphrey, W., A. Dalke, and K. Schulten. 1996. VMD: visual molecular
dynamics. J Mol Graph 14:33-38, 27-38.
121. Frishman, D., and P. Argos. 1995. Knowledge-based protein secondary structure
assignment. Proteins 23:566-579.
122. Baker, N. A., D. Sept, S. Joseph, M. J. Holst, and J. A. McCammon. 2001.
Electrostatics of nanosystems: application to microtubules and the ribosome. Proc
Natl Acad Sci U S A 98:10037-10041.
123. Sharp, K. A., and B. Honig. 1990. Electrostatic interactions in macromolecules:
theory and applications. Annu Rev Biophys Biophys Chem 19:301-332.
124. King, G., F. S. Lee, and A. Warshel. 1991. Microscopic Simulations of
Macroscopic Dielectric-Constants of Solvated Proteins. Journal of Chemical
Physics 95:4366-4377.
125. Warshel, A., and J. Aqvist. 1991. Electrostatic energy and macromolecular
function. Annu Rev Biophys Biophys Chem 20:267-298.
126. Kasper, P., P. Christen, and H. Gehring. 2000. Empirical calculation of the
relative free energies of peptide binding to the molecular chaperone DnaK.
Proteins 40:185-192.
127. Sharp, K. A. 1998. Calculation of HyHel10-lysozyme binding free energy
changes: effect of ten point mutations. Proteins 33:39-48.
128. Novotny, J., R. E. Bruccoleri, M. Davis, and K. A. Sharp. 1997. Empirical free
energy calculations: a blind test and further improvements to the method. J Mol
Biol 268:401-411.
129. Eisenhaber, F., P. Lijnzaad, P. Argos, C. Sander, and M. Scharf. 1995. The
Double Cube Lattice Method: Efficient Approaches to Numerical Integration of
Surface Area and Volume and to Dot Surface Contouring of Molecular
Assemblies. J Comp Chem 16:273-284.
130. Levitt, M., C. Sander, and P. S. Stern. 1983. The normal modes of a protein:
native bovine pancreatic trypsin inhibitor. Int J Quantum Chem Quantum Biol
Symp 10:181-199.
131. Go, N., T. Noguti, and T. Nishikawa. 1983. Dynamics of a small globular protein
in terms of low-frequency vibrational modes. Proc Natl Acad Sci U S A 80:3696-
3700.
132. Brooks, B., and M. Karplus. 1983. Harmonic dynamics of proteins: normal modes
and fluctuations in bovine pancreatic trypsin inhibitor. Proc Natl Acad Sci U S A
80:6571-6575.
111
133. McQuarrie, D. A. 1973. Statistical thermodynamics. Harper & Row, New York.
134. Kuboniwa, H., N. Tjandra, S. Grzesiek, H. Ren, C. B. Klee, and A. Bax. 1995.
Solution structure of calcium-free calmodulin. Nat Struct Biol 2:768-776.
135. Zhang, M., T. Tanaka, and M. Ikura. 1995. Calcium-induced conformational
transition revealed by the solution structure of apo calmodulin. Nat Struct Biol
2:758-767.
136. Fallon, J. L., and F. A. Quiocho. 2003. A closed compact structure of native
Ca(2+)-calmodulin. Structure (Camb) 11:1303-1307.
137. Hayashi, N., M. Matsubara, A. Takasaki, K. Titani, and H. Taniguchi. 1998. An
expression system of rat calmodulin using T7 phage promoter in Escherichia coli.
Protein Expr Purif 12:25-28.
138. Gopalakrishna, R., and W. B. Anderson. 1982. Ca2+-induced hydrophobic site on
calmodulin: application for purification of calmodulin by phenyl-Sepharose
affinity chromatography. Biochem Biophys Res Commun 104:830-836.
139. Garbers, D. L., J. R. Hansbrough, E. W. Radany, R. V. Hyne, and G. S. Kopf.
1980. Purification and characterization of calmodulin from sea urchin
spermatozoa. J Reprod Fertil 59:377-381.
140. Ganoth, A., E. Nachliel, R. Friedman, and M. Gutman. 2006. Molecular dynamics
study of a calmodulin-like protein with an IQ peptide: Spontaneous refolding of
the protein around the peptide. Proteins 64:133-146.
141. Petegem, F. V., F. C. Chatelain, and D. L. Minor. 2005. Insights into voltage-
gated calcium channel regulation from the structure of the Ca(V)1.2 IQ domain-
Ca(2+)/calmodulin complex. Nat Struct Mol Biol.
142. Sitkoff, D., K. Sharp, and B. Honig. 1994. Accurate calculation of hydration free
energies using macroscopic solvent models. J Phys Chem 98:1978-1988.
143. Cox, J. A., M. Comte, J. E. Fitton, and W. F. DeGrado. 1985. The interaction of
calmodulin with amphiphilic peptides. J Biol Chem 260:2527-2534.
144. Crivici, A., and M. Ikura. 1995. Molecular and structural basis of target
recognition by calmodulin. Annu Rev Biophys Biomol Struct 24:85-116.
145. Ikura, M., G. M. Clore, A. M. Gronenborn, G. Zhu, C. B. Klee, and A. Bax. 1992.
Solution structure of a calmodulin-target peptide complex by multidimensional
NMR. Science 256:632-638.
146. Misra, V. K., J. L. Hecht, K. A. Sharp, R. A. Friedman, and B. Honig. 1994. Salt
effects on protein-DNA interactions. The lambda cI repressor and EcoRI
endonuclease. J Mol Biol 238:264-280.
147. Misra, V. K., K. A. Sharp, R. A. Friedman, and B. Honig. 1994. Salt effects on
ligand-DNA binding. Minor groove binding antibiotics. J Mol Biol 238:245-263.
148. Sharp, K. A. 1996. Electrostatic interactions in hirudin-thrombin binding. Biophys
Chem 61:37-49.
149. Andre, I., T. Kesvatera, B. Jonsson, and S. Linse. 2006. Salt enhances
calmodulin-target interaction. Biophys J 90:2903-2910.
150. Lee, A. L., S. A. Kinnear, and A. J. Wand. 2000. Redistribution and loss of side
chain entropy upon formation of a calmodulin-peptide complex. Nat Struct Biol
7:72-77.
112
151. Duneau, J. P., N. Garnier, and M. Genest. 1997. Insight into signal transduction:
structural alterations in transmembrane helices probed by multi-1 ns molecular
dynamics simulations. J Biomol Struct Dyn 15:555-572.
152. Duneau, J. P., S. Crouzy, Y. Chapron, and M. Genest. 1999. Dynamics of the
transmembrane domain of the ErbB-2 receptor. Theor Chem Acc 101:87-91.
153. Shepherd, C. M., D. van der Spoel, and H. J. Vogel. 2004. Molecular dynamics
simulations of peptides from the central domain of smooth muscle caldesmon. J
Biomol Struct Dyn 21:555-566.
154. Lee, K. H., D. R. Benson, and K. Kuczera. 2000. Transitions from alpha to pi
helix observed in molecular dynamics simulations of synthetic peptides.
Biochemistry 39:13737-13747.
155. Weaver, T., and L. Banaszak. 1996. Crystallographic studies of the catalytic and a
second site in fumarase C from Escherichia coli. Biochemistry 35:13955-13965.
156. Sprang, S. R., S. G. Withers, E. J. Goldsmith, R. J. Fletterick, and N. B. Madsen.
1991. Structural basis for the activation of glycogen phosphorylase b by
adenosine monophosphate. Science 254:1367-1371.
157. Boyington, J. C., B. J. Gaffney, and L. M. Amzel. 1993. The three-dimensional
structure of an arachidonic acid 15-lipoxygenase. Science 260:1482-1486.
158. Vidgren, J., L. A. Svensson, and A. Liljas. 1994. Crystal structure of catechol O-
methyltransferase. Nature 368:354-358.
159. Bullock, T. L., K. Breddam, and S. J. Remington. 1996. Peptide aldehyde
complexes with wheat serine carboxypeptidase II: implications for the catalytic
mechanism and substrate specificity. J Mol Biol 255:714-725.
160. Liu, G., D. K. Sukumaran, D. Xu, Y. Chiang, T. Acton, S. Goldsmith-Fischman,
B. Honig, G. T. Montelione, and T. Szyperski. 2004. NMR structure of the
hypothetical protein NMA1147 from Neisseria meningitidis reveals a distinct 5-
helix bundle. Proteins 55:756-758.
161. Musso, G. F., S. Patthi, T. C. Ryskamp, S. Provow, E. T. Kaiser, and G.
Velicelebi. 1988. Development of helix-based vasoactive intestinal peptide
analogues: identification of residues required for receptor interaction.
Biochemistry 27:8174-8181.
162. Eyal, E., S. Gerzon, V. Potapov, M. Edelman, and V. Sobolev. 2005. The limit of
accuracy of protein modeling: influence of crystal packing on protein structure. J
Mol Biol 351:431-442.
163. DePristo, M. A., P. I. de Bakker, and T. L. Blundell. 2004. Heterogeneity and
inaccuracy in protein structures solved by X-ray crystallography. Structure
12:831-838.
164. Espindola, F. S., E. M. Espreafico, M. V. Coelho, A. R. Martins, F. R. Costa, M.
S. Mooseker, and R. E. Larson. 1992. Biochemical and immunological
characterization of p190-calmodulin complex from vertebrate brain: a novel
calmodulin-binding myosin. J Cell Biol 118:359-368.
165. Yildiz, A., H. Park, D. Safer, Z. Yang, L. Q. Chen, P. R. Selvin, and H. L.
Sweeney. 2004. Myosin VI steps via a hand-over-hand mechanism with its lever
arm undergoing fluctuations when attached to actin. J Biol Chem 279:37223-
37226.
113
166. Yildiz, A., J. N. Forkey, S. A. McKinney, T. Ha, Y. E. Goldman, and P. R.
Selvin. 2003. Myosin V walks hand-over-hand: single fluorophore imaging with
1.5-nm localization. Science 300:2061-2065.
167. Yildiz, A., and P. R. Selvin. 2005. Fluorescence imaging with one nanometer
accuracy: application to molecular motors. Acc Chem Res 38:574-582.
168. Forkey, J. N., M. E. Quinlan, M. A. Shaw, J. E. Corrie, and Y. E. Goldman. 2003.
Three-dimensional structural dynamics of myosin V by single-molecule
fluorescence polarization. Nature 422:399-404.
169. Walker, M. L., S. A. Burgess, J. R. Sellers, F. Wang, J. A. Hammer, 3rd, J.
Trinick, and P. J. Knight. 2000. Two-headed binding of a processive myosin to F-
actin. Nature 405:804-807.
170. Burgess, S., M. Walker, F. Wang, J. R. Sellers, H. D. White, P. J. Knight, and J.
Trinick. 2002. The prepower stroke conformation of myosin V. J Cell Biol
159:983-991.
171. Houdusse, A., A. G. Szent-Gyorgyi, and C. Cohen. 2000. Three conformational
states of scallop myosin S1. Proc Natl Acad Sci U S A 97:11238-11243.
סימולציות מחשב של קומפלקסים
בין חלבון לפפטיד תוך שימוש
מודלמערכתבשרשרת הקלה של מיוזין כ
לשם קבלת התוארחיבור
"דוקטור לפילוסופיה"
מאת
אסף גנות
אביב-הוגש לסנאט אוניברסיטת תל
2006 יולי
עבודה זו נעשתה בהדרכת
מנחם גוטמן' פרופ
תקציר
. מית של מיוזיןיכ-הוא רכיב חיוני במערכת המכנו, חבר במשפחת הקלמודולין, Mlc1pהחלבון
ואר של ו קושרים את מתחם הצMlc1pשישה חלבוני , Saccharomyces cerevisiaeבשמר ההנצה
לבין Mlc1p מבנים של מספר קומפלקסים בין החלבון .IQ פפטידים מסוג המורכב משישה, Vמיוזין
, המחקר הנוכחי מרחיב את ההבנה שלנו.X של קרני קריסטלוגרפיהבאמצעות נפתרו IQפפטידי
דינמיקה ואנרגטיקה של , על מבנים,י דינמיקה מולקולרית וחישובים נוספים"באמצעות מחקר ע
המחקר , הרחבה ניכרת של המידע המבני האצור במבנים הגבישייםל בנוסף. Mlc1p-IQהקומפלקסים
כיצד קיפול באתר מסוים ו מצאנ. על סיב האקטיןVמספק הבנה אטומית אודות דגם התנועה של מיוזין
מנוף תקינה שלה תופעולאת ובכך להבטיח , כואר המיוזין יכול לשמש כמפרק גמיש לאורוהממוקם על צו
.זרוע המיוזין
4IQ ו 2IQסימולציות של דינמיקה מולקולרית של הפפטידים ) I: (כוללת ארבעה פרקיםהתיזה
-Mlc1p השוואה מפורטת בין הסימולציות של שני הקומפלקסים ) Mlc1p;) IIבקומפלקס עם החלבון
IQ;) III ( סימולציות של דינמיקה מולקולרית של פפטידיIQחופשיים ;) IV (של הערכה מחודשת
.Vתחם הקושר שרשראות קלות של מיוזין המבנה של המ
פלקסיםמבפרק הראשון של התיזה אנחנו מציגים סימולציות של דינמיקה מולקולרית של הקו
Mlc1p-IQ4 ו Mlc1p-IQ2 ,הקומפלקס.לאחר רלקסציה שלהם בתמיסת מלח פיזיולוגית Mlc1p-
IQ2 בות הסימולציה אופיין בשינוי עיקריים שלו ובעקהעבר רלקסציה ללא איבוד של כוחות האריזה
עבר תהליך של קיפול מחדש במהלכו החלבון שינה את , 4IQעם הפפטיד , פלקס הנוסףמ הקו.מבני מוגבל
אנו מציגים השוואה , בפרק השני של התיזה. מבנהו ממתוח לקומפקטי והפפטיד התקפל לשני חלקים
, בצענו השוואה מקיפה בין המבנים. קסיםפלממפורטת בין סימולציות הדינמיקה המולקולרית של שני הקו
אנליזת האנרגיה כוללת ניתוח של . פפטיד- חלבוןנמיקה והאנרגיה החופשית של האינטראקציהיהד
. פטיד וציון תרומתם של רכיבים אנרגטיים שונים והפחלבון בין הפלקסיםמהכוחות הפועלים על הקו
ך פרקי זמן משתנים ובתנאים מש חופשיים לIQהפרק השלישי של התיזה עוסק בסימולציות של פפטידי
שלושינויים בין מבני התמיסה, שינויים בין הקונפורמציות של פפטידים בנוכחות ובהעדר חלבון. שונים
פלקסים מקוה התוצאות של סימולציות הדינמיקה המולקולרית של . נידונים, החופשייםIQשני פפטידי ה
ושל פפטידים חופשיים מאפשרות לבצע הערכה של המבנה של המתחם הקושר שרשראות פפטיד -חלבון
. IQלבין פפטידי ) Mlc1pכמו (לקסים בין שרשראות קלות פה קומשהמורכב משי, Vקלות של מיוזין
במרכז המודל המוצע עומדת התפיסה שהמתחם הקושר . הערכה זו מוצגת בפרק הרביעי של התיזה
ולכן המודל לוקח בחשבון את יכולתם של החלבון ,נמיתי הוא ישות תאית דVוזין שרשראות קלות של מי
Mlc1p ושל פפטידי IQלהתגמש ולהתכופף באופן הדדי .
מתאר סימולציות של דינמיקה המאמר הראשון. תוצאות התיזה מסוכמות בשלושה מאמרים
4IQ-p1Mlc) . and M, Friedman.R, Nachliel. E, .A, Ganothמולקולרית של הקומפלקס
Gutman. 2006. Molecular dynamics study of a calmodulin-like protein with an IQ
peptide: Spontaneous refolding of the protein around the peptide. Proteins 64:133-
ללת הכו, Mlc1p-IQ2 ו Mlc1p-IQ4 מציג השוואה נרחבת בין הקומפלקסים המאמר השני;)146
V ). R, .A, anothGאנליזה אנרגטית והערכה מחודשת של המתחם הקושר שרשראות קלות של מיוזין
Friedman, E. Nachliel, and M. Gutman. 2006. A molecular dynamics study and free
energy analysis of complexes between the Mlc1p protein and two IQ motif peptides.
Biophys J 91:2436-2450(; העוסק בסימולציות של פפטידי , המאמר השלישיIQבהכנה, חופשיים.
IQהמחקר הנוכחי מהווה מחקר חדשני של סימולציות דינמיקה מולקולרית של פפטידי
- בתהליכים מכנוIQפפטידי תפקידם העיקרי של לאור . קלמודולין ובהעדרו בקומפלקס עם חלבון דמוי
ה מספקת תיאור ברזולוציה אטומית של האינטראקציות שלהם עם קלמודולין וחלבונים התיז, כימיים
ולציות דינמיקה מולקולרית ונגמר מחיל בסיתמשרטט טיול ממוחשב המ המחקר. דמויי קלמודולין
מדענים ששיתוף פעולה בין ומדגים ,Vבתובנות אודות המתחם הקושר שרשראות קלות של מיוזין
.מבנה-נה טובה יותר של יחסי תפקודהבפיזיקאים יכול לתרום ל-ביובין להגרפילוטקריסהעוסקים ב