1
Variational Multiscale Modeling of Biomolecular Complexes Kelin Xia 1 , Xin Feng 2 , Yiying Tong 2 , and Guo-wei Wei 1,3 1 Department of Mathematics, Michigan State University, MI 48824, USA 2 Department of Computer Science and Engineering, Michigan State University, MI 48824, USA 3 Department of Biochemistry and Molecular Biology, Michigan State University, MI 48824, USA Introduction Multiscale modeling is of paramount importance to the understanding of biomolecular structure, function, dynamics and transport. Geometric modeling provides structural representations of molecular data from the Protein Data Bank (PDB) and the Electron Microscopy Data Bank (EMDB). Commonly used geometric models, such as molecular surface (MS), van der Waals surface, and solvent accessible surface are ad hoc devision of solvent and solute regions and lead to troublesome geometric singularities, as demonstrated in the figure below. At fundamental level, solvent and solute electron densities overlap each other and there is no sharp solvent-solute interface. Surface definitions and defects Molecular surface of protein (ID:1PPL) and its geometric singularity. We discuss our variational multiscale models and associated geometric modeling of biomolecular complexes, based on dierential geometry of surfaces and geometric measure theory. Our models give rise to singularity-free surface representation, curvature characterization, electrostatic mapping, solvation energy and binding anity analysis of biomolecules. V ariational Multiscale Models Almost all the biological processes in a living cell occur in aqueous surroundings, because up to 65%-90% of human cell mass is water. In our multiscale solvation model, a total free energy functional is constructed to include polar and nonpolar free energies. I The free energy functional of the solvation process is, G total [S, Φ]= Z γ|S| + pS + S h - m 2 |Φ| 2 + Φρ m i +(1 - S) " - s 2 |Φ| 2 - k B T X α ρ α0 e - q α Φ+U α -μ α0 k B T - 1 !# dr, where γ is the surface tension, S is the hypersurface and can be viewed as a characteristic function of the solute domain, p is the hydrodynamic pressure, and U α denotes the solvent-solute non-electrostatic interactions, such as the van der Waals interaction. Here Φ is the electrostatic potential, s and m are the dielectric constants of the solvent and solute, respectively, ρ m represents the fixed charge density of the solute and k B T is the thermal energy. By applying the variational principle to minimize the total solvation free energy with respect to Φ and S, two equations are generated. I The Generalized Poisson-Boltzmann Equation describes the electrostatic potential, - ∇· ((S)Φ)= Sρ m +(1 - S) X α q α ρ α0 e - q α Φ+U α -μ α0 k B T , where ρ α0 denotes the bulk concentration, and μ α0 is a relative reference chemical potential. I The generalized Laplace-Beltrami equation governs the surface formation under potential driven geometric flows, S t = |S| ∇· γ S |S| + V 1 , where the potential driven term V 1 is given by V 1 =-p + m 2 |Φ| 2 - Φρ m - s 2 |Φ| 2 - k B T X α ρ α0 e - q α Φ+U α -μ α0 k B T - 1 ! . The external potential term can be adjusted to take into consideration of other eects. For instance, if we assume that the biomolecular system is far from equilibrium and account for the chemical potential related energy in the free energy functional, we result in a potential driven term of the form V 2 =-p + U + m 2 |Φ| 2 - Φρ m - s 2 |Φ| 2 + Φ X α ρ α q α + X α k B T ρ α ln ρ α ρ α0 - ρ α + ρ α0 - μ α0 ρ α . Our model can be easily modified to account for other physical interactions. Surface and Electrostatic Analysis The solution of the generalized Laplace-Beltrami equation gives rise to biomolecular surfaces of controllable resolutions. These surfaces are free from geometric singularities. Their resolutions can be tuned to avoid the local atomic fluctuation in the curvature analysis. V ariational multiresolu- tion surface A comparison of the molecular surface (Left Chart) and our variational surface (Right Chart) of a protein (ID:1PPL). The Generalized Poisson-Boltzmann equation is solved to obtain electrostatic potential, which oers an indication of possible binding sites. Electrostatic Potential A comparison of the electrostatic potentials on the molecular surface (Left Chart) and our variational surface (Right Chart) of a protein (ID:1PPL). EMD Data Preprocessing The EMDB collects protein structural information from electron microscopy. The associated data are in a volumetric format and usually suer from low signal-to-noise rate (SNR). Therefore, the noise reduction of EMDB data is mandatory. High order geometric flows, which can more eciently suppress the high-frequency components, are employed for EMDB data analysis. A specific form of arbitrarily high order geometric PDEs is given by, S t = (-1) q q g(|∇∇ 2q S|)∇· (2q S) q g(|∇∇ 2q S|) + P(S, |S|), where g(|∇∇ 2q S|)= 1 + |∇∇ 2q S| 2 is the generalized Gram determinant. When q = 0, we arrive at a generalized mean curvature flow. EMDB data noise reduction Noise reduction of the EMDB data (ID emd5119). References [1] Z. Chen, N. Baker, andG. W. Wei, Dierential geometry based solvation model I: Eulerian formulation, Journal of Computational Physics, 229: 8231-8258 (2010). [2] G. W. Wei, Q. Zheng, Z. Chen, and K. L. Xia, Variational multiscale models for charge transport, SIAM Review. 54, 699-754 (2012). [3] X. Feng, K. L. Xia, Y. Tong, andG. W. Wei, Geometric modeling of organelles, subcellular structures and multiprotein complexes, International Journal for Numerical Methods in Biomedical Engineering, 28, 1198-1223 (2012) [4] K. L. Xia, X. Feng, Z. Chen, Y. Tong, andG. W. Wei, Multiscale geometric modeling of macromolecules I: Cartesian representation,Journal of Computational Physics, 257, 912-936 (2014). Geometric Modeling —Meshing The Lagrangian representation of protein surfaces derived from our variational multiscale models can be used for volumetric meshing. The Delaunay triangulation algorithm is implemented. Mesh detail The meshing of protein complex (emd1590). High quality mesh is constructed with the Delaunay triangulation in TetGen. Geometric Modeling —Curvature The surfaces are characterized by using Gaussian, mean, minimal principal and maximal principal curvatures, which indicate potential binding sites. Mean curvature Mean curvature maps of six multiprotein complexes. Binding-Site Prediction The product of minimal curvature and electrostatic potential indicates binding sites. Binding site prediction The upper row shows experimental binding sizes, and the lower row illustrates our predictions. Conclusion Our variational multiscale modeling demonstrates a great promising for the geometric, physical, and biological analysis of biomolecular complexes. Acknowledgment This work was partially supported by NSF grants CCF-0936830, IIS-0953096 and DMS-1160352 (FRG), and NIH grants R01GM-090208. http://www.math.msu.edu/˜wei/ [email protected] [email protected] [email protected] [email protected]

Variational Multiscale Modeling of Biomolecular ComplexesThe external potential term can be adjusted to take into consideration of other e ects. For instance, if we assume that the

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Variational Multiscale Modeling of Biomolecular ComplexesThe external potential term can be adjusted to take into consideration of other e ects. For instance, if we assume that the

Variational Multiscale Modeling of Biomolecular Complexes

Kelin Xia1, Xin Feng2, Yiying Tong2, and Guo-wei Wei1,3

1Department of Mathematics, Michigan State University, MI 48824, USA2Department of Computer Science and Engineering, Michigan State University, MI 48824, USA3Department of Biochemistry and Molecular Biology, Michigan State University, MI 48824, USA

IntroductionMultiscale modeling is of paramount importance to the understanding of biomolecular structure,function, dynamics and transport. Geometric modeling provides structural representations ofmolecular data from the Protein Data Bank (PDB) and the Electron Microscopy Data Bank (EMDB).Commonly used geometric models, such as molecular surface (MS), van der Waals surface, andsolvent accessible surface are ad hoc devision of solvent and solute regions and lead to troublesomegeometric singularities, as demonstrated in the figure below. At fundamental level, solvent andsolute electron densities overlap each other and there is no sharp solvent-solute interface.

Surfacedefinitionsand defects

Molecular surfaceof protein(ID:1PPL) and itsgeometricsingularity.

We discuss our variational multiscale models and associated geometric modeling of biomolecularcomplexes, based on differential geometry of surfaces and geometric measure theory. Our modelsgive rise to singularity-free surface representation, curvature characterization, electrostaticmapping, solvation energy and binding affinity analysis of biomolecules.

VariationalMultiscaleModelsAlmost all the biological processes in a living cell occur in aqueous surroundings, because up to65%-90% of human cell mass is water. In our multiscale solvation model, a total free energyfunctional is constructed to include polar and nonpolar free energies.I The free energy functional of the solvation process is,

Gtotal[S,Φ] =

∫ {γ|∇S|+ pS + S

[−εm2|∇Φ|2 +Φ ρm

]+(1 − S)

[−εs2|∇Φ|2 − kBT

∑α

ρα0

(e−

qαΦ+Uα−µα0kBT − 1

)]}dr,

where γ is the surface tension, S is the hypersurface and can be viewed as a characteristic functionof the solute domain, p is the hydrodynamic pressure, and Uα denotes the solvent-solutenon-electrostatic interactions, such as the van der Waals interaction. HereΦ is the electrostaticpotential, εs and εm are the dielectric constants of the solvent and solute, respectively, ρmrepresents the fixed charge density of the solute and kBT is the thermal energy. By applying thevariational principle to minimize the total solvation free energy with respect toΦ and S, twoequations are generated.

I The Generalized Poisson-Boltzmann Equation describes the electrostatic potential,

−∇ · (ε(S)∇Φ) = Sρm + (1 − S)∑α

qαρα0e−

qαΦ+Uα−µα0kBT ,

where ρα0 denotes the bulk concentration, and µα0 is a relative reference chemical potential.I The generalized Laplace-Beltrami equation governs the surface formation under potential

driven geometric flows,

∂S∂t

= |∇S|[∇ ·(γ∇S|∇S|

)+ V1

],

where the potential driven term V1 is given by

V1 = −p +εm2|∇Φ|2 −Φ ρm −

εs2|∇Φ|2 − kBT

∑α

ρα0

(e−

qαΦ+Uα−µα0kBT − 1

).

The external potential term can be adjusted to take into consideration of other effects. Forinstance, if we assume that the biomolecular system is far from equilibrium and account for thechemical potential related energy in the free energy functional, we result in a potential driventerm of the form

V2 = −p + U +εm2|∇Φ|2 −Φ ρm −

εs2|∇Φ|2 +Φ

∑α

ραqα

+∑α

[kBT

(ραln

ρα

ρα0− ρα + ρα0

)− µα0ρα

].

Our model can be easily modified to account for other physical interactions.

Surface and Electrostatic AnalysisThe solution of the generalized Laplace-Beltrami equation gives rise to biomolecular surfaces ofcontrollable resolutions. These surfaces are free from geometric singularities. Their resolutions canbe tuned to avoid the local atomic fluctuation in the curvature analysis.

Variationalmultiresolu-tionsurface

A comparison of themolecular surface (LeftChart) and ourvariational surface(Right Chart) of aprotein (ID:1PPL).

The Generalized Poisson-Boltzmann equation is solved to obtain electrostatic potential, whichoffers an indication of possible binding sites.

ElectrostaticPotentialA comparison of theelectrostatic potentialson the molecularsurface (Left Chart) andour variational surface(Right Chart) of aprotein (ID:1PPL).

EMD Data PreprocessingThe EMDB collects protein structural information from electron microscopy. The associated dataare in a volumetric format and usually suffer from low signal-to-noise rate (SNR). Therefore, thenoise reduction of EMDB data is mandatory. High order geometric flows, which can moreefficiently suppress the high-frequency components, are employed for EMDB data analysis. Aspecific form of arbitrarily high order geometric PDEs is given by,

∂S∂t

= (−1)q√

g(|∇∇2qS|)∇ ·

∇(∇2qS)√g(|∇∇2qS|)

+ P(S, |∇S|),

where g(|∇∇2qS|) = 1 + |∇∇2qS|2 is the generalized Gram determinant. When q = 0, we arrive at ageneralized mean curvature flow.

EMDB datanoisereduction

Noise reduction of theEMDB data (IDemd5119).

References[1] Z. Chen, N. Baker, and G. W. Wei, Differential geometry based solvation model I: Eulerianformulation, Journal of Computational Physics, 229: 8231-8258 (2010).[2] G. W. Wei, Q. Zheng, Z. Chen, and K. L. Xia, Variational multiscale models for charge transport,SIAM Review. 54, 699-754 (2012).[3] X. Feng, K. L. Xia, Y. Tong, and G. W. Wei, Geometric modeling of organelles, subcellular structuresand multiprotein complexes, International Journal for Numerical Methods in BiomedicalEngineering, 28, 1198-1223 (2012)[4] K. L. Xia, X. Feng, Z. Chen, Y. Tong, and G. W. Wei, Multiscale geometric modeling ofmacromolecules I: Cartesian representation,Journal of Computational Physics, 257, 912-936 (2014).

GeometricModeling— MeshingThe Lagrangian representation of protein surfaces derived from our variational multiscale modelscan be used for volumetric meshing. The Delaunay triangulation algorithm is implemented.

Meshdetail

The meshing ofprotein complex(emd1590). Highquality mesh isconstructed withthe Delaunaytriangulation inTetGen.

GeometricModeling— CurvatureThe surfaces are characterized by using Gaussian, mean, minimal principal and maximal principalcurvatures, which indicate potential binding sites.

Meancurvature

Mean curvaturemaps of sixmultiproteincomplexes.

Binding-Site PredictionThe product of minimal curvature and electrostatic potential indicates binding sites.

Bindingsiteprediction

The upper rowshowsexperimentalbinding sizes, andthe lower rowillustrates ourpredictions.

ConclusionOur variational multiscale modeling demonstrates a great promising for the geometric, physical,and biological analysis of biomolecular complexes.

Acknowledgment

This work was partially supported by NSF grants CCF-0936830,IIS-0953096 and DMS-1160352 (FRG), and NIH grants R01GM-090208.

http://www.math.msu.edu/˜wei/ [email protected] [email protected] [email protected] [email protected]