37
In Silico In Silico Laboratories: Laboratories: The Virtual Parasite The Virtual Parasite Project - An Overview Project - An Overview SURA Grid Computing in the Life Sciences - 2006 SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC Tarynn M. Witten, Ph.D., FGSA, FCSBC Director, Research and Development Director, Research and Development Center for the Study of Biological Complexity Center for the Study of Biological Complexity Visual Parasite Project Visual Parasite Project Virginia Commonwealth University Virginia Commonwealth University Richmond, VA Richmond, VA [email protected]

In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

Embed Size (px)

Citation preview

Page 1: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

In SilicoIn Silico Laboratories: Laboratories:The Virtual Parasite Project - The Virtual Parasite Project -

An OverviewAn OverviewSURA Grid Computing in the Life Sciences - 2006SURA Grid Computing in the Life Sciences - 2006

Tarynn M. Witten, Ph.D., FGSA, FCSBCTarynn M. Witten, Ph.D., FGSA, FCSBCDirector, Research and DevelopmentDirector, Research and Development

Center for the Study of Biological ComplexityCenter for the Study of Biological ComplexityVisual Parasite ProjectVisual Parasite Project

Virginia Commonwealth UniversityVirginia Commonwealth UniversityRichmond, VA Richmond, VA

[email protected]

Page 2: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

OVERVIEWOVERVIEW The TeamThe Team Computational Biology – A Larger ViewComputational Biology – A Larger View In Silico LaboratoriesIn Silico Laboratories Introduction to the VPPIntroduction to the VPP

– GoalsGoals Basic Parasite InformationBasic Parasite Information

– Pathology, life cycle, and formPathology, life cycle, and form The VPP ProgramThe VPP Program

– Architecture, model, equations, benchmarksArchitecture, model, equations, benchmarks– VisualizationVisualization

Patent ApplicationPatent Application Where to From HereWhere to From Here

Page 3: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

The VPP TeamThe VPP Team

Page 4: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

Computational BiologyComputational Biology Handling the “omic” hierarchy

– It’s not just genes any more– ETA Systems Computational Medicine and

BioSciences Group Simulations of biomedical elements

– Computational Chemistry– Computational Biomechanics– Computational “Fill in your favorite noun”

Frankenstein in the machine The “in silico laboratory”

Page 5: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

In SilicoIn Silico Laboratories Laboratories Not a simulation of an organism but Not a simulation of an organism but an extensible, portable, “in silico,” multi-scale, an extensible, portable, “in silico,” multi-scale,

high performance computational and high performance computational and mathematical laboratory for research into the mathematical laboratory for research into the dynamics of host-parasite interactionsdynamics of host-parasite interactions

To test this environment by examining the host-To test this environment by examining the host-parasite dynamics of the parasite dynamics of the Trypanosoma cruziTrypanosoma cruzi, , the causative agent in Chagas Diseasethe causative agent in Chagas Disease

Page 6: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

BASIC BASIC T. CRUZIT. CRUZI INFORMATION INFORMATION

Page 7: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

Epidemiology – 1 Epidemiology – 1

• Chagas disease is the 3Chagas disease is the 3rdrd most common parasitic disease after most common parasitic disease after malaria and schistosomiasis malaria and schistosomiasis•Estimates are that over 2 billion people worldwide are Estimates are that over 2 billion people worldwide are affected by these 3 parasitic diseasesaffected by these 3 parasitic diseases•Mortality estimates in Africa for schistosomiasis are Mortality estimates in Africa for schistosomiasis are 200,000/year and most are children200,000/year and most are children•12 species of Trypanasoma cruzi are known to occur in the 12 species of Trypanasoma cruzi are known to occur in the USUS•Trypanosoma cruzi is the causal agent in Chagas diseaseTrypanosoma cruzi is the causal agent in Chagas disease

Page 8: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

Epidemiology – 2 Epidemiology – 2 • Recent estimates suggest that more than 17 million Recent estimates suggest that more than 17 million people throughout Latin America are currently infected people throughout Latin America are currently infected with T. cruzi – currently present in 18 countrieswith T. cruzi – currently present in 18 countries•4.8-5.4 million individuals currently exhibiting clinical 4.8-5.4 million individuals currently exhibiting clinical symptomssymptoms•Annual incidence 700,000 – 800,000 new casesAnnual incidence 700,000 – 800,000 new cases•45,000 deaths due to the cardiac form of the disease45,000 deaths due to the cardiac form of the disease•There is no cure and therapeutic agents are highly toxicThere is no cure and therapeutic agents are highly toxic•There is no treatment for chronic Chagas There is no treatment for chronic Chagas •Chagas disease is fatalChagas disease is fatal•The life cycle of the parasite is complexThe life cycle of the parasite is complex

Page 9: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

T. CruziT. Cruzi Lifecycle Lifecycle

Page 10: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

Pathology in Chagas DiseasePathology in Chagas Disease

•Transmitted through the feces of biting insectsTransmitted through the feces of biting insects•Insects defecate while taking blood mealInsects defecate while taking blood meal•Infected individual scratches feces into wound Infected individual scratches feces into wound starting infectionstarting infection•Myocarditis and cardiomyopathyMyocarditis and cardiomyopathy•Alimentary tract dysfunction manifested by Alimentary tract dysfunction manifested by megaesophagus and megacolonmegaesophagus and megacolon•Acute stage occurs 1-2 weeks after exposureAcute stage occurs 1-2 weeks after exposure

Page 11: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

BASIC BASIC PROGRAM PROGRAM INFORMATIONINFORMATION

Page 12: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

VPP Program ArchitectureVPP Program Architecture Program is written in public domain languages and uses public Program is written in public domain languages and uses public

domain software (C++, C, Fortran)domain software (C++, C, Fortran) Currently VPP code exceeds 20,000 linesCurrently VPP code exceeds 20,000 lines Modular development of the VPP environment enables users to Modular development of the VPP environment enables users to

supply relevant lab modules for their particular research needssupply relevant lab modules for their particular research needs– Standard worlds are provided (flask, test tube, etc.)Standard worlds are provided (flask, test tube, etc.)– Standard parasite forms are provided (spherical, elliptical, helicoid, Standard parasite forms are provided (spherical, elliptical, helicoid,

amoeboid, flagellar)amoeboid, flagellar)– Standard fluids are provided (water, plasma/blood)Standard fluids are provided (water, plasma/blood)

Parasite we chose to model in the environment was the Parasite we chose to model in the environment was the T. cruzi T. cruzi parasiteparasite

Simulation code runs in a parallel processor (MPI) environment – Simulation code runs in a parallel processor (MPI) environment – Sun Grizzly (32 dual processor node cluster)Sun Grizzly (32 dual processor node cluster)

Visualization interface allows user to visualize actual data in a Visualization interface allows user to visualize actual data in a “video/interactive” format – Sun V880 with in house developed “video/interactive” format – Sun V880 with in house developed interfaceinterface

Page 13: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

Initial Approach To Initial Approach To T. cruziT. cruzi Modeling Modeling Macro-scale biophysics at a population Macro-scale biophysics at a population

levellevel Inclusion of host cells with cell cycle modelInclusion of host cells with cell cycle model Single parasite model (sphere with tail)Single parasite model (sphere with tail)

Page 14: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC
Page 15: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

Basic Newtonian ModelBasic Newtonian Model Four basic forcesFour basic forces

– GravitationalGravitational– BuoyantBuoyant– SwimmingSwimming– DragDrag

Page 16: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

The Uncoupled EquationsThe Uncoupled Equations

Page 17: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC
Page 18: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC
Page 19: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

VPP VisualizationVPP Visualization

Page 20: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC
Page 21: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC
Page 22: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC
Page 23: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC
Page 24: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC
Page 25: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

The New Patent ApplicationThe New Patent Application

Page 26: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

Charge Gradient PatentCharge Gradient Patent Results from the construction of the simulation Results from the construction of the simulation

lead to a patent application utilizing charge-lead to a patent application utilizing charge-gradients as a means of inhibiting and/or gradients as a means of inhibiting and/or stopping T. cruzi invasionstopping T. cruzi invasion

Patent application on 4 June 2004 has been Patent application on 4 June 2004 has been awarded provisional patent to VPP teamawarded provisional patent to VPP team

Literature research indicates that the Literature research indicates that the methodology may be applicable to a class of methodology may be applicable to a class of organisms including malaria.organisms including malaria.

Page 27: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

Patent MethodologiesPatent MethodologiesArgument is based upon charge-charge Argument is based upon charge-charge

interaction between parasite andinteraction between parasite and– Charged nano-beads experiment was Charged nano-beads experiment was

completed by an NSF BBSI studentcompleted by an NSF BBSI student– In addition, other approaches such as In addition, other approaches such as

synthesis of peptide or RNA aptamers synthesis of peptide or RNA aptamers positively charged and used to test invasion positively charged and used to test invasion inhibition efficacyinhibition efficacy

Page 28: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

WHERE TO FROM HERE?WHERE TO FROM HERE?

Page 29: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

Where To From Here – 1 ?Where To From Here – 1 ?Include more biologically accurate Include more biologically accurate

mammalian host cell modelmammalian host cell modelDevelop more biologically accurate model of Develop more biologically accurate model of

parasiteparasiteExpand module for environmental Expand module for environmental

definitionsdefinitionsInclude van der Waals force calculationsInclude van der Waals force calculations

Page 30: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

Where To From Here – 2 ?Where To From Here – 2 ?Continue algorithm optimization and Continue algorithm optimization and

developmentdevelopment– Generalized to n-processor distributed grid Generalized to n-processor distributed grid

environmentsenvironments

Continue GUI interface developmentContinue GUI interface development– Extension to 3D visualization (cave)Extension to 3D visualization (cave)– VRML interfaceVRML interface

Page 31: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

Where To From Here – 3 ?Where To From Here – 3 ? Invasion moduleInvasion module

– Host-parasite proximity factorsHost-parasite proximity factors Inclusion of recognition factors, reorientation factors, attachment Inclusion of recognition factors, reorientation factors, attachment

factorsfactors

– Binding factorsBinding factors– Transformation for invasionTransformation for invasion

Signal transduction pathwaysSignal transduction pathways Membrane factorsMembrane factors Attachment factorsAttachment factors

– Physical InvasionPhysical Invasion

Page 32: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

Molecular Scale ModelsMolecular Scale Models Once parasites are in Once parasites are in

the proximity of the the proximity of the host cell, modeling host cell, modeling must account for the must account for the various phases of various phases of invasioninvasion

Page 33: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

Where To From Here – 4 ?Where To From Here – 4 ?

Extension to whole Extension to whole human modelhuman model– Inclusion of tissue Inclusion of tissue

alteration factorsalteration factors– Pharmaco-dynamic Pharmaco-dynamic

intervention factorsintervention factors– Re-infection factorsRe-infection factors

Page 34: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

Take Home MessageTake Home Message Computational Biology is not just about “omics”Computational Biology is not just about “omics” Ultra-large scale environments are actively involved in Ultra-large scale environments are actively involved in

addressing complex biomedical problems at super-genomic addressing complex biomedical problems at super-genomic levelslevels

These environments have similar problems with respect to These environments have similar problems with respect to programming, visualization, user interface design, data programming, visualization, user interface design, data storage and access as the “omic” environments havestorage and access as the “omic” environments have

In silicoIn silico laboratories are the next extension of HPC to laboratories are the next extension of HPC to biomedical research and educationbiomedical research and education

Such laboratories can lead to insights into biomedical Such laboratories can lead to insights into biomedical dynamics that were not here-to-fore envisioneddynamics that were not here-to-fore envisioned

Page 35: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

Thank You For ComingThank You For Coming

Page 36: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

Simulation Benchmark Timing -1Simulation Benchmark Timing -1

-1

-0.5

0

0.5

1

1.5

2

2.5

3

100 1,000 10,000 100,000

Number of Parasites in Simulation

log(

Tot

al S

imul

atio

n T

ime)

124

Page 37: In Silico Laboratories: The Virtual Parasite Project - An Overview SURA Grid Computing in the Life Sciences - 2006 Tarynn M. Witten, Ph.D., FGSA, FCSBC

Simulation Benchmark Timing –2Simulation Benchmark Timing –2

-1.5

-1

-0.5

0

0.5

1

1.5

2

2.5

3

100 1,000 10,000 100,000

Number of Parasites in Simulation

log(

Tot

al S

imu

lati

on T

ime) 1

1 Threaded