1
New possibilities 1. EBI data pack – database modules for main databases supported by EBI: Ensembl, UniProt, ChEBI,Reactome, IntAct, GO, BioModels, SBO. These databases are installed on the special BioUML server and can be easily searched and used by BioUML workbench (client). 2. Composite database module – allows a user to specify a set of databases that he would like to use simultaneously as a source of components during creation and editing of diagrams. 3. Composite diagrams (hierarchical) – allow a user to join several existing diagrams/models into a bigger one. 4. Experiment allows to specify conditions (initial values, parameters, external stimulus, etc.) for virtual experiments on the diagram. 5. BioHub – an approach that allows to arrange relationships between biological objects from different databases formally and use this information for searching. Results of such search can be presented as an editable graph. 6. Graphic notation editor – it allows an advanced user to create new graphic notation or extend an existing one. As an example SBGN graphic notation was created. 7. Microarray data analysis - allows a user to load his microarray data, identify up/down regulated genes and highligt diagram elements according to microarray data or results of this data analysis. BioUML – new concepts and possibilities Nikita Tolstykh, Sergey Lapukhov, Ilya Kiselev, Aleksey Shadrin and Fedor Kolpakov Institute of Systems Biology, Novosibirsk, Russia; Design Technological Institute of Digital Techniques SB RAS, Novosibirsk, Russia *Contacts: [email protected] 0100100010011101 ISB 0100100010011101 ISB Metaphor Biological systems reconstruction is like solitaire (patience) game: desk – BioUML editor; solitaire – biological pathway; cards – biological objects (genes, proteins, lipids, etc.); pack of cardsdifferent biological databases. Levels of biological information BioHub concept BioHUB – an approach link information from different databases. Main usage: • binding microarray (omics) data to pathway diagrams; • graph search; • DBReferences editor; • microarray (omics) data analysis. BioHUB is conceptually similar with MIRIAM standard and defines: • references to database objects; • relationships between biological objects (for example: is-a, part-of, form-of ). Each bar corresponds to one value from corresponding microarray series. BioHub usage: graph search engine Graphic notation editor Motivation Reconstruction of complex biological systems requires integrated software that provides: 1. integration with a wide range of biological databases; 2. integration with omics data; 3. powerful search capabilities; 4. visual modelling approach; 5. client-server architecture for team work. refers refers refers refers refers U niProt Ensem bl ChEBI GO Level1: C atalogs Level2: Pathw ays, m odels GeneM odels B iologicalobjects Level3: Problem specific BMOND Experimentaldata, results ofanalysis BioH U B OM ICS data Results ofanalysis M SigDB G eneAtlas, NCI60 Externaldatabases: -KEGG -LipidMap,LipidBank -R eactom e,… C yclonet -leads -actions -targets LipidNet classifications: -lipids -genes U biProt classifications: E1,E2,E3,… w i k i w i k i w i k i Main ideas for data integration and pathway reconstruction: - escape information duplication. - classify components of biological pathways by levels. - each next level should refer but do not duplicate information from previous levels. - use free EBI databases whenever it is possible. BioHub usage: coloring diagrams according to microarray data BioUMLworkbench BioUML (http://www.biouml.org) is an open source integrated Java workbench for systems biology. Its core is a meta-model that provides an abstract layer for comprehensive formal description of wide range of biological and other complex systems as a compartmentalized graph. Connection types: 1) directed – input output. Transformation function can be used; 2) undirected – contact. Indicates that 2 nodes in mode is the same entity. Semantic constraints: There are semantic constraints, for example: block can have only one input for each variable. Two inputs are forbidden for the same variable. Flat model: Before Matlab or Java code generation composite model is transformed into flat model and usual genertions routines are used. block (EM odel) dx/dt= f1 dy/dt= f2 z = f3 block 2 (EM odel) dx/dt= f5 dy/dt= f6 + z k +z+f4 = 0 x x y y subdiagram (EModel) R s1 s2 e x e block 3 (EM odel) dx/dt= f5 dy/dt= f6 + block2.k k+z+f4 = 0 Indirectlink x x forbidden R s2 s3 s1 s1 f(x) s4 directparticipation ofsubdiagram element in a reaction Composite diagram (hierarchical) : main concepts Block types: 1) block only mathematical equations. Used mainly for physiological models; 2) subdiagram – other diagram Virtual experiment concept To make a virtual experiment it is frequently needed to modify initial model. To skip model duplications for each virtual experiment we introduce “changes” concept. Typical modifications (changes) are: • changing of initial values changing of model parameters to imitate different conditions or mutations • deleting of some model elements to imitate knock-out mutations • adding events to imitate external influences on the model. The figure below demonstrates how changes are shown on diagram for pRB knock-out experiment. Conclusion We believe that new version of BioUML workbench will greatly simplify, speed up and improve the process of formal description and reconstruction of biological pathways. Acknowledgements This work was supported by FP6 grant 037590 “Net2Drug”, FP7 grant 090107 "LipidomicNet" and interdisciplinary project 46 of SB RAS.

New possibilities 1. EBI data pack – database modules for main databases supported by EBI: Ensembl, UniProt, ChEBI,Reactome, IntAct, GO, BioModels, SBO

Embed Size (px)

Citation preview

Page 1: New possibilities 1. EBI data pack – database modules for main databases supported by EBI: Ensembl, UniProt, ChEBI,Reactome, IntAct, GO, BioModels, SBO

New possibilities1. EBI data pack – database modules for main databases supported by EBI: Ensembl, UniProt, ChEBI,Reactome, IntAct, GO, BioModels, SBO. These databases are installed on the special BioUML server and can be easily searched and used by BioUML workbench (client).

2. Composite database module – allows a user to specify a set of databases that he would like to use simultaneously as a source of components during creation and editing of diagrams.

3. Composite diagrams (hierarchical) – allow a user to join several existing diagrams/models into a bigger one.

4. Experiment – allows to specify conditions (initial values, parameters, external stimulus, etc.) for virtual experiments on the diagram.

5. BioHub – an approach that allows to arrange relationships between biological objects from different databases formally and use this information for searching. Results of such search can be presented as an editable graph.

6. Graphic notation editor – it allows an advanced user to create new graphic notation or extend an existing one. As an example SBGN graphic notation was created.

7. Microarray data analysis - allows a user to load his microarray data, identify up/down regulated genes and highligt diagram elements according to microarray data or results of this data analysis.

BioUML – new concepts and possibilitiesNikita Tolstykh, Sergey Lapukhov, Ilya Kiselev, Aleksey Shadrin and Fedor Kolpakov

Institute of Systems Biology, Novosibirsk, Russia; Design Technological Institute of Digital Techniques SB RAS, Novosibirsk, Russia

*Contacts: [email protected]

0100100010011101

ISB0100100010011101

ISB

Metaphor

Biological systems reconstruction is like solitaire (patience) game:desk – BioUML editor;solitaire – biological pathway;cards – biological objects (genes, proteins, lipids, etc.);pack of cards– different biological databases.

Levels of biological information

BioHub concept BioHUB – an approach link information from different databases. Main usage:• binding microarray (omics) data to pathway diagrams;• graph search;• DBReferences editor;• microarray (omics) data analysis.BioHUB is conceptually similar with MIRIAM standard and defines:• references to database objects;• relationships between biological objects (for example: is-a, part-of, form-of ).

Each bar corresponds to one value from corresponding microarray series.

BioHub usage: graph search engine

Graphic notation editor

MotivationReconstruction of complex biological systems requires integrated software that provides: 1. integration with a wide range of biological databases;2. integration with omics data;3. powerful search capabilities; 4. visual modelling approach;5. client-server architecture for team work. refers refers

refers refers refers

UniProtEnsembl ChEBI GOL evel 1: C atalog s

L evel 2: P athways ,models

GeneModels

B iolog ic al objec ts

L evel 3: P roblems pec ific

BMOND

E xperimental data,res ults of analys is

BioHUB

OMICS data

Results of analysis

MSigDB GeneAtlas,NCI60

E xternal databases :- K E G G- L ipidMap, L ipidB ank- R eactome, …

Cyclonet- leads- actions- targets

LipidNetclassifications:- lipids- genes

UbiProtclassifications:E1, E2, E3, …

wiki

wiki

wiki

Main ideas for data integration and pathway reconstruction: - escape information duplication.- classify components of biological pathways by levels.- each next level should refer but do not duplicate information from previous levels.- use free EBI databases whenever it is possible.

BioHub usage: coloring diagrams according to microarray data

BioUMLworkbenchBioUML (http://www.biouml.org) is an open source integrated Java workbench for systems biology. Its core is a meta-model that provides an abstract layer for comprehensive formal description of wide range of biological and other complex systems as a compartmentalized graph.

Connection types:1) directed – input output. Transformation function can be used;2) undirected – contact. Indicates that 2 nodes in mode is the same entity.

Semantic constraints:There are semantic constraints, for example: block can have only one input for each variable. Two inputs are forbidden for the same variable.

Flat model:Before Matlab or Java code generation composite model is transformed into flat model and usual genertions routines are used.

block (EModel)dx/dt = f1dy/dt = f2z = f3

block 2 (EModel)dx/dt = f5dy/dt = f6 + z k+z+f4 = 0

x x

y y

subdiagram (EModel)

Rs1 s2

e

x

e

block 3 (EModel)dx/dt = f5 dy/dt = f6 + block2.k k+z+f4 = 0

Indirect linkx

xforbidden

Rs2 s3

s1

s1

f(x)

s4direct participation of subdiagram elementin a reaction

Composite diagram (hierarchical) : main concepts

Block types:1) block – only mathematical equations. Used mainly for physiological models;2) subdiagram – other diagram

Virtual experiment concept To make a virtual experiment it is frequently needed to modify initial model. To skip model duplications for each virtual experiment we introduce “changes” concept. Typical modifications (changes) are:

• changing of initial values• changing of model parameters to imitate different conditions or mutations• deleting of some model elements to imitate knock-out mutations• adding events to imitate external influences on the model.The figure below demonstrates how changes are shown on diagram for pRB knock-out experiment.

ConclusionWe believe that new version of BioUML workbench will greatly simplify, speed up and improve the process of formal description and reconstruction of biological pathways.

AcknowledgementsThis work was supported by FP6 grant 037590 “Net2Drug”, FP7 grant 090107 "LipidomicNet" and interdisciplinary project 46 of SB RAS.