Upload
truongdiep
View
212
Download
0
Embed Size (px)
Citation preview
Systems biology
Systems biology is a biology-based inter-
disciplinary study field that focuses on the systematic study of complex interactions in biological
systems, thus using a new perspective (integrationinstead of reduction) to study them.
Systems biology
Systems biology is a biology-based inter-
disciplinary study field that focuses on the systematic study of complex interactions in biological
systems, thus using a new perspective (integrationinstead of reduction) to study them.
integration of complex interactions in a systematic way
Make use of rich public data● Thousands of expression datasets available
– GEO & ArrayExpress● Protein-protein interactions
– HPRD, Intact● Curated pathway information
– KEGG, Reactome● Annotations
– Gene Ontology, literature● Regulatory knowlede
– miRNA targets
– Transcription factor targets
Merge -omics
● Each dataset can be considered as a layer
● Combinations of different layers brings us to new knowledge
Real world zoom-in-out
● One gene with one function and few interactions
● Find more genes having similar properties● Connect the genes based on their
functions● Model the network in order to understand
the mechanisms
Real problems
We have more data than we can handle with traditional methods. Data has exploded
at all (regulatory) levels
Functions are distributed – cancer is usually not caused by one faulty gene
Motivation
● Network modelling helps to understand the whole picture
● Steady states represent for example differentiation stages
● Modelling as checking network reconstruction
● SQUAD dynamically models regulatory networks
Studing regulatory networks
● Modelling the behavior– ODEs describe reaction kinetics
● very demanding on the input data
– Boolean ● basic on/off data, discrete time steps
– Stochastic● time delayed reactions
TF-DNA binding
● Array design● Replicates
Replicates may go wrong Promoters are only part of the picture
● Each program gives different results● Intersection of targets are most reliable● Small number of targets is not bad
Peak finding
information layers
Oct4 targets from RNAi
Oct4 direct targets from ChIP-chip
Oct4 indirect targets from ChIP-chip
Previously published data:Oct4 ChIP-chip & RNAi
Sox2 & Nanog RNAiSox2 & Nanog ChIP-chip
DatabaseClustering genes by behaviour
Network
Research cycles
Biological experiments
Raw data analysis
Combination of datasets
Network building
Network modelling
Identification of new candidates
Aim of the project
● Identify genes playing a role in keeping cells in pluripotency and driving early differentiation using Boolean modelling
Network reconstruction
Pertubation experiments in hES and hEC cells
− Oct4, Sox2, Nanog − Gadd45g, Bmp4, Fgf2, ActA
Oct4
Oct4
RNAi Targets on microarrayTargets on microarray
Filtering the network
● Target first receptors and ligands (GO)– cell surface receptor linked signal transduction
– receptor binding
● Target genes having at least 5 incoming regulatory edges
– genes under strong regulation in ESC
Basics of Boolean modelling
● Each network node can be either ON or OFF
● Synchronous & asynchronous modelling -– nodes change their state one-by-one or
simultaneously
Steady states
● Network states where some genes are active, some inactive and they represent relevant biological conditions– pluripotency (OCT4, SOX2, NANOG active)– differentiation (GADD45G, BMP4 active)
Perturbations
● A node is activated on inhibited – overexpression– knockdown
● Signal is carried forward and affects the rest of the network
● A way to measure the effect of a node
Expanding and perturbing the network
Introduce a feedback loopPerturbe the nodeMeasure the effect on the network
Summary
● SysBio tries to find the behavior of the system
● Deals with interactions, not the properties of the parts
● We don't know how to build perfect models
Systems biology project
● Human data
– Given:
● Protein-protein interaction● Co-expression
– Find:
● Additional data layer (e.g. ChIP)
● Task
– Identify main hubs (5 at least, their function)
– Find largest components (after ribosome, proteasome), what is their biological function? Why are they connected?
– Find as many network motifs as possible from the data
– Write a short overview and be prepared to represent it