Upload
medku
View
258
Download
3
Tags:
Embed Size (px)
DESCRIPTION
http://www.ncbi.nlm.nih.gov/pubmed/20236959 J R Soc Interface. 2010 Sep 6;7(50):1341-54. doi: 10.1098/rsif.2010.0063. Epub 2010 Mar 17. Topological network alignment uncovers biological function and phylogeny. Kuchaiev O, Milenkovic T, Memisevic V, Hayes W, Przulj N. http://www.ncbi.nlm.nih.gov/pubmed/19259413 Cancer Inform. 2008;6:257-73. Epub 2008 Apr 14. Uncovering biological network function via graphlet degree signatures. Milenkoviฤ T, Przulj N.
Citation preview
Topological network alignment
20131216Statistics journal
Result
G H
G(V, E) H(U, F)
EC = 0.089
Motivation
HumanYeast
Are two networks the same or similar?
large-scale networks such as interactome
Theoretical background
Network or GraphCollection of nodes (vertex) and connections between them (edges).Biology, social communication, and web pages
Theoretical background
Graph G and HNode set V and U (V U)Edge set E V*V and F U*UPossible graphs: for G
G H
G(V, E) H(U, F)
Theoretical background
Graph comparisonSubgraph isomorphismIs G an exact subgraph of H?NP-completeEfficient algorithms are not known.
Graph alignmentFitting G into HEdge correctness (EC): the % of E aligned to FNP-hard
G H
G(V, E) H(U, F)
Previous approaches
Local alignment : ambiguous, different pairingMapping are chosen independently for local regions of similarity.PathBLAST : homology informationNetworkBLAST : conserved protein clusters with likelihood methodMaWISh : evolution (sequence alignment)GRAEMLIN : dense conserved subgraph with phylogeny
Global alignmentProvide unique alignment from each node in smaller graph to exactly one node in larger graphISORANK : maximize overall matchGRAEMLIN : training from known graph alignments and phylogeny
New approaches
Never use a priori informationSequence, Homology, Clusters, Phylogeny ,and Known alignments
Topological similarityOrbit, graphlet, and signature similarity
Of course, a priori information can be used.
ใใใ GRAAL ใชใใญ
n-node graphlet and automorphism orbits
n-node graphlet and automorphism orbits
graphlet
orbit
Topologically relevant
Topologically relevant
Topologically relevant
Graphlet Degree Vector
Graphlet Degree Vector
Graphlet Degree Vector
Graphlet Degree Vector
n-node graphlet and automorphism orbits
Orbit 15 in touches orbit 0, 1, 4, and 15 once.
Signature similarityWeight vector
[0, 1] 1 means is not affected by any other orbit.
๐15=4 ๐44=5
Signature similarity
Node , denotes the i-th coordinates of its signature vector. The distance is the i-th orbits of nodes and is
The total distance between and is
The signature similarity is
S = 1 is that and are identical (D = 0).
GRAph ALigner algorithm (GRAAL)
Compute costs of aligning each node with each node .
This matrix is row V and col U (all pairs of nodes).Align the densest parts (the minimal cost nodes, seed).Greedily alignment in the sphere.Repeat * while all nodes in the smaller graph will be aligned.
GRAAL uses only topological information.Biological information can be used by the equation
G H
G(V, E) H(U, F)
density topology
: degree of node
*
GRAALSearch the densest part and align.
Search the minimal cost nodes pair (seed).If multi-minimal cost pairs, chosen randomly.
G(V, E) H(U, F)
GRAALSearch the densest part and align.
Search the minimal cost nodes pair (seed).If multi-minimal cost pairs, chosen randomly.
Seed nodes pair
G(V, E) H(U, F)
GRAALMake spheres and align.
Make sphere .Greedily align and with the minimal cost.
๐ข๐ฃ
G(V, E) H(U, F)
: length of the shortest path
GRAALMake spheres and align.
Make sphere .Greedily align and with the minimal cost.
๐ข๐ฃ
G(V, E) H(U, F)
๐๐ฎ (๐ฃ ,๐ )
๐๐ฏ (๐ข ,๐ )
: length of the shortest path
GRAALMake spheres and align.
Make sphere .Greedily align and with the minimal cost.
๐ข๐ฃ
G(V, E) H(U, F)
๐๐ฎ (๐ฃ ,๐ )
๐๐ฏ (๐ข ,๐ )
Align
: length of the shortest path
GRAALExpand radii of spheres and align.
๐ข๐ฃ
: length of the shortest path
G(V, E) H(U, F)
๐๐ฎ (๐ฃ ,๐ )
๐๐ฏ (๐ข ,๐ )
Make sphere .Greedily align and with the minimal cost.
Aligned node
GRAALExpand radii of spheres and align.
๐ข๐ฃ
: length of the shortest path
G(V, E) H(U, F)
๐๐ฎ (๐ฃ ,๐ )
๐๐ฏ (๐ข ,๐ )
Make sphere .Greedily align and with the minimal cost.
Aligned node
radii :
GRAALExpand radii of spheres up to 3.
๐ข๐ฃ
: length of the shortest path
G(V, E) H(U, F)
๐๐ฎ (๐ฃ ,๐ )
๐๐ฏ (๐ข ,๐ )
Make sphere .Greedily align and with the minimal cost.
Aligned node
GRAALExpand radii of spheres up to 3.
๐ข๐ฃ
: length of the shortest path
G(V, E) H(U, F)
๐๐ฎ (๐ฃ ,๐ )
๐๐ฏ (๐ข ,๐ )
Make sphere .Greedily align and with the minimal cost.
Aligned node
radii :
GRAALExpand radii of spheres up to 3.
๐ข๐ฃ
: length of the shortest path
G(V, E) H(U, F)
๐๐ฎ (๐ฃ ,๐ )
๐๐ฏ (๐ข ,๐ )
Some nodes are not aligned.
Make sphere .Greedily align and with the minimal cost.
Aligned node
radii :
GRAALRepeat with new edge networks .
๐ฎ๐ (๐ฝ ,๐ฌ๐ )
The distance between and , Aligned node
๐ โค2๐ฏ ๐ (๐ผ ,๐ญ๐ )
๐๐ฏ ๐ (๐ข ,๐ )
: length of the shortest path
๐๐ฎ๐ (๐ฃ ,๐ )
๐ โค2
๐=1
๐=1
GRAALRepeat with new edge networks .
๐ฎ๐ (๐ฝ ,๐ฌ๐ )
The distance between and , Aligned node
๐ โค2
๐๐ฎ๐ (๐ฃ ,๐ )๐=1
: length of the shortest path
edge()
edge
Path: 6 12 25 can be replaced by , which is analogous for insertion or deletion.
GRAALRepeat with new edge networks .
๐ฎ๐ (๐ฝ ,๐ฌ๐ )
The distance between and , Aligned node
๐ โค2
New seed
๐ฏ ๐ (๐ผ ,๐ญ๐ )
๐๐ฏ ๐ (๐ข ,๐ )
: length of the shortest path
New seed
๐๐ฎ๐ (๐ฃ ,๐ )
๐ โค2
๐=1
๐=1
GRAALRepeat with new edge networks .
๐ฎ๐ (๐ฝ ,๐ฌ๐ )
The distance between and , Aligned node
๐ โค2
New seed
๐ฏ ๐ (๐ผ ,๐ญ๐ )
๐๐ฏ ๐ (๐ข ,๐ )
: length of the shortest path
New seed
๐๐ฎ๐ (๐ฃ ,๐ )
๐ โค2
๐=1
๐=1
GRAALNodes in G are aligned to exactly one node in H.
The distance between and , Aligned node
: length of the shortest path
G(V, E) H(U, F)
Alignment scoreEdge correctness: the % of edges in G are aligned to edges in H.
Node correctness: the % of nodes in G are aligned to nodes in H.Correct mapping is needed.
Interaction correctness: the % of interactions that aligned correctly.Correct interaction is needed.
G H
G(V, E) H(U, F)
GRAAL function
The correct node mapping G to H๐ :๐ฝโ๐ผ๐ :๐ฝโ๐ผ
Statistical significance
: a random mapping between nodes in G(V, E) and H(U, F).The probability P of successfully aligning k or more edges by chance is the tail of the hypergeometric distribution:
G H
G(V, E) H(U, F)
๐1=|๐|๐=โ๐=๐
๐ 2 (๐2
๐ )(๐โ๐2
๐1โ๐ )( ๐๐1
)
The number of edges from G that are aligned to edges in H.
The number of node pairs in H.
Edge correctness
Result
G H
G(V, E) H(U, F)
EC = 0.089