46
Detection and approximation of linear structures in metric spaces Stanford - July 12, 2012 MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov and on-going work with Jian Sun Fr´ ed´ eric Chazal Geometrica group INRIA Saclay

INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

  • Upload
    vodung

  • View
    214

  • Download
    0

Embed Size (px)

Citation preview

Page 1: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Detection and approximation of linearstructures in metric spaces

Stanford - July 12, 2012

MMDS 2012

Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozovand on-going work with Jian Sun

Frederic ChazalGeometrica group

INRIA Saclay

Page 2: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Introduction

• Branching filamentary structures (metric graphs) appear in a wide of realworld data sets.

• Data possibly not embedded in Euclidean space and only coming with(local) pairwise distance information → metric spaces

Problem:Can we recover the underlying metric graph structure from approximating data?

Page 3: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

1

Input: a metric space (Y, dY) (e.g. a point cloud with distance matrix).

Problem statement

Y

Page 4: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Basically, a metric graph is a continuous metric space that has the structure of a graph, with edges and vertices, such that the edge interiors also belong to the space, and distances along each edge are given by arclength

1

Promise: the input is close to some unknown metric graph X

X

Input: a metric space (Y, dY) (e.g. a point cloud with distance matrix).

Problem statement

Y

Page 5: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Basically, a metric graph is a continuous metric space that has the structure of a graph, with edges and vertices, such that the edge interiors also belong to the space, and distances along each edge are given by arclength

1

Promise: the input is close to some unknown metric graph X

X

Input: a metric space (Y, dY) (e.g. a point cloud with distance matrix).

A metric graph is a path metric space (X, dX) thatis homeomorphic to a 1-dimensional stratified space.A vertex of X is a 0-dimensional stratum of X andan edge of X is a 1-dimensional stratum of X.

the distance between any pair of points is equal to the infimum ofthe lengths of the continuous curves joining them

Problem statement

Y

Page 6: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Basically, a metric graph is a continuous metric space that has the structure of a graph, with edges and vertices, such that the edge interiors also belong to the space, and distances along each edge are given by arclength

1

Promise: the input is close to some unknown metric graph X

Output:

- a metric graph (X, dX) that is close and if possible homeomorphic to X

X X

Input: a metric space (Y, dY) (e.g. a point cloud with distance matrix).

Problem statement

Y

Page 7: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Basically, a metric graph is a continuous metric space that has the structure of a graph, with edges and vertices, such that the edge interiors also belong to the space, and distances along each edge are given by arclength

1

Promise: the input is close to some unknown metric graph X

Output:

- a metric graph (X, dX) that is close and if possible homeomorphic to X

X X

Input: a metric space (Y, dY) (e.g. a point cloud with distance matrix).

Problem statement

Y

- a map Y→ X that roughly preserves distances (quasi-isometry)

Page 8: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Basically, a metric graph is a continuous metric space that has the structure of a graph, with edges and vertices, such that the edge interiors also belong to the space, and distances along each edge are given by arclength

1

Promise: the input is close to some unknown metric graph X

Output:

- a metric graph (X, dX) that is close and if possible homeomorphic to X

Input: a metric space (Y, dY) (e.g. a point cloud with distance matrix).

Problem statement

- a map Y→ X that roughly preserves distances (quasi-isometry)

In this talk:

- A few basic topological and geometric ideas to address this problem.

- Topology guaranteed graph reconstruction based upon degree inference.

- “Linear structure” detection and tree reconstruction through metric hyper-bolic geometry.

Page 9: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Metric spaces approximation

An ε-correspondence between (X, dX) and (Y, dY) is a set C ⊂ X× Y s. t.

1. for any x ∈ X (resp. y ∈ Y), there exists y ∈ Y (resp. x ∈ X) s. t. (x, y) ∈ C.

2. For any (x, y), (x′, y′) ∈ C, |dX(x, x′)− dY(y, y′)| ≤ ε.

X

YC

dGH(X,Y) = 1

2inf{ε > 0 : there exists an ε-correspondence betweenXandY }

Let (X, dX), (Y, dY) be two metric spaces.

x x′

y

y′

The Gromov-Hausdorff distance:

Page 10: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

• The degree of a point x on X is the number of connected components of asufficiently small (intrinsic) sphere centered at x.

• Vertices of X are the points with degree 1 or larger than 2.

• The degree of (most) points can be inferred from Y by looking at (intrinsic)spherical shells:- chose a parameter r > 0,- degr(y) = nb. of clusters of BY(y, 5r/3) \ BY(y, r) (single linkage).

X Y

A first approach using the local topology of data[Aanjaneya, C., Chen, Glisse, Guibas, Morozov, SoCG’11, IJCGA (2012)]

Page 11: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

The algorithmInput: (Y, dY) approximating a metric graph (X, dX) and parameter r > 0.

Output: A metric graph (X, dX).

1. Labelling points as edge or branch• for all y ∈ Y do• if degr(y) = 2 then label y as an edge point.• else label y as a branch point.• Label all points within distance 2r from a preliminarybranch point as branch points.

Page 12: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

The algorithmInput: (Y, dY) approximating a metric graph (X, dX) and parameter r > 0.

Output: A metric graph (X, dX).

Graph structure reconstruction• E← points of Y labeled as edge points; V← points of Ylabeled as branch points.• Clusterize E and V (single linkage)• Vertices of X↔ clusters of V.• Put an edge between vertices of X if their correspondingclusters are close to a same cluster of E.

Page 13: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

The algorithmInput: (Y, dY) approximating a metric graph (X, dX) and parameter r > 0.

Output: A metric graph (X, dX).

Reconstructing the metric• To each edge e of X assign a length equal to the diameterof the corresponding cluster of E plus 4r.

Page 14: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Theoretical guarantees

Let (Y, dY) be an ε-approximation of a metric graph (X, dX) for some ε > 0.Let b > 0 be the length of the shortest edge of X.

If r > 0 is correctly chosen with respect to ε and b then :

• Topological Reconstruction theorem: X is homeomorphic to X.

• Metric Reconstruction theorem: there is an almost distance preservingmap (quasi-isometry) φ : X→ X.

• Quasi-isometric embedding theorem: there is an almost distance preserv-ing map (quasi-isometry) ψ : Y→ X.

Page 15: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Experimental results

• Data: GPS traces (sampled curves along a road network).

• (Y, dY): a neighborhood graph (Rips) built on the data set with its intrinsicmetric.

Page 16: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Experimental results

• Data: earthquakes epicenters (preprocessed to remove “outliers/noise”).

• (Y, dY): a neighborhood graph (Rips) with its intrinsic metric.

Page 17: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Advantages and drawbacks of the previous algorithm

• A simple algorithm for metric graph reconstruction:- coming with topological and metric guarantees,- relying on intrinsic metric information (no need of coordinates).

Good points:

Page 18: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Advantages and drawbacks of the previous algorithm

• A simple algorithm for metric graph reconstruction:- coming with topological and metric guarantees,- relying on intrinsic metric information (no need of coordinates).

• Choice of the parameter r (that relies on ε and b): how to do it inpractice? How to make r dependent of the local quality of the approxi-mation?

• Interpretation of the output of the algorithm when the sampling condi-tions are not fullfilled: the algorithm might not output a graph (edgesadjacent to more than 2 vertices).

Weak points and open questions:

Good points:

Page 19: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Another approach based upon basic metric geometry[F. C., Jian Sun, work in progress]

• In metric trees the geodesic triangles/tetrahedra have a specific shape :metric trees are 0-hyperbolic.

x

yz

w

x

y

z

• Locally, metric graphs are trees...

Page 20: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Another approach based upon basic metric geometry[F. C., Jian Sun, work in progress]

• In metric trees the geodesic triangles/tetrahedra have a specific shape :metric trees are 0-hyperbolic.

x

yz

w

x

y

z

• Locally, metric graphs are trees...

Page 21: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Another approach based upon basic metric geometry[F. C., Jian Sun, work in progress]

• In metric trees the geodesic triangles/tetrahedra have a specific shape :metric trees are 0-hyperbolic.

x

yz

w

x

y

z

Goal: Use these remarks/properties to reconstruct metric trees and graphsfrom approximations.

• Locally, metric graphs are trees...

Page 22: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

δ-hyperbolic spaces

Let (X, dX) be a metric space.

Given δ > 0, X is δ-hyperbolic if for any x, y, z, w ∈ X the two largest of thedistance sums dX(x,w)+dX(y, z), dX(x, y)+dX(z, w) and dX(x, z)+dX(y, w)differ by at most 2δ.

w

x

yz

d(x, y) + d(z, w)d(x,w) + d(z, y)d(x, z) + d(y, w)

Page 23: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Real tree: 1. any two points are the ends of a unique geodesic segment; 2. the union of two geodesicsegments intersecting exactly at one of their ends is a geodesic segment. A geodesic metric space is0-hyperbolic iff it is a real tree.

δ-hyperbolic spaces

Given δ > 0, X is δ-hyperbolic if for any x, y, z, w ∈ X if the two largest of thedistance sums dX(x,w)+dX(y, z), dX(x, y)+dX(z, w) and dX(x, z)+dX(y, w)differ by at most 2δ.

w

x

y

z

d(x, y) + d(z, w) = d(x,w) + d(z, y)

• Metric trees are 0-hyperbolic. This is indeed a characterization of (real) trees.

Basic properties:

Page 24: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Real tree: 1. any two points are the ends of a unique geodesic segment; 2. the union of two geodesicsegments intersecting exactly at one of their ends is a geodesic segment. A geodesic metric space is0-hyperbolic iff it is a real tree.

δ-hyperbolic spaces

Given δ > 0, X is δ-hyperbolic if for any x, y, z, w ∈ X if the two largest of thedistance sums dX(x,w)+dX(y, z), dX(x, y)+dX(z, w) and dX(x, z)+dX(y, w)differ by at most 2δ.

w

x

y

z

d(x, y) + d(z, w) = d(x,w) + d(z, y)

• Metric trees are 0-hyperbolic. This is indeed a characterization of (real) trees.

• if dGH(X,Y) < ε then (X is δ-hyperbolic) ⇒ (Y is (δ + 2ε)-hyperbolic).

Basic properties:

Page 25: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Real tree: 1. any two points are the ends of a unique geodesic segment; 2. the union of two geodesicsegments intersecting exactly at one of their ends is a geodesic segment. A geodesic metric space is0-hyperbolic iff it is a real tree.

δ-hyperbolic spaces

Given δ > 0, X is δ-hyperbolic if for any x, y, z, w ∈ X if the two largest of thedistance sums dX(x,w)+dX(y, z), dX(x, y)+dX(z, w) and dX(x, z)+dX(y, w)differ by at most 2δ.

• Metric trees are 0-hyperbolic. This is indeed a characterization of (real) trees.

• if dGH(X,Y) < ε then (X is δ-hyperbolic) ⇒ (Y is (δ + 2ε)-hyperbolic).

Basic properties:

Question: Can we use δ-hyperbolicity tocharacterize (metric) data that are “close”(w.r.t. dGH) to a metric tree?

Page 26: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Real tree: 1. any two points are the ends of a unique geodesic segment; 2. the union of two geodesicsegments intersecting exactly at one of their ends is a geodesic segment. A geodesic metric space is0-hyperbolic iff it is a real tree.

δ-hyperbolic spaces

Given δ > 0, X is δ-hyperbolic if for any x, y, z, w ∈ X if the two largest of thedistance sums dX(x,w)+dX(y, z), dX(x, y)+dX(z, w) and dX(x, z)+dX(y, w)differ by at most 2δ.

• Metric trees are 0-hyperbolic. This is indeed a characterization of (real) trees.

• if dGH(X,Y) < ε then (X is δ-hyperbolic) ⇒ (Y is (δ + 2ε)-hyperbolic).

Basic properties:

Question: Can we use δ-hyperbolicity tocharacterize (metric) data that are “close”(w.r.t. dGH) to a metric tree?

Not so easy:the Poincare disc is log(3)-hyperbolic!

Page 27: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Distance functions and persistence tree

Let (X, dX) be a compact path metric space. Given r ∈ X, let d(.) = dX(r, .)the distance function to r in X (assume d has a finite number of local maxima).

Xr

Equivalence relation: x ∼ y iff d(x) = d(y)and there exists a continuous path from x to ycontained in d−1([d(x),+∞)).

Page 28: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Distance functions and persistence tree

Let (X, dX) be a compact path metric space. Given r ∈ X, let d(.) = dX(r, .)the distance function to r in X (assume d has a finite number of local maxima).

Xr

Equivalence relation: x ∼ y iff d(x) = d(y)and there exists a continuous path from x to ycontained in d−1([d(x),+∞)).

Persistence (merge) tree: T = X/ ∼

T

Page 29: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Distance functions and persistence tree

Let (X, dX) be a compact path metric space. Given r ∈ X, let d(.) = dX(r, .)the distance function to r in X (assume d has a finite number of local maxima).

Xr

Equivalence relation: x ∼ y iff d(x) = d(y)and there exists a continuous path from x to ycontained in d−1([d(x),+∞)).

Persistence (merge) tree: T = X/ ∼

T

Lemma:(i) T is a tree; its leaves are in one-to-one correspondence with the localmaxima of d.(ii) T inherits a metric structure from dT(Πr,Πx) := dX(r, x) for all x ∈ X.

Let Π : X→ T be the quotient map.

In practice: T can be efficiently constructed using an union find data structure(0-persistence algorithm) on the input data endowed with a neighboring graph.

Page 30: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Tree reconstructionLet (X, dX) be a compact path metric space. Given r ∈ X, let d(.) = dX(r, .)the distance function to r in X (assume d has a finite number of local maxima).

Xr

T

Lemma: if X is a metric tree then T is isometricto X.

Page 31: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Tree reconstructionLet (X, dX) be a compact path metric space. Given r ∈ X, let d(.) = dX(r, .)the distance function to r in X (assume d has a finite number of local maxima).

Xr

T

Lemma: if X is a metric tree then T is isometricto X.

What can we say about dGH(X,T) when X isδ-hyperbolic?

→ In general nothing... But....

Page 32: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Tree reconstruction

Xr

T

mx

x

Let (X, dX) be a compact path metric space. Given r ∈ X, let d(.) = dX(r, .)the distance function to r in X (assume d has a finite number of local maxima).

For x ∈ X let mx ∈ X be in the c.c. ofd−1([d(x),+∞)) containing x that maximizes d onthis c.c. and let f : X→ R+ be defined by

f(x) = (r|mx)x := dX(x, r)+dX(x,mx)−dX(r,mx).

Not unique in general

In practice: f can be easily computed as T is constructed (using the 0-persistence algorithm).

Gromov product

Page 33: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Tree reconstruction

Theorem: Let M = ‖f‖∞ = maxx∈X f(x). If X is δ-hyperbolic then

dGH(X,T) < M + 9δ

and for any x, y ∈ X, |dT (Πx,Πy)− dX(x, y)| ≤ 2(M + 9δ).

Xr

T

mx

x

Let (X, dX) be a compact path metric space. Given r ∈ X, let d(.) = dX(r, .)the distance function to r in X (assume d has a finite number of local maxima).

For x ∈ X let mx ∈ X be in the c.c. ofd−1([d(x),+∞)) containing x that maximizes d onthis c.c. and let f : X→ R+ be defined by

f(x) = (r|mx)x := dX(x, r)+dX(x,mx)−dX(r,mx).

Not unique in general

Gromov product

Page 34: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Tree reconstruction

Theorem: Let M = ‖f‖∞ = maxx∈X f(x). If X is δ-hyperbolic then

dGH(X,T) < M + 9δ

and for any x, y ∈ X, |dT (Πx,Πy)− dX(x, y)| ≤ 2(M + 9δ).

Xr

T

mx

x

Let (X, dX) be a compact path metric space. Given r ∈ X, let d(.) = dX(r, .)the distance function to r in X (assume d has a finite number of local maxima).

For x ∈ X let mx ∈ X be in the c.c. ofd−1([d(x),+∞)) containing x that maximizes d onthis c.c. and let f : X→ R+ be defined by

f(x) = (r|mx)x := dX(x, r)+dX(x,mx)−dX(r,mx).

Not unique in general

Corollary: there exists a constant C < 100 s.t. if T0 is a tree and ifdGH(X,T0) < ε then dGH(T,T0) < Cε.

Gromov product

Page 35: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Some practical considerationsWhen X is a neighboring graph (e.g. Rips-Vietoris) built on top of a finitemetric data set (Y, dY)...

• T, Π : Y → T and M can be computedefficiently using the 0-persistence algorithmin almost linear time (in the number of edgesof X)!

Page 36: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Some practical considerationsWhen X is a neighboring graph (e.g. Rips-Vietoris) built on top of a finitemetric data set (Y, dY)...

• T, Π : Y → T and M can be computedefficiently using the 0-persistence algorithmin almost linear time (in the number of edgesof X)!

• δ can also be computed from the data but inO(|Y |4)→ can be improved to O(|Y |3);→ ε-nets (making the complexity dependentof the geometry)→ random samples of tetrahedra?

Page 37: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Some practical considerationsWhen X is a neighboring graph (e.g. Rips-Vietoris) built on top of a finitemetric data set (Y, dY)...

• T, Π : Y → T and M can be computedefficiently using the 0-persistence algorithmin almost linear time (in the number of edgesof X)!

• δ can also be computed from the data but inO(|Y |4)→ can be improved to O(|Y |3);→ ε-nets (making the complexity dependentof the geometry)→ random samples of tetrahedra?

⇒ An algorithm that always output a tree and an upperbound on the Gromov-Hausdorff distance between the reconstructed tree and the data (no parameterto choose except to build X from Y).

Warning: even if dGH(X,T0) << 1, T is in general not homeomorphic to T0.

Page 38: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

From trees to graphs

• Locally, metric graphs are trees: if G is a metric graph and l(G) is thelength of the shortest non-null homotopic simple path then for any r ∈ G,(B(r, l(G)/4), dG) is a metric tree.

r

Page 39: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

From trees to graphs

• Locally, metric graphs are trees: if G is a metric graph and l(G) is thelength of the shortest non-null homotopic simple path then for any r ∈ G,(B(r, l(G)/4), dG) is a metric tree.

→ Data can be “semi-locally” approximated bytrees.

Page 40: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

From trees to graphs

• Locally, metric graphs are trees: if G is a metric graph and l(G) is thelength of the shortest non-null homotopic simple path then for any r ∈ G,(B(r, l(G)/4), dG) is a metric tree.

→ Data can be “semi-locally” approximated bytrees.

Using topological persistence:

• topology (homology) of G[C., de Silva, Oudot ’12],

• estimation of l(G) [Dey,Sun, Wang ’10].

Page 41: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Advantages and drawbacks of the previous approach

Good points:

• A simple and efficient algorithm for tree recontruction/approximation.

• No parameter (except for the construction of the neighboring graph)and upper bounds!

• “Semi-local” exploration of the structure of metric data (δ-hyperbolicitycan be localized).

Page 42: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Advantages and drawbacks of the previous approach

Good points:

• A simple and efficient algorithm for tree recontruction/approximation.

• No parameter (except for the construction of the neighboring graph)and upper bounds!

• “Semi-local” exploration of the structure of metric data (δ-hyperbolicitycan be localized).

Weak points:

• No guarantees on the topology (homeomorphism).

• No global graph reconstruction:- covering the data by trees and gluying them together might leadto choice of parameters issues and pretty bad bounds on the metricapproximation.

Page 43: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Advantages and drawbacks of the previous approach

Good points:

• A simple and efficient algorithm for tree recontruction/approximation.

• No parameter (except for the construction of the neighboring graph)and upper bounds!

• “Semi-local” exploration of the structure of metric data (δ-hyperbolicitycan be localized).

Weak points:

• No guarantees on the topology (homeomorphism).

• No global graph reconstruction:- covering the data by trees and gluying them together might leadto choice of parameters issues and pretty bad bounds on the metricapproximation.

However...

Page 44: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Perspectives

The previous ideas and results can be used to prove some approximation resultsfor the Reeb graph of the distance to a point in a path metric space.

Xr

Equivalence relation: x ∼ y iff d(x) = d(y)and x and y are contained in the same c.c. ofd−1(d(x)).

Reeb graph: G = X/ ∼

G

Page 45: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Perspectives

The previous ideas and results can be used to prove some approximation resultsfor the Reeb graph of the distance to a point in a path metric space.

Page 46: INRIA Saclay Geometrica group Fr ed eric Chazal · MMDS 2012 Joint work with M. Aanjaneya, D. Chen, M. Glisse, L. Guibas, D. Morozov ... Interpretation of the output of the algorithm

Thank you for your attention!