Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
Entropic Measure and Wasserstein Diffusion
Max-K. von Renesse, Karl-Theodor Sturm
Abstract
We construct a new random probability measure on the sphere and on the unit intervalwhich in both cases has a Gibbs structure with the relative entropy functional as Hamiltonian.It satisfies a quasi-invariance formula with respect to the action of smooth diffeomorphismof the sphere and the interval respectively. The associated integration by parts formula isused to construct two classes of diffusion processes on probability measures (on the sphere orthe unit interval) by Dirichlet form methods. The first one is closely related to MalliavinâsBrownian motion on the homeomorphism group. The second one is a probability valuedstochastic perturbation of the heat flow, whose intrinsic metric is the quadratic Wassersteindistance. It may be regarded as the canonical diffusion process on the Wasserstein space.
1 Introduction
(a) Equipped with the L2-Wasserstein distance dW (cf. (2.1)), the space P(M) of probabilitymeasures on an Euclidean or Riemannian space M is itself a rich object of geometric interest.Due to the fundamental works of Y. Brenier, R. McCann, F. Otto, C. Villani and many others(see e.g. [Bre91, McC97, CEMS01, Ott01, OV00, Vil03]) there are well understood and pow-erful concepts of geodesics, exponential maps, tangent spaces T”P(M) and gradients Du(”) offunctions on this space. In a certain sense, P(M) can be regarded as an infinite dimensionalRiemannian manifold, or at least as an infinite dimensional Alexandrov space with nonnegativelower curvature bound if the base manifold (M,d) has nonnegative sectional curvature.A central role is played by the relative entropy : P(M) â R âȘ +â with respect to theRiemannian volume measure dx on M
Ent(”) = â«
M Ï log Ï dx, if d”(x) dx with Ï(x) = d”(x)dx
+â, else.
The relative entropy as a function on the geodesic space (P(M), dW ) is K-convex for a givennumber K â R if and only if the Ricci curvature of the underlying manifold M is bounded frombelow by K, [vRS05, Stu06]. The gradient flow for the relative entropy in the geodesic space(P(M), dW ) is given by the heat equation â
ât” = â” on M , [JKO98]. More generally, a largeclass of evolution equations can be treated as gradient flows for suitable free energy functionalsS : P(M) â R, [Vil03].
What is missing until now, is a natural âRiemannian volume measureâ P on P(M). The basic re-quirement will be an integration by parts formula for the gradient. This will imply the closabilityof the pre-Dirichlet form
E(u, v) =â«P(M)
ăDu(”), Dv(”)ăT” dP(”)
in L2(P(M),P), â which in turn will be the key tool in order to develop an analytic and stochasticcalculus on P(M). In particular, it will allow us to construct a kind of Laplacian and a kindof Brownian motion on P(M). Among others, we intend to use the powerful machinery ofDirichlet forms to study stochastically perturbed gradient flows on P(M) which â on the level
1
of the underlying spaces M â will lead to a new concept of SPDEs (preserving probability byconstruction).Instead of constructing a âuniform distributionâ P on P(M), for various reasons, we prefer toconstruct a probability measure PÎČ on P(M) formally given as
dPÎČ(”) =1ZÎČ
eâÎČ·Ent(”) dP(”) (1.1)
for ÎČ > 0 and some normalization constant ZÎČ. (In the language of statistical mechanics, ÎČ isthe âinverse temperatureâ and ZÎČ the âpartition functionâ whereas the entropy plays the role ofa Hamiltonian.)
(b) One of the basic results of this paper is the rigorous construction of such a entropic measurePÎČ in the one-dimensional case, i.e. M = S1 or M = [0, 1]. We will essentially make use of therepresentation of probability measures by their inverse distributions function g”. It allows totransfer the problem of constructing a measure PÎČ on the space of probability measures P([0, 1])(or P(S1)) into the problem of constructing a measure QÎČ
0 (or QÎČ) on the space G0 (or G, resp.)of nondecreasing functions from [0, 1] (or S1, resp.) into itself.In terms of the measure QÎČ
0 on G0, for instance, the formal characterization (1.1) then reads asfollows
dQÎČ0 (g) =
1ZÎČ
eâÎČ·S(g) dQ0(g). (1.2)
Here Q0 denotes some âuniform distributionâ on G0 â L2([0, 1]) and S : G0 â [0,â] is theentropy functional
S(g) := Ent(gâLeb) = ââ« 1
0log gâČ(t) dt.
This representation is reminiscent of Feynmanâs heuristic picture of the Wiener measure, â nowwith the energy
H(g) =â« 1
0gâČ(t)2dt
of a path replaced by its entropy. QÎČ0 will turn out to be (the law of) the Dirichlet process or
normalized Gamma process.
(c) The key result here is the quasi-invariance â or in other words a change of variable formula âfor the measure PÎČ (or PÎČ0 ) under push-forwards ” 7â hâ” by means of smooth diffeomorphismsh of S1 (or [0, 1], resp.). This is equivalent to the quasi-invariance of the measure QÎČ undertranslations g 7â h g of the semigroup G by smooth h â G. The density
dPÎČ(hâ”)dPÎČ(”)
= XÎČh · Y
0h (”)
consists of two terms. The first one
XÎČh (”) = exp
(ÎČ
â«S1
log hâČ(t)d”(t))
can be interpreted as exp(âÎČEnt(hâ”))/ exp(âÎČEnt(”)) in accordance with our formal inter-pretation (1.1). The second one
Y 0h (”) =
âIâgaps(”)
âhâČ(Iâ) · hâČ(I+)|h(I)|/|I|
2
can be interpreted as the change of variable formula for the (non-existing) measure P. Heregaps(”) denotes the set of intervals I =]Iâ, I+[â S1 of maximal length with ”(I) = 0. Note thatPÎČ is concentrated on the set of ” which have no atoms and not absolutely continuous parts andwhose supports have Lebesgue measure 0.
(d) The tangent space at a given point ” in P = P(S1) (or in P0 = P([0, 1])) will be anappropriate completion of the space Câ(S1,R) (or Câ([0, 1],R), resp.). The action of a tangentvector Ï on ” (âexponential mapâ) is given by the push forward Ïâ”. This leads to the notionof the directional derivative
DÏu(”) = limtâ0
1t
[u((Id+ tÏ)â”)â u(”)]
for functions u : P â R. The quasi-invariance of the measure PÎČ implies an integration by partsformula (and thus the closability)
DâÏu = âDÏuâ VÏ Â· u
with drift VÏ = limtâ01t (Y
ÎČId+tÏ â 1).
The subsequent construction will strongly depend on the choice of the norm on the tangentspaces T”P. Basically, we will encounter two important cases.
(e) Choosing T”P = Hs(S1,Leb) for some s > 1/2 â independent of ” â leads to a regular,local, recurrent Dirichlet form E on L2(P,PÎČ) by
E(u, u) =â«P
ââk=1
|DÏku(”)|2 dPÎČ(”).
where ÏkkâN denotes some complete orthonormal system in the Sobolev space Hs(S1). Ac-cording to the theory of Dirichlet forms on locally compact spaces [FOT94], this form is associ-ated with a continuous Markov process on P(S1) which is reversible with respect to the measurePÎČ. Its generator is given by
12
âk
DÏkDÏk
+12
âk
VÏk·DÏk
. (1.3)
This process (gt)tâ„0 is closely related to the stochastic processes on the diffeomorphism groupof S1 and to the âBrownian motionâ on the homeomorphism group of S1, studied by Airault,Fang, Malliavin, Ren, Thalmaier and others [AMT04, AM06, AR02, Fan02, Fan04, Mal99].These are processes with generator 1
2
âkDÏk
DÏk. Hence, one advantage of our approach is to
identify a probability measure PÎČ such that these processes â after adding a suitable drift âare reversible.Moreover, previous approaches are restricted to s â„ 3/2 whereas our construction applies to allcases s > 1/2.
(f) Choosing T”G = L2([0, 1], ”) leads to the Wasserstein Dirichlet form
E(u, u) =â«P0
âDu(”)â2L2(”) dP
ÎČ0 (”)
on L2(P0,PÎČ0 ). Its square field operator is the squared norm of the Wasserstein gradient and itsintrinsic distance (which governs the short time asymptotic of the process) coincides with theL2-Wasserstein metric. The associated continuous Markov process (”t)tâ„0 on P([0, 1]), which weshall call Wasserstein diffusion, is reversible w.r.t. the entropic measure PÎČ0 . It can be regardedas a stochastic perturbation of the Neumann heat flow on P([0, 1]) with small time Gaussianbehaviour measured in terms of kinetic energy.
3
2 Spaces of Probability Measures and Monotone Maps
The goal of this paper is to study stochastic dynamics on spaces P(M) in case M is the unitinterval [0, 1] or the unit circle S1.
2.1 The Spaces P0 = P([0, 1]) and G0
Let us collect some basic facts for the space P0 = P([0, 1]) of probability measures on the unitinterval [0, 1] the proofs of which can be found in the monograph [Vil03]. Equipped with theL2-Wasserstein distance dW , it is a compact metric space. Recall that
dW (”, Îœ) := infÎł
(â«â«[0,1]2
|xâ y|2Îł(dx, dy)
)1/2
, (2.1)
where the infimum is taken over all probability measures Îł â P([0, 1]2) having marginals ” andÎœ (i.e. Îł(AĂM) = ”(A) and Îł(M ĂB) = Îœ(B) for all A,B âM).Let G0 denote the space of all right continuous nondecreasing maps g : [0, 1[â [0, 1] equippedwith the L2-distance
âg1 â g2âL2 =(â« 1
0|g1(t)â g2(t)|2dt
)1/2
.
Moreover, for notational convenience each g â G0 is extended to the full interval [0, 1] by g(1) :=1. The map
Ï : G0 â P0, g 7â gâLeb
(= push forward of the Lebesgue measure on [0, 1] under the map g) establishes an isometrybetween (G0, â.âL2) and (P0, dW ). The inverse map Ïâ1 : P0 â G0, ” 7â g” assigns to eachprobability measure ” â P0 its inverse distribution function defined by
g”(t) := infs â [0, 1] : ”[0, s] > t (2.2)
with inf â := 1. In particular, for all ”, Îœ â P0
dW (”, Îœ) = âg” â gÎœâL2 . (2.3)
For each g â G0 the generalized inverse gâ1 â G0 is defined by gâ1(t) = infs â„ 0 : g(s) > t.Obviously,
âg1 â g2âL1 = âgâ11 â gâ1
2 âL1 (2.4)
(being simply the area between the graphs) and (gâ1)â1 = g. Moreover, gâ1(g(t)) = t for allt provided gâ1 is continuous. (Note that under the measure QÎČ
0 to be constructed below thelatter will be satisfied for a.e. g â G0.)On G0, there exist various canonical topologies: the L2-topology of G0 regarded as subset ofL2([0, 1],R); the image of the weak topology on P0 under the map Ïâ1 : ” 7â g” (= inversedistribution function); the image of the weak topology on P0 under the map ” 7â gâ1
” (=distribution function). All these â and several other â topologies coincide.
Proposition 2.1. For each sequence (gn)n â G0, each g â G0 and each p â [1,â[ the followingare equivalent:
(i) gn(t) â g(t) for each t â [0, 1] in which g is continuous;
(ii) gn â g in Lp([0, 1]);
(iii) gâ1n â gâ1 in Lp([0, 1]);
4
(iv) ”gn â ”g weakly;
(v) ”gn â ”g in dW .
In particular, G0 is compact.
Let us briefly sketch the main arguments of the
Proof. Since all the functions gn and gâ1n are bounded, properties (ii) and (iii) obviously are
independent of p. The equivalence of (ii) and (iii) for p = 1 was already stated in (2.4) and theequivalence between (ii) for p = 2 and (v) was stated in (2.3). The equivalence of (iv) and (v)is the well known fact that the Wasserstein distance metrizes the weak topology. Another wellknown characterization of weak convergence states that (iv) is equivalent to (iâ): gâ1
n (t) â gâ1(t)for each t â [0, 1] in which gâ1 is continuous. Finally, (iâČ) â (i) according to the equivalence(ii) â (iii) which allows to pass from convergence of distribution functions gâ1
n to convergenceof inverse distribution functions gn. The last assertion follows from the compactness of P0 inthe weak topology.
2.2 The Spaces G, G1 and P = P(S1)
Throughout this paper, S1 = R/Z will always denote the circle of length 1. It inherits the groupoperation + from R with neutral element 0. For each x, y â S1 the positively oriented segmentfrom x to y will be denoted by [x, y] and its length by |[x, y]|. If no ambiguity is possible, thelatter will also be denoted by y â x. In contrast to that, |x â y| will denote the S1-distancebetween x and y. Hence, in particular, |[y, x]| = 1 â |[x, y]| and |x â y| = min|[y, x]|, |[x, y]|.A family of points t1, . . . , tN â S1 is called an âordered familyâ if
âNi=1 |[ti, ti+1]| = 1 with
tN+1 := t1 (or in other words if all the open segments ]ti, ti+1[ are disjoint).Put
G(R) = g : R â R right continuous nondecreasing with g(x+ 1) = g(x) + 1 for all x â R.
Due to the required equivariance with respect to the group action of Z, each map g â G(R)induces uniquely a map Ï(g) : S1 â S1. Put G := Ï(G(R)). The monotonicity of the functionsin G(R) induces also a kind of monotonicity of maps in G: each continuous g â G will beorder preserving and homotopic to the identity map. In the sequel, however, we often will haveto deal with discontinuous g â G. The elements g â G will be called monotone maps of S1.G is a compact subspace of the L2-space of maps from S1 to S1 with metric âg1 â g2âL2 =(â«S1 |g1(t)â g2(t)|2dt
)1/2.With the composition of maps, G is a semigroup. Its neutral element e is the identity map.Of particular interest in the sequel will be the semigroup G1 = G/S1 where functions g, h â Gwill be identified if g(.) = h(.+ a) for some a â S1.
Proposition 2.2. The mapÏ : G1 â P, g 7â gâLeb
(= push forward of the Lebesgue measure on S1 under the map g) and its inverse Ïâ1 : P âG1, ” 7â g” (with g” as defined in (2.2)) establish an isometry between the space G1 equippedwith the induced L2-distance
âg1 â g2âG1 =(
infsâS1
â«S1
|g1(t)â g2(t+ s)|2dt)1/2
and the space P of probability measures on S1 equipped with the L2-Wasserstein distance. Inparticular, G1 is compact.
5
Proof. The bijectivity of Ï and Ïâ1 is clear. It remains to prove that
dW (”, Îœ) = âg” â gÎœâG1 (2.5)
for all ”, Îœ â P. Obviously, it suffices to prove this for all absolutely continuous ”, Îœ (orequivalently for strictly increasing g”, gÎœ) since the latter are dense in P (or in G1, resp.). Forsuch a pair of measures, there exists a map F : S1 â S1 (âtransport mapâ) which minimizes thetransportation costs [Vil03]. Fix any point in S1, say 0, and put s = F (0). Then the map F isa transport map for the mass ” on the segment ]0, 1[ onto the mass Îœ on the segment ]s, s+ 1[.Since these segments are isometric to the interval ]0, 1[, the results from the previous subsectionimply that the minimal cost for such a transport is given by
â«S1 |g1(t)â g2(t+ s)|2dt. Varying
over s finally proves the claim.
3 Dirichlet Process and Entropic Measure
3.1 Gibbsean Interpretation and Heuristic Derivation of the Entropic Mea-sure
One of the basic results of this paper is the rigorous construction of a measure PÎČ formally givenas (1.1) in the one-dimensional case, i.e. M = S1 or M = [0, 1]. We will essentially make useof the isometries Ï : G1 â P = P(S1), g 7â gâLeb and Ï : G0 â P0 = P([0, 1]). They allow totransfer the problem of constructing measures PÎČ on spaces of probability measures P (or P0)into the problem of constructing measures QÎČ (or QÎČ
0 ) on spaces of functions G1 (or G0, resp.).In terms of the measure QÎČ
0 on G0, for instance, the formal characterization (1.1) then reads asfollows
QÎČ0 (dg) =
1ZÎČ
eâÎČ·S(g) Q0(dg). (3.1)
Here Q0 denotes some âuniform distributionâ on G0 â L2([0, 1]) and S : G0 â [0,â] is theentropy functional S(g) := Ent(gâLeb). If g is absolutely continuous then S(g) can be expressedexplicitly as
S(g) = ââ« 1
0log gâČ(t) dt.
The representation (3.1) is reminiscent of Feynmanâs heuristic picture of the Wiener measure.Let us briefly recall the latter and try to use it as a guideline for our construction of the measureQÎČ
0 .
According to this heuristic picture, the Wiener measure PÎČ with diffusion constant Ï2 = 1/ÎČshould be interpreted (and could be constructed) as
PÎČ(dg) =1ZÎČ
eâÎČ·H(g) P(dg) (3.2)
with the energy functional H(g) = 12
â« 10 g
âČ(t)2dt. Here P(dg) is assumed to be the âuniformdistributionâ on the space Gâ of all continuous paths g : [0, 1] â R with g(0) = 0. Even ifsuch a uniform distribution existed, typically almost all paths g would have infinite energy.Nevertheless, one can overcome this difficulty as follows.Given any finite partition 0 = t0 < t1 < · · · < tN = 1 of [0, 1], one should replace the energyH(g) of the path g by the energy of the piecewise linear interpolation of g
HN (g) = inf H(g) : g â Gâ, g(ti) = g(ti) âi =Nâi=1
|g(ti)â g(tiâ1)|2
2(ti â tiâ1).
6
Then (3.2) leads to the following explicit representation for the finite dimensional distributions
PÎČ (gt1 â dx1, . . . , gtN â dxN ) =1
ZÎČ,Nexp
(âÎČ
2
Nâi=1
|xi â xiâ1|2
ti â tiâ1
)pN (dx1, . . . , xN ). (3.3)
Here pN (dx1, . . . , xN ) = P (gt1 â dx1, . . . , gtN â dxN ) should be a âuniform distributionâ on RN
and ZÎČ,N a normalization constant. Choosing pN to be the N -dimensional Lebesgue measuremakes the RHS of (3.3) a projective family of probability measures. According to Kolmogorovâsextension theorem this family has a unique projective limit, the Wiener measure PÎČ on Gâ withdiffusion constant Ï2 = 1/ÎČ.
Now let us try to follow this procedure with the entropy functional S(g) replacing the energyfunctional H(g). Given any finite partition 0 = t0 < t1 < · · · < tN < tN+1 = 1 of [0, 1], wewill replace the entropy S(g) of the path g by the entropy of the piecewise linear interpolationof g
SN (g) = inf S(g) : g â G0, g(ti) = g(ti) âi = âN+1âi=1
logg(ti)â g(tiâ1)ti â tiâ1
· (ti â tiâ1).
This leads to the following expression for the finite dimensional distributions
QÎČ0 (gt1 â dx1, . . . , gtN â dxN )
=1
ZÎČ,Nexp
(ÎČ
N+1âi=1
logxi â xiâ1
ti â tiâ1· (ti â tiâ1)
)qN (dx1 . . . dxN ) (3.4)
where qN (dx1, . . . , xN ) = Q0 (gt1 â dx1, . . . , gtN â dxN ) is a âuniform distributionâ on the simplexÎŁN =
(x1, . . . , xN ) â [0, 1]N : 0 < x1 < x2 . . . < xN < 1
and x0 := 0, xN+1 := 1.
What is a âcanonicalâ candidate for qN? A natural requirement will be the invariance property
qN (dx1, . . . , dxN ) = [(Îxiâ1,xi+k)â qk(dxi, . . . , dxi+kâ1)]dqNâk(dx1, . . . , dxiâ1, dxi+k, . . . , dxN ) (3.5)
for all 1 †k †N and all 1 †i †N â k + 1 with the convention x0 = 0, xN+1 = 1 and therescaling map Îa,b : ]0, 1[kâ Rk, yj 7â yj(bâ a) + a for j = 1, · · · , k.If the qN , N â N, were probability measures then the invariance property admits the followinginterpretation: under qN , the distribution of the (N â k)-tuple (x1, . . . , xiâ1, xi+k, . . . , xN ) isnothing but qNâk; and under qN , the distribution of the k-tuple (xi, . . . , xi+kâ1) of points inthe interval ]xiâ1, xk[ coincides â after rescaling of this interval â with qk. Unfortunately, nofamily of probability measures qN , N â N with property (3.5) exists. However, there is a familyof measures with this property.By iteration of the invariance property (3.5), the choice of the measure q1 on the intervalÎŁ1 = ]0, 1[ will determine all the measures qN , N â N. Moreover, applying (3.5) for N = 2,k = 1 and both choices of i yields[
(Î0,x1)â q1(dx2)]dq1(dx1) =
[(Îx2,1)â q1(dx1)
]dq1(dx2) (3.6)
for all 0 < x1 < x2 < 1. This reflects the intuitive requirement that there should be no differencewhether we first choose randomly x1 â ]0, 1[ and then x2 â ]x1, 1[ or the other way round, firstx2 â ]0, 1[ and then x1 â ]0, x2[.
Lemma 3.1. A family of measures qN , N â N, with continuous densities satisfies property (3.5)if and only if
qN (dx1, . . . , dxN ) = CNdx1 . . . dxN
x1 · (x2 â x1) · . . . · (xN â xNâ1) · (1â xN )(3.7)
for some constant C â R+.
7
Proof. If q1(dx) = Ï(x)dx then (3.6) is equivalent to
Ï(y) · Ï(x
y
)· 1y
= Ï(x) · Ï(y â x
1â x
)· 11â x
for all 0 < x < y < 1. For continuous Ï this implies that there exists a constant C â R+ suchthat Ï(x) = C
x(1âx) for all 0 < x < 1. Iterated inserting this into (3.5) yields the claim.
Let us come back to our attempt to give a meaning to the heuristic formula (3.1). Combining(3.4) with the choice (3.7) of the measure qN finally yields
QÎČ0 (gt1 â dx1, . . . , gtN â dxN )
=1
ZÎČ,N
N+1âi=1
(xi â xiâ1)ÎČ(tiâti1 ) dx1 . . . dxNx1 · (x2 â x1) · . . . · (1â xN )
(3.8)
with appropriate normalization constants ZÎČ,N . Now the RHS of this formula indeed turns outto define a consistent family of probability measures. Hence, by Kolmogorovâs extension theoremit admits a projective limit QÎČ
0 on the space G0. The push forward of this measure under thecanonical identification Ï : G0 â P0, g 7â gâLeb will be the entropic measure PÎČ0 which we werelooking for.The details of the rigorous construction of this measure as well as various properties of it willbe presented in the following sections.
3.2 The Measures QÎČ and PÎČ
The basic object to be studied in this section is the probability measure QÎČ on the space G.
Proposition 3.2. For each real number ÎČ > 0 there exists a unique probability measure QÎČ onG, called Dirichlet process, with the property that for each N â N and for each ordered familyof points t1, t2, . . . , tN â S1
QÎČ (gt1 â dx1, . . . , gtN â dxN ) =Î(ÎČ)âN
i=1 Î(ÎČ(ti+1 â ti))
Nâi=1
(xi+1 â xi)ÎČ(ti+1âti)â1dx1 . . . dxN .
(3.9)
The precise meaning of (3.9) is that for all bounded measurable u : (S1)N â Râ«Gu (gt1 , . . . , gtN ) dQÎČ(g)
=Î(ÎČ)âN
i=1 Î(ÎČ Â· |[ti, ti+1]|)
â«ÎŁN
u(x1, . . . , xN )Nâi=1
|[xi, xi+1]|ÎČ·|[ti,ti+1]|â1dx1 . . . dxN .
with ÎŁN =
(x1, . . . , xN ) â (S1)N :âN
i=1 |[xi, xi+1]| = 1
and xN+1 := x1, tN+1 := t1. In
particular, with N = 1 this meansâ«G u(gt)dQ
ÎČ(g) =â«S1 u(x)dx for each t â S1.
Proof. It suffices to prove that (3.9) defines a consistent family of finite dimensional distributions.The existence of QÎČ (as a âprojective limitâ) then follows from Kolmogorovâs extension theorem.The required consistency means that
Î(ÎČ)âNi=1 Î(ÎČ Â· |[ti, ti+1]|)
â«ÎŁN
Nâi=1
|[xi, xi+1]|ÎČ·|[ti,ti+1]|â1u(x1, . . . , xN ) dx1 . . . dxN
=Î(ÎČ)
Î(ÎČ Â· |[t1, t2]|) · . . . · Î(ÎČ Â· |[tkâ1, tk+1]|) · . . . · Î(ÎČ Â· |[tN , t1]|)
·â«
ÎŁNâ1
|[x1, x2]|ÎČ·|[t1,t2]|â1 · . . . · |[xkâ1, xk+1]|ÎČ·|[tkâ1,tk+1]|â1 · . . . · |[xN , x1]|ÎČ·|[tN ,t1]|â1
·v(x1, . . . , xkâ1, xk . . . , xN ) dx1 . . . dxkâ1dxk+1 . . . dxN
8
whenever u(x1, . . . , xN ) = v(x1, . . . , xkâ1, xk . . . , xN ) for all (x1, . . . xN ) â ÎŁN . The latter is animmediate consequence of the well-known fact (Eulerâs beta integral) thatâ«
[xkâ1,xk+1]|[xkâ1, xk]|ÎČ·|[tkâ1,tk]|â1 · |[xk, xk+1]|ÎČ·|[tk,tk+1]|â1 dxk
=Î(ÎČ Â· |[tkâ1, tk]|)Î(ÎČ Â· |[tk, tk+1]|)
Î(ÎČ Â· |[tkâ1, tk+1]|)|[xkâ1, xk+1]|ÎČ·|[tkâ1,tk+1]|â1.
For s â S1 let Ξs : G â G, g 7â g Ξs be the isomorphism of G induced by the rotationΞs : S1 â S1, t 7â t+ s. Obviously, the measure QÎČ on G is invariant under each of the maps Ξs.Hence, QÎČ induces a probability measure QÎČ
1 on the quotient spaces G1 = G/S1.Recall the definition of the map Ï : G â P, g 7â gâLeb. Since (g Ξs)âLeb = gâLeb thiscanonically extends to a map Ï : G1 â P. (As mentioned before, the latter is even an isometry.)
Definition 3.3. The entropic measure PÎČ on P is defined as the push forward of the Dirichletprocess QÎČ on G (or equivalently, of the measure QÎČ
1 on G1) under the map Ï. That is, for allbounded measurable u : P â Râ«
Pu(”) dPÎČ(”) =
â«Gu(gâLeb) dQÎČ(g).
3.3 The Measures QÎČ0 and PÎČ0
The subspaces g â G : g(0) = 0 and g â G0 : g(0) = 0 can obviously be identified.Conditioning the probability measure QÎČ onto this event thus will define a probability measureQÎČ
0 on G0. However, we prefer to give the direct construction of QÎČ0 .
Proposition 3.4. For each real number ÎČ > 0 there exists a unique probability measure QÎČ0 on
G0, called Dirichlet process, with the property that for each N â N and each family 0 = t0 <t1 < t2 < . . . < tN < tN+1 = 1
QÎČ0 (gt1 â dx1, . . . , gtN â dxN ) =
Î(ÎČ)âi Î(ÎČ Â· (ti+1 â ti))
âi
(xi+1 â xi)ÎČ·(ti+1âti)â1dx1 . . . dxN .
(3.10)
The precise meaning of (3.10) is that for all bounded measurable u : [0, 1]N â Râ«G0
u (gt1 , . . . , gtN ) dQÎČ0 (g)
=Î(ÎČ)âN
i=1 Î(ÎČ Â· (ti+1 â ti))
â«ÎŁN
u(x1, . . . , xN )Nâi=1
(xi+1 â xi)ÎČ·(ti+1âti)â1dx1 . . . dxN .
with ÎŁN =(x1, . . . , xN ) â [0, 1]N : 0 < x1 < x2 . . . < xn < 1
and xN+1 := x1, tN+1 := t1.
Remark 3.5. According to these explicit formulae, it is easy to calculate the moments of theDirichlet process. For instance,
EÎČ0 (gt) :=â«G0
gt dQÎČ0 (g) = t
andVarÎČ0 (gt) :=
â«G0
(gt â t)2 dQÎČ0 (g) =
11 + ÎČ
t(1â t)
for all ÎČ > 0 and all t â [0, 1].
9
Definition 3.6. The entropic measure PÎČ0 on P0 = P([0, 1]) is defined as the push forward ofthe Dirichlet process QÎČ
0 on G0 under the map Ï. That is, for all bounded measurable u : P0 â Râ«P0
u(”) dPÎČ0 (”) =â«G0
u(gâLeb) dQÎČ0 (g).
Remark 3.7. (i) According to the above construction QÎČ0 ( . ) = QÎČ( . |g(0) = 0) andâ«
G0
u(g) dQÎČ0 (g) =
â«Gu(g â g(0)) dQÎČ(g),
â«Gu(g) dQÎČ(g) =
â« 1
0
â«G0
u(g + x) dQÎČ0 (g) dx.
(ii) Analogously, the entropic measures on the sphere and on the are linked as followsâ«Pu(”) dPÎČ(”) =
â« 1
0
â«P0
u((Ξx)â”)dPÎČ0 (”) dx
or briefly
dPÎČ =â« 1
0
[(Ξx)âdPÎČ0
]dx
where Ξx : S1 â S1, y 7â x + y and Ξx : P â P : ” 7â (Ξx)â”. We would like to emphasize,however, that PÎČ 6= PÎČ0 . For instance, consider u(”) :=
â«f d” for some f : S1 â R (which may
be identified with f : [0, 1] â R). Thenâ«P(S1)
u(”) dPÎČ(”) =â«S1
f(x) dx
whereas â«P([0,1])
u(”) dPÎČ0 (”) =â«
[0,1]f(x)ÏÎČ(x) dx
with ÏÎČ(x) = Î(ÎČ)Î(ÎČt)Î(ÎČ(1ât))
â« 10 x
ÎČtâ1(1â x)ÎČ(1ât)â1 dt.
According to the last remark, it suffices to study in detail one of the four measures QÎČ, QÎČ0 ,
PÎČ, and PÎČ0 . We will concentrate in the rest of this chapter on the measure QÎČ0 which seems to
admit the most easy interpretations.
3.4 The Dirichlet Process as Normalized Gamma Process
We start recalling some basic facts about the real valued Gamma processes. For α > 0 denoteby G(α) the absolutely continuous probability measure on R+ with density 1
Î(α)xαâ1eâx.
Definition 3.8. A real valued Markov process (Îłt)tâ„0 starting in zero is called standard Gammaprocess if its increments Îłt â Îłs are independent and distributed according to G(t â s) for 0 â€s < t. Without loss of generality we may assume that almost surely the function tâ Îłt is rightcontinuous and nondecreasing.
Alternatively the Gamma-Process may be defined as the unique pure jump Levy process withLevy measure Î(dx) = 1ââx>0
eâx
x dx. The connection between pure jump Levy and Poisson pointprocesses gives rise to several other equivalent representations of the Gamma process [Kin93,
10
Ber99]. For instance, let Î = p = (px, py) â R2 be the Poisson point process on R+ĂR+ withintensity measure dxĂ Î(dy) with Î as above, then a Gamma process is obtained by
Îłt :=â
pâÎ :pxâ€tpy. (3.11)
For ÎČ > 0 the process Îłt·ÎČ is a Levy process with Levy measure ÎÎČ(dx) = ÎČ Â· 1ââx>0eâx
x dx. Itsincrements are distributed according to
P (ÎłÎČ·t â ÎłÎČ·s â dx) =1
Î(ÎČ Â· (tâ s))xÎČ·(tâs)â1eâxdx.
Proposition 3.9. For each ÎČ > 0, the law of the process (Îłt·ÎČÎłÎČ
)tâ[0,1] is the Dirichlet process QÎČ0 .
Proof. This well-known fact is easily obtained from Lukacsâ characterization of the Gammadistribution [EY04].
3.5 Support Properties
Proposition 3.10. (i) For each ÎČ > 0, the measure QÎČ0 has full support on G0.
(ii) QÎČ0 -almost surely the function t 7â g(t) is strictly increasing but increases only by jumps
(that is, the jumps heights add up to 1 and the jump locations are dense in [0, 1]).(iii) For each fixed t0 â [0, 1], QÎČ
0 -almost surely the function t 7â g(t) is continuous at t0.
Proof. (i) Let g â G â L2([0, 1], dx) and Δ > 0 then we have to show QÎČ(BΔ(g)) > 0 whereBΔ(g) = h â G0 : âhâ gâL2([0,1]) < Δ. For this choose finitely many points ti â [0, 1] togetherwith ÎŽi > 0 such that the set S := f â G
âŁâŁ |f(ti)âg(ti)| †Ύi âi is contained in BΔ(g). Clearly,from (3.10) QÎČ(S) > 0 which proves the claim.(ii) (3.10) implies that QÎČ
0 -almost surely g(s) < g(t) for each given pair s < t. Varying over allsuch rational pairs s < t, it follows that a.e. g is strictly increasing on R+.In terms of the probabilistic representation (3.9), it is obvious that g increases only by jumps.(iii) This also follows easily from the representation as a normalized gamma process (3.9).
Restating the previous property (ii) in terms of the entropic measure yields that PÎČ0 -a.e. ” â P0
is âCantor likeâ. More precisely,
Corollary 3.11. PÎČ0 -almost surely the measure ” â P0 has no absolutely continuous part andno discrete part. The topological support of ” has Lebesgue measure 0. Moreover,
Ent(”) = +â. (3.12)
Proof. The assertion on the entropy of ” is an immediate consequence of the statement on thesupport of ”. The second claim follows from the fact that the jump heights of g add up to 1.
In terms of the measure QÎČ0 , the last assertion of the corollary states that S(g) = +â for QÎČ
0 -a.e.g â G0.
3.6 Scaling and Invariance Properties
The Dirichlet process QÎČ0 on G0 has the following Markov property: the distribution of g|[s,t]
depends on g[0,1]\[s,t] only via g(s), g(t).And the Dirichlet process QÎČ
0 on G0 has a remarkable self-similarity property: if we restrict thefunctions g onto a given interval [s, t] and then linearly rescale their domain and range in orderto make them again elements of G0 then this new process is distributed according to QÎČâČ
0 withÎČâČ = ÎČ Â· |tâ s|.
11
Proposition 3.12. For each ÎČ > 0, and each s, t â [0, 1], s < t
QÎČ0
(g|[s,t] â .
âŁâŁ g[0,1]\[s,t]) = QÎČ0
(g|[s,t] â .
âŁâŁ g(s), g(t)) (3.13)
and(Îs,t)âQÎČ
0 = QÎČ·|tâs|0 (3.14)
where Îs,t : G0 â G0 with Îs,t(g)(r) = g((1âr)s+rt)âg(s)g(t)âg(s) for r â [0, 1].
Proof. Both properties follow immediately from the representation in Proposition 3.10.
Corollary 3.13. The probability measures QÎČ0 , ÎČ > 0 on G0 are uniquely characterized by the
self-similarity property (3.14) and the distributions of g1/2:
QÎČ0 (g1/2 â dx) =
Î(ÎČ)Î(ÎČ/2)2
· [x(1â x)]ÎČ/2â1dx.
Proposition 3.14. (i) For ÎČ â 0 the measures QÎČ0 weakly converge to a measure Q0
0 definedas the uniform distribution on the set 1[t,1] : t â ]0, 1] â G0.Analogously, the measures QÎČ weakly converge for ÎČ â 0 to a measure Q0 defined as the uniformdistribution on the set of constant maps t : t â S1 â G.(ii) For ÎČ ââ the measures QÎČ
0 (or QÎČ) weakly converge to the Dirac mass ÎŽe on the identitymap e of [0, 1] (or S1, resp.).
Proof. (i) Since the space G0 (equipped with the L2-topology) is compact, so is P(G0) (equippedwith the weak topology). Hence the family QÎČ
0 , ÎČ > 0 is pre-compact. Let Q00 denote the limit of
any converging subsequence of QÎČ0 for ÎČ â 0. According to the formula for the one-dimensional
distributions, for each t â ]0, 1[
QÎČ0 (gt â dx) =
Î(ÎČ)Î(ÎČt)Î(ÎČ(1â t))
· xÎČtâ1(1â x)ÎČ(1ât)â1dx
ââ (1â t)ÎŽ0(dx) + tÎŽ1(dx)
as ÎČ â 0. Hence, Q00 is the uniform distribution on the set 1[t,1] : t â ]0, 1] â G0.
(ii) Similarly, QÎČ0 (gt â dx) â ÎŽt(dx) as ÎČ â â. Hence, ÎŽe with e : t 7â t will be the unique
accumulation point of QÎČ0 for ÎČ ââ.
Restating the previous results in terms of the entropic measures, yields that the entropic mea-sures PÎČ0 converge weakly to the uniform distribution P0
0 on the set (1 â t)ÎŽ0 + tÎŽ1 : t â[0, 1] â P0; and the measures PÎČ converge weakly to the uniform distribution P0 on the setÎŽt : t â S1 â P whereas for ÎČ ââ both, PÎČ0 and PÎČ, will converge to ÎŽLeb, the Dirac masson the uniform distribution of [0, 1] or S1, resp.The assertions of Proposition 3.12 imply the following Markov property and self-similarity prop-erty of the entropic measure.
Proposition 3.15. For each each x, y â [0, 1], x < y
PÎČ0(”|[x,y] â .
âŁâŁÂ”|[0,1]\[x,y]) = PÎČ0(”|[x,y] â .
âŁâŁÂ”([x, y])
andPÎČ0(”|[x,y] â .
âŁâŁÂ”([x, y]) = α)
= PÎČ·α0 (”x,y â . )
with ”x,y â P0 (ârescaling of ”|[x,y]â) defined by ”x,y(A) = 1”([x,y])”(x+(yâx) ·A) for A â [0, 1].
12
3.7 Dirichlet Processes on General Measurable Spaces
Recall Fergusonâs notion of a Dirichlet process on a general measurable space M with parametermeasure m on M . This is a probability measure Qm
P(M) on P(M), uniquely defined by the fact
that for any finite measurable partition M =âN+1
i=1 Mi and Ïi := m(Mi).
QmP(M) (” : ”(M1) â dx1, . . . , ”(MN ) â dxN )
=Î(m(M))âN+1i=1 Î(Ïi)
xÏ1â11 · · ·xÏNâ1
N
(1â
Nâi=1
xi)ÏN+1â1
dx1 · · · dxN ,
If a map h : M âM leaves the parameter measure m invariant then obviously the induced maph : P(M) â P(M), ” 7â hâ” leaves the Dirichlet process Qm
P(M) invariant.
In the particular case M = [0, 1] and m = ÎČ Â· Leb, the Dirichlet process QmP(M) can be obtained
as push forward of the measure QÎČ0 (introduced before) under the isomorphism ζ : G0 â P([0, 1])
which assigns to each g the induced Lebesgue-Stieltjes measure dg (the inverse 릉1 assigns toeach probability measure its distribution function):
QmP([0,1]) = ζâQÎČ
0 . (3.15)
Note that the support properties of the measure QmP([0,1]) are completely different from those of
the measure PÎČ0 . In particular, QmP([0,1])-almost every ” â P([0, 1]) is discrete and has full topo-
logical support, cf. Corollary 3.11. The invariance properties of QmP([0,1]) under push forwards by
means of measure preserving transformations of [0, 1] seems to have no intrinsic interpretationin terms of QÎČ
0 .
4 The Change of Variable Formula for the Dirichlet Process andfor the Entropic Measure
Our main result in this chapter will be a change of variable formula for the Dirichlet process.To motivate this formula, let us first present an heuristic derivation based on the formal repre-sentation (3.1).
4.1 Heuristic Approaches to Change of Variable Formulae
Let us have a look on the change of variable formula for the Wiener measure. On a formal level,it easily follows from Feynmanâs heuristic interpretation
dPÎČ(g) =1Zeâ
ÎČ2
R 10 g
âČ(t)2dt dP(g)
with the (non-existing) âuniform distributionâ P. Assuming that the latter is âtranslation invari-antâ (i.e. invariant under additive changes of variables, â at least in âsmoothâ directions h) weimmediately obtain
dPÎČ(h+ g) =1Zeâ
ÎČ2
R 10 (h+g)âČ(t)2dt dP(h+ g)
=1Zeâ
ÎČ2
R 10 h
âČ(t)2dtâÎČR 10 h
âČ(t)gâČ(t)dt · eâÎČ2
R 10 g
âČ(t)2dt dP(g)
= eâÎČ2
R 10 h
âČ(t)2dtâÎČR 10 h
âČ(t)dg(t) dPÎČ(g).
If we interpretâ« 10 h
âČ(t)dg(t) as the Ito integral of hâČ with respect to the Brownian path g thenthis is indeed the famous Cameron-Martin-Girsanov-Maruyama theorem.
13
In the case of the entropic measure, the starting point for a similar argumentation is the heuristicinterpretation
dQÎČ0 (g) =
1ZeÎČ
R 10 log gâČ(t)dt dQ0(g),
again with a (non-existing) âuniform distributionâ Q0 on G0. The natural concept of âchangeof variablesâ, of course, will be based on the semigroup structure of the underlying space G0;that is, we will study transformations of G0 of the form g 7â h g for some (smooth) elementh â G0. It turns out that Q0 should not be assumed to be invariant under translations butmerely quasi-invariant:
dQ0(h g) = Y 0h (g) dQ0(g)
with some density Yh. This immediately implies the following change of variable formula for QÎČ0 :
dQÎČ0 (h g) =
1ZeÎČ
R 10 log(hg)âČ(t)dt dQ0(h g)
=1ZeÎČ
R 10 log hâČ(g(t))dt · eÎČ
R 10 log gâČ(t)dt · Y 0
h (g) dQ0(g)
= eÎČR 10 log gâČ(t)dt · Y 0
h (g) dQÎČ0 (g).
This is the heuristic derivation of the change of variables formula. Its rigorous derivation (andthe identification of the density Yh) is the main result of this chapter.
4.2 The Change of Variables Formula on the Sphere
For g, h â G with h â C2 we put
Y 0h (g) :=
âaâJg
âhâČ (g(aâ)) · hâČ (g(a+))
ÎŽ(hg)ÎŽg (a)
, (4.1)
where Jg â S1 denotes the set of jump locations of g and
ÎŽ(h g)ÎŽg
(a) :=h (g(a+))â h (g(aâ))
g(a+)â g(aâ).
To simplify notation, here and in the sequel (if no ambiguity seems possible), we write y â xinstead of |[x, y]| to denote the length of the positively oriented segment from x to y in S1. Wewill see below that the infinite product in the definition of Y 0
h (g) converges for QÎČ-a.e. g â G.Moreover, for ÎČ > 0 we put
XÎČh (g) := exp
(ÎČ
â« 1
0log hâČ (g(s)) ds
), Y ÎČ
h (g) := XÎČh (g) · Y 0
h (g). (4.2)
Theorem 4.1. Each C2-diffeomorphism h â G induces a bijective map Ïh : G â G, g 7â h gwhich leaves the measure QÎČ quasi-invariant:
dQÎČ(h g) = Y ÎČh (g) dQÎČ(g).
In other words, the push forward of QÎČ under the map Ïâ1h = Ïhâ1 is absolutely continuous w.r.t.
QÎČ with density Y ÎČh :
d(Ïhâ1)âQÎČ(g)dQÎČ(g)
= Y ÎČh (g).
The function Y ÎČh is bounded from above and below (away from 0) on G.
By means of the canonical isometry Ï : G â P, g 7â gâLeb, Theorem 4.1 immediately implies
14
Corollary 4.2. For each C2-diffeomorphism h â G the entropic measure PÎČ is quasi-invariantunder the transformation ” 7â hâ” of the space P:
dPÎČ(hâ”) = Y ÎČh (Ïâ1(”)) dPÎČ(”).
The density Y ÎČh (Ïâ1(”)) introduced in (4.2) can be expressed as follows
Y ÎČh (Ïâ1(”)) = exp
[ÎČ
â« 1
0log hâČ(s)”(ds)
]·
âIâgaps(”)
âhâČ(Iâ) · hâČ(I+)|h(I)|/|I|
where gaps(”) denotes the set of segments I = ]Iâ, I+[â S1 of maximal length with ”(I) = 0and |I| denotes the length of such a segment.
4.3 The Change of Variables Formula on the Interval
From the representation of QÎČ as a product of QÎČ0 and Leb (see Remark 3.7) and the change of
variable formulae for QÎČ and Leb, one can deduce a change of variable formula for QÎČ0 similar to
that of Theorem 4.1 but containing an additional factor 1hâČ(0) . In this case, one has to restrict
to translations by means of C2-diffeomorphisms h â G with h(0) = 0.More generally, one might be interested in translations of G0 by means of C2-diffeomorphismsh â G0. In contrast to the previous situation, it now may happen that hâČ(0) 6= hâČ(1).
For g â G0 and C2-ismorphism h : [0, 1] â [0, 1] we put
Y ÎČh,0(g) := XÎČ
h (g) · Yh,0(g) (4.3)
withYh,0(g) =
1âhâČ(0) · hâČ(1)
· Y 0h (g)
and XÎČh (g) and Y 0
h (g) defined as before in (4.1), (4.2). Note that here and in the sequel by aC2-isomorphism h â G0 we understand an increasing homeomorphism h : [0, 1] â [0, 1] such thath and hâ1 are bounded in C2([0, 1]), which in particular implies hâČ > 0.
Theorem 4.3. Each translation Ïh : G0 â G0, g 7â h g by means of a C2-isomorphism h â G0
leaves the measure QÎČ0 quasi-invariant:
dQÎČ0 (h g) = Y ÎČ
h,0(g) dQÎČ0 (g)
or, in other words,d(Ïhâ1)âQÎČ
0 (g)
dQÎČ0 (g)
= Y ÎČh,0(g).
The function Y ÎČh,0 is bounded from above and below (away from 0) on G0.
Corollary 4.4. For each C2-isomorphism h â G0 the entropic measure PÎČ0 is quasi-invariantunder the transformation ” 7â hâ” of the space P0:
dPÎČ0 (hâ”)
dPÎČ0 (”)= exp
[ÎČ
â« 1
0log hâČ(s)”(ds)
]· 1â
hâČ(0) · hâČ(1)·
âIâgaps(”)
âhâČ(Iâ) · hâČ(I+)|h(I)|/|I|
where gaps(”) denotes the set of intervals I = ]Iâ, I+[â [0, 1] of maximal length with ”(I) = 0and |I| denotes the length of such an interval.
Remark 4.5. Theorem 4.3 seems to be unrelated to the quasi-invariance of the measure QmP([0,1])
under the transformation dg â h · dg/ăh, dgă shown in [Han02]. Nor is it anyhow implied bythe quasi-ivarariance formula for the general measure valued gamma process as in [TVY01] withrespect to a similar transformation. In our present case the latter would correspond to themapping dÎł â h · dÎł of the (measure valued) Gamma process dÎł.
15
4.4 Proofs for the Sphere Case
Lemma 4.6. For each C2-diffeomorphism h â G
XÎČh (g) = lim
kââ
kâ1âi=0
[h (g(ti+1))â h (g(ti))
g(ti+1)â g(ti)
]ÎČ(ti+1âti)(4.4)
Here ti = ik for i = 0, 1, . . . , k â 1 and tk = 0. Thus ti+1 â ti := |[ti, ti+1]| = 1
k for all i.
Proof. Without restriction, we may assume ÎČ = 1. According to Taylorâs formula
h (g(ti+1)) = h (g(ti)) + hâČ (g(ti)) · (g(ti+1)â g(ti)) + 12h
âČâČ(Îłi) · (g(ti+1)â g(ti))2
for some Îłi â [g(ti), g(ti+1)]. Hence,
limkââ
kâ1âi=0
[h (g(ti+1))â h (g(ti))
g(ti+1)â g(ti)
]ti+1âti=
= limkââ
kâ1âi=0
[hâČ (g(ti)) + 1
2hâČâČ(Îłi) · (g(ti+1)â g(ti))
]ti+1âti
= limkââ
exp
(kâ1âi=0
[log hâČ (g(ti)) + log
(1 + 1
2
hâČâČ(Îłi)hâČ (g(ti))
(g(ti+1)â g(ti)))]
· (ti+1 â ti))
(?)= exp
(limkââ
kâ1âi=0
log hâČ (g(ti)) · (ti+1 â ti)
)
= exp(â« 1
0log hâČ (g(t)) dt
)= X1
h(g).
Here (?) follows from the fact that
1 + 12
hâČâČ(Îłi)hâČ (g(ti))
· (g(ti+1)â g(ti)) =h (g(ti+1))â h (g(ti))
g(ti+1)â g(ti)· 1hâČ (g(ti))
= hâČ(ηi) ·1
hâČ (g(ti))℠Δ > 0
for some ηi â [g(ti), g(ti+1)] and some Δ > 0, independent of i and k. Thus
kâ1âi=0
âŁâŁâŁâŁlog[1 + 1
2
hâČâČ(Îłi)hâČ (g(ti))
· (g(ti+1)â g(ti))]âŁâŁâŁâŁ · (ti+1 â ti)
†C1 ·kâ1âi=0
12
âŁâŁâŁâŁ hâČâČ(Îłi)hâČ (g(ti))
âŁâŁâŁâŁ · (g(ti+1)â g(ti)) · (ti+1 â ti)
†C2 ·kâ1âi=0
(g(ti+1)â g(ti)) · (ti+1 â ti)
†C3 · 1k .
Lemma 4.7. For each C3-diffeomorphism h â G
Y 0h (g) := lim
kââ
kâ1âi=0
[hâČ (g(ti)) ·
g(ti+1)â g(ti)h (g(ti+1))â h (g(ti))
](4.5)
where ti = ik for i = 0, 1, . . . , k â 1 and tk = 0.
16
Proof. Let h and g be given. Depending on some Δ > 0 let us choose l â N large enough (to bespecified in the sequel) and let a1, . . . , al denote the l largest jumps of g. Put Jâg = Jg\a1, . . . , aland for simplicity al+1 := a1. For k very large (compared with l) and j = 1, . . . , l let kj denotethe index i â 0, 1, . . . , k â 1, for which aj â [ti, ti+1[. Then again by Taylorâs formula
kj+1â1âi=kj+1
[hâČ (g(ti)) ·
g(ti+1)â g(ti)h (g(ti+1))â h (g(ti))
]â1
=kj+1â1âi=kj+1
[1 + 1
2
hâČâČ (g(ti))hâČ (g(ti))
· (g(ti+1)â g(ti)) + 16
hâČâČâČ(ηi)hâČ (g(ti))
· (g(ti+1)â g(ti))2
](1a)
†exp
kj+1â1âi=kj+1
log[1 +
12
(log hâČ
)âČ (g(ti)) + Δl
· (g(ti+1)â g(ti))
](1b)
†eΔ/l · exp
12
kj+1â1âi=kj+1
(log hâČ
)âČ (g(ti)) · (g(ti+1)â g(ti))
,
provided l and k are chosen so large that
|g(ti+1)â g(ti)| â€Î”
C1 · l
for all i â 0, . . . , k â 1 \ k1, . . . , kl, where C1 = supx,y
|hâČâČâČ(x)|6·hâČ(y) .
On the other hand,âhâČ(g(tkj+1
))
hâČ(g(tkj+1)
) = exp
(â« g(tkj+1)
g(tkj+1)
(12 log hâČ
)âČ (s) ds)
= exp
kj+1â1âi=kj+1
[(12 log hâČ
)âČ (g(ti)) · (g(ti+1)â g(ti)) +(
12 log hâČ
)âČâČ (Îłi) · 12 (g(ti+1)â g(ti))
2]
(2)
â„ eâΔ/l · exp
12
kj+1â1âi=kj+1
(log hâČ
)âČ (g(ti)) · (g(ti+1)â g(ti))
,
provided l and k are chosen so large that
|g(ti+1)â g(ti)| â€Î”
C2 · l
for all i â 0, 1, . . . , k â 1 \ k1, . . . , kl, where C2 = supx
âŁâŁâŁ(12 log hâČ
)âČâČ (x)âŁâŁâŁ.Therefore,
âiâ0,1,...,kâ1\k1,...,kl
[hâČ (g(ti)) ·
g(ti+1)â g(ti)h (g(ti+1))â h (g(ti))
]â1
†e2Δ ·lâ
j=1
âhâČ(g(tkj+1
))
hâČ(g(tkj+1)
) = (I).
In order to derive the corresponding lower estimate, we can proceed as before in (1a) and (2)(replacing Δ by âΔ and †by â„ and vice versa). To proceed as in (1b) we have to argue as
17
follows
exp
kj+1â1âi=kj+1
log[1 +
(12 log hâČ
)âČ (g(ti))â Δl
· (g(ti+1)â g(ti))
](1c)
â„ eâΔ/l · exp
kj+1â1âi=kj+1
(1â Δ) ·(
12 log hâČ
)âČ (g(ti)) · (g(ti+1)â g(ti))
,
provided l and k are chosen so large that
log (1 + C3 · (g(ti+1)â g(ti))) â„ (1â Δ) · C3 · (g(ti+1)â g(ti))
for all i â 0, 1, . . . , k â 1 \ k1, . . . , kl, where C3 = supx
âŁâŁâŁ(12 log hâČ
)âČ (x)âŁâŁâŁ.Thus we obtain the following lower estimateâ
iâ0,1,...,kâ1\k1,...,kl
[hâČ (g(ti)) ·
g(ti+1)â g(ti)h (g(ti+1))â h (g(ti))
]â1
â„ eâ2Δ ·
lâj=1
âhâČ(g(tkj+1
))
hâČ(g(tkj+1)
)1âΔ
â„ eâ2Δ · CâΔ/23 ·
lâj=1
âhâČ(g(tkj+1
))
hâČ(g(tkj+1)
) = (II),
since lâj=1
âhâČ(g(tkj+1
))
hâČ(g(tkj+1)
)Δ = exp
Δ2
lâj=1
[log hâČ
(g(tkj+1
))â log hâČ
(g(tkj+1)
)]†exp
Δ2
lâj=1
C3 ·[g(tkj+1
)â g(tkj+1)]
†exp(Δ2C3
),
where C3 = supx
âŁâŁ(log hâČ)âČ (x)âŁâŁ.
Now for fixed l as k ââ the bound (I) converges to
(IâČ) = e2Δ ·lâ
j=1
âhâČ (g(aj+1â))hâČ (g(aj+))
and the bound (II) to
(IIâČ) = eâ2Δ · CâΔ/23 ·
lâj=1
âhâČ (g(aj+1â))hâČ (g(aj+))
.
Finally, it remains to considerâiâk1,...,kl
[hâČ (g(ti)) ·
g(ti+1)â g(ti)h (g(ti+1))â h (g(ti))
]â1
= (III).
Again for fixed l and k ââ this obviously converges to
(IIIâČ) =lâ
j=1
[1
hâČ (g(ajâ))· ÎŽ(h g)
ÎŽg(aj)
].
Putting together these estimates and letting lââ, we obtain the claim.
18
Lemma 4.8. (i) For all g, h â G with h â C2 strictly increasing, the infinite product in thedefinition of Y 0
h (g) converges. There exists a constant C = C(ÎČ, h) such that âg â G
1C †Y ÎČ
h (g) †C.
(ii) If hn â h in C2 then Y 0hn
(g) â Y 0h (g).
(iii) Let Y 0h,k, X
ÎČh,k, Y
ÎČh,k denote the sequences used in Lemma 4.6 and 4.7 to approximate Y 0
h , XÎČh , Y
ÎČh .
Then there exists a constant C = C(ÎČ, h) such that âg â G, âk â N
1C †Y ÎČ
h,k(g) †C.
Proof. (i) Put C = sup |(log hâČ)âČ|. Given g â G and Δ > 0, we choose k large enough such thatâaâJg(k) |g(a+)âg(aâ)| †Δ where Jg(k) = Jg \a1, a2, . . . , ak denotes the âset of small jumpsâ
of g. Here we enumerate the jump locations a1, a2, . . . â Jg according to the size of the respectivejumps. Then with suitable Οa â [g(aâ), g(a+)]
âaâJg(k)
âŁâŁâŁâŁâŁâŁlog
âhâČ(g(aâ))
âhâČ(g(a+))
ÎŽ(hg)ÎŽg (a)
âŁâŁâŁâŁâŁâŁâ€
âaâJg(k)
âŁâŁâŁâŁ12 log hâČ(g(aâ)) +12
log hâČ(g(aâ))â log hâČ(Ο(a))âŁâŁâŁâŁ
â€â
aâJg(k)
|C · (g(a+)â g(aâ))| = C · Δ.
Hence, the infinite sumâaâJg
log
âhâČ(g(aâ))
âhâČ(g(a+))
ÎŽ(hg)ÎŽg (a)
= limkââ
âaâJg(k)
log
âhâČ(g(aâ))
âhâČ(g(a+))
ÎŽ(hg)ÎŽg (a)
is absolutely convergent and thus also infinite product in the definition of Y 0h (g) converges. The
same arguments immediately yieldâŁâŁlog Y 0h (g)
âŁâŁ â€âaâJg
âŁâŁâŁâŁ12 log hâČ(g(aâ)) +12
log hâČ(g(aâ))â log hâČ(Ο(a))âŁâŁâŁâŁ †C. (4.6)
(ii) In order to prove the convergence Y 0hn
(g) â Y 0h (g), for given g â G we split the product over
all jumps into a finite product over the big jumps and an infinite product over all small jumps.Obviously, the finite products will converge (for any choice of k)
âaâa1,...,ak
âhâČn(g(aâ))
âhâČn(g(a+))
ÎŽ(hng)ÎŽg (a)
âââ
aâa1,...,ak
âhâČ(g(aâ))
âhâČ(g(a+))
ÎŽ(hg)ÎŽg (a)
as nââ provided hn â h in C2. Now let C = supn supx |(log hâČn)âČ(x)| and choose k as before.
Then uniformly in n âŁâŁâŁâŁâŁâŁlogâ
aâJg\a1,...,ak
âhâČn(g(aâ))
âhâČn(g(a+))
ÎŽ(hng)ÎŽg (a)
âŁâŁâŁâŁâŁâŁ †C · Δ.
(iii) Let C1 = supx|hâČ(x)| and C2 = sup
x
âŁâŁ(log hâČ)âČ (x)âŁâŁ. Then for all g and k:
Xh,k(g) =kâ1âi=0
hâČ(ηi)ti+1âti †C1
19
and
Y 0h,k(g) =
kâ1âi=0
hâČ (g(ti))hâČ(Îłi)
= exp
[kâ1âi=0
(log hâČ
)âČ (ζi) · (g(ti)â Îłi)
]
†exp
[C2 ·
kâ1âi=0
|g(ti)â Îłi|
]†exp(C2)
(with suitable Îłi, ηi â [g(ti), g(ti+1)] and ζi â [g(ti), Îłi]). Analogously, the lower estimatesfollow.
Proof of Theorem 4.1. In order to prove the equality of the two measures under consideration, itsuffices to prove that all of their finite dimensional distributions coincide. That is, for each m âN, each ordered family t1, . . . , tm of points in S1 and each bounded continuous u : (S1)m ââ Rone has to verify thatâ«
Gu(hâ1 (g(t1)) , hâ1 (g(t2)) , . . . , hâ1 (g(tm))
)dQÎČ(g)
=â«Gu (g(t1), g(t2), . . . , g(tm)) · Y ÎČ
h (g) dQÎČ(g).
Without restriction, we may restrict ourselves to equidistant partitions, i.e. ti = im for i =
1, . . . ,m. Let us fix m â N, u and h. For simplicity, we first assume that h is C3. Then byLemmas 4.6 - 4.8 and Lebesgueâs theoremâ«Gu(g(
1m
), . . . , g (1)
)· Y ÎČ
h (g) dQÎČ(g)
=â«Gu(g(
1m
), . . . , g (1)
)· limkââ
Y ÎČh,k(g) dQ
ÎČ(g)
= limkââ
â«Gu(g(
1m
), . . . , g (1)
)·mkâ1âi=0
[hâČ(g(ikm
))·
g(i+1km
)â g
(ikm
)h(g(i+1km
))â h
(g(ikm
))]
·mkâ1âi=0
[h(g(i+1km
))â h
(g(ikm
))g(i+1km
)â g
(ikm
) ] ÎČkm
dQÎČ(g)
= limkââ
Î(ÎČ)[Î(ÎČ/km)]km
â«Smk
1
u(xk, x2k, . . . , xmk)mkâ1âi=0
hâČ(xi) ·mkâ1âi=0
[h(xi+1)â h(xi)]ÎČkmâ1 dx1 . . . dxmk
= limkââ
Î(ÎČ)[Î(ÎČ/km)]km
â«Smk
1
u(xk, x2k, . . . , xmk) ·mkâ1âi=0
[h(xi+1)â h(xi)]ÎČkmâ1 dh(x1) . . . dh(xmk)
= limkââ
Î(ÎČ)[Î(ÎČ/km)]km
â«Smk
1
u(hâ1(yk), hâ1(y2k), . . . , hâ1(ymk)
)·mkâ1âi=0
[yi+1 â yi]ÎČkmâ1 dy1 . . . dymk
=â«Gu(hâ1
(g(
1m
)), hâ1
(g(
2m
)), . . . , hâ1 (g (1))
)dQÎČ(g).
Now we treat the general case h â C2. We choose a sequence of C3-functions hn â G with hn â h
20
in C2. Then â«Gu(hâ1 (g(t1)) , hâ1 (g(t2)) , . . . , hâ1 (g(tm))
)dQÎČ(g)
= limnââ
â«Gu(hâ1n (g(t1)) , hâ1
n (g(t2)) , . . . , hâ1n (g(tm))
)dQÎČ(g)
= limnââ
â«Gu (g(t1), g(t2), . . . , g(tm)) · Y ÎČ
hn(g) dQÎČ(g)
=â«Gu (g(t1), g(t2), . . . , g(tm)) · Y ÎČ
h (g) dQÎČ(g).
For the last equality, we have used the dominated convergence Y ÎČhn
(g) â Y ÎČh (g) (due to Lemma
4.8).
4.5 Proof for the Interval Case
The proof of Theorem 4.3 uses completely analogous arguments as in the previous section. Tosimplify notation, for h â C1([0, 1]), k â N let Xh,k, Y
0h,k : G0 â R be defined by
Xh,k(g) :=kâ1âi=0
[h (g(ti+1))â h (g(ti))
g(ti+1)â g(ti)
]ti+1âti
and
Y 0h,k(g) :=
[g(t1)â g(t0)
h (g(t1))â h (g(t0))
] kâ1âi=1
[hâČ (g(ti)) ·
g(ti+1)â g(ti)h (g(ti+1))â h (g(ti))
]where ti = i
k with i = 0, 1, . . . , k. Similar to the proof of theorem 4.1 the measure QÎČ0 satisfies
the following finite dimensional quasi-invariance formula.For any u : [0, 1]mâ1 â R, m, l â N and C1-isomorphism h : [0, 1] â [0, 1]â«
G0
u(hâ1 (g(t1)) , hâ1 (g(t2)) , . . . , hâ1 (g(tmâ1))
)dQÎČ
0 (g)
=â«G0
u (g(t1), g(t2), . . . , g(tmâ1)) ·XÎČh,l·m(g) · Y 0
h,l·m(g) dQÎČ0 (g),
where ti = im , i = 1, · · · ,mâ1. The passage to the limit for letting first l and then m to infinity
is based on the following assertions.
Lemma 4.9. (i) For each C2-isomorphism h â G0 and g â G0
Xh(g) = limkââ
Xh,k(g).
(ii) For each C3-isomorphism h â G0 and g â G0
limkââ
Y 0h,k(g) =
âaâJg
âhâČ(g(a+)) · hâČ(g(aâ))
ÎŽ(hg)ÎŽg (a)
Ă 1âhâČ(g(0)) · hâČ(g(1â))
·
1 if g(1â) = g(1)hâČ(g(1â))ÎŽ(hg)
ÎŽg(1)
else,
where Jg â]0, 1[ is the set of jump locations of g on ]0, 1[. In particular,
limkââ
Y 0h,k(g) = Yh,0(g) for QÎČ
0 -a.e.g.
21
(iii) For all g â G0 and C2-isomorphism h â G0, the infinite product in the definition of Yh,0(g)converges. There exists a constant C = C(ÎČ, h) such that âg â G0
1C †Y ÎČ
h,0(g) †C.
(iv) If hn â h in C2([0, 1], [0, 1]) with h as above, then Y 0,hn(g) â Y0,h(g).(v) For each C3-isomorphism h â G0 there exists a constant C = C(ÎČ, h) such that âg â G,âk â N
1C †XÎČ
h,k(g) · Y0h,k(g) †C.
Proof. The proofs of (i) and (iii)-(iv) carry over from their respective counterparts on the sphere,lemmas 4.6 and 4.8 above. We sketch the proof of statement (ii) which needs most modification.For Δ > 0 choose l â N large enough and let a2, . . . , alâ1 denote the l â 2 largest jumps of gon ]0, 1[. For k very large (compared with l) we may assume that a2, . . . , alâ2 â] 2k , 1â
2k [. Put
a1 := 1k , al := 1 â 1
k . For j = 1, . . . , l let kj denote the index i â 1, . . . , k â 1, for whichaj â [ti, ti+1[. In particular, k1 = 1 and kl = kâ1. Then using the same arguments as in lemma4.7 one obtains, for k and l sufficiently large, the two sided bounds
(I) = e2Δ ·lâ1âj=1
âhâČ(g(tkj+1
))
hâČ(g(tkj+1)
)â„
âiâ1,...,kâ1\k1,...,kl
[hâČ (g(ti)) ·
g(ti+1)â g(ti)h (g(ti+1))â h (g(ti))
]â1
â„ eâ2Δ · CâΔ/23 ·
lâ1âj=1
âhâČ(g(tkj+1
))
hâČ(g(tkj+1)
) = (II)
For fixed l and k ââ the bounds (I) and (II) converge to
(IâČ) = e2Δ
âhâČ(g(a2â))hâČ(g(0))
·lâ2âj=2
âhâČ (g(aj+1â))hâČ (g(aj+))
·
âhâČ(g(1â))
hâČ(g(alâ1+))
and
(IIâČ) = eâ2Δ · CâΔ/23 ·
âhâČ(g(a2â))hâČ(g(0))
·lâ2âj=2
âhâČ (g(aj+1â))hâČ (g(aj+))
·
âhâČ(g(1â))
hâČ(g(alâ1+)).
It remains to consider the three remaining terms
(III) =â
iâk2,...,klâ1
[hâČ (g(ti)) ·
g(ti+1)â g(ti)h (g(ti+1))â h (g(ti))
]â1
,
which for fixed l and k ââ converges to
(IIIâČ) =lâ1âj=2
[1
hâČ (g(ajâ))· ÎŽ(h g)
ÎŽg(aj)
],
(IV) =
[g( 1k )â g(0)
h(g( 1k ))â h (g(0))
]â1
·
[hâČ(g(
1k))·
g( 2k )â g( 1
k )h(g( 2k ))â h
(g( 1k ))]â1
,
converging by right continuity of g to
(IVâČ) = hâČ(g(0))
22
and
(V) =
[hâČ(g(k â 1k
))·
g(1)â g(kâ1k )
h (g(1))â h(g(kâ1
k ))]â1
,
which tends, also for k ââ, to
(VâČ) =
1 if g continuous in 1ÎŽ(hg)ÎŽg (1) 1
hâČ(g(1â)) else.
Combining these estimates and letting l â â, we obtain the first claim. The second claim instatement (ii) follows from the fact that g is continuous in t = 1 QÎČ
0 -almost surely.
5 The Integration by Parts Formula
In order to construct Dirichlet forms and Markov processes on G, we will consider it as an infinitedimensional manifold. For each g â G, the tangent space TgG will be an appropriate completionof the space Câ(S1,R). The whole construction will strongly depend on the choice of the normon the tangent spaces TgG. Basically, we will encounter two important cases:
âą in Chapter 6 we will study the case TgG = Hs(S1,Leb) for some s > 1/2, independentof g; this approach is closely related to the construction of stochastic processes on thediffeomorphism group of S1 and Malliavinâs Brownian motion on the homeomorphismgroup on S1, cf. [Mal99].
âą in Chapters 7-9 we will assume TgG = L2(S1, gâLeb); in terms of the dynamics on thespace P(S1) of probability measures, this will lead to a Dirichlet form and a stochasticprocess associated with the Wasserstein gradient and with intrinsic metric given by theWasserstein distance.
In this chapter, we develop the basic tools for the differential calculus on G. The main resultwill be an integration by parts formula. These results will be independent of the choice of thenorm on the tangent space.
5.1 The Drift Term
For each Ï â Câ(S1,R), the flow generated by Ï is the map eÏ : R Ă S1 â S1 where for eachx â S1 the function eÏ(., x) : R â S1, t 7â eÏ(t, x) denotes the unique solution to the ODE
dxtdt
= Ï(xt) (5.1)
with initial condition x0 = x. Since eÏ(t, x) = etÏ(1, x) for all Ï, t, x under consideration, wemay simplify notation and write etÏ(x) instead of eÏ(t, x).Obviously, for each Ï â Câ(S1,R) the family etÏ, t â R is a group of orientation preserving,Câ-diffeomorphism of S1. (In particular, e0 is the identity map e on S1, etÏ esÏ = e(t+s)Ï forall s, t â R and (eÏ)â1 = eâÏ.)Since â
âtetÏ(x)|t=0 = Ï(x) we obtain as a linearization for small t
etÏ(x) â x+ tÏ(x). (5.2)
More precisely,|etÏ(x)â (x+ tÏ(x))| †C · t2
as well as| ââxetÏ(x)â (1 + t
â
âxÏ(x))| †C · t2
23
uniformly in x and |t| †1.
For Ï â Câ(S1,R) and ÎČ > 0 we define functions V ÎČÏ : G â R by
V ÎČÏ (g) := V 0
Ï (g) + ÎČ
â«S1
ÏâČ(g(x))dx
where
V 0Ï (g) :=
âaâJg
[ÏâČ(g(a+)) + ÏâČ(g(aâ))
2â Ï(g(a+))â Ï(g(aâ))
g(a+)â g(aâ)
]. (5.3)
Lemma 5.1. (i) The sum in (5.3) is absolutely convergent. More precisely,
|V 0Ï (g)| â€
âaâJg
âŁâŁâŁâŁÏâČ(g(a+)) + ÏâČ(g(aâ))2
â Ï(g(a+))â Ï(g(aâ))g(a+)â g(aâ)
âŁâŁâŁâŁ †12
â«S1
|ÏâČâČ(x)|dx
and|V ÎČÏ (g)| †(1/2 + ÎČ) ·
â«S1
|ÏâČâČ(x)|dx.
(ii) For each ÎČ â„ 0
V ÎČÏ (g) =
â
âtY ÎČetÏ
(g)âŁâŁâŁâŁt=0
=â
âtY ÎČe+tÏ(g)
âŁâŁâŁâŁt=0
. (5.4)
Proof. (i) According to Taylorâs formula, for each a â Jg
ÏâČ(g(a+)) + ÏâČ(g(aâ))2
â ÎŽ(Ï g)ÎŽg
(a) =1
2(g(a+)â g(aâ))
â« g(a+)
g(aâ)
â« g(a+)
g(aâ)sgn(yâx) ·ÏâČâČ(y)dydx.
Hence, âaâJg
âŁâŁâŁâŁÏâČ(g(a+)) + ÏâČ(g(aâ))2
â ÎŽ(Ï g)ÎŽg
(a)âŁâŁâŁâŁ
†12
âaâJg
âŁâŁâŁâŁâŁ 1(g(a+)â g(aâ))
â« g(a+)
g(aâ)
â« g(a+)
g(aâ)sgn(y â x) · ÏâČâČ(y)dydx
âŁâŁâŁâŁâŁâ€ 1
2
âaâJg
â« g(a+)
g(aâ)|ÏâČâČ(y)|dy =
12
â«S1
|ÏâČâČ(y)|dy.
Finally,
|â«S1
ÏâČ(g(x))dx| †supyâS1
|ÏâČ(y)| â€â«S1
|ÏâČâČ(y)|dy.
(ii) Let us first consider the case ÎČ = 0.
â
âtlog Y 0
etÏ(g)âŁâŁâŁâŁt=0
=â
ât
âaâJg
[12
log(â
âxetÏ)(g(a+)) +
12
log(â
âxetÏ)(g(aâ))â log
ÎŽ(etÏ g)ÎŽg
(a)]âŁâŁâŁâŁâŁâŁt=0
=âaâJg
â
ât
[12
log(â
âxetÏ)(g(a+)) +
12
log(â
âxetÏ)(g(aâ))â log
ÎŽ(etÏ g)ÎŽg
(a)]âŁâŁâŁâŁt=0
.
In order to justify that we may interchange differentiation and summation, we decompose (aswe did several times before) the infinite sum over all jumps in Jg into a finite sum over bigjumps a1, . . . , ak and an infinite sum over small jumps in Jg(k) = Jg \ a1, . . . , ak. Of course,
24
the finite sum will make no problem. We are going to prove that the contribution of the smalljumps is arbitrarily small. Recall from Lemma 4.8 thatâaâJg(k)
[12
log(â
âxetÏ)(g(a+)) +
12
log(â
âxetÏ)(g(aâ))â log
ÎŽ(etÏ g)ÎŽg
(a)]†Ct·
âaâJg(k)
[g(a+)â g(aâ)]
where Ct := supxâŁâŁ ââx log( ââxetÏ)(x)
âŁâŁ. Now Ct †C · |t| for all |t| †1 and an appropriate constantC. Thus for any given Δ > 0âŁâŁâŁâŁâŁâŁ âât
âaâJg(k)
[12
log(â
âxetÏ)(g(a+)) +
12
log(â
âxetÏ)(g(aâ))â log
ÎŽ(etÏ g)ÎŽg
(a)]âŁâŁâŁâŁâŁâŁt=0
†Δ
provided k is chosen large enough (i.e. such that C ·â
aâJg(k) |g(a+)âg(aâ)| †Δ). This justifiesthe above interchange of differentiation and summation.Now for each x â S1
â
ât
(log
â
âxetÏ(x)
)âŁâŁâŁâŁt=0
= ÏâČ(x)
since the linearization of etÏ for small t yields
etÏ(x) â x+ tÏ(x),â
âxetÏ(x) â 1 + tÏâČ(x).
Similarly, for small t we obtain
ÎŽ(etÏ g)ÎŽg
(a) â 1 + t · ÎŽ(Ï g)ÎŽg
(a)
and thusâ
ât
ÎŽ(etÏ g)ÎŽg
(a)âŁâŁâŁâŁt=0
=ÎŽ(Ï g)ÎŽg
(a).
Therefore,â
âtlog Y 0
etÏ(g)âŁâŁâŁâŁt=0
= V 0Ï (g).
On the other hand, obviously
â
âtlog Y 0
etÏ(g)âŁâŁâŁâŁt=0
=â
âtY 0etÏ
(g)âŁâŁâŁâŁt=0
since Y 0e0(g) = 1.
Finally, we have to consider the derivative of XetÏ . Based on the previous arguments and usingthe fact that â
ât log(ââxetÏ
)(x) is uniformly bounded in t â [â1, 1] and x â S1 we immediately
seeâ
âtlogXetÏ(g)
âŁâŁâŁâŁt=0
=â
ât
â«S1
log(â
âxetÏ
)(g(y))dy
âŁâŁâŁâŁt=0
=â«S1
â
âtlog(â
âxetÏ
)âŁâŁâŁâŁt=0
(g(y)) dy =â«S1
ÏâČ(g(y))dy.
Again Xe0(g) = 1. Therefore,
â
ât
[XetÏ
]ÎČ (g)âŁâŁâŁâŁt=0
= ÎČ Â·â«S1
ÏâČ(g(y))dy
and thusâ
âtY ÎČetÏ
(g)âŁâŁâŁâŁt=0
= V ÎČÏ (g).
this proves the first identity in (5.4). The proof of the second one V ÎČÏ (g) = â
âtYÎČe+tÏ(g)
âŁâŁâŁt=0
is
similar (even slightly easier).
25
5.2 Directional Derivatives
For functions u : G â R we will define the directional derivative along Ï â Câ(S1,R) by
DÏu(g) := limtâ0
1t
[u(etÏ g)â u(g)] (5.5)
provided this limit exists. In particular, this will be the case for the following âcylinder functionsâ.
Definition 5.2. We say that u : G â R belongs to the class Sk(G) if it can be written as
u(g) = U(g(x1), . . . , g(xm)) (5.6)
for some m â N, some x1, . . . , xm â S1 and some Ck-function U : (S1)m â R.
It should be mentioned that functions u â Sk(G) are in general not continuous on G.
Lemma 5.3. The directional derivative exists for all u â S1(G). In particular, for u as above
DÏu(g) = limtâ0
1t
[u(g + t · Ï g)â u(g)]
=mâi=1
âiU(g(x1), . . . , g(xm)) · Ï(g(xi))
with âiU := ââyiU . Moreover, DÏ : Sk(G) â Skâ1(G) for all k â N âȘ â and
âDÏuâL2(QÎČ) â€âm · ââUââ · âÏâL2(S1).
Proof. The first claim follows from
DÏu(g) =â
âtU(etÏ(g(x1)), . . . , etÏ(g(xm)))
âŁâŁâŁâŁt=0
=mâi=1
âiU(etÏ(g(x1)), . . . , etÏ(g(xm))) · ââtetÏ(g(xi))
âŁâŁâŁâŁt=0
=mâi=1
âiU(g(x1), . . . , g(xm)) · Ï(g(xi))
=â
âtU(g(x1) + tÏ(g(x1)), . . . , g(xm) + tÏ(g(xm)))
âŁâŁâŁâŁt=0
= limtâ0
1t
[u(g + t · Ï g)â u(g)] .
For the second claim,
âDÏuâ2L2(QÎČ) =
â«G
(mâi=1
âiU(g(x1), . . . , g(xm)) · Ï(g(xi))
)2
dQÎČ(g)
â€â«G
(mâi=1
(âiU)2(g(x1), . . . , g(xm)) ·mâi=1
Ï2(g(xi))
)dQÎČ(g)
†ââUâ2â ·
mâi=1
â«GÏ2(g(xi)) dQÎČ(g)
= m · ââUâ2â ·â«S1
Ï2(y) dy.
26
5.3 Integration by Parts Formula on P(S1)
For Ï â Câ(S1,R) let DâÏ denote the operator in L2(G,QÎČ) adjoint to DÏ with domain S1(G).
Proposition 5.4. Dom(DâÏ) â S1(G) and for all u â S1(G)
DâÏu = âDÏuâ V ÎČ
Ï Â· u. (5.7)
Proof. Let u, v â S1(G). Thenâ«DÏu · v dQÎČ = lim
tâ0
1t
â«[u(etÏ g)â u(g)] · v(g) dQÎČ(g)
= limtâ0
1t
â« [u(g) · v(eâtÏ g) · Y ÎČ
eâtÏâ u(g) · v(g)
]dQÎČ(g)
= limtâ0
1t
â«u(g) · [v(eâtÏ g)â v(g)] dQÎČ(g)
+ limtâ0
1t
â«u(g) · v(g) ·
[Y ÎČeâtÏ
â 1]dQÎČ(g)
+ limtâ0
1t
â«u(g) · [v(eâtÏ g)â v(g)] ·
[Y ÎČeâtÏ
â 1]dQÎČ(g)
= ââ«u ·DÏv dQÎČ(g)â
â«u · v · V ÎČ
Ï dQÎČ(g) + 0.
To justify the last equality, note that according to Lemma 4.8 | log Y ÎČetÏ | †C · |t| for |t| †1.
Hence, the claim follows with dominated convergence and Lemma 5.4.
Corollary 5.5. The operator (DÏ,S1(G)) is closable in L2(QÎČ). Its closure will be denoted by
(DÏ,Dom(DÏ)).
In other words, Dom(DÏ) is the closure (or completion) of S1(G) with respect to the norm
u 7â(â«
[u2 + (DÏu)2] dQÎČ
)1/2
.
Of course, the space Dom(DÏ) will depend on ÎČ but we assume ÎČ > 0 to be fixed for the sequel.
Remark 5.6. The bilinear form
EÏ(u, v) :=â«DÏu ·DÏv dQÎČ, Dom(EÏ) := Dom(DÏ) (5.8)
is a Dirichlet form on L2(G,QÎČ) with form core Sâ(G). Its generator (LÏ,Dom(LÏ)) is theFriedrichs extension of the symmetric operator
(âDâÏ DÏ, S2(G)).
5.4 Derivatives and Integration by Parts Formula on P([0, 1])
Now let us have a look on flows on [0, 1]. To do so, let a function Ï â Câ([0, 1],R) withÏ(0) = Ï(1) = 0 be given. (Note that each such function can be regarded as Ï â Câ(S1,R)with Ï(0) = 0.) The flow equation (5.1) now defines a flow etÏ, t â R, of order preserving Câdiffeomorphisms of [0, 1]. In particular, etÏ(0) = 0 and etÏ(1) = 1 for all t â R.
Lemma 5.1 together with Theorem 4.3 immediately yields
Lemma 5.7. For Ï â Câ([0, 1],R) with Ï(0) = Ï(1) = 0 and each ÎČ â„ 0
â
âtY ÎČetÏ,0
(g)âŁâŁâŁâŁt=0
= V ÎČÏ (g)â ÏâČ(0) + ÏâČ(1)
2=: V ÎČ
Ï,0(g). (5.9)
27
For functions u : G0 â R we will define the directional derivative along Ï â Câ([0, 1],R) withÏ(0) = Ï(1) = 0 as before by
DÏu(g) := limtâ0
1t
[u(etÏ g)â u(g)] (5.10)
provided this limit exists. We will consider three classes of âcylinder functionsâ for which theexistence of this limit is guaranteed.
Definition 5.8. (i) We say that a function u : G0 â R belongs to the class Ck(G0) (for k âN âȘ 0,â) if it can be written as
u(g) = U(â«
~f(t)g(t)dt)
(5.11)
for some m â N, some ~f = (f1, . . . , fm) with fi â L2([0, 1],Leb) and some Ck-function U : Rm âR. Here and in the sequel, we write
â«~f(t)g(t)dt =
(â« 10 f1(t)g(t)dt, . . . ,
â« 10 fm(t)g(t)dt
).
(ii) We say that u : G0 â R belongs to the class Sk(G0) if it can be written as
u(g) = U (g(x1), . . . , g(xm)) (5.12)
for some m â N, some x1, . . . , xm â [0, 1] and some Ck-function U : Rm â R.(iii) We say that u : G0 â R belongs to the class Zk(G0) if it can be written as
u(g) = U(â«~α(gs)ds
)(5.13)
with U as above, ~α = (α1, . . . , αm) â Ck([0, 1],Rm) andâ«~α(gs)ds =
(⫠10 α1(gs)ds, . . . ,
⫠10 αm(gs)ds
).
Remark 5.9. For each Ï â Câ(S1,R) with Ï(0) = 0 (which can be regarded as Ï â Câ([0, 1],R)with Ï(0) = Ï(1) = 0), the definitions of DÏ in (5.5) and (5.10) are consistent in the followingsense. Each cylinder function u â S1(G0) defines by v(g) := u(g â g0) (âg â G) a cylinderfunction v â S1(G) with DÏv = DÏu on G0. Conversely, each cylinder function v â S1(G)defines by u(g) := v(g) (âg â G0) a cylinder function u â S1(G0) with DÏv = DÏu on G0.
Lemma 5.10. (i) The directional derivative DÏu(g) exists for all u â C1(G0)âȘS1(G0)âȘZ1(G0)(in each point g â G0 and in each direction Ï â Câ([0, 1],R) with Ï(0) = Ï(1) = 0) andDÏu(g) = limtâ0
1t [u(g + t · Ï g)â u(g)]. Moreover,
DÏu(g) =mâi=1
âiU(â«
~f(t)g(t)dt)·â«fi(t)Ï(g(t))dt
for each u â C1(G0) as in (5.11),
DÏu(g) =mâi=1
âiU(g(x1), . . . , g(xm)) · Ï(g(xi))
for each u â S1(G0) as in (5.12), and
DÏu(g) =mâi=1
âiU(â«~α(gs)ds
)·â«Î±âČi(gs)Ï(gs)ds
for each u â Z1(G0) as in (5.13).(ii) For Ï â Câ([0, 1],R) with Ï(0) = Ï(1) = 0 let Dâ
Ï,0 denote the operator in L2(G0,QÎČ0 )
adjoint to DÏ. Then for all u â C1(G0) âȘS1(G0) âȘ Z1(G0)
DâÏ,0u = âDÏuâ V ÎČ
Ï,0 · u. (5.14)
Proof. See the proof of the analogous results in Lemma 5.3 and Proposition 5.4.
Remark 5.11. The operators (DÏ,C1(G0)), (DÏ,S
1(G0)), and (DÏ,Z1(G0)) are closable in
L2(QÎČ0 ). The closures of (DÏ,C
1(G0)), (DÏ,Z1(G0)) and (DÏ,S
1(G0)) coincide. They will bedenoted by (DÏ,Dom(DÏ)). See (proof of) Corollary 6.11.
28
6 Dirichlet Form and Stochastic Dynamics on on GAt each point g â G, the directional derivative DÏu(g) of any âniceâ function u on G definesa linear form Ï 7â DÏu(g) on Câ(S1). If we specify a pre-Hilbert norm â.âg on Câ(S1) forwhich this linear form is continuous then there exists a unique element Du(g) â TgG withDÏu(g) = ăDu(g), Ïăg for all Ï â Câ(S1). Here TgG denotes the completion of Câ(S1) w.r.t.the norm â.âg.The canonical choice of a Dirichlet form on G will then be (the closure of)
E(u, v) =â«GăDu(g), Dv(g)ăg dQÎČ(g), u, v â S1(G). (6.1)
Given such a Dirichlet form, there is a straightforward procedure to construct an operator (âgen-eralized Laplacianâ) and a Markov process (âgeneralized Brownian motionâ). Different choices ofâ.âg in general will lead to completely different Dirichlet forms, operators and Markov processes.We will discuss in detail two choices: in this chapter we will choose â.âg (independent of g) tobe the Sobolev norm â.âHs for some s > 1/2; in the remaining chapters, â.âg will always be theL2-norm Ï 7â (
â«S1 Ï(gt)2dt)1/2 of L2(S1, gâLeb).
For the sequel, fix â once for ever â the number ÎČ > 0 and drop it from the notations, i.e.Q := QÎČ, VÏ := V ÎČ
Ï etc.
6.1 The Dirichlet Form on G
Let (Ïk)kâN denote the standard Fourier basis of L2(S1). That is,
Ï2k(x) =â
2 · sin(2Ïkx), Ï2k+1(x) =â
2 · cos(2Ïkx)
for k = 1, 2, . . . and Ï1(x) = 1. It constitutes a complete orthonormal system in L2(S1): eachÏ â L2(S1) can uniquely be written as Ï(x) =
ââk=1 ck · Ïk(x) with Fourier coefficients of Ï
given by ck :=â«S1 Ï(y)Ïk(y)dy. In terms of these Fourier coefficients we define for each s â„ 0
the norm
âÏâHs :=
(c21 +
ââk=1
k2s · (c22k + c22k+1)
)1/2
(6.2)
on Câ(S1). The Sobolev space Hs(S1) is the completion of Câ(S1) with respect to the normâ.âHs . It has a complete orthonormal system consisting of smooth functions (Ïk)kâN. Forinstance, one may choose
Ï2k(x) =â
2 · kâs · sin(2Ïkx), Ï2k+1(x) =â
2 · kâs · cos(2Ïkx) (6.3)
for k = 1, 2, . . . and Ï1(x) = 1.A linear form A : Câ(S1) â R is continuous w.r.t. â.âHs â and thus can be represented asA(Ï) = ăÏ,ÏăHs for some Ï â Hs(S1) with âÏâHs = âAâHs â if and only if
âAâHs :=
(|A(Ï1)|2 +
ââk=1
k2s · (|A(Ï2k)|2 + |A(Ï2k+1)|2)
)1/2
<â. (6.4)
Proposition 6.1. Fix a number s > 1/2. Then for each cylinder function u â S(G) and eachg â G, the directional derivative defines a continuous linear form Ï 7â DÏu(g) on Câ(S1) âHs(S1). There exists a unique tangent vector Du(g) â Hs(S1) such that DÏu(g) = ăDu(g), ÏăHs
for all Ï â Câ(S1).In terms of the family Ί = (Ïk)kâN from (6.3)
Du(g) =ââk=1
DÏku(g) · Ïk(.)
29
and
âDu(g)â2Hs =
ââk=1
|DÏku(g)|2. (6.5)
Proof. It remains to prove that the RHS of (6.5) is finite for each u and g under consideration.According to Lemma 5.3, for any u â S(G) represented as in (5.12)
ââk=1
|DÏku(g)|2 =
ââk=1
(mâi=1
âiU(g(x1), . . . , g(xm)) · Ïk(g(xi))
)2
†m · ââUâ2â · â
ââk=1
Ï2kââ = m · ââUâ2
â · (1 + 4ââk=1
kâ2s).
And, indeed, the latter is finite for each s > 1/2.
For the sequel, let us now fix a number s > 1/2 and define
E(u, v) =â«GăDu(g), Dv(g)ăHs dQ(g) (6.6)
for u, v â S1(G). Equivalently, in terms of the family Ί = (Ïk)kâN from (6.3)
E(u, v) =ââk=1
â«GDÏk
u(g) ·DÏkv(g) dQ(g). (6.7)
Theorem 6.2. (i) (E ,S1(G)) is closable. Its closure (E ,Dom(E)) is a regular Dirichlet formon L2(G,Q) which is strongly local and recurrent (hence, in particular, conservative).(ii) For u â S1(G) with representation (5.6)
E(u, u) =ââk=1
â«G
(mâi=1
âiU(g(x1), . . . , g(xm)) · Ïk(g(xi))
)2
dQ(g).
The generator of the Dirichlet form is the Friedrichs extension of the operator L given on S2(G)by
Lu(g) =mâ
i,j=1
ââk=1
âiâjU (g(x1), . . . , g(xm))Ïk(g(xi))Ïk(g(xj))
+mâi=1
ââk=1
âiU (g(x1), . . . , g(xm)) [ÏâČk(g(xi)) + VÏk(g)]Ïk(g(xi)).
(iii) Z1(G) is a core for Dom(E) (i.e. it is contained in the latter as a dense subset). Foru â Z1(G) with representation (5.13)
E(u, u) =ââk=1
â«G
(mâi=1
âiU(â«~α(gt)dt) ·
â«Î±âČi(gt)Ïk(gt)dt
)2
dQ(g).
The generator of the Dirichlet form is the Friedrichs extension of the operator L given on Z2(G)by
Lu(g) =mâ
i,j=1
ââk=1
âiâjU(â«~α(gt)dt
)·â«Î±âČi(gt)Ïk(gt)dt ·
â«Î±âČj(gt)Ïk(gt)dt
+mâi=1
ââk=1
âiU(â«~α(gt)dt
)VÏk
(g) +â«
[αâČâČi (gt)Ï2k(gt) + αâČi(gt)Ï
âČk(gt)Ïk(gt)]dt.
30
(iv) The intrinsic metric Ï can be estimated from below in terms of the L2-metric:
Ï(g, h) â„ 1âCâg â hâL2 .
Remark 6.3. All assertions of the above Theorem remain valid for any E defined as in (6.7)with any choice of a sequence Ί = (Ïk)kâN of smooth functions on S1 with
C := âââk=1
Ï2kââ <â. (6.8)
(This condition is satisfied for the sequence from (6.3) if and only if s > 1/2.)
The proof of the Theorem will make use of the following
Lemma 6.4. (i) Dom(E) contains all functions u which can be represented as
u(g) = U(âg â f1âL2 , . . . , âg â fmâL2) (6.9)
with some m â N, some f1, . . . , fm â G and some U â C1(Rm,R).For each u as above, each Ï â Câ(S1) and Q-a.e. g â G
DÏu(g) =mâi=1
âiU(âg â f1âL2 , . . . , âg â fmâL2) ·â«S1
sign(g(t)â fi(t))|g(t)â fi(t)|âg â fiâL2
Ï(g(t))dt
where sign(z) := +1 for z â S1 with |[0, z]| †1/2 and sign(z) := â1 for z â S1 with |[z, 0]| < 1/2.(ii) Moreover, Dom(E) contains all functions u which can be represented as
u(g) = U(gΔ1(x1), . . . , gΔm(xm)) (6.10)
with some m â N, some x1, . . . , xm â S1, some Δ1, . . . , Δm â ]0, 1[ and some U â C1((S1)m,R).Here gΔ(x) :=
â« x+Δx g(t)dt â S1 for x â S1 and 0 < Δ < 1. More precisely,
gΔ(x) := Ï(â« x+Δ
xÏâ1g(t)dt)
where Ï : G(R) â G (cf. section 2.2) denotes the projection and Ïâ1 : G â G(R) the canonicallift with Ïâ1(g)(t) â [g(x), g(x) + 1] â R for t â [x, x+ 1] â R.For each u as above, each Ï â Câ(S1) and each g â G
DÏu(g) =mâi=1
âiU(gΔ1(x1), . . . , gΔm(xm)) · 1Δi
⫠xi+Δi
xi
Ï(g(t))dt.
(iii) The set of all u of the form (6.10) is dense in Dom(E).
Proof. (i) Let us first prove that for each f â G, the map u(g) = âg â fâL2 lies in Dom(E). Forn â N, let Ïn : G â G be the map which replaces each g by the piecewise constant map:
Ïn(g)(t) := g(i
n) for t â [
i
n,i+ 1n
[.
Then by right continuity Ïn(g) â g as nââ and thus
1n
nâ1âi=0
|g( in
)â f(i
n)|2 ââ
â«S1
|g(t)â f(t)|2dt.
31
Therefore, for each g â G as nââ
un(g) := Un(g(0), g(1n
), . . . , g(nâ 1n
)) ââ u(g) (6.11)
where Un(x1, . . . , xn) :=(
1n
ânâ1i=0 dn(xi+1 â f( in))2
)1/2and dn is a smooth approximation of
the distance function x 7â |x| on S1 (which itself is non-differentiable at x = 0 and x = 12) with
|dâČn| †1 and dn(x) â |x| as nââ. Obviously, un â S1(G).By dominated convergence, (6.11) also implies that un â u in L2(G,Q). Hence, u â Dom(E) if(and only if) we can prove that
supnE(un) <â.
But
E(un) =ââk=1
â«G
âŁâŁâŁâŁâŁnâi=1
âiUn(g(0), g(1n
), . . . , g(nâ 1n
)) · Ï(g(iâ 1n
))
âŁâŁâŁâŁâŁ2
dQ(g)
â€ââk=1
â«G
1n
nâi=1
Ï2k(g(
iâ 1n
)) dQ(g) =ââk=1
âÏkâ2L2 <â,
uniformly in n â N. This proves the claim for the function u(g) = âg â fâL2 .From this, the general claim follows immediately: if vn, n â N, is a sequence of S1(G) approx-imations of g 7â âg â 0âL2 then un(g) := U(vn(g â f1), . . . , vn(g â fm)) defines a sequence ofS1(G) approximations of u(g) = U(âg â f1âL2 , . . . , âg â fmâL2).
(ii) Again it suffices to treat the particular case m = 1 and U = id, that is, u(g) = gΔ(x)for some x â S1 and some 0 < Δ < 1. Let g â G(R) be the lifting of g and recall thatu(g) = Ï(1
Δ
â« x+Δx g(t)dt). Define un â S1(G) for n â N by un(g) = Ï( 1
n
ânâ1i=0 g(x + i
nΔ)). Rightcontinuity of g implies un â u as nââ pointwise on G and thus also in L2(G,Q). To see theboundedness of E(un) note that DÏun(g) = 1
n
ânâ1i=0 Ï(g(x+ i
nΔ)). Thus
E(un) â€ââk=1
â«G
1n
nâ1âi=0
Ï2k(g(x+
i
nΔ))dQ(g) =
ââk=1
âÏkâ2L2 <â.
(iii) We have to prove that each u â S1(G) can be approximated in the norm (â.â2 + E(.))1/2 byfunctions un of type (6.10). Again it suffices to treat the particular case u(g) = g(x) for somex â S1. Choose un(g) = g1/n(x). Then by right continuity of g, un â u pointwise on G and
thus also in L2(G,Q). Moreover, DÏun(g) = nâ« x+1/nx Ï(g(t))dt (for all Ï and g) and therefore
E(un) â€ââk=1
n
â« x+1/n
xÏ2k(g(t))dtdQ(g) =
ââk=1
âÏkâ2L2 <â.
Proof of the Theorem. (a) The sum E of closable bilinear forms with common domain S1(G)is closable, provided it is still finite on this domain. The latter will follow by means of Lemma5.3 which implies for all u â S1(G) with representation (5.11)
E(u, u) =ââk=1
â«G
(mâi=1
âiU(g(x1), . . . , g(xm)) · Ïk(g(xi))
)2
dQ(g)
†m · ââUâ2â ·
ââk=1
âÏkâ2L2(S1) <â.
32
Hence, indeed E is finite on S1(G).(b) The Markov property for E follows from that of the EÏk
(u, v) =â«G DÏk
u ·DÏkv dQ.
(c) According to the previous Lemma, the class of continuous functions of type (6.10) is densein Dom(E). Moreover, the class of finite energy functions of type (6.9) is dense in C(G) (withthe L2 topology of G â L2(S1), cf. Proposition 2.1). Therefore, the Dirichlet form E is regular.(e) The estimate for the intrinsic metric is an immediate consequence of the following estimatefor the norm of the gradient of the function u(g) = âg â fâL2 (which holds for each f â Guniformly in g â G):
âDu(g)â2 =ââk=1
(â«S1
sign(g(t)â fi(t))|g(t)â fi(t)|âg â fiâL2
Ïk(g(t))dt)2
â€ââk=1
â«S1
Ï2k(g(t))dt †â
ââk=1
Ï2kââ =: C.
(f) The locality is an immediate consequence of the previous estimate: Given functions u, v âDom(E) with disjoint supports, one has to prove that E(u, v) = 0. Without restriction, one mayassume that supp[u] â Br(g) and supp[v] â Br(h) with âg â hâL2 > 2r + 2ÎŽ. (The generalcase will follow by a simple covering argument.) Without restriction, u, v can be assumed to bebounded. Then |u| †CwÎŽ,g and |v| †CwÎŽ,h for some constant C where
wÎŽ,g(f) =[1ÎŽ(r + ÎŽ â âf â gâL2) ⧠1
]âš 0.
Given un â S1(G) with un â u in Dom(E) put
un = (un ⧠wÎŽ,g) âš (âwÎŽ,g).
Then un â u in Dom(E). Analogously, vn â v in Dom(E) for vn = (vn ⧠wÎŽ,h) âš (âwÎŽ,h). Butobviously, E(un, vn) = 0 since un · vn = 0. Hence, E(u, v) = 0.(g) In order to prove that Z1(G) is contained in Dom(E) it suffices to prove that each u â Z1(G)of the form u(g) =
â«Î±(gt)dt can be approximated in Dom(E) by un â S1(G). Given u as above
with α â C1(S1,R) put un(g) = 1n
âni=1 α(gi/n). Then un â S1(G), un â u on G and
DÏun(g) =1n
nâi=1
αâČ(gi/n)Ï(gi/n) ââ«Î±âČ(gt)Ï(gt)dt = DÏu(g).
Moreover,
E(un, un) =â«G
âk
âŁâŁâŁâŁâŁ 1nnâi=1
αâČ(gi/n)Ï(gi/n)
âŁâŁâŁâŁâŁ2
dQ(g)
†C ·â«G
âk
1n
nâi=1
αâČ(gi/n)2 dQ(g) = C ·
â«S1
αâČ(t)2dt
uniformly in n â N. Hence, u â Dom(E) and
E(u, u) = limnââ
E(un, un) =â«G
âk
âŁâŁâŁâŁâ«S1
αâČ(gt)Ïk(gt)dtâŁâŁâŁâŁ2 dQ(g).
(h) The set Z1(G) is dense in Dom(E) since according to assertion (ii) of the previous Lemmaalready the subset of all u of the form (6.10) is dense in Dom(E).Finally, one easily verifies that Z2(G) is dense in Z1(G) and (using the integration by partsformula) that L is a symmetric operator on Z2(G) with the given representation.
33
Corollary 6.5. There exists a strong Markov process (gt)tâ„0 on G, associated with the Dirichletform E. It has continuous trajectories and it is reversible w.r.t. the measure Q. Its generatorhas the form
12L =
12
âk
DÏkDÏk
+12
âk
VÏk·DÏk
with ÏkkâN being the Fourier basis of Hs(S1).
Remark 6.6. This process (gt)tâ„0 is closely related to the stochastic processes on the diffeo-morphism group of S1 and to the âBrownian motionâ on the homeomorphism group of S1,studied by Airault, Fang, Malliavin, Ren, Thalmaier and others [AMT04, AM06, AR02, Fan02,Fan04, Mal99]. These are processes with generator 1
2L0 = 12
âkDÏk
DÏk. For instance, in the
case s = 3/2 our process from the previous Corollary may be regarded as âBrownian motionplus driftâ. All the previous approaches are restricted to s â„ 3/2. The main improvements ofour approach are:
âą identification of a probability measure Q such that these processes â after adding asuitable drift â are reversible;
âą construction of such processes in all cases s > 1/2.
6.2 Finite Dimensional Noise Approximations
In the previous section, we have seen the construction of the diffusion process on G under minimalassumptions. However, the construction of the process is rather abstract. In this section, we tryto construct explicitly a diffusion process associated with the generator of the Dirichlet form Efrom Theorem 6.2. Here we do not aim for greatest generality.Let a finite family Ί = (Ïk)k=1,...,n of smooth functions on S1 be given and let (Wt)tâ„0 withWt = (W 1
t , . . . ,Wnt ) be a n-dimensional Brownian motion, defined on some probability space
(Ω,F ,P). For each x â S1 we define a stochastic processes (ηt(x))tâ„0 with values in S1 as thestrong solution of the Ito differential equation
dηt(x) =nâk=1
Ïk(ηt(x))dW kt +
12
nâk=1
ÏâČk(ηt(x))Ïk(ηt(x))dt (6.12)
with initial condition η0(x) = x. Equation (6.12) can be rewritten in Stratonovich form asfollows
dηt(x) =nâk=1
Ïk(ηt(x)) dW kt . (6.13)
Obviously, for every t and for P-a.e. Ï â Ω, the function x 7â ηt(x, Ï) is an element of thesemigroup G. (Indeed, it is a Câ-diffeomorphism.) Thus (6.13) may also be interpreted as aStratonovich SDE on the semigroup G:
dηt =nâk=1
Ïk(ηt) dW kt , η0 = e. (6.14)
This process on G is right invariant: if gt denotes the solution to (6.14) with initial conditiong0 = g for some initial condition g â G then gt = ηt g. One easily verifies that the generatorof this process (gt)tâ„0 is given on S2(G) by 1
2
ânk=1DÏk
DÏk. What we aim for, however, is a
process with generator
â12
nâk=1
DâÏkDÏk
=12
nâk=1
DÏkDÏk
+12
nâk=1
VÏk·DÏk
.
34
Define a new probability measure Pg on (Ω,F), given on Ft by
dPg = exp
(nâk=1
â« t
0VÏk
(ηs g)dW ks â
12
nâk=1
â« t
0|VÏk
(ηs g)|2ds
)dP (6.15)
and a semigroup (Pt)tâ„0 acting on bounded measurable functions u on G as follows
Ptu(g) =â«
Ωu(ηt(g(.), Ï)) dPg(Ï).
Proposition 6.7. (Pt)tâ„0 is a strongly continuous Markov semigroup on G. Its generator is anextension of the operator 1
2L = â12
ânk=1D
âÏkDÏk
with domain S2(G). That is, for all u â S2(G)and all g â G
limtâ0
1t
(Ptu(g)â u(g)) =12Lu(g). (6.16)
Proof. The strong continuity follows easily from the fact that ηt(x, .) â x a.s. as t â 0 whichimplies by dominated convergence
Ptu(g) =â«
Ωu(ηt g) dPg â u(g)
for each continuous u : G â R.Now we aim for identifying the generator. According to Girsanovâs theorem, under the measurePg the processes
W kt = W k
t â12
â« t
0VÏk
(ηs g)ds
for k = 1, . . . , n will define n independent Brownian motions. In terms of these driving processes,(6.12) can be reformulated as
dgt(x) =nâk=1
Ïk(gt(x))dW kt +
12
nâk=1
[ÏâČk(gt(x)) + VÏk(gt)]Ïk(gt(x))dt (6.17)
(recall that gs = ηs g). The chain rule applied to a smooth function U on (S1)m, therefore,yields
dU (gt(y1), . . . , gt(ym))
=mâi=1
â
âxiU (gt(y1), . . . , gt(ym)) dgt(yi)
+12
mâi,j=1
â2
âxiâxjU (gt(y1), . . . , gt(ym)) dăg.(yi), g.(yj)ăt
=mâi=1
nâk=1
â
âxiU (gt(y1), . . . , gt(ym))Ïk(gt(yi))dW k
t
+12
mâi=1
nâk=1
â
âxiU (gt(y1), . . . , gt(ym)) [ÏâČk(gt(yi)) + VÏk
(gt)]Ïk(gt(yi))dt
+12
mâi,j=1
nâk=1
â2
âxiâxjU (gt(y1), . . . , gt(ym))Ïk(gt(yi))Ïk(gt(yj))dt.
35
Hence, for a cylinder function of the form u(g) = U(g(y1), . . . , g(ym)) we obtain
limtâ0
1t
(Ptu(g)â u(g))
= limtâ0
1t
â«Î©
[U (gt(y1), . . . , gt(ym))â U (g0(y1), . . . , g0(ym))] dPg
= limtâ0
1t
â«Î©
â« t
0
[12
mâi=1
nâk=1
â
âxiU (gs(y1), . . . , gs(ym)) [ÏâČk(gs(yi)) + VÏk
(gs)]Ïk(gs(yi))
+12
mâi,j=1
nâk=1
â2
âxiâxjU (gs(y1), . . . , gs(ym))Ïk(gs(yi))Ïk(gs(yj))
ds dPg
(â)=
12
mâi=1
nâk=1
â
âxiU (g(y1), . . . , g(ym)) [ÏâČk(g(yi)) + VÏk
(g)]Ïk(g(yi))
+12
mâi,j=1
nâk=1
â2
âxiâxjU (g(y1), . . . , g(ym))Ïk(g(yi))Ïk(g(yj))
=12
nâk=1
[DÏkDÏk
u(g) + VÏk(g) ·DÏk
u(g)] = â12
nâk=1
DâÏkDÏk
u(g).
In order to justify (â), we have to verify continuity in s in all the expressions preceding (â). Theonly term for which this is not obvious is VÏk
(gs). But gs = ηs g with a function ηs(x, Ï) whichis continuous in x and in s. Thus VÏk
(ηs(., Ï) g) is continuous in s.
Remark 6.8. All the previous argumentations in principle also apply to infinite families of(Ïk)k=1,2,..., provided they have sufficiently good integrability properties. For instance, thefamily (6.3) with s > 5
2 will do the job. There are three key steps which require a carefulverification:
âą the solvability of the Ito equation (6.12) and the fact that the solutions are homeomor-phisms of S1; here s â„ 3
2 suffices, cf. [Mal99];
âą the boundedness of the quadratic variation of the drift to justify Girsanovâs transformationin (6.15); for s > 5
2 this will be satisfied since Lemma 5.1 implies (uniformly in g)
ââk=1
|VÏk(g)|2 †(ÎČ + 1)2
ââk=1
â« 1
0|ÏâČâČk(x)|2dx †4(ÎČ + 1)2
ââk=1
k4â2s;
âą the finiteness of the generator and Itoâs chain rule for C2-cylinder functions; here s > 32
will be sufficient.
Remark 6.9. Another completely different approximation of the process (gt)tâ„0 in terms offinite dimensional SDEs is obtained as follows. For N â N, let S1
N denote the set of cylinderfunctions u : G â R which can be represented as u(g) = U(g(1/N), g(2/N), . . . , g(1)) for someU â C1((S1)N ). Denote the closure of (E ,S1
N ) by (EN ,Dom(EN )). It is the image of theDirichlet form (EN ,Dom(EN )) on ÎŁN â (S1)N given by
EN (U) =â«
ÎŁN
Nâi,j=1
âiU(x)âjU(x) aij(x)Ï(x) dx (6.18)
with
aij(x) =ââk=1
Ïk(xi)Ïk(xj), Ï(x) =Î(ÎČ)
Î(ÎČ/N)N
Nâi=1
(xi+1 â xi)ÎČ/Nâ1dx.
36
and (as before) ÎŁN =
(x1, . . . , xN ) â (S1)N :âN
i=1 |[xi, xi+1]| = 1
. That is,
EN (u) = EN (U)
for cylinder functions u â S1N as above. Let (Xt,Px)tâ„0,xâÎŁN
be the Markov process on ÎŁN
associated with EN . Then the semigroup associated with EN is given by
TNt u(g) = Eg(1/N),...,g(1) [U(Xt)] .
Now let (gt,Pg)tâ„0,gâG and (Tt)tâ„0 denote the Markov process and the L2-semigroup associatedwith E . Then as N ââ
T 2N
t â Tt strongly in L2
sinceE2N E
in the sense of quadratic forms, [RS80], Theorem S.16. (Note that âȘNâNS12N is dense in Dom(E).)
6.3 Dirichlet Form and Stochastic Dynamics on G1 and P
In order to define the derivative of a function u : G1 â R we regard it as a function u on G withthe property u(g) = u(g Ξz) for all z â S1. This implies that DÏu(g) = (DÏu)(g Ξz) wheneverone of these expressions is well-defined. In other words, DÏu defines a function on G1 which willbe denoted by DÏu and called the directional derivative of u along Ï.
Corollary 6.10. (i) Under assumption (6.8), with the notations from above,
E(u, u) =ââk=1
â«G1
|DÏku|2 dQ.
defines a regular, strongly local, recurrent Dirichlet form on L2(G1,Q).(ii) The Markov process on G analyzed in the previous section extends to a (continuous, re-versible) Markov process on G1.
In order to see the second claim, let g, g â G with g = g Ξz for some z â S1. Then obviously,
gt(., Ï) = ηt(g(.), Ï) = ηt(g(.+ z), Ï) = gt(., Ï) Ξz.
Moreover,Pg = Pg
since VÏ(g Ξz) = VÏ(g) for all Ï under consideration and all z â S1.
The objects considered previously â derivative, Dirichlet form and Markov process on G1 â havecanonical counterparts on P. The key to these new objects is the bijective map Ï : G1 â P.The flow generated by a smooth âtangent vectorâ Ï : S1 â R through the point ” â P will begiven by ((etÏ)â”)tâR. In these terms, the directional derivative of a function u : P â R at thepoint ” â P in direction Ï â Câ(S1,R) can be expressed as
DÏu(”) = limtâ0
1t
[u((etÏ)â”)â u(”)] ,
provided this limit exists. The adjoint operator to DÏ in L2(P,P) is given (on a suitable densesubspace) by
DâÏu(”) = âDÏ(”)â VÏ(Ïâ1(”)) · u(”).
37
The drift term can be represented as
VÏ(Ïâ1(”)) = ÎČ
â« 1
0ÏâČ(s)”(ds) +
âIâgaps(”)
[ÏâČ(Iâ) + ÏâČ(I+)
2â Ï(I+)â Ï(Iâ)
|I|
].
Given a sequence Ί = (Ïk)kâN of smooth functions on S1 satisfying (6.8), we obtain a (regular,strongly local, recurrent) Dirichlet form E on L2(P,P) by
E(u, u) =âk
â«P|DÏk
u(”)|2dP(”). (6.19)
It is the image of the Dirichlet form defined in (6.7) under the map Ï. The generator of E isgiven on an appropriate dense subspace of L2(P,P) by
L = âââk=1
DâÏkDÏk
. (6.20)
For P-a.e. ”0 â P, the associated Markov process (”t)tâ„0 on P starting in ”0 is given as
”t(Ï) = gt(Ï)âLeb
where (gt)tâ„0 is the process on G, starting in g0 := Ïâ1(”0). (As mentioned before, (gt)tâ„0 admitsa more direct construction provided we restrict ourselves to a finite sequence Ί = (Ïk)k=1,...,n.)
6.4 Dirichlet Form and Stochastic Dynamics on G0 and P0
For s > 0 and Ï : [0, 1] â R let the Sobolev norm âÏâHs be defined as in (6.2) and let Hs0([0, 1])
denote the closure of Câc (]0, 1[), the space of smooth Ï : [0, 1] â R with compact supportin ]0, 1[. If s â„ 1/2 (which is the only case we are interested in) Hs
0([0, 1]) can be identifiedwith Ï â Hs([0, 1]) : Ï(0) = Ï(1) = 0 or equivalently with Ï â Hs(S1) : Ï(0) = 0.For the sequel, fix s > 1/2 and a complete orthonormal basis Ί = ÏkkâN of Hs
0([0, 1]) withC := â
âk Ï
2kââ <â, and define
E0(u, u) =ââk=1
â«G0
|DÏk,0u(g)|2 dQ0(g).
Corollary 6.11. (E0,S1(G0)), (E0,Z
1(G0)) and (E0,C1(G0)) are closable. Their closures coincide
and define a regular, strongly local, recurrent Dirichlet form (E0,Dom(E0)) on L2(G0,Q0).
Proof. For the closability (and the equivalence of the respective closures) of (E0,S1(G0)) and
(E0,Z1(G0)), see the proof of Theorem 6.2. Also all the assertions on the closure are deduced in
the same manner. For the closability of (E0,C1(G0)) (and the equivalence of its closure with the
previously defined closures), see the proof of Theorem 7.8 below.
As explained in the previous subsection, these objects (invariant measure, derivative, Dirichletform and Markov process) on G0 have canonical counterparts on P0 defined by means of thebijective map Ï : G0 â P0.
7 The Canonical Dirichlet Form on the Wasserstein Space
7.1 Tangent Spaces and Gradients
The aim of this chapter is to construct a canonical Dirichlet form on the L2-Wasserstein spaceP0. Due to the isometry Ï : G0 â P0 this is equivalent to construct a canonical Dirichlet formon the metric space (G0, â.âL2). This can be realized in two geometric settings which seem to becompletely different:
38
âą Like in the preceding two chapters, G0 can be considered as a group, with composition offunctions as group operation. The tangent space TgG0 is the closure (w.r.t. some norm)of the space of smooth functions Ï : [0, 1] â R with Ï(0) = Ï(1) = 0. Such a function Ïinduces a flow on G0 by (g, t) 7â etÏ g â g + t Ï g and it defines a directional derivativeby DÏu(g) = limtâ0
1t [u(etÏ g)â u(g)] for u : G0 â R. The norm on TgG0 we now choose
to be âÏâTg := (â«Ï(gs)2ds)1/2. That is,
TgG0 := L2([0, 1], gâLeb).
For given u and g as above, a gradient Du(g) â TgG0 exists with
DÏu(g) = ăDu(g), ÏăTg (âÏ â Tg)
if and only if supÏDÏu(g)âÏgâL2
<â.
âą Alternatively, we can regard G0 as a closed subset of the space L2([0, 1],Leb). The lin-ear structure of the latter (with the pointwise addition of functions as group operation)suggests to choose as tangent space
TgG0 := L2([0, 1],Leb).
An element f â TgG0 induces a flow by (g, t) 7â g+tf and it defines a directional derivative(âFrechet derivativeâ) by Dfu(g) = limtâ0
1t [u(g + tf) â u(g)] for u : G0 â R, provided u
extends to a neighborhood of G0 in L2([0, 1],Leb) or the flow (induced by f) stays withinG0. A gradient Du(g) â TgG0 exists with
Dfu(g) = ăDu(g), făL2 (âÏ â L2)
if and only if supfDfu(g)âfâL2
<â. In this case, Du(g) is the usual L2-gradient.
Fortunately, both geometric settings lead to the same result.
Lemma 7.1. (i) For each g â G0, the map Îčg : Ï 7â Ï g defines an isometric embeddingof TgG0 = L2([0, 1], gâLeb) into TgG0 = L2([0, 1],Leb). For each (smooth) cylinder functionu : G0 â R
DÏu(g) = DÏgu(g).
If Du â L2(Leb) exists then Du â L2(gâLeb) also exists.(ii) For Q0-a.e. g â G0, the above map Îčg : TgG0 â TgG0 is even bijective. For each u as aboveDu(g) = Du(g) gâ1 and
âDu(g)âTg = âDu(g)âTg .
Proof. (i) is obvious, (ii) follows from the fact that for Q0-a.e. g â G0 the generalized inversegâ1 is continuous and thus gâ1(gt) = t for all t (see sections 3.5 and 2.1). Hence, the mapÎčg : TgG0 â TgG0 is surjective: for each f â TgG0
Îčg(f gâ1) = f gâ1 g = f.
Example 7.2. (i) For each u â Z1(G0) of the form u(g) = U(â« 10 ~α(gt)dt) with U â C1(Rm,R)
and ~α = (α1, . . . , αm) â C1([0, 1],Rm), the gradients Du(g) â TgG0 = L2([0, 1], gâLeb) andDu(g) â TgG0 = L2([0, 1],Leb) exist:
Du(g) =mâi=1
âiU(â«~α(gt)dt) · αâČi(g(.)), Du(g) =
mâi=1
âiU(â«~α(gt)dt) · αâČi(.)
39
and their norms coincide:
âDu(g)â2Tg
= âDu(g)â2Tg
=â« 1
0
âŁâŁâŁâŁâŁmâi=1
âiU(â«~α(gt)dt) · αâČi(g(s))
âŁâŁâŁâŁâŁ2
ds.
(ii) For each u â C1(G0) of the form u(g) = U(â« 10~f(t)g(t)dt) with U â C1(Rm,R) and ~f =
(f1, . . . , fm) â L2([0, 1],Rm), the gradient
Du(g) =mâi=1
âiU(â«~f(t)g(t)dt) · αi(.) â L2([0, 1],Leb)
exists and
âDu(g)â2Tg
=â« 1
0
âŁâŁâŁâŁâŁmâi=1
âiU(â«~f(t)g(t)dt) · fi(s)
âŁâŁâŁâŁâŁ2
ds.
For u â C1(G0) âȘ Z1(G0), the gradient Du can be regarded as a map G0 Ă [0, 1] â R, (g, t) 7âDu(g)(t). More precisely,
D : C1(G0) âȘ Z1(G0) â L2(G0 Ă [0, 1],Q0 â Leb).
Proposition 7.3. The operator D : Z1(G0) â L2(G0Ă[0, 1],Q0âLeb) is closable in L2(G0,Q0).
Proof. LetW â L2(G0Ă[0, 1],Q0âLeb) be of the formW (g) = w(g)·Ï(gt) with some w â Z1(G0)and some Ï â Câ([0, 1]) satisfying Ï(0) = Ï(1) = 0. Then according to the integration by partsformula for each u â Z1(G0) with u(g) = U(
â« 10 ~α(gs)ds)â«
G0Ă[0,1]Du ·W d(Q0 â Leb) =
â«G0
â« 1
0
mâi=1
âiU(â«~α(gs)ds)αâČi(gt)w(g)Ï(gt)dtdQ0(g)
=â«G0
DÏu(g)w(g) dQ0(g) =â«G0
u(g)DâÏw(g) dQ0(g).
To prove the closability of D, consider a sequence (un)n in Z1(G0) with un â 0 in L2(Q0) andDun â V in L2(Q0 â Leb). Thenâ«
V ·W d(Q0 â Leb) = limn
â«Dun ·W d(Q0 â Leb) = lim
n
â«unD
âÏw dQ0 = 0 (7.1)
for all W as above. The linear hull of the latter is dense in L2(Q0 â Leb). Hence, (7.1) impliesV = 0 which proves the closability of D.
The closure of (D,Z1(G0)) will be denoted by (D,Dom(D). Note that a priori it is not clearwhether D coincides with D on C1(G0). (See, however, Theorem 7.8 below.)
7.2 The Dirichlet Form
Definition 7.4. For u, v â Z1(G0) âȘ C1(G0) we define the âWasserstein Dirichlet integralâ
E(u, v) =â«G0
ăDu(g),Dv(g)ăL2 dQ0(g). (7.2)
Theorem 7.5. (i) (E,Z1(G0)) is closable. Its closure (E,Dom(E)) is a regular, recurrentDirichlet form on L2(G0,Q0).Dom(E) = Dom(D) and for all u, v â Dom(D)
E(u, v) =â«G0Ă[0,1]
Du · Dv d(Q0 â Leb).
40
(ii) The set Zâ0 (G0) of all cylinder functions u â Zâ(G0) of the form u(g) = U(â«~α(gs)ds) with
U â Câ(Rm,R) and ~α = (α1, . . . , αm) â Câ([0, 1],Rm) satisfying αâČi(0) = αâČi(1) = 0 is a corefor (E,Dom(E)).(iii) The generator (L,Dom(L) of (E,Dom(E)) is the Friedrichs extension of the operator(L,Zâ0 (G0) given by
Lu(g) = âmâi=1
Dâαiui(g)
=mâ
i,j=1
âiâjU(â«~α(gs)ds
)·⫠1
0αâČi(gs)α
âČj(gs)ds +
mâi=1
âiU(â«~α(gs)ds
)· V ÎČ
αâČi(g)
where ui(g) := âiU(â«~α(gs)ds) and V ÎČ
αâČi(g) denotes the drift term defined in section 5.1 with
Ï = αâČi; ÎČ > 0 is the parameter of the entropic measure fixed throughout the whole chapter.(iv) The Dirichlet form (E,Dom(E)) has a square field operator given by
Î(u, v) := ăDu,DvăL2(Leb) â L1(G0,Q0)
with Dom(Î) = Dom(E) â© Lâ(G0,Q0). That is, for all u, v, w â Dom(E) â© Lâ(G0,Q0)
2â«w · Î(u, v) dQ0 = E(u, vw) + E(uw, v)â E(uv,w). (7.3)
Proof. (a) The closability of the form (E,Z1(G0)) follows immediately from the previous Propo-sition 7.3. Alternatively, we can deduce it from assertion (iii) which we are going to prove first.(b) Our first claim is that E(u,w) = â
â«u · Lw dQ0 for all u,w â Zâ0 (G0). Let u(g) =
U(â«~α(gs)ds) and w(g) = W (
â«~Îł(gs)ds) with U,W â Câ(Rm,R) and ~α = (α1, . . . , αm), ~Îł =
(Îł1, . . . , Îłm) â Câ([0, 1],Rm) satisfying αâČi(0) = αâČi(1) = ÎłâČi(0) = ÎłâČi(1) = 0. Observe that
ăDu(g),Dw(g)ăL2 =mâ
i,j=1
âiU(â«~α(gs)ds) · âjW (
â«~Îł(gs)ds) ·
â« 1
0αâČi(gs)Îł
âČj(gs)ds
=mâi=1
ui(g) ·DαâČiw(g).
Hence, according to the integration by parts formula from Proposition 5.10
E(u,w) =â«G0
ăDu(g),Dw(g)ăL2 dQ0(g)
=mâi=1
â«G0
ui(g) ·DαâČiw(g) dQ0(g)
=mâi=1
â«G0
DâαâČiui(g) · w(g) dQ0(g)
= ââ«G0
Lu(g) · w(g) dQ0(g).
This proves our first claim. In particular, (L,Zâ0 (G0)) is a symmetric operator. Therefore, theform (E,Zâ0 (G0)) is closable and its generator coincides with the Friedrichs extension of L.(c) Now let us prove that Zâ0 (G0) is dense in Z1(G0). That is, let us prove that each functionu â Z1(G0) can be approximated by functions uΔ â Zâ0 (G0). For simplicity, assume that u is of theform u(g) = U(
â«Î±(gs)ds) with U â C1(R) and α â C1([0, 1]). (That is, for simplicity, m = 1.)
Let UΔ â Câ(R) for Δ > 0 be smooth approximations of U with âU â UΔââ + âU âČ â U âČΔââ â 0
41
as Δ â 0 and let αΔ â Câ(R) with αâČΔ(0) = αâČΔ(1) = 0 be smooth approximations of α withâαâαΔââ â 0 and αâČΔ(t) â αâČ(t) for all t â]0, 1[ as Δâ 0. Moreover, assume that supΔ âαâČââ <â.Define uΔ â Zâ0 (G0) as uΔ(g) = UΔ(
â«Î±Î”(gs)ds). Then uΔ â u in L2(G0,Q0) by dominated
convergence relative Q0.Since
supΔ
supgâG
(U âČΔ(â«Î±Î”(g(s))ds)
)2â«[0,1]α
âČΔ(gs)
2ds †C,
(αâČΔ)2(g(s)) Δâ0ââ αâČ(gs)2 âs â [0, 1] \
(g = 0 â© g = 1
),
and[0, 1] \
(g = 0 â© g = 1
)=]0, 1[ for Q0-almost all g â G0
one finds by dominated convergence in L2([0, 1],Leb), for Q0-almost all g â G0(U âČΔ(â«Î±Î”(gs)ds)
)2â«[0,1]α
âČΔ(gs)
2dsΔâ0ââ
(U âČ(â«Î±(gs)ds)
)2â«[0,1]α
âČ(gs)2ds.
Hence also with
E(uΔ, uΔ) =â«G0
(U âČΔ(â«Î±Î”(gs)ds)
)2 · ⫠αâČΔ(gs)2dsQ0(dg)
Δâ0âââ«G0
(U âČ(â«Î±(gs)ds)
)2 · ⫠αâČ(gs)2dsQ0(dg)
by dominated convergence in L2(G0,Q0). In particular, uΔΔ constitutes a Cauchy sequencerelative to the norm âvâ2
E,1 := âvâ2L2(G,Q) + E(v, v). In fact, since the sequence uΔ is uniformly
bounded w.r.t. to â.âE,1, by weak compactness there is a weakly converging subsequence in(Dom(E), â.âE,1). Since the associated norms converge, the convergence is actually strong in(Dom(E), â.âE,1). Moreover, since uΔ â u in L2(G0,Q0), this limit is unique. Hence the entiresequence converges to u â (Dom(E), â.âE,1), such that in particular E(u, u) = limΔâ0 E(uΔ, uΔ).This proves our second claim. In particular, it implies that also (E,Z1(G0)) is closable and thatthe closures of Zâ0 (G0) and Z1(G0) coincide.(d) Obviously, (E,Dom(E)) has the Markovian property. Hence, it is a Dirichlet form. Sincethe constant functions belong to Dom(E), the form is recurrent. Finally, the set Z1(G0) is densein (C(G0), â.ââ) according to the theorem of Stone-Weierstrass since it separates the points inthe compact metric space G0. Hence, (E,Dom(E)) is regular.(e) According to Leibnizâ rule, (7.3) holds true for all u, v, w â Z1(G0). Arbitrary u, v, w âDom(E)â©Lâ(G0,Q0) can be approximated in (E(.) + â.â2)1/2 by un, vn, wn â Z1(G0) which areuniformly bounded on G0. Then unvn â uv, unwn â uw and vnwn â vw in (E(.) + â.â2)1/2.Moreover, we may assume that wn â w Q0-a.e. on G0 and thusâ«|wÎ(u, v)â wnÎ(un, vn)| dQ0 â€
â«|wâwn|Î(u, v)dQ0 +
â«|wn| · |Î(u, v)âÎ(un, vn)|dQ0 â 0
by dominated convergence. Hence, (7.3) carries over from Z1(G0) to Dom(E) â©Lâ(G0,Q0).
Lemma 7.6. For each f â G0 the function u : g 7â ăf, găL2 belongs to Dom(E).
Proof. (a) For f, g â G0 put ”f = fâLeb and ”g = gâLeb. Recall that by Kantorovich duality
12âf â gâ2
L2 =12d2W (”f , ”g)
= supÏ,Ï
â« 1
0Ïd”f +
â« 1
0Ïd”g
= sup
Ï,Ï
â« 1
0Ï(ft)dt+
â« 1
0Ï(gt)dt
42
where the supÏ,Ï is taken over all (smooth, bounded) Ï â L1([0, 1], ”f ), Ï â L1([0, 1], ”g)satisfying Ï(x) + Ï(y) †1
2 |x â y|2 for ”f -a.e. x and ”g-a.e. y in [0, 1]. Replacing Ï(x) by|x|2/2â Ï(x) (and Ï(y) by . . .) this can be restated as
ăf, găL2 = infÏ,Ï
â« 1
0Ï(ft)dt+
â« 1
0Ï(gt)dt
(7.4)
where the infÏ,Ï now is taken over all (smooth, bounded) Ï â L1([0, 1], ”f ), Ï â L1([0, 1], ”g)satisfying Ï(x) + Ï(y) â„ ăx, yă for ”f -a.e. x and ”g-a.e. y in [0, 1]. If g is strictly increasingthen Ï can be chosen as
ÏâČ = f gâ1,
cf. [Vil03], sect. 2.1 and 2.2.(b) Now fix a countable dense set gnnâN of strictly increasing functions in G0 and an arbitraryfunction f â G0. Let (Ïn, Ïn) denote a minimizing pair for (f, gn) in (7.4) and define un : G0 â Rby
un(g) := mini=1,...,n
â« 1
0Ï(fi(t))dt+
â« 1
0Ïi(g(t))dt
.
Note that ÏâČi = f gâ1i and thus un(gi) = ăf, giă for all i = 1, . . . , n. Therefore,
|un(g)â un(g)| †maxi
â« 1
0|Ïi(g(t))dtâ Ïi(g(t))|dt †max
iâÏâČiââ ·
â« 1
0|g(t)â g(t)|dt †âg â gâL1
for all g, g â G0. Hence, un â u pointwise on G0 and in L2(G0,Q0) where u(g) := ăf, gă.(c) The function un is in the class Z0(G0):
un(g) = Un(â«~α(gt)dt
)with Un(x1, . . . , xn) = minc1 + x1, . . . , cn + xn, ci =
â«Ïi(f(t))dt and αi = Ïi. The function
Un can be easily approximated by C1 functions in order to verify that un â Dom(E) and
Dun(g) =nâi=1
1Ai(g) · ÏâČi(g(.))
with a suitable disjoint decomposition G0 = âȘiAi. (More precisely, Ai denotes the set of allg â G0 satisfying
â« 10 Ï(fi(t))dt +
â« 10 Ïi(g(t))dt <
â« 10 Ï(fj(t))dt +
â« 10 Ïj(g(t))dt for all j < i andâ« 1
0 Ï(fi(t))dt+â« 10 Ïi(g(t))dt â€
â« 10 Ï(fi(t))dt+
â« 10 Ïi(g(t))dt for all j > i.) Thus
âDun(g)â2 =âi
1Ai(g) ·⫠1
0ÏâČi(g(t))
2dt
andE(un) †max
iâ€n
â«G0
âÏâČi gâ2L2dQ0(g).
In particular, since |ÏâČi| †1,supn
E(un) †1
and thus u â Dom(E).
Lemma 7.7. For all u â Z1(G0) and all w â C1(G0) â©Dom(E)
E(u,w) =â«G0
ăDu(g),Dw(g)ăL2dQ0(g) (7.5)
(with Du(g) and Dw(g) given explicitly as in Example 7.2).
43
Proof. Recall that for u â Zâ0 (G0) of the form u(g) = U(â«~α(gt)dt)
Lu(g) =mâi=1
DâαâČiui(g)
with ui(g) = âiU(â«~α(gt)dt). Hence, for w â C1(G0) of the form w(g) = W (ă~h, gă)
E(u,w) = ââ«
Lu(g)w(g) dQ0(g)
=mâi=1
â«G0
DâαâČiui(g)w(g) dQ0(g) =
mâi=1
â«G0
ui(g)DαâČiui(g)w(g) dQ0(g)
=mâ
i,j=1
â«G0
âiU(â«~α(gt)dt) · âjW (
â«~h(t)g(t)dt) ·
â«Î±âČi(g(t))hj(t)dt dQ0(g)
=â«G0
ăDu(g),Dw(g)ădQ0(g).
This proves the claim provided u â Zâ0 (G0). By density this extends to all u â Z1(G0).
Theorem 7.8. (i) (E,C1(G0)) is closable and its closure coincides with (E,Dom(E)). Similarly,(D,C1(G0)) is closable and its closure coincides with (D,Dom(D)).(ii) For all u,w â Z1(G0) âȘ C1(G0)
Î(u,w)(g) = ăDu(g),Dw(g)ăL2 , (7.6)
in particular, E(u,w) =â«G0ăDu(g),Dw(g)ăL2dQ0(g) (with Du(g) and Dw(g) given explicitly as
in Example 7.2).(iii) For each f â G0 the function uf : g 7â âf â gâL2 belongs to Dom(E) and Î(uf , uf ) †1Q0-a.e. on G0.(iv) (E,Dom(E)) is strongly local.
Proof. (a) Claim: For each f â L2([0, 1],Leb) the function uf : g 7â ăf, găL2 belongs to Dom(E)and E(uf , uf ) = âfâ2
L2.Indeed, if f â L2 â© C1 then f = c0 + c1f1 + c2f2 with f1, f2 â G0 and c0, c1, c2 â R. Hence,uf â Dom(E) according to Lemma 7.6 and E(uf , uf ) =
â«âDufâ2dQ0 = âfâ2 according to
Lemma 7.7. Finally, each f â L2 can be approximated by fn â L2 â© C1 with âf â fnâ â 0.Hence, uf â Dom(E) and E(uf , uf ) = âfâ2.(b) Claim: C1(G0) â Dom(E).Let u â C1(G0) be given with u(g) = U(ă~f, gă), U â C1(Rm,R), ~f = (f1, . . . , fm) â L2([0, 1],Rm).For each i = 1, . . . ,m let (wi,n)nâN be an approximating sequence in (Z1(G0), (E + â.â2)1/2) forwi : g 7â ăfi, gă. Put un(g) = U(w1,n(g), . . . , wm,n(g)). Then un â Z1(G0), un â u pointwise onG0 and in L2(G0,Q0). Moreover,
E(un, un) =â«ââi
âiU(w1,n(g), . . . , wm,n(g))Dwi,n(g)â2L2 dQ0(g)
ââ«ââi
âiU(ăf1, gă, . . . , ăfm, gă)Dwi(g)â2L2 dQ0(g)
=â«âDu(g)â2dQ0(g).
Hence, u â Dom(E) and E(u, u) =â«âDu(g)â2dQ0(g).
(c) Assertion (ii) then follows via polarization and bi-linearity. Assertion (iii) is an immediateconsequence of assertion (ii). Assertion (iii) allows to prove the locality of the Dirichlet form(E,Dom(E)) in the same manner as in the proof of Theorem 6.2.
44
(d) Claim: C1(G0) is dense in Dom(E).We have to prove that each u â Z1(G0) can be approximated by un â C1(G0). As usual, it sufficesto treat the particular case u(g) =
â« 10 α(gt)dt for some α â C1([0, 1]). Put Un(x1, . . . , xn) =
1n
âni=1 α(xi) and fn,i(t) = n · 1[ iâ1
n, in
[(t). Then
un(g) := Un(ăfn,1, gă, . . . ăfn,n, gă) =1n
nâi=1
α
(n
â« in
iâ1n
gtdt
)
defines a sequence in C1(G0) with un(g) â u(g) pointwise on G0 and in L2(G0,Q0).Moreover,
Dun(g) =nâi=1
αâČ
(n
â« in
iâ1n
gtdt
)· 1[ iâ1
n, in
[(.) (7.7)
and therefore
E(un) =â«G0
1n
nâi=1
αâČ
(n
â« in
iâ1n
gtdt
)2
dQ0(g) âââ«G0
â« 1
0αâČ(gt)2dtdQ0(g) = E(u). (7.8)
Thus (un)n is Cauchy in Dom(E) and un â u in Dom(E).
7.3 Rademacher Property and Intrinsic Metric
We say that a function u : G0 â R is 1-Lipschitz if
|u(g)â u(h)| †âg â hâL2 (âg, h â G0).
Theorem 7.9. Every 1-Lipschitz function u on G0 belongs to Dom(E) and Î(u, u) †1 Q0-a.e.on G0.
Before proving the theorem in full generality, let us first consider the following particular case.
Lemma 7.10. Given n â N, let h1, . . . , hn be a orthonormal system in L2([0, 1],Leb) and letU be a 1-Lipschitz function on Rn. Then the function u(g) = U(ăh1, gă, . . . , ăhn, gă) belongs toDom(E) and Î(u, u) †1 Q0-a.e. on G0.
Proof. Let us first assume that in addition U is C1. Then according to Theorem 7.8, u is inDom(E) and Du(g) =
âni=1 âiU(ă~h, gă) · hi. Thus
Î(u, u)(g) = âDu(g)âL2 =nâi=1
|âiU(ă~h, gă)|2 †1.
In the case of a general 1-Lipschitz continuous U on Rn we choose an approximating sequenceof 1-Lipschitz functions Uk, k â N, in C1(Rn) with Uk â U uniformly on Rn and put uk(g) =Uk((ă~h, gă) for g â G0. Then uk â u pointwise and in L2(G0,Q0). Hence, u â Dom(E) andÎ(u, u) †1 Q0-a.e. on G0.
Proof of Theorem 7.9. Every 1-Lipschitz function u on G0 can be extended to a 1-Lipschitzfunction u on L2([0, 1],Leb) (âKirszbraun extensionâ). Hence, without restriction, assume thatu is a 1-Lipschitz function on L2([0, 1],Leb). Choose a complete orthonormal system hiiâN ofthe separable Hilbert space L2([0, 1],Leb) and define for each n â N the function Un : Rn â Rby
Un(x1, . . . , xn) = u
(nâi=1
xihi
)
45
for x = (x1, . . . , xn) â Rn. This function Un is 1-Lipschitz on Rn:
|Un(x)â Un(y)| â€
â„â„â„â„â„nâi=1
xihi ânâi=1
yihi
â„â„â„â„â„L2
†|xâ y|.
Hence, according to the previous Lemma the function
un(g) = Un(ăh1, gă, . . . , ăhn, gă)
belongs belongs to Dom(E) and Î(un, un) †1 Q0-a.e. on G0.Note that
un(g) = u
(nâi=1
ăhi, găhi
)for each g â L2([0, 1],Leb). Therefore, un â u on L2([0, 1],Leb) since
âni=1ăhi, găhi â g
on L2([0, 1],Leb) and since u is continuous on L2([0, 1],Leb). Thus, finally, u â Dom(E) andÎ(u, u) †1 Q0-a.e. on G0.
Our next goal is the converse to the previous Theorem.
Theorem 7.11. Every continuous function u â Dom(E) with Î(u, u) †1 Q0-a.e. on G0 is1-Lipschitz on G0.
Lemma 7.12. For each u â C1(G0) âȘ Z1(G0) and all g0, g1 â G0
u(g1)â u(g0) =â« 1
0ăDu ((1â t)g0 + tg1) , g1 â g0ăL2dt. (7.9)
Proof. Put gt = (1â t)g0 + tg1 and consider the C1 function η : [0, 1] â R defined by ηt = u(gt).Then
ηt = Dg1âg0u(gt) = ăDu(gt), g1 â g0ă
and thus
η1 â η0 =â« 1
0ηtdt =
â« 1
0ăDu(gt), g1 â g0ădt.
Lemma 7.13. Let g0, g1 â G0 â©C3 and put gt = (1â t)g0 + tg1. Then for each u â Dom(E) andeach bounded measurable Κ : G0 â Râ«
G0
[u(g1 h)â u(g0 h)]Κ(h) dQ0(h) =â« 1
0
â«G0
ăDu(gt h, (g1 â g0) hăΚ(h) Q0(h)dt. (7.10)
Proof. Given g0, g1, Κ and u â Dom(E) as above, choose an approximating sequence in Z1(G0)âȘC1(G0) with un â u in Dom(E) as nââ. According to the previous Lemma for each nâ«
G0
[un(g1h)âun(g0h)]Κ(h) dQ0(h) =â« 1
0
â«G0
ăDun (gt h) , (g1âg0)hăΚ(h) dQ0(h)dt. (7.11)
By assumption un â u in L2(G0,Q0) and Dun â Du in L2(G0 Ă [0, 1],Q0 â Leb) as n â â.Using the quasi-invariance of Q0 (Theorem 4.3) this impliesâ«
G0
|u(gt h)â un(gt h)|Κ(h) dQ0(h) =â«G0
[u(h)â un(h)|Κ(gâ1t h) · Y ÎČ
gâ1t
(h) dQ0(h) â 0
46
as nââ as well asâ«G0
âDu(gt h)â Dun(gt h)|2L2Κ(h) Q0(h)
=â«G0
âDu(h)â Dun(h)|2L2Κ(gâ1t h) · Y ÎČ
gâ1t
(h) Q0(h) â 0
Hence, we may pass to the limit nââ in (7.11) which yields the claim.
Proof of Theorem 7.11. Let a continuous u â Dom(E) be given with Î(u, u) †1 Q0-a.e. on G0.We want to prove that u(g1)â u(g0) †âg1 â g0âL2 for all g0, g1 â G0. By density of G0 â© C3 inG0 and by continuity of u it suffices to prove the claim for g0, g1 â G0 â© C3.Choose a sequence of bounded measurable Κk : G0 â R+ such that the probability measuresΚkdQ0 on G0 converge weakly to ÎŽe, the Dirac mass in the identity map e â G0. Then accordingto the previous Lemma and the assumption âDuâ †1â«
G0
[u(g1 h)â u(g0 h)]Κk((h)dQ0(h)
=â« 1
0
â«G0
ăDu(gt h, (g1 â g0) hăΚk(h) dQ0(h)dt
â€â« 1
0
â«G0
âDu(gt h)âL2 · â(g1 â g0) hâL2 ·Κk(h) dQ0(h)dt
â€â«G0
â(g1 â g0) hâL2 ·Κk(h) dQ0(h).
Now the integrands on both sides, h 7â u(g1 h)âu(g0 h) as well as h 7â â(g1â g0) hâL2 , arecontinuous in h â G0. Hence, as k ââ by weak convergence ΚkdQ0 â ÎŽe we obtain
u(g1)â u(g0) †âg1 â g0âL2 .
Corollary 7.14. The intrinsic metric for the Dirichlet form (E,Dom(E)) is the L2-metric:
âg1 â g0âL2 = sup u(g1)â u(g0) : u â C(G0) â©Dom(E), Î(u, u) †1 Q0-a.e. on G0
for all g0, g1 â G0.
7.4 Finite Dimensional Noise Approximations
The goal of this section is to present representations â and finite dimensional approximations âof the Dirichlet form
E(u, v) =â«G0
ăDu(g),Dv(g)ăL2 dQ0(g)
in terms of globally defined vector fields.If (Ïi)iâN is a complete orthonormal system in Tg = L2([0, 1], gâLeb) for a given g â G0 thenobviously
ăDu(g),Dv(g)ăL2 =ââi=1
DÏiu(g)DÏiv(g). (7.12)
Unfortunately, however, there exists no family (Ïi)iâN which is simultaneously orthonormal inall Tg = L2([0, 1], gâLeb), g â G0. For a general family, the representation (7.12) should bereplaced by
ăDu(g),Dv(g)ăL2 =ââ
i,j=1
DÏiu(g) · aij(g) ·DÏjv(g) (7.13)
47
where a(g) = (aij(g))i,jâN is the âgeneralized inverseâ to Ί(g) = (Ίij(g))i,jâN with
Ίij(g) := ăÏi, ÏjăTg =â« 1
0Ïi(gt)Ïj(gt)dt.
In order to make these concepts rigorous, we have to introduce some notations.
For fixed n â N let S+(n) â RnĂn denote the set of symmetric nonnegative definite real (nĂn)-matrices. For each A â S+(n) a unique element Aâ1 â S+(n), called generalized inverse to A,is defined by
Aâ1x :=
0 if x â Ker(A),y if x â Ran(A) with x = Ay
This definition makes sense since (by the symmetry of A) we have an orthogonal decompositionRn = Ker(A)â Ran(A). Obviously,
Aâ1 ·A = A ·Aâ1 = ÏA
where ÏA denotes the projection onto Ran(A).Moreover, for each A â S+(n) there exists a unique element A1/2 â S+(n), called nonnegativesquare root of A, satisfying
A1/2 ·A1/2 = A.
Let Κ(n) denote the map A 7â Aâ1, regarded as a map from S+(n) â RnĂn to RnĂn, withΚ(n)ij (A) = (Aâ1)ij for i, j = 1, . . . , n. Similarly, put
Î(n) : S+(n) â S+(n), A 7â (A1/2)â1 = (Aâ1)1/2.
Note that Κ(n)(A) = Î(n)(A) · Î(n)(A) for all A â S+(n).The maps Κ(n) and Î(n) are smooth on the subset of positive definite matrices A â S+(n) butunfortunately not on the whole set S+(n). However, they can be approximated from below(in the sense of quadratic forms) by smooth maps: there exists a sequence of Câ maps Î(n,l) :RnĂn â RnĂn with
Ο · Î(n,k)(A) · Ο †Ο · Î(n,l)(A) · Ο
for all A â S+(n), Ο â Rn and all k, l â N with k †l and
Î(n,l)ij (A) â Î(n)
ij (A) = (Aâ1/2)ij
for all A â S+(n), i, j â 1, . . . , n as lââ. Put Κ(n,l)(A) = Î(n,l)(A) ·Î(n,l)(A) for A â RnĂn.Then the sequence (Κ(n,l))lâN approximates Κ(n) from below in the sense of quadratic forms.
Now let us choose a family ÏiiâN of smooth functions Ïi : [0, 1] â R which is total in C0([0, 1])w.r.t. uniform convergence (i.e. its linear hull is dense). Put
Ίij(g) := ăÏi, ÏjăTg =â« 1
0Ïi(gx)Ïj(gx)dx
anda
(n,l)ij (g) = Κ(n,l)
ij (Ί(g)) , Ï(n,l)ij (g) = Î(n,l)
ij (Ί(g)).
Note that the maps g 7â a(n,l)ij (g) and g 7â Ï
(n,l)ij (g) (for each choice of n, l, i, j) belong to the
class Zâ(G0). Moreover, puta
(n)ij (g) = Κ(n)
ij (Ί(g)) .
48
Then obviously the orthogonal projection Ïn onto the linear span of Ï1, . . . , Ïn â Tg =L2([0, 1], gâLeb) is given by
Ïnu =nâ
i,j=1
a(n)ij (g) · ău, ÏiăTg · Ïj
and
ăÏnu, ÏnvăTg =nâ
i,j=1
ău, ÏiăTg · a(n)ij (g) · ăv, ÏjăTg
for all u, v â Tg.
Theorem 7.15. (i) For each n, l â N the form (E(n,l),Z1(G0)) with
E(n,l)(u, v) =nâ
i,j=1
â«G0
DÏiu(g) · a(n,l)ij (g) ·DÏjv(g) dQ0(g)
is closable. Its closure is a Dirichlet form with generator being the Friedrichs extension of thesymmetric operator (L(n,l),Z2(G0)) given by
L(n,l) =nâ
i,j=1
a(n,l)ij ·DÏiDÏj +
nâi,j=1
[DÏia
(n,l)ij + a
(n,l)ij · V ÎČ
Ïi
]DÏj . (7.14)
(ii) As lââE(n,l) E(n)
where
E(n)(u, v) =nâ
i,j=1
â«G0
DÏiu(g) · a(n)ij (g) ·DÏjv(g) dQ0(g).
for u, v â Z1(G0). Hence, in particular, E(n) is a Dirichlet form.(iii) As nââ
E(n) E
(which provides an alternative proof for the closability of the form (E,Z1(G0))).
Proof. (i) The function a(n,l)i,j on G0 is a cylinder function in the class Z1(G0). The integration
by parts formula for the DÏi , therefore, implies that for all u, v â Z2(G0)
E(n,l)(u, v) =âi,j
â«DÏiu(g)DÏjv(g)a
(n,l)ij (g)dQ0(g)
=âi,j
â«u(g) ·Dâ
Ïi
(a
(n,l)ij DÏjv
)(g) dQ0(g) = â
â«u(g) · L(n,l)v(g) dQ0(g).
with
L(n,l) = ânâ
i,j=1
DâÏi
(a
(n,l)ij DÏj
).
Hence, (E(n,l),Z2(G0)) is closable and the generator of its closure is the Friedrichs extension of(L(n,l),Z2(G0)).(ii) The monotone convergence E(n,l) E(n) of the quadratic forms is an immediate consequenceof the fact that a(n,l)(g) a(n)(g) (in the sense of symmetric matrices) for each g â G0 which inturn follows from the defining properties of the approximations Κ(n,l) of the generalized inverseΚ(n).
49
The limit of an increasing sequence of Dirichlet forms is itself again a Dirichlet form provided itis densely defined which in our case is guaranteed since it is finite on Z2(G0).(iii) Obviously, the En, n â N constitute an increasing sequence of Dirichlet forms with En †Efor all n. Moreover, Z1(G0) is a core for all the forms under consideration. Hence, it suffices toprove that for each u â Z1(G0) and each Δ > 0 there exists an n â N such thatâŁâŁâŁE(n)(u, u)â E(u, u)
âŁâŁâŁ †Δ.
To simplify notation, assume that u is of the form u(g) = U(â«Î±(gt)dt) for some U â C1
c (R)and some α â C1([0, 1]). By assumption, the set Ïi, i â N is total in C0([0, 1]) w.r.t. uniformconvergence. Hence, for each ÎŽ > 0 there exist n â N and Ï â span(Ï1, . . . , Ïn) with âαâČâÏâsup â€ÎŽ which implies
ăαâČ, ÏăTg
âÏâTg
â„ âÏâTg â ÎŽ â„ âαâČâTg â 2ÎŽ.
Thus
E(u, u) â„ E(n)(u, u) â„â«G0
U âČ(â«Î±(gt)dt)2 · ăαâČ, Ïă2Tg
· 1âÏâ2
Tg
dQ0(g)
â„â«G0
U âČ(â«Î±(gt)dt)2 ·
(âαâČâTg â 2ÎŽ
)2dQ0(g)
â„â«G0
U âČ(â«Î±(gt)dt)2 ·
(1
1 + ÎŽâαâČâ2
Tgâ 4ÎŽ
)dQ0(g)
â„ 11 + ÎŽ
E(u, u)â 4ÎŽâU âČâ2sup.
Hence, for ÎŽ sufficiently small, E(u, u) and E(n)(u, u) are arbitrarily close to each other.
Remark 7.16. For any given g0 â G0, let (gt)tâ„0 with gt : (x, Ï) 7â gxt (Ï) be the solution to theSDE
dgxt =nâ
i,j=1
Ï(n,l)ij (gt) · Ïj(gxt ) dW i
t
+12
nâi,j=1
a(n,l)ij (gt) · Ïj(gxt ) ·
(ÏâČi(g
xt ) + V ÎČ
Ïi(gt))dt
+12
nâi,j=1
nâk,m=1
âkmΚ(n,l)ij (Ί(gt)) · ă(ÏkÏm)âČ, ÏiăTg · Ïj(gxt )dt
where âkmΚ(n,l)ij for (k,m) â 1, . . . , n2 denotes the 1st order partial derivative of the function
Κ(n,l)ij : RnĂn â R with respect to the coordinate xkm. Then the generator of the process coincides
on Z2(G0) with the operator 12L(n,l) from (7.14), the generator of the Dirichlet form E(n,l).
Let us briefly comment on the various terms in the SDE from above:
âą The first one,ân
i,j=1 Ï(n,l)ij (gt) · Ïj(gxt ) dW i
t is the diffusion term, written in Ito form;
âą the second one, 12
âni,j=1 a
(n,l)ij (gt) ·Ïj(gxt ) ·ÏâČi(gxt )dt is a drift which comes from the trans-
formation between Stratonovich and Ito form (it would disappear if we wrote the diffusionterm in Stratonovich form).
50
âą The next one, 12
âni,j=1 a
(n,l)ij (gt) · Ïj(gxt ) · V
ÎČÏi(gt)dt is a drift which arises from our change
of variable formula. Actually, since
V ÎČÏi
(g) = ÎČ
â« 1
0ÏâČi(g(y))dy +
âaâJg
[ÏâČi(g(a+)) + ÏâČi(g(aâ))
2â Ïi(g(a+))â Ïi(g(aâ))
g(a+)â g(aâ)
],
it consists of two parts, one originates in the logarithmic derivative of the entropy of thegâs (which finally will force the process to evolve as a stochastic perturbation of the heatequation), the other one is created by the jumps of the gâs.
âą The last term, 12
âni,j=1
ânk,m=1 âkmΚ(n,l)
ij (Ί(gt)) · ă(ÏkÏm)âČ, ÏiăTg · Ïj(gxt )dt involves thederivative of the diffusion matrix. It arises from the fact that the generator is originallygiven in divergence form.
7.5 The Wasserstein Diffusion (”t) on P0
The objects considered previously â derivative, Dirichlet form and Markov process on G0 â havecanonical counterparts on P0. The key to these objects is the bijective map Ï : G0 â P0,g 7â gâLeb.
We denote by Zk(P0) the set of all (âcylinderâ) functions u : P0 â R which can be written as
u(”) = U
(â« 1
0α1d”, . . . ,
â« 1
0αmd”
)(7.15)
with some m â N, some U â Ck(Rm) and some ~α = (α1, . . . , αm) â Ck([0, 1],Rm) . The subsetof u â Zk(P0) with αâČi(0) = αâČi(1) = 0 for all i = 1, . . . ,m will be denoted by Zk0(P0). Foru â Z1(P0) represented as above we define its gradient Du(”) â L2([0, 1], ”) by
Du(”) =mâi=1
âiU(â«~αd”) · αâČi(.)
with norm
âDu(”)âL2(”) =
â« 1
0
âŁâŁâŁâŁâŁmâi=1
âiU(â«~αd”) · αâČi
âŁâŁâŁâŁâŁ2
d”
1/2
.
The tangent space at a given point ” â P0 can be identified with L2([0, 1], ”). The action of atangent vector Ï â L2([0, 1], ”) on ” (âexponential mapâ) is given by the push forward Ïâ”.
Theorem 7.17. (i) The image of the Dirichlet form defined in (7.2) under the map Ï is theregular, strongly local, recurrent Wasserstein Dirichlet form E on L2(P0,P0) defined on its coreZ1(P0) by
E(u, v) =â«P0
ăDu(”), Dv(”)ă2L2(”)dP0(”). (7.16)
The Dirichlet form has a square field operator, defined on Dom(E) â© Lâ, and given on Z1(P0)by
Î(u, v)(”) = ăDu(”), Dv(”)ă2L2(”).
The intrinsic metric for the Dirichlet form is the L2-Wasserstein distance dW . More precisely,a continuous function u : P0 â R is 1-Lipschitz w.r.t. the L2-Wasserstein distance if and onlyif it belongs to Dom(E) and Î(u, u)(”) †1 for P0-a.e. ” â P0.
51
(ii) The generator of the Dirichlet form is the Friedrichs extension of the symmetric operator(L,Z2
0(P0) on L2(P0,P0) given as L = L1 + L2 + ÎČ Â· L3 with
L1u(”) =mâ
i,j=1
âiâjU(â«~αd”) ·
â« 1
0αâČiα
âČjd”;
L2u(”) =mâi=1
âiU(â«~αd”) ·
âIâgaps(”)
[αâČâČi (Iâ) + αâČâČi (I+)
2â αâČi(I+)â αâČi(Iâ)
|I|
]â αâČâČi (0) + αâČâČi (1)
2
L3u(”) =
mâi=1
âiU(â«~αd”) ·
â« 1
0αâČâČi d”.
Recall that gaps(”) denotes the set of intervals I = ]Iâ, I+[â [0, 1] of maximal length with”(I) = 0 and |I| denotes the length of such an interval.(iii) For P0-a.e. ”0 â P0, the associated Markov process (”t)tâ„0 on P0 starting in ”0, calledWasserstein diffusion, with generator 1
2L is given as
”t(Ï) = gt(Ï)âLeb
where (gt)tâ„0 is the Markov process on G0 associated with the Dirichlet form of Theorem 7.5,starting in g0 := Ïâ1(”0).For each u â Z2
0(P0) the process
u(”t)â u(”0)â12
â« t
0Lu(”s)ds
is a martingale whenever the distribution of ”0 is chosen to be absolutely continuous w.r.t. theentropic measure P0. Its quadratic variation process is⫠t
0Î(u, u)(”s)ds.
Remark 7.18. L1 is the second order part (âdiffusion partâ) of the generator L, L2 and L3
are first order operators (âdrift partsâ). The operator L1 describes the diffusion on P0 in alldirections of the respective tangent spaces. This means that the process (”t) at each time t â„ 0experiences the full âtangentialâ L2([0, 1], ”t)-noise.L3 is the generator of the deterministic semigroup (âNeumann heat flowâ) (Ht)tâ„0 on L2(P0,P0)given by
Htu(”) = u(ht”).
Here ht is the heat kernel on [0, 1] with reflecting (âNeumannâ) boundary conditions and ht”(.) =â« 10 ht(., y)”(dy). Indeed, for each u â Z1
0(P0) given as u(g) = U(â«~αd”) we obtain Htu(”) =
U(â« â«
~α(x)ht(x, y)”(dy)dx)
and thus
âtHtu(”) =mâi=1
âiU(ht”) · âtâ« â«
αi(x)ht(x, y)”(dy)dx
=mâi=1
âiU(ht”) ·⫠â«
αi(x)hâČâČt (x, y)”(dy)dx
=mâi=1
âiU(ht”) ·⫠â«
αâČâČi (x)ht(x, y)”(dy)dx = L3Htu(”).
Note that L depends on ÎČ only via the drift term L3 and 1ÎČL â L3 as ÎČ ââ.
52
The following statement, which in the finite dimensional case is known as Varadhanâs formula,exhibits another close relationship between (”t) and the geometry of (P([0, 1]), dW ). The Gaus-sian short time asymptotics of the process (”t)tâ„0 are governed by the L2-Wasserstein distance.
Corollary 7.19. For measurable sets A,B â P0 with positive P0-measure, let dW (A,B) =infdW (Îœ, Îœ) | Îœ â A, Îœ â B and pt(A,B) =
â«A
â«B pt(Îœ, dÎœ)P0(dÎœ) where pt(Îœ, dÎœ) denotes the
transition semigroup for the process (”t)tâ„0.Then
limtâ0
t log pt(A,B) = âdW (A,B)2
2. (7.17)
Proof. This type of result is known as Varadhanâs formula. Its respective form for (E,Dom(E) onL2(P0,P0) holds true by the very general results of [HR03] for conservative symmetric diffusions,and the identification of the intrinsic metric as dW in our previous Theorem.
Due to the sample path continuity of (”t) the Wasserstein diffusion is equivalently characterizedby the following martingale problem. Here we use the notation ăα, ”tă =
⫠10 α(x)”t(dx).
Corollary 7.20. For each α â C2([0, 1]) with αâČ(0) = αâČ(1) = 0 the process
Mt = ăα, ”tă âÎČ
2
â« t
0ăαâČâČ, ”săds
â12
â« t
0
âIâgaps(”s)
[αâČâČ(Iâ) + αâČâČ(I+)
2â αâČ(I+)â αâČ(Iâ)
|I|
]â αâČâČ(0) + αâČâČ(1)
2
ds
is a continuous martingale with quadratic variation process
[M ]t =â« t
0ă(αâČ)2, ”săds.
Remark 7.21. For illustration one may compare corollary 7.20 for (”t) in the case ÎČ = 1 tothe respective martingale problems for four other well-known measure valued process, say onthe real line, namely the so-called super-Brownian motion or Dawson-Watanabe process (”DWt ),the Fleming-Viot process (”FW ), both of which we can consider with the Laplacian as drift, theDobrushin-Doob process (”DDt ) which is the empirical measure of independent Brownian motionswith locally finite Poissonian starting distribution, cf. [AKR98], and finally simply the empiricalmeasure process of a single Brownian motion (”BMt = ÎŽXt). For each i â DW,FV,DD,BMand sufficiently regular α : R â R the process M i
t := ăα, ”ită â 12
â« t0 ăα
âČâČ, ”isăds is a continuousmartingale with quadratic variation process
[MDW ]t =â« t
0ăα2, ”DWs ăds,
[MFV ]t =â« t
0[ăα2, ”FVs ă â (ăα, ”FVs ă)2]ds,
[MDD]t =â« t
0ă(αâČ)2, ”DDs ăds,
[MBM ]t =â« t
0ă(αâČ)2, ”BMs ăds.
In view of corollary 7.19 the apparent similarity of ”DD and ”BM to the Wasserstein diffusion” is no suprise. However, the effective state spaces of ”DD, ”BM and ”t are as much differentas their invariant measures.
53
References
[AKR98] S. Albeverio, Yu. G. Kondratiev, and M. Rockner. Analysis and geometry on con-figuration spaces. J. Funct. Anal., 154(2):444â500, 1998.
[AM06] Helene Airault and Paul Malliavin. Quasi-invariance of Brownian measures on thegroup of circle homeomorphisms and infinite-dimensional Riemannian geometry. J.Funct. Anal. 241 (1): 99-142, 2006.
[AMT04] Helene Airault, Paul Malliavin, and Anton Thalmaier. Canonical Brownian motionon the space of univalent functions and resolution of Beltrami equations by a con-tinuity method along stochastic flows. J. Math. Pures Appl. (9), 83(8):955â1018,2004.
[AR02] Helene Airault and Jiagang Ren. Modulus of continuity of the canonic Brownianmotion âon the group of diffeomorphisms of the circleâ. J. Funct. Anal. 196 (2):395-426, 2002.
[Ber99] Jean Bertoin. Subordinators: examples and applications. In Lectures on probabilitytheory and statistics (Saint-Flour, 1997), volume 1717 of Lecture Notes in Math.,pages 1â91. Springer, Berlin, 1999.
[Bre91] Yann Brenier. Polar factorization and monotone rearrangement of vector-valuedfunctions. Comm. Pure Appl. Math., 44(4):375â417, 1991.
[CEMS01] Dario Cordero-Erausquin, Robert J. McCann, and Michael Schmuckenschlager. ARiemannian interpolation inequality a la Borell, Brascamp and Lieb. Invent. Math.,146(2):219â257, 2001.
[Daw93] Donald A. Dawson. Measure-valued Markov processes. In Ecole dâEte de Probabilitesde Saint-Flour XXIâ1991, volume 1541 of Lecture Notes in Math., pages 1â260.Springer, Berlin, 1993.
[DZ06] Arnaud Debussche and Lorenzo Zambotti. Conservative Stochastic Cahn-Hilliardequation with reflection. 2006. Preprint.
[EY04] Michel Emery and Marc Yor. A parallel between Brownian bridges and gammabridges. Publ. Res. Inst. Math. Sci., 40(3):669â688, 2004.
[Fan02] Shizan Fang. Canonical Brownian motion on the diffeomorphism group of the circle.J. Funct. Anal. 196 (1): 162-179, 2002.
[Fan04] Shizan Fang. Solving stochastic differential equations on Homeo(S1). J. Funct. Anal.,216(1):22â46, 2004.
[FOT94] Masatoshi Fukushima, Yoichi Oshima, and Masayoshi Takeda. Dirichlet forms andsymmetric Markov processes. Walter de Gruyter & Co., Berlin, 1994.
[Han02] Kenji Handa. Quasi-invariance and reversibility in the Fleming-Viot process. Probab.Theory Related Fields, 122(4):545â566, 2002.
[HR03] Masanori Hino and Jose A. Ramırez. Small-time Gaussian behavior of symmetricdiffusion semigroups. Ann. Probab., 31(3):1254â1295, 2003.
[JKO98] Richard Jordan, David Kinderlehrer, and Felix Otto. The variational formulation ofthe Fokker-Planck equation. SIAM J. Math. Anal., 29(1):1â17 (electronic), 1998.
54
[Kin93] J. F. C. Kingman. Poisson processes, volume 3 of Oxford Studies in Probability.The Clarendon Press Oxford University Press, New York, 1993. , Oxford SciencePublications.
[Mal99] Paul Malliavin. The canonic diffusion above the diffeomorphism group of the circle.C. R. Acad. Sci. Paris Ser. I Math., 329(4):325â329, 1999.
[McC97] Robert J. McCann. A convexity principle for interacting gases. Adv. Math.,128(1):153â179, 1997.
[NP92] D. Nualart and E. Pardoux. White noise driven quasilinear SPDEs with reflection.Probab. Theory Related Fields, 93(1):77â89, 1992.
[Ott01] Felix Otto. The geometry of dissipative evolution equations: the porous mediumequation. Comm. Partial Differential Equations, 26(1-2):101â174, 2001.
[OV00] F. Otto and C. Villani. Generalization of an inequality by Talagrand and links withthe logarithmic Sobolev inequality. J. Funct. Anal., 173(2):361â400, 2000.
[RS80] Michael Reed and Barry Simon. Functional Analysis I. Academic Press 1980.
[vRS05] Max-K. von Renesse and Karl-Theodor Sturm. Transport inequalities, gradient es-timates, entropy, and Ricci curvature. Comm. Pure Appl. Math., 58(7):923â940,2005.
[Sch97] Alexander Schied. Geometric aspects of Fleming-Viot and Dawson-Watanabe pro-cesses. Ann. Probab., 25(3):1160â1179, 1997.
[Sta03] Wilhelm Stannat. On transition semigroups of (A,Κ)-superprocesses with immigra-tion. Ann. Probab., 31(3):1377â1412, 2003.
[Stu06] Karl-Theodor Sturm. On the geometry of metric measure spaces. I. Acta Math.,196(1):65â131, 2006.
[TVY01] Natalia Tsilevich, Anatoly Vershik, and Marc Yor. An infinite-dimensional analogueof the Lebesgue measure and distinguished properties of the gamma process. J.Funct. Anal., 185(1):274â296, 2001.
[Vil03] Cedric Villani. Topics in optimal transportation, volume 58 of Graduate Studies inMathematics. American Mathematical Society, Providence, RI, 2003.
55