Weak Convergence Methods for Nonlinear Partial …Weak Convergence Methods for Nonlinear Partial Diﬀerential Equations (PDE 2) Lecture Notes Summer Term 2010 Bernd Schmidt∗ version

Weak Convergence Methods for

Nonlinear Partial Differential

Equations (PDE 2)

Lecture Notes

Summer Term 2010

Bernd Schmidt∗

version as of July 25, 2010

∗ Zentrum Mathematik, Technische Universitat Munchen, Boltzmannstr. 3, 85747Garching, [email protected]

1

Contents

Contents 2

1 Introduction 3

2 Convergence properties of nonlinear functionals 52.1 Weak convergence in normed spaces . . . . . . . . . . . . . . . . . 52.2 Weak convergence in Lp spaces . . . . . . . . . . . . . . . . . . . 72.3 Distributionen . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 122.4 Negative Sobolev-Spaces . . . . . . . . . . . . . . . . . . . . . . . 172.5 A-quasiconvexity . . . . . . . . . . . . . . . . . . . . . . . . . . . 212.6 The Legendre-Hadamard condition . . . . . . . . . . . . . . . . . 272.7 Temperierte Distributionen . . . . . . . . . . . . . . . . . . . . . . 352.8 Sobolevraume und Fouriertransformation . . . . . . . . . . . . . . 412.9 Compensated compactness . . . . . . . . . . . . . . . . . . . . . . 442.10 Young measures . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50

3 Selected Applications 543.1 Conservation laws . . . . . . . . . . . . . . . . . . . . . . . . . . . 543.2 A Convergence result for quasilinear elliptic systems . . . . . . . . 713.3 Homogenization of elliptic PDE . . . . . . . . . . . . . . . . . . . 74

4 Variationsmethoden fur vektorwertige Probleme 814.1 Euler-Lagrange-Gleichung . . . . . . . . . . . . . . . . . . . . . . 824.2 Die direkte Methode . . . . . . . . . . . . . . . . . . . . . . . . . 844.3 Polykonvexitat . . . . . . . . . . . . . . . . . . . . . . . . . . . . 884.4 Quasikonvexitat . . . . . . . . . . . . . . . . . . . . . . . . . . . . 95

5 Bonus Tracks 1025.1 Relaxation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1025.2 Young-Maße . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1085.3 Mikrostrukturen und Laminate . . . . . . . . . . . . . . . . . . . 119

Bibliography 122

2

Chapter 1

Introduction

Writing a general partial differential equation (PDE for short) as

A(u) = f, (1.1)

where A is a (nonlinear) partial differential operator (PDO), one is often inter-ested in solving a appropriate approximate problem

Aε(uε) = fε, (1.2)

This way one hopes, e.g., if it is too hard to show that (1.1) does have solutions,to first solve the easier problem (1.2) and then find a solution of (1.1) in thelimit εց 0. Another motivation is dictated by numerics: In order to solve (1.1)numerically, one needs to discretize the equation, thus arriving at an approximateproblem of the form (1.2). But also the reverse point of view is interesting inapplications. Many (physical) systems involve some small parameter (or scale)ε. Then (1.2) desribes a complicated system which we would like to approximateby a simpler equation of the form (1.1) for small ε.

In any case, the main task is to show that solutions uε of (1.2) do convergeto solutions u of (1.1), possibly up to extracting subsequences. Now typicallyone does not have much knowledge about the sequence (uε). In particular, if kis a PDO of order k, there is no hope that (uε) converges strongly in Ck or W k,p

at all! However, suitable a priori estimates may guarantee at least that (uε) isbounded and so—up to subsequences—weakly convergent to some function u.

Now the notion of weak convergence is taylored so as to give convergentquantityies under linear operations. Ususally, if (uε) converges to u weakly (writeuε u) and A is nonlinear, one cannot deduce that A(uε) A(u), let aloneAε(uε) A(u). In order to succeed we will therefore have to use the onlyavailable piece of information, namely, that uε solves the PDE (1.2), in a crucialway. The core theme of this course will be how this fact ‘compensates’ for thelack of strong compactness of (uε).

In this sense, we are basically investigating weak continuity properties ofnonlinear operations on function spaces. A strongly related question arises in

3

the calculus of variations. For nonlinear functionals u 7→ F (u), a fundamentalquestion is if F is lower semicontinuous. If so and if in addition certain coercivityassumptions are satisfied, then the direct method in the calculus of variationsyields minimizers of F , which under suitable conditions solve the correspondingEuler-Lagrange equation. Indeed, as we will see, there are many interestingsituations where F is not weakly continuous but still weakly lower semicontinuous.

The largest part of these lecture notes is devoted to a thorough investigationof weak convergence properties of nonlinear functionals, see Chapter 2. We usethe opportunity to introduce all the supplementary material which is neededcarefully. Most notably, Chapter 2 contains an introduction to the theory ofdistributions, negative Sobolev spaces and Young measures. These sections areput in between the other sections, so as to introduce new methods directly whenthey are needed to advance our development of the theory.

Chapter 3 describes three applications of our theoretic results in Chapter2. We discuss modern developments in the theory of hyperbolic conservationlaws at some length. This is then complemented by convergence studies andhomogenization results for elliptic equations.

The last Chapters 4 and 5 finally discuss the important ‘variational case’ indetail, thus giving an introduction to the vector valued calculus of variations.(Note that Chapter 5 contains extra material not covered in class).

References The basic references for this course are [Dac 82], [Ta 79] (for Chap-ters 2 and 3), [Ev 98] and [Ev 90] (for Chapter 3) and [Mu 98], [Dac 08] and[Sch 09] (for Chapters 4 and 5). These last chapters are in fact a slightly modifedversion of Chapter 5 in [Sch 09].

Acknowledgement I am grateful to Eva-Maria Piel for preparing a large partof the first LATEX-version of these notes.

4

Chapter 2

Convergence properties ofnonlinear functionals

2.1 Weak convergence in normed spaces

Definition 2.1 Let X be a normed space with dual X ′.

(a) A sequence (xk) ⊂ X is said to converge weakly to x ∈ X (notation: xk x) if

ϕ(xk) → ϕ(x) ∀ϕ ∈ X ′.

(b) A sequence (ϕk) ⊂ X ′ is said to converge weakly* to ϕ ∈ X ′ (notation:

ϕk∗ ϕ) if

ϕk(x) → ϕ(x) ∀x ∈ X.

Note:

• xn → x =⇒ xn x

• ϕk → ϕ =⇒ ϕk∗ ϕ

But the converse is false.Most important are the following compactness properties.

Theorem 2.2 If (xn) is a bounded sequence in a reflexive Banach space, thenthere exists a weakly convergent subsequence of (xn).

Theorem 2.3 (Alaoglu) If (ϕn) ⊂ X ′ is bounded in the dual X ′ of X and X isa separable Banach space, then there exists a weakly* convergent subsequence.

By the Hahn-Banach theorem we have:

Theorem 2.4 Let X be a normed space and xn x. If V ⊂ X is closed andconvex with xn ∈ V then also x ∈ V .

5

Corollary 2.5 Let X be a normed space, xn x. Then there exists a sequenceof convex combinations

yn =

N(n)∑

i=n

λ(n)i xi, λ

(n)i ≥ 0,

N(n)∑

i=n

λ(n)i = 1

such that ‖yn − x‖ → 0.

Proof. Let Vn = co(xn, xn+1, ...). As x ∈ Vn there exists λnk , ..., λ

nN(n),

∑λn

i = 1such that ∥∥∥∥∥∥

N(n)∑

i=n

λni − x

∥∥∥∥∥∥≤ 1

n.

Corollary 2.6 The norm ‖ · ‖X on X (resp. ‖ · ‖X′ on X ′) is weakly (resp.weakly*) lower semicontinuous.

Proof. Suppose xn x inX. Let R = lim inf ‖xn‖. Then for ε > 0 ∃ subsequencesuch that ‖xnk

‖ ≤ R + ε and thus x ∈ BR+ε. As ε was arbitrary,

‖x‖ ≤ R = lim infn→∞

‖xn‖.

If ϕn∗ ϕ in X ′, then lim inf ‖ϕn‖ ≥ lim inf ϕn(x) = ϕ(x) for any x ∈ X with

‖x‖ = 1. Passing to the supremum over those x yields lim inf ‖ϕn‖ ≥ ‖ϕ‖.

Weakly(*) convergent sequences are necessarily bounded:

Theorem 2.7 Suppose xn

(∗) x. Then (xn) is bounded in X (resp. X ′).

In order to check if a sequence (xn) converges weakly/weakly*, it is often con-venient to consider particularly simple objects (test functions) in X ′ resp. X onwhich to test (xn). This is often made possible by the following result.

Theorem 2.8 A sequence (xn) converges weakly to x if and only if

• (xn) is bounded and

• ϕ(xn) → ϕ(x) ∀ϕ ∈ A where spanA = X ′.

An analogous statement holds for weak*-convergence.

6

Proof. ”⇒”: clear by the preceding theorem.”⇐”: By linearity ϕ(xn) → ϕ(x) ∀ϕ ∈ spanA. For ψ ∈ X ′, ε > 0, chooseϕ ∈ spanA such that ‖ϕ− ψ‖X′ < ε. Then

lim supn→∞

|ψ(xn) − ψ(x)|

≤ lim sup(|ψ(xn) − ϕ(xn)| + |ϕ(xn) − ϕ(x)| + |ϕ(x) − ψ(x)|)≤ ‖ψ − ϕ‖X′︸︷︷︸

≤ε

‖xn‖X︸︷︷︸bd.

+ ‖ψ − ϕ‖X′︸︷︷︸≤ε

‖x‖X

≤ Cε.

The analogous statement for weak*-convergence is proved similarly.

2.2 Weak convergence in Lp spaces

We consider Lp = Lp(Ω), Ω ⊂ Rn open, with Lebesgue-measure. Then

1 ≤ p <∞ : fk f iff

∫

Ω

fkg →∫

Ω

fg ∀g ∈ Lq,1

p+

1

q= 1,

p = ∞ : fk∗ f iff

∫

Ω

fkg →∫

Ω

fg ∀g ∈ L1.

Notation: In the sequel we will sometimes just write “fk

(∗) f” meaning “fk f”

for p <∞ and “fk∗ f” for p = ∞.

As span χA : A ⊂ Ω measurable is dense in Lq for all q ∈ [1,∞], by Theorem

2.8 we deduce that for a bounded sequence (fk) we have fk

(∗) f iff

∫

A

fk →∫

A

f ∀A ⊂ Ω measurable,

i.e. if “local averages” of the fk converge.In fact, it will be sufficient to choose an even smaller class of test functions:

the characteristic functions of hypercubes.

Lemma 2.9 Let (fk) ⊂ Lp(Ω) be a bounded sequence. If 1 < p ≤ ∞, then

fk

(∗) f iff ∫

A

fk →∫

A

f ∀A = (0, a)n + b ⊂ Ω, a, b ∈ Rn.

If p = 1, then fk f iff (fk) is equiintegrable and

∫

A

fk →∫

A

f ∀A = (0, a)n + b ⊂ Ω, a, b ∈ Rn.

7

Proof. The detailed proof is left as an excercise. For p > 1 it suffices to showthat span χA : A = (0, a)n + b ⊂ Ω measurable is dense in Lq for 1 ≤ q < ∞,which follows from the fact that C0(Ω) is dense in Lq. For p = 1 one inducesfrom the Dunford-Pettis theorem (Theorem 2.14) that (fk) is relatively weaklysequentially compact if and only if (fk) is equiintegrable. It then only remains toidentify weak limit points uniquely.

One of our main tasks is to investigate nonlinear operations on weakly con-vergent sequences.

Example: Suppose fn f in Lp(0, 1). Let ψ : R → R. Question: When can weguarantee ψ(fn) ψ(f)? Surely, if ψ is affine. But indeed the converse is true,too:

Proposition 2.10 If ψ(fn) ψ(f) for any sequence (fn) with fn f , then ψis affine.

Before we prove this, we consider one of the most important examples ofweakly but not strongly convergent sequences.

Lemma 2.11 Suppose D = (0, 1)n (or a general hypercube in Rn). Let f ∈Lp(D) and extend f : Rn → R periodically. Let fk ∈ Lp(Ω), Ω ⊂ Rn bounded andopen,

fk(x) := f(kx) “a highly oscillatory function”.

Then

fk

(∗)

1

|D|

∫

D

f(x) dx = −∫

D

f(x) dx on Lp(Ω).

(Here the right hand side is the constant function taking the value −∫

Df , i.e., the

mean value of f over the periodic unit cell.)

Proof. Choose k0 ∈ N such that k0D ⊃ Ω. If p = ∞, then clearly (fk) is bounded.For p <∞ this follows from

‖fk‖pLp =

∫

Ω

|f(kx)|pdx = k−n

∫

kΩ

|f(y)|pdy

≤ k−n

∫

kk0D

|f(y)|pdy = k−nknkn0

∫

D

|f(y)|pdy = ‖f‖pLpk

n0 .

Similarly for any subhypercube Q ⊂ Ω:∫

Q

fk = k−n

∫

kQ

f(y).

Now consider a partitioning of Rn by translates of D. For large k, the numberof translates of D completely contained in kQ is kn |Q|

|D| +O(kn−1), the number of

translates of D hitting ∂Ω is O(kn−1). It follows that∫

Q

fk =|Q||D|

∫

D

f +O(k−1).

8

(Note that on any cube∫

D|f | ≤ c‖f‖Lp is bounded.)

As∫

Q( 1|D|∫

Df)dx = |Q|

|D|∫

Df , this shows

fk 1

|D|

∫

D

f

by Lemma 2.9. (Note that for p = 1,∫|fk|≥M |fk| =

∫χ|fk|≥M|fk| can be

estimated from above by kn0

∫Dχ|f |≥M|f | simlarly as before. Since this converges

to 0 for M ր ∞, (fk) is indeed equiintegrable, see the remark after Definition2.13.)

The above proposition is now easily proved: For a, b ∈ R, λ ∈ [0, 1] let

f =

a on (0, λ),

b on (λ, 1),

extended periodically.Then fk, fk(x) = f(kx), converges weakly to f =

∫ 1

0f = λa + (1 − λ)b.

Similarly ψ(fk) λψ(a) + (1 − λ)ψ(b). So if ψ(fk) ψ(f), then

ψ(λa+ (1 − λ)b) = λψ(a) + (1 − λ)ψ(b)

and ψ is affine.

In the Calculus of Variations it is of particular interest to consider functionalsof the form

F (u) =

∫

Ω

ψ(u(x)) dx.

The previous result shows, however, that F cannot be expected to be weaklycontinuous in any interesting situation. But the direct method in the calculusof variations (to be introduced later, also cf. [Sch 10]), which leads to existenceresults for minimizers, however, will still work under weaker conditions on F : Oneneeds a good criterion that guarantees that F be at least lower semicontinuous.

Theorem 2.12 Suppose Ω ⊂ Rn bounded and open, f : Rm → R continuous.Let

F (u) =

∫

Ω

f(u(x)) dx for u ∈ L∞(Ω; Rm).

Then

(i) F is sequentially continuous w.r.t. weak*-convergence iff f is affine.

(ii) F is seq. lower semicontinuous w.r.t. weak*-convergence iff f is convex.

9

Proof. Clearly (i) =⇒ (ii), so we only prove (ii). ”⇒”: Suppose

lim infn→∞

F (un) ≥ F (u) ∀(un) s.t. un∗ u.

In particular, for a, b ∈ Rm, λ ∈ [0, 1], let f : [0, 1]n → R

u(x) =

a, x1 ≤ λ,

b, x1 > λ,

extended to Rn by periodicity and define uk(x) = u(kx)χΩ(x).

As uk∗ u =

∫[0,1]n

u = λa + (1 − λ)b in L∞(Rn), also u|Ω ∗ u in L∞(Ω).

Similarly, f(un)∗ λf(a) + (1 − λ)f(b) and consequently

F (un) =

∫

Ω

f(un)n→∞→

∫

Ω

λf(a) + (1 − λ)f(b) = |Ω|[λf(a) + (1 − λ)f(b)].

But

F (u) =

∫f(u) = |Ω|f(λa+ (1 − λ)b),

and so f is convex.”⇐:” As f is convex there are affine functions l1, l2, ... such that f = supi li.

For fixed n let

Ej := x ∈ Rm : lj(x) > li(x) for 1 ≤ i < j and lj(x) ≥ li(x) for j < i ≤ n ,

so that E1∪ · · · ∪En = Rm and max1≤i≤n li(x) = lj(x) on Ej. Suppose uk∗ u.

Then also χAuk∗ χAu for all measurable A. But then

lim infk→∞

F (uk) = lim infk→∞

∫

Ω

f(uk)

≥ lim infk→∞

n∑

j=1

∫

u−1(Ej)

lj(uk)

If lj(x) = ajx+ bj , this reads

lim inf F (uk) ≥n∑

j=1

lim inf

∫ajχu−1(Ej)uk + χu−1(Ej)bj

=n∑

j=1

∫

u−1(Ej)

lj(u)

=

n∑

j=1

∫

u−1(Ej)

max1≤i≤n

li(u)

=

∫

Ω

max1≤i≤n

li(u).

10

Finally sending n→ ∞, by monotone convergence we find

lim infk→∞

F (uk) ≥∫

Ω

f(u) = F (u).

As L1 is not reflexive, bounded sequences need not admit weakly convergentsubsequences. This can be seen already in the following simple one-dimensionalsituation.

Example: The sequence nχ(0, 1n

) ⊂ L1 is bounded, but does not admit a convergent

subsequence. (If so, then nχn 0, but∫nχn ≡ 1 6→ 0.)

This lack of compactness can be cured by embedding L1 in a larger space ofmeasures: Naturally L1(Ω) ⊂ M(Ω), the set of finite (signed) Radon measures.By the Riesz theorem, this space can be identified with the dual of C0(Ω) :=

Cc(Ω)‖.‖∞

:C0(Ω)′ = M(Ω).

So naturally we have the notion of weak*-convergence on M(Ω):

µk∗ µ M(Ω) :⇔

∫fdµk →

∫fdµ ∀f ∈ C0(Ω).

Since ‖f‖L1 = ‖fdx‖M, every L1-bounded sequence has a weak*-convergentsubsequence in M.Note: If Ω is compact, then C(Ω) = C0(Ω) = Cc(Ω).

Example: uχ[0, 1n

]∗ δ0 in M(−1, 1).

For easy reference we finally recall the definition and some characterizationsof equiintegrability from general measure theory.

Definition 2.13 A family/set F ⊂ L1(Ω, µ) in a measure space (Ω, µ) is calledequintegrable if

(i) ∀ε > 0 ∃A measurable with µ(A) <∞ such that

∫

Ω\A|f |dµ < ε ∀ f ∈ F .

(ii) ∀ε > 0 ∃ δ > 0 such that ∀E measurable with

µ(E) < δ =⇒∫

E

|f |dµ < ε ∀ f ∈ F .

11

Note that if µ(Ω) <∞, then (i) is trivial and one has:

F is equintegrable

⇔ limMր∞

supf∈F

∫

|f |≥M|f | dµ

⇔∃ monotone ϕ : [0,∞) → [0,∞] with limt→∞

ϕ(t)

t= ∞ and C > 0 such that

∫

Ω

ϕ(|f |) ≤ C ∀ f ∈ F .

We finally mention (without proof) the Dunford-Pettis theorem.

Theorem 2.14 F ⊂ L1(Ω, µ) relatively weakly sequentially compact if and onlyif F is equiintegrable.

2.3 Distributionen

Distributionen sind ‘verallgemeinerte Funktionen’. Wahrend wir bisher Funk-tionen und ihre Ableitungen untersucht haben, werden wir den Gegenstand un-serer Untersuchungen nun wesentlich verallgemeinern. Schon in der Theorie derSobolevraume (vgl. [Sch 10]) haben wir gesehen, dass es von großem Nutzen seinkann, auch nicht-glatte Funktionen in einem verallgemeinerten Sinne zu differen-zieren. So ist etwa im schwachen Sinne f : R → R, f(x) = |x|, differenzierbarmit

f ′(x) :=

−1, x < 0,

1, x > 0.

f ′′ ist nun jedoch noch nicht einmal im schwachen Sinne mehr definiert: Es kannkeine L1

loc-Funktion g geben, so dass f ′′ = g ist, denn g musste gleich 0 auf(−∞, 0) und auf (0,∞) sein, somit g = 0 fast uberall. f ′ ist aber nicht konstant.

Um auch solche Funktionen noch differenzieren zu konnen, mussen wir dieKlasse der Funktionen geeignet verallgemeinern: Wir werden die Menge der Dis-tributionen D einfuhren, deren Elemente wir als ‘verallgemeinerte Funktionen’verstehen. Es wird sich herausstellen, dass f ′′ tatsachlich sinnvoll zu definierenist, allerdings nicht als Funktion auf R.

Die wohl wichtigste Eigenschaft einer Distribution ist, dass sie unendlich oftdifferenzierbar ist. Aber auch andere auf Funktionen definierte Operationenhaben eine naturliche Entsprechung auf den verallgemeinerten Funktionen, diewir im Folgenden untersuchen werden.

Ausgangspunkt fur die Definition einer verallgemeinerten Funktion auf Ω ⊂Rn (offen) ist die Beobachtung, dass f ∈ L1

loc(Ω) durch die Werte∫

Ω

fϕ, ϕ ∈ D(Ω) := C∞c (Ω)

12

eindeutig festgelegt wird. (In der Distributionentheorie wird der Raum der Test-funktionen C∞

c (Ω) meist mit D(Ω) bezeichnet.)Beachte, dass ϕ 7→

∫fϕ eine lineare Abbildung von D(Ω) in den Skalarenkorper

K (K = R oder C) ist. Wir definieren nun die Menge der Distributionen als dieMenge der linearen Abbildungen, die einer (sehr milden) Stetigkeitsbedingunggenugen.

Definition 2.15 Sei Ω ⊂ Rn offen. Eine Distribution auf Ω ist eine lineareAbbildung T : D(Ω) → K, so dass gilt: Fur jede kompakte Teilmenge K von Ωexistieren CK > 0 und NK ∈ N0, so dass

|Tϕ| ≤ CK

∑

|α|≤NK

‖∂αϕ‖L∞(K) ∀ϕ ∈ D(Ω) mit suppϕ ⊂ K

gilt. (Gibt es ein kleinstes NK, welches fur alle Kompakta in Ω funktioniert, soheißt NK die Ordnung von T .) Die Menge der Distributionen auf Ω wird mitD′(Ω) bezeichnet.

Beispiele:

1. Jede L1loc-Funktion f induziert eine Distribution Tf gemaß Tfϕ :=

∫Ωfϕ.

Die Linearitat dieser Abbildung ist klar. Außerdem gilt

|Tϕ| ≤ ‖f‖L1(K)‖ϕ‖L∞(K)

fur alle ϕ ∈ D(Ω) mit suppϕ ⊂ K. Insbesondere ist Tf von nullter Ord-nung. Wir werden in Zukunft einfach f statt Tf schreiben.

2. Jedes Borelmaß µ mit |µ|(K) < ∞ fur kompakte K ⊂ Ω ist eine Distribu-tion nullter Ordnung gemaß ϕ 7→

∫ϕdµ, denn

|Tϕ| ≤ |µ|(K)‖ϕ‖L∞(K).

(Dies verallgemeinert 1.)

3. Ist x ∈ Ω, so definiert T : D(Ω) → K, Tϕ := ϕ(x) eine Distribution.Dies ist in der Tat gerade T = δx, wobei δx das Diracmaß im Punkte xbezeichnet:

δx(A) =

1, x ∈ A,

0, x /∈ A.

Nach 2. ist δx ∈ D′(Ω). Speziell fur x = 0 schreibt man auch oft einfach δstatt δ0.

4. Ist Ω = (0, 1) ⊂ R, T : D(Ω) → K definiert durch Tϕ =∑∞

k=2dkϕ

dxk ( 1k), so

ist T ∈ D′(Ω). T ist jedoch nicht von endlicher Ordnung.

13

Definition 2.16 Es seien ϕ, ϕ1, ϕ2, . . . ∈ D(Ω). Wir sagen (ϕk) konvergiert in D(Ω)gegen ϕ, wenn es ein Kompaktum K ⊂ Ω gibt, so dass suppϕk ⊂ K gilt fur allek ∈ N und ∂αϕk → ∂αϕ gleichmaßig auf Ω konvergiert fur jeden Multiindex α.

Beachte: Dies definiert eine außerst starke Konvergenz auf D. Eine Folgekonvergiert nur dann, wenn es ein Kompaktum gibt, außerhalb dessen alle Funk-tionen verschwinden, und wenn alle Ableitungen gleichmaßig konvergieren.

Theorem 2.17 Eine lineare Abbildung T : D(Ω) → K ist genau dann eine Dis-tribution, wenn gilt

ϕk → ϕ in D(Ω) =⇒ Tϕk → Tϕ in K.

Da die Konvergenz in D(Ω) sehr stark ist, zeigt dieser Satz, dass die Stetigkeits-bedingung fur Distributionen eine sehr schwache Bedingung ist.

Proof. Sei T ∈ D′(Ω), ϕk → ϕ in D(Ω). Nach Definition existiert ein KompaktumK ⊂ Ω, so dass suppϕk ⊂ K ist fur alle k und ∂αϕk → ∂αϕ gleichmaßig auf Ωkonvergiert fur jedes α. Dann aber ist auch suppϕ ⊂ K und

|Tϕk − Tϕ| ≤ CK

∑

|α|≤NK

‖∂α(ϕk − ϕ)‖L∞(K) → 0.

Ist nun umgekehrt T /∈ D′(Ω), so gibt es ein Kompaktum K ⊂ Ω, so dass zujedem k ∈ N eine Testfunktion ϕk ∈ D(Ω) mit suppϕk ⊂ K und

|Tϕk| ≥ k∑

|α|≤k

‖∂αϕk‖L∞

existiert. Nach Multiplikation mit einem geeignetem Skalar konnen wir o.B.d.A.|Tϕk| = 1 fur alle k annehmen. Dann aber folgt ∂αϕk → 0 gleichmaßig fur jedesα und damit ϕk → 0 in D(Ω). Wegen |Tϕk| = 1 fur alle k gilt jedoch nichtTϕk → 0 = T0.

Definition 2.18 Wir sagen eine Folge von Distributionen Tn konvergiert in D′(Ω)gegen eine Distribution T , wenn Tnϕ→ Tϕ in K konvergiert fur alle ϕ ∈ D(Ω).

Beispiel: Ist ηε, ε > 0, der skalierte Standardglattungskern, so gilt ηε → δ inD′(Rn) mit ε → 0. (Beachte ηε(ϕ) =

∫ηε(x)ϕ(x) dx = ηε ∗ ϕ(0) → ϕ(0) fur

ϕ ∈ D(Rn).)

Um die Definition der Ableitung einer Distribution zu motivieren, uberlegenwir zunachst, wie die Ableitung ∂α einer (schwach) differenzierbaren Funktionf : Ω → K als Distribution wirkt: Fur alle ϕ ∈ D(Ω) ist

∫

Ω

∂αf ϕ = (−1)|α|∫

Ω

f ∂αϕ.

Wir definieren daher:

14

Definition 2.19 Ist T ∈ D′(Ω), α ein Multiindex, so wird durch

∂αT (ϕ) := (−1)|α|T (∂αϕ) ∀ϕ ∈ D(Ω)

eine Distribution ∂αT definiert.

Beachte, dass ϕk → ϕ in D(Ω) impliziert ∂αϕk → ∂αϕ in D(Ω), so dass dieAbleitung ∂αT tatsachlich wohldefiniert ist. Ist u ∈ Cm(Ω) oder u ∈ Wm,1

loc (Ω),|α| = m, so ist die distributionelle Ableitung offenbar gerade die klassische bzw.schwache Ableitung von u.

Beispiele:

1. Die Heavisidefunktion H : R → R,

H(x) =

1, x > 0,

0, x < 0,

ist lokal integrierbar und also eine Distribution auf R. Fur Testfunktionenϕ ist

H ′(ϕ) = −∫

R

H(x)ϕ′(x) dx = −∫ ∞

0

ϕ′(x) dx = ϕ(0) = δ(ϕ).

Dies zeigt H ′ = δ.

2. Die Ableitungen der Deltadistribution sind gerade die Auswertungsfunk-tionale der Ableitungen: Fur Testfunktionen ϕ ist

∂αδ(ϕ) = (−1)|α|δ(∂αϕ) = (−1)|α|∂αϕ(0).

3. Sei T = log | · |. Dann ist T ∈ L1loc(R) und also eine Distribution. Der

kanonische Kandidat fur die Ableitung ist x 7→ 1x. Dies ist jedoch nicht lokal

integrierbar und somit nicht offensichtlich als Distribution zu interpretieren.Andererseits muss die distributionelle Ableitung ja existieren. Was also istT ′?

Sei ϕ ∈ D(R). Partielle Integration liefert

T ′(ϕ) = −T (ϕ′) = −∫

R

log |x|ϕ′(x) dx

= −∫

|x|≤εlog |x|ϕ′(x) dx+ log ε ϕ(ε) − log ε ϕ(−ε) +

∫

|x|>ε

ϕ(x)

xdx

fur ε > 0. Nun ist∫|x|≤ε log |x|ϕ′(x) dx → 0 wegen log | · | ∈ L1

loc und

log ε (ϕ(ε)−ϕ(−ε)) = 2ε log ε ϕ(ε)−ϕ(−ε)2ε

→ 0 ·ϕ′(0) = 0 fur ε→ 0. Es folgt

T ′(ϕ) = limεց0

∫

|x|>ε

ϕ(x)

xdx.

15

Man schreibt T ′ = HW 1x

und nennt HW 1x

den Hauptwert1 von 1x.

Die Menge der Distributionen ist zwar offensichtlich ein Vektorraum, das Pro-dukt zweier Distributionen kann jedoch im Allgemeinen nicht sinnvoll definiertwerden. Die Multiplikation einer Distribution mit einer glatten Funktion ist abermoglich. Zur Motivation betrachten wir wieder f ∈ L1

loc(Ω). Ist ψ ∈ C∞(Ω), sogilt ∫

Ω

(ψf)ϕ =

∫

Ω

f(ψϕ) ∀ϕ ∈ D(Ω).

Wir definieren daher

Definition 2.20 Ist ψ ∈ C∞(Ω), T ∈ D′(Ω), so wird durch

(ψT )(ϕ) = T (ψϕ) ∀ϕ ∈ D(Ω)

eine Distribution ψT ∈ D′(Ω) definiert.

Beachte, dass dies wohldefiniert ist: Mit der Leibniz-Regel sieht man, dassϕk → ϕ in D(Ω) ψϕk → ψϕ in D(Ω) impliziert.

Lemma 2.21 (Produktregel) Ist ψ ∈ C∞(Ω), T ∈ D′(Ω), so ist

∂i(ψT ) = (∂iψ)T + ψ(∂iT ).

Proof. Fur ϕ ∈ D(Ω) ist

∂i(ψT )(ϕ) = −ψT (∂iϕ) = −T (ψ ∂iϕ) = −T (∂i(ψϕ) − (∂iψ)ϕ)

= (∂iT )(ψϕ) + T ((∂iψ)ϕ) = ψ∂iT (ϕ) + (∂iψ)T (ϕ).

Nach diesem Schema kann man auch weitere Operationen auf (geeigneten)Distributionen definieren wie etwa Faltung, Spiegelung und Fouriertransforma-tion. Motiviert durch die auf gewohnlichen Funktionen bekannten Eigenschaftendefiniert man diese Operationen ‘durch Dualitat’, indem man sie durch ihr Wirkenauf Testfunktionen beschreibt. Die Beweise von Aussagen uber Distributionenbenutzen dann typischer Weise gerade die entsprechenden (schon bekannten) Aus-sagen uber Testfunktionen.

Wir werden die Theorie erst bei Bedarf weiter ausbauen, untersuchen hier abernoch den Zusammenhang von distributionellen und klassischen Ableitungen.

Theorem 2.22 Es seien u, ∂αu ∈ C(Ω) fur alle |α| ≤ k, wobei ∂αu die distribu-tionelle Ableitung bezeichnet. Dann ist u ∈ Ck(Ω).

1oder nauch CH 1

xfur Cauchy-Hauptwert, englisch: PV 1

xder principal value of 1

x

16

Proof. O.B.d.A. ist |α| = 1, etwa α = ej , ej der j-te Einheitsvektor. Derallgemeine Fall ergibt sich hieraus durch Induktion. Zu x0 ∈ Ω beliebig betrachtedas Segment K = [x0 − t0ej , x0 + t0ej ], wobei t0 > 0 so klein sei, dass K ⊂ Ω ist.Sei ηε der skalierte Standardglattungskern. Dann gilt fur uε := ηε ∗ u auf einerUmgebung von K fur hinreichend kleine ε

∂juε(x) = (∂jηε) ∗ u(x) =

∫(∂jηε)(x− y)u(y) dy

= −∫

∂ηε

∂yj

(x− y)u(y) dy =

∫ηε(x− y)∂ju(y) dy,

denn ηε(x− ·) ∈ D(Ω). Dies zeigt ∂juε = ηε ∗ ∂ju in einer Umgebung von K undsomit

uε(x0 + tej) = uε(x0) +

∫ t

0

(ηε ∗ ∂ju)(x0 + sej) ds

fur |t| < t0. Da uε → u und ηε ∗ ∂ju → ∂ju gleichmaßig auf K konvergieren furε→ 0, folgt

u(x0 + tej) = u(x0) +

∫ t

0

∂ju(x0 + sej) ds.

Daraus folgt nun die Behauptung.

2.4 Negative Sobolev-Spaces

Since for u ∈W k,p(Ω) every ∂αu, |α| ≤ k, is an Lp(Ω)-function, W k,p(Ω) may beviewed as a subspace of Lp over a disjoint union of copies of Ω: For |α| ≤ k let

Ωα ⊂ Rn × γ ∈ Nn0 : |γ| ≤ k ,

Ωα = (x, α) : x ∈ Ω ,

so that Ωα ∩ Ωβ = ∅ for α 6= β, and define

Ω(k) =⋃

α

Ωα

equipped with natural σ-algebra, topology and measure:

|A| =∑

α

| x ∈ Ω : (x, α) ∈ Ωα |

The integral over a function f : Ω(k) → R is

∫

Ω(k)

f(z) dz =∑

α

∫

Ω

f(x, α) dx

17

and Lp(Ω(k)) consists of functions u such that

‖u‖Lp(Ω(k)) :=

(∑

α

∫

Ω

|f(x, α)|p) 1

p

<∞,

i.e. u(·, α) ∈ Lp(Ωα) for all α with |α| ≤ k.The mapping Ψ : W k,p → Lp(Ω(k)), defined by

(Ψu)(x, α) = ∂αu(x)

is an isometry, and in particular Ψ(W k,p(Ω)) is a closed subspace of Lp(Ω(k)).This implies:

Theorem 2.23 W k,p(Ω) is separable if 1 ≤ p <∞ and reflexive if 1 < p <∞.

Proof. This follows immediately from the fact that subsets of separable spaces areagain separable and the fact that closed subspaces of reflexive spaces are reflexivethemselves.

As Lp(Ω(k)), p <∞, is an Lp-space over the σ-finite set Ω(k), its dual is givenby (Lp(Ω(k)))′ ∼= Lq(Ω(k)), 1

p+ 1

q= 1, in the usual way:

For ϕ ∈ (Lp(Ω(k)))′ there exists v ∈ (Lq(Ω(k))) such that

ϕ(u) =

∫

Ω(k)

u(z)v(z) dz =∑

|α|≤k

∫

Ω

u(x, α)v(x, α) dx.

We can now characterize functionals on W k,p in the following way:

Theorem 2.24 Let 1 ≤ p < ∞. For every l ∈ (W k,p(Ω))′ there exists v ∈Lp(Ω(k)) such that

l(u) =∑

|α|≤k

∫

Ω

v(x, α)∂αu(x) dx. (2.1)

Moreover,

‖l‖(W k,p)′ = inf‖v‖Lp(Ω(k)) : v ∈ Lp(Ω(k)) satisfies (2.1)

= min‖v‖Lp(Ω(k)) : v ∈ Lp(Ω(k)) satisfies (2.1)

.

Proof. If l ∈ (W k,p(Ω))′, then l Ψ−1 is a continuous linear functional onΨ(W k,p(Ω)) ⊂ Lp(Ω(k)). By the Hahn-Banach Theorem there exists a norm

18

preserving extension L of l Ψ−1, represented by some v ∈ Lq(Ω(k)), 1p

+ 1q

= 1.Thus,

l(u) = (l Ψ−1)(Ψu) =∑

α

∫

Ω

v(x, α)(Ψu)(x, α) dx

=∑

α

∫

Ω

v(x, α)∂αu(x) dx.

Also by choice of L and Ψ being isometric

‖l‖(W k,p)′ = ‖l Ψ−1‖(Ψ(W k,p(Ω)))′ = ‖L‖(Lp(Ω(k)))′ = ‖v‖Lq(Ω(k)).

If w is any element of Lq(Ω(k)) satisfying (2.1) then w, viewed as an element of(Lp(Ω(k)))′, is an extension of l Ψ−1, and so

‖w‖Lq(Ω(k)) ≥ ‖l Ψ−1‖(Ψ(W k,p))′ = ‖l‖(W k,p)′ .

Remark 2.25 This justifies saying uk u in W k,p, 1 ≤ p <∞, iff ∂αuk ∂αuin Lp for all α with |α| ≤ k.

The elements of (W k,p(Ω(k)))′ are extensions of distributions:If v satisfies (2.1), let

T =∑

|α|≤k

(−1)α∂αTvα, Tvα

(ϕ) =

∫vαϕ. (2.2)

Then

Tϕ =∑

|α|≤k

(−1)α∂αTvα(ϕ) =

∑

|α|≤k

∫vα∂

αϕ

= l(ϕ).

Conversely, any distribution of the form (2.2) (with vα ∈ Lq for all α) extendsto a continuous linear functional on W k,p(Ω), even uniquely to a continuous linearfunctional on W k,p

0 (Ω). This follows from the fact that T is continuous withrespect to ‖ · ‖W k,p:

|Tϕ| =

∣∣∣∣∣∣∑

|α|≤k

∫vα∂

αϕ

∣∣∣∣∣∣=

∣∣∣∣∫

Ω(k)

vΨϕdz

∣∣∣∣

≤ ‖v‖Lq(Ω(k))‖Ψϕ‖Lp(Ω(k))

= ‖v‖Lq(Ω(k))‖ϕ‖(W k,p)′ ,

and by definition D(Ω) is dense in W k,p0 (Ω).

19

Definition 2.26 Let 1 < p ≤ ∞.

W−k,p :=

T ∈ D′(Ω) : ∃vα ∈ Lp(Ω) : T =

∑

α

∂αvα

with norm

‖T‖W−k,p := min

‖v‖Lp(Ω(k)) : T =

∑

α

∂αvα

.

As a corollary to our previous considerations we obtain:

Theorem 2.27 If 1 ≤ p <∞, 1p

+ 1q

= 1, then

(W k,p0 (Ω))′ ∼= W−k,q(Ω).

Recall the Rellich-Kondrachov Theorem:

Theorem 2.28 If Ω ⊂ Rn is open and bounded, then the embeddings

• W 1,p0 → Lq for 1 ≤ p < n, 1 ≤ q < p∗ = pn

n−pand

• W 1,p0 → C0(Ω) for p > n

are compact. If in addition Ω has a Lipschitz boundary ∂Ω, then the embeddings

• W 1,p → Lq for 1 ≤ p < n, 1 ≤ q < p∗ = pn

n−pand

• W 1,p → C(Ω) for p > n

are compact, too.

Corollary 2.29 Under the assumptions of Theorem 2.28, the embedding W 1,p0 (Ω)

(resp. W 1,p(Ω)) → Lp(Ω) is compact for any 1 ≤ p ≤ ∞. In particular, anybounded sequence in W 1,p(Ω) has an Lp-strongly convergent subsequence and

un

(∗) u in W 1,p implies un → u in Lp.

Remark 2.30 Compactness theorems are important to handle nonlinear expres-sions. E.g., suppose uε is W 1,α-bounded sequence of solutions of the quasilinearPDE

a(x, uε) · ∇uε = b(x, uε).

Then for a subsequence uε∗ u in W 1,∞ and, in particular, uε → u uniformly.

If a and b are continuous, we find that u also solves

a(x, u) · ∇u = b(x, u).

20

Recall Schauder’s theorem from functional analysis: A linear operator T : X → Ybetween Banach spaces X and Y is compact if and only if the adjoint operatorT ′ : Y ′ → X ′ is compact. Applying this to the compact embeddings W 1,p

0 →Lp, 1 ≤ p <∞, yields:

Theorem 2.31 Suppose Ω ⊂ Rn is open and bounded. The embeddings Lq(Ω) →W−1,q(Ω), 1 < q ≤ ∞ are compact.

Proof. Let T : W 1,p0 → Lp, Tu = u be the compact embedding operator. Then

T ′ : (Lp)′ → (W 1,p0 )′

T ′ϕ(u) = ϕ(Tu) =

∫ϕu ∀ϕ ∈ (Lp)′, u ∈ W 1,p

0 ,

i.e. T ′ : Lq → W−1,q is the natural embedding from Lq into W−1,q and compact,too.

For p = 1 we have argued earlier that it is convenient to embed L1 in thelarger space M of Radon measures. We therefore prove a compact embeddingresult directly for M.

Theorem 2.32 Suppose Ω ⊂ Rn is open and bounded. The embeddings M(Ω) →W−1,q(Ω), q < n

n−1, are compact.

Proof. For p > n, the embedding W 1,p0 → C0 is compact. Similarly as before it

follows that then also

M ∼= (C0)

′ → (W 1,p0 )′

∼= W−1,q

is compact for q = p

p−1. It remains to note that p > n is equivalent to q < n

n−1.

2.5 A-quasiconvexity

We now resume our disccussion of Section 2.2 on the (semi-)continuity propertiesof nonlinear functionals. Now, however, under additional differential constraints.More precisely, we will consider sequences (u(ν)) ⊂ L∞(Ω; Rm) such that

(H)

u(ν) ∗ u L∞

Au(ν) =

(∑j,k aijk

∂u(ν)j

∂xk

)

i=1,...,q

bd. in L2(Ω; Rq),

f(u(ν))∗ l L∞,

where the aijk ∈ R are constants and so A is a linear partial differential operatorwith constant coefficients.If (H) holds and additionally Au(ν) = 0 ∀ν we say that (H0) holds.

Examples:

21

1. Suppose u(ν) ∈ L∞(Ω; R), Ω ⊂ Rn, is of the form u(ν) = u(ν)(x1). Then u(ν)

satisfies∂u(ν)

∂xj

= 0 for j = 2, ..., n,

i.e., (H0) is satisfied with

ai,j,k = ai,1,k (due to m = 1)

= δi+1,k ∀i = 1, ..., q = k − 1.

2. Suppose u(ν) ∈ L∞(Ω; Rn), Ω ⊂ Rn, is a sequence of gradients:

u(ν) = ∇v(ν), (v(ν)) ⊂W 1,∞.

Then u(ν) satisfies∂(u(ν))j

∂xk

=∂(u(ν))k

∂xj

and hence (H0), where i labels the elements of(i1, i2) ∈ 1, ...n2 , i1 < i2

andaijk = δ(i1,i2),(j,k) − δ(i2,i1),(j,k).

Already this (easy) example shows that it can be cumbersome to writedown aijk explicitly. In the following we will therefore often just list theside conditions as in

∂(u(ν))j

∂xk

=∂(u(ν))k

∂xj

∀j, k ∈ 1, ..., n (⇔: curl u(ν) = 0)

This is the so-called “scalar variational case”.

3. In the sequel, in particular sequences

u(ν) = (v(ν), w(ν)) ∈ (L∞)m × (L∞)m

with Au(ν) = (div v(ν), curlw(ν)), curlw = (wi,j −wj,i)i,j will be interesting.

Recall: By Jensen’s inequality, a function f : Rm → R is convex iff

−∫

D

f(z + ζ(x)) dx ≥ −∫

D

f(z) dx = f(z) (2.3)

∀ζ ∈ L∞(D; Rm) with −∫

Dζ(x) dx = 0.

Proof. Sufficiency is clear by considering a, b ∈ Rm, λ ∈ (0, 1) andz = λa+ (1 − λ)b and

ζ(x) =

a− z on D ⊂ D with |D|

|D| = λ

b− z on D\D.

22

Then

−∫

D

f(z + ζ(x)) =1

|D|(f(a)|D| + f(b)|D\D|) = λf(a) + (1 − λ)f(b)

and

f(z) = f(λa+ (1 − λ)b).

Necessity is just Jensen’s inequality (2.3): Suppose l is a supporting hyperplaneat z: l(z) = f(z), l ≤ f everywhere and l affine, say l(x) = Ax+ b. Then

−∫f(z + ζ(x)) ≥ −

∫l(z + ζ(x)) = Az + b+ A−

∫ζ = Az + b = l(z) = f(z).

In the presence of differential constraints, this motivates:

Definition 2.33 A continuous function f : Rm → R is called A-quasiconvex if

−∫

D

f(z + ζ(x)) dx ≥ −∫

D

f(z) = f(z)

for every z ∈ Rm and every hypercube D = (0, a)n ⊂ Rn and all ζ ∈ L(D), where

L(D) =

ζ ∈ C∞

per(D; Rm) : −∫

D

ζ = 0 and Aζ = 0

.

The space C∞per(D) consists of the restriction to D of C∞-smooth functions

that are D-periodic. Note that without differential constraints, the restrictionto this space does not introduce additional constraints as C∞

per(D) is dense inL∞(D). However, if A 6= 0, then it will not be sufficient to require that Jensen’sinequality be satisfied for fields ζ ∈ L∞(D) with Aζ = 0 in D′(D). By way ofcontrast, it is important that their periodic extension ζ ∈ L∞(Rn) satisfy Aζ = 0in D′(Rn). (Note that jumps could occur on ∂D, whose distributional derivativeis incompatible with the differential constraints.)

In fact, this leads to an equivalent condition: If ζ ∈ L∞(D; Rm), −∫

Dζ = 0 and

its periodic extension satisfies Aζ = 0, then the mollified fields ζε = ηε ∗ ζ , whereηε is a scaled standard mollfier, are easliy seen to lie in C∞

per(D) with∫

Dζε = 0.

Furthermore, Aζε = ηε ∗ Aζ = 0. So if f is A-quasiconvex, we deduce fromζε → ζ in L1(D) as ε → 0 that

−∫

D

f(z + ζ(x))dx ≥ f(z).

Theorem 2.34 Suppose f is continuous and such that l ≥ f(u) whenever asequence u(ν) satifies (H0). Then f is A-quasiconvex.

23

Proof. Let ζ ∈ C∞per(D) be periodically extended to Rn and define

ζ (ν)(x) = ζ(νx), ν ∈ N.

Then ζ (ν) ∗ 0 and

∑

i,j

aijk

∂ζ(ν)j

∂xk

(x) = ν∑

i,j

aijk

∂ζj∂xk

(νx) = 0 ∀x, ∀i, ∀ν.

From∫

D

f(z + ζ (ν)(x))dx = ν−n

∫

νD

f(z + ζ(y))dy =

∫

D

f(z + ζ(y))dy

and the assumption f(z + ζ(·)) ∗ l ≥ f(z) we now deduce that

∫

D

f(z + ζ(y))dy = lim infν→∞

∫

D

f(z + ζ (ν)(x))dx ≥ |D|f(z).

Under an additional technical assumption, the converse of the preceedingtheorem is also known to be true. In order to formulate it we introduce the linearmappings B(ξ) : Rm → Rq (ξ ∈ Rn) by setting

B(ξ)λ =

(m∑

j=1

n∑

k=1

aijkλjξk

)

i=1,...,q

.

One says that the “constant rank assumption” holds, if

RankB(ξ) = RankB(ξ′) ∀ ξ, ξ′ ∈ Rn \ 0.

Under the constant rank assumption it was shown only rather recently byFonseca and Muller that A-quasiconvexity is also sufficient for weak* lower semi-continuity, cf. [FoMu 99]. Their proof, however, is quite involved and so we willgive only a weaker result here.

Theorem 2.35 Suppose f is continuous and such that

−∫

D

f(z + ζ(x)) dx ≥ −∫

D

f(z) = f(z)

for every z ∈ Rm and every hypercube D = (0, a)n ⊂ Rn and all ζ ∈ L∞(D) with

−∫

D

ζ = 0 and Aζ = 0.

24

Suppose (u(ν)), u satisfy (H) and, in addition,

(u(ν)) − u ∈ ker(A) ∀ν.

Then

lim infν→∞

∫

Ω

f(u(ν)(x))dx ≥∫

Ω

f(u(x)) ∀Ω ⊂ Rn open.

Proof. Approximate Ω by unions of hypercubes of side-length 1k:

• Hk =⋃

1≤i≤IkDki

, Dki⊂ Ω translates of (0, 1

k)n

• |Ω\Hk| → 0 k → ∞.

If x ∈ Hk, then set

uk(x) = −∫

Dki

u(y)dy for x ∈ Dki.

Then∫

Hk

|u(x) − uk(x)|dx =∑

i

∫

Dki

|u(x) −−∫

Dki

u(y) dy| dx

≤∑

i

∫

Dki

1

|Dki|

∫

Dki

|u(x) − u(y)| dy dx.

If u is uniformly continuous, then this expression converges to zero as k → ∞.But also for general u ∈ L1(Ω) we have

∫

Hk

|uk| ≤∑

i

∫

Dki

|Dki|−1

∫

Dki

|u(y)| dy =

∫

Hk

|u|,

So that u 7→ uk is a contraction on L1(Hk). Let ε > 0. Then approximate u bya uniformly continuous v such that ‖u− v‖L1 < ε. This implies

‖u− uk‖L1 ≤ ‖u− v‖ + ‖v − vk‖ + ‖uk − vk‖ ≤ ‖v − vk‖ + 2‖u− v‖ ≤ 3ε

for k large. So∫

Hk

|u(x) − uk(x)|dx→ 0 (2.4)

as k → ∞.Note also that if f is continuous and bounded, then for given ε > 0 there

exists a constant C = C(ε) such that f(y1)− f(y2)| ≤ ε4|Ω| +C|y1− y2| holds true

for all y1, y2 ∈ Rm. It follows that∫

Hk

|f(u+ (u(ν) − u)) − f(uk + (u(ν) − u))| ≤ ε

4+ C

∫

Hk

|u− uk| <ε

2(2.5)

25

and∫

Hk

|f(u) − f(uk)| ≤ε

4+ C

∫

Hk

|u− uk| <ε

2(2.6)

for large k by (2.4). Since

f(u(ν)) − f(u) = f(u+ (u(ν) − u)) − f(uk + (u(ν) − u))

+ f(uk + (u(ν) − u)) − f(uk) + f(uk) − f(u),

it follows from equations (2.5) and (2.6) that

∫

Hk

f(u(ν)) −∫

Hk

f(u) ≥∫

Hk

[f(uk + (u(ν) − u)) − f(uk)] dx− ε.

Setting ζ (ν) = u(ν) − u ∈ kerA we get ζ (ν) ∗ 0. Centering

η(ν)(x) := ζ (ν)(x) −−∫

Dki

ζ (ν)(x) for x ∈ Dki,

the second term on the right hand side converges to zero and thus for all i wealso have

η(ν)|Dki∈ kerA, η(ν) ∗

0

and moreover,

−∫

Dki

η(ν) dx = 0.

It follows that∫

Hk

f(u(ν)) −∫

Hk

f(u)

≥∫

Hk

f(uk + ζ (ν)) − f(uk + η(ν))

︸︷︷︸→0 as before

+∑

i

∫

Dki

f(uk + η(ν)) − f(uk)

︸︷︷︸≥0 on each Dki

by assumption

−ε

This implies

lim infν→∞

∫

Hk

f(u(ν)k ) ≥

∫

Hk

f(u) − ε

Let first k ր ∞, then ε ց 0, and the claim follows.

Definition 2.36 Let f be continuous. If f and −f are A-quasiconvex, then fis said to be A-quasiaffine.

As a corollary to Theorem 2.34 we have the following result.

26

Corollary 2.37 Suppose f is continuous and such that l = f(u) whenever asequence u(ν) satifies (H0). Then f is A-quasiconvex.

Note: If f is affine it follows that f is also A-quasiaffine.Important examples of A-quasiconvex functions can be constructed as follows:

Theorem 2.38 Suppose g : RN → R is convex and f : Rm → R can be writtenin the form

f(z) = g(Φ1(z), ...,ΦN (z)) ( = g(Φ(z)) )

where the Φi are A-quasiaffine. Then f is A-quasiconvex.

Proof. For fixed v ∈ RN there exists a linear map a(v)

w 7→ a(v) · w, w ∈ Rn

such thatg(w) ≥ g(v) + a(v) · (w − v).

Applying this inequality to wi = Φi(z + ζ(x)), vi = Φi(z), we obtain for ζ ∈C∞

per(D):

−∫

D

f(z + ζ(x)) dx = −∫

D

g(Φ(z + ζ(x))︸︷︷︸w(x)

) dx

≥ −∫

D

g(Φ(z)) + a(Φ(z)) · (Φ(z + ζ(x)) − Φ(z)) dx

= f(z) +∑

i

ai(Φ(z))

(−∫

D

Φi(z + ζ(x)) dx−−∫

D

Φi(z) dx

)

= f(z)

by A-quasiconvexity of Φi.

A major shortcoming of the general theorems in this section is that they maybe very hard to verify in specific situations: There are no good criterions forA-quasiconvexity. This notion is formulated in terms of a ‘non-local’ integralinequality which has to be satisfied for every test field ζ . We will therefore needother criteria which might be weaker but which can be checked more directly.

2.6 The Legendre-Hadamard condition

In this section we will discuss a necessary condition for lower semicontinuity whichcan be checked locally.

27

Recall the differential constraints in assumption (H)

Au(ν) =

(m∑

j=1

n∑

k=1

aijk

∂u(ν)j

∂xk

)

i=1,...,q

bounded in L2(Rq)

and the associated linear maps B(ξ) : Rm → Rq (ξ ∈ Rn) given by

B(ξ)λ =

(m∑

j=1

n∑

k=1

aijkλjξk

)

i=1,...,q

.

Also define the sets

V = (λ, ξ) ∈ Rm × Rn : B(ξ)λ = 0 ⊂ Rm × Rn,

Λ = λ ∈ Rm : ∃ξ ∈ Rn\ 0 : (λ, ξ) ∈ V ⊂ Rm.

Remark 2.39 The less restrictive our constraints that Au(ν) be bounded are, thebigger V is. If (λ, ξ) ∈ V, λ 6= 0, ξ 6= 0, then these constraints do not prevent fastoscillations with direction Λ in the target space and direction ξ in the originaldomain: For a periodic function ψ : R → R let

u(ν)(x) = λψ(νξ · x).

Then

(Au(ν))i =∑

j,k

aijk

∂u(ν)j

∂xk

=∑

j,k

aijkλjψ′(νξ · x)νξk

= ψ′(νξ · x)ν(B(ξ)λ)i

= 0.

Examples:

1. No constraints: A = 0. Then Λ = Rm.

2.∂u

(ν)j

∂xkbounded in L2 ∀j, k. Then Λ = 0.

(Note: By Rellich-Kondrachov u(ν) is compact in L2. If (H) holds, thenu(ν) → u strongly in L2.)

3. If u(ν) : Ω → Rm,∂u

(ν)j

∂xk= 0, k = 2, ..., n, then (H0) holds with

ai,j,k = ai,1,k = δi+1,k, i = 1, ..., n− 1

28

B(ξ)λ = (∑

k δi+1,kλξk)i = λ(ξ2, ..., ξn).I.e.

V = (0 × Rn) ∪(R × 0 × Rn−1

)⊂ R × Rn

Λ = R.

4. The scalar variational case: u(ν) : Ω → Rn with

u(ν) = ∇v(ν), i.e.

Au(ν) = (∂u

(ν)j

∂xk

− ∂u(ν)k

∂xj

)j,k=1,...,n = 0.

It follows that B(ξ)λ = (λjξk − λkξj)j,k.Now B(ξ)λ = 0 iff λjξk − λkξj = 0 ∀j, k. If without loss of generalityλ1 6= 0, then ξk = ξ1

λ1λk implies ξ ‖ λ.

Conversely, ξ ‖ λ implies w.l.o.g.

λjξk − λkξj = µλkλj − µλkλj = 0.

It followsV = (λ, ξ) : λ ‖ ξ ⊂ Rn × Rn

and thusΛ = Rn.

5. The general variational case:If u = Dv, v : Ω → Rp, then u satisfies

Au = (curl∇v1, ..., curl∇vp) = 0.

With ujk = ∂kvj this is equivalent to

∂ujk

∂xl

− ∂ujl

∂xk

= 0 ∀j = 1, ..., p, k, l = 1, ..., n.

Consequently, for ξ ∈ Rn, λ ∈ Rpn,

B(ξ)λ = (λjkξl − λjlξk)j,k,l.

As B(ξ)λ = 0 is equivalent to (λjkξl − λjlξk)k,l = 0 for every fixed j andhence, as before, λj . ‖ ξ, it follows

V = (λ, ξ) ∈ Rpn × Rn : λj. ‖ ξ ,where λj. denotes the j-th row of the matrix (λjk) 1≤j≤p

1≤k≤n.

Therefore

Λ = λ ∈ Rpn : ∃ξ ∈ Rn\ 0 : λj . ‖ ξ ∀j= λ ∈ Rpn : λj. ‖ λi. ∀i, j= a⊗ b : a ∈ Rp, b ∈ Rn “the Rank-1-matrices”.

29

We now prove an important necessary condition for weak lower semicontinuity.

Theorem 2.40 1. If l ≥ f(u) for any sequence satisfying (H0), then f isconvex in the directions of Λ, i.e., t 7→ f(a+ tb) is convex ∀a ∈ Rm, b ∈ Λ.

2. If l = f(u) for any sequence satisfying (H0), then f is affine in the directionsof Λ.

In the variational case this is called the “Legendre-Hadamard-” or “ellipticitycondition”.Proof. Only the first statement is to be shown; the second is an immediateconsequence of the first one.Let t1, t2 ∈ R, y1 = a + t1b, y2 = a + t2b and µ ∈ (0, 1). b ∈ Λ implies ∃ξ 6= 0such that ∑

j,k

aijkbjξk = 0 ∀i ∈ 1, ..., q .

Let ψ : R → R be 1-periodic with

ψ(t) =

(1 − µ)(t1 − t2) for 0 ≤ t < µ

µ(t2 − t1) for µ ≤ t < 1.

Define u(ν) ∈ L∞(Ω) (Q-periodic with Q = R(0, 1)n, R ∈ SO(n) such thatRe1 = ξ

|ξ|) by

u(ν)(x) = z + bψ

(νξ

|ξ| · x), z = µy1 + (1 − µ)y2.

Then u(ν) is highly oscillating and converges to

w*- lim u(ν) = z

because

−∫

Q

u(ν) = z + b−∫

Q

ψ(ξ

|ξ| · x) = z +

∫ 1

0

ψ(t)dt

= z + µ(1 − µ)(t1 − t2) + (1 − µ)µ(t2 − t1) = z.

Similarly,

w*- lim f(u(ν)) = −∫

Q

f

(z + bψ

(ξ

|ξ| · x))

=

∫ 1

0

f(z + bψ(t))dt,

30

where

z + bψ(t) = µy1 + (1 − µ)y2 + b

(1 − µ)(t1 − t2), t < µ

µ(t2 − t1), t > µ

= µa+ µt1b+ (1 − µ)(a+ t2b) + b

(1 − µ)(t1 − t2), t < µ

µ(t2 − t1), t > µ

= a +

t1b

t2b=

y1

y2.

Hence,w*- lim f(u(ν)) = µf(y1) + (1 − µ)f(y2).

Now by assumption

f(µy1 + (1 − µ)y2) = f(z) = f(w*- lim u(ν))

≥ w*- lim f(u(ν))

= µf(y1) + (1 − µ)f(y2).

As alsoAu(ν) = 0 in D′

(Exercise), the proof is complete.

Examples:

1. Suppose Λ = Rm (e.g., if A = 0, i.e., no side-conditions). Then l ≥ f(u)for any sequence satisfying (H0) if and only if f is convex.

2. Λ = 0 (E.g. ai,j,k = a(i1,i2),jk= δi1jδi2k) leads to the ”compact case”.

3. In the general variational case (see above):

Λ = a⊗ b : a ∈ Rp, b ∈ Rn .

By Theorem 2.40, a necessary condition for weakly* lower semicontinuityis that t 7→ f(A + tB) to be convex ∀B ∈ Λ, i.e., ∀B of rank 1, i.e., ”f isrank-1-convex”.

Remark 2.41 If f ∈ C2(Rm) then convexity in directions of Λ is equivalent to

D2f(a)(b, b) ≥ 0 ∀a ∈ Rm, b ∈ Λ.

31

The following important example shows that the converse of the precedingtheorem is wrong: Example:Ω ⊂ Rm, m = 3. Let

u(ν) =

u(ν)1 (x1, x2)

u(ν)2 (x1, x2)

u(ν)3 (x1, x2)

=

sin(νx2)cos(νx1)

sin(ν(x1 − x2))

Then u(ν) satisfies (H0) with

Au(ν) =

(∂u

(ν)1

∂x1

,∂u

(ν)2

∂x2

,∂u

(ν)3

∂x1

+∂u

(ν)3

∂x2

).

We calculateB(ξ)λ = (λ1ξ1, λ2ξ2, λ3(ξ1 + ξ2))

and so

Λ = (λ1, λ2, λ3) : at least two of the λj vanish= Re1 ∪ Re2Re3.

Clearly, f : R3 → R with f(x, y, z) = xyz is affine in the directions of Λ. Fur-thermore,

u(ν)1 , u

(ν)2 , u

(ν)3

∗ 0.

But

f(u(ν)(x)) = u(ν)1 (x)u

(ν)2 (x)u

(ν)3 (x)

= sin(νx2) cos(νx1) (sin(νx1) cos(νx2) − sin(νx2) cos(νx1))

=1

4sin(2νx1) cos(2νx2) − sin2(νx2) cos2(νx1)

and thus

f(u(ν))∗ 0 − 1

4< 0 = f(0).

Theorem 2.42 Let f ∈ Cr, r ≥ 2 and suppose l = f(u) for any sequencesatisfying (H). Then f satisfies the following condition:If (λ1, ξ1), ..., (λr, ξr) ∈ V with ξj 6= 0 ∀j are such that Rank(ξ1, ...ξr) ≤ r − 1,then

Drf(y)[λ1, ..., λr] = 0 ∀y ∈ Rm.

Proof. (only for r = 2 or 3) Use induction on r.

32

1. r = 2. As Rank(ξ1, ξ2) = 1, ξ1 and ξ2 are linearly dependent and so, forsome c ∈ R, B(ξ1)λ2 = cB(ξ2)λ2 = 0. So in fact

B(ξ1) = 0 ∀λ ∈ span λ1, λ2 ,

and, in particular, span λ1, λ2 ⊂ Λ.By Theorem 2.40 (also cf. Remark 2.41) the quadratic form D2f(y)[·, ·]vanishes on Λ. It follows that

D2f(y)[λ1, λ2] = 0.

2. Consider ϕ1, ϕ2, ϕ3 periodic with average zero and define

u(ν)(x) = u(x) + t[λ1ϕ1(νξ1 · x) + λ2ϕ2(νξ2 · x) + λ3ϕ3(νξ3 · x)],

u(x) ≡ y ∈ Rm. For any t,

u(ν) ∗ u in L∞ and Au(ν) = Au = 0.

For |t| small we have

f(u(ν)(x)) = f(y) + tDf(y)[λ1ϕ1(νξ1 · x) + λ2ϕ2(νξ2 · x) + λ3ϕ3(νξ3 · x)]

+1

2t2D2f(y)[λ1ϕ1(νξ1 · x) + . . . , λ1ϕ1(νξ1 · x) + . . . ]

+1

6t3D3f(y)[. . . , . . . , . . .]

+O(t4),

where the linear term converges weakly* to zero.

3. Consider the quadratic term

D2f(y)[..., ...] =∑

1≤i,j≤3

D2f(y)[λi, λj]ϕi(νξi · x)ϕj(νξj · x).

If Rank(ξi, ξj) = 1, then as in the first step we have

D2f(y)[λi, λj] = 0.

If Rank(ξi, ξj) = 2, then

ϕi(νξi · x)ϕj(νξj · x) ∗ 0

(Exercise!). Consequently,

D2f(y)[..., ...]∗ 0.

33

4. We finally investigate the cubic term

D3f(y)[. . . , . . . , . . .] =∑

i,j,k

D3f(y)[λi, λj, λk]ϕi(νξi · x)ϕj(νξj · x)ϕk(νξk · x).

Fix i, j, k ∈ 1, 2, 3. If two of the vectors ξ1, ξ2, ξ3 are parallel, say ξi ‖ ξj(w.l.o.g.), then as in the first step

D2f(y)[λi, λj] = 0 ∀y.

This remains true if y is replaced by y + tλk. So

D3f(y)[λi, λj, λk] =d

dt

∣∣∣∣t=0

D2f(y + tλk)[λi, λj] = 0.

If, on the other hand, ξi is not parallel to ξj for i 6= j, then one can chooseϕ1, ϕ2, ϕ3 in such a way as to have

ϕ1(νξ1 · x)ϕ2(νξ2 · x)ϕ3(νξ3 · x) ∗ c 6= 0

(Exercise!). As by assumption l = lim f(u(ν)) = f(y), we obtain

0 = lim∑

i,j,k

D3f(y)[λi, λj , λk]ϕiϕjϕk

= 3 limD3f(y)[λ1, λ2, λ3]ϕ1ϕ2ϕ3

= 3cD3f(y)[λ1, λ2, λ3].

Corollary 2.43 Let dim span Λ = d and choose coordinates so that

span Λ = y ∈ Rm : yd+1 = . . . = yd = 0 .

If l = f(u) for every sequence satisfying (H), then f is a polynomial in y1, ..., yd

of degree at most minn, d whose coefficients depend on yd+1, ..., ym.

Proof. Suppose (e1, ..., ed) ∈ Λ is a basis of span Λ and (e1, ..., en) is a basis of

34

Rm. For y =∑yiei we calculate, using that f is affine in directions of Λ,

f(y) = f

y1

e1︸︷︷︸

∈Λ

+

m∑

i=2

yiei

+ (1 − y1)

m∑

i=2

yiei

= y1f

(e1 +

m∑

i=2

yiei

)+ (1 − y1)f

(m∑

i=2

yiei

)

= y1

[f

(y2

(e1 + e2 +

m∑

i=3

yiei

)+ (1 − y2)

(e1 +

m∑

i=3

yiei

))]

+ (1 − y1)

[f

(y2

(e2 +

m∑

i=3

yiei

)+ (1 − y2)

m∑

i=3

yiei

)]

= y1

[y2f

(e1 + e2 +

m∑

i=3

yiei

)+ (1 − y2)f

(e1 +

m∑

i=3

yiei

)]

+ (1 − y1)

[y2f

(e2 +

m∑

i=3

yiei

)+ (1 − y2)f

(m∑

i=3

yiei

)]

= . . .

Continuing this way, we see that f is a polynomial in y1, ...yd of degree ≤ d.But also deg f ≤ n: If (λ1, ..., λn+1) ∈ Λ, choose ξj 6= 0 such that (λj, ξj) ∈ V.

SinceRank(ξ1, ...ξn+1) ≤ Rank Rn = n,

one hasD(n+1)f(y)[λ1, ..., λn+1] = 0

by Theorem 2.42, and thusD

(n+1)span Λf = 0.

2.7 Temperierte Distributionen

One of the most important tool in analysis, in particular for questions in PDEtheory, is the Fourier transform. In this section we will extend this notion todistributions. From the functional analytic point of view this is done by dualityof distributions with test functions. Unfortunately, it turns out that the spaceD(Rn) is not suitable for this undertaking as it is not closed under taking Fouriertransforms. However, it holds true that ϕ(ξ) for ϕ ∈ D(Rn) is still smooth and‘rapidly decreasing’ as |ξ| → ∞. Ths observation leads to a new test functionspace S of rapidly decreasing smooth functions, which in particular contains

35

D(Rn). Correspondingly we identify a new class S ′ of tempered distributionswhich act as linear functionals on S. Roughly speaking this class consists ofthose (normal) distributions, which do not grow too fast at infinity.

Definition 2.44 (i) Der Schwartz-Raum S ist definiert durch

S := ϕ ∈ C∞(Rn) : ‖ϕ‖N <∞ ∀N ∈ N,wobei ‖ · ‖N die Norm

‖ϕ‖ := max|α|,|β|≤N

supx∈Rn

|xα∂βϕ(x)|

bezeichnet.

(ii) Eine Folge (ϕj) ⊂ S von Schwartz-Funktionen konvergiert in S gegen ϕ ∈S, wenn ‖ϕj − ϕ‖N → 0 f”ur alle N ∈ N.

Beispiele:

1. Offenbar ist D(Rn) ⊂ S. Die Funktion x 7→ e−x2aber liegt in S, jedoch

nicht in D(Rn).

2. Gilt ϕk → ϕ in D(Rn), so gilt auch ϕk → ϕ in S. (Der Beweis ist einfach.)

Remark 2.45 S ist ein lokalkonvexer Raum, dessen Topologie von der Familieder Normen (‖ · ‖N)N∈N induziert wird. Es gibt eine Metrik, die diese Topologieerzeugt. D(Rn) liegt dicht in S. (Ubung!)

Lemma 2.46 (i) Ist ϕ ∈ S, so ist auch x 7→ xα∂βϕ(x) ∈ S fur alle Multiin-dizes α, β.

(ii) Fur jeden Sobolevraum W k,p(Rn) mit k ∈ N0 und 1 ≤ p ≤ ∞ gilt S ⊂W k,p(Rn) und es gibt eine Konstante C = C(k, p, n) und ein N = N(k, p, n),so dass

‖ϕ‖W k,p(Rn) ≤ C‖ϕ‖N ∀ϕ ∈ S.(iii) ϕ, ψ ∈ S =⇒ ϕψ ∈ S.

Proof. (i) und (iii) sind einfach (Leibniz-Regel!).(ii) Seien ϕ ∈ S und α ein Multiindex mit |α| ≤ k. Es gilt∫

Rn

|∂αϕ|p ≤∫

B1(0)

|∂αϕ|p +

∫

Rn\B1(0)

|∂αϕ|p

≤ |B1(0)| supx∈B1(0)

|∂αϕ(x)|p +

∫

Rn\B1(0)

|x|−n−1|x|n+1|∂αϕ(x)|p dx

≤ C‖ϕ‖pk + sup

x∈Rn\B1(0)

|x|n+1|∂αϕ(x)|p ·∫

Rn\B1(0)

|x|−n−1 dx

≤ C‖ϕ‖pk + C‖ϕ‖p

maxk,n+1

≤ C‖ϕ‖p

maxk,n+1.

36

Daraus ergibt sich die Behauptung.

Eine wichtige Eigenschaft des Schwartz-Raumes ist seine Invarianz unter Fouri-ertransformation. Wegen S ⊂ L1 ist die Fouriertransformierte einer Schwartz-Funktion ϕ ∈ S gegeben durch

Fϕ(ξ) = ϕ(ξ) =1

(2π)n2

∫

Rn

e−ix·ξϕ(x) dx.

Wir stellen einige (teils schon bekannte) Tatsachen uber die Fouriertransforma-tion auf S zusammen:

Lemma 2.47 Es seien ϕ, ϕk, ψ ∈ S, k = 1, 2, . . ., α ein Multiindex und h ∈ Rn.Dann gilt

(i) ∂αϕ(ξ) = (iξ)αϕ(ξ) und ∂αϕ(x) = (−ix)αϕ(ξ),

(iia) τhϕ(ξ) = e−ih·ξϕ(ξ) und τhϕ(ξ) = eih·xϕ(ξ),

(iib) ϕλ(ξ) = λnϕ(λξ) fur ϕλ(x) := ϕ(λx),

(iic) ˆϕ = ϕ,

(iii) ϕ ∈ S,

(iv) ϕk → ϕ in S =⇒ ϕk → ϕ in S.

Proof. (ii) und (v) sind schon bekanntAuch (i) folgt aus schon bekannten Eigenschaften der Fouriertransformation

auf L1: Die erste Gleichung ergibt sich aus ∂βϕ ∈ S ⊂ L1 fur alle Multiindizesβ; fur die zweite Gleichung beachte, dass (−ix)βϕ ∈ S ⊂ L1 fur alle Multiindizes

β gilt, so dass in der Tat ϕ differenzierbar ist mit ∂αϕ(x) = (−ix)αϕ(ξ).(iii) & (iv): Wie eben begrundet ist ϕ C∞-glatt mit

|ξβ∂γϕ(ξ)| = | ∂β(xγϕ)(ξ)| ≤ 1

(2π)n2

∫|∂β(xγϕ)| dx

=1

(2π)n2

‖∂β(xγϕ)‖L1 ≤ C‖∂β(xγϕ)‖N

fur hinreichend großes N ∈ N nach Lemma 2.46(ii). Nach Vergroßerung von Nergibt sich mit Hilfe der Leibniz-Regel

|ξβ∂γ ϕ(ξ)| ≤ C‖ϕ‖N .

Dies zeigt, dass es zu jedem N ∈ N ein N ∈ N und eine Konstante C gibt, sodass

‖ϕ‖N ≤ C‖ϕ‖N ∀ϕ ∈ S.

37

Das beendet den Beweis von (iii) und zeigt außerdem (iv).

Wir erinnern hier noch an die Tatsache, dass F : S → S sich zu einer linearenIsometrie F : L2(Rn) → L2(Rn) fortsetzt.

Wie in der Theorie der Fouriertransformation ublich betrachten wir kom-plexwertige Funktionen.

Definition 2.48 Eine lineare Abbildung T : S → C ist eine temperierte Distribution(man schreibt T ∈ S ′), wenn

ϕk → ϕ in S =⇒ Tϕk → Tϕ in C.

Wegen D(Rn) ⊂ S und ϕk → ϕ in D(Rn) =⇒ ϕk → ϕ in S gilt S ′ ⊂ D′(Rn).(Genauer: T |D(Rn) ∈ D′(Rn) fur alle T ∈ S ′.) Da D(Rn) dicht in S liegt (Ubung!),gilt sogar S ′ → D(Rn).

Theorem 2.49 Es sei T : S → C eine lineare Abbildung. T ist genau dann einetemperierte Distribution, wenn C > 0 und N ∈ N existieren, so dass

|Tϕ| ≤ C‖ϕ‖N ∀ϕ ∈ S

gilt.

Proof. Dass die Bedingung hinreichend fur T ∈ S ′ ist, ist klar.Um die Notwendigkeit zu beweisen, nehmen wir an, es gabe zu jedem k ∈ N

ein ϕk ∈ S, so dass|Tϕk| > k‖ϕk‖k

gilt. O.B.d.A. ist zudem Tϕk = 1 fur alle k. (Multipliziere mit geeignetenSkalaren.) Fur jedes k0 ∈ N ist dann aber

1 > k‖ϕk‖k ≥ k‖ϕk‖k0 ∀ k ≥ k0,

so dass limk→∞ ‖ϕk‖k0 = 0. Dies zeigt ϕk → 0 in S. Jedoch konvergiert Tϕk = 1nicht gegen 0.

Beispiele:

1. Lp ⊂ S ′ fur alle 1 ≤ p ≤ ∞, nicht jedoch Lploc, wie 4. zeigen wird.

2. Alle Polynome liegen in S ′: Fur ϕ ∈ S, p ein Polynom gilt |∫p ϕ| ≤

‖pϕ‖L1 ≤ C‖ϕ‖N fur hinreichend großes N .

3. Endliche Borel-Maße µ sind temperierte Distributionen gemaß ϕ 7→∫ϕdµ,

denn es gilt ∣∣∣∣∫ϕdµ

∣∣∣∣ ≤ |µ|(Rn)‖ϕ‖L∞.

38

4. Die Funktion x 7→ ex2liegt nicht in S ′ (aber in D′(Rn)). (Ubung.)

Definition 2.50 Eine Folge von temperierten Distributionen Tn konvergiert in S ′

gegen T ∈ S ′, wenn Tnϕ in C gegen Tϕ konvergiert fur alle ϕ ∈ S.

Genau wie fur D′ definiert man die Ableitungen ∂αT , die Reflektion T unddie Verschiebung τhT fur T ∈ S: Zur Motivation bemerkt man zunachst, dassfur f ∈ L1

loc, ϕ ∈ D(Rn) gilt∫τhf ϕ =

∫f(x− h)ϕ(x) dx =

∫f(x)ϕ(x+ h) dx =

∫f τ−hϕ,

∫f ϕ =

∫f(−x)ϕ(x) dx =

∫f(x)ϕ(−x) dx =

∫f ϕ.

Dies erhebt man nun zur Definition:

Definition 2.51 Fur T ∈ D′(Rn) oder T ∈ S definiert man die Distributionen(bzw. temperierten Distributionen) τhT und T durch

τhT (ϕ) := T (τ−hϕ), T (ϕ) := T (ϕ) ∀ϕ ∈ D(Rn) bzw. S.

Man muss sich davon uberzugen, dass diese Ausdrucke als Elemente vonD′(Rn) bzw. S ′ wohldefiniert sind. Das ist aber einfach.

Die Multiplikation mit glatten Funktionen ist jedoch i.A. nur auf D′(Rn)wohldefiniert. (Z.B. ist 1 ∈ L∞ ⊂ S ′ aber ex2

= ex2 · 1 /∈ S ′.)

Lemma 2.52 Es sei ϕ ∈ C∞, so dass jede Ableitung ∂αϕ hochstens polynomielldivergiert: Es gibt Konstanten C = C(α), N = N(α), so dass

|∂αϕ(x)| ≤ C(1 + |x|N) ∀x ∈ Rn.

Dann ist ϕT wohldefiniert.

Proof. Fur ψ ∈ S und Multiindizes α und β ist

|xα∂β(ϕψ)(x)| =

∣∣∣∣∣xα∑

γ≤β

(β

γ

)∂γϕ(x)∂β−γψ(x)

∣∣∣∣∣

≤∑

γ≤β

(β

γ

)C(γ)(1 + |x|N(γ))|xα∂γψ(x)|

≤ C‖ψ‖N

fur C und N (nur von α, β abhangend) groß genug. Dies zeigt ϕψ ∈ S undψk → ψ in S =⇒ ϕψk → ϕψ in S.

Beispiel: Insbesondere darf man temperierte Distributionen also mit Polynomenund Funktionen der Form x 7→ eia·x, a ∈ Rn, oder auch (1 + |x|2)s, s ∈ R,multiplizieren.

39

Wir kommen nun zur Fouriertransformation fur temperierte Distributionen.Fur ϕ, ψ ∈ S gilt

∫ϕψ =

∫ϕψ, denn die Fouriertransformation ist eine L2-

Isometrie, so dass∫ϕψ =

∫ϕψ =

∫ˆϕψ =

∫ϕ

ˇψ =

∫ϕψ,

wobei wir im dritten Schritt ausgenutzt haben, dass

χ(ξ) =1

(2π)n2

∫eix·ξχ(x) dx = ˇχ(x)

fur alle χ ∈ S gilt. Dadurch motiviert definieren wir:

Definition 2.53 Ist T ∈ S ′, so wird durch

Tϕ := T ϕ ∀ϕ ∈ S

die Fouriertransformierte FT := T ∈ S ′ definiert.

Dies ist wohldefiniert nach Lemma 2.47(iii) und (iv).

Theorem 2.54 Die Abbildung F : S ′ → S ′ ist linear und stetig. Fur ϕ ∈ S,T ∈ S ′, Multiindizes α und h ∈ Rn gilt

(i) ∂αT = (iξ)αT und ∂αT = (−ix)αT ,

(ii) τhT = e−ih·ξT und τhT = eih·xT ,

(iii)ˆT = T .

Proof. Die Stetigkeit von F ist klar.(i) Aus den entsprechenden Eigenschaften fur Schwartz-Funktionen folgt

∂αTψ = ∂αT ψ = (−1)|α|T (∂αψ) = (−1)|α|T ( (−ix)αψ)

= (−1)|α|T ((−ix)αψ) = (iξ)αT ψ

sowie

∂αT ψ = (−1)|α|T (∂αψ) = (−1)|α|T (∂αψ) = (−1)|α|T ((iξ)αψ)

= (−iξ)|α|T (ψ) = (−iξ)|α|Tψ

fur Schwartz-Funktionen ψ.(ii) Dies folgt nach dem gleichen Schema aus den entsprechenden Eigen-

schaften fur Schwartz-Funktionen.

(iii) Fur Schwartz-Funktionen ψ giltˆTψ = T

ˆψ = T ψ = T ψ.

Beispiele:

40

1. δ = 1

(2π)n2, denn

δϕ = δ(ϕ) =1

(2π)n2

∫

Rn

e−i0·xϕ(x) dx =

∫

Rn

1

(2π)n2

ϕ(x) dx ∀ϕ ∈ S.

2. 1 = (2π)n2 δ, denn nach 1. gilt

1 = (2π)n2ˆδ = (2π)

n2 δ = (2π)

n2 δ.

2.8 Sobolevraume und Fouriertransformation

In [Sch 10] haben wir insbesondere die Sobolveraume

Hk = Hk(Rn) = u ∈ L2 : ∂αu ∈ L2 ∀ |α| ≤ kuntersucht. In Abschnitt 2.4 haben wir zudem die Raume H−k, k ∈ N, betrach-tet. Hier werden wir Hs fur beliebige s ∈ R definieren. Dazu benotigen wirzunachst eine Charakterisierung von Hk, die nicht ausnutzt, dass k ∈ Z ist. DerEinfachheit halber beschranken wir uns hier im Wesentlichen auf Funktionen, dieauf ganz Rn definiert sind.

Theorem 2.55 Sei u ∈ L2. Es gilt u ∈ Hk, k ∈ N0, genau dann, wenn ξ 7→(1 + |ξ|2) k

2 u(ξ) ∈ L2 ist. Die Norm

u 7→ ‖(1 + |ξ|2) k2 u‖L2

ist aquivalent zur Hk-Norm.

Proof. Aus der Formel von Plancherel ergibt sich

∑

|α|≤k

‖∂αu‖2L2 =

∑

|α|≤k

‖∂αu‖2L2 =

∑

|α|≤k

‖ξαu‖2L2 =

∫

Rn

∑

|α|≤k

|ξα|2 |u(ξ)|2 dξ

(mit ‖T‖L2 := ∞ fur T ∈ S ′ \ L2). Nun gilt einerseits∑

|α|≤k

|ξα|2 ≤ C(1 + |ξ|2)k

(Fallunterscheidung, ob |ξα| ≤ 1 oder > 1), andererseits

(1 + |ξ|2)k ≤ 2k(1 + |ξ|2k) = C(1 + (|ξ1|2 + . . .+ |ξn|2)k) ≤ C∑

|α|≤k

|ξα|2,

d.h. c(1+ |ξ|2)k ≤∑|α|≤k |ξα|2 ≤ C(1+ |ξ|2)k fur geeignete c, C > 0. Daraus folgt

c‖(1 + |ξ|2) k2 u‖2

L2 ≤ ‖u‖2Hk ≤ C‖(1 + |ξ|2) k

2 u‖2L2 .

Dies motiviert die folgende Definition:

41

Definition 2.56 Fur s ∈ R setze

Hs := Hs(Rn)

:= f ∈ S ′(Rn) : f ∈ L1loc(R

n) mit ‖f‖s := ‖(1 + |ξ|2) s2 f‖L2 <∞.

(Ist g eine Funktion mit (1 + |ξ|2) s2 g ∈ L2, dann ist gϕ ∈ L1 fur ϕ ∈ S, so dass g

via ϕ 7→∫gϕ als temperierte Distribution aufgefasst werden kann. Ein solches g

lasst sich also immer als g = f fur ein f ∈ S ′ schreiben.)

Remark 2.57 1. Es gilt Hs ⊂ Hs′ fur s > s′.

2. Nach Satz 2.55 stimmt diese Definition mit der fruheren Definition furs ∈ N uberein (bis auf den Ubergang zu einer aquivalenten Norm).

3. Hs ist ein Hilbertraum bezuglich

〈f, g〉s =

∫(1 + |ξ|2)sf(ξ)¯g(ξ) dξ.

Der nachste Satz zeigt, dass (Hs)′ in kanonischer Weise isomorph zu H−s ist.

Theorem 2.58 Sei s ∈ R. Die Abbildung

Φ : H−s → (Hs)′ mit (Φg)(f) := (f , g)L2 =

∫f ¯g

ist ein antilinearer isometrischer Isomorphismus.

Proof. Zunachst beachte, dass f ∈ Hs, g ∈ H−s

f ¯g = (1 + |ξ|2) s2 f · (1 + |ξ|2)− s

2 ¯g ∈ L2 · L2 ⊂ L1

impliziert, so dass f 7→∫f ¯g ein stetiges Funktional Φg auf Hs definiert mit

|(Φg)(f)| =

∣∣∣∣∫f ¯g

∣∣∣∣ ≤ ‖(1 + |ξ|2) s2 f‖L2‖(1 + |ξ|2)− s

2 ¯g‖L2 = ‖f‖s‖g‖−s.

Da in dieser Ungleichung Gleichheit gilt, wenn f = (1 + |ξ|2)−sg ist, folgt, dassΦ eine antilineare Isometrie ist.

Es bleibt zu begrunden, dass Φ surjektiv ist. Sei dazu 〈·, g〉s ∈ (Hs)′, g ∈ Hs.Dann ist (1 + |ξ|2)s ˆg ∈ S ′ und indem wir g = F−1((1 + |ξ|2)s ˆg) setzen, erhaltenwir g ∈ S ′ mit g = (1 + |ξ|2)s ˆg und (1 + |ξ|2)− s

2 g = (1 + |ξ|2) s2 ˆg ∈ L2. Es folgt

(Φg)(f) =

∫f ¯g =

∫(1 + |ξ|2)sf ¯g = 〈f, g〉s.

fur f ∈ Hs.

Beispiel: Fur 〈·, g〉s ∈ (H1)′, g ∈ H1, erhalt man g = F−1((1 + |ξ|2)s ˆg) =(1 − ∆)g ∈ H−1.

42

Theorem 2.59 Sei s ∈ R, k ∈ N. Dann ist f ∈ Hs genau dann, wenn ∂αf ∈Hs−k ist fur alle |α| ≤ k. Die Normen

‖f‖s und

∑

|α|≤k

‖∂αf‖2s−k

12

sind aquivalent.

Proof. Das geht ahnlich wie der Beweis von Satz 2.55.

Beispiele:

1. Es gilt S ⊂ Hs fur alle s ∈ R. Umgekehrt impliziert der SobolevscheEinbettungssatz (s. Skript PDG 1 und Satz 2.60 unten), dass

⋂s∈R

Hs ⊂C∞.

2. Sei f : R → R gegeben durch f(x) =√

2π

sinxx

. Dann ist f = χ(−1,1) und

damit f ∈ Hs fur alle s. Beachte aber, dass f nicht in S liegt.

3. Im Rn gilt δ = 1

(2π)n2. Damit ist

‖δ‖2s =

1

(2π)n2

∫

Rn

(1 + |ξ|2)s dξ =|∂B1(0)|(2π)

n2

∫ ∞

0

(1 + r2)srn−1 dr <∞

genau dann, wenn 2s+ n− 1 < −1, also wenn s < −n2

ist.

Der Sobolevsche Einbettungssatz fur die Raume Hs lautet wir folgt.

Theorem 2.60 Fur s > m+ n2

gilt Hs → Cm.

Fur s ∈ N haben wir diesen Satz schon im Skript PDG 1 bewiesen. In derTat ist der folgende Beweis fur allgemeine s mit Hilfe der Fouriertransformationsogar einfacher (funktioniert aber nicht in der allgemeinen Form fur W k,p, p 6= 2).Proof. Wegen F−1 : L1(Rn) → C(Rn) mit ‖F−1g‖L∞ ≤ ‖g‖L1, genugt es zuzeigen, dass fur alle Multiindizes α mit |α| ≤ m gilt

f ∈ Hs =⇒ ∂αf ∈ L1 mit ‖∂αf‖L1 ≤ C‖f‖s.

Dies wiederum sieht man wie folgt:∫

|ξαf(ξ)| dξ ≤ C

∫(1 + |ξ|2)m

2 |f(ξ)| dξ

= C

∫(1 + |ξ|2) s

2 |f(ξ)| · (1 + |ξ|2)m−s2 dξ

≤ C‖f‖s

(∫(1 + |ξ|2)m−s dξ

)12

≤ C‖f‖s,

43

da ∫ ∞

0

(1 + r2)m−srn−1 dr <∞

ist fur 2(m− s) + n− 1 < −1 ⇐⇒ s > m+ n2.

Wir erwahnen noch, wie man Hs-Distributionen auf allgemeineren GebietenΩ erklart. Beachte, dass f 7→ ϕf ein stetiger Operator auf allen Hs ist fur ϕ ∈ S.Fur Ω ⊂ Rn offen setzt man

Hsloc(Ω) := T ∈ D′(Ω) : ∀U ⊂⊂ Ω ∃f ∈ Hs(Rn) mit T = S auf U.

Es gilt dann T ∈ Hsloc(Ω) ⇐⇒ ϕT ∈ Hs(Rn) ∀ϕ ∈ D(Ω).

Wie fur naturliche Exponenten definiert man

Hs0(Ω) := C∞

c (Ω)

(Abschluss in der Hs-Norm).Ist schließlich ∂Ω hinreichend gutartig (z.B. Lipschitz), so kann man auch

Hs(Ω) durch Einschrankung auf Ω definieren.

2.9 Compensated compactness

In the quadratic case the Legendre-Hadamard condition even turns out to besufficient.

Theorem 2.61 Let M ∈ Rm×m be a symmetric matrix and set

f(a) = aTMa for a ∈ Rm.

Assume that

(H)

u(ν) u in L2(Ω; Rm),

f(u(ν)) l in D′(Ω),

Au(ν) =

(∑j,k

∂u(ν)j

∂xk

)

i=1,...,q

is compact in W−1,2loc (Ω).

Then l ∈ M and the following implications hold true.

(i) If f(λ) ≥ 0 ∀ λ ∈ Λ, then l ≥ f(u) (as measures).

(ii) If f(λ) = 0 ∀ λ ∈ Λ, then l = f(u).

Recall that

Λ =

λ ∈ Rm : ∃ ξ ∈ Rn\ 0 s.t.

∑

j,k

aijkλjξk = 0 ∀i.

44

Note that if Au(ν) is bounded in L2, then it is compact in W−1,2 by Rellich’stheorem, and hence in W−1,2

loc . (If u(νk) u in W−1,2, then also ϕu(νk) → ϕu inW−1,2 ∀ ϕ ∈ C∞

c (Ω).)

Proof. We only need to prove the first statement. First note that l ∈ M becausef(u(ν)) is bounded in L1 and so f(u(ν))

∗ l in M.

1. Let v(ν) = u(ν) − u. Then v(ν) satisfies (H) with u = 0, f(u) = 0. If wesucceed to prove the theorem for v(ν), then

0 ≤ lim f(v(ν)) = lim(u(ν) − u)TM(u(ν) − u)

= lim f(u(ν)) + f(u) + lim(u(ν))TMu− lim uTMu(ν)

= lim f(u(ν)) − f(u).

Thuslim f(u(ν)) ≥ f(u).

2. For ϕ ∈ C∞c (Ω) let w(ν) = ϕv(ν). Then

w(ν) 0 in L2,∑

j,k aijk∂w

(ν)j

∂xk→ 0 in W−1,2(Rn), i ∈ 1, ..., q ,

suppw(ν) ⊂ K ⊂⊂ Ω

for some suitable K. To see this, note that

∂w(ν)j

∂xk

= ϕ∂v

(ν)j

∂xk

+∂ϕ

∂xk

v(ν)j

︸︷︷︸bounded in L2

and soAw(ν) = ϕAv(ν) + something bounded in L2.︸︷︷︸

compact in W−1,2

Now w(ν) 0 in L2 implies Aw(ν) 0 in W−1,2. By compactness thisgives

Aw(ν) → 0 in W−1,2.

We need to prove that

lim infν→∞

∫

Rn

(w(ν))TMw(ν) ≥ 0.

If this is done, the proof is finished: ∀ ϕ ∈ D(Ω)

0 ≤ lim infν→∞

∫

Rn

(w(ν))TMw(ν) = lim infν→∞

∫ϕ2

︸︷︷︸∈D

(v(ν))TMv(ν)

︸︷︷︸→l in D′

= l(ϕ2)

and sol ≥ 0 as measures.

45

3. Consider

w(ν)(ξ) = (2π)−n2

∫

Rn

e−ix·ξw(ν) dx.

For every ξ, e−ix·ξ is in L2(K), so

w(ν)(ξ) → 0 ∀ ξ.

Furthermore, w(ν) is bounded in L∞:

|w(ν)(ξ)| ≤ (2π)−n2

∫|w(ν)(x)| dx ≤ C‖w(ν)‖L2 ≤ C.

By Lebesgue’s theorem of dominated convergence∫

Ω

|w(ν)(ξ)|2 → 0 ∀ Ω ⊂⊂ Ω,

i.e.,

w(ν) → 0 strongly in L2loc(Ω). (2.7)

4. Note that∑

j,k aijk

∂w(ν)j

∂xk→ 0 in W−1,2(Ω) iff

∥∥∥∥∥∑

aijk

∂w(ν)j

∂xk

∥∥∥∥∥−1

=

∥∥∥∥∥∥(1 + |ξ|2)− 1

2

∑aijk

∂w

(ν)j (ξ)

∂xk

∥∥∥∥∥∥L2

→ 0

i.e.,

1

(1 + |ξ|2)− 12

∑aijk w

(ν)j (ξ) ξk → 0 in L2(Rn) for every i. (2.8)

5. Extend f(a) = aTMa from Rm to Cm by

f(w) = aTMa.

Then∫

Rn

(w(ν))TMw(ν) = Re

∫

Rm

w(ν)TMw(ν)

= Re

∫

Rm

w(ν)T

Mw(ν).

So we need to prove that

lim infν→∞

∫

Rm

f(w(ν)) dξ ≥ 0 (2.9)

46

ifRef(λ) = Reλ

TMλ ≥ 0 ∀ λ ∈ Λ + iΛ

(because if λ = λ1+iλ2 ∈ Λ+iΛ, then Re(λ1 + iλ2)TM(λ1+iλ2)+λ

T1Mλ1+

λ2Mλ2 ≥ 0 by assumption.)

6. ∀ ε > 0 ∃Cε > 0 such that

Ref(λ) ≥ −ε|λ|2 − Cε

q∑

i=1

∣∣∣∣∣∑

j,k

aijk λjηk

∣∣∣∣∣

2

∀λ ∈ Cm, η ∈ Rm with |η| = 1.

(2.10)

Proof of this inequality:If the statement were wrong, then there existed ε0 > 0 such that for allν ∈ N there are λ(ν) ∈ Cm, η(ν) ∈ Rm with |η(ν)| = 1 ∀ ν such that

Ref(λ(ν)) < −ε0|λ(ν)|2 − ν

q∑

i=1

∣∣∣∣∣∑

j,k

aijk λ(ν)j η

(ν)k

∣∣∣∣∣

2

and – without loss of generality – |λ(ν)| = 1 ∀ ν. Extract subsequences suchthat λ(ν) → λ, η(ν) → η.

Then on the one hand we have

q∑

i=1

∣∣∣∣∣∑

j,k

aijk λjηk

∣∣∣∣∣

2

= 0, whence∑

aijk λjηk = 0 ∀ i

and so λ ∈ Λ + iΛ. On the other hand we obtain

Ref ≤ −ε0|λ|2 = −ε0,

a contradiction.

7. Now we prove (2.9):

∫

Rn

Ref(w(ν)(ξ)) dξ =

∫

|ξ|≤1(· · · ) +

∫

|ξ|>1(· · · ).

As w(ν) → 0 strongly in L2loc(R

n), c.f. (2.7), it follows that

∫

|ξ|≤1Ref(w(ν)(ξ)) dξ → 0 as ν → ∞.

47

Using (2.10) (with η = ξ

|ξ|) the second term can be estimated by

∫

|ξ|>1Ref(w(ν)(ξ)) ≥ −ε

∫

|ξ|>1

∣∣∣w(ν)(ξ)∣∣∣2

dξ − Cε

∫

|ξ|>1

∑

i

∣∣∣∣∣∑

j,k

aijk w(ν)j

ξk|ξ|

∣∣∣∣∣

2

.

Noting that 1t≤ 2

(1+t2)12

for t ≥ 1, by (2.8) we get

lim inf

∫

|ξ|>1Ref(w(ν)(ξ)) dξ ≥ −ε

∫

|ξ|>1|w(ν)(ξ)|2 dξ

≥ −ε‖w(ν)‖L2 ≥ −Cε.

Since ε > 0 was arbitrary, we indeed obtain (2.9).

Remark 2.62 Up to extracting subsequences, the condition that f(u(ν)) con-verges in D′ (even weakly* in M) is always satisfied.

Corollary 2.63 Let E = span Λ, d = dimE. Choose coordinates so thatE = y ∈ Rm : yd+1 = . . . = ym = 0.

If u(ν) ∗ u in L∞(Ω; Rm), the sequence Au(ν) is compact in W−1,2

loc and

f : Rm−d → R is continuous,

thenf(u

(ν)d+1, . . . , u

(ν)m ) → f(ud+1, . . . , um)

strongly in Lp(Ω) ∀ p <∞.

Proof. The mapping

y = (y1, . . . , ym) 7→ y2d+1 + . . .+ y2

m

vanishes on Λ and so

(u(ν)d+1)

2 + . . .+ (u(ν)m )2 → u2

d+1 + . . .+ u2m in D′.

By boundedness in L∞ this convergence holds also w*-L∞. But then

∫(u

(ν)d+1)

2 + . . .+ (u(ν)m )2 →

∫u2

d+1 + . . .+ u2m,

which shows that‖u(ν)

j ‖2L2 → ‖uj‖2

L2 ∀ j ≥ d+ 1

48

and thusu

(ν)j → uj in L2 ∀ j ≥ d+ 1.

Consequently,

f(u(ν)d+1, ..., u

(ν)m ) → f(ud+1, ..., um)

strongly in L1 while being bounded in L∞. Hence,

f(u(ν)d+1, ..., u

(ν)m )

Lp

→ f(ud+1, ..., um) ∀ p <∞.

Exercise: Show that fn → f in L1 and (fn) bounded in Lq implies fn → f in Lp

for all p < q.

Corollary 2.64 (The ‘div-curl lemma’) Suppose v, w ∈ L2(Ω; Rn) are vectorfields such that

div v(ν) =∑n

j=1

∂v(ν)j

∂xjis bounded in L2,

curlw(ν) =

(∂w

(ν)j

∂xk− ∂w

(ν)k

∂xj

)

j,k

is bounded in L2

(or just compact in W−1,2loc (Ω), resp.).

Then v(ν) v, w(ν) w in L2(Ω; Rn) implies

v(ν) · w(ν) D′

→ v · w.

Proof.

Λ =

(λ, µ) : ∃ ξ 6= 0 :

m∑

j=1

λjξj = 0, µjξk − µkξj = 0 ∀ j, k

= (λ, µ) : ∃ ξ 6= 0 : λ · ξ = 0, µ ‖ ξ= (λ, µ) : λ · µ = 0 .

u(ν) = (u(ν)1 , u

(ν)2 ) := (v(ν), w(ν)) satisfies (H) with

f(y1, y2) = y1 · y2

vanishing on Λ. The assertion thus follows.

49

2.10 Young measures

Young measures provide “statistic information” on the value distribution of se-quences (u(ν)).

Theorem 2.65 Let K ⊂ Rm be bounded, Ω ⊂ Rn a bounded domain, f : Rm →R continuous. Suppose u(ν) ∈ L∞(Ω; Rm) is such that u(ν) ∈ K almost every-where. Then there exists a subsequence of probability measures (νx)x∈Ω such that

suppνx ⊂ K

and

f(u(ν))∗ f in L∞, f(x) :=

∫

Rm

f(y) dνx(y).

Remark 2.66 Young measures provide explicit limits for all nonlinearly trans-formed sequences (f(u(ν)))ν∈N!

We will construct Yound measures by recourse to the following measure the-oretic lemma, which will be stated without proof.

Theorem 2.67 (Disintegration) Let E ⊂ Rn, F ⊂ Rk be open sets, ν anRm-valued Radon measure on E × F . Let µ be the projection

µ(A) = |ν|(A× F )

of |ν| onto the first factor and assume that µ is a Radon measure, i.e. µ(K) <∞for compact K ⊂ E.Then there exist Rm-valued finite Radon measures νx such that x 7→ νx is µ-measurable,

|νx|(F ) = 1 for µ- a.e. x

and

f(x, ·) ∈ L1(F, |νx|) for µ-a.e. x,

x 7→∫

F

f(x, y) dνx(y) ∈ L1(E, µ)

and we have∫

E×F

f(x, y) dνx(y) =

∫

E

(∫

F

f(x, y) dνx(y)

)dµ(x).

Notation: ν = µ⊗ νx.

Example: Let dν = g(x, y) dxdy, so that dµ =∫

Fg(x, y) dydx and suppose g ∈

L1(E × F ) with g > 0. Then νx with

dνx =g(x, y) dy∫Fg(x, y) dy

50

does the job:

∫

E

(∫

F

f(x, y)g(x, y)∫

Fg(x, y) dy

dy

)∫

F

g(x, y) dy dx

=

∫

E×F

f(x, y)g(x, y) dy dx.

Proof of Theorem 2.65. To a measurable function v : Ω → K one associates anon-negative measure ν ∈ M(Ω × Rm) via

〈ν, ϕ(x, y)〉M,C0=

∫ϕ(x, v(x)) dx ∀ ϕ ∈ C0(Ω × Rm).

(i.e., “ν = Ln × δv(x)”). In particular, for v = u(j) we have

⟨ν(j), ϕ

⟩M,C0

=

∫ϕ(x, u(j)(x)) dx.

This defines a bounded (by |Ω|) sequence of Radon measures on Ω × Rm fromwhich we may extract a convergent subsequence (not relabeled)

ν(j) ∗ ν in M(Ω × Rm).

The limiting measure ν satisfies

1. ν ≥ 0,

2. supp ν ⊂ Ω ×K and

3. projΩ ν = Ln,

because for ϕ ∈ C0(Ω × Rm):

1. ϕ ≥ 0 implies 〈ν, ϕ〉 = limj

⟨ν(j), u

⟩= limj

∫ϕ(x, u(j)(x)) dx ≥ 0.

2. ∀ ϕ such that ϕ ≡ 0 on Ω ×K:

〈ν, ϕ〉 = limj

∫ϕ(x, u(j)(x)) dx = 0.

3. ϕ(x, y) = ψ(x) implies

〈ν, ϕ〉 = limj

∫ψ(x) =

∫ψ(x) dx.

51

By the disintegration theorem 2.67 there exists a family (νx) such that |νx| = 1and

ν = Ln ⊗ νx.

So for all χ ∈ C0(Ω), f ∈ C0(Rm):

limj→∞

∫

Ω

χ(x)f(u(j)(x)) dx = limj

∫

Ω×Rm

χ(x)f(y) dν(j)(x, y)

=

∫

Ω×Rm

χ(x)f(y) dν(x, y)

=

∫

Ω

χ(x)

∫

Rm

f(y) dνx(y)

︸︷︷︸=f(x)

dx

This proves

f(u(j))M∗

∫

Rm

f(y) dνx(y) = f .

By boundedness of f(u(j)) and density of C0 in L1, the claim now follows. (Notethat w.l.o.g. we may assume that f ∈ C0 as the range of values of u(j) is bounded.)

Corollary 2.68 Let u(j) u in L∞. Then u(j) → u in Lp (∀ p <∞) iff each νx

is the Dirac-measure δu(x).(Equivalently: u(j) u in L∞ and u(j) → u pointwise a.e.)

Proof. ”⇒”: If u(j) → u strongly Lp (any p), then by the theorem

f(u(x)) = limjf(u(j)(x)) =

∫f(y) dνx(y)

∀ f continuous. It follows that νx = δu(x).”⇐”: Let νx = δu(x) ∀x. By the theorem,

(u(j))2 ∗

∫y2 dνx(y) =

∫y2 dδu(x)(y) = (u(x))2

Since also uj ∗ u, it follows that u(j) → u L2. By boundedness in L∞ this implies

u(j) → u Lp ∀ p <∞.

Examples:

1. u(j) ∗ u L∞, f(x) = x gives

u(x) =

∫y dνx(y).

52

2. u periodic with unit cell [0, 1]n, u(j)(x) = u(jx)

(so that u(j) ∗ u ≡ −

∫[0,1]

u(y) dy), then

f(u)∗ −∫

[0,1]

f(u(x)) dx =

∫f(y) dν(y),

where ν (independent of x) is the image measure of the uniform distributionunder u.

Remark 2.69 1. One can show that any measurably parameterized measure(νx) arises as the Young measure of a suitable sequence (u(j)).

2. Uniqueness: Suppose, (νx), (νx) are two Young measures associated to(u(j)). Then for all continuous f :

∫f(y) dνx(y) dx = w∗- lim

jf(u(j)) =

∫f(y) dνx dx for a.e. x

and thusνx = νx for a.e. x.

53

Chapter 3

Selected Applications

3.1 Conservation laws

Our aim in this section is to study equations of the form

∂∂tu+ divF (u) = 0 in Rn × (0,∞),

u = g on Rn × t = 0 , (3.1)

where u : Rn × [0,∞) → Rm is to be found and F : Rm → Rm×n is given: a“system of conservation laws”.

Motivation: In physical applications, u describes some quantity whose rate ofchange

d

dt

∫

V

u dx

within some test volume V is given by

−∫

∂V

F (u) ν dS,

where F : Rm → Rm×n describes the flux F (u)ν through the boundary of V .Equating these terms gives

∫

V

ut dx = −∫

∂V

F (u)ν dS = −∫

V

divF (u) dx.

V being arbitrary shows that indeed

∂

∂tu+ divF (u) = 0.

Systems of conservation laws are very difficult to handle. We will therefore oftenreduce to one space dimension:

∂tu+ ∂xF (u) = 0 in R × (0,∞),

u = g on R × 0 .(3.2)

54

Examples:

1. The p-system: ∂tu1 − ∂xu2 = 0,

∂tu2 − ∂xp(u1) = 0,

p a given function. Here, F (y) =

(c − y2

−p(y1)

). This system arises in the study

of nonlinear wave equation

∂ttu− ∂x(p(∂xu)) = 0,

when u1 = ∂tu, u2 = ∂xu.

2. Euler’s equations for compressible gas flow:Let

• ρ = mass density

• v = velocity

• E = e+ 12v2 energy per unit mass, where e is the ”internal energy”,

• p = p(ρ, e) pressure.

The last equation p = p(e, v) is a “constitutive equation”: p is assumed tobe a known function, which models the material specific properties.

Euler’s equations (in the variables: u = (u1, u2, u3) = (ρ, ρv, ρE)) are

∂tρ+ ∂x(ρv) = 0 (conservation of mass),

∂t(ρv) + ∂x(ρv2 + p) = 0 (conservation of momentum),

∂t(ρE) + ∂x(ρEv + pv) = 0 (conservation of energy).

For sufficiently small time intervals, it is not hard to prove that a singleconservation law has a classical solution. (One uses the method of characteristicsto construct it - cf. [Sch 10].) But already for quite simple PDEs (e.g. Burger’sequation) such a solution does not exist for all times.

Recall: For a quasilinear equation of first order

a(x, u) · ∇u = b(x, u)

the characteristic equations are

dxj

dt= aj(x, y), j = 1, ..., n,

dy

dx= b(x, y).

55

In our case xn+1 = t, b ≡ 0, an+1 = 1, (a1, ...an) = ∇F .Since dxn+1

dt= 1, xn+1 = 0 is xn+1 ≡ t. Also y ≡ y(0). The solution is given

byy(t) = u(x(t), t)

and in particular u is constant along characteristics.

Example: Burgers equation ut + uux = 0 (i.e.: a = (y, 1)). For initial valuesgiven, e.g., by

u(x, 0) =

1, x < 0

1 − x, 0 ≤ x ≤ 1

0, x ≥ 1

there are charactersitic curves t 7→ x(t) that cross for times t > 0 (cf. Fig. 3.1)and the solution is not defined unambiguously any longer. We therefore need aweaker notion of solution. 3.1.

Figure 3.1: Crossing Characteristics.

Motivated by our earlier studies of weak solution, for a solution u and a testfunction v ∈ C∞

c (Rn × [0,∞); Rm) we compute

0 =

∫ ∞

0

∫ ∞

−∞(∂tu+ divxF (u)) · v dx dt

= −∫ ∞

0

∫ ∞

−∞u · ∂tv + F (u) : Dxv dx dt−

∫ ∞

−∞u(x, 0)︸︷︷︸=g(x)

·v(x, 0) dx

and make the following

Definition 3.1 u ∈ L∞(Rn × (0,∞); Rm) is called and integral solution of (3.1)if

∫ ∞

0

∫ ∞

−∞u · ∂tv + F (u) : Dxv dx dt+

∫ ∞

−∞g(x) v(x, 0) dx = 0

for all v ∈ C∞c (Rn × [0,∞); Rm).

56

Exercise: Prove that any smooth integral solution is a classical solution of (3.1).

We consider the special situation in 1 + 1 dimensions, where where u is asmooth solution on R × [0,∞)\C for a (smooth) curve C. Along C u is supposedto hava a single jump discontiuity. Let V ⊂ R × (0,∞) be open and such that Cbisects V into two parts V1 (left of C) and V2 (to the right of C): V = (C ∩ V )

·∪V1

·∪ V2.

Figure 3.2: Test volume around a shock.

For v ∈ C∞c (V ) the integral solution u satisfies

0 =

∫ ∞

0

∫ ∞

−∞u · ∂tv + F (u) · ∂xv dx dt

=

∫

V1

u · ∂tv + F (u) · ∂xv dx dt+

∫

V2

u · ∂tv + F (u) · ∂xv dx dt

Noting that v vanishes on ∂V , partial integration gives

∫

V1

u · ∂tv + F (u) · ∂xv = −∫

V1

(∂tu+ ∂xF (u)) · v +

∫

C

(u(V1)ν2 + F (u(V1))ν1

)· v.

Here u(V1) denotes the trace of u|V1 on C, ν = (ν1, ν2) is the outer normal of V1

along C. As ∂tu+ ∂xF (u) = 0 within V1 (cf. the above exercise), we obtain

∫

V1

u · ∂tv + F (u) · ∂xv =

∫

C

(u(V1)ν2 + F (u(V1))ν1

)· v.

Similarly,

∫

V2

u · ∂tv + F (u) · ∂xv = −∫

C

(u(V2)ν2 + F (u(V2))ν1

)· v.

57

Since v was arbitrary, this shows that

(F (u(ν1)) − F (u(ν2))

)ν1 +

(u(ν1) − u(ν2)

)ν2 = 0 on C.

In particular, if C is given by (x, t) : x = s(t), then ν = ± 1√1+s2

(1−s

). So

(choose ”+”, i.e. V1 left, V2 right)

F (u(V1)) − F (u(V2)) = s(u(V1) − u(V2)

).

One often writes [[F (u)]] = F (u(V1) − u(V2)), [[u]] = u(V1) − u(V2). Also set σ = s.We arrive at the so-calles Rankine-Hugoniot jump condition:

[[F (u)]] = σ[[u]]

Example: Recall our example of the Burgers equation

∂tu+ u ∂xu = 0

(where F (y) =

1

2y2

)

with initial condition

u(x, 0) = g(x) =

1, x < 0,

1 − x, 0 ≤ x ≤ 1,

0, x ≥ 1.

A possible solution for t ≥ 1 is given by a shock

u(x, t) =

1, x < s(t),

0, x > s(t),

(s

1

)some curve with s(1) = 1.

If u is an integral solution, the Rankine-Hugoniot condition requires that

[[F (u)]] = s[[u]] along C⇔ F (1) − F (0) = s(1 − 0) ∀ t ≥ 1

⇔ 1

2= s(t) ∀ t ≥ 1

So

(s

1

)must be the straight line given by

s(t) = 1 +1

2(t− 1) =

1

2(t+ 1),

58

Figure 3.3: Shock solution of the Burgers equation.

see Fig. 3.3 A draw-back of the notion of “integral solutions” is that such asolution need not be unique.

Example: Consider the initial value problem∂tu+ u ∂xu = 0 in R × (0,∞),

u = g on R × t = 0 ,

where

g(x) =

0 if x < 0,

1 if x > 0.

A possible solution is given by the shock

u(x, t) =

0 if x < t

2,

1 if x > t2.

But also the so-called “rarefaction wave”

u(x, t) =

0 if x ≤ 0,xt

if 0 < x < t,

1 if x > t.

is a possible solution, see Fig. 3.4

Exercise: Prove that both u and u are integral solutions of the Burgers equation.

We need additional (physically motivated) criteria to single out the relevantsolution! Let us begin with the following observation: If u is a smooth solutionand Φ,Ψ : Rm → R satisfy the relation

DΦ(y)DF (y) = DΨ(y) ∀ y ∈ Rm

59

Figure 3.4: A “rarefaction wave”.

then

∂tΦ(u) = DΦ(u) ∂tu

= −DΦ(u)DF (u) ∂xu (u is a solution)

= −DΨ(u) ∂xu

= − div Ψ(u).

That is, Φ(u) satisfies a scalar conservation law with flux Ψ(u):

∂tΦ(u) + div Ψ(u) = 0.

Now for a non-smooth u, we cannot expect this to be true in general.The guiding idea for a selection criterion is that information (about the solu-

tion) is transported along characteristics of the equation, i.e., along the charac-

teristic curves

(x(t)

t

), where

x = x(t) = F ′(y(t)) = F ′(y(0)).

Such information is lost in shocks and characteristic curves whose origin lies in ashock will in general not carry physically relevant bits of information.

As a paradigm we consider a shock in a single equation (i.e., m = 1) alonga curve C, parametrized by x = s(t). Suppose for simplicity that F is convex.

Then F ′ is increasing and in order that the characteristic curves hit at C we musthave u1 > u2. Suppose Φ,Ψ satisfy Φ′F ′ = Ψ′.

Claim: We claim that

Ψ(u1) − Ψ(u2) ≥F (u1) − F (u2)

u1 − u2

(Φ(u1) − Φ(u2))

whenever Φ is convex.

60

Figure 3.5: Characteristics running into a shock.

Proof. By shifting and adding constants we may without loss of generality assumethat u2 = 0, u1 = u ≥ 0 and F (0) = Φ(0) = Ψ(0) = 0. We need to show that

uΨ(u) ≥ F (u)Φ(u) for u ≥ 0.

As both sides are equal to zero for u = 0, this is implied by

Ψ(u) + uΨ′(u) ≥ F ′(u)Φ(u) + F (u)Φ′(u), u ≥ 0.

Again both sides are equal to zero for u = 0, so this equation holds true if

2Ψ′(u) + uΨ′′(u) ≥ F ′′(u)Φ(u) + 2F ′(u)Φ′(u) + F (u)Φ′′(u), u ≥ 0.

Now using that Ψ′ = F ′Φ′ and hence also Ψ′′ = F ′′Φ′ + F ′Φ′′, we arrive at theequivalent inequalities

u(F ′′(u)Φ′(u) + F ′(u)Φ′′(u)) ≥ F ′′(u)Φ(u) + F (u)Φ′′(u)

⇔ F ′′(u)(uΦ′(u) − Φ(u)) + Φ′′(uF ′(u) − F (u)) ≥ 0

As F ′′(u),Φ′′(u) ≥ 0, this inequality is implied by the inequalities

uΦ′(u) − Φ(u) = 0 and uF ′(u) − F (u) ≥ 0,

which in turn follow from the convexity of Φ and F : They are true for u = 0 andwith f ∈ Φ, F:

(uf ′ − f)′ = f ′ + uf ′′ − f ′ = uf ′′ ≥ 0.

Also observe that in general equality does not hold true: For F (y) = 12y2,Φ(y) =

12y2 we have Ψ(y) = 1

3y3 and (with u2 = 0, u = u1 > 0)

uΨ(u) =1

3u4 >

1

4u4 =

1

2u2 · 1

2u2 = F (u)Φ(u)

61

Summarizing our discussion, we have found that if Φ is convex and Ψ′ = F ′Φ′

then

∂tΦ(u) + ∂xΨ(u) = 0 (3.3)

in regions where u is smooth and

[[Ψ(u)]] ≥ [[F (u)]]

[[u]][[Φ(u)]]

along a jump discontinuity curve C. By the Rankine-Hugoniot condition [[F (u)]][[u]]

=σ, this inequality can be written as

[[Ψ(u)]] ≥ σ[[Φ(u)]]. (3.4)

Now a calculation similar to the derivation of the Rankine-Hugoniot conditionshows that conditions (3.3) and (3.4) are implied by the inequality

∂tΦ(u) + ∂xΨ(u) ≤ 0.

(In the distributional sense, i.e., after testing with arbitrary v ∈ C∞c such that

v ≥ 0.)

Remark 3.2 In physical applications, −Φ can be an entropy as, e.g., for a massdensity u = ρ with

−Φ(ρ) = −ρ log ρ.

(Indeed, (−Φ(ρ))′′ = (− log ρ− 1)′ = −1ρ≤ 0.)

This motivates the following definition:

Definition 3.3 1. (Φ,Ψ) is called an entropy/entropy-flux pair if Φ is convexand DΨ = DF DΦ.

2. An integral solution u of (3.2) is called an entropy solution if

∂tΦ(u) + ∂xΨ(u) ≤ 0

holds true for every entropy/entropy-flux pair (Φ,Ψ).

This negativity condition is to be understood in the distributional sense, i.e.

∫ ∞

0

∫ ∞

−∞Φ(u) ∂tv + Ψ(u)∂xv dxdt ≥ 0 ∀ v ∈ C∞

c (R × (0,∞)), v ≥ 0.

62

We will try to find solutions by first looking at the “singularly perturbed”system

∂tuε + ∂xF (uε) − ε∂xxu

ε = 0 in R × (0,∞)

uε = g on R × 0.Physically the term ε∂xxu

ε takes account of (small) viscosity effects. Mathemat-ically it regularizes the solutions.

The scalar equation in one space dimension:We now restrict out attention to the special case of a scalar conservation law

in 1 + 1 dimensions. Note that then for any convex Φ there exists a Ψ such that

Φ′(y)F ′(y) = Ψ′(y) ∀ y ∈ R,

so that (Φ,Ψ) is an entropy/entropy-flux pair.We follow the approach by L. Tartar. The main ingredient into the existence

proof in the following theorem:

Theorem 3.4 Let Ω ⊂ R2 be open and bounded, F : R → R in C1. Suppose(uε) ⊂ L∞(Ω) satisfies

uε ∗ u in L∞

and that for every entropy/entropy-flux pair (Φ,Ψ) there is a compact subset ofW−1,2

loc containing∂tΦ(uε) + ∂xΨ(uε)

for every ε > 0. Then

F (uε)∗ F (u) in L∞,

F ′(uε) → F ′(u) in Lp ∀ p <∞

and moreover, if there is no interval on which F is affine, then even

F (uε)∗ F (u) in Lp ∀ p <∞.

Proof. Fix an entropy/entropy-flux pair (Φ,Ψ) and consider the bounded se-quence

(uε, vε, wε, zε) := (uε, F (uε),Φ(uε),Ψ(uε)) ∈ L∞(Ω; R4).

Passing to a subsequence we have

(uε, vε, wε, zε)∗ (u, v, w, z) in L∞(Ω; R4)

for suitable u, v, w, z ∈ L∞(Ω).

In order to prove the first assertion, namely, F (uε)∗ F (u), it thus remains

to be seen that v = F (u) a.e.

63

By assumption on the entropy/entropy flux pairs (id, F ) and (Φ,Ψ) we havethat both

∂tuε + ∂xv

ε and ∂twε + ∂xz

ε

lie in a compact subset of W−1,2(Ω). We also observe that the correspondingsingular cone Λ ⊂ R4 is given by

Λ = (λ1, λ2, λ3, λ4) : ∃ ξ ∈ R2 \ 0 s.t. ξ2λ1 + ξ1λ2 = 0 = ξ2λ3 + ξ1λ4= (λ1, λ2, λ3, λ4) : (λ1, λ2) ‖ (λ3, λ4)= (λ1, λ2, λ3, λ4) : λ1λ4 − λ2λ3 = 0.

By Theorem 2.61 on compensated compactness we thus obtain

uεzε − vεwε ∗ uz − vw,

i.e.,

uεΨ(uε) − F (uε)Φ(uε)∗ uz − vw. (3.5)

After passing, if necessary, to a further subsequence we may assume that (uε)generates a Young measure (ν(x,t))(x,t)∈Ω. In terms of this Young measure weobtain the following representations for a.e. (x, t):

u(x, t) =

∫y dν(x,t),

v(x, t) =

∫F (y) dν(x,t),

w(x, t) =

∫Φ(y) dν(x,t) and

z(x, t) =

∫Ψ(y) dν(x,t).

Similarly for the weak limit uz− vw from (3.5) of uεΨ(uε)−F (uε)Φ(uε) one gets∫yΨ(y) − F (y) Φ(y) dν(x,t) = uz − vw

= u(x, t)

∫Ψ(y) dν(x,t) − v(x, t)

∫

Ω

Φ(y) dν(x,t)

and hence∫

(y − u(x, t))Ψ(y) − (F (y) − v(x, t))Φ(y) dν(x,t) = 0. (3.6)

In fact for a.e. (x, t) the above equations hold true for every entropy/entropy-flux pair (Φ,Ψ). To see this, it suffices to observe that there is a compact interval

64

[−R,R] such that supp ν(x,t) ⊂ [−R,R] for a.e. (x, t) and that every convexfunction Φ can be approximated on [−R,R] by a sequence of convex functionsΦn whose graph is a piecewise affine interpolation of rational points in such away that also the associated entropy-flux functions Ψn converge to Ψ uniformly.

For fixed (x, t) such that all of the previous a.e.-equations are satisfied at (x, t)we now choose

Φ(y) = |y − u(x, t)|and, correspondingly,

Ψ(y) =

F (u(x, t)) − F (y) for y ≤ u(x, t),

F (y) − F (u(x, t)) for y ≥ u(x, t).

Then

(y − u(x, t))Ψ(y) − (F (y)− v(x, t))Φ(y)

= |y − u(x, t)|(F (y) − F (u(x, t))) − (F (y) − v(x, t))|y − u(x, t)|= (v(x, t) − F (u(x, t)))|y − u(x, t)|.

It follows from (3.6) that

(v(x, t) − F (u(x, t)))

∫|y − u(x, t)| dν(x,t) = 0.

Now if∫|y−u(x, t)| dν(x,t) 6= 0, then we immediately obtain v(x, t) = F (u(x, t)).

But also from∫|y − u(x, t)| dν(x,t) 6= 0 we can deduce that ν(x,t) = δu(x,t) and so

v(x, t) =

∫F (y) dν(x,t) = F (u(x, t)).

This completes the proof of F (uε)∗ F (u). (Note that by identifying the limit

F (u) uniquely we could indeed pass to subsequences w.l.o.g.)In order to prove the remaining assertions of the theorem we first note that

it suffices to show that, for a.e. (x, t), supp ν(x,t) is contained in an interval whereF is affine:

• Then F ′(uε) generates the Young measure (F ′(ν(x,t)))(x,t)∈Ω consisting ofthe image measures of ν(x,t) under F ′ which are all Dirac measures. ByCorollary 2.68 we therefore have

F ′(uε) →∫F ′(y) dν(x,t) = F ′

(∫y dν(x,t)

)= F ′(u)

in Lp for all p <∞.

65

• If there is no interval at all on which F is affine, then all the ν(x,t) themselvesare Dirac measures and analogously we see that then

uε → u

in Lp for all p <∞.

Shifting u and F by a constant, we may w.l.o.g. assume that u(x, t) =F (u(x, t)) = 0, so that

∫y dν(x,t) = 0 and

∫F (y) dν(x,t) = 0. (3.7)

Let [α, β] be the closed convex hull of supp ν(x,t). By the first equation in (3.7)we have α ≤ 0 ≤ β and, if α (or β) is zero, then also β (resp. α) is zero and theclaim holds true. So let us assume now that α < 0 < β.

For every y ∈ R we define

g(y) :=

∫ y

α

z dν(x,t)(z), h(y) :=

∫ y

α

F (z) dν(x,t)(z).

Then g(y) = h(y) = 0 for y /∈ [α, β] by (3.7). From∫ β

αy dν(x,t) = 0 and the

construction of α and β we furthermore deduce that g(y) < 0 in (α, β). Notethat the distributional derivatives of g and h are given by the measures g′ and h′

withdg′(y) = y dν(x,t)(y) resp. dh′(y) = F (y) dν(x,t)(y),

so that, for C1-Funktions ϕ,

∫ϕ(y) dg′(y) = −

∫ϕ′(y) y dν(x,t)(y) and

∫ϕ(y) dh′(y) = −

∫ϕ′(y)F (y) dν(x,t)(y).

Exercise: Prove this!

From (3.6) and (3.7) we get

∫yΨ(y) − F (y) Φ(y) dν(x,t) = 0.

A standard approximation argument (approximate Φ and Ψ with C1-Functionsuniformly) shows that this equation can be rewritten in terms of g′ and h′ as

−∫g(y) Ψ′(y) dy +

∫h(y) Φ′(y) dy = 0.

66

Using that Ψ′ = F ′Φ′ we arrive at∫

(h(y) − g(y)F ′(y))Φ′(y) dy = 0.

Since this equation holds true for any convex Φ and thus any increasing Φ′, bylinearity it also holds true for differences of increasing functions and hence for anysmooth function. (Every w ∈ C1 can be written as w(y) = w(0)+

∫ y

0(w′)+(s) ds−∫ y

0(w′)−(s).) But then we must have

h− g F ′ = 0.

Now this proves that

(F (y) g(y)− y h(y))′ = F (y) g′(y) − y h′(y) + g(y)F ′(y) − h(y)

= F (y) g′(y) − y h′(y) = 0,

where the last equality followed from

d(F (y) g′(y) − y h′(y)) = F (y) y dν(x,t) − y F (y) dν(x,t) = 0.

Since g and h vanish outside [α, β] we deduce

F (y) g(y)− y h(y) = 0.

Recalling that g < 0 on (α, β) and h− gF ′ = 0, i.e., hg

= F ′, we finally obtainthat

F (y) − y F ′(y) = 0 ∀ y ∈ (α, β).

The only solutions of this differential equation are given by

F (y) = cy ∀ y ∈ (α, β)

for some constant c, which was to be proved.

In order to apply this theorem to conservation laws we will also need thefollowing lemma.

Lemma 3.5 If E1 is compact in W−1,2(Ω), E2 bounded in M(Ω) and E3 boundedin W−1,∞(Ω), then

(E1 + E2) ∩ E3 is precompact in W−1,2loc (Ω).

Proof. Let (g(ν)) be a sequence in E := (E1 +E2) ∩E3. W.l.o.g. we may assumethat supp g(ν) ⊂ Ω for some smoothly bounded subdomain Ω ⊂ Ω.

Write g(ν) = g(ν)1 + g

(ν)2 with g

(ν)i ∈ Ei (i = 1, 2) and let v

(ν)i be the solution of

−∆v(ν)i = g

(ν)i in Ω, v(ν) = 0 on ∂Ω.

67

Now note that the Laplace operator −∆ with Dirichlet boundary conditionsinduces an isomorphism between W 1,2

0 (Ω) and W−1,2(Ω). Without proof we willuse the fact that this is indeed true for the spaces W 1,p

0 (Ω) and W−1,p(Ω) for anyp ∈ (1,∞).

It follows that (v(ν)1 ) is compact in W 1,2

0 (Ω). As for v(ν)2 , we note that M(Ω)

embeds compactly into W−1,p(Ω) for p < nn−1

by Theorem 2.32 and so (v(ν)2 )

is precompact in W 1,p(Ω). So v(ν) := v(ν)1 + v

(ν)2 is precompact in W 1,p(Ω) for

p < nn−1

.But for any q <∞ we have

−∆v(ν) = g(ν)1 + g

(ν)2 = g(ν) in Ω, v(ν) = v

(ν)1 + v

(ν)2 = 0 on ∂Ω,

whence v(ν) is bounded in W 1,q(Ω). For q > 2 this boundedness together withprecompactness in W 1,p(Ω) for some p ≥ 1 proves that (v(ν)) is precompact inW 1,2(Ω) and so (g(ν)) is precompact in W−1,2(Ω).

Now consider the approximating equations

∂tuε + ∂xF (uε) − ε∂xxu

ε = 0 in R × (0,∞), uε(x, 0) = g(x). (3.8)

Lemma 3.6 There exist classical solutions uε of (3.8) such that both uε andε∂xu

ε are bounded in L∞.

We will not prove this lemma. But note that it suffices to provide a solutionu for ε = 1 and initial conditions u(x, 0) = g(εx) and then set uε(x) := u(x

ε, t

ε).

Lemma 3.7 If uε is a solution of (3.8) as given by Lemma 3.6, then√ε∂xu

ε isbounded in L2(Ω) for each Ω ⊂ R × [0,∞) bounded.

Proof. Let Ω ⊂ R×[0,∞) be bounded and, w.l.o.g., of the form Ω = (a, b)×[0, T ).By testing (3.8) with uε itself we obtain

ε

∫

Ω

uεxx u

ε dx dt =

∫

Ω

uεt u

ε + F (uε)x uε dx dt

and so, after partial integrations,

ε

∫ T

0

[uεx u

ε]x=b

x=a dt− ε

∫ T

0

∫ b

a

uεx u

εx dx dt

=1

2

∫ T

0

∂t

∫ b

a

(uε)2 dx dt+

∫ T

0

[F (uε) uε]x=bx=a dt−

∫ T

0

∫ b

a

F (uε) uεx dx dt.

Now since uε and εuεx are uniformly bounded, the first term on the left hand

side and the second term on the right hand side of this equation are bounded

68

independently of ε. The first term on the right hand side is bounded, too, becauseit is equal to

1

2

∫ b

a

[(uε)2

]t=T

t=0dx.

But also the last term is bounded as we can choose G(y) :=∫ y

0F (s) ds and write

∫ T

0

∫ b

a

F (uε) uεx dx dt =

∫ T

0

∫ b

a

G(uε)x dx dt =

∫ T

0

[G(uε)]x=b

x=a dt.

In summary, it follows that

‖√εuε

x‖2L2(Ω) =

∣∣∣∣ε∫ T

0

∫ b

a

|uεx|2 dx dt

∣∣∣∣ ≤ C.

Theorem 3.8 Suppose F : R → R is smooth, g ∈ C1 ∩ L1 ∩ L∞. Then

ut + F (u)x = 0 in R × (0,∞), u(x, 0) = g(x)

has an integral solution. If there is no interval on which F is affine, then thereeven exists an entropy solution.

Proof. Let (uε) be a sequence of solutions of (3.8) as given by Lemma 3.6. W.l.o.g.we have

uε ∗ u

for some u ∈ L∞. Let Φ be convex and choose Ψ such that Ψ′ = F ′Φ′. LetΩ ⊂ R × [0,∞) be bounded.

Claim: Φ(uε)t + Ψ(uε)x lies in a compact subset of W−1,2loc (Ω).

In order to prove this claim, we first note that w.l.o.g. we may assume that Φand hence Ψ is smooth: By mollification one can approximate Φ by convex smoothfunctions uniformly such that also Φ′ is approximated boundedly in measure.

Now calculate

Φ(uε)t + Ψ(uε)x = Φ′(uε) uεt + Ψ′(uε) uε

x

= Φ′(uε) (εuεxx − F (uε)x) + Φ′(uε)F ′(uε) uε

x

= εΦ′(uε) uεxx

= εΦ(uε)xx − εΦ′′(uε) (uεx)

2.

Clearly, Φ(uε)t + Ψ(uε)x is bounded in W−1,∞ because Φ(uε) and Ψ(uε) arebounded in L∞. Also, Φ′′(uε) is bounded in L∞ and ε(uε

x)2 is bounded in L1

by Lemma 3.7 and so εΦ′′(uε)(uεx)

2 is bounded in M(Ω). Finally, εΦ(uε)xx is

69

contained in a compact subset of W−1,2(Ω). In fact, it even is a null-sequence inthat space: Viewing the smooth function εΦ(uε)xx ∈ W−1,2(Ω) as a distributionextended to a linear functional on W 1,2

0 which acts on test functions ϕ (and henceall ϕ ∈W 1,2

0 ) through ϕ 7→∫εΦ(uε)xx ϕ, we can estimate

‖εΦ(uε)xx‖W−1,2 = sup

∫εΦ(uε)xx ϕ : ϕ ∈W 1,2

0 , ‖ϕ‖W

1,20

= 1

= sup

ε

∫Φ(uε)x ϕx : ‖ϕ‖

W1,20

= 1

≤ sup√

ε ‖Φ′(uε)‖L∞‖√εuε

x‖L2‖ϕx‖L2 : ‖ϕ‖W1,20

= 1

≤ C√ε,

where we have used Φ(uε)x = Φ′(uε) uεx in the second step and Lemma 3.7 in the

fourth step. Lemma 3.5 now implies the claim.

uε being a solution of (3.8) we have∫ ∞

0

∫ ∞

−∞uε ϕt + F (uε)ϕx + εuεϕxx dx dt+

∫ ∞

−∞g(x)ϕ(x, 0) dx = 0

for any ϕ ∈ C∞c (R × [0,∞)). As a consequence of the above claim we can

now deduce from Theorem 3.4 that F (uε)∗ F (u). With Ω bounded such that

suppϕ ⊂ Ω we finally get that indeed∫ ∞

0

∫ ∞

−∞uϕt + F (u)ϕx dx dt+

∫ ∞

−∞g(x)ϕ(x, 0) dx = 0,

which shows that u is an integral solution.If there is no interval on which F is affine, then also uε → u strongly in any

Lp, p <∞. But then∫ ∞

0

∫ ∞

−∞Φ(u)ϕt + Ψ(u)ϕx dx dt = lim

ε→0

∫ ∞

0

∫ ∞

−∞Φ(uε)ϕt + Ψ(uε)ϕx dx dt ≥ 0.

Remark 3.9 Other methods show that in fact there always exists an entropy so-lution. The proof by compensated compactness, however, appears most amenableto systems of conservation laws. But little is known rigorously for such systems.

We conclude the section by making the remark that the entropy conditiondoes indeed single out a unique integral solution:

Theorem 3.10 There exists at most one entropy solution of (3.2).

A proof of this statement can be found, e.g., in [Ev 98]. (XXX)

70

3.2 A Convergence result for quasilinear elliptic

systems

In this section we will consider an elliptic system of PDEs:

− div(E(Du)) = 0 in Ω. (3.9)

Here E : Rm×n → Rm×n is a smooth function satisfying the growth estimate

|E(X)| ≤ C(1 + |X|) ∀X ∈ Rm×n,

Ω is a bounded domain in Rn and u : Ω → Rm is to be found.Equation (3.9) is elliptic in the sense that the linearization DE(X) is assumed

to satisfy the strict Legendre-Hadamard condition

(η ⊗ ξ) : DE(X)(η ⊗ ξ) ≥ γ|η|2|ξ|2 (3.10)

for some γ > 0 and all X ∈ Rm×n, η ∈ Rm and ξ ∈ Rn. Note that in the varia-tional case, when E = DF for some F , this amounts to strict rank-1-convexityof F .

Suppose now we are given a sequence (u(ν)) ⊂ W 1,2(Ω; Rm) of weak solutionsof (3.9). E.g., the u(ν) may have been obtained by solving (3.9) with respect todifferent boundary data. Assume that (maybe after the extraction of a subse-quence) u(ν) u in W 1,2. Then the natural question arises if we can guaranteethat also u is a weak solution of (3.9).

Of course, a weak solution of (3.9) is a function v ∈ W 1,2(Ω; Rm) such thatfor all ϕ ∈W 1,2

0 (Ω; Rm) (or, equivalently, ϕ ∈ C∞c (Ω; Rm))

∫

Ω

E(Dv(x)) : Dϕ(x) dx = 0.

The difficulty in this undertaking is – as always – that (3.9) is a nonlinearsystem of PDEs and it is not clear if one may pass to weak limits in the aboveformula. Under an additional technical assumption on the smallness of the oscil-lation within Du(ν), however, we indeed obtain the following positive result:

Theorem 3.11 There exists a number ε0 > 0 such that, if

supν

oscΩDu(ν) ≤ ε0,

then u is a weak solution of (3.9).

Here, for a function f , oscΩ f = supx,y∈Ω |f(x) − f(y)|.Note that a localization argument shows that it would be sufficient to assume

that there exist a δ > 0 such that supν oscB∩ΩDu(ν) ≤ ε0 for every ball B of

radius at most δ. This also follows immediately from the following proof.As a preparation we slightly generalize our compensated compactness result

in Theorem 2.61 first. Let A be a PDO of first order with singular cone Λ as inTheorem 2.61.

71

Theorem 3.12 Let M ∈ L∞(Ω; Rm×msym ) and set

f(x, a) = aTM(x)a for x ∈ Ω, a ∈ Rm.

Suppose u(ν) u in L2, f(u(ν)) → l in D′ and Au(ν) is compact in W−1,2loc (Ω; Rm).

If f(x, λ) ≥ 0 for all λ ∈ Λ and almost every x ∈ Ω, then l ≥ f(x, u) as measures.

Proof. This follows directly from Theorem 2.61 by localization. Assume first thatM is continuous. Let ε > 0 and write Ω = Ω ∩ (Bδ(x1) ∪ . . . ∪ Bδ(xN)), where

|M(x) −M(xi)| < ε for x ∈ Bδ(xi).

Since u(ν)|Ω∩Bδ(xi) u|Ω∩Bδ(xi) in L2 and f(u(ν)|Ω∩Bδ(xi)) → l|Ω∩Bδ(xi) in D′, wehave

l ≥ uTM(xi)u ≥ uTMu− ε|u|2

on Ω ∩ Bδ(xi) and sol ≥ F (·, u) − ε|u|2

on Ω. Now let ε → 0.For general M we mollify by setting Mε = ηε ∗M , where ηε is the rescaled

standard mollifier. Then Mε is smooth and satisfies

aTMεa = aT ηε ∗Ma = ηε ∗ (aTMa︸︷︷︸≥0

) ≥ 0

for all a ∈ Λ. So by the first part of the poof we have l ≥ uTMεu for all ε > 0.Noting that Mε

∗M , we have uTMεu uTMu in L1 and so

l ≥ uTMu.

Proof of Theorem 3.11. As ‖Du(ν)‖L2 and oscDu(ν) are bounded, Du(ν) isbounded in L∞ and we may – after extraction of a subsequence – assume that(Du(ν)) generates a Young measure (νx).

Since Du(ν) ∗ Du, we have for a.e. x ∈ Ω:

Du(x) =

∫

Rm×n

Y dνx(Y ) =: Y .

Furthermore, E(Du(ν))∗ E, where

E(x) :=

∫

Rm×n

E(Y ) dνx(Y ).

Similarly, Du(ν) : E(Du(ν))∗ Y : E(Y ), where

Y : E(Y )(x) :=

∫

Rm×n

Y : E(Y ) dνx(Y ).

72

On the other hand, the div-curl lemma Corollary 2.64 shows that

Du(ν) : E(Du(ν)) =∑

ij

∂ju(ν)i Eij(Du

(ν))∗ Du : E(Du)

since, for every i, curl∇ui = 0 and divEi(Du(ν)) = 0. So

Y : E(Y ) = Du : E.

This shows that∫

Rm×n

(Y − Y ) : (E(Y ) −E(Y )) dνx(Y )

= Du : E − Y :

∫E(Y ) dνx −

∫Y dνx : E(Y ) + Y : E(Y )

= Y : E − Y : E − Y : E(Y ) + Y : E(Y ) = 0.

Noting that by assumption supp νx is contained in a ball of radius ε0 and that

E(Y ) = E(Y ) +DE(Y )(Y − Y ) +O(|Y − Y |2),

we find∫

Rm×n

(Y − Y ) : DE(Y )(Y − Y ) dνx(Y ) ≤ Cε0

∫

Rm×n

|Y − Y |2 dνx(Y ). (3.11)

Now consider the function

F (x, P ) := P : DE(Y )P − γ

2|P |2 for P ∈ Rm×n. (3.12)

By the strict Legendre-Hadamard condition (3.10) this function is quadratic andrank-1-convex. But then Theorem 3.12 gives

∫

U

F (x,Du(x)) dx ≤ lim infν→∞

∫

U

F (x,Duν) dx.

for every U ⊂ Ω open. An approximation procedure similarly as in the proof ofTheorem 3.12 now shows that the term on the right hand side of this equalitycan be expressed in terms of the Young measure, although F depends explicitlyon x:

limν→∞

∫

U

F (Duν) dx =

∫

U

∫

Rm×n

F (x, Y ) dνx(Y ) dx.

U being arbitrary, this proves that

F (x,Du(x)) ≤∫

Rm×n

F (x, Y ) dνx(Y ) for a.e. x.

73

Writing F (x, P ) = P : M(x)P , M(x) ∈ R(m×n)×(m×n), we see that the lastinequality implies that

∫F (x, Y − Y ) dνx(Y )

=

∫F (x, Y ) +

∫F (x, Y ) dνx − Y : M(x)

∫Y dνx −

∫Y dνx : M(x)Y

=

∫F (x, Y ) − F (x, Y ) dνx ≥ 0

for a.e. x. By construction of F in (3.12) and by (3.11), this finally proves that

γ

2

∫

Rm×n

|Y − Y |2 dνx(Y ) =

∫

Rm×n

(Y − Y ) : DE(Y − Y ) − F (x, Y − Y ) dνx(Y )

≤ Cε0

∫

Rm×n

|Y − Y |2 dνx(Y ).

For ε0 so small that Cε0 <γ

2we thus obtain

∫

Rm×n

|Y − Y |2 dνx(Y ) = 0

and hence Y = Y νx-a.e., i.e., νx = δY (x) = δDu(x).

But then the convergence Du(ν) → Du is even strong in Lp for every p <∞,see Corollary 2.68. We thus deduce that E(Du(ν)) → E(Du) strongly in Lp forevery p <∞ and weak* in L∞. This completes the proof.

3.3 Homogenization of elliptic PDE

For ε > 0 we consider the elliptic problem

−n∑

i,j=1

∂xj

(aij

(xε

)∂xiuε(x)

)= f(x) in Ω, uε = 0 on ∂Ω. (3.13)

Here Ω is – as usual – a bounded domain in Rn with sufficiently smooth boundary,say Lipschitz or even C1. The coefficient matrix A = (aij) is assumed to satisfyA ∈ L∞(Rn; Rn×n

sym ) and, for some suitable θ > 0

ξTA(y)ξ ≥ θ|ξ|2 for all ξ ∈ Rn and almost every y ∈ Rn,

i.e., A is uniformly elliptic. Moreover, A shall be Q-periodic, where Q is somecube in Rn. For the sake of simplicity, we will set Q = (0, 1)n.

ε is a small parameter that will eventually tend to zero. In applications, sucha highly osscillating matrix describes a material with “microstructure” at a very

74

fine scale. Our hope is that for small ε the solution uε can be approximatedby some limiting function u which in turn can be found by solving an easier“effective” equation that, in particular, does not depend on ε.

Heuristic approach:We will first follow an heuristic approach. The important idea, which helps

to understand this kind of problems, is that for small ε the quantities x and xε

behave in a sense as independent variables. Whereas xε

describes “microscopicphenomena such as oscillations on a fine scale, x keeps track of the averagedbehavior on the large “macroscopic” scale. This motivates to make a multi-scaleansatz (actually rather a two-scale ansatz in our example) of the form

uε(x) = u0

(x,x

ε

)+ εu1

(x,x

ε

)+ ε2u0

(x,x

ε

)+ . . . (3.14)

and to view y = xε

as a new variable independent of x. The dominant behavioras ε→ 0 is then given by the zero-th order term u0.

Noting that for a function v of the form v(x) = w(x, y)|y= xε

we have

∂xiv(x) =

(∂xiw(x, y) + ε−1∂yi

w(x, y))∣∣

y= xε

and setting Lε = −∑ni,j=1 ∂xj

(aij

(xε

)∂xi

·), we thus obtain

Lεv = −n∑

i,j=1

(∂xj

+ ε−1∂yj

) (aij(y)

(∂xiw + ε−1∂yi

w))

= ε−2L1w + ε−1L2w + L3w,

(to be evaluated at y = xε), where

L1w = −n∑

i,j=1

∂yj(aij(y) ∂yi

w) ,

L2w = −n∑

i,j=1

(∂yj

(aij(y) ∂xiw) + ∂xj

(aij(y) ∂yiw)),

L3w = −n∑

i,j=1

∂xj(aij(y) ∂xi

w) .

By (3.13) we have Lεuε = f and thus by (3.14)

ε−2L1u0 + ε−1 (L1u1 + L2u0) + (L1u2 + L2u1 + L3u0) +O(ε) = f.

Assuming “scale separation”, i.e., that effects on different scales behave essentiallyindependent, we now ask that each individual coefficient in this expansion with

75

respect to powers of ε be zero! We are then led to the equations

L1u0 = 0, (3.15)

L1u1 + L2u0 = 0, (3.16)

L1u2 + L2u1 + L3u0 = f, (3.17)

which are to be solved for functions functions ui = ui(x, y), i = 1, 2, 3, on Ω ×Qwith zero boundary conditions in x on ∂Ω and periodic boundary conditions iny on ∂Q.

Multiplying (3.15) with u0 itself and integrating over Q we obtain

∫

Q

(∇yu0(x, y))TA(y)∇yu0(x) = 0

by partial integration. Note that there are no boundary terms due to the peri-odicity of u0 in y. But then our ellipticity assumptions shows that ∇yu0 = 0 andthus u0 = u0(x) is a function of x only.

Consequently,

L2u0 = −n∑

i,j=1

∂yjaij(y) ∂xi

u0

and (3.16) implies

L1u1 =

n∑

i,j=1

∂yjaij(y) ∂xi

u0.

In order to determine the y-dependency of u1, we solve the so-called correctorproblems

L1χk =

n∑

j=1

∂yjakj(y) in Q (3.18)

for χ = χ(y), k = 1, . . . , n, with periodic boundary conditions on ∂Q. Note thatthis is possible by Fredholm’s alternative: As we just saw, the kernel of L1 = L∗

1

with respect to periodic boundary conditions precisely consists of all constantfunctions. Since ∫

Q

n∑

j=1

∂yjakj(y) = 0

(again due to periodicity), the right hand side of (3.18) is indeed orthogonal toall these constant functions.

This allows us to re-write the above equation as

L1

(u1 −

∑

k

χk ∂xku0

)= 0

76

and thus

u1 =∑

k

χk ∂xku0 + u1 (3.19)

for some u1 = u1(x) only depending on x.The effective equation for u0 is now found by observing that due to the solv-

ability of (3.17), i.e. the fact that there exists u2 solving

L1u2 = f − L2u1 − L3u0,

Fredholm’s alternative implies∫

Q

(f(x) − L2u1(x, y) − L3u0(x)) dy = 0. (3.20)

Now clearly∫

Q

f(x) dy = f(x) (3.21)

and∫

Q

L3u0(x) dy = −n∑

j,k=1

∂xk

(∫

Q

ajk(y) dy ∂xju0

). (3.22)

Furthermore, due to the periodicity of aij∂xiu1 in y and (3.19),

∫

Q

L2u1(x, y) dy = −∫

Q

n∑

i,j=1

(∂yj

(aij(y) ∂xiu1) + ∂xj

(aij(y) ∂yiu1))

= 0 −∫

Q

n∑

i,j,k=1

∂xj(aij(y) ∂yi

(χk ∂xku0 + u1))

= −n∑

j,k=1

∫

Q

n∑

i=1

aij(y) ∂yiχk ∂xj

∂xku0. (3.23)

Thus defining the “homogenized coefficients” ajk by

ajk :=

∫

Q

(ajk(y) +

n∑

i=1

aij(y) ∂yiχk

)dy,

we obtain from (3.20), (3.21), (3.22) and (3.23) that the dominant term u = u0

in (3.14), which only depends on x, solves the equation

−n∑

i,j=1

∂xj(aij ∂xi

u(x)) = f(x) in Ω, u = 0 on ∂Ω.

77

Although this approach does not lead to a rigorous proof of uε u0, thesolution of the equation above, this multi-scale analysis is an important firststep: It provides us with an explicit formula for the homogenized coefficients.

Rigorous analysis:In the second part of the section we will give a rigorous convergence result.

Suppose uε is the (unique) weak solution of (3.13). We will now also distinguishmore carefully between the operator L1 and its formal adjoint L∗

1. (Of course,L1 = L∗

1 if A is symmetric.) Define χk = χk(y), k = 1, . . . , n, to be a weaksolution of the corrector problem

n∑

i,j=1

∂yi

(aij(y)∂yj

χk(y))

=n∑

i=1

∂yiaik(y) in Q

subject to periodic boundary conditions. More precisely, χk ∈ H1per(Q) is such

that

∫

Q

n∑

i,j=1

aij(y) ∂yjχk(y) ∂yi

ψ(y) dy =

∫

Q

n∑

i=1

∂yiaik(y)ψ(y) dy (3.24)

for every ψ ∈ H1per(Q). Here the Hilbert spaceH1

per(Q) ofQ-periodicH1-functionsconsists of the restrictions to Q of Q-periodic functions belonging to H1

loc(Rn).

By Fredholm’s alternative, such functions χk do exist. Also note that χk is deter-mined uniquely up to an additive constant. Define the homogenized coefficientsby

aij :=

∫

Q

n∑

k=1

aik(y) (δkj + ∂ykχj(y)) dy.

Theorem 3.13 uε converges to u weakly in W 1,20 (Ω), where u is the (unique)

weak solution of

−n∑

i,j=1

∂xj(aij ∂xi

u(x)) = f(x) in Ω, u = 0 on ∂Ω. (3.25)

Proof. As solutions to (3.13), the sequence (uε) is bounded in W 1,20 (Ω). We may

therefore extract a converging subsequence uε u in W 1,20 (Ω). Since the weak

solution of Equation (3.25) is unique, it suffices to show that u solves (3.25).For a further subsequence (not relabeled) we have

n∑

i=1

aij

( ·ε

)∂xiuε ξj (3.26)

78

for suitable ξj ∈ L2. Passing to the limit in the weak form of (3.13) we get

∫

Ω

ξ · ∇ϕdx =

∫

Ω

f ϕ dx (3.27)

for every ϕ ∈W 1,20 (Ω).

We define the “corrector functions” ϕεk by

ϕεk(x) = xk + εχk

(xε

)

for x ∈ Rn. From (3.24) we the deduce that for every ψ ∈W 1,20 (Ω) (extended by

0 outside Ω)

∫

Ω

n∑

i,j=1

aij

(xε

)∂xj

ϕεk(x) ∂xi

ψ(x) dx

=

∫

Ω

n∑

i,j=1

aij

(xε

)(δjk + ∂jχk

(xε

))∂xiψ(x) dx

=

∫

Rn

n∑

i,j=1

aij (y)(δjk + ∂yj

χk (y))∂iψ(εy) εndy.

With a partition of unity we can write ψ(εy) =∑

m∈ 12

Zn ψm(z + y), where

suppψm ⊂ Q. But then

∫

Ω

n∑

i,j=1

aij

(xε

)∂xj

ϕεk(x) ∂xi

ψ(x) dx

= εn∑

m∈ 12

Zn

∫

Q

n∑

i,j=1

aij (y)(δjk + ∂yj

χk (y))∂iψm(z + y) dy = 0

by (3.24) and so ϕεk is a weak solution of

n∑

i,j=1

∂xi

(aij

(xε

)∂xj

ϕεk

)= 0. (3.28)

Let ζ ∈ C∞c (Ω). Since uε solves (3.13), we have

∫

Ω

f(x) ζ(x)ϕεk(x) dx

=

∫

Ω

n∑

i,j=1

aij

(xε

)∂xiuε(x)

(ϕε

k(x) ∂xjζ(x) + ζ(x) ∂xj

ϕεk(x)

)dx.

79

Using that ∂xiuε ζ = ∂xi

(uεζ) − uε∂xiζ and that ϕε

k solves (3.28) and thus

∫

Ω

n∑

i,j=1

aij

(xε

)∂xi

(uεζ) ∂xjϕε

k dx = 0,

we arrive at

∫

Ω

f ζ ϕεk =

∫

Ω

n∑

i,j=1

aij

(xε

)∂xiuε ∂xj

ζ −∫

Ω

n∑

i,j=1

aij

(xε

)uε ∂xi

ζ ∂xjϕε

k. (3.29)

Now by the definition of ϕεk and by uε u in W 1.2 we have

ϕεk → xk and uε → u

strongly in L2 as ε→ 0. Also,

n∑

j=1

aij

(xε

)∂xj

ϕεk(x) =

n∑

j=1

aij

(xε

)(δkj + ∂jχk

(xε

)) aik

as the L2-weak limit of a highly oscillating sequence. Together with (3.26) wefind by first sending ε to 0 in (3.29) and then using (3.27) with ϕ = xkζ

∫

Ω

n∑

j=1

ξj xk ∂xjζ −

∫

Ω

n∑

i=1

aik u ∂xiζ =

∫

Ω

f xk ζ =

∫

Ω

ξ · ∇(xk ζ)

=

∫

Ω

n∑

j=1

ξj xk ∂xjζ +

∫

Ω

n∑

j=1

ξj ζ δjk

and so

∫

Ω

n∑

i=1

aik ∂xiu ζ = −

∫

Ω

n∑

i=1

aik u ∂xiζ

∫

Ω

ξk ζ.

Since ζ was arbitrary, this yields

ξk =n∑

i=1

aik∂xiu.

But then again from (3.27) we obtain that for every ϕ ∈W 1,20 (Ω)

∫

Ω

n∑

i,k=1

aik ∂xiu ∂xk

ϕ =

∫

Ω

n∑

k=1

ξk ∂xkϕ =

∫

Ω

f ϕ.

This proves that u is indeed a weak solution of (3.25).

80

Chapter 4

Variationsmethoden furvektorwertige Probleme

Achtung: Dieser Teil des Skripts ist dem Skript “Partielle Differen-tialgleichungen 2” des Sommersemesters 2009 entnommen. Es enthaltalle in der VL besprochenen Resultate, jedoch in anderer Reihenfolge.

Im letzten Kapitel widmen wir uns nun ausschließlich dem “variationellenFall”: Wir untersuchen Funktionale auf (Teilmengen von) Sobolevraumen W 1,p

auf die Existenz von Minimierern. Unter geeigneten Voraussetzungen erfullendiese Minimierer eine PDG, die sich aus dem betrachteten Funktional ableitet,die Euler-Lagrange-Gleichung.

Im Skript PDG 1, Kap. 5.4 hatten wir uns bereits mit Problemen dieser Artbeschaftigt. Dort war I ein Funktional von der Form

I(u) =

∫

U

f(x, u(x), Du(x))

auf einer Teilmenge des Sobolevraums W 1,p(U), U ⊂ Rn offen, zu minimieren.Im Unterschied hierzu wollen wir nun vektorwertige Probleme untersuchen, d.h.Funktionale auf (Teilmengen von) W 1,p(U ; Rm). Wie im Skript PDG 1 wollenwir Minimierer mit der sog. direkten Methode finden.

Es stellt sich nun heraus, dass die Situation fur vektorwertige Probleme wesentlichkomplizierter ist als im skalaren Fall. Die skalaren Ergebnisse lassen sich zwarunschwer auf den allgemeinen Fall ubertragen. Doch dies fuhrt in den Anwen-dungen zu viel zu starken Voraussetzungen. Im Falle der dreidimensionalen Elas-tizitatstheorie werden wir sehen, dass man tatsachlich neue Konzepte benotigt,um physikalisch relevante Probleme behandeln zu konnen.

81

4.1 Euler-Lagrange-Gleichung

Wir untersuchen zunachst den Zusammenhang zwischen Minimierern und derEuler-Lagrange-Gleichung, der fast vollig analog zum skalaren Fall verlauft (vgl.Skript PDG 1).

Es sei U ⊂ Rn offen und beschrankt und

f : U × Rm × Rm×n → R ∪ ∞,(x, y, z) = (x1, . . . , xn, y1, . . . , ym, z11, z12, . . . , zmn) 7→ f(x, y, z)

eine glatte Funktion. Betrachte das Funktional

I(u) =

∫

U

f(x, u(x), Du(x)) dx.

Erfullt f die Wachstumsbedingung

|f(x, y, z)| ≤ C (1 + |y|p + |z|p) ,

fur ein 1 ≤ p < ∞, so ist es naturlich, den Sobolevraum W 1,p(U ; Rm) (odereinen Teilraum davon) als Definitionsbereich von I anzunehmen. Da wir auchRandwertprobleme betrachten wollen, nehmen wir an, dass ∂U C1-glatt1 ist undbetrachten I auf

A = Ag := u ∈W 1,p(U ; Rm) : u = g auf ∂U im Spursinne= u ∈W 1,p(U ; Rm) : u− g ∈W 1,p

0 (U ; Rm),

wobei g eine gegebene Funktion in W 1,p(U ; Rm) sei.Es sei nun u eine Minimalstelle von I. Ist dann v ∈W 1,p

0 (U ; Rm), so ist auchu+ tv = g auf ∂U fur t ∈ R und die Funktion

t 7→ i(t) := I(u+ tv)

nimmt bei t = 0 ein Minimum an. Wenn also i differenzierbar ist, muss i′(0) = 0gelten. Um dies nachzuweisen, nehmen wir zusatzlich an, dass f die Wachstums-bedingungen

|Dyf(x, y, z)|, |Dzf(x, y, z)| ≤ C(1 + |y|p−1 + |z|p−1

)

fur x ∈ U, y ∈ Rm, z ∈ Rm×n erfullt. Dann ist nach der Youngschen Ungleichung

1Es wurde auch reichen anzunehmen, dass U einen Lipschitz-Rand hat.

82

(also ab ≤ ap

p+ bq

qfur a, b ≥ 0, 1

p+ 1

q= 1)

∣∣∣∣d

dtf(x, u(x) + tv(x), Du(x) + tDv(x))

∣∣∣∣= |Dyf(x, u(x) + tv(x), Du(x) + tDv(x)) v(x)

+ Dzf(x, u(x) + tv(x), Du(x) + tDv(x))Dv(x)|≤ C(1 + |u(x) + tv(x)|p−1 + |Du(x) + tDv(x)|p−1)(|v(x)| + |Dv(x)|)≤ C(1 + |u(x) + tv(x)|p + |Du(x) + tDv(x)|p) + C(|v(x)|p + |Dv(x)|p)≤ C(1 + |u(x)|p + |Du(x)|p) + C(|v(x)|p + |Dv(x)|p)

fur |t| ≤ 1. Die letzte Funktion ist unabhangig von t ∈ (−1, 1) und integrierbar.Wir durfen also unter dem Integral differenzieren und erhalten

0 = i′(0) =

∫

U

d

dt

∣∣∣∣t=0

f(x, u(x) + tv(x), Du(x) + tDv(x)) dx

=

∫

U

m∑

i=1

∂yif(x, u(x), Du(x))vi(x) +

m∑

i=1

n∑

j=1

∂zijf(x, u(x), Du(x))∂jvi(x) dx.

Durch diese Rechnung und partielle Integration motiviert definieren wir:

Definition 4.1 Wir sagen u ∈ A ist eine schwache Losung des Randwertprob-lems

− div(Dzf(x, u(x), Du(x)) +Dyf(x, u(x), Du(x)) = 0 in U, u = g auf ∂U.

wenn∫

U

Dyf(x, u(x), Du(x)) · v(x) +Dzf(x, u(x), Du(x)) : Dv(x) dx = 0

fur alle v ∈W 1,p0 (U ; Rm).

Hier ist Dzf als m × n Matrix mit den Eintragen ∂zijf zu verstehen. (Das

Skalarprodukt im Matrizenraum wird oft mit einem Doppelpunkt bezeichnet.Fur A = (aij), B = (bij) ∈ Rm×n ist A : B :=

∑m

i=1

∑n

j=1 aij bij = Spur(ATB).)In Indexschreibweise entspricht das den m Gleichungen

−∑

j

∂j∂zijf(x, u(x), Du(x)) + ∂yi

f(x, u(x), Du(x)) = 0 fur i = 1, . . . , m.

Unsere Rechnung von oben zeigt dann:

83

Theorem 4.2 Es sei f : U × Rm × Rm×n → R eine glatte Funktion, die denWachstumsbedingungen

|f(x, y, z)| ≤ C (1 + |y|p + |z|p) ,|Dyf(x, y, z)|, |Dzf(x, y, z)| ≤ C

(1 + |y|p−1 + |z|p−1

)

fur alle x ∈ U, y ∈ Rm, z ∈ Rm×n genugt. Ist dann I(u) = minv∈A I(v), so ist ueine schwache Losung des Randwertproblems

− div(Dzf(x, u(x), Du(x)) +Dyf(x, u(x), Du(x)) = 0 in U, u = g auf ∂U.

Remark 4.3 Die PDG

− div(Dzf(x, u(x), Du(x)) +Dyf(x, u(x), Du(x)) = 0

heißt die zum Funktional I gehorige Euler-Lagrange-Gleichung. Diese Gleichungist eine PDG in Divergenzform.

Beispiel: Ein genuin vektorwertiges variationelles Problem ist das Auffinden vonDeformationen kleiner Energie in der dreidimensionalen Elastizitatstheorie. Essei U ⊂ R3 ein elastischer Korper, U ⊂ R3 offen und beschrankt. Ordnet man je-dem Punkt x aus U einen Punkt y(x) ∈ R3 zu, so wird dadurch eine Deformationy definiert. Die zur Deformation y notige Energie, die aus den lokalen Verzerrun-gen herruhrt, ist fur sog. hyperelastische Materialien durch ein Integralfunktionalvon der Form

E(y) =

∫

U

W (x,Dy(x)) dx

gegeben. Hierbei ist W die gespeicherte Energiedichte, die im Falle homogenerMaterialien nicht explizit von x abhangt. Die Abhangigkeit von y durch den De-formationsgradienten Dy erklart sich dadurch, dass Dy gerade die lokalen Verz-errungen des Korpers misst, die die elastische Energie speichern. Ein Minimierervon E in der Klasse A beschreibt dann die energetisch gunstigste Konfiguration,die die vorgegebenen Randwerte realisiert.

4.2 Die direkte Methode

Mit der direkten Methode der Variationsrechnung lasst sich unter geeignetenVoraussetzungen die Existenz von Minimierern bestimmter Funktionale ‘direkt’zeigen, ohne die zugehorigen Euler-Lagrange-Gleichungen zu untersuchen. Wirwiederholen zunachst das allgemeine Prinzip, das schon im Skript PDG 1 vorgestelltwurde.

84

Es sei I : X → R ein Funktional auf einem metrischen (oder auch nur topol-ogischen) Raum X. Ist I nach unten beschrankt, so kann man eine Minimalfolge(un) betrachten, d.h. eine Folge (un) mit

limn→∞

I(un) = infv∈X

I(v).

Nun wurde man gerne folgern, dass (un) (oder auch nur eine Teilfolge) gegenein u ∈ X konvergiert. Im Allgemeinen ist jedoch die Annahme, dass etwa Xkompakt ist, viel zu stark. Da auf Minimalfolgen die Werte I(un) beschranktsind, genugt es vielmehr zu fordern, dass die Niveaumengen v : I(v) ≤ c,c ∈ R, relativ kompakt (bzw. relativ folgenkompakt) sind. Ist dies gewahrleistet,konnen wir in der Tat un → u annehmen.

Nun mochten wir noch u als Minimierer von I identifizieren. Ohne jede Vo-raussetzung an I ist das sicher falsch. Ist aber I unterhalbstetig (bzw. unterhalb-folgenstetig), so ist

infv∈X

I(v) = limnI(un) = lim inf

nI(un) ≥ I(u) ≥ inf

v∈XI(v)

und damit tatsachlich I(u) = infv∈X I(v).

Sei wieder f : U × Rm × Rm×n → R glatt, U ⊂ Rn offen und beschrankt mitC1- (oder Lipschitz-)Rand, und I das Funktional

I(u) =

∫

U

f(x, u(x), Du(x)) dx

auf dem Raum A = Ag = u ∈ W 1,p(U) : u = g im Spursinne der zulassigenFunktionen. Genau wie im skalaren Fall (vgl. Skript PDG 1), ergeben sich dienotigen Kompaktheitseigenschaften der Niveaumengen aus Koerzivitatsannahmenan f .

Lemma 4.4 Erfullt f die Wachstumsbedingung

f(x, y, z) ≥ c1|z|p − c2

fur geeignete Konstanten c1 > 0, c2 ∈ R, p ∈ (1,∞), so ist v ∈ A : I(v) ≤ cschwach relativ folgenkompakt fur jedes c ∈ R.

Beachte, dass unter dieser Wachstumsbedingung I auf A wohldefiniert ist,wenn man den Wert I(u) = +∞ zulasst.

Proof. Ist v ∈ A mit I(v) ≤ c, so gilt

c1

∫

U

|Dv(x)|p dx ≤∫

U

f(x, v(x), Dv(x)) + c2 dx = I(v) + c2|U | ≤ c+ c2|U |.

85

Nach der Poincareschen Ungleichung ist dann auch

‖v‖Lp ≤ ‖v − g‖Lp + ‖g‖Lp ≤ C‖D(v − g)‖Lp + ‖g‖Lp

≤ C(‖Dv‖Lp + ‖Dg‖Lp) + ‖g‖Lp.

beschrankt.Ist daher (uk) eine Folge in v ∈ A : I(u) ≤ c, so ist (uk) beschrankt

in W 1,p(U). Da Lp reflexiv ist, existieren also u, v1, . . . vn ∈ Lp(U), so dassukj

u und ∂iukj vi fur eine geeignete Teilfolge ukj

. Da fur ϕ ∈ C∞c (U) gilt∫

vi ϕ = limj

∫∂iukj

ϕ = − limj

∫ukj

∂iϕ = −∫u ∂iϕ, ist tatsachlich vi = ∂iu

und es giltukj

u, Dukj Du in Lp.

Weil die Spurabbildung stetig ist, ist auch u = g auf ∂U , so dass u ∈ A gilt.

Auch die Unterhalbstetigkeit kann man wie im skalaren Fall folgern, wenn fkonvex ist. Wir werden jedoch gleich im Anschluss an den folgenden Satz sehen,dass im vektorwertigen Fall die Annahme f sei konvex fur wichtige Anwendungenzu restriktiv ist.

Lemma 4.5 Es sei f : U ×Rm ×Rm×n → R eine glatte, nach unten beschrankteFunktion, so dass

z 7→ f(x, y, z)

konvex ist fur alle x ∈ U, y ∈ Rm. Dann ist I schwach unterhalbfolgenstetig aufW 1,p(U ; Rm).

Proof. Es sei (uk) eine Folge mit uk u in W 1,p(U ; Rm), d.h. uk u inLp(U ; Rm) undDuk Du in Lp(U ; Rm×n). Es ist zu zeigen, dassm := lim infk→∞I(uk) ≥ I(u). Nach Ubergang zu einer Teilfolge (wieder mit (uk) bezeichnet)konnen wir annehmen, dass m = limk→∞ I(uk) ist.

Aus dem Satz von Rellich-Kondrachov folgt uk → u stark in Lp. Indem wirzu einer Teilfolge (wieder mit (uk) bezeichnet) ubergehen, konnen wir außerdemuk → u fast uberall annehmen.

Nach dem Satz von Egorov2 existiert nun zu jedem j ∈ N eine Menge Aj , sodass

uk → u gleichmaßig auf Aj und |U \ Aj| ≤ j−1.

Dabei konnen wir naturlich Aj so wahlen, dass Aj+1 ⊃ Aj . Wahle nun nochMengen Bj := x ∈ U : |u(x)|+|Du(x)| ≤ j und beachte, dass limj→∞ |U\Bj | =0, da |u|+ |Du| integrierbar ist. Dann aber ist auch limj→∞ |U \ (Aj ∩Bj)| = 0.

2Der Satz von Egorov besagt: Ist U ⊂ Rn messbar mit |U | < ∞ und fk eine Funktionenfolgemit fk → f fast uberall, so gibt es zu jedem ε > 0 eine Menge Aε mit |U \ Aε| ≤ ε, so dassfk → f gleichmaßig auf Aε konvergiert. (Dieser Satz wird in der Maßtheorie bewiesen.)

86

Da f nach unten beschrankt ist, konnen wir nach Addition mit einer geeignetenKonstanten annehmen, dass f ≥ 0 gilt. Aus der Konvexitat von f in z erhaltenwir fur jedes j:

I(uk) =

∫

U

f(x, uk, Duk) dx ≥∫

Aj∩Bj

f(x, uk, Duk) dx

≥∫

Aj∩Bj

f(x, uk, Du) +Dzf(x, uk, Du) · (Duk −Du) dx.

Auf Aj ∩Bj konvergieren nun f(x, uk, Du) und Dzf(x, uk, Du) gleichmaßig gegenf(x, u,Du) bzw. Dzf(x, u,Du). Da zudem Duk schwach gegen Du konvergiert,ergibt sich

m = limk→∞

I(uk) ≥∫

Aj∩Bj

f(x, u,Du) dx.

Da f ≥ 0 vorausgesetzt ist, folgt nun mit monotoner Konvergenz (wegenAj+1 ∩ Bj+1 ⊃ Aj ∩Bj)

m ≥ limj→∞

∫

Aj∩Bj

f(x, u,Du) dx = I(u).

Corollary 4.6 Es sei f : U × Rm × Rm×n → R eine glatte Funktion, die dieWachstumsbedingung

f(x, y, z) ≥ c1|z|p − c2

fur geeignete Konstanten c1 > 0, c2 ∈ R, p ∈ (1,∞) erfulle, so dass

z 7→ f(x, y, z)

konvex ist fur alle x ∈ U, y ∈ Rm. Dann existiert ein u ∈ A mit

I(u) = infv∈A

I(v).

Proof. Das folgt mit der direkten Methode aus Lemma 4.4 und Lemma 4.5.

Insbesondere ist der Minimierer nach Satz 4.2 eine schwache Losung derEuler-Lagrange-Gleichungen unter geeigneten Wachstumsbedingungen an f undD(y,z)f .

Beispiel: Um einzusehen, dass Konvexitat i.A. eine zu starke Annahme ist, be-trachten wir wieder das Energiefunktional

E(y) =

∫

U

W (Dy)

87

aus dem vorigen Beispiel. Liegen in der Ausgangslage (der sog. Referenzkonfigura-tion) keine internen Verspannungen vor, so wird man nach geeigneter Normierungannehmen durfen, dass W ≥ 0 ist und W (F ) = 0 ist, wenn F die EinheitsmatrixF = Id ist. Da reine Drehungen keine elastische Energie kosten sollten, sollte Wsogar auf ganz SO(3) verschwinden. Ware nun W konvex, so ware

0 ≤W

0 0 00 0 00 0 1

≤ 1

2W

−1 0 00 −1 00 0 1

+

1

2W

1 0 00 1 00 0 1

≤ 0.

Das ist aber unphysikalisch: Man kann einen dreidimensionalen elastischen Korpernicht auf einen eindimensionalen Strich zusammenpressen, ohne dem System En-ergie zuzufuhren. (Im Gegenteil: Eine solche Deformation sollte sogar unendlichviel Energie kosten.)

4.3 Polykonvexitat

In diesem Abschnitt werden wir das eben angesprochene Dilemma losen, indemwir Integranden zulassen, die nicht mehr konvex sein mussen. Um die direkteMethode zur Auffindung von Minimierern anwenden zu konnen, mussen wir je-doch sicherstellen, dass die betrachteten Funktionale noch unterhalbstetig sind.Als geeignete Klasse von Integranden werden sich die sog. polykonvexen Funk-tionen herausstellen. Diese Funktionen fuhren einerseits zu unterhalbstetigenFunktionalen, andererseits lassen sich mit ihnen Energiefunktionale modellieren,die auch physikalisch sinnvoll sind.

Im folgenden Abschnitt werden wir sehen, dass diese Klasse der guten In-tegranden noch erweitert werden kann. Im Allgemeinen kann es jedoch sehrschwierig sein zu entscheiden, ob die direkte Methode anwendbar ist. Polykon-vexitat (gemeinsam mit geeigneten Wachstumsbedingungen) liefert hier ein wichtigeshinreichendes Kriterium.

Wir erinnern zunachst an den Begriff der Kofaktormatrix aus der linearenAlgebra. Ist A eine r×r Matrix und bezeichnet man mit A(i, j) die (r−1)×(r−1)Matrix, die dadurch entsteht, dass man die i-te Zeile und die j-te Spalte in Astreicht, so ist die Kofaktormatrix cof A definiert als die r × r Matrix mit denEintragen

(cof A)ij = (−1)i+j detA(i, j).

In der linearen Algebra zeigt man, dass

detA Id = AT cof A = A(cof A)T (4.1)

gilt. Beachte, dass cof A gerade die Ableitung von detA nach den Eintragen von

88

A ist: Nach (4.1) gilt

detAδij =

n∑

k=1

aki(cof A)kj (4.2)

fur alle i, j ∈ 1, . . . r. Setzt man i = j = m in (4.2), so ergibt sich

∂ detA

∂alm

=∂

∂alm

n∑

k=1

akm(cof A)km = (cof A)lm. (4.3)

Das wesentliche Hilfsresultat ist nun, dass die Kofaktormatrix einer Jakobi-matrix divergenzfrei ist:

Lemma 4.7 Es sei u : U → Rn, U ⊂ Rn offen, glatt. Dann gilt

div(cof Du) = 0,

wobei die Divergenz zeilenweise zu nehmen ist, also

n∑

j=1

∂j((cof Du)ij) = 0 fur i = 1, . . . , n.

Proof. Wendet man (4.2) auf A = Du an und bildet die Divergenz, so ergibt sich

∑

j

∂j (detDu δij) =∑

j

∂j

(n∑

k=1

∂iuk(cofDu)kj

)

Mit Hilfe von (4.3) folgt daraus

∑

j,l,m

δij(cof Du)lm∂j∂mul =∑

j,k

∂i∂juk(cof Du)kj +∑

j,k

∂iuk∂j((cofDu)kj)

Fuhrt man hier auf der linken Seite die Summation uber j aus, so erhalt mangerade die erste Summe auf der linken Seite. Damit ist

∑

j,k

∂iuk∂j((cofDu)kj) = 0,

d.h.

Du div(cofDu)) = 0

gezeigt. Fur alle x0, fur dieDu(x0) nicht singular ist, folgt nun div(cofDu))(x0) =0.

89

Ist detDu(x0) = 0, so betrachte die Abbildung uε(x) := u(x) + εx. Hinre-ichend klein gewahlt ist ε > 0 kein Eigenwert von Du(x0), so dass Duε(x0) =Du(x0)+ε Id nicht singular ist. Wie oben gezeigt folgt daher div(cofDuε)(x0)) =0. Im Grenzwert ε→ 0 ergibt sich daraus div(cofDu)(x0)) = 0.

Ein analoges Resultat gilt fur die Minoren von Jakobimatrizen. Zur Erin-nerung: Ist F eine m× n Matrix, so wird die Determinante einer quadratischenTeilmatrix von F ein Minor von F genannt. Im Folgenden bezeichnen wir furfestes r ≤ minm,n und feste 1 ≤ i1 < i2 < . . . < ir ≤ m, 1 ≤ j1 < j2 < . . . <jr ≤ n mit S(F ) = S i1,...,ir

j1,...,jr

(F ) die r × r Submatrix

S(F ) =

fi1j1 · · · fi1jr

......

firj1 · · · firjr

fur Matrizen F = (fij) ∈ Rm×n sowie mit M(F ) = M i1,...,irj1,...,jr

(F ) den r-Minor

M(F ) = detS(F ).

Corollary 4.8 Es seien u : U → Rm, U ⊂ Rn offen, glatt und 1 ≤ i1 < i2 <. . . < ir ≤ m, 1 ≤ j1 < j2 < . . . < jr ≤ n. Dann gilt

r∑

t=1

∂jt((cof S i1,...,ir

j1,...,jr

(Du))isjt) = 0 fur s = 1, . . . , r.

Proof. Dies folgt indem man Lemma 4.7 lokal auf die Abbildung anwendet, diesich dadurch ergibt dass man

(ui1(x), . . . , uir(x))

als Funktion von (xj1, . . . , xjr) auffasst, wobei die ubrigen Koordinaten fix sind.

Eine wichtige Konsequenz dieser Beobachtung ist, dass die Minoren von Jako-bimatrizen schwach stetig sind.

Theorem 4.9 Es sei r < p < ∞, r ∈ 1, . . . ,minm,n. Ist (u(k)) eine Folgeaus W 1,p(U ; Rm) mit

u(k) u in W 1,p(U ; Rm),

dann gilt fur alle r-Minoren M

M(Du(k)) M(Du) in Lpr (U ; Rm).

Proof. Der Beweis erfolgt durch Induktion uber r. Dabei ist der Fall r = 1 klar,da die 1-Minoren einer Matrix gerade deren Eintrage sind.

90

Betrachte nun den r-Minor M(F ) = M i1,...,irj1,...,jr

(F ). Ist w ∈ C∞(U ; Rm), so gilt

nach (4.1)

M(Dw) =r∑

s=1

∂jswit(cof S(Dw))itjs

fur t = 1, . . . , r.

Nach Korollar 4.8 istr∑

s=1

wit∂js(cof S(Dw))itjs

= 0,

so dass M(Dw) als Ableitung geschrieben werden kann:

M(Dw) =

r∑

s=1

∂js(wit(cof S(Dw))itjs

) fur t = 1, . . . , r. (4.4)

Fur ϕ ∈ C∞c (U) ergibt sich daraus

∫

U

M(Dw)ϕ = −∫

U

r∑

s=1

wit(cof S(Dw))itjs∂jsϕ fur t = 1, . . . , r. (4.5)

Da die TermeM(Dw) und wit(cof S(Dw))itjsaus r-fachen Produkten von Eintragen

in w und Dw bestehen, zeigt ein Standard-Approximationsargument (benutze dieallgemeine Holder-Ungleichung), dass (4.5) fur alle w ∈W 1,p, p ≥ r, gilt.

Nun konvergiert u(k) u schwach in W 1,p(U ; Rm) und damit wegen p ≥ rauch in W 1,r(U ; Rm). Nach dem Satz von Rellich-Kondrachov konvergiert u(k) →u (stark) in Lq(U) fur 1 ≤ q < r∗ = nr

n−r. Außerdem gilt nach Induktionsannahme

cof S(Du(k)) cof S(Du) in Lp

r−1 (U) und damit auch in Lr

r−1 (U), da die Eintragevon cof S(F ) ja gerade (r − 1)-Minoren von F sind. Dann aber konvergiert

u(k)it

(cof S(Du(k)))itjs uit(cof S(Du))itjs

schwach in Lq fur 1 ≤ q < nn−1

fur alle it, js, denn es ist r−1r

+ n−rrn

= n−1n

. Mit

Hilfe von (4.5) fur w = u(k) bzw. w = u folgt nun

limk

∫

U

M(Du(k))ϕ = − limk

∫

U

r∑

s=1

u(k)it

(cof S(Du(k)))itjs∂jsϕ

= −∫

U

r∑

s=1

uit(cof S(Du))itjs∂jsϕ =

∫

U

M(Du)ϕ.

(4.6)

Dies zeigt, dass M(Du(k)) gegen M(Du) im Distributionensinne konvergiert,sogar wenn nur p ≥ r vorausgesetzt ist. Da nun Du(k) eine beschrankte Folgein W 1,p ist, ist M(Du(k)) beschrankt in L

pr . Ist nun p > r, so ergibt sich aus

der Reflexivitat von Lpr , dass jede Teilfolge von M(Du(k)) eine in L

pr schwach

konvergente Teilfolge besitzt. Nach (4.6) muss dieser Limes M(Du) sein. Dannaber konvergiert die gesamte Folge M(Du(k)) gegen M(Du).

91

Remark 4.10 Dieser Satz gilt auch fur p = ∞, wenn man die schwache durchschwach*-Konvergenz ersetzt. Das folgt unmittelbar aus der Version fur p < ∞und der Beobachtung, dass M(Du(k)) in L∞ beschrankt ist, wenn u(k) schwach*-konvergiert in W 1,∞.

Corollary 4.11 Sind u, v ∈W 1,p(U ; Rm) mit u−v ∈W 1,p0 (U ; Rm) fur ein p ≥ r,

so gilt ∫

U

M(Du) =

∫

U

M(Dv)

fur alle r-Minoren M .

Proof. Nach Approximation von u und u−v durfen wir o.B.d.A. u ∈ C∞(U) undu− v ∈ C∞

c (U) annehmen. Wie im Beweis von Satz 4.9 gezeigt gilt

M(Dw) =

r∑

s=1

∂js(wit(cof S(Dw))itjs

) fur t = 1, . . . , r

(s. Gleichung (4.4)) fur glatte Funktionen w. Insbesondere fur w = u und w = vergibt sich damit

∫

U

M(Du) =

∫

∂U

uit(cof S(Du))itjsνjs

=

∫

∂U

vit(cof S(Dv))itjsνjs

=

∫

U

M(Dv),

wenn ν die außere Normale an ∂U bezeichnet.

Eine Funktion f : Rm×n → R mit dieser Eigenschaft, dass∫

Uf(Du) nur von

den Werten von u auf dem Rand ∂U abhangt, nennt man Null-Lagrangefunktion.

Definition 4.12 (i) Ist F ∈ Rm×n, so bezeichne M(F ) den aus allen Minoren

von F bestehenden Vektor der Dimension d(m,n) =∑minm,n

r=1

(m

r

)(n

r

).

(ii) Eine Funktion f : Rm×n → R∪∞ heißt polykonvex, wenn es eine konvexe

Funktion g : Rd(m,n) → R ∪ ∞ gibt, so dass

f(F ) = g(M(F )) ∀F ∈ Rm×n

gilt.

Insbesondere ist jede konvexe Funktion polykonvex. Die Umkehrung davongilt nicht, wenn m,n ≥ 2 ist wie das Beispiel des Minors

F = (fij) 7→ f11f22 − f12f21

92

zeigt. Diese Funktion ist sogar affin im ersten 2-Minor, aber sicherlich nichtkonvex auf f11 = f22 = 0, f12 = f21.Beispiel: Eine Energiedichte W in der Elastizitatstheorie, die zu starke Kompres-sionen energetisch bestraft ist z.B.

W (F ) = |F |2 + ψ(detF ), mit ψ(t) =

1t, t > 0,

∞, t ≤ 0.

Dieses W ist polykonvex.

Wir kommen nun zum wesentlichen Unterhalbstetigkeitsresultat fur Integranden,die polykonvex in Du sind:

Theorem 4.13 Es sei f : U×Rm×Rm×n → R eine glatte, nach unten beschrankteFunktion, so dass fur fast alle x ∈ U und alle y ∈ Rm, z ∈ Rm×n

f(x, y, z) = g(x, y,M(z))

fur eine geeignete (glatte) Funktion g gilt, die konvex in z sei. Dann ist I schwachunterhalbfolgenstetig auf W 1,p(U ; Rm) fur p > n.

Der Beweis verlauft ahnlich wie der Beweis von Lemma 4.5.

Proof. Es sei (uk) eine Folge mit uk u in W 1,p(U ; Rm). Es ist zu zeigen,dass lim infk→∞ I(uk) ≥ I(u) gilt. Genau wie im Beweis von Lemma 4.5 siehtman, dass wir annehmen durfen, dass lim infk→∞ I(uk) = limk→∞ I(uk) gilt, dassuk → u stark in Lp und fast uberall konvergiert und dass f ≥ 0 ist.

Wir definieren die Mengen Aj und Bj genau wie im Beweis von Lemma 4.5.Aus der Polykonvexitat von f in z erhalten wir fur jedes j:

I(uk) =

∫

U

f(x, uk, Duk) dx ≥∫

Aj∩Bj

f(x, uk, Duk) dx

=

∫

Aj∩Bj

g(x, uk,M(Duk)) dx

≥∫

Aj∩Bj

g(x, uk,M(Du))

+DM(z)g(x, uk,M(Du)) · (M(Duk) −M(Du)) dx.

Wie im Beweis von Lemma 4.5 ergibt sich daraus nun

limk→∞

I(uk) ≥∫

Aj∩Bj

g(x, u,M(Du)) dx =

∫

Aj∩Bj

f(x, u,Du) dx

und mit monotoner Konvergenz schließlich

limk→∞

I(uk) ≥ limj→∞

∫

Aj∩Bj

f(x, u,Du) dx = I(u).

93

Corollary 4.14 Erfullt f : U × Rm × Rm×n → R zusatzlich zu den Vorausset-zungen von Satz 4.13 die die Wachstumsbedingung

f(x, y, z) ≥ c1|z|p − c2

fur geeignete Konstanten c1 > 0, c2 ∈ R, p > n, so existiert ein u ∈ A mit

I(u) = infv∈A

I(v).

Proof. Das folgt mit der direkten Methode aus Lemma 4.4 und Satz 4.13.

Als Anwendung der Tatsache, dass Determinanten Null-Lagrangefunktionensind (vgl. Korollar 4.11), geben wir hier einen Beweis des Brouwerschen Fixpunk-tsatzes.

Theorem 4.15 (Brouwerscher Fixpunktsatz) Jede stetige Abbildung der abgeschlosse-nen Einheitskugel B1(0) in sich hat einen Fixpunkt.

Der Beweis ergibt sich leicht aus dem folgenden Lemma.

Lemma 4.16 Es gibt keine stetige Abbildung w : B1(0) → ∂B1(0) mit w(x) = xfur alle x ∈ ∂B1(0).

Beweis von Satz 4.15. Es sei u : B1(0) → B1(0) eine stetige Funktion ohneFixpunkt. Fur x ∈ B1(0) definiere w(x) als den Schnittpunkt des Strahles, dervon u(x) ausgeht und durch den Punkt x fuhrt, mit ∂B1(0). Dies definiert einestetige Abbildung w : B1(0) → ∂B1(0) mit w(x) = x fur alle x ∈ ∂B1(0) imWiderspruch zu Lemma 4.16.

Beweis von Lemma 4.16. Wir fuhren die Annahme, es gabe ein solches w zumWiderspruch.

Sei zunachst w als glatt angenommen. Ist v die identische Abbildung v(x) = x,so folgt aus Korollar 4.11

∫

B1(0)

detDw =

∫

B1(0)

detDv = |B1(0)| 6= 0, (4.7)

denn es gilt w = v auf ∂B1(0). Andererseits ist |w|2 ≡ 1, so dass ∂iw · w = 0 furalle i gilt und damit (Dw)Tw = 0. Wegen w 6= 0 ist Null also ein Eigenwert vonDw und somit detDw ≡ 0 auf B1(0), im Widerspruch zu (4.7).

Ist nun w nur als stetig vorausgesetzt, so setzen wir w gemaß w(x) = x aufRn \ B1(0) fort. Ist ηε der skalierte Standard-Glattungskern, so ist w := ηε ∗ wglatt und 6= 0 fur ε hinreichend klein. (Beachte uε → u gleichmaßig fur ε → 0.)

Fur ε < 1 gilt auf ∂B2(0) außerdem

w(x) =

∫ηε(y)(x− y) dy = x−

∫y ηε(y) dy = x,

94

da ηε symmetrisch ist. Damit erfullt nun die Abbildung

˜w : B1(0) → B1(0), ˜w(x) :=w(2x)

|w(2x)| ,

erstens ˜w ∈ C∞(B1(0)), zweitens ˜w(B1(0)) = ∂B1(0) und drittens ˜w(x) = x furx ∈ ∂B1(0) im Widerspruch zum schon behandelten Fall.

4.4 Quasikonvexitat

Obgleich polykonvexe Integranden viele interessante Beispiele von Funktionalenliefern, ist es aus theoretischen Grunden sehr nutzlich einen weiteren, allge-meineren Konvexitatsbegriff fur Funktionen, die auf Matrizen definiert sind, zuuntersuchen: Die Quasikonvexitat. Wir lassen uns dabei von der Frage leiten, furwelche Funktionen f das Integralfunktional

I(u) =

∫

U

f(Du)

unterhalbfolgenstetig auf W 1,p(U ; Rm) ist. Der Einfachheit halber betrachten wirin diesem Abschnitt nur solche Integranden f , die nicht explizit von x oder uabhangen.

Definition 4.17 Eine Funktion f : Rm×n → R heißt quasikonvex, wenn fur einebeschrankte offene Menge U ⊂ Rn mit |∂U | = 0 gilt

−∫

U

f(F +Dϕ(x)) dx ≥ f(F ) ∀ϕ ∈W 1,∞0 (U ; Rm).

Remark 4.18 1. Wir betrachten hier nur R-wertige Funktionen. Will manauch +∞ als Wert von f zulassen, so muss man im Folgenden ein paarzusatzliche technische Detail beachten.

2. Im R-wertigen Fall kann man sogar auf die Voraussetzung |∂U | = 0 verzichten.(Ubung.)

3. Gilt die definierende Ungleichung fur eine beschrankte offene Menge U mit|∂U | = 0, so gilt sie automatisch fur alle solchen Teilmengen von Rn. Diesergibt sich direkt aus dem folgenden Lemma.

Im Hinblick auf spatere Anwendungen beweisen wir die Unabhangigkeit vonU in einer etwas allgemeineren Fassung.

95

Lemma 4.19 Fur f ∈ L1loc(R

m×n) und U ⊂ Rn offen und beschrankt mit |∂U | =0 setze

Qf(F, U) := infϕ∈W 1,∞(U ;Rm)

−∫

U

f(F +Dϕ).

Dann ist Qf(F, U) unabhangig von U .

Beachte, dass Qf(·, U) = f ist, wenn f quasikonvex ist.

Proof. Es seien U, V ⊂ Rn offen und beschrankt mit |∂U | = |∂V | = 0. Nachdem Uberdeckungssatz von Vitali3 gibt es eine hochstens abzahlbare Familie vonpaarweise disjunkten Mengen ajV + bj ⊂ U , aj > 0 und bj ∈ Rn, so dass

∣∣∣∣∣U \⋃

j

(ajV + bj)

∣∣∣∣∣ = 0

ist.Ist nun ϕ ∈ W 1,∞

0 (V ; Rm), so definiert

ψ(x) :=

ajϕ

(x−bj

aj

)fur x ∈ ajV + bj ,

0 fur x /∈ ⋃j ajV + bj

ein Element von W 1,∞0 (U ; Rm). Da |∂V | = 0 vorausgesetzt ist, konnen wir ab-

schatzen

|U |Qf(F, U) ≤∫

U

f(F +Dψ) =∑

j

∫

ajV +bj

f

(F +Dϕ

(x− bjaj

))

=∑

j

∫

ajV +bj

f

(F +Dϕ

(x− bjaj

))

=∑

j

anj

∫

V

f (F +Dϕ (x)) = |U |−∫

V

f (F +Dϕ (x)) .

Dies zeigt Qf(F, U) ≤ Qf(F, V ); die Umkehrung ergibt sich analog.

Wir benotigen noch einen weiteren Konvexitatsbegriff:

Definition 4.20 Eine Funktion f : Rm×n → R heißt Rang-1-konvex, wenn fentlang jeder ‘Rang-1-Geraden’ konvex ist, d.h. wenn die Abbildung

R ∋ t 7→ f(tA+ (1 − t)B)

konvex ist, wann immer Rang(A− B) = 1 ist.

3Der Uberdeckungssatz von Vitali: Es sei U ⊂ Rn beschrankt und offen sowie C eine Familievon abgeschlossenen Teilmengen von U . Es gebe 1. eine Konstante M > 1, so dass zu jedemC ∈ C ein x ∈ U und ein r > 0 existiert mit Br(x) ⊂ C ⊂ BMr(x). 2. sei fur fast alle x ∈ U

infdiamC : C ∈ C mit x ∈ C = 0. Dann gibt es eine hochstens abzahlbare Familie (Cj) von

paarweise disjunkten Mengen aus C mit∣∣∣U \⋃j Cj

∣∣∣ = 0.

In unserer Situation ist dieser Satz fur C = aV +b : a > 0, b ∈ Rn, aV +b ⊂ U zu verwenden.

96

Der folgende Satz klart die Zusammenhange der verschiedenen Konvexitatsbegriffe.

Theorem 4.21 Es sei f : Rm×n → R eine Abbildung. Dann gilt

f konvex =⇒ f polykonvex =⇒ f quasikonvex =⇒ f Rang-1-konvex.

Remark 4.22 1. Da eine Rang-1-konvexe Funktion insbesondere separat kon-vex, d.h. konvex in jedem Eintrag, wenn die ubrigen Argumente festgehaltensind, ist f in jedem Falle stetig.

2. Ist m = 1 oder n = 1, so fallen all diese Konvexitatsbegriffe zusammen.(Klar, dass in diesem Fall f konvex ist genau dann, wenn f Rang-1-konvexist.) Im Allgemeinen sind die Umkehrungen in Satz 4.21 jedoch falsch; dazuspater mehr.

Proof. Dass Konvexitat Polykonvexitat impliziert, haben wir bereits im vorigenAbschnitt bemerkt.

Sei nun f als polykonvex vorausgesetzt, so dass es eine konvexe Funktiong : Rd(m,n) → R gibt mit f(F ) = g(M(F )) fur alle F ∈ Rm×n. Mit Hilfe derJensenschen Ungleichung4 ergibt sich fur F ∈ Rm×n, ϕ ∈ W 1,∞

0 (U ; Rm), U ⊂ Rn

offen,

−∫

U

f(F +Dϕ) = −∫

U

g(M(F +Dϕ))

≥ g

(−∫

U

M(F +Dϕ)

)= g(M(F )) = f(F ),

wobei wir im vorletzten Schritt Korollar 4.11 ausgenutzt haben.Um die letzte Implikation zu zeigen, mussen wir begrunden, dass fur A,B, F ∈

Rm×n mit Rang(A−B) = 1 und F = λA+ (1−λ)B, λ ∈ (0, 1), die Ungleichung

f(F ) ≤ λf(A) + (1 − λ)f(B)

erfullt ist, wenn f quasikonvex ist. Da Rang(A − B) = 1 ist, gibt es Vektorena ∈ Rm, e ∈ Rn mit |e| = 1, so dass5 A− B = a⊗ e und damit

A = F + (1 − λ)a⊗ e und B = F − λa⊗ e

4Die Jensensche Ungleichung: Es sei ϕ : U → Rd eine integrierbare Abbildung, U ⊂ Rn, undg : V → R, V ⊂ Rd offen, eine konvexe Funktion mit ϕ(U) ⊂ V , so dass auch g ϕ integrierbarsei. Dann gilt

−∫

U

g(ϕ(x)) dx ≥ g

(−∫

U

ϕ(x) dx

).

Beweis: Schreibe g = supi∈I gi als Supremum uber affine Funktionen gi. Dann ist −

∫g ϕ ≥

−

∫gi ϕ = gi(−

∫ϕ) fur alle i. Nun bilde das Supremum uber i ∈ I.

5Fur a ∈ Rm, b ∈ Rn bezeichnet a ⊗ b die m × n Matrix abT = (aibj)ij .

97

gilt. Nach einer Drehung des Koordinatensystem konnen wir o.B.d.A. annehmen,dass e = e1 ist.

Es sei z ∈W 1,∞(R; R) die 1-periodische Sagezahnfunktion mit

z(0) = 0, z′(t) =

1 − λ fur t ∈ (0, λ),

−λ fur t ∈ (λ, 1).

Auf dem Quader Q = (0, 1)n betrachten wir die Funktionen

uk(x) = Fx+ az(kx1)

k, vk(x) = Fx+ amin

z(kx1)

k, dist(x, ∂Q)

.

Dann liegt die Funktion x 7→ vk(x) − Fx in W 1,∞0 , so dass

f(F ) ≤ −∫

Q

f(Dvk) ∀ k

gilt. Nun ist aber vk = uk bis auf eine im Limes k → ∞ verschwindende Rand-schicht, so dass

limk

−∫

Q

f(Dvk) = limk

−∫

Q

f(Duk) = λf(A) + (1 − λ)f(B)

ist. Um die letzte Gleichung einzusehen, beachte, dass (fur jedes k) Duk = A aufeiner Menge vom Maße λ und Duk = B auf einer Menge vom Maße 1− λ ist.

Remark 4.23 Die Umkehrungen der ersten beiden Implikationen in Satz 4.21sind falsch wann immer m,n ≥ 2 ist. Die Umkehrung der dritten ist falschfur m ≥ 3, n ≥ 2. Die Frage, ob moglicherweise Quaiskonvexitat aus Rang-1-Konvexitat folgt fur m = 2, n ≥ 2 ist offen. Das folgende Beispiel von Alibert,Dacorogna und Marcellini zeigt, dass man selbst explizit gegenenen Funktionennicht so einfach ansehen kann, ob sie quasikonvex sind.

Beispiel: Es sei m = n = 2,

f(F ) = |F |4 − γ|F |2 detF.

Dann gibt es ein ε > 0, so dass gilt

f konvex ⇐⇒ |γ| ≤ 4

3

√2,

f polykonvex ⇐⇒ |γ| ≤ 2,

f quasikonvex ⇐⇒ |γ| ≤ 2 + ε,

f Rang-1-konvex ⇐⇒ |γ| ≤ 4√3.

Es ist offen, ob ε = 4√3− 2 ist. (Mehr dazu findet man etwa [Dac 08].)

Der wesentliche Zusammenhang zwischen der Unterhalbstetigkeit des Funk-tionals I(u) =

∫f(Du) und der Quasikonvexitat von f ist Inhalt des folgenden

Satzes.

98

Theorem 4.24 Es sei f : Rm×n → R stetig.

(i) I ist schwach*-unterhalbfolgenstetig (σ∗uhfs) auf W 1,∞(U ; Rm) genau dann,wenn f quasikonvex ist.

(ii) Gilt im Falle p ∈ (1,∞) zudem

0 ≤ f(F ) ≤ C(1 + |F |p) ∀F ∈ Rm×n,

so ist I schwach-unterhalbfolgenstetig (σuhfs) auf W 1,p(U ; Rm) genau dann,wenn f quasikonvex ist.

Wir werden hier nur (i) beweisen.

Proof. Sei Q = (0, 1)n und ϕ ∈ W 1,∞0 (Q; Rm). Setze ϕ periodisch zu einer auf

ganz Rn definierten Funktion fort und definiere uk ∈ W 1∞(U ; Rm) durch

uk(x) := Fx+1

kϕ(kx).

Es ist nicht schwer zu sehen, dass dann uk∗ u in W 1,∞(U ; Rm) fur u(x) = Fx

gilt und ∫

U

f(Duk) → |U |∫

Q

f(F +Dϕ)

konvergiert. (Ubung.) Ist nun I unterhalbstetig, so folgt

f(F ) =1

|U |I(u) ≤ lim infk→∞

1

|U |I(uk) = lim infk→∞

1

|U |

∫

U

f(Duk) ≤∫

Q

f(F +Dϕ).

Dies zeigt aber, dass f quasikonvex ist.Sei nun umgekehrt f als quasikonvex vorausgesetzt und uk

∗ u inW 1,∞(U ; Rm).

1. Fall: u(x) = Fx + c ist affin. Wahle U ′ ⊂⊂ U und eine Abschneidefunktionθ ∈ C∞

c (U) mit θ ≡ 1 auf U ′ und setze

vk = u+ θ(uk − u).

Da uk gleichmaßig gegen u konvergiert, gibt es eine von U ′ unabhangige KonstanteC > 0, so dass

|Dvk| ≤ |Du|+ |Dθ| · |uk − u| + |θ| · |Duk −Du| ≤ C

ist fur hinreichend große k ≥ k0, wobei k0 von U ′ abhangt. Da f stetig ist, ergibtsich nun mit M = sup|f(F )| : |F | ≤ 2C

lim infk→∞

I(uk) ≥ lim infk→∞

(∫

U

f(Dvk) +

∫

U\U ′

f(Duk) − f(Dvk)

)

≥ |U |f(F ) − 2M |U \ U ′|,

99

wobei wir vk − u ∈ W 1,∞0 (U ; Rm) und die Quasikonvexitat von f ausgenutzt

haben. Da U ′ ⊂⊂ U beliebig war, folgt daraus nun die Behauptung.

2. Fall: u ist stuckweise affin. lim inf I(uk) ≥ I(u) folgt hier unmittelbar aus Fall1 angewandt auf die Mengen, auf denen u affin ist.

3. Fall: u ist eine allgemeine W 1,∞-Funktion. Es sei U ′′ ⊂⊂ U ′ ⊂⊂ U . Indemwir u zunachst durch Faltung mit Glattungkernen durch eine glatte Funktion vapproximieren, dann fur eine Abschneidefunktion θ ∈ C∞

c (U ′) mit θ ≡ 1 auf U ′′

die Funktionθv + (1 − θ)u

konstruieren und diese schließlich auf U ′′ mit Hilfe einer feinen Triangulisierungdurch ihre stuckweise affine Interpolation ersetzen, erhalten wir eine Folge vj vonApproximationen an u, so dass 1. vj stuckweise affin in U ′′ ist, 2. vj ≡ u aufU \ U ′ und 3. Dvj → Du konvergiert mit j → ∞ stark in Lp fur alle p <∞ undschwach* in L∞.

Setzeuj,k : uk + vj − u.

Dann ist |Duj,k| ≤ C und uj,k∗ vj in W 1,∞ mit k → ∞. Es folgt

limj→∞

lim infk→∞

∫

U ′′

f(Duj,k) ≥ limj→∞

∫

U ′′

f(Duj,k) (nach Fall 2)

=

∫

U ′′

f(Du) (majorisierte Konvergenz)

≥∫

U

f(Du) − C|U \ U ′′|.

(Beachte, dass o.B.d.A. Dvj → Du fast uberall.) Da außerdem (majorisierteKonvergenz)

limj→∞

supk

∫

U

|f(Duj,k) − f(Duk)| ≤ limj→∞

supk

∫

U

ω(|Duj,k −Duk|)

= limj→∞

∫

U

ω(|Dvj −Du|) = 0

gilt, wenn ω den Stetigkeitsmodul von f bezeichnet, folgt nun

lim infk→∞

∫

U

f(Duk) ≥ limj→∞

lim infk→∞

∫

U ′′

f(Duj,k) − C|U \ U ′′|

≥∫

U

f(Du) − 2C|U \ U ′′|.

Da U ′′ ⊂⊂ U beliebig war, ergibt sich daraus die Behauptung.

100

Corollary 4.25 Sei p ∈ (1,∞), g ∈ W 1,p(U), U ⊂ Rn offen und beschrankt.f : Rm×n → R sei quasikonvex und erfulle die Wachstumsbedingung

c1|F |p − c2 ≤ f(F ) ≤ c2 + c2|F |p ∀F ∈ Rm×n

fur geeignete Konstanten c1, c2 > 0. Dann nimmt I auf Ag = v ∈W 1,p(U ; Rm) :u− g ∈W 1,p

0 (U ; Rm) sein Minimum an.

Proof. Das folgt mit der direkten Methode aus Lemma 4.4 und Satz 4.24, wennman o.B.d.A. f ≥ 0 annimmt.

101

Chapter 5

Bonus Tracks

This chapter collects some extra material beyond the scope of the present lecture.In particular on

• Relaxation of integral functionals in Section 5.1,

• an alternative approach to Young measures and some generalizations inSection 5.2 and

• microstructures and laminates in Section 5.3.

5.1 Relaxation

Wir untersuchen nun Integralfunktionale, deren Integranden nicht einmal quasikon-vex sind. Im Allgemeinen nehmen diese Funktionale ihr Minimum nicht an. Ob-gleich das auf den ersten Blick pathologisch erscheint, werden wir sehen, dass ger-ade die Nicht-Existenz von Minimierern ein Indikator fur interessante physikalis-che Phanomene wie etwa die Ausbildung von Mikrostrukturen in Materialiendarstellen kann.

Beispiel: Betrachte das eindimensionale Variationsproblem

Minimiere I(w) =

∫ 1

0

f(w(x)) dx unter der Nebenbedingung

∫ 1

0

w(x) dx = α.

Ist f ≥ 0 eine Funktion mit mehr als einem Minimierer, etwa f(z1) = f(z2) = 0,so kann ein solches Funktional als Modell fur ein physikalisches System dienen,dass sich bevorzugt (also mit geringer Energie) in den ‘Phasen’ w = z1 oder

w = z2 aufhalt, wobei der Mittelwert∫ 1

0w = α festgelegt ist, so dass sich das

System i.A. nicht ausschließlich in ‘Phase z1’ bzw. ‘Phase z2’ aufhalten kann.Mit Hilfe der Jensenschen Ungleichung sieht man, dass

∫f(w) ≥

∫f ∗∗(w) ≥ f ∗∗

(∫w

)= f ∗∗(α)

102

nach unten abgeschatzt werden kann, wobei f ∗∗ die konvexe Einhullende von fbezeichne. Diese Abschatzung ist in der Tat scharf, denn zu ε > 0 kann manw1, w2 ∈ R und λ ∈ [0, 1] wahlen mit

α = λw1 + (1 − λ)w2, f ∗∗(α) ≥ λf(w1) + (1 − λ)f(w2) − ε,

so dass fur

w(x) =

w1, x ∈ (0, λ),

w2, x ∈ (λ, 1),

gilt

∫f(w) = λf(w1) + (1 − λ)f(w2) ≤ f ∗∗(α) + ε.

Ist man also nur am Minimalwert des Problems interessiert, so kann mandas Funktional I(w) =

∫f(w) durch das analytisch gutartigere ‘relaxierte’ Funk-

tional Irel(w) :=∫f ∗∗(w) ersetzen.

Fur w = y′ lasst sich dieses Funktional als ein elastisches EnergiefunktionalI(y) =

∫ 1

0f(y′) fur einen (eindimensionalen) elastischen Stab interpretieren.

‘Bevorzugte Phasen’ von f sind dann Deformationen minimaler Energie. Beachtedass hier die Nebenbedingung

∫y′ = α gerade die Randbedingung y(1)−y(0) = α

ist. Es wird also die Frage untersucht, welche Energie notig ist, um den Stab au-seinanderzuziehen bzw. zusammenzudrucken.

Ziel dieses Abschnitts ist es, die in diesem Beispiel beschriebene Vorgehensweiseauf vektorwertige Probleme zu verallgemeinern.

Definition 5.1 Zu f : Rm×n → R definieren wir die quasikonvexe Einhullende

fqk : Rm×n → [−∞,∞) als die großte quasikonvexe Funktion, die kleiner odergleich f ist.

Es ist leicht zu sehen, dass das Supremum quasikonvexer Funktionen wiederquasikonvex ist, so dass fqk wohldefiniert ist und

fqk = supg ≤ f : g ist quasikonvex

gilt. Beachte, dass fqk R-wertig oder identisch −∞ ist.

Theorem 5.2 Ist f ∈ L1loc(R

m×n), so gilt fur jede beschrankte offene MengeU ⊂ Rn mit |∂U | = 0

fqk(F ) = infϕ∈W

1,∞0 (U ;Rm)

−∫

U

f(F + ∇ϕ).

103

Proof. Mit der Notation aus Lemma 4.19 ist fqk = Qf(·, U) zu zeigen, wobeiwir nach Lemma 4.19 schon wissen, dass Qf(·, U) nicht von U abhangt. Nun isteinerseits

Qf(·, U) ≥ Qfqk(·, U) = fqk.

Um andererseits Qf(·, U) ≤ fqk nachzuweisen, genugt es wegen Qf(·, U) ≤ f zuzeigen, dass Qf(·, U) quasikonvex ist.

Dazu durfen wir o.B.d.A. Qf(·, U) > −∞ annehmen, denn gibt es ein G ∈Rm×n mit Qf(G,U) = −∞, dann ist Qf(·, U) ≡ −∞: Zu F ∈ Rm×n wahleψ ∈ W 1,∞

0 (U,Rm) mit F + Dψ ≡ G auf einer Teilmenge U ′ ⊂⊂ U . Dann aberist |U |Qf(F, U) ≤

∫U\U ′ f(F +Dψ) + infϕ∈W

1,∞0 (U ;Rm)

∫U ′ f(G+Dϕ) = −∞.

Sei nun ψ ∈W 1,∞0 stuckweise affin: Es gebe endlich viele paarweise disjunkte

offene Mengen Ui, auf denen ψ affin sei, mit∣∣∣∣∣U \

⋃

i

Ui

∣∣∣∣∣ = 0.

Zu ε > 0 wahle ϕi ∈W 1,∞0 (Ui; R

m), so dass

Qf(F +Dψ,Ui) ≥ −∫

Ui

f(F +Dψ +Dϕi) − ε.

Fur ϕ := ψ+∑

i ϕi ∈W 1,∞0 (U,Rm), wobei die ϕ durch Null auf ganz U fortgesetzt

wurden, ist∫

U

Qf(F +Dψ,U) =∑

i

|Ui|Qf(F +Dψ,Ui)

≥∫

U

f(F +Dϕ) − ε|U | ≥ (Qf(F, U) − ε) |U |.

Da ε beliebig war, ergibt sich

−∫

U

Qf(F +Dψ,U) ≥ Qf(F, U). (5.1)

Da diese Ungleich nur fur alle stuckweise affinen ψ gezeigt ist, konnen wirnoch nicht unmittelbar folgern, dass Qf(·, U) quasikonvex ist. Eine Inspektiondes Beweises von Satz 4.21 (insbesondere der Implikation ‘quasikonvex =⇒Rang-1-konvex’) zeigt jedoch, dass die Gultigkeit von (5.1) fur alle stuckweiseaffinen ψ schon ausreicht, um zu schließen, dass Qf(·, U) Rang-1-konvex ist.Damit aber ist Qf(·, U) separat konvex und insbesondere stetig. Nun erhaltman, dass (5.1) tatsachlich fur alle ψ ∈ W 1,∞

0 (U ; Rm) gilt durch ein Standard-Approximationsargument.

Wir konnen nun das Hauptergebnis dieses Paragraphen uber die Relaxierungvon Integralfunktionalen I(u) =

∫Uf(Du(x)) dx formulieren.

104

Theorem 5.3 Es seien U ⊂ Rn offen und beschrankt mit C1-Rand und 1 < p <∞. f erfulle eine p-Wachstumsbedingung der Form

c1|F | − c2 ≤ f(F ) ≤ c2(1 + |F |p).

Dann gilt fur das relaxierte Funktional Irel(u) :=∫

Ufqk(Du(x)) dx:

infAg

I = minAg

Irel.

Des Weiteren ist u ein Minimierer von Irel genau dann, wenn u (W 1,p-schwacher)Haufungspunkt einer minimierenden Folge fur I ist.

Dreh- und Angelpunkt zum Beweis dieses Satzes ist das folgende Lemma, dasin Verbindung mit Satz 4.24 zeigt, dass Irel die (W 1,p-schwach-) unterhalbstetigeEinhullende von I ist.

Lemma 5.4 Unter den Voraussetzungen von Satz 5.3 gilt: Ist u ∈W 1,p, so gibtes eine Folge (uk) mit uk − u ∈W 1,p

0 und

uk u in W 1,p sowie I(uk) → Irel(u).

Proof. Wahle U ′′j ⊂⊂ U ′

j ⊂⊂ U mit |U \ U ′′j | → 0 fur j → ∞. Ahnlich wie im

Beweis von Satz 4.24 konstruieren wir vj , so dass vj stuckweise affin auf U ′′j und

gleich u auf U \ U ′j ist. Wir durfen zudem annehmen, dass Dvj → Du in Lp(U)

konvergiert.Es seien Uj,i ⊂ U ′′

j disjunkte offene Mengen mit |U ′′j \⋃i Uj,i| = 0, auf denen

vj affin ist. Wahle εj → 0, ϕj,i ∈ W 1,∞0 (Ui) (durch 0 auf U fortgesetzt), so dass

auf Uj,i gilt

fqk(Dvj) ≥ −∫

Uj,i

f(Dvj +Dϕj,i) − εj

(vgl. Satz 5.2). Dann ist ϕj :=∑

i ϕj,i ∈ W 1,∞0 (U) und

∫

U ′′j

fqk(Dvj) =∑

i

|Uj,i|−∫

Uj,i

fqk(Dvj) ≥∫

U ′′j

f(Dvj +Dϕj) − εj|U |. (5.2)

Setze nun uj := vj + ϕj. Offensichtlich ist uj − u ∈ W 1,p0 . Des Weiteren ist

wegen Dvj → Du in Lp

limj→∞

∫

U ′′j

fqk(Dvj) = limj→∞

∫

U

fqk(Dvj) =

∫

U

fqk(Du) = Irel(u). (5.3)

Wegen (5.2) und da ϕj auf U \ U ′′j verschwindet, folgt nun aus der Wachstums-

bedingung an f , dass

c1‖uj‖p

Lp(U) − c2|U | ≤ I(uj) =

∫

U ′′j

f(Duj) +

∫

U\U ′′j

f(Dvj) ≤ C

105

ist. Nach Ubergang zu einer Teilfolge folgt daher uj w fur ein w ∈W 1,p. Nungilt nach (5.2) und (5.3)

lim supj→∞

I(uj) = lim supj→∞

∫

U ′′j

f(Duj) +

∫

U\U ′′j

f(Dvj) ≤ Irel(u).

Andererseits ist nach Satz 4.24 auch

lim infj→∞

I(uj) ≥ lim infj→∞

Irel(uj) ≥ Irel(w).

Es bleibt also nur noch u = w zu zeigen.Dazu genugt es, limj

∫χDuj = limj

∫χDvj fur χ in einer dichten Teilmenge

von Lp′, 1p

+ 1p′

= 1, nachzuweisen. Indem wir die Mengen Uj,i gegebenen-falls in mehrere Mengen zerteilen, durfen wir o.B.d.A. annehmen, dass jedesUj,i hochstens einen Durchmesser vom Betrag 1

jhat und dass (Uj+1,i)i eine Ver-

feinerung von (Uj,i)i ist fur alle j. Eine geeignete dichte Teilmenge von Lp istdann z.B. durch die Menge derjenigen Funktionen χ gegeben, fur die ein j = j(χ)existiert, so dass χ konstant auf den Uj,i ist. Fur ein solches χ ist namlich

∫χDuj =

∫χDvj +

∑

i

∫

Uj,i

χDϕj,i =

∫χDvj ∀ j ≥ j(χ).

Beweis von Satz 5.3. Offensichtlich ist infAgI ≥ infAg

Irel und nach Lemma 5.4auch umgekehrt infAg

I ≤ infAgIrel. Nach Korollar 4.25 ist außerdem infAg

Irel =minAg

Irel, so dass infAgI = minAg

Irel gezeigt ist.Ist nun u Haufungspunkt einer I-minimierenden Folge (uk), so gilt nach Satz

4.24

Irel(u) ≤ lim infk→∞

Irel(uk) ≤ lim infk→∞

I(uk) = infAg

I = minAg

Irel.

Ist umgekehrt u als Minimierer von Irel vorausgesetzt, so konnen wir nach Lemma5.4 eine Folge (uk) ⊂ Ag wahlen, so dass

uk u in W 1,p sowie I(uk) → Irel(u) = minAg

Irel = infAg

I

gilt, so dass u Haufungspunkt der I-minimierenden Folge (uk) ist.

Remark 5.5 1. Man nenn Irel die Relaxierung von I. Analoge Ergebnissegelten auch fur Funktionale der Form

I(u) =

∫

U

f(x, u(x), Du(x)) dx.

106

Hier wird das relaxierte Funktional durch

Irel(u) =

∫

U

fqk(x, u(x), Du(x)) dx

definiert, wobei fqk als die Quasikonvexifizierung der Funktion F 7→ f(x, u, F )fur feste x ∈ U und u ∈ Rm definiert wird.

2. Der wichtige Punkt ist, dass Irel – im Gegensatz zu I – sein Minimum im-mer annimmt. Minimierern von Irel entsprechen schwache Haufungspunktevon I-minimierenden Folgen. In diesen Folgen stecken jedoch unter Umstandenwesentliche Informationen uber das zugrunde liegende physikalische Prob-lem, die durch den Ubergang zu Irel verlorengehen, s. das folgende Beispiel.

Beispiel: Betrachte das Funktional

I(u) =

∫ 1

0

((u′)2 − 1)2 + u2

auf W 1,40 . Das relaxierte Funktional ist gegeben durch

Irel =

∫ 1

0

f ∗∗(u′) + u2,

wobei f ∗∗ die Konvexifizierung von f(v) = (v2 − 1)2 ist, also

f ∗∗(v) =

(v2 − 1)2 fur |v| ≥ 1,

0 fur |v| ≤ 1.

Nun ist u ≡ 0 ein Minimierer von Irel mit Irel(u) = 0, das Minimum von Iwird jedoch nicht angenommen. (I(u) = 0 =⇒

∫u2 = 0 =⇒ u ≡ 0 =⇒∫

((u′)2 − 1)2 = 1 > 0.)Physikalisch von Interesse sind nun solche u mit moglichst geringem I(u), also

gerade die minimierenden Folgen. Ein Beispiel einer minimierenden Folge ist

uk(x) =φ(kx)

kφ(kx), mit φ(x) =

1

2−∣∣∣∣x− ⌊x⌋ − 1

2

∣∣∣∣ .

Die Wahl einer minimierenden Folge ist jedoch nicht eindeutig. Trotzdem aberkann man hoffen, universelle Eigenschaften dieser Folgen zu identifizieren. In un-serem Beispiel etwa gilt fur jede minimierende Folge uk → 0 in L2. Daruberhinauswurden wir erwarten, dass

• u′k ≈ ±1 sein muss,

• der Wechsel zwischen u′k ≈ −1 und u′k ≈ +1 mit großerem k immer schnellerwird und

• im Mittel genauso oft u′k ≈ −1 wie u′k ≈ +1 gilt.

Wie man diese Aussagen prazise fassen kann, darauf werden wir im nachstenAbschnitt eingehen.

107

5.2 Young-Maße

Bei Young-Maßen handelt es sich eigentlich um eine Familie von Maßen ν =(νx)x∈Ω, Ω ⊂ Rn eine messbare Menge. Ist wk : Ω → Rd eine Folge messbarerFunktionen, so erzeugt (wk) das Young-Maß ν = (νx), wobei jedes νx ein (Sub-)Wahrscheinlichkeitsmaß auf Rd ist, wenn fur alle x0 ∈ Ω gilt:

νx0(dy) ist ‘die Wahrscheinlichkeit fur wk(x) ∈ dy im Limes k → ∞fur x nahe x0’.

Young-Maße liefern also eine Werte-Satistik von wk(x) fur spate Folgenglieder.Wir werden dies im Folgenden prazisieren. Mit dieser Interpretation lassen sichdie Vermutungen uber das universelle Verhalten von u′k aus dem letzten Beispieldes vorigen Abschnitts umformulieren zu der Aussage:

Fur große k ist u′k(x) mit hoher Wahrscheinlichkeit nahe ±1. Dabeisollte u′k ≈ −1 genauso wahrscheinlich wie u′k ≈ +1 sein und zwarunabhangig vom betrachteten Punkt x.

Zur Konstruktion von Young-Maßen benotigen wir eine technische Vorbere-itung. Es sei

C0(Rd) := f ∈ C(Rd) : lim

|x|→∞f(x) = 0 = Cc(Rd)

der – mit der sup-Norm versehene – Raum der im Unendlichen verschwindendenstetigen Funktionen. (Allgemeiner definiert man C0(U) auch fur Teilmengen Uvon Rd als den Abschluss der stetigen Funktionen mit kompaktem Trager in Ubezuglich der sup-Norm.) Wir bezeichnen den Raum der signierten Radonmaßeendlicher Masse auf Rd mit M(Rd). Nach dem Rieszschen Darstellungssatz istM(Rd) isometrisch isomorph zum Dualraum von C0(R

d), wobei ein Maß µ gemaß

C0(Rd) ∋ f 7→ 〈µ, f〉 =

∫

Rd

f(x)µ(dx)

als Funktional auf C0(Rd) wirkt. Ist Ω ⊂ Rn messbar, so ist der Raum L1(Ω;C0(R

d))als der Raum der Bochner-integrierbaren Funktionen definiert, s. etwa [Ed 65].Es stellt sich nun heraus, dass der Dualraum von L1(Ω;C0(R

d)) gerade durch denRaum L∞

w∗(Ω;M(Rd)) der schwach*-messbaren wesentlich beschrankten Funktio-nen mit Werten in M(Rd) gegeben ist.1 Ist ν ∈ L∞

w∗(Ω;M(Rd)), so schreiben wirmeist νx fur ν(x) ∈ M(Rd).

Theorem 5.6 (Haupsatz fur Young-Maße) Es sei Ω ⊂ Rn messbar mit |Ω| <∞ und wk : Ω → Rd eine Folge messbarer Funktionen. Dann gibt es eine Teilfolge(wkj

) und ein ν ∈ L∞w∗(Ω;M(Rd)), so dass

1Einen Beweis findet man etwa in [Ed 65, S. 588f].

108

(i) νx ≥ 0 und ‖νx‖M(Rd) =∫

Rd dνx ≤ 1 fur fast alle x ist.

(ii) Fur alle f ∈ C0(Rd) gilt

f(wkj)

∗ f in L∞(Ω),

wobei f gegeben ist durch

f(x) := 〈νx, f〉 =

∫

Rd

f(y) dνx(y).

(iii) Es sei K ⊂ Rd kompkt. Dann gilt

dist(wkj, K) → 0 d.M.n. =⇒ supp νx ⊂ K f.f.a. x.

Hierbei steht ‘d.M.n.’ fur Konvergenz ‘dem Maße nach”.2

(iv) Es ist ‖νx‖ = 1 f.f.a x genau dann, wenn

limM→∞

supj

||wkj| ≥M| = 0

gilt, wenn also keine Masse nach ∞ entkommt.

(v) Sei ‖νx‖ = 1 f.f.a x, f ∈ C(Rd) und A ⊂ Ω messbar. Ist dann

(f(wkj)) relativ schwach folgenkompakt in L1(A),

so folgtf(wkj

) f in L1(A).

(vi) Gilt ‖νx‖ = 1 f.f.a x, so stimmt in (iii) auch die umgekehrte Implikation‘⇐=’.

Definition 5.7 Die Abbildung ν : Ω → M(Rd) ist das von (wkj) erzeugte

Young-Maß.

Remark 5.8 1. Der zentrale Punkt ist (ii): Das Young-Maß verschlusselt dieschwach*-Limites aller nicht-linearer Funktionen der wkj

. Zur Erinnerung:In einer Hausaufgabe wurde gezeigt, dass schwache Limites nicht mit nicht-linearen Operationen kommutieren. Selbst wenn der schwach*-Limes derFolge (wkj

) existiert und bekannt ist, so kann man daraus allein also keineRuckschlusse auf die Werte der schwach*-Grenzwerte von f(wkj

) gwinnen.

2Seien v, v1, v2, . . . : Ω → R messbar, Ω ⊂ Rn messbar mit |Ω| < ∞. Man sagt die Folge vk

konvergiert dem Maße nach gegen v, wenn limk→∞ |x : |vk(x) − v(x)| ≥ ε| = 0 gilt fur alleε > 0. (Das entspricht dem Begriff der stochastischen Konvergenz in der Wahrscheinlichkeits-theorie.)

109

2. Eine (technisch etwas kompliziertere) Version dieses Satzes gilt auch fur|Ω| = ∞.

3. Gilt∫Ω

Φ(|wk|) ≤ C fur ein Φ : [0,∞) → R mit Φ(t) → ∞ fur t → ∞, soist limM→∞ supk ||wk| ≥ M| = 0: Zu ε > 0 wahle Mε, so dass Φ(t) ≥ ε−1

fur t > Mε gilt. Dann ist

supk

||wk| ≥ M| ≤ supk

ε

∫

|wk|≥MΦ(|wk|) ≤ Cε ∀M ≥ Mε.

4. Aus (v) ergibt sich: Ist (wk) beschrankt in Lp und f ∈ C(Rd) mit |f(y)| ≤C(1 + |y|q), q < p, dann gilt

f(wkj) f in L

pq .

Das folgt aus der Tatsache, dass f(wkj) beschrankt in L

pq ist: Einerseits

impliziert dies, dass f(wkj) relativ schwach folgenkompakt in L1 ist, so dass

nach (v) f(wkj) f in L1 gilt. Andererseits erhalt man daraus, dass jede

Teilfolge eine in Lpq konvergente Teilfolge besitzt. Zusammen ergibt sich die

Behauptung.

Fur p > 1, f = id zeigt dies

wkj w in Lp, w(x) = 〈νx, id〉.

Proof. (i) & (ii) Setze Wk(x) := δwk(x). Dann ist ‖Wk(x)‖M = 1 fur alle xund x 7→ 〈Wk(x), f〉 = f(wk(x)) messbar fur alle f ∈ C0. Damit ist (Wk) alsFolge in L∞

w∗(Ω;M(Rd)) = (L1(Ω;C0(Rd)))′ erkannt mit ‖Wk‖L∞

w∗ (Ω;M) = 1. Da

nun L1(Ω;C0(Rd)) separabel ist, ist die schwach*-Topologie auf beschrankten

Teilmengen von L∞w∗(Ω;M(Rd)) metrisierbar und wir erhalten aus dem Satz von

Alaoglu eine konvergente Teilfolge Wkj

∗ ν mit ‖ν‖L∞

w∗ (Ω;M) ≤ 1.

Fur ϕ ∈ L1(Ω), f ∈ C0(Rd) betrachte die Funktion ϕ ⊗ f ∈ L1(Ω;C0(R

d)),definiert durch ϕ⊗ f(x) = ϕ(x)f ∈ C0(R

d). Es gilt

∫

Ω

ϕ(x)f(wkj(x)) dx =

∫

Ω

ϕ(x)〈Wkj(x), f〉 dx =

∫

Ω

〈Wkj(x), ϕ⊗ f〉 dx

→∫

Ω

〈νx, ϕ⊗ f〉 dx =

∫

Ω

ϕ(x)〈νx, f〉 dx

fur j → ∞, was (ii) zeigt.Des Weiteren zeigt diese Rechnung

∫

Ω

ϕ(x)〈νx, f〉 dx ≥ 0 ∀ϕ ≥ 0 ∀ f ≥ 0.

110

Dann aber gilt 〈νx, f〉 ≥ 0 f.f.a x fur alle f ≥ 0. Da C0 separabel ist, ergibt sichdaraus 〈νx, f〉 ≥ 0 fur alle f ≥ 0 f.f.a x und damit auch νx ≥ 0 f.f.a. x, was denBeweis von (i) beendet.

(iii) Wir mussen 〈νx, f〉 = 0 fur alle f ∈ C0(Rd \ K) nachweisen. Zu f ∈

C0(Rd \K) und ε > 0 wahle Cε > 0, so dass

|f(y)| ≤ ε+ Cεdist(y,K)

ist. (Das ist moglich, da f(y) → 0 geht fur |y| → ∞.) Dann aber folgt

(|f | − ε)+(wkj) ≤ Cεdist(wkj

, K) → 0 d.M.n.

und (|f | − ε)+(wkj)

∗ (|f | − ε)+, so dass

〈νx, (|f | − ε)+〉 = (|f | − ε)+(x) = 0 f.f.a. x

gilt. Da ε > 0 beliebig war, folgt daraus nun mit monotoner Konvergenz 〈νx, |f |〉 =0 und somit 〈νx, f〉 = 0 f.f.a. x.

(iv) Es gilt ‖νx‖M ≤ 1 fast uberall. Daher ist ‖νx‖M = 1 f.f.a. x genau dann,wenn

∫Ω‖νx‖M = |Ω| ist.

Definiere θm ∈ C0(Rd), m ∈ N, durch

θm(y) :=

1, |y| ≤ m,

1 +m− |y|, m ≤ |y| ≤ m+ 1,

0, |y| ≥ m+ 1.

(5.4)

Dann ist einerseits

limj→∞

∫

Ω

θm(wkj) =

∫

Ω

〈νx, θm〉 und limm→∞

∫

Ω

〈νx, θm〉 =

∫

Ω

‖νx‖M,

wobei letzteres aus θm ր 1 und einer zweimaligen Anwendung des Satzes vonder monotonen Konvergenz folgt. Andererseits ist

∫

Ω

θm(wkj)

≥ ||wkj

| ≤ m| = |Ω| − ||wkj| > m|,

≤ ||wkj| ≤ m+ 1| = |Ω| − ||wkj

| > m+ 1|,

so dass sich

|Ω| − supj

||wkj| > m| ≤ lim

j→∞

∫

Ω

θm(wkj) ≤ |Ω| − lim inf

j→∞||wkj

| > m+ 1|

ergibt.Ist also limm→∞ supj ||wkj

| > m| = 0, so erhalten wir tatsachlich

|Ω| ≤ limm→∞

limj→∞

∫

Ω

θm(wkj) =

∫

Ω

‖νx‖M.

111

Ist nun umgekehrt∫Ω‖νx‖M = |Ω|, dann schließen wir

limm→∞

lim infj→∞

||wkj| > m+ 1| = 0. (5.5)

Da auch jede Telfolge von (wkj) das Young-Maß ν generiert, bleibt diese Aussage

auch fur alle Teilfolgen von (wkj) richtig. Das zeigt, dass sogar

limm→∞

supj

||wkj| > m+ 1| = 0

gilt: Ware dies nicht der Fall, so gabe es ein ε > 0, naturliche Zahlen m1 < m2 <. . . und Indizes j(m1), j(m2), . . . mit

||wkj(mi)| > mi + 1| ≥ ε ∀ i.

Da fur endlich viele Folgenglieder wk1, wk2, . . . , wkNstets

limm→∞

supj=1,...,N

||wkj| > m+ 1| = 0

ist, gilt j(mi) → ∞ mit mi → ∞. Ggf. nach Ubergang zu einer weiteren Teilfolgeist dann i 7→ j(mi) streng monoton in i und wir erhalten eine Teilfolge (wkj(mi))i

von (wkj)j mit

lim infi→∞

||wkj(mi)| > m+ 1| ≥ ε ∀ m > 0

im Widerspruch zu (5.5).(v) Sei f(wkj

) relativ schwach folgenkompakt in L1(A). Mit Hilfe des Satzesvon Dunford-Pettis3 sieht man leicht, dass dies genau dann der Fall ist, wennsowohl f+(wkj

) als auch f−(wkj) relativ schwach folgenkompakt in L1(A) sind.

Wir konnen also o.B.d.A. f ≥ 0 voraussetzen. Setze fm := θmf ∈ Cc(Rd), wobei

θm wie in (5.4) definiert ist.Wir zeigen zunachst, dass fur alle ϕ ∈ L∞(A)

limm→∞

∫

A

ϕfm(wkj) =

∫

A

ϕf(wkj) (5.6)

gleichmaßig in j gilt: Da f ≥ 0 ist, gilt∣∣∣∣∫

A

ϕ(fm(wkj

) − f(wkj))∣∣∣∣ ≤ C

∫

|wkj|≥m

f(wkj)

≤ C

∫

f(wkj)≥M

f(wkj) + C

∫

|wkj|≥m,f(wkj

)<Mf(wkj

)

≤ C supj

∫

f(wkj)≥M

f(wkj) + CM sup

j

||wkj| ≥ m|

3Der Satz von Dunford-Pettis besagt, dass eine beschrankte Familie F ⊂ L1(µ), µ einendliches Maß, genau dann relativ schwach folgenkompakt in L1(µ) ist, wenn F gleichgradigintegrierbar ist.

112

fur M > 0. Nun ist (f(wkj)) nach dem Satz von Dunford-Pettis gleichgradig inte-

grierbar auf A, so dass zu ε > 0 ein M existiert mit C supj

∫f(wkj

)≥M f(wkj) < ε

2.

Wahlt man nun m – unabhangig von j – hinreichend groß, so wird nach der schonbewiesenen Aussage (iv) auch CM supj ||wkj

| ≥ m| < ε2. Dies zeigt die Be-

hauptung.Nun gilt fur fm ∈ Cc(R

d) nach (ii)

limj→∞

∫

A

ϕfm(wkj) =

∫

A

ϕ〈νx, fm〉.

Mit Hilfe der gleichmaßigen Konvergenz in (5.6) folgt daraus dann

limj→∞

∫

A

ϕf(wkj) = lim

m→∞

∫

A

ϕ〈νx, fm〉 =

∫

A

ϕ〈νx, f〉,

wobei sich die letzte Gleichheit aus dem Satz von der monotonen Konvergenzergibt, indem man

∫Aϕ〈νx, fm〉 =

∫ϕ<0 ϕ〈νx, fm〉 +

∫ϕ≥0 ϕ〈νx, fm〉 schreibt.

Das war zu zeigen.(vi) Betrachte

f = mindist(·, K), 1 ∈ L∞ ∩ C.Ist supp νx ⊂ K f.f.a. x, so ist auch 〈νx, f〉 = 0 f.f.a. x. Mit f ∈ L∞ ∩ C folgtandererseits aus (v), dass f(wkj

) f in L1 konvergiert, so dass insbesondere∫f(wkj

) → 0 gilt. Dann aber ist fur alle ε > 0

|dist(wkj, K) ≥ ε| ≤ 1

ε

∫

dist(wkj,K)≥ε

f(wkj) → 0.

Beispiele:

1. Sei h : R → R 1-periodisch mit

h(x) =

a, 0 ≤ x < λ,

b, λ ≤ x < 1,a, b ∈ R, λ ∈ [0, 1].

Definiere wk : [0, 1] → R durch wk(x) := h(kx). Dann gilt wk∗ w in

L∞(0, 1) mit w ≡ λa + (1 − λ)b (Ubung) und genauso konvergiert f(wk)schwach* gegen die Konstante Funktion λf(a)+(1−λ)f(b) in L∞(0, 1) furalle f : R → R. Dies zeigt, dass (wk) das Young-Maß (νx) mit

νx = λδa + (1 − λ)δb ∀x

generiert. Beachte, dass νx hier nicht von x abhangt. Man sagt in diesemFall, das Young-Maß ν ist homogen.

113

2. Allgemeiner sei h ∈ L1loc(R

n) periodisch mit Einheitszelle [0, 1]n, d.h. f(x+z) = f(x) fur alle z ∈ Z. Definiere wk : [0, 1]n → R durch wk(x) := h(kx).Dann gilt fur alle f ∈ C0(R) (Ubung)

f(wk)∗ const. =

∫

[0,1]nf(h(z)) dz in L∞([0, 1]n).

(wk) generiert also das homogene Young-Maß ν, wobei νx das Bildmaß desLebesgue-Maßes auf [0, 1]n unter der Abbildung h ist:

νx(A) = |(h|[0,1]n)−1(A)| = |[0, 1]n ∩ h−1(A)| ∀x.

3. Wir konnen nun insbesondere die eingangs gestellten Fragen nach uni-versellen Eigenschaften von minimierenden Folgen des Funktionals

I(u) =

∫ 1

0

((u′)2 − 1)2 + u2, u ∈W 1,40

rigoros beantworten. Sei (uk) eine solche minimierende Folge, wk := u′k.Dann gibt es eine Teilfolge (wkj

) die ein Young-Maß ν induziert. Da (wk)beschrankt in L4 ist, gilt ‖νx‖M = 1 f.f.a. x (s. Bemerkung 5.8,3 mit Φ(t) =t4).

Zu ε > 0 wahle nun δ > 0, so dass (x2−1)2 < δ =⇒ max|x−1|, |x+1| <ε. Dann gilt

|dist(wkj, −1, 1) ≥ ε| ≤ |(w2

kj− 1)2 ≥ δ|

≤ 1

δ

∫ 1

0

(w2kj− 1)2 ≤ 1

δI(ukj

) → 0

mit j → ∞. Dann aber folgt aus Satz 5.6(iii) supp νx ⊂ −1, 1 f.f.a. x.Zusammenfassend konnen wir festhalten, dass es λ(x) ∈ [0, 1] gibt, so dass

νx = λ(x)δ−1 + (1 − λ(x))δ1

ist.

Aus Bemerkung 5.8,4 folgt nun

u′kj= wkj

w in L4

mit

w(x) = 〈νx, id〉 =

∫

R

y dνx(y) = −λ(x) + (1 − λ(x)) = 1 − 2λ(x).

114

Andererseits gilt wegen I(uk) → 0 auch ukj→ 0 in L2 und somit

∫wϕ = lim

j→∞

∫wkj

ϕ = limj→∞

∫u′kj

ϕ = − limj→∞

∫ukj

ϕ′ = 0

fur ϕ ∈ C∞c (0, 1). Es muss also w ≡ 0 sein, d.h. λ(x) = 1

2f.f.a. x.

(wkj) generiert also das homogene Young-Maß

νx =1

2(δ−1 + δ1) f.f.a. x.

Da ν dadurch eindeutig gegeben ist, wird ν sogar von der ganzen Folge (u′k)erzeugt.

Bevor wir uns weiteren Anwendungen zuwenden, wollen wir noch prazisieren,in welchem Sinne ein von (wk) erzeugtes Young-Maß als Werte-Statistik von wk(x)fur große k aufzufassen ist. Sei Ω ⊂ Rn offen, so dass Bδ(x) ⊂ Ω fur hinreichendkleine δ > 0 ist. Durch

〈ν(k)x,δ , f〉 = −

∫

Bδ(x)

f(wk(z)) dz

wird dann ein lineares Funktional auf C0(Rd), also ein Maß ν

(k)x,δ auf Rd definiert,

das “die Wahrscheinlichkeit misst, dass wk(z) in dy liegt fur z ∈ Bδ(x)”:

ν(k)x,δ (A) = −

∫

Bδ(x)

χA(wk(z)) dz =1

|Bδ(x)||z ∈ Bδ(x) : wk(z) ∈ A|,

d.h. ν(k)x,δ ist das Bildmaß der Gleichverteilung auf Bδ(x) unter wk.

Corollary 5.9 Es giltlimδց0

limk→∞

ν(k)x,δ = νx

in der schwach*-Topologie auf M(Rd) fur fast alle x ∈ Ω.

Proof. Fur f ∈ C0(Rd) gilt f(wk)

∗ f mit f(z) = 〈νz, f〉, so dass

limk→∞

〈ν(k)x,δ , f〉 = lim

k→∞−∫

Bδ(x)

f(wk(z)) dz = −∫

Bδ(x)

〈νz, f〉 dz.

Dies zeigt

ν(k)x,δ

∗ νx,δ fur 〈νx,δ, f〉 = −

∫

Bδ(x)

〈νz, f〉 dz.

Fur festes f ∈ C0(Rd) ist nun fast jedes x ∈ Ω ein Lebesgue-Punkt von f , so

dass

limδց0

〈νx,δ, f〉 = limδց0

−∫

Bδ(x)

〈νz, f〉 dz = 〈νx, f〉

115

fur fast alle x folgt. Damit gilt aber auch

limδց0

〈νx,δ, f〉 = 〈νx, f〉

auf einer abzahlbar dichten Teilmenge von C0(Rd) fur alle x ∈ Ω \ N , N eine

geeignete Nullmenge. Da nun ‖νx,δ‖M ≤ 1 ist fur alle δ und x, so dass jedeTeilfolge konvergente Teilfolgen besitzt, zeigt dies, dass fur x /∈ N tatsachlich

νx,δ∗ νx mit δ ց 0

gilt.

Remark 5.10 Gemaß dieser Interpretation des Young-Maßes als Werte-Statistikkann man erwarten, dass starke Konvergenz voliegt, wenn jedes νx bei einemeinzigen Wert konzentriert ist. Tatsachlich gilt:

wk → w d.M.n ⇐⇒ νx = δw(x) f.f.a. x.

Beweis: Ubung.

Zur Anwendung von Young-Maßen auf Integralfunktionale geben wir zunachstdie folgenden beiden Satze (ohne Beweis) an.

Theorem 5.11 wk : Ω → Rd generiere das Young-Maß ν. Es sei f : Ω×Rd → R

stetig und nach unten beschrankt. Dann gilt

lim infk→∞

∫

Ω

f(x, wk(x)) dx ≥∫

Ω

∫

Rd

f(x, y) dνx(y) dx.

Ist (f(·, wk(·)))k schwach relativ folgenkompakt in L1(Ω), so gilt sogar

f(·, wk(·)) f in L1(Ω), f(x) =

∫

Rd

f(x, y) dνx(y).

Theorem 5.12 Es seien uk : Ω → Rd, vk : Ω → Rd′ Funktionenfolgen, so dassuk → u fast uberall konvergiere und (vk) das Young-Maß ν generiere. Dannerzeugt (uk, vk) : Ω → Rd+d′ das Young-Maß x 7→ δu(x) ⊗ νx.

Wir untersuchen nun die Unterhalbstetigkeit des Funktionals

I(u) =

∫

Ω

f(x, u(x), Du(x)) dx

auf W 1,p(Ω; Rm), p > 1. Gilt uk u in W 1,p, so gibt es eine Teilfolge (wieder mituk bezeichnet), so dass uk → u fast uberall konvergiert und (Duk) ein Young-Maßν erzeugt. Nach Satz 5.12 erzeugt dann (uk, Duk) das Young-Maß δu(x) ⊗ νx.

116

Nach Satz 5.11 wiederum gilt dann fur stetiges nach unten beschranktes f

lim infk→∞

∫

Ω

f(x, uk(x), Duk(x)) dx ≥∫

Ω

∫

Rm

∫

Rm×n

f(x, y, z) dδu(x) ⊗ νx(y, z) dx

=

∫

Ω

∫

Rm×n

f(x, u(x), z) dνx(z) dx.

Unterhalbstetigkeit fur I ergabe sich also, wenn wir∫

Rm×n

g(z) dνx(z) ≥ g(〈νx, id〉) (5.7)

mit g = f(x, u(x), ·) abschatzen konnten. (Beachte 〈νx, id〉 = Du(x).) Im Fol-genden werden wir sehen, dass das gerade fur quasikonvexe Funktionen richtigist.

Dazu mussen wir die von Gradienten induzierten Young-Maße genauer unter-suchen. Wir setzen im Folgenden voraus, dass Ω ⊂ Rn offen und beschrankt mitC1-Rand (oder auch nur Lipschitz-Rand) ist.

Definition 5.13 ν ∈ L∞w∗(Ω;M(Rd)) heißt W 1,p-Gradienten-Young-Maß (oder

kurz W 1,p-GYM), wenn es eine Folge (uk) ⊂W 1,p(Ω; Rm) gibt, so dass

uk u in W 1,p(Ω; Rm) (bzw. “∗” falls p = ∞)

undδDuk

∗ ν in L∞

w∗(Ω;M(Rd))

gelten.

Die folgenden Satze, die wir wieder ohne Beweis angeben, liefern eine vollstandigeCharakterisierung der GYMs:

Theorem 5.14 ν ∈ L∞w∗(Ω;M(Rd)) ist ein W 1,∞-GYM genau dann, wenn νx ≥

0 f.u. ist und es eine kompakte Menge K und ein u ∈W 1,∞(Ω; Rm) gibt, so dassgilt:

(i) supp νx ⊂ K f.f.a. x,

(ii) 〈νx, id〉 = Du(x) f.f.a. x und

(iii) 〈νx, f〉 ≥ f(〈νx, id〉) f.f.a. x fur alle quasikonvexen Funktionen f : Rm×n →R.

Die Version fur p <∞ dieses Satzes lautet

Theorem 5.15 ν ∈ L∞w∗(Ω;M(Rd)) ist ein W 1,p-GYM, p < ∞, genau dann,

wenn νx ≥ 0 f.u. ist und es ein u ∈W 1,p(Ω; Rm) gibt, so dass gilt:

117

(i)∫Ω

∫Rm×n |F |p dνx(F ) dx <∞,

(ii) 〈νx, id〉 = Du(x) f.f.a. x und

(iii) 〈νx, f〉 ≥ f(〈νx, id〉) f.f.a. x fur alle quasikonvexen Funktionen f : Rm×n →R, die einer Wachstumsbedingung der Form |f(F )| ≤ C(1+ |F |p) genugen.

Dieser Satz zeigt insbesondere, dass (5.7) tatsachlich fur alle quasikonvexenFunktionen g unter geeigneten Wachstumsvoraussetzungen gilt.

GYMs verhalten sich also in gewisser Hinsicht ‘dual’ zu den quasikonvexenFunktionen: Wahrend quasikonvexe Funktionen die Jensensche Ungleichung furalle Gradientenfelder erfullen, erfullen die Gradienten-Young-Maße die Jensen-sche Ungleichung fur alle quasikonvexen Funktionen.

Wie wir zu Beginn dieses Abschnitts gesehen haben, liefern die Young-Maßeaber gerade auch dann wertvolle Informationen, wenn die Integranden nichtquasikonvex sind und ein Minimierer im Allgemeinen nicht angenommen wird.Ahnlich wie man fur manche Differentialgleichungen, die keine klassische Losungbesitzen, immer noch ‘schwache Losungen’ konstruieren kann, kann auch derDefinitionsbereich eines Integralfunktionals geeignet erweitert werden, so dass‘verallgemeinerte Minimierer’ existieren.

Betrachte das Funktional

I(u) =

∫

Ω

f(Du)

aufA = u ∈W 1,p(Ω; Rm) : u− g ∈W 1,p

0 .Wir setzen I zu einem Funktional J auf die Menge

Y := ν : Ω → M(Rm×n) : ν ist W 1,p-GYM mit 〈νx, id〉 = Du fur ein u ∈ A

gemaß

J(ν) =

∫

Ω

〈νx, f〉 dx

fort. Es gilt dann der folgende Satz (o. Beweis):

Theorem 5.16 Sei p > 1, f stetig mit c1|F |p − c2 ≤ f(F ) ≤ c2(1 + |F |p) furgeeignete Konstanten c1, c2 > 0. Dann gilt

infAI = min

YJ.

Die Minimierer von J sind gerade die von den minimierenden Folgen erzeugtenGYMs.

Insbesondere hat I einen Minimierer in A genau dann, wenn ein Minimiererν von J existiert, so dass νx ein Dirac-Maß ist f.f.a. x.

118

5.3 Mikrostrukturen und Laminate

In Satz 4.21 haben wir gesehen, dass Quasikonvexitat Rang-1-Konvexitat im-pliziert. Die wesentliche Konstruktion im Beweis dafur war eine Feinschichtungvon Lagen mit Deformationsgradient A bzw. B, Rang(A − B) = 1, so dass dieresultierende gemittelte Deformation λA + (1 − λ)B ergab. Iteriert man dieseKonstruktion, so gelangt man zum Begriff des Laminats. Dabei handelt es sichum diejenigen homogenen GYMs (also Maße), die durch Deformationen solcherArt induziert werden. (Mehr Einzelheiten hierzu, insbesondere die exakte Defi-nition von Laminaten, findet man in [Mu 98].)

Die interessante Frage ist nun, ob tatsachlich alle GYMs auf diese Weiseentstehen. Wie wir im letzten Abschnitt bemerkt haben, sind die GYMs geradedie verallgemeinerten Minimierer von Integralfunktionalen, die von Gradientenabhangen. Sie verschlusseln die Mikrostrukturen, die von den minimierendenFunktionenfolgen dieser Funktionale erzeugt werden. Unsere Frage lautet also:

Sind alle Mikrostrukturen Laminate?

Es stellt sich nun heraus, dass – ahnlich wie GYMs die dualen Objektezu den quasikonvexen Funktionen sind – die Laminate dual zu den Rang-1-konvexen Funktionen sind. (Ein Wahrscheinlichkeitsmaß mit kompaktem Tragerist genau dann ein Laminat, wenn die Jensensche Ungleichung fur alle Rang-1-konvexen Funktionen erfullt ist.) Daraus ergibt sich schließlich, dass unsere Frageaquivalent zu der schon fruher erorterten Frage

Gilt Quasikonvexitat =⇒ Rang-1-Konvexitat?

ist.Die Vermutung von Morrey aus dem Jahre 1952, dass das nicht stimmt, wurde

erst 1993 von Sverak bewiesen. Zum Schluss dieser Vorlesung geben wir hierseinen Beweis wieder. In Anwendung auf die mathematische Theorie der Mate-rialwissenschaften bedeutet dies, dass es Mikrostrukturen gibt, die kompliziertersind als selbst auf verschiedensten Skalen beliebig verschachtelte Materialschich-tungen.

Theorem 5.17 Es sei m ≥ 3, n ≥ 2. Dann gibt es eine Rang-1-konvexe Funk-tion f : Rm×n → R, die nicht quasikonvex ist.

Wir benotigen die folgende nutzliche Charakterisierung der Quasikonvexitat.

Lemma 5.18 f : Rm×n → R ist quasikonvex genau dann, wenn∫

Q

f(F +Dϕ(x)) dx ≥ f(F )

fur alle ϕ ∈W 1,∞(Rn), die Q = (0, 1)n-periodisch sind, gilt.

119

Direkt aus der Definition 4.17 ergibt sich, dass diese Bedingung hinreichendfur die Quasikonvexitat von f ist.

Ist nun umgekehrt f als quasikonvex vorausgesetzt, so wahle Abschneidefunk-tionen θk ∈ C∞

c (Rn) mit 0 ≤ θk ≤ 1, θk ≡ 1 auf (−k + 1, k − 1)n, θk ≡ 0 aufRn \ (−k, k)n und |Dθk| ≤ C. Fur ϕk = θkϕ folgt dann

(2k)n

∫

Q

f(F +Dϕ) =

∫

(−k,k)n

f(F +Dϕ)

≥∫

(−k,k)n

f(F +Dϕk) − Ckn−1 ≥ (2k)nf(F ) − Ckn−1.

Teilt man durch (2k)n und lasst k → ∞ gehen, so erhalt man die Behauptung.

Beweis von Satz 5.17. O.B.d.A. sei m = 3 und n = 2. Betrachte die (0, 1)2-periodische Funktion u : R2 → R3 mit

u(x) =1

2π

sin 2πxsin 2πy

sin 2π(x+ y)

.

Es gilt

Du(x) =

cos 2πx 00 cos 2πy

cos 2π(x+ y) cos 2π(x+ y)

,

so dass

L := spanDu(x) : x ∈ R2 =

r 00 st t

: r, s, t ∈ R

ist. Beachte, dass die einzigen Rang-1-Geraden in L die Geraden von der FormF + Ra⊗ b mit

a⊗ b =

1 00 00 0

,

0 00 10 0

oder

0 00 01 1

sind.Betrachte nun die Funktion g : L→ R mit

g

r 00 st t

= −rst.

120

Offenbar ist g Rang-1-affin auf L. Außerdem gilt∫

(0,1)2g(Du) =

∫

(0,1)2− cos 2πx cos 2πy cos 2π(x+ y)

=

∫ 1

0

∫ 1

0

− cos2 2πx cos2 2πy + cos 2πx sin 2πx cos 2πy sin 2πy

= −1

2· 1

2+ 0 = −1

4< 0 = g(0).

Der Beweis ist hier jedoch noch nicht beendet, da g ja nur auf L definiert ist.Wir konstruieren nun auf ganz R3×2 eine Rang-1-konvexe Funktion, die auf L

nahe bei g liegt: Es sei P die orthogonale Projektion von R3×2 auf L. Setze

fε,k(F ) = g(PF ) + ε(|F |2 + |F |4

)+ k|F − PF |2.

Wir uberlegen uns zunachst, dass fur jedes ε > 0 ein k(ε) > 0 existiert, sodass fε,k Rang-1-konvex ist: Ware dies nicht der Fall, so gabe es ε > 0, so dasskein fε,k, k ∈ N Rang-1-konvex ist. Da die fε,k glatte Funktionen sind, bedeutetdas, dass es zu jedem k ∈ N Matrizen Fk ∈ R3×2 und Vektoren ak ∈ R3, bk ∈ R2

mit |ak| = |bk| = 1 existieren, so dass

D2fε,k(Fk)(ak ⊗ bk, ak ⊗ bk) ≤ 0. (5.8)

Nun ist

D2fε,k(F )(X,X) (5.9)

= D2g(PF )(PX, PX) + 2ε|X|2 + ε(4|F |2|X|2 + 8|F : X|2

)+ k|X − PX|2.

(5.10)

Da g(PF ) kubisch in F ist, skaliert D2g(PF ) linear in F . Aus (5.8) und (5.9)ergibt sich damit, dass |Fk| ≤ C beschrankt ist. (Beachte |ak⊗bk| = |ak|·|bk| = 1.)

Nach Ubergang zu Teilfolgen (wieder mit k indiziert) erhalten wir

Fk → F, ak → a, bk → b.

Im Limes k → ∞ folgt dann aber aus (5.8) und (5.9)

D2g(PF )(Pa⊗ b, Pa⊗ b) + 2ε+ j|a⊗ b− Pa⊗ b|2 ≤ 0 ∀ j > 0.

Daher ist Pa⊗ b = a⊗ b, also a⊗ b ∈ L. Die Abbildung t 7→ g(PF + tPa⊗ b) istalso Rang-1-affin, so dass D2g(PF )(Pa ⊗ b, Pa ⊗ b) = 0 ist. Zusammengefasstergibt sich der Widerspruch 2ε ≤ 0.

Wir konnen also ε > 0 so klein wahlen, dass∫

Q

fε,k(ε)(Du) =

∫

Q

g(Du) + ε(|Du|2 + |Du|4

)< 0 = fε,k(ε)(0),

fε,k(ε) aber Rang-1-konvex ist.

121

Bibliography

[Dac 82] B. Dacorogna: Weak continuity and weak lower semicontinuity of non-linear functionals. Lecture Notes in Mathematics, 922. Springer-Verlag,Berlin · New York, 1982.

[Dac 08] B. Dacorogna: Direct methods in the calculus of variations. Secondedition. Applied Mathematical Sciences, 78. Springer-Verlag, New York,2008.

[Ed 65] R. E. Edwards: Functional Analysis. Holt, Rinehart and Winston, NewYork, 1965.

[Ev 90] L. C. Evans: Weak convergence methods for nonlinear partial differentialequations. CBMS Regional Conference Series in Mathematics, 74. AmericanMathematical Society, Providence, RI, 1990.

[Ev 98] L. C. Evans: Partial Differential Equations. Graduate Studies in Math-ematics 19. American Mathematical Society, Providence, RI, 1998.

[FoMu 99] I. Fonseca, S. Muller: A-quasiconvexity, lower semicontinuity, andYoung measures. SIAM J. Math. Anal. . 30 (1999), 1355–1390.

[Mu 98] S. Muller: Variational models for microstructure and phase transitions.Lecture Notes Max-Planck-Institute for Mathematics in the Sciences 1998.http://www.mis.mpg.de/preprints/ln/lecturenote-0298.pdf

[Sch 10] B. Schmidt: Partielle Differentialgleichungen. Lecture Notes TUMunchen 2010. http://www-m7.ma.tum.de/foswiki/pub/M7/Analysis/PDE09/Maerz 05.pdf

[Sch 09] B. Schmidt: Partielle Differentialgleichungen 2. Lecture Notes TUMunchen 2009.

[St 08] M. Struwe: Variational methods. Applications to nonlinear partial differ-ential equations and Hamiltonian systems. Fourth edition. Ergebnisse derMathematik und ihrer Grenzgebiete. Springer-Verlag, Berlin, 2008.

122

[Ta 79] L. Tartar: Compensated compactness and applications to partial differen-tial equations. Nonlinear analysis and mechanics: Heriot-Watt Symposium,Vol. IV, pp. 136–212, Res. Notes in Math., 39, Pitman, Boston, Mass.-London, 1979.

[We] D. Werner: Funktionalanalysis. Springer-Verlag, Berlin · Heidelberg · NewYork 2000.

123

Documents

Weak Convergence Methods for Nonlinear Partial …Weak Convergence Methods for Nonlinear Partial Diﬀerential Equations (PDE 2) Lecture Notes Summer Term 2010 Bernd Schmidt∗ version