Upload
others
View
1
Download
0
Embed Size (px)
Citation preview
B4. Gauge Field Theory
It is a striking fact about Nature that there exist gauge fields which play a key role inmediating interactions. At the āfundamental levelā of particle physics one has the electro-magnetic field, the various fields involved in the standard model, and the gravitational field.In various areas of condensed matter physics it has also been found useful to introduce gaugefield descriptions of certain kinds of collective modes.
It is not possible to survey all of these. In what follows I will (i) discuss some of theunderlying general features of all gauge fields, and why we have them, then (ii) discuss howthey are described in a path integral formulation, using the ideas first developed by Fadeevand Popov. I will show how this works in both Abelian gauge theories (like QED) and innon-Abelian theories (the Yang-Mills model), along with brief remarks about the electroweaktheory. I will then give a brief description of gauge theory for a condensed matter system.It is not possible to go into great detail here - there is no space - but we can at least see howthese applications come about.
B.4.1 GAUGE FIELDS - GENERAL FEATURES
The long history of this topic means that before we launch into the modern formulationof the theory, it is necessary to look at some more general aspects of the theory. We begin bya quick resume of some of the history, followed by a comparison of the classical and quantumversions of electromagnetic theory. A key point emerges from this comparison: there is acrucial difference in the role played by the electrodynamic gauge potential AĀµ(x) in the 2versions, as shown in the Aharonov-Bohm effect, which is discussed thoroughly here.
B.4.1 (a) SOME HISTORICAL REMARKS
the discovery of the electromagnetic field and of the General Relativistic description ofspacetime also ushered in the idea of gauge fields, in early work by Weyl - we have knownsince the 1960ās that they are here to stay. The discovery of QM, and the slow development ofQFT made it clear that the quantized versions of such fields mediated interactions in physics- the EM and spacetime fields being bosonic, with spin-1 and spin-2 respectively. Once thespin-statistics theorem was first given (by Pauli) it was also clear that only bosonic fieldscould be associated with classical macroscopic force fields. By the early 1960ās (notably in thework of Weinberg) it was known that the only consistent theories of massless bosonic fieldshad to be either spin-0 (Higgs field - given a mass by the āHiggsā mechanism, actually firstdiscussed by PW Anderson and N. Bogoliubov), spin-1 (EM field), or spin-2 (gravitationalfield). In condensed matter theory, the key mechanism needed to give these gauge fields amass (i.e., a finite energy gap) was found - the appearance of an āorder parameterā, a concept
1
first defined by Landau and Lifshitz in 1935, and developed with great effect by Landau, andlater by others, including London(1938), and BCS and Bogoliubov (1956-1962). Andersonāspaper, with the hypothesis of the Higgs boson, appeared in 1963. The papers of Higgs, andof Englert and Brout, were all written and published in 1964, and very quickly followed bypapers of Kibble (1964) and Guralnilk and Hagen (1964). The appelation āHiggs bosonā isa misnomer.
However, there were serious mathematical problems, already noted in the 1950ās in thecontext of QED. These concerned the renormalizability of QED, and the practical task ofrenormalizing calculations of specific physical quantities. The very small coupling constant inQED made practical calculations tractable, but the same was not true of the weak or stronginteractions - by the late 1950ās many physicists were in despair over the application of QFTto these interactions, and Landau and others led the way to alternative formulations, suchas the S-matrix theory (or ābootstrapā theory), reminiscent of behaviourist āblack boxāpsychology. This detour wasted the time of many physicists (although it produced someuseful mathematics), until the tide began to turn in 1966-1967.
This change of attitude, and the ārediscovery of field theoryā happened in a curiousfashion. There were two key developments. First, the remarkable paper of Yang and Millson 1954, which generalized the idea of the U(1) gauge transformation used in QED to anSU(N) gauge transformation. This idea was largely ignored at the time since it predictedmassless bosons as intermediaries of the strong force (and mesons are not massless!). Secondwas the slow development of General Relativity (GR). the late 1950ās, very few physicists(with the exception of astrophysicists in the UK, Dirac, the Russians surrounding E. M.Lifshitz, and those in the former circle of Einstein in the USA) paid any attention to gravityor to GR. This astonishing neglect showed how narrow-minded some communities in sciencecan be; but it also arose because GR seemed to have little relevance to earth-bound physics,and moreover, seemed to be quite irreconcilable with QM. At that time, only Einstein wastalking publicly about his idea of a āunified field theoryā.
Curiously it was Feynman who first broke away from this mindset, at the Chapel Hillconference in 1957, where he argued, using thought experiments, that gravity had to bequantized. In a period of intense work between 1957-1963 he tried to do this, but ran intoa fundamental problem - gravity is non-renormalizable. It is a matter of some mystery why,after the remarkable sucess Feynman had using path integral methods on both superfluid4He and the polaron problem, in the period 1952-1956 (diagrams are almost useless for thesesystems), that in the study of gravity he should abandon path integrals for the āSHut UpAnd Calculateā (SHUAC) method of diagrams - and this is what he did. His approach ledto the discovery of ghosts in gauge field theory, and was pursued to a successful but almostunreadable conclusion by B. de Witt (1964-1967); but it was completely unusable for anycalculations.
All this changed in the period 1967-1970 with three key developments. First, in 1966-1967, Fadeev and Popov succeeded in formulating gauge field theory in path integral language- not just for QED, but also for Yang-Mills theory, and even in principle, for gravity. Theresult was equivalent to deWittās (published a week later), but in contrast, was simple to
2
understand and use. Second, in 1967-1968, Salam and Weinberg separately published the-ories unifying the weak and EM interactions into one āelectroweakā gauge theory. Thistheory incorporated the yang-Mills idea of non-Aberlian gauge fields, and the Higgs mech-anism to give the gauge bosons a mass. Nobody paid any attention to it until 1970, whenāt Hooft, a beginning PhD student in Utrecht, showed - to everyoneās astonishment - thatthe Salam-Weinberg theory was renormalizable; and then, in a tour de force, he showed howto do calculations with it, using a combination of path integral methods and ādimensionalregularizationā, a technique introduced by āt Hooft and his supervisor Veltman (and foundby others independently at around the same time).
It is hard to imagine what Feynmanās reaction must have been when he saw what hemight have achieved, had he stuck with his own path integral methods!
All of this work was subsequently developed into what is now the āstandard modelā.The key further element to be found was the discovery of asymptotic freedom in the stronginteractions (mediated by gluons, with a role also played by other bosonic fields). Thisdiscovery was actually made by āt Hooft in 1972, but he was discouraged from publishing itby Veltman; it was rediscovered by Gross and Wilczek in 1973, and also by Politzer in 1973(whose supervisor, Coleman, was in contact with āt Hooft).
At the same time in condensed matter physics, the idea of the āgauge principleā wasbeing applied to various systems. Here the impact was less clear, because we always have aāmore microscopicā theory which does not require the gauge formulation; and gauge theoriesare often hard to work with. Thus, e.g., the gauge theories of high-Tc superconductivity andof the FQH (Fractional Quantum Hall Liquid) have had little practical impact. Howeverthey have motivated mainly interesting new developments, and the idea of āspontaneouslybroken gauge symmetryā, with the appearance of an order parameter, is central to all ofcondensed matter physics. Indeed, in focussing on this, CM physicists are really returningto the basic original idea of gauge theory, which goes back to Weyl in the 1920ās, viz., thatthere had to be an apparent arbitrariness in the way that we parametrize physical variablessuch as phase, or even length and time in spacetime. In classical physics these variablesreally are redundant - they are eliminated by fixing a āstandard of measurementā. But thisis not so in a quantum theory.
This difference between classical and quantum physics, in the role played by a gaugefield, is crucial. So let us now see how it happens, in the context of one of the simplestgauge theories, viz., QED and the quantized EM field. The remarks we make here aroseoriginally from the analyses of Yang-Mills in 1954, of Aharonov and Bohm in 1957-59, andtheir subsequent elaborations by many authors.
B.4.1 (b) THE AHARONOV-BOHM EFFECT
When first proposed in the late 1950ās, this analysis by Aharonov and Bohm caused hugecontroversy (NB: the mathematical analysis was actually done by MHL Pryce in Bristol,who later spent the years 1968-2001 at UBC). This was not least because Bohm, a brilliantyoung protege of Oppenheimer during the war years, had been expelled from Princeton and
3
from the USA in the early 1950ās, accused of being a communist during the McCarthy years.With the help of Einstein and Pryce, then both at Princeton, Bohm went first to Brazil andthen to Israel; Pryce then recruited him to Bristol in the UK in the late 1950ās.
There are several key works by Aharonov and Bohm; the one we will be looking atconcerns gauge fields. You may also find it interesting to look at Feynmanās lectures inPhysics, Vol. 3, on this - Feynman was one of the early supporters of Bohm and his work.
(i) Particle on a Ring: Consider a situation where a single non-relativistic particle isforced to move on a 2-dimensional circle of radius R0 - such a situation is now easy to organizewith mesoscopic rings or superconducting SQUIDS, or quantum wells, or even optically withphotons. However when Chambers did the first experiment in 1961 it was not so simple.
The quantum mechanics of the problem is quite straightforward, and easily done usingthe Schrodinger equation. The Hamiltonian is assumed static; then
H =1
2m
(p + qA(r)
)2+ q Ļ(r) (1)
and we will assume that a flux Ī¦ is enclosed inside the ring, i.e., thatā®c
dl Ā·A(l) = R0
ā«dĪøA(Īø) = Ī¦ (2)
This is not actually the problem solved by Aharonov and Bohm, and it is easier to solvethen their scattering problem. Notice that the full EM Lagrangian, derived from H bycanonical transformation, viz.,
L =1
2mr2 + qA(r) Ā· rā q Ļ(r) (3)
where the momentum is
p =āLār
; = mr + qA(r) (4)
can be replaced by a truncated version only valid on the 2-dimensional ring of radius R0.Let us assume that the electrostic potential Ļ(r) = Ļ0, a constant (no electric field). Then,
4
on the ring, we have
L ā 1
2mr2 + q r Ā·A(r)
ā 1
2I0Īø
2 + q R0ĪøA(Īø) (if |r| = R0) (5)
where the āmoment of inertiaāI0 = mR2
0 (6)
(i) Consider first this problem when Ī¦ = 0; we just have a particle circulating on a ring,with
H0 = L0 =1
2I0Īø
2 (7)
Then the normalized solutions to the Schrodinger equation:
HĻl(Īø) = ĪµlĻl(Īø) (8)
are given by
Ļl(Īø) =1ā2Ļ
eilĪø (9)
where l = 0, Ā±1, Ā±2, . . . etc., is an integer (the angular momentum quantum number), andĪø is, as above the angular coordinate. Here we can treat it as compact, i.e., 0 ā¤ Īø ā¤ 2Ļ.
The propagator for the particle is also easily found; we have
G(Īø1Īø2; t1t2) = G(Īø, t) =āl
Ļl(Īø)Ļāl (Īø)e
i~ Īµ
0l t
=āl
ā« āāā
dĻ
2Ļ
Ļl(Īø)Ļāl (Īø)
Ļ ā Īµ0l(10)
where the eigenvalues are just
Īµ0l =~2
2I0
l2 (11)
Using either form in (10), we get the result
G(Īø, t) =
(I0
2Ļi~t
) 12āā
l=āā
exp
i
~I0
(Īø + 2Ļl)2
2t
(12)
which we can write in more compact form as
G(Īø, t) =
(I0
2Ļi~t
) 12
ei~ I0
Īø2
2t Ī3
(ĻI0Īø
~t;
2ĻI0
~t)
(13)
5
where Ī3(z, t) is the biperiodic Jacobi Īø-function, defined in the complex plane z, and withthe series representation
Ī3(z, t) =āā
n=āā
ei(Ļtn2+2zn) (14)
i.e., a discretized version of a Gaussian integral.Why do we have such a complicated result? The first part of (13) looks just like a free
particle of mass I0; so where does the Jacobi Īø-function come from? In this calculation itcomes from the compactness of the variable Īø; we are dealing with a 1-d āparticle in a boxā,with periodic boundary conditions. But now we can rederive this result in a quite differentway, extending the domain of Īø so that āā < Īø < ā. Letās rederive the result (13) usingpath integrals. Then we have
G(Īø, t) =
ā« Īø(t)=Īø
Īø(0)=0
D r(t) ei~S[r,r] = A(t) e
i~Scl(Īø,t) (15)
where A(t) is just the fluctuation determinant for a particle of mass I0 in free particle motion:
A(t) =
(I0
2Ļi~t
) 12
(16)
Now, however, we must be careful with the classical action. Recall that we are supposedto sum over all paths. In Fig(a) at the top of the page we see 2 simple paths which begin atĪø(0) = 0, and terminate at Īø(t) = Īø. Notice that in both cases the winding number n is zero.
However, this is not true of the path in Fig(b). This has winding number n = 1. Andyet this path must also be included in G(Īø, t).
The simplest way to then derive the answer for Scl(Īø, t) is to either (a) note that anypath beginning at Īø(0) = 0 and ending at Īø(t) = Īø can be decomposed into a path goingfrom Īø(0) = 0 to Īø(Ļ) = 0, where Ļ ā¤ t, and then another path going to Īø(t) = Īø; but wenow sum over all possible winding numbers for the first at these 2 paths. Or else (b), we justāunfoldā the ring, as shown in the figure below. Now in this figure we see that any point onthe line at
Īøā²= Īø + 2nĻ, n = 0, Ā±1, Ā±2, . . . (17)
6
must be identified with Īø, and we must then sum over all paths leading to any of thesepoints. The paths shown on the previous figure are shown again below.
However it is clear from this diagram that we are dealing with a free particle on the line,and so we immediately find that
ei~Scl(Īø,t) =
āān=āā
ei~ I0
(Īø+2nĻ)2
2t (18)
have summed over the different winding numbers. Inserting (18) and (16) into (15), we againrecover (13).
(ii) Now letās go to the finite flux case. The energy levels are shifted, and the eigenfunc-tions change; we have
Ļl(Īø, Ļ) =1ā2Ļ
ei(l+Ļ/2Ļ)Īø
Īµl(Ļ) =~2
2I0
(l + Ļ/2Ļ)2 (19)
and from this we can derive the new answer. Before doing so, notice a crucial point. Theanswer does not depend on how the flux is distributed inside the rings, or even on whetherthe flux density (i.e., the magnetic field) is finite at the ring itself - it depends only on thetotal flux, or rather, the ādimensionless fluxā Ļ, defined as
Ļ =q
~
ā®dl Ā·A(l) =
2Ļq
~Ī¦ = Ī¦/Ī¦0, where Ī¦0 = h/q (20)
and Ī¦0 = h/q is the flux quantum for charge q.From here on it is clear that for a path with winding number n, we must add a phase nĻ
7
to the action exponent, and so now we get
G(Īø, t; Ļ) =
(I0
2Ļi~t
) 12āā
n=āā
exp
inĻ+
i
~I0
(Īø + 2nĻ)2
2t
=
(I0
2Ļi~t
) 12
ei~ I0
Īø2
2t Ī3
(ĻI0Īø
~tā Ļ
2;
2ĻI0
~t
)(21)
and in both forms of this answer, we see what is already obvious from the new eigenfunctionsand energies in (19), i.e., that the flux has shifted the answer - it is a āphase shiftā operator.
(ii) Charged Particle scattering off a flux tube: Now letās go to the problem thatAharonov and Bohm actually looked at in their famous paper. This problem is shown below- we have an infinitesimal flux tube, still carrying flux Ī¦, but now confined to a very thinfilament, which we will treat as a Ī“-function in 2-d space. We then have
H =1
2m
(Ļ+ qA(r)
)2(22)
where
A(r) = Ī¦z Ć r2Ļr
= ĻĪ¦0z Ć r2Ļr
= ĪøĪ¦
2Ļr(23)
where z and r are unit vectors, as is Īø. To make the point even more clearly, letās surroundthe flux tube at the origin by an infinite potential barrier outside the flux tube, but witha radius r0 which we will also take to be infinitesimal. Then nothing from outside canpenetrate, and the flux tube is isolated from the outside world.
And yet, from what we have done above, we know that even though the electric fieldE(r) = 0 everywhere, and B(r) = 0 except inside the flux tube, yet still a quantum particlemoving outside will feel the flux! This is utterly different from classical mechanics, wherethe particle dynamics is governed by the Lorentz equation, which is local:
mr(t) = qE(r) + q(rĆB(r)
)(24)
8
Now, we imagine a plane wave state of the particle, incident on the flux tube. If welet the barrier potential radius r0 ā 0, then for some finite wavelength incident wave, withwavelength Ī» = 2Ļ/k, the scattering cross-section (in 2d) off the potential barrier on its own(ie., with no enclosed flux) is given from elementary scattering theory by
Ļk ā¼ kr0 ln(kr0) (25)
so that in this limit, the particle does not scatter off the potential barrier. Nevertheless itstill scatters off the flux tube, even though it never sees the flux. The Schrodinger equationin 2d, for the Hamiltonian (1) with Ļ(r) = 0, is
ā2r +
1
rār +
[k2 ā 1
r2(iāĪø + Ļ)2
]ĪØ(r, Īø) = 0 (26)
and for r 6= 0, the solution can be written in terms of the eigenfunctions of this equation:
ĪØ(r, Īø) =ālk
clkĻkl(r, Īø) =āl
cleilĪø J|l+Ļ|(kr), (if r = r0) (27)
with the eigenvalues:
HĻlk(r, Īø) = ĪµlkĻlk(r, Īø)
Īµlk = ~2k2/2m (28)
from which we have
G(r2r1; Īø2Īø1; t) =1
2Ļ
āl
ā«k dk J|l+Ļ|(kr1) J|l+Ļ|(kr2) ei[(Īø2āĪø1)lāĪµlkt] (29)
However this part is easy. More messy is the determination of the scattering amplitudefk(Īø), defined by the asymmetric form of ĪØ(r, Īø) according to
ĪØ(r, Īø) = eikx +1ārfk(Īø) e
ikr, quad(when r āā ) (30)
This was the problem that Pryce solved for Aharonov, to give the result
fk(Īø) ' ā eiĻ/4ā2Ļk
sin ĻĻeiĪø2
cos Īø2
(31)
The form of this result is rather complex, and is illustrated in the figure. From the resultin (31) we see that the scattering amplitude is periodic in the flux Ļ; indeed, when Ļ = n,an integer, it has no effect at all.
Notice that (31) diverges for Īø ā Ļ, and in fact it breaks down for both forward andbackward scattering. The figure gives an idea of how it actually behaves.
9
Of course what is so strange about this result is that it shows that the particle dynamics,in the quantum theory of this problem, is controlled not by E(r) or B(r), but by A(r).This means that in the quantum theory, it is A(r) (and more generally A(r, t)) that isfundamental object, and not B(r, t) or E(r, t), which are merely derived from A(r, t).
This is the exact opposite of classical EM theory. There, the fundamental physicalquantities are B(r, t) and E(r, t); and from the Lorentz eqtn. (24), we see that these 2 fieldsentirely control the dynamics of electric charge.
(iii) Classical EM vs. QED: What is the reason for this fundamental difference,between classical EM theory and its quantized version? For many years this was not properlyunderstood, and yet the reason is to be found in equation (21). We notice that the term in theexponential appears without an accompanying ~, which is actually buried in the definitionof Ļ; letās rewrite (21) as
G(Īø, t; Ļ) = A(t)āā
n=āā
expi
~
n~Ļ+ I
(Īø + 2Ļn)2
2t
(32)
where
A(t) =
(I0
2Ļi~t
) 12
(33)
Now suppose we consider the limit ~ ā 0. It is singular; the phase in the exponentdiverges. However we notice that the term in Ļ is independent of ~, and it disappears fromthe exponent. This is because there are actually 2 quantum parameters here, ~ and q, and
10
we are perfectly entitled to treat them as independent. Thus letās write (32) as
G(Īø, t; ~, q) = A(~t)āā
n=āā
ei2nqĪ¦/Ļ e
i2I0
(Īø+2nĻ)2
~t
ā” A(~t)āā
n=āā
einĻc(qĪ¦) eiĻn(~t,Īø) (34)
where the prefactor A(~t) and the phase Ļn(~t, Īø) both diverge as ~tā 0:
A(~t) =( I0
2Ļi~t) 1
2 āāāā~tā0
ā
Ļn(~t, Īø) =I0
2
(Īø + 2nĻ)2
~tāāāā~tā0
ā (35)
whereas the topological phase Ļc(qĪ¦) does not:
Ļc =qĪ¦
2Ļ, (independent of ~, t). (36)
Thus we see already, from looking at a simple non-relativistic problem in the motion ofa charged particle coupled to a static EM field, that there is a crucial term in the dynamicsthat exists in the quantum version of the theory, but not in the classical.
If we think about this a little more, it is not all that obvious why there ought to beany relationship between the quantum and classical versions of electrodynamics. After all,classical EM theory makes no reference to Dirac electrons and holes, which exist in QED,and it is not obvious why the form of the Lagrangian or action for the 2 theories shouldlook the same. Thus we can, if we like, start by asking - why should there be a quantumgeneralization of a classical gauge theory like QED, and what should it look like?
B.4.2 DERIVATION of GAUGE FIELD THEORIES
In this sub-section we will give some of the well-known arguments that are used to derivethe existence and form of gauge theories. A key part of what follows is the comparisonbetween the quantum and classical versions of these theories. I will focus mainly on thegeneral form of these theories, with not too much attention paid to examples. Note, asalready emphasized in the previous sub-section, that physics is full of different sorts of gaugetheory - why this is so should become clear in the following.
B.4.2 (a) QUANTUM ELECTRODYNAMICS: a U(1) GAUGE THEORY
QED is the simplest sort of gauge theory - we shall see why this is in the course of thederivations to be given. It is important in what follows to compare the classical and quantum
11
versions of electrodynamics, so we begin by reviewing classical EM theory. In what followsit will be assumed that everyone is familiar with special relativity and relativistic notation.
(i) Classical Electrodynamics: Classical Electrodynamics is a theory of sources cou-pled to fields, and it can be phrased entirely in terms of the fields E(x) and B(x), and thecharge 4-current JĀµ(x) = (Ļ(x),J(x)). However, as is well known, this is not the best wayto formulate it, and it is not ideal for the generalization to QED.
Letās see how we can set up classical EM theory starting with an experimental fact, viz.,that there exist fields E(x) and B(x), and that they act upon electric charges according tothe Lorentz force law in (24) above. In 4-vector notation we rewrite the Lorentz equation as
fĀµ = qF ĀµĪ½uĪ½ (37)
where uĪ½ is the 4-velocity vector, i.e.,
uĪ½ =dxĪ½
dĻ(38)
where Ļ is the proper time interval (on a worldline). In what follows will assume the followingconventions; the infinitesimal interval ds is related to the metric by
c2dĻ 2 = ds2 = gĀµĪ½dxĀµ dxĪ½ āāāāāā
flat spaceĪ·ĀµĪ½dx
Āµ dxĪ½ (39)
where the coordinate 4-vector is xĀµ = (x0; x1, x2, x3) = (ct; x, y, z), and the flat spaceMinkowski metric is taken to be
Ī·ĀµĪ½ = Ī·ĀµĪ½ =
1 0 0 00 ā1 0 00 0 ā1 00 0 0 ā1
(40)
with signature ā2. If in some general pseudo-Riemannian spacetime we define a set of basisvectors eĀµ (cotravariant) and eĪ½ (covariant), then
ds = eĀµdxĀµ = eĀµdxĀµ
gĀµĪ½ = eĀµ Ā· eĪ½ , gĀµĪ½ = eĀµ Ā· eĪ½ , gĀµĪ½ = eĀµ Ā· eĀµ = Ī“ĀµĪ½ (41)
The 4-vectors of main interest to us will be:
4-velocity: u = uĪ½eĪ½ , with uĪ½ = Ī³u(c,u), where Ī³u =(1 ā u2
)ā1/2, with u = u/c, and
with individual components u =(dxdt, dydt, dzdt
).
Note we then have dĻ = dt/Ī³u, and also
u2 = uĪ½uĪ½ =
(ds
dĻ
)2
= c2 (42)
12
4-acceleration:
a = aĪ½eĪ½ , where aĪ½ =d2xĪ½
dĻ 2=
duĪ½
dĻ(43)
so thatuĪ½a
Ī½ = 0 (44)
4-current density:J(x) = Ļ0(x)u(x) = JĪ½(x)eĪ½(x) (45)
where Ļ0(x) is the proper charge density; then
JĪ½ = Ļ0Ī³u(c,u) = (Ļc, ) (46)
where Ļ(x) = Ī³uĻ0(x), and j(x) = Ļ(x)u(x) is the 3-current density; from eq. (42), we have
J2 = JĀµJĪ½ = Ļ2c2 ā 2 (47)
Returning now to the Lorentz force equation, we notice that a contraction of (37) witha 4-vector uĀµ gives fĀµuĀµ = qF ĀµĪ½uĀµuĪ½ = 0, because a 4-force, like a 4-acceleration, must beperpendicular to its associated velocity (cf. eq. (44)); thus FĀµĪ½ must be antisymmetric, i.e.,
FĀµĪ½ = āFĪ½Āµ (48)
We have introduced the tensor FĀµĪ½(x) as a way of describing the Lorentz force law ina relativistically invariant way - the presence of the velocity vector r = u in (24) naturallyleads to a formulation in terms of the velocity 4-vector u(x), and then to an equation of theform in (37). If we now want to recover (24), we write FĀµĪ½(x) in the form
FĀµĪ½ =
0 Ex/c Ey/c Ez/c
āEx/c 0 āBz āBy
āEy/c Bz 0 āBx
āEz/c By Bx 0
(49)
Now, we also wish to find a source equation for the EM field - we may treat this as ageneral theoretical requirement, or again base it on experiment (which show that EM fieldsact on charges, and are acted on by them as well). Rather than appeal to experiment, wecan instead simply ask what is the natural relativistically invariant form required for electriccharge to generate the field. Since the charge Ļ(x) is simply one component of the 4-currentJ(x), we look for a linear equation relating F ĀµĪ½(x) to JĀµ(x). The obvious way to do this isto write
āĀµFĀµĪ½(x) = ĪŗJĪ½(x), Īŗ =
4Ļ/c (cgs)Āµ0 (MKS)
(50)
(any other 4-vector AĀµ contracted with FĀµĪ½(x) would do, but experiment reveals no othersuch quantities); the constant Īŗ is given by experiment.
13
At first it might seem that eqs. (37) and (50) characterize the theory properly. Butactually the components of FĀµĪ½ are not independent of each other - there are 6 apparentlyindependent components in (49), but only 4 independent quantities in (50). One can eitherargue that FĀµĪ½(x) has to be therefore constructed from a 4-vector, or appeal to experimentfor the relationship between the components of FĀµĪ½(x). Both lines of argument lead to theconclusion that we can write FĀµĪ½(x) in the form
FĀµĪ½(x) = āĀµAĪ½(x)ā āĪ½AĀµ(x) (51)
where
AĀµ(x) =
(Ļ(x)/c, A(x)
)(MKS)(
Ļ(x), A(x))
(cgs)
(52)
and Ļ(x), A(x) are the electric and magnetic potentials. Another way to enforce the restric-tion to 4 independent components is via the Bianchi identity, written as
ā[Ī»FĀµĪ½] ā” āĪ»FĀµĪ½ + āĪ½FĪ»Āµ + āĀµFĪ½Ī» = 0 (53)
which actually follows directly from (51). There are thus 2 different ways we can write thefield equations of classical electromagnetism; we assume:
āĀµFĀµĪ½(x) = Āµ0J
Ī½(x)
and
F ĀµĪ½ = (āĀµAĪ½ ā āĪ½AĀµ) OR āĪ»FĀµĪ½ + āĪ½FĪ»Āµ + āĀµFĪ½Ī» = 0 (54)
So far so good. Now observe that we have a rather peculiar theory, for the key observablevariables are E(x) and B(x), and of course J(x), but the underlying field variable for theEM field, which is not observable, is AĀµ(x).
It then follows that if we make any changes of variables, coordinate transformations,etc., the quantities FĀµĪ½(x) and J(x) ought to be invariant, but AĀµ(x) does not have to be.This immediately leads to the possibility of Gauge Transformations. Suppose we make thetransformation
AĀµ(x) āā AĀµ(x) = AĀµ(x) + Ī±Āµ(x) (55)
Then we haveFĀµĪ½ = (āĀµAĪ½ ā āĪ½AĀµ) + (āĀµĪ±Ī½ ā āĪ½Ī±Āµ) (56)
and for FĀµĪ½ = FĀµĪ½ , we require Ī±Āµ = āĀµĻ, so:
AĀµ(x) = AĀµ + āĀµĻ(x) (57)
Thus any one of an infinite set of functions AĪ½(x) is just as valid as AĀµ(x) for the treatmentof the classical EM fields.
We now wish to write an action functional for the classical EM theory, in preparation forthe transition to QED. There will be 2 parts to this action, the matter part and the fieldpart.
14
Particle Action in EM theory: For a set of particles of mass mj, at spacetime coordinatexj, it is well known that the action has the form
Sp = āāj
mjc
ā«dsj = āc2
āj
ā«dtj (1ā u2
j/c2)1/2 (58)
Suppose we ignore this term, concentrating only on the electromagnetic part that comesfrom the interaction of the current JĀµ(x) with the gauge field AĀµ(x), which from now onwe take to be the fundamental EM field. The simplest scalar term for a Lagrangian densitycombining these two is just JĀµA
Āµ, and actually experiment tells us that we should have
Sint = āā«
d4x JĀµ(x)AĀµ(x) (59)
At first glance this coupling seem problematic, because it is apparently not invariantunder the gauge transformation of (57). However, one can actually show that it is gaugeinvariant, as follows. Suppose we gauge transform (59):
AĀµ(x) ā AĀµ + āĀµĻ(x) =ā Sint ā Sint
Sint = āā«d4x JĀµ(AĀµ + āĀµĻ) = ā
ā«d4x [JĀµAĀµ + āĀµ(JĀµĻ)ā (āĀµJ
Āµ)Ļ] (60)
Now the 2nd term, āĀµ(JĀµĻ), can be rewritten as a surface integral of JĀµĻ at infinity,and we will assume no current sources at infinity. This leaves the 3rd term, and we see thecoupling term (59) will be gauge invariant if the current conservation equation
āĀµJĀµ(x) = 0 (61)
is satisfied. But the truth of this is easily demonstrated, starting from the 1st equation ofmotion in (54); for if we take the derivative āĪ½ of (54), the left side of (54) must vanishbecause of the antisymmetric property (48) of the field FĀµĪ½(x).
Field action in EM theory: We want to find a scalar quantity which is gauge invariantfor the EM field action. This time we cannot get away with using combinations of AĪ½(x),because any combination like AĪ½AĪ½ will not be gauge invariant. Thus we must resort to FĀµĪ½ ,and the correct form of the field action is in fact
SEM = ā 1
4Āµ0
ā«d4xF ĀµĪ½(x)FĀµĪ½(x) (MKS units) (62)
where the prefactor is determined by experiment.Thus we find that the total EM action is
Scl[JĀµ, AĀµ] = ā
ā«d4x
1
4Āµ0
F ĀµĪ½(x)FĀµĪ½(x) + JĀµ(x)AĀµ(x)
(63)
and, by varying this action, we can recover the equations of motion.
15
(ii) Quantum Electrodynamics: : Now let us start again, this time with the classicalelectrons of EM theory replaced by the electrons described by the Dirac equation for aspin-1/2 fermionic field having the bare action
S0 =
ā«d4x Ļ(x) [iĪ³ĀµāĀµ ām]Ļ(x) (64)
What we now wish to show is that a quantum action corresponding to the classical actionabove can be derived for the Dirac electron coupled to an EM field - but the arguments usedare a little different (at least at first glance). They have the great advantage of being easilygeneralizable to a variety of matter fields (although one can quibble rather strongly aboutthis in the case of quantum gravity).
We begin by noting that one can realize a simple āgauge transformationā on (64), becausethe phase of Ļ(x) is arbitrary. Thus we make the global gauge transformation
Ļ(x) āā eāiĪøĻ(x) (65)
Under this transformation, S0 is invariant; the system possesses a global U(1) symmetry.However, suppose we allow Īø to depend on the spacetime coordinates; this move is of coursehighly non-trivial, and is suggested by the following considerations:
(i) the original motivation of Weyl, in the 1920ās; since one can envisage, in generalrelativity, a charge of ācoordinate measureā as one moves around in space, why not the samefor phase measure (here the name āgaugeā, meaning a measurement scale).
(ii) Quantum Mechanics naturally suggests, particularly in its path integral form, the roleof phase as a fundamental variable in the theory. This idea reappears in the Aharonov-Bohmeffect.
We shall see that the consequences of introducing such a local gauge transformation givefurther reason to look at it. So we now consider the transformation:
Ļ(x) āā eiĪø(x)Ļ(x) (66)
We immediately see that S0 is not invariant under this transformation; in fact we get
S0 āā«d4x Ļ(x)
[iĪ³Āµ(āĀµ ā iāĀµĪø(x)
)ām
]Ļ(x)
= S0 +
ā«d4x Ļ(x)Ī³ĀµāĀµĪø(x)Ļ(x) (67)
However, we can have a gauge-invariant theory if we do the following:
- We replace āĀµ by a āgauge-covariantā derivative DĀµ so that DĀµĻ(x)ā eāiĪø(x)DĀµĻ(x)is invariant. This will work if
- we write DĀµ = āĀµ+ iqAĀµ, and at the same time suppose that the field AĀµ(x) transformsaccording to
AĀµ(x) āā AĀµ(x) +1
qāĀµĪø(x) (68)
16
thereby cancelling the extra term in (67).Thus we are led to replace (64) by
S0 =
ā«d4x Ļ(x) [iĪ³ĀµDĀµ ām]Ļ(x)
=
ā«d4x Ļ(x)
[iĪ³Āµ(āĀµ + iqAĀµ(x)
)ām
]Ļ(x) (69)
and then, recognizing that the field AĀµ(x) must have its own individual term in the La-grangian (it is a now field), which must be gauge invariant, we are finally led to the totalQED action:
SQED =
ā«d4x
Ļ(x) [iĪ³ĀµāĀµ āM ]Ļ(x)ā JĀµ(x)AĀµ(x)ā 1
4Āµ0
F ĀµĪ½(x)FĀµĪ½(x)
(70)
where the Dirac current JĀµ(x) is
JĀµ(x) = qĻ(x)Ī³ĀµĻ(x) (71)
Thus we have shown that the gauge invariance of the Dirac electron terms in the action,under a local phase transformation, leads naturally to the existence of a gauge field A2(x) ofthe same kind as appears in the original classical action for the EM field! We now see anotherreason to take all this seriously; the gauge invariance naturally prevents the existence of aterm ā¼ AĪ½(x), which would give the photon a mass, something excluded by experiment. Itis also interesting to note that the field FĀµĪ½(x) acquires a new significance in the quantummechanical theory (which in a path integral formulation can be related directly back to theclassical theory, as we shall see). Consider the action of the curvature operator
RĀµĪ½ = [DĀµ, DĪ½ ] ā” DĀµDĪ½ āDĪ½DĀµ (72)
on the wave-function; we immediately find that
RĀµĪ½ Ļ(x) = iqFĀµĪ½(x) Ļ(x) (73)
so that the different components of the field intensity FĀµĪ½(x) (i.e., the āphysicalā fieldsE(x), B(x)) are just components of the curvature. Thus we can argue that in reality wehave
- AĀµ(x): The underlying āEM fieldā, or āEM vacuumā, which is not directly accessibleto us in classical EM theory; it is accessible as a phase variable in QM, modulo gaugetransformation;
- FĀµĪ½(x): The ācurvatureā - a kind of generalized āpolarizationā - of A(x), withcomponents E(x), B(x), which is directly accessible to us via its effects on charge, in bothclassical EM and QED.
We note of course that FĀµĪ½(x) is gauge invariant, as we have already seen. Notice twokey features of this U(1) theory:
17
(i) The form of the coupling to the gauge field (indeed, the existence of the gauge field,in the argument) arises solely from the transformation of the matter field under local phasetransformation; one ends up with a covariant or āminimalā coupling. The requirement ofgauge invariance then uniquely determines the coupling (as well as the masslessness of thegauge field).
(ii) The photon does not couple to itself. There is no reason coming from gauge invariancewhy this āphoton self-couplingā does not happen (it could, e.g., come from higher powers ofFĀµĪ½(x) in the action); nevertheless there are no such terms in the bare action.
There is also a 3rd key feature which we will come to below, concerning the connectionbetween gauge transformations and conservation laws (Noetherās Theorem).
B.4.2 (b) NON-ABELIAN GAUGE FIELDS - YANG-MILLS THEORY
In their pioneering paper published in 1954, Yang and Mills vastly extended the ideas ofgauge fields beyond the U(1) discussion given above. This paper, inspired by ideas from GRas well as by questions arising in particle physics, was almost entirely ignored for a decade- it was well ahead of its time. There was nothing in classical physics at that time whichsuggested such a quantum theory, apart from GR - although now we have very nice examplesfrom condensed matter physics (e.g., superfluid 3He, where the quantum order parameterobeys a set of classical equations of motion which look like a non-Abelian gauge theory ona background dynamics curved spacetime). The main reasons that the Yang-Mills theoryreceived so little attention until the early-mid 1960ās were
(i) Almost nobody was interested in or familiar with classical GR; and quantum gravityhardly existed even as an idea.
(ii) The Yang-Mills (YM) theory predicted massless particles. Adding a mass term brokethe SU(N) gauge invariance. No mechanism at that time was known, at least in particlephysics, that would break this invariance in a physically satisfactory way.
(iii) Nobody knew how to calculate with such theories.
As we will see, objections (i) and (ii) were slowly overcome; the key to solving (ii) wasthe Anderson-Higgs mechanism, implemented for YM theories by Salam and Weinberg. Thekey to (iii) was the use of path integral methods, without which even QED remained hardto understand. The main initiative came from āt Hooft, and from āt Hooft and Veltman.
In what follows we will begin by going through the important special case of SU(2) gaugesymmetry, which is a nice pedagogical example. I will then sketch how this is generalized tohigher non-Abelian groups, and discuss the physical significance of all this.
(i) SU(2) Gauge Theory: Recall that the simple SU(2) group can be represented bythe Pauli matrices ĻĪ±Ī²j , with
ĻiĻj = Ī“ij + iĪµijkĻk
[Ļi, Ļj] = 2iĪµijkĻk (81)
18
and we may define the unitary operator U(Ī©) = U(nĪ©), where n is a unit vector on theBloch sphere, as
U(Ī©) = eāi2Ī©Ā·Ļ = eā
i2
Ī©jĻĪ±Ī²j ā” lim
Nāā
(1ā i
2NĪ© Ā· Ļ
)N(82)
where the last form gives us the infinitesimal operator. Note that U(Ī©) = UĪ±Ī²(Ī©) is amatrix operator in āspin spaceā, and the unit vector n tells us the direction around whichĪ© is effecting a rotation.
We may now go through much the same manoeuvres as we did for the Abelian gaugefield. We introduce in this case a 2-component spinor Dirac field, ĻĪ±(x), and note that theLagrangian
L0 = ĻĪ±(x)[iĪ³ĀµāĀµĪ“
Ī±Ī² āmĪ“Ī±Ī²
]ĻĪ²(x)
is invariant under a global SU(2) rotation. However, let us now apply the local operator,where Ī©ā Ī©(x); then
U(Ī©(x)
)Ļ(x) = UĪ±Ī²
(Ī©(x)
)ĻĪ²(x)
= eāi2Ī©(x)Ā·Ļ Ī±Ī²
ĻĪ²(x) = ĻĪ±(x) (83)
and so we find that L0 is not invariant, for the same reason as before, i.e., we get an extragradient term:
S0 āā S0 +
ā«d4x ĻĪ±(x)
[Uā1Ī±Ī³
(Ī©(x)
)āĀµU
Ī³Ī²(Ī©(x)
)]ĻĪ²(x) (84)
We therefore introduce the gauge covariant derivative, with a charge g0:
DĀµ =(āĀµ + i
g0
2Ļ Ā·AĀµ(x)
)DĪ±Ī²Āµ =
(āĀµĪ“
Ī±Ī² + ig0
2ĻĪ±Ī²j Ā· AjĀµ(x)
)(85)
where we use a vector notation ĻĪ±Ī² = (ĻĪ±Ī²j , ĻĪ±Ī²x , ĻĪ±Ī²y ) ā” ĻĪ±Ī²j ā Ļ , so as to suppress
the clutter of indices, with the boldface indicating a vector in real spacetime. Since ĻĪ±(x)
transforms to ĖĻĪ± = uā1Ī±Ī²ĻĪ², we clearly want a transformation such that
DĀµĻ āā U(Ī©(x)
)DĀµĻ (86)
and, going through the algebra, analogous to that for U(1) gauge fields, we find that
1
2Ļ Ā·AĀµ āā U(Ī©)
1
2Ļ Ā·AĀµU
ā1(Ī©) + igā10
(āĀµU(Ī©)
)Uā1(Ī©) (87)
or, for an infinitesimal transformation Ī“Ī©(x) (cf. eq. (82)), we get
1
2Ļ Ā·AĀµ āā
1
2Ļ Ā·AĀµ +
1
2Ļ Ā· (Ī“Ī©ĆAĀµ) +
1
2g0
Ļ Ā· āĀµĪ“Ī©
1
2ĻiA
iĀµ āā
1
2ĻiA
iĀµ +
1
2ĪµijkĻ
iĪ“Ī©jAkĀµ +1
2g0
ĻiāĀµĪ“Ī©i (88)
19
where we write out all the components in the 2nd form. Thus we can now write everythingin terms of a new non-Abelian gauge field, given by (85), with the transformation property
AĀµ(x) ā AĀµ(x) + Ī“AĀµ(x)
Ī“AĀµ(x) =(Ī“Ī©(x)ĆAĀµ(x)
)+
1
g0
āĀµĪ“Ī©(x) (89)
and we can immediately generalize this from the infinitesimal transformation to the generaltransformation
AĀµ(x) āā AĀµ(x) +1
g0
āĀµĪ“Ī©(x) +(Ī©(x)ĆAĀµ(x)
)AiĀµ(x) āā AiĀµ(x) +
1
g0
āĀµĪ“Ī©i(x) +
(ĪµijkĪ©
j(x)AkĀµ(x))
(90)
which we should compare with the Abelian transformation in (68). We can also relate thisto a ācurvature tensorā, or field intensity tensor; as with (72) above, we have an operator
RĪ±Ī²ĀµĪ½ = DĪ±Ī³
Āµ DĪ³Ī²Ī½ āDĪ±Ī³
Ī½ DĪ³Ī²Āµ (91)
and applying this to ĻĪ±(x), we find that
RĪ±Ī²ĀµĪ½ ĻĪ²(x) =
ig0
2(ĻĪ±Ī² Ā· FĀµĪ½) ĻĪ²(x) (92)
where we define
FĀµĪ½ =(āĀµAĪ½ ā āĪ½AĀµ
)+ g(AĀµ ĆAĪ½)
F iĀµĪ½ =
(āĀµA
iĪ½ ā āĪ½AiĀµ
)+ gĪµijkA
jĀµA
kĪ½ (93)
Unlike the Abelian FĀµĪ½(x), this tensor is not gauge-invariant, as we see by making thetransformation; we have
Ļ Ā· FĀµĪ½(x) āā U(Ī©(x)
)(Ļ Ā· FĀµĪ½(x)) Uā1
(Ī©(x)
)FĀµĪ½(x) āā FĀµĪ½(x) + Ī©(x)Ć FĀµĪ½(x) (94)
However, the analogue of the Abelian field action term in (63) and (70) is gauge-invariant;i.e., the term
Tr (Ļ Ā· FĀµĪ½) (Ļ Ā· FĀµĪ½) =1
2F iĀµĪ½F
ĀµĪ½i (95)
is gauge-invariant. Thus we are led to the form of our spinor generalization of the Abeliangauge theory, taking the form of an action in which a vector gauge field AĀµ(x) is coupled toa spinor fermion field Ļ(x), with the action
S[Ļ, Ļ; AĀµ
]=
ā«d4x Ļ(x) [iĪ³ĀµDĀµ ām]Ļ(x) ā 1
4F iĀµĪ½(x)F ĀµĪ½
i (x) (96)
20
with DĀµ given by (85). Now we would like to write this in a way analogous to (63), using acurrent operator. We can do this if we define
JĀµ(x) ā” JĀµi (x) =g
2Ļ(x)Ī³Āµ Ļ Ļ(x)
=g
2ĻĪ±(x)Ī³ĀµĻĪ±Ī²i ĻĪ²(x) (97)
A little later we will see how such a choice can be justified (and in the same way justifythe choice (71) for Abelian QED). In any case, with this choice we can write the final formfor the action:
S =
ā«d4x
ĻĪ±(x) [iĪ³ĀµāĀµ ām]ĻĪ±(x) ā JĀµ(x) Ā·AĀµ(x) ā 1
4FĀµĪ½(x)FĀµĪ½(x)
(98)
with the quantities defined as we have already seen.
This concludes the discussion of a simple U(1) non-Abelian gauge theory, which has theadvantage of being easily understandable in temrs of spinors. Let us now move to a moregeneral discussion.
(ii) General Non-Abelian Gauge Theories: We can reformulate all of this for ageneral non-Abelian gauge group. Thus, one can imagine some simple Lie group G withgenerators ga satisfying the algebra
[ga, gb] = ifabcgc (99)
and we will represent this Lie algebra with matrices T which operate on a n-dimensionalfermion field ĪØ(x) = ĪØa(x). The matrices T then have the commutation relation
[Ta, Tb] = ifabcTc (100)
and the general local gauge transformation will act on ĪØ(x) according to
U(Ī(x)
)ĪØ(x) = eāig0TĀ·Ī(x) (101)
where we can think of Ī(x) as a āhyperangleā in the n-dimensional space. We introduce ageneralized gauge field AĀµ with components AĀµa(x), which transform according to
AĀµa āā AĀµa + āĀµĪa + g0fabcĪbAĀµc (102)
and we define the gauge covariant derivative
DĀµ =(āĀµ + ig0T Ā·AĀµ
)(103)
Then the action of the commutator RĀµĪ½ = [DĀµ, DĪ½ ] on the fermion field is given by
RĀµĪ½ = [DĀµ, DĪ½ ] = ig0T Ā· FĀµĪ½ (104)
21
withFĀµĪ½(x) =
(āĀµAĪ½ ā āĪ½AĀµ
)+ ig0 [AĀµ, AĪ½ ] (105)
which transforms according to
FĀµĪ½(x) āā U(Ī)FĀµĪ½(x)Uā1(Ī) (106)
Finally, we write the total action for this theory as
SYM =
ā«d4x
ĪØ(x) [iĪ³ĀµDĀµ ām] ĪØ(x) ā 1
4FĀµĪ½(x) Ā· FĀµĪ½(x)
=
ā«d4x
ĪØ(x) [iĪ³ĀµāĀµ ām] ĪØ(x) ā JĀµ(x) Ā·AĀµ(x) ā 1
4FĀµĪ½(x) Ā· FĀµĪ½(x)
(107)
of which the SU(2) theory is obviously a special case.
B.4.2 (c) PHYSICAL PROPERTIES of GAUGE FIELDS
There are many interesting things that one can say about gauge fields, particularly aboutnon-Abelian gauge fields. In the following I will confine the remarks to some fairly basicpoints.
(i) Equations of Motion and Current Conservation: Up until now we have justlooked at the action and the quantities in it. But an obvious physical question is - how dothe fields affect each otherās motion?
To answer this we find the equations of motion of the 2 fields, by functionally differenti-ating the action. We then have, as usual, that
Ī“SYM =
ā«d4x
[āĀµ
āLā(āĀµAĪ½)
ā āLāAĪ½
]Ī“AĀµ
+
[āĀµ
āLā(āĀµĪØ)
ā āLāĪØ
]Ī“ĪØ +
[āĀµ
āLā(āĀµĪØ)
ā āLāĪØ
]Ī“ĪØ (108)
giving us equations of motion for AĪ½ , ĪØ, ĪØ. The simplest of these equations is for thefermion field ĪØ(x); we have(
iĪ³ĀµDĀµ ām)ĪØ(x) =
[(iĪ³ĀµāĀµ ām
)+ ig0ĪØ(x)Ī³Āµ T Ā·AĪ½(x)
]ĪØ(x) = 0 (109)
so that the fermion field is acted upon by a āsourceā which combines the anti-field ĪØ(x) andthe gauge field term T Ā·AĪ½(x). A similar equation is obeyed by the anti-field ĪØ(x).
The gauge field equation of motion is taken from the 1st variation in (108), and it gives
DĀµFĀµĪ½(x) = g0ĪØ(x) Ī³ĀµT ĪØ(x) = JĀµ(x) (110)
which is just the generalization of the usual sourced equation of motion for the EM field.
22
It is very interesting and useful to look more closely at the connection between currentslike JĪ½(x) in (110) and the symmetries that exist in the field theory of interest. Let us recallwhere an expression like (108) comes from; our theory has a Lagrangian L
(Xp, āĀµXp
), where
Xp =(Ļ(x), ĻĪ±(x), Ļ(x), AĪ½(x), . . .
)is the set of all fields that L depends on, collected into
one āsuperfieldā Xp. We then have
Ī“S =
ā«dDx
[āLĪ“Xp
Ī“Xp +āL
Ī“(āĀµXp)Ī“(āĀµXp)
]=
ā«dDx
[āLĪ“Xp
+āL
Ī“(āĀµXp)āĀµ
]Ī“Xp (111)
Now from this we derive a form like (108) by throwing away boundary terms, arguingthat at the boundary of our spacetime they vanish - this is done using integration by parts,to show that
āĀµ( Ī“LĪ“(āĀµXp)
)ā Ī“LĪ“Xp
= 0 (112)
once we set Ī“S = 0. However, substituting this into (111), we immediately find that Ī“S in(111) can be written as a total derivative, i.e.,
Ī“S =
ā«dDx āĀµ
[Ī“L
Ī“(āĀµXp)Ī“Xp
]=
ā«dDx Ī“L(Xp, āĀµXp) (113)
where we now identify the quantity in bracelets as a ācurrentā. Before continuing withargument, letās just consider what form the current JĀµ(x) might take. This clearly dependson what field Xp we are dealing with. In what follows we will assume that all transformationsof the Lagrangian we are interested in can be effected by unitary operators acting on thefield Xp, i.e., that we can write
Xp = eiGapqĻaXq (114)
for the transformed field, where Gapq is the generator of the transformation and Ļa is the
āangleā by which it is effected. Then, e.g., we have
GapqĻa āā g0T
aĪ±Ī² Ā·Īa (general non-Abelian transformation)
GapqĻa āā
g
2Ļ aĪ±Ī² Ā·Ī©a (SU(2) transformation)
GapqĻa āā qĪø (U(1) transformation) (115)
where in the case of a global transformation, Ļa is independent of x, whereas for a localtransformation, Ļa ā Ļa(x).
From (114) we have
Ī“Xp = iĪ“Ļa(x)GapqX
q =
(āXp
āĻa
)Ī“Ļa(x) (117)
23
and so now we can write Ī“L in (113) as
Ī“L = āĀµ
[āL
ā(āĀµXp)
āXp
āĻa
]Ī“Ļa (118)
Now, the key point. Suppose we make a transformation of the fields, which will be as-sumed infinitesimal, parametrized by the infinitesimal Ī“Ļa, and we find that L is unchanged.It then immediately follows that
āĀµJĀµa(x) = 0
JĀµa(x) =
[āL
ā(āĀµXp)
āXp
āĻa
](119)
This is usually called āNoetherās theoremā, and evaluation of it for any of the fieldswe have looked at so far immediately gives us a conservation law for the currents we havedefined. As an example of (119), consider the bare Lagrangian
L0 = ĪØĪ±(x) [iĪ³ĀµāĀµ ām] ĪØĪ±(x) (120)
The infinitesimal field transformation is
ĪØĪ±(x) =[āig0T
aĪ±Ī²ĪØ
Ī²(x)]Ī“Īa (121)
so that the current is
JĀµa(x) =(iĪØĪ±(x)Ī³Āµ
)(ā g0T
aĪ±Ī²ĪØ
Ī²(x))
= g0ĪØĪ±(x)Ī³ĀµTĪ±Ī²
a ĪØĪ²(x) (1)
as previously derived in (110). From (119) we then see that for the Lagrangian in (120),JĀµa(x) is conserved.
One can say a lot more about such conserved currents, but the basic message here is clear- symmetries lead to conservation laws, just as in classical physics and in ordinary QM.
(ii) Physical Interpretation of FĀµĪ½(x): It has already been stated that FĀµĪ½(x) can bethought of as a kind of curvature, and here we amplify on this statement.
Consider the case where the transformation of the field we have been talking about nowconsists in looking at the change of the field as we move from one point to another. Thuswe are interested in the transformation
Ļ(x2) = U(x2, x1)Ļ(x1) (123)
and more generally in the correlator
g2(x2, x1) = ć0 | T Ļ(x2)Ļ(x1) | 0ć (124)
24
Now letās consider the effect of adding a gauge field into the unitary transformationU(x2, x1) Ļ(x1). In path integral language, we can write an expression of the form
U(x2, x1) Ļ(x1) = ei~ā« x2x1
dx (L0+LA)= G0
21[x] ei~ā« x2x1
dx LA (125)
where the āamplitudeā G021[x] is the result of making the transformation in the absence of
the gauge field, along a special path x.Letās now focus on the extra contribution here, which we call
PA(x2, x1 | x) = ei~ā«dx LA(Ļ,Ļ;AĀµ) (126)
for a general gauge field; thus, e.g., for the non-Abelian Yang-Mills theory of (107), we have
PA(x2, x1 | x) = eig0ā« x2x1
dxĀµ TĀ·AĀµ(x)(127)
Now the interesting question here is - what happens if we take the system through acircuit, and bring it back to the same place? This question leads us to a specific example ofa āBerry phaseā argument, which is more usually discussed for a simple wave-function, as inQM.
To extract the curvature it is sufficient to look at an infinitesimal circuit, which weassume to be oriented arbitrarily in spacetime. Let us imagine following this circuit alongthe counterclockwise path 1ā 2ā 3ā 4; i.e., we wish to calculate the contribution
P(x, xā² | 4 3 2 1) = eig0ā®dxĀµTĀ·AĀµ(x) = P(4, 3)P(3, 2)P(2, 1)P(1, 4) (128)
25
Now there are 2 obvious ways to do this. One is the simple and quick method of usingStokesās theorem, i.e., we write
eig0ā®dxĀµTĀ·AĀµ(x) = e
ig0ā®dxĀµ dxĪ½
(āĀµAĪ½(x)āāĪ½AĀµ(x)
)+g0[AĀµ(x),AĪ½(x)]
(129)
where AĀµ(x) = T Ā·AĀµ(x). This result is obtained by noting the identity
e(A+B)x = eAx eBx eā[A, B]x2/2 +O(x3) (130)
for the non-commuting operators AĀµ(x) and AĪ½(x), and then using two of the four integra-tions in the commutator term to remove the derivatives from the integral [āĀµAĪ½ , āĪ½AĀµ, ] inthe exponent.
If this manoeuvre seems too much of a trick (and it looks much better if we phrase it interms of differential forms ), then we can do the path integral long-hand. We have
P(3, 2)P(2, 1) = eig0AĀµ(x)+dxĪ½) dxĪ½ eig0AĀµ(x) dxĀµ
= exp
ig0
[AĀµdxĀµ +
(AĪ½dxĪ½ + āĀµAĪ½dxĀµ dxĪ½
)]ā g2
0
2[AĀµ, AĪ½ ] dxĀµ dxĪ½
and so we get
P(x, xā² | 4 3 2 1) = expig0
[(āĀµAĪ½ ā āĪ½AĀµ
)ā ig0 [AĀµ, AĪ½ ]
]dxĀµ dxĪ½
(131)
From this result, which agrees with (129), we see that the net effect of moving aroundthis path is to change the amplitude. Now we have already seen this effect in our discussionof the Aharonov-Bohm effect, for which the curvature at a point is just the magnetic fieldB(x) (assuming the field AĀµ(x) is static). What we see in (131) is just the generalization ofthis to the non-Abelian case (cf. eq. (105)).
Why do we call this a curvature? Actually this is because of the analogy with GR,where we measure the curvature of spacetime by parallel transporting some 4-vector arounda loop. Here we are actually transporting a field around the space of field configurations, inthe Hilbert space of these configurations, by moving along a circuit in spacetime. Thus wecan think of this as a curvature of the gauge field itself.
The effect of the commutator, and of the non-commutating property of the fields, hasa profound effect on the field dynamics. Let us go back to the result we have for the totalaction, eq. (107), then has terms of 3rd and 4th-order in the gauge field:
ā1
4FaĀµĪ½ Ā· FĀµĪ½
a = ā1
2Tr(
T Ā· FĀµĪ½
)(T Ā· FĀµĪ½
)= āg0f
bca āĀµAaĪ½A
ĀµbA
Ī½c ā
g20
4fabcfadeAĀµbA
Ī½cAdĀµAeĪ½ (132)
where the fabc are, as before, the elements characterizing the Lie group algebra (so for simpleangular momentum-style operators, we would have fabc = Īµabc).
26
Thus we immediately expect to see terms corresponding to diagram like those in (a) and(b) above, which correspond to the 1st and 2nd self-interaction terms in (132) above. Thisis in addition to the free gauge line shown in (c).
From this we see a really crucial difference between U(1) gauge fields like those in QED,and the non-Abelian Yang-Mills theory. If we refer back to the discussion just after equation(80), a key feature of ordinary electrodynamics was highlighted, i.e., that photons do notinteract with themselves. However, Yang-Mills gauge fields have their own āself-chargeā,and act as a source for themselves (in addition to having matter fields as their source). TheYang-Mills gauge field is thus fundamentally non-linear. This has profound effects on thedynamics of YM fields; many of these effects have yet to be explored, and we still do nothave anything like a full understanding of them.
And of course all this takes no account of the coupling of YM fields to matter itself, whichmakes it all the more complicated! To properly deal with all of this would take us deep intothe standard model.
Finally, notice that we have not yet quantized this theory! To do this we have to adaptpath integrals to gauge theory, which we now do.
B.4.3: PATH INTEGRALS for GAUGE FIELDS
It is perfectly possible to deal with QED using a conventional canonical approach, andthe results of doing this are strewn across dozens of textbooks and thousands of papers.What is less often emphasized are certain difficulties in such an approach, which provedinsurmountable in the case of non-Abelian gauge theories (at least at until the end of the1960ās). For this reason the success of the path integral approach proved decisive for thesubsequent development of particle physics.
The technical key which opened the door was the development by Fadeev and Popov in1966-67 of their method of integrating over gauge-equivalent fields, following the discoveryof āghostā contributions by Feynman in the early 1960ās. In the following I will explain thesimple picture behind the Fadeev-Popov idea, and then give its formal elaboration.
27
B.4.3 (a) FUNCTIONALLY INTEGRATING over REDUNDANT VARI-ABLES
We wish first of all to understand exactly what is the problem that arises when we haveto deal with gauge fields, in the paht integral formalism and elsewhere. To do this we willbegin with a simpler analogous problem, and see how its solution can be generalized to dealwith gauge fields.
(i) Redundant Variables and Jacobians: Letās start by considering the followingmathematical problem. Suppose we are given some functional I[f ] of a function f(Q) of
a variable Q =(xi; qj
). To focus things, imagine the functional I tells us the total
energy on the earth coming from the sun at some time t (we suppress the dependence ont, and f(Q) is the intensity of sunlight above some point on the earthās surface, havingangular coordinates Ī© = (Īø, Ļ)). Then xi = Ī© = (Īø, Ļ), and the qj are a set of atmosphericvariables like pressure P , temperature T , humidity, etc.
Now as physicists we understand clearly that the intensity depends only on the angle Ī©(measured with respect to the sunās direction); all other variables are irrelevant (and we alsoknow that I[f ] = C
ā«dĪ©f(Ī©), where C is a constant). Suppose, however, we did not know
this. Then we might have some trouble finding the relevant variables amongst all the others- a common problem in, e.g., medical trials. The question then is - if we have the functionalI[f ] written in some arbitrary way as a functional of f(Ī©, qj), how can we extract themeaningful information, ie., how can we get rid of the redundant variables qj?
This problem suggests a rather simpler mathematical question, which I present by wayof an example. Suppose we are given the simple integral
I =
ā«d2R f(R) (133)
where R = (x, y, z) is a vector in 3-d space; BUT we have the restriction that we are onlyallowed to integrate on the surface shown in the figure, defined by z = g(r), where g(r) isjust the āheightā of the surface, as a function of the 2d planar variable r = (x, y, ). Now wemight think we could then write
I =
ā«d3R f(R)Ī“
(z ā g(r
)(??) (134)
Now this would work if the surface was a flat horizontal one, ie., if
f(R) = f(r) āzānf(R)
āzn= 0 (135)
so that f(R) is independent of z. Obviously in this case the problem is trivial - the variablez is quite irrelevant to our considerations.
28
Suppose however that it turns out that we donāt know the explicit form of the equationfor the surface in the form Z = g(z), but we only know that the surface obeys an implicitequation of form
G(R) = 0 (136)
so that the surface might have some arbitrary shape, ie., not just be a plane parallel to thexy plane (in the above example, we have G(R) = z ā g(r)). Then we would write 1
I =
ā«d3R f(R)
ā£ā£ā£āG(R)
āz
ā£ā£ā£Ī“(G(R))
(137)
to take care of the change of variables - we have introduced a Jacobian which takes care ofthe variable rate at which we pass through the surface, at different points in the space, whenwe vary z. All of this is a low-dimensional example of a more general problem; we have afunction
I =
ā«dx f(x) x = (x1, . . . , xn)
=
ā«dQ f(Q)Ī“(q) q = (q1, . . . , qm) (138)
where Q = (x, q) = (x1, . . . , xn; q1, . . . , qm), and f(Q) = f(x), āq; this is the analogue ofthe above problem where the surface is flat and horizontal. Then if we define a surface inQ-space upon which f(Q) varies, but where it is independent of the other coordinates inQ-space that are orthogonal to the surface variables, we can make the same arguments asabove. We define the surface as
G(Q) = 0 (139)
and then we have
I =
ā«dQ f(Q) det
ā£ā£ā£āG(Q)
āq
ā£ā£ā£Ī“(G(Q))
(140)
as the solution to our problem.
1Note that for some function f(x), with zeros at points xoj, with j = 1, 2, . . . , we have Ī“(f(x)
)=
|f ā²(x)|ā1Ī“(xā xoj)
). For I in (137) we have, in the case where G(R) = z ā g(r) (cf. (124), that Ī“
(G(R)
)=ā£ā£ā£āG
āz
ā£ā£ā£ā1Ī“(z ā g(r)). The determinant in (140) is the multi-variable version of this.
29
(ii) Redundancy for Gauge Fields: What we now wish to do is to apply this ob-servation to the problem of functional integration over gauge fields, where the key problemis that instead of a function f(Q) with redundant variables in it, we deal with a functionalZ[Ļ, Ļ; AĀµ
]of a gauge field AĀµ(x) which also has redundant variables in it - the redundancy
being generated from some given AĀµ(x) by making a gauge transformation. The generatingfunctional Z should be invariant under any gauge transformation, since no physical quantityshould depend on which gauge we choose; and indeed we know it is invariant, because theaction is invariant.
What this means is that the obvious form for the generating functional,
Z[Ļ, Ļ; AĀµ
]=
ā«DĻDĻDAĀµ e
i~S[Ļ, Ļ; AĀµ] (??) (141)
cannot be right, because it contains a āhidden infinityā, coming from the integration overall gauge-transformed configuration of AĀµ.
One might argue here that all one needs to do is to fix a gauge, and then calculate fromthere. In the old canonical formulation, this is what was done with QED, but it led tosevere technical problems, which we will note in passing below. But in the path integralformulation, all that one has to do is to extract the Jacobian determinant in (14) (or rather,its generalization to functionals).
What happens if we do just apply (141) naively? We can do this most simply with QED,and so we are interested in the following integral:
I[JĀµ] =
ā«DAĀµ e
i~S0[AĀµ]+
ā«d4x JĀµ(x)AĀµ(x) (142)
where S0[AĀµ] is given by (62), which we rewrite as
S0[AĀµ] = ā 1
4Āµ0
ā«d4x FĀµĪ½(x)FĀµĪ½(x)
=1
2Āµ0
ā«d4x AĀµ(x)
[Ī·ĀµĪ½ā2 ā āĀµāĪ½
]AĀµ(x) (143)
which we can also write in k-space as
S0[AĀµ] = ā 1
2Āµ0
āq
AĀµ(q)[q2Ī·ĀµĪ½ + qĀµqĪ½
]AĪ½(āq) (144)
and you will notice the similarity of this result to that for the phonon system; we can in thesame way divide this free field term into transverse and longitudinal parts, i.e., write
AĀµ(q) = AāĪ½(q)qĀµqĪ½ + Aā„Ī½ (q) [Ī“ĀµĪ½ ā qĀµqĪ½ ] (145)
and we see that the operator QĀµĪ½0 (q) = q2Ī·ĀµĪ½ ā qĀµqĪ½ , or equivalently the operator QĀµĪ½
0 (x) =Ī·ĀµĪ½ā2āāĀµāĪ½ , are both entirely transverse (which is what we would expect for photons, whichare indeed transverse excitations).
30
The next move is then clearly supposed to be to do the functional integral in (142),following the usual line of development:ā«
DAĀµ ei~ā«dDx
[1
2Āµ0
(AĀµ,QĀµĪ½o ,AĪ½
)+(JĀµAĀµ
)]ā¼ 1
|QĀµĪ½o |
eāiĀµo2~ā«dDx (JQā1
o J) (146)
where the inverse operator is:
Qā10 =
(Ī·ĀµĪ½ā2 ā āĀµāĪ½
)ā1(147)
However this operator inverse is formally infinite; writing[Ī·ĀµĪ½ā2 ā āĀµāĪ½
] (Q0(x, xā²)
)ĀµĪ²
= Ī“ĀµĪ²Ī“(xā xā²) (148)
and multiplying to the left by āĀµ, we get
0ĆQ0 = āĪ²Ī“(xā xā²) (149)
which is a contradiction unless Q0 is infinite. This infinity is a reflection of the gaugeinvariance, because AĀµ(q) contains longitudinal (as well as transverse) degrees of freedom,which are untouched by Q0, i.e., they have zero eigenvalue when acted upon by Q0 (andhence infinite eigenvalue when operated on by Qā1
0 ). Any gauge transformation of the form(57), adding a term āĀµĻ(x) to AĀµ, will thus also have zero eigenvalue when acted upon Q0,as we see easily:
QĀµĪ½0 āĀµĻ(x) =
(Ī·ĀµĪ½ā2 ā āĀµāĪ½
)āĀµĻ(x) = 0 (150)
So how do we deal with this? The naive answer is to fix the gauge before doing thecalculation, but this has the disadvantage that we immediately lose Lorentz invariance inthe calculations. In the early days of QFT this was a big problem, actually first solved byTomonaga. Other methods introduced a photon mass into the calculation - this removed theproblem of zero eigenvalues, but destroyed gauge invariance (the mass was set to zero at theend of the calculation). However all such techniques were extremely messy to implement,and quite impossibly complicated for non-Abelian gauge theories.
B.4.3 (b) FADDEEV-POPOV TECHNIQUE
Letās first describe the solution of Fadeev and Popov in geometric terms, and then go onto see how it works in detail.
Imagine the space of all possible configurations of some gauge field AĀµ(x). This is ofcourse a very large space, which we show here in the figure in the form of a 3-d caricature.Now suppose we are in some specific gauge, and we look at all such configurations in thisgauge. We show all such configurations in this gauge on a hyper-sheet, depicted as a simple2-d sheet in the figure. We then write all of these configurations, in this gauge, as
AĀµ(x) āā AĀµ(x) (fixed gauge) (151)
31
and then consider the set of all possible gauge transformation on AĀµ(x) that are requiredto reverse this many-one mapping, ie., to produce the full set of possible configurations; wewrite these as
AĀµa(x) = AĀµa(x) + Ī±Āµa(x) ā” AĀµa(x) +(āĀµĪa(x) + g0fabcĪ
b(x)AĀµc (x)) (152)
where it is understood that we deal here with infinite sets - the set of all gauge configurationsAĀµ(x) ā” AĀµa(x) is produced by starting with the set of all configurations possible in somefixed gauge AĀµ(x), and then adding all possible gauge transformations Ī±Āµ(x) ā” Ī±Āµa(x),for arbitrary differentiable functions Ī±Āµ(x). More precisely, we define the group G of allpossible equivalence classes of a given AĀµ(x), called in mathematics the āorbitā of the gaugegroup; the full set of functions AĀµ(x) is then produced by the set of all orbits of all possibleconfigurations AĀµ(x) in a fixed gauge. To show that this geometrical picture is meaningfulis a job for mathematicians, which we will not enter into here.
Consider now the functional integral in (142) again - we will now write
I[JĀµ] =
ā«DAĀµ e
i~S0[AĀµ]+
ā«d4x JĀµ(x)AĀµ(x)
=
ā«DĪ±Āµ(x)
ā«DAĀµ e
i~S0[AĀµ+Ī±Āµ]+
ā«d4x JĀµ(x)[AĀµ+Ī±Āµ]
=
ā«DĪ±Āµ
ā«DAĀµ(x) e
i~S0[AĀµ]+
ā«d4x JĀµ(x)AĀµ(x) (153)
and we see that the problem in (153) is that integrand appearing in the functional integral,i.e., the function exp
i/~(S0[AĀµ] +
ā«d4x JĀµ(x)AĀµ(x)
), is invariant under changes in gauge,
so that the functional integrationā«DĪ±Āµ(x) simply produces an infinite multiplication of the
answer - what we would like to do is to get rid of this redundant multiplication factor.To do so, letās recall the development in eqs. (138)-(140), and do the same now for
gauge functionals, instead of just ordinary functions. Notice that we cannot just stick a
32
factor like Ī“(Ļ(x)
)into the functional integral, for two reasons. First, we want to keep
the theory gauge invariant, for many different reasons (most of which have not yet becomeapparent). Second, in the functional integral (142) and (153), we donāt actually yet knowhow to properly āmeasureā the āvolumeā, in the hyperspace of functionals, of the domaindefined by a specific gauge choice. To see this we simply recall the determinant appearing inthe finite-dimensional integral in (137), which acts as a Jacobian for the change of variable- it is not yet clear how to generalize this to cover for the much larger space of functions weare now dealing with.
Thus it is not enough to just stick a factor Ī“(G(AĀµ(x)
))into the functional integral
(141), and integrate over the full AĀµ(x). What we want is something like
Z[Ļ, Ļ, AĀµ
]=
ā«DĪ±Āµ(x)
ā«DAĀµ āFP Ī“
(G(AĀµ(x)
)) ā«DĻDĻ e
i~S[Ļ, Ļ,AĀµ] (154a)
where āFP is the relevant determinant, now called the āFadeev-Popov determinantā. Ourjob is to find an expression for it, which is valid for any coneeivable form for the gaugetransformation.
Formally this is easy. In exact analogy with (140), we have
āFP = detā£ā£ā£Ī“G(AĀµ(x)
)Ī“Ī±Āµ(xā²)
ā£ā£ā£ ā” detā£ā£ā£Ī“G(AaĀµ(x)
)Ī“Ī±Āµb (xā²)
ā£ā£ā£ = āabFP (x, xā²)
for G(AĀµ(x)
)= 0 (gauge constraint) (155a)
so that
ā«DĪ±ĀµāFP [Ī±] = 1 (155b)
However these formal expressions are not terribly illuminating until we try to use them- this we will see in the next sub-section, for both QED and for non-Abelian gauge theories.
In preparation for this, letās first rewrite (155a) for a general non-Abelian gauge theory.If we assume that the non-Abelian gauge transformation can always be characterized byan āangleā in some space of gauge transformations (this angle being Īø(x) for U(1) trans-formations and the hyperangle Ī(x) = Īa(x) for YM theories), then we can rewrite theFadeev-Popov factor in (154) asā«
DĪ±Āµ(x) āFP Ī“G(AĀµ(x)
)ā
ā«DĪ(x) āFP (x, xā²) Ī“G
(AĀµ(xā²)
)(156)
where now
āFP (x, xā²) ā” āabFP (x, xā²) = det
ā£ā£ā£Ī“G(AaĀµ(x))
Ī“Īb(xā²)
ā£ā£ā£ ā” det |Mab(x, xā²)| (157)
This is a much more transparent formula since the functional integral is over hyperanglesĪa(x) in some finite-dimensional space, something we know how to do. Note that the Fadeev-Popov determinant has a physical meaning that is evident from (157). If we make an
33
infinitesimal change in the gauge by effecting a change Ī“Ī(xā²) in the gauge angle, thenāFP (x, xā²) measures the āresponseā of the gauge-fixing function G
(AĀµ(x)
)to this change.
In other words, if we write
Ī“G(AĀµa(x)
)=
ā«d4xā² Mab(x, x
ā²) Ī“Īb(xā²) (158)
then Mab(x, xā²) measures the response of the a-th component of the gauge constraint functionG(AĀµa(x)
)to the change Ī“Īb(x
ā²) in the gauge angle.
B.4.4: GAUGE FIELDS in HIGH-ENERGY PHYSICS
We will not go into too much detail here. The case of QED is relatively easy to under-stand, as we will see. Non-Abelian gauge theories are more messy because the number ofdegrees of freedom is large. It would take us too far afield to discuss the application of theresults to either QED or to the electroweak theory (and the latter requires, in any case, anappeal to a spontaneous symmetry-breaking, i.e., to the Anderson-Higgs mechanism).
B.4.4 (a) QUANTUM ELECTRODYNAMICS
This is of course the simplest theory. We have parametrized gauge transformations inthis theory by the shifty AĀµ(x) āā AĀµ(x) + āĀµĻ(x), but it is now time to formulate this alittle more generally. We assume the gauge constraint in the form
G(AĀµ(x, Īø(x)
))= 0
G(AĀµ(x)
)= āĀµA
Āµ(x) (Lorentz)G(AĀµ(x)
)= ā Ā·A(x) (Coulomb)
G(AĀµ(x)
)= Az(x) (Axial)
(159)
when on the RHS of (159) we give 3 common examples of fixed gauge choices in QED. Inwhat follows we will use a slight variation on the Lorentz gauge, by adding an extra functionĻ(x); this can simply be viewed as part of the gauge transformation, and its usefulness willbecome apparent later. We then write
G(AĀµ) = āĀµAĀµ(x)ā Ļ(x) ā”(āĀµAĀµ(x) + ā2Īø(x)
)ā Ļ(x) (160)
34
where AĀµ(x) is some fixed gauge satisfying G(AĀµ) = 0. Now eqtn (155b) implies thatā«DĪø(x) Ī“
(G(AĀµ)
)= āā1
FP ; thus we have 2ā«DĪø(x) Ī“
(G(AĀµ)
)=(
det |ā2|)ā1
(161)
so that the FP determinant is
āFP (x, xā²) = det |(āĀµāĀµ)x,xā²| ā” det |ā2| (162)
which is a constant āPF ; recall that the determinant det |ā2| ā” det |ā2xxā² |, where āxxā² ā”
ćx|ā2|xā²ć = āxĪ“(x ā xā²), should be interpreted as a matrix, which is diagonal in x-space(compare notes on path integrals in section A). Thus we get
I[AĀµ] =
ā«DAĀµ(x) e
i~S[AĀµ]
= āFP
ā«DAĀµ(x) Ī“
(āĀµAĀµ ā Ļ(x)
)ei~S[A] (163)
with the constant outside the integration. However we still have to deal with the Ī“-functionalĪ“(āĀµA ā Ļ), and this is where the āt Hooft trick comes in handy. Suppose we functionallyintegrate now over Ļ(x), but now inserting some functions Ht(x) in the integral, i.e., wemultiply I by the factor
ā«DĻ(x) Ht(x), (where we if we choose Ht = 1, then we get rid of
the Ļ-function as though it had never been there at all). We then have
I[AĀµ] =1
N
ā«DAĀµ Ht(ā
ĀµAĀµ) ei~S[A] (164)
where N , the normalizing factor, is just a constant:
N =
ā«DĻ Ht(Ļ)
āFP
(165)
The nice thing about this trick is that we can now make Ht(Ļ) the exponential of some-thing - this way everything is now in the exponential, and we can read off the Feynman rules.The choice made by āt Hooft was
Ht(Ļ) = expāi2Ī±
ā«d4x
1
Āµ0~Ļ2(x) (166)
where Ī± is just a number; we then finally get (ignoring the factor 1/N ):
ZQED[JĀµ] =
ā«DĻDĻ
ā«DAĀµ e
i~
S0[A]ā 1
2Ī±
ā«d4x 1
Āµ0(āĀµAĀµ)2+
ā«d4x JĀµ(x)AĀµ(x)
(167)
2Recall the footnote to eqtn. (137). We are just using the fundamental generalization of the usual formulathat
ā«dx Ī“(axā b) = 1/a to
ā«DĪø(x) Ī“
(ā2Īø(x)ā f(x)
)= 1/ det(ā2).
35
with S0[AĀµ] given by (143) or (144). We may now do the usual functional integration thatled to (146) and (147), but now we can write
ZQED[JĀµ] =
ā«DĻDĻ
ā«DAĀµ eā
i~ā«d4x
[1
2Āµ0AĀµ(x)D0
ĀµĪ½(x,xā²)AĪ½(xā²)]āā«d4x JĀµ(x)AĀµ(x)
(168)
whereD0ĀµĪ½(x, x
ā²) =(Ī·ĀµĪ½ā
2 + (Ī±ā1 ā 1)āĀµāĪ½)ā1
(169)
and this operator does not have the pathology of Q0(x, xā²) in (147); it has an equivalentrepresentation in momentum space as
D0ĀµĪ½(q) = ā 1
q2 + iĪµ[Ī·ĀµĪ½ + (Ī±ā 1)qĀµqĪ½ ] (170)
where as usual, qĀµ = qĀµ/|q|. The parameter Ī± now acts as a āregularizerā, getting rid ofthe āzero modeā problem we had before. Different values of Ī± give different gauges that hadbeen used in QED long before Fadeev and Popov; for example
Ī± = 1 Feynman gaugeĪ± = 0 Landau gauge
We may now read off the Feynman rules for QED (compare the rules already derived forDirac fermions in section B.3, eqs. (44)-(48)). We have, for the Lagrangian
L = ĻĪ±(iĪ³ĀµDĀµ ām
)ĻĪ± + JĀµA
Āµ ā 1
2Ī±Āµ0
(āĀµAĀµ)2 ā 1
2Āµ0
FĀµĪ½FĀµĪ½ (171)
the following rules:
1. The fermion propagator is GĪ±Ī²0 (kj) = i~SĪ±Ī²F (kj), for a fermion line carrying momentum
kj, as shown in the figure (172);
2.The photon propagator is given by the factor, as shown in the figure (173),
i~D0ĀµĪ½(q) =
āi~q2 + iĪµ
[Ī·ĀµĪ½ + (Ī±ā 1)qĀµqĪ½ ]
3. Each interaction vertex contributes the factor, as shown in the figure (174)
ā i~g = ā i
~eĪ³ĀµĪ±Ī² Ī“
(āi
ki +āj
qj)
36
where the Ī³-matrix now contains spin indices Ī±, Ī² acting between ĻĪ± and ĻĪ², and e is thecharge of the particle.
Then, as usual, we sum over all spinor indices, integrate over all moments ki and qj, andmultiply by a symmetry factor; and the momentum integrations appear in the form
(ā1)Lā«
dk1
(2Ļ)DĀ· Ā· Ā·ā«
dkn(2Ļ)D
ā«dq1
(2Ļ)DĀ· Ā· Ā·ā«
dqm(2Ļ)D
(175)
where L is the number of fermion loops.Actually, the rules for QED are almost identical in form to those for the electron-phonon
problem, covered in section B.3; and the topology of the diagrams is identical to that for thecoupled field problem of section B.3.
B.4.4 (b) YANG-MILLS SU(N) GAUGE THEORY
Let us go back to our key formula in (157), for the FP determinant. To make thingsclear, we will simply go through the same manoeuvres as we did for the U(1) gauge field, wewill pick the same generalized Lorentz gauge,
G(AĀµa(x)
)= āĀµA
Āµa ā Ļa(x)
= āĀµAĀµa + āĀµ
(āĀµĪa + g0fabcĪ
bAĀµc)ā Ļa(x) (176)
Following through the steps as for the QED calculation, we then find that
āabFP (x, xā²) = det |Ī“ab āz + g0f
abcāĀµAĀµc | Ī“(xā xā²) (177)
We may also carry out the integration using the āt Hooft trick, introducing the obviousgeneralization of (166) as
Ht(Ļa) = expāi2Ī±
ā«d4x
(1
~Ļa(x)Ļa(x)
)(178)
37
and so we can write the partition function or generating functional as
ZYM [JaĀµ ] =
ā«DAĀµ(x) āFP e
i~
(S0YM [A]+
ā«d4x[JĀµ(x)AĀµ(x)ā 1
2Ī±(āĀµAĀµ)2]
)
=
ā«DAĀµa(x) āab
FP (x) ei~
(S0YM [AĀµb (x)]+
ā«d4x[JbĀµ(x)AĀµb (x)ā 1
2Ī±(āĀµAĀµ)2]
)(179)
with S0YM [AĀµ] given by:
S0YM [AĀµ] = ā1
~
ā«d4x FĀµĪ½(x)FĀµĪ½(x) (180)
However there is now a big difference. In the case of QED, the Faddeev-Popov determi-nant āFP in (162) was just a constant, independent of the field AĀµ(x). This is no longerthe case - the determinant in (177) is clearly dependent on AĀµ(x), and so we cannot takeit outside the functional integral in (179), as we did for QED in (163). This makes thenon-Abelian case much more difficult.
At this point the founders of modern QFT introduced a trick that had been invented byFeynman, during his earlier research into quantum gravity. He wrote the determinant āab
FP
as a functional integration over a set of fake fermion fields - this takes us back to the resultwe found in section B.2 for integration over Fermion fields, that they give a determinant inthe numerator (cf. eqs. (25) and (28) in section B.2). Thus we write
āabFP (x, xā²) = det |Mab(x, xā²)| =
ā«Dc(x)
ā«Dc(x) eiSGH [c,c]
SGH [c, c] =
ā«d4x
ā«d4xā² ca(x)Mab(x, xā²)cb(x) (181)
where the fermion fields c(x) ā” ca(x) and c(x) obey all the Grassmann rules discussed before.The fields were called āghost fieldsā by Feynman, and they created much confusion at thetime (Feynman introduced them to prevent the theory from losing unitarity - it was onlylater that their role as a determinant was understood).
Thus we can finally write that
ZYM [JaĀµ ] =
ā«DcDc
ā«DAĀµ e
i~
(S0YM [AĀµ]+SGH [c,c]+
ā«d4x [JĀµ(x)AĀµ(x)ā 1
2Ī±(āĀµAĀµ)2]
)(182)
The Feynman rules for this functional are extracted by writing the action in terms of anon-interacting āfree fieldā part
S0[c, c; AĀµ] =
ā«d4x
[c(x)ā2c(x)ā 1
4FĀµĪ½(x)FĀµĪ½(x)ā 1
2Ī±(āĀµA
Āµ)2
](183)
38
and an interacting part
Sint[c, c; AĀµ] = āā«d4x
ig0(cfc āĀµA
Āµ) +1
2g0FĀµĪ½f AĀµAĪ½ ā 1
4g2
0 (f f)(AĀµAĀµ)2
= ā
ā«d4x
ig0(caf
abccbāĀµAĀµc ) +
g0
2F aĀµĪ½f
bca A
ĀµbA
Ī½c ā
g20
4fabcf
dea A
bĀµA
cĀµA
ĀµdA
Ī½e
(184)
and this allows us to read off the Feynman rules for the Yang-Mills gauge field (which, Iemphasize, is still not coupled to the real world fermions that one might expect to exist ina real theory like the electroweak theory). We then have
1. A vector boson propagator given by the YM generalization of (173), as shown in thediagram (185)
i~D0ĀµĪ½(q) =
āi~q2 + iĪµ
Ī“ab [Ī·ĀµĪ½ + (Ī±ā 1)qĀµqĪ½ ]
2. A ghost propagator given by the diagram (186)
āiāab(k) = āiĪ“ab 1
k2 + iĪµ
where there is no ~ because of the definition (181); we notice that the ghost propagator ismassless, as is obvious from (184), and is a scalar field as well.
3. Finally, from the interaction in (184) we get a whole variety of vertices, of the followingform:
(a) 3-boson vertex, given by the 2nd term in (184), as
ā i~Ī»3 =
i
~g0f
abc [Ī·ĀµĪ½(q1 ā q2)Ī» + Ī·Ī½Ī»(q2 ā q3)Āµ + Ī·Ī»Āµ(q3 ā q1)Ī½ ] Ī“(q1 + q2 + q3)
in which 3 vector bosons AĀµ(q) interact at a point in spacetime, each scattering off theirmutual ācurvatureā;
(b) A 4-boson vertex, given by the 3rd term in (184), as
i~Ī»4 = ā i
~g20[fabe f
cde (Ī·ĀµĪ»Ī·Ī½Ļ ā Ī·ĀµĻĪ·Ī½Ī»)
+ face fbde (Ī·ĀµĪ½Ī·Ī»Ļ ā Ī·ĀµĻĪ·Ī»Ī½)
+ fade fcbe (Ī·ĀµĪ»Ī·Ī½Ļ ā Ī·ĀµĪ½Ī·Ī»Ļ)]Ī“(q1 + q2 + q3 + q4)
39
with 4 bosons coupled at a point;(c) A boson-ghost fermion interaction, coming from the 1st term in (184), given by
i
~g3 = g0f
abcqĀµ
which is produced by the absorption of a gauge boson by the ghost fermion.
Finally, we integrate over momenta as before - however, there are no external ghostfermion lines, and we still have a loop factor (ā1)L for L ghost fermion loops.
This concludes this introduction to gauge fields in QFT. One of the most interestingtopics that is usefully examined in this formalism, outside of particle physics, is the varietyof non-relativistic fermionic and bosonic superfluid in Nature, as well as spin fluids. And ofcourse, if we go to spin-2 gauge fields, we can also discuss quantum gravity.
40