The first time I ever heard the term, "gauge invariance", I didn't know what to make of it. Of course it didn't help that Hungarian physics literature uses the term, "mértékinvariancia", which, literally translated, means something like "invariance of measure". But to be honest, the English phrase doesn't appear to be much more meaningful either.
It wasn't until I came across an excellent informal introduction to the concept in Aitchison's and Hey's book, Gauge Theories in Particle Physics, that I finally began to understand what this whole concept was all about. And it didn't take very long to realize just how powerful a concept it really is!
The idea is simple and has its roots in the well-known (non-gauge) invariances in classical physics. For instance, everyone knows that the equations of physics remain invariant under, say, a translation of the coordinate system. This is just a fancy way of saying that there are no absolute positions; what matters is not where an object is in absolute terms, but where it is relative to other objects, i.e., coordinate differences.
So when you have an equation of physics involving the x coordinate of two
objects, x1 and x2, the equation may contain their
differences but not the absolute coordinates themselves. An equation in the form,
Or, even for a single object, you may have a physical equation that contains the time
derivative of its x coordinate. Since
Translations, rotations, time translations, Lorentz-boosts... these transformations are
all geometrical. But is it only geometrical transformations under which physical equations
remain invariant? Of course not. While much of physics is about geometry (some hope that
one day, all of it will be), physics also deals with non-geometrical quantities. Take, for
instance, voltage. It is another one of those "relative" quantities: what
matters is not the absolute voltage but the relative potential between two conductors. In
other words, adding a constant to all voltages should leave physics equations invariant.
This may, at first, look to be at odds with equations like
There is one common characteristic to these transformations under which physical laws remain unchanged. Namely that all are global: the value that characterizes the transformation is the same everywhere in all of spacetime.
What if this is not so? What if we use a transformation that is characterized by a value that is different everywhere (e.g., a smooth function of spacetime coordinates, as opposed to a constant?)
At first sight, the idea may appear like madness. Such an arbitrary transformation must surely destroy the validity of any physics equation.
Then again... there's another way of looking at this. Okay, so our physics equation was mangled by that strange transformation. But is there any way to unmangle it?
Surprisingly, the answer is yes. Even more surprisingly, we find that when the equations are unmangled, i.e., changed to accommodate our strange transformation, the new components that appear in them will correspond with known physical forces!
By far the simplest gauge theory is electromagnetism. And by far the simplest way to present electromagnetism as a gauge theory is through the non-relativistic Schrödinger equation of a particle moving in empty space:
iħ ∂φ= –ħ²∇²φ ∂t 2m
Although the equation contains the wave function φ, we know that the actual probability of finding a particle in some
state is a function of
φ → eip(x, t)φ
where
Let's see what happens to the Schrödinger equation though when we apply this transformation. First, the left-hand side:
∂φ'= ∂eip(x, t)φ + eip(x, t) ∂φ= eip(x, t) ┌
│
│
└i ∂p(x, t)φ + ∂φ┐
│
│
┘∂t ∂t ∂t ∂t ∂t
Next, the right-hand side, which is a bit more difficult to tackle, but hey, it's just straightforward algebraic manipulation:
∇²φ' = ∇[∇(eip(x, t)φ)] = ∇[∇(eip(x, t))φ + eip(x, t)∇φ] =
∇(eip(x, t)i∇p(x, t)φ + eip(x, t)∇φ) = ∇[eip(x, t)(i∇p(x, t)φ + ∇φ)] =
∇eip(x, t)(i∇p(x, t)φ + ∇φ) + eip(x, t)∇(i∇p(x, t)φ + ∇φ) =
eip(x, t)i∇p(x, t)[i∇p(x, t)φ + ∇φ] + eip(x, t)[i∇²p(x, t)φ + i∇p(x, t)∇φ + ∇²φ] =
eip(x, t){i∇p(x, t)[i∇p(x, t)φ + ∇φ] + i∇²p(x, t)φ + i∇p(x, t)∇φ + ∇²φ} =
eip(x, t){∇²φ + 2i∇p(x, t)∇φ – [∇p(x, t)]²φ + i∇²p(x, t)φ} =
eip(x, t){[∇ + i∇p(x, t)]²φ}
On both sides of the equation, we now have an extra factor eip(x, t) which we can safely drop, resulting in the following equation:
iħ ┌
│
│
└i ∂p(x, t)φ + ∂φ┐
│
│
┘= –ħ²[∇ + i∇p(x, t)]2φ ∂t ∂t 2m
Or
iħ ∂φ= –ħ²┌
│
│
└[∇ + i∇p(x, t)]2 – 2m ∂p(x, t)┐
│
│
┘φ ∂t 2m ħ ∂t
Whatever it is, it is definitely not the Schrödinger equation of a particle in empty
space. In other words, we can conclude that the Schrödinger equation is not
invariant under the gauge transformation
Now of course that is not exactly surprising. We have, after all, mangled the wave function beyond recognition by changing its complex phase with an arbitrary amount at each point of spacetime.
But what if we start with a Schrödinger equation that already includes components that look like the ones we ended up with? For instance:
(1)
iħ ∂φ= –ħ²[(∇ + iA)2 + V] φ ∂t 2m
Starting with this equation, when we perform the gauge transformation
(2)A → A + ∇p(x, t)
V → V – 2m ∂p(x, t)ħ ∂t
after which our new Schrödinger equation remains valid.
What we have here is a vectorial and a scalar quantity, and a pair of transformation
laws that supposedly do not alter the validity of our physics. But we already have just
such a set of quantities in physics, in electromagnetism. If A were the
electromagnetic vector potential and V were the scalar potential, the
transformation rules (2) would leave measurable physical quantities—namely, the
magnetic field,
This is nothing short of remarkable. We have, after all, made no a priori assumptions about electromagnetism. We started off with the Schrödinger equation of a particle moving in empty space, observed that the probability of finding a particle in a state does not depend on the complex phase of its wave function, and made an attempt to incorporate this invariance into the equation itself. We were successful, and in the process, we managed to recover a vectorial and a scalar quantity that satisfy Maxwell's equations: in a sense, we "invented" electromagnetism!
Pure magic, if you ask me. But this really is just the beginning. The transformations
represented by
Aitchison, I. J. R. & Hey, A. J. G., Gauge Theories in Particle Physics, Institute of Physics Publishing, 1996