My thesis advisor Adam Bincer gave marvelous lectures in 732 and 831 on the Dirac equation. I took 831 in 1979 (no, it was 1980) from him as a college junior, a happy chemistry major. His influence was so strong that two days into the semester I decided to switch to physics for graduate school.
I am so glad that in those days we students took meticulous notes on paper, so that I can share them with you today…
If we simply wish to force the Schrodinger equation to comply with relativity then a simple replacement of the kinetic energy
should suffice. So one could be led to believe. This introduces correction terms into the Hamiltonian for an atom that we can handle using perturbation theory
and replacement of the dynamical variables with quantum operators via the canonical quantization procedure (equivalent to replacing Poisson brackets with commutators)
results in the Klien-Gordon wave equation.
Unfortunately the positive-definite quantity
no longer occurs in the “probability flux conservation law”
It is trivial to show that for the Klien-Gordon equation
obeys
but this is not always positive, and therefore cannot be interpreted as a probability density. We certainly want probabilities of finding particle in any region to be positive or zero. We can get around this by interpreting this quantity as a charge density instead, but that doesn’t let us use it as a probability, so that functionality is lost, which breaks the measurement theory of the system. This is not the only problem. If we look for solutions such as
and so we have negative energy states! Furthermore the negative energy states have a negative probability density. Since the positive energy states all have positive and the negative energy states all have negative . As far as Fourier analysis is concerned this is no problem, we simply have two Hilbert spaces that with proper choice of norm can be made disjoint and orthogonal. The negative energy states do pose a stability problem, since the energy can be arbitrarily large and negative the theory has no stable ground state.
The current for these plane wave states can be seen to be
where the index can be , meaning , and respectively with . Therefore is the time component of the relativistic current four vector. Normalization of these relativistic states is a trickier matter. Recall that
must be the probability of finding the particle in a small volume in a particular location . If we Lorentz boost into another frame moving parallel to the -axis at then
and in order to keep the probability the same in that volume for all observers the factor must be compensated for.
Fortunately contains a hint; normalize so that there are particles per unit volume so that
and so we choose so that
where we have calibrated the volume of all of space to be one. This is legal since any scattering results must be volume independent anyway.
Since the square root operator appears to be the source of all evil, Dirac proposed a linear wave equation in order to avoid it. He suggested the form
where from now on we will take . In order that this comform to the requirements of relativity we must have
The square of the operator acting on must be the same as the Klein-Gordon equation.
There must exist a positive probability density such that
We want an intact measurement theory.
The operators and can have no coordinate or momentum dependence.
Requirement one means that
which after the operator substitution becomes
If we define the
we see that these relations can be met by
and
It should be fairly clear that no numbers satisfy these requirements (these are the algebraic rules of a Clifford algebra) We have three matrices that satisfy these relations
However we need four such objects, not three. It is easy to show that
where the “” means a unit matrix, satisfies all of the required algebraic relations. It appears that simply satisfying relativistic requirements is intimately tied up with spin. The Dirac equation is therefore a matrix equation whose eigenstates are spinors, irreducible representations of a Clifford algebra.
We solve this first for the relativistic plane wave states
results in four linear equations for the four components (not space-time components, something quite new) which have a solution if the determinant
from which we find
which was a requirement laid down by Dirac. We can procede to find the actual spinors by defining
Then we find that
We now choose
We see that there will be four solutions, what do they describe? A spin one-half electron only needs two degrees of freedom, so clearly these have more structure than we need. To find out what these wavefunctions represent, multiply the Dirac equation by from the left
and define the four-vector (which {\bf is} a space-time four-vector)
and we obtain a form of the Dirac equation that does not distinguish from any other coordinate (the covariant form)
where we make the short-hand notation
with denoting , denoting , and so forth, and we sum over repeated indices so that
In space-time (Minkowski space) we distinguish between upper (contravariant) and lower (covariant)indices, and translate between them with a metric tensor
The Dirac matrices satisfy
also the action of conjugation can be accomplished with
which you should verify. Recall the basic matrix manipulations
At any rate these relations can be used to obtain the probability and currents that we need to construct a meaningful theory.
Conjugation of the Dirac equation leads to a form
with different signs for the time and spatial derivative terms. Since
multiplication on the right by and moving through results in a covariant form
where
which is called the adjoint spinor to distinguish it from the simple conjugate. Multiply the Dirac equation on the left by the adjoint spinor and the adjoint equation on the right by and subtract to obtain the desired current conservation law
which has a probability density that is manifestly positive (so a big win here).
What do the Dirac states represent?
We now re-interpret the spectrum of the Dirac theory. Return to
We now choose
and get the corresponding object from one of these equations
These states will be shown to represent spin up/down (meaning the -component of angular momentum in the rest frame is ), and make sense for , since if the two lower components would blow up rather than vanish in the rest frame. We calculate as .
There are two other linearly independent solutions. Pick
and get the corresponding object from the other equation
This pair of solutions only makes sense if , since , which blows up in the nonrelativistic limit if , and so we have not escaped the negative energy problem.
We have four linearly independent solutions, and the spinor portions of the respective wavefunctions are
The first two are positive energy, and the second two are negative energy. We will interpret the spin-up, spin-down components of the electron wavefunction to be
and the other two components to represent the anti-electron. Unfortunately since the spectrum has no lowest state, there will be no stable states for one particle systems. We will later prove that the electron obeys the Pauli Exclusion Principle; one or no particles per quantum state. Our quantum states are labeled by spin up/down and the energy.
If we think of a “naive” vacuum as a state containing none of these particle states and invent creation operators , then there are positive and negative energy excitations.
create positive energy excitation associate with wavefunctions
However
create negative energy excitation associate with wavefunctions
This will result in a sort of negative-feedback in which any interactions will stimulate the production of negative energy states in an effort to descend into a ground state, which is lower in energy than the state.
The figure above illustrates such a negative energy, spin-down state
Dirac solved the instability problem by reinventing the concept of vacuum. The vacuum (lowest energy state)consists of the state with all negative energy levels filled. This is called the Dirac sea, and we will denote it by .
We define the energy of this to be zero, and its momentum, charge, and spin to be zero as well and measure the quantum numbers of all states relative to this zero point.
Note that (once we establish the exclusion principle)
since all negative energy states are already filled!
Antiparticles.of greater significance is that
is a hole in the vacuum, subtracting a negative energy, a momentum and an up-spin from the vacuum. This “hole” in the vacuum can be thought of as an excitation created by a “hole creation” operator
producing a spin-down, positive energy state (the dotted circle).
If the states created by are electrons, this hole is the anti-electron, called a positron. It is a positive energy excitation
(remember that Dirac solutions had wavefunctions). and its wavefunction is
Schrodinger wrote down the wave equation in 1926. Dirac wrote his down in 1928, and by 1932 anti-electrons were detected and identified in cloud chambers.
We also need to normalize the spinors, keep in mind the wave portions of free particle wavefunctions are normalized to a delta function. The only constraint placed on this normalization by relativity is that it should be a Lorentz invariant, and so we again normalize to particles per unit volume so that the factors from the energy and the volume element induced under Lorentz transformation cancel one another
Contract onto ;
and so we find that
the contraction of this onto is
It is easy to show that the contractions below are also true