2 Free quantised scalar field
We are now almost ready to quantise the scalar field , that is, find a QFT that follows the KG field as specified by the Lagrangian (51). In this section we will consider this field to be a free field, i.e. one that has no interactions, not even with itself. This is not a very realistic description of nature but it is a very important first step. Our description of realistic interacting fields will largely be based on free fields and we will consider the interaction as a small perturbation.
Our discussion is analogous to going from a finite-dimensional Lagrangian problem, where we had degrees of freedom that each took a numeric value, to a classical field theory, where we assigned each point in spacetime a value . When describing such a system using QM, we defined operators for the position and momentum of each of these particles. Similarly, a quantum field is assigns an operator to every point in spacetime. Before we can actually write down these fields we need to do a bit more revising.
2.1 Heisenberg and Schrödinger pictures
Most discussions of QM describe the wavefunction as dynamic and time dependent and the operators acting on them as static. This is called the Schrödinger picture which describes wavefunctions as manifestly time dependent through the Schrödinger equation (2)
| (2) |
The Hamiton operator itself does not depend on time.33 3 Note that for a brief period we will use a hat to indicate that an object is an operator Starting from some initial state , the time evolution of the state is described by
| (65) |
The operator fulfils the differential equation
| (66) |
Please keep in mind that this is a differential equation for an operator rather than a function. To ensure that remains unchanged, we need the operator to be unitary, i.e. .
This picture is not ideal to derive a QFT in because the language we have developed for field theories explicitly talks about time dependence. Instead, we will use the Heisenberg picture where operators depend on time while states do not. To transform between the two pictures, we will use a unitary operator. The state of our system as it was at obviously does not depend on time which makes it our new state vector
| (67) |
To make use of this new picture, we also need to consider how an arbitrary operator transforms. It is not difficult to see that
| (68) |
leaves observables invariant. Crucially for what we are about to attempt, this transformation also leaves the commutation relation between and invariant
| (69) |
This new operator fulfils the EoM if has explicit time dependence through the Heisenberg equation assuming has no explicit time dependence
| (70) |
This is very similar to the classical equivalent Poisson bracket
| (71) |
Suggested Exercise
Show that (68) indeed leaves expectation values invariant and that the EoM is follows from the Schrödinger equation.
From now on we will always assume the Heisenberg picture and drop in the subscript .
2.2 Equal-time canonical commutation relations
One of the first things we have learned about QM was that the position and momentum operators do not commute, i.e. . Extending this for multiple particles, we can define the operator for the position (or momentum) of the -th particle as (). In position space, these just take the form
| (72) |
The commutation relation is now extended to
| (73a) | ||||
| (73b) | ||||
since operators acting on different degrees of freedom will always commute. These are referred to as canonical commutation relations.
In the continuum limit, we have locations in space instead indices , field operators instead of coordinate operators , and conjugate momenta operator instead of momentum operators . We can now write the CCRs at equal time
| (74) | ||||
| (75) |
Note that these are in the Heisenberg picture as they depend on time! While this looks like it might not be Lorentz invariant (after all, we are treating time and space very different here), the formalism we are developing is invariant which will become clearer later.
We will get a better idea of what the operator actually does once we write down a solution that satisfies both and the CCRs. However, it is helpful to get an idea early on so it is a bit less abstract. Consider the vacuum state , i.e. a completely empty universe devoid of any particles (we will formalise this later as well but this suffices for now), as our initial state. We will see shortly that applying on the vacuum will create a new particle at position .
We can verify the EoM and follow from the CCRs. For this, we use that the Hamiltonian is (55)
| (76) |
where we have used that for the KG field which translates to operators . We now calculate commutators
| (77) |
Here the derivative in the operator is w.r.t. and therefore it commutes with . Therefore,
| (78) |
which is fulfilling the EoM as .
Suggested Exercise
Show that
| (79) |
Together, we can deduce the KG equation as
| (80) |
2.3 Solution for the field operators
In (63) we have seen a solution to the KG wave equation. To turn this into a solution for the field operator , we promote the coefficients and to operators, i.e. and . Explicitly, we have for and (now written in terms of the four-vector rather than and )
| (81a) | |||
| (81b) | |||
where we still have .
It is easy to be overwhelmed by the number of symbols in the equation above so let us focus on the relevant terms to schematically write
| (82a) | |||
| (82b) | |||
These equations should look very familiar as they are (up to the dependence), the decomposition of and in terms of ladder operators and of the quantum harmonic oscillator. We can find the CCRs for the and operators as
| (83a) | ||||
| (83b) | ||||
The harmonic oscillator became a lot simpler once we wrote the Hamiltonian in terms of the ladder operators. We can do something similar here with one exception
| (85) | ||||
| (86) | ||||
Let us swap and in the terms that have a positive exponential. Note that this only changes the spatial components, i.e. .
| (87) | ||||
We can use that
| (88) |
to set and
| (89) | ||||
| (90) | ||||
where we have used since this on-shell condition was always assumed. We now need to multiply out the ladder operators to find
| (91) |
Swapping the integration variable in the second term and commuting the to the right of the
| (92) |
This is again very similar to the harmonic oscillator where we ended up setting and . Here we have a problem though as the commutator evaluates to which is not defined.
2.4 Vacuum energy and normal ordering
What does it mean for to be infinite like this? In practice, nothing. Experiments can only ever measure the difference between energies so that this term will always cancel. However, this is rather unsatisfactory as it does not really solve the underlying problem that our result is infinite, nor does it help us work with the expression. It is quite unwieldy to have infinities like this that are not regulated somehow as it is all too easy to accidentally have the infinity appear “for real” if we for example had actually used the commutation relation rather than just writing the commutator. We need a formal and rigorous way of handling these terms from the beginning.
Lattice regularisation
The simplest way to formalise the removal of this to put our system in a box. Rather than having an infinitely large universe which allows for uncountably infinitely many momenta, we have a finite volume of length (for example with periodic boundary conditions). This means only discrete momenta are allowed
| (93) |
and we translate
| (94) | ||||
| (95) |
Now we have which is perfectly regular as long as is finite. In the limit , still divergences but we can now define exactly how we want to subtract the problem. Unfortunately, we are still not done because
| (96) |
Even though the has cancelled, we still have a sum over all possible momenta that will diverge since . There are multiple ways of regularising this sum such as using Ramanujan summation (which is common in string theory and would assign ) or lattice regularisation (which defines a largest possible momentum or equivalently a smallest lattice spacing). Whatever way we end up choosing, we can modify our original to avoid this problem as a function of the regulator.
This is our first encounter with regularisation (which makes the problem explicit) and renormalisation (which removes the problem at the level of ).
Another more elegant way to deal with this problem is to backtrace a bit. Fundamentally, our problem arises from the commutator of and which in turn arises from the field operator. In our classical field theory, and were numbers so their order did not matter. Similarly, when we derived the Hamiltonian we wrote because is a number. During quantisation, we translated which lead to the commutator when we decided to move to the right of . The trick to avoid this is to always ensure that is to the right of by using normal ordering. In a normal-ordered expression, which we indicate with colons, the operators are defined such that always comes before when we go from a classical to a quantum theory. For example
| (97) |
instead of the unordered result
| (98) |
Both of these have the same classical limit but are different operators. However, only the normal-ordered one avoids the problem of the if we define as
| (99) |
We can now finally answer the question raised earlier about the interpretation of by using the ladder operators and . Similar to the discussion of the harmonic oscillator, we begin by defining a vacuum state such that destroys it
| (100) |
and it is properly normalised, i.e.
| (101) |
The (normal-ordered) energy of this state is just zero since it is immediately destroyed by . This is the reason that is called the vacuum state and why the problematic contribution of the to we encountered earlier is called vacuum energy.
2.5 Particles
Now that we know what the vacuum is, what happens if we let act upon it? What is the interpretation of the new state
| (102) |
This state is clearly orthogonal
| (103) |
A good and Lorentz-invariant choice of the normalisation is because then
| (104) |
which is Lorentz invariant.
Lorentz invariance and integrals
Now is a good time to look at this term a bit closer. We will very often see integrals of the form
| (105) |
with . Even though it does not appear to be Lorentz-invariant, it is. This means that if is Lorentz-invariant, so is the integral. Because it is so very common, the measure it is often abbreviated as , i.e.
| (106) |
To show this, let us begin by adding an additional integration over which is forced on-shell by a delta function
| (107) |
We can use the identity
| (108) |
with to write
| (109) |
Here we have used that, as long as (as enforced by the Heaviside function), . Expanding the argument and using the definition of
| (110) |
For proper, orthochronous Lorentz transformations, i.e. those that do not change the sign of , this is manifestly Lorentz-invariant.
We can understand what this state is by calculating its energy
| (111) |
and momentum
| (112) |
Suggested Exercise
Show that the total momentum operator can be written as
| (113) |
This means that, as expected, is again an eigenstate of but also that has create an excited state with energy and momentum . Since , it is not unreasonable to say that this is a particle of mass that has been created with momentum . Note that this particle is a completely de-localised plane wave because it has definite momentum . If we instead wanted to create a particle at a given position, we would have to use itself.
Note that we can create multiple particles as well. If we apply and we get
| (114) |
Since the two creation operators commute, . This allows us to conclude that the particles described by the KG equation follow Bose-Einstein statistics, i.e. do not pick up a sign under permutation The spin-statistics theorem, for which no simple proof is known, states that particles following the Bose-Einstein statistics (i.e. bosons) have integer spin while those that follow the Fermi-Dirac statistics (i.e. fermions for which ) have half-integer spin.
2.6 Propagators
The first physics question we can ask of our theory is the amplitude of a particle travelling from to . This is an important point from a causality perspective as well: a particle created at should only be able to reach if the distance between them is time-like, i.e. .
The most naive way of phrasing this question is to create a particle at position and consider the overlap with a particle being created at position , i.e.
| (115) |
It is easy to see that
| (116) |
which is Lorentz-invariant as we saw earlier.
Suggested Exercise
Show this by working through the operator algebra.
However, this is not quite physical. A particle may propagator from to even across space-like separations as long as it does not influence anything outside its future light cone. Therefore we should ask whether the creation at influences the destruction at , i.e. consider the commutator
| (117) |
If , it is possibly to continuously Lorentz transform into meaning that the expectation value vanishes, ensuring causality44 4 This is also true for the commutator itself and does not rely on the vacuum states on either side.. This is not possible for so that events that are within each other’s light cones can influence each other.
(117) is our first example of a vacuum expectation value (vev) which is an object of the form
| (118) |
for some (potentially complicated) operator . The term is more commonly used for the vev of a single field . The only field in nature with a non-zero is the Higgs field which has and is the reason for masses of the and bosons.
Let us study (117) further, now assuming a frame where happens before , i.e. , so that we can cleanly speak of being cause and being effect. Taking the difference of the two terms using (116)
| (119) | ||||
where we have swapped in the second term and modified accordingly. We can use an inverted version of the residue theorem
| (120) |
where the contour is defined as shown in Figure 1. This contour is valid only for as otherwise the exponential would diverge. We therefore write
| (121) |
where we have neatened up the denominator using the definition of . We will call this object the delayed Green’s function55 5 You will find this object often called the retarded Green’s function. This term is quite problematic for a number of reasons, not least of all because it is a very archaic term that is not very descriptive. . A Green’s function is the ‘inverse’ of a differential operator. In our case
| (122) |
It turns out that there is a slightly more useful way of defining which is called the Feynman prescription. Rather than closing the contour above the real axis as in Figure 1, we have the contour weave between the poles as in Figure LABEL:fig:prop:feynman. The result is a mixture of the delayed and advanced Green’s functions
| (123) | ||||
In this Feynman propagator we ensure the order of events using the time-ordering symbol . We are required to place the last event on the left and the first event on the right. We will see soon why this is a good choice.