Ch. 6 Solutions¶

← Back to problems　|　Back to chapter

Table of Contents

Basic

B-1. Antisymmetry of the Field Strength Tensor and Verification of Components
B-2. Expansion of the Lagrangian
B-3. Invariance of Electric and Magnetic Fields under Gauge Transformations
B-4. Gauge Transformation Law of the Covariant Derivative
B-5. Calculation of Conjugate Momenta
B-6. Transversality Condition for Polarization Vectors
B-7. Verification of the Bianchi Identity
B-8. Dispersion Relation of the Photon

Medium

M-1. Deriving Maxwell's Equations from the Euler-Lagrange Equation
M-2. Counting Degrees of Freedom in Coulomb Gauge
M-3. Completeness Relation for Polarization Vectors
M-4. Canonical Commutation Relations in Coulomb Gauge and the Transverse Delta Function

Advanced

A-1. Comparison with the Proca Field (Massive Vector Field)
A-2. Lagrangian with Gauge-Fixing Term and the Photon Propagator

Basic¶

B-1. Antisymmetry of the Field Strength Tensor and Verification of Components¶

→ Back to problem

Problem:

Using the definition \(F_{\mu\nu} = \partial_\mu A_\nu - \partial_\nu A_\mu\), calculate the following.

(a) Write \(F_{01}\) in terms of partial derivatives of \(A_0, A_1\).

(b) Write \(F_{10}\) in the same manner, and verify that \(F_{01} = -F_{10}\).

(c) Write \(F_{12}\) in terms of partial derivatives of \(A_1, A_2\), and by comparing with the matrix in Eq. (6.4), confirm that this corresponds to \(-B_z\).

(a) Calculation of \(F_{01}\):

Substituting \(\mu=0\), \(\nu=1\) into the definition \(F_{\mu\nu} = \partial_\mu A_\nu - \partial_\nu A_\mu\):

\[ F_{01} = \partial_0 A_1 - \partial_1 A_0 = \frac{\partial A_1}{\partial t} - \frac{\partial A_0}{\partial x} \]

(b) Calculation of \(F_{10}\) and verification of antisymmetry:

Substituting \(\mu=1\), \(\nu=0\):

\[ F_{10} = \partial_1 A_0 - \partial_0 A_1 = \frac{\partial A_0}{\partial x} - \frac{\partial A_1}{\partial t} \]

Comparing with (a):

\[ F_{10} = -\left(\frac{\partial A_1}{\partial t} - \frac{\partial A_0}{\partial x}\right) = -F_{01} \quad \checkmark \]

(c) Calculation of \(F_{12}\):

Substituting \(\mu=1\), \(\nu=2\):

\[ F_{12} = \partial_1 A_2 - \partial_2 A_1 = \frac{\partial A_2}{\partial x} - \frac{\partial A_1}{\partial y} \]

Meanwhile, the \(z\)-component of the magnetic field is \(B_z = (\nabla \times \mathbf{A})_z = \partial_1 A^2 - \partial_2 A^1\). Since \(F^{12} = -B_z\) according to Eq. (6.4), and \(F^{12} = \eta^{1\alpha}\eta^{2\beta}F_{\alpha\beta} = (-1)(-1)F_{12} = F_{12}\), so \(F^{12} = F_{12} = -B_z\)...

In fact, Eq. (6.4) is the matrix for \(F^{\mu\nu}\), and the \((1,2)\) component is written as \(-B_z\). Using the metric \(\eta^{\mu\nu} = \mathrm{diag}(+1,-1,-1,-1)\):

\[ F^{12} = \eta^{1\alpha}\eta^{2\beta}F_{\alpha\beta} = (-1)(-1)F_{12} = F_{12} \]

Therefore \(F_{12} = F^{12} = -B_z\).

However, from \(B_z = \partial_1 A_2 - \partial_2 A_1\) and \(F_{12} = \partial_1 A_2 - \partial_2 A_1\), it would seem that \(F_{12} = B_z\).

This apparent contradiction depends on how one reads the sign conventions of Eq. (6.4). Looking at Eq. (6.4), the \((0,1)\) component of \(F^{\mu\nu}\) is \(E_x\), so \(F^{01} = E_x\). Meanwhile, \(F_{01} = \partial_0 A_1 - \partial_1 A_0\), and from Eq. (6.3), \(E_x = -\partial_x A_0 - \partial_t A_1\), so \(F_{01} = \partial_0 A_1 - \partial_1 A_0 = -E_x\). Then \(F^{01} = \eta^{00}\eta^{11}F_{01} = (+1)(-1)(-E_x) = E_x\). This is consistent with Eq. (6.4).

Similarly, \(F_{12} = \partial_1 A_2 - \partial_2 A_1\) and \(F^{12} = \eta^{11}\eta^{22}F_{12} = (-1)(-1)F_{12} = F_{12}\).

Re-checking the \((1,2)\) component of Eq. (6.4), it is written as \(-B_z\). This means \(F^{12} = -B_z\) in the textbook's sign convention.

Conclusion: In the textbook's Eq. (6.4) convention, \((\nabla \times \mathbf{A})_z = \partial_x A_y - \partial_y A_x\) gives \(F_{12} = \partial_1 A_2 - \partial_2 A_1 = B_z\), and \(F^{12} = (-1)(-1)F_{12} = B_z\) should hold, but the \(-B_z\) in Eq. (6.4) is related to the convention for raising/lowering spatial component indices (\(A^i = -A_i\), etc.).

Following the most standard convention:

\[ F_{12} = \partial_1 A_2 - \partial_2 A_1 = B_z \]

and the \((1,2)\) component being \(-B_z\) in Eq. (6.4) means \(F^{12} = -B_z\), corresponding to the relation \(F^{ij} = -\epsilon^{ijk}B_k\) (so \(F^{12} = -B_3 = -B_z\)). In this case, \(F_{12} = \eta_{1\alpha}\eta_{2\beta}F^{\alpha\beta} = (-1)(-1)F^{12} = F^{12} = -B_z\).

Final reconciliation: Following Eq. (6.4), \(F^{12} = -B_z\), and \(F_{12} = F^{12} = -B_z\) (since raising/lowering spatial indices gives \(\eta_{11}\eta_{22} = (-1)(-1) = 1\), so \(F_{12} = F^{12}\)).

Meanwhile, \(F_{12} = \partial_1 A_2 - \partial_2 A_1\), and \(B_z = \partial_x A_y - \partial_y A_x = \partial_1 A_2 - \partial_2 A_1\) would give \(F_{12} = B_z\), which seems contradictory.

The resolution lies in raising/lowering indices: \(F_{ij} = \eta_{i\alpha}\eta_{j\beta}F^{\alpha\beta}\). Only spatial indices contribute, so \(F_{12} = \eta_{11}\eta_{22}F^{12} = (-1)(-1)F^{12} = F^{12} = -B_z\).

Meanwhile, in \(F_{12} = \partial_1 A_2 - \partial_2 A_1\), here \(A_i\) has a lower index. Since \(A_i = \eta_{i\mu}A^\mu = -A^i\), and \(\mathbf{A} = (A^1, A^2, A^3)\), we have \(A_i = -A^i\).

The \(z\)-component of \(\mathbf{B} = \nabla \times \mathbf{A}\) is \(B_z = \partial_x A^y - \partial_y A^x = \partial_1 A^2 - \partial_2 A^1\).

\[ F_{12} = \partial_1 A_2 - \partial_2 A_1 = \partial_1(-A^2) - \partial_2(-A^1) = -\partial_1 A^2 + \partial_2 A^1 = -(\partial_1 A^2 - \partial_2 A^1) = -B_z \]

This is now consistent! \(F_{12} = -B_z\), which agrees with the \((1,2)\) component \(-B_z\) of Eq. (6.4) where \(F^{12} = -B_z = F_{12}\).

Final answer:

\[ F_{12} = \partial_1 A_2 - \partial_2 A_1 = -(\partial_1 A^2 - \partial_2 A^1) = -B_z \]

This is consistent with the \((1,2)\) component \(-B_z\) of the matrix in Eq. (6.4). \(\square\)

B-2. Expansion of the Lagrangian¶

→ Back to problem

Problem:

Express the Lagrangian density \(\mathcal{L} = -\frac{1}{4}F_{\mu\nu}F^{\mu\nu}\) in terms of the electric field \(\mathbf{E}\) and the magnetic field \(\mathbf{B}\). That is, show that

\[ \mathcal{L} = \frac{1}{2}(\mathbf{E}^2 - \mathbf{B}^2) \]

by using the components from Eq. (6.4) and contracting all indices explicitly.

Solution Strategy:

Compute \(F_{\mu\nu}F^{\mu\nu}\) by separating it into \((0,i)\) components and \((i,j)\) components.

Calculation:

\[ F_{\mu\nu}F^{\mu\nu} = \sum_{\mu,\nu=0}^{3} F_{\mu\nu}F^{\mu\nu} \]

Due to antisymmetry, the \(\mu = \nu\) terms are zero. The independent components are those with \(\mu < \nu\), and by symmetry each pair contributes twice:

\[ F_{\mu\nu}F^{\mu\nu} = 2\sum_{\mu < \nu} F_{\mu\nu}F^{\mu\nu} \]

\((0,i)\) components (\(i = 1,2,3\)):

\(F_{0i} = -E_i\) (from the discussion in D1(c), \(F_{0i} = \partial_0 A_i - \partial_i A_0\), and \(E_i = -\partial_0 A^i - \partial_i A^0 = \partial_0 A_i + \partial_i A_0\)... )

Using the results from D1 above: \(F^{0i} = E^i = E_i\) (spatial components of the electric field). \(F_{0i} = \eta_{00}\eta_{ii}F^{0i} = (1)(-1)E_i = -E_i\).

Therefore:

\[ F_{0i}F^{0i} = (-E_i)(E_i) = -E_i^2 \]

Summing over the three values of \(i\):

\[ \sum_{i=1}^{3} F_{0i}F^{0i} = -\mathbf{E}^2 \]

Combining the contributions from \((0,i)\) and \((i,0)\) using antisymmetry:

\[ 2\sum_{i=1}^{3} F_{0i}F^{0i} = -2\mathbf{E}^2 \]

Let us redo this more carefully.

\[ F_{\mu\nu}F^{\mu\nu} = \sum_{\mu=0}^{3}\sum_{\nu=0}^{3} F_{\mu\nu}F^{\mu\nu} \]

Here \(F^{\mu\nu} = \eta^{\mu\alpha}\eta^{\nu\beta}F_{\alpha\beta}\), and writing the index contraction of \(F_{\mu\nu}F^{\mu\nu}\) correctly:

\[ F_{\mu\nu}F^{\mu\nu} = \eta^{\mu\alpha}\eta^{\nu\beta}F_{\mu\nu}F_{\alpha\beta} \]

This is cumbersome, so we compute directly in components.

\[ F_{\mu\nu}F^{\mu\nu} = F_{00}F^{00} + F_{01}F^{01} + F_{02}F^{02} + F_{03}F^{03} + F_{10}F^{10} + \cdots \]

\(F_{00} = 0\). Using antisymmetry:

\[ F_{\mu\nu}F^{\mu\nu} = 2(F_{01}F^{01} + F_{02}F^{02} + F_{03}F^{03}) + 2(F_{12}F^{12} + F_{13}F^{13} + F_{23}F^{23}) \]

Mixed spacetime components: From \(F_{0i} = -E_i\), \(F^{0i} = E_i\), we get \(F_{0i}F^{0i} = -E_i \cdot E_i = -E_i^2\).

\[ 2\sum_{i} F_{0i}F^{0i} = -2\mathbf{E}^2 \]

Purely spatial components: \(F_{12} = -B_z\), \(F^{12} = -B_z\), so \(F_{12}F^{12} = B_z^2\). Similarly \(F_{13}F^{13} = B_y^2\), \(F_{23}F^{23} = B_x^2\).

\[ 2\sum_{i<j} F_{ij}F^{ij} = 2\mathbf{B}^2 \]

Therefore:

\[ F_{\mu\nu}F^{\mu\nu} = -2\mathbf{E}^2 + 2\mathbf{B}^2 \]

Thus:

\[ \boxed{\mathcal{L} = -\frac{1}{4}F_{\mu\nu}F^{\mu\nu} = -\frac{1}{4}(-2\mathbf{E}^2 + 2\mathbf{B}^2) = \frac{1}{2}(\mathbf{E}^2 - \mathbf{B}^2)} \]

Verification:

Dimensional analysis: \([\mathbf{E}^2] = [\mathbf{B}^2] = \text{energy density}\), which matches the dimensions of a Lagrangian density. Furthermore, this has the \(\mathcal{L} = T - V\) structure, where \(\frac{1}{2}\mathbf{E}^2\) corresponds to the "kinetic energy" (\(\mathbf{E}\) contains \(\dot{\mathbf{A}}\)) and \(\frac{1}{2}\mathbf{B}^2\) corresponds to the "potential energy."

B-3. Invariance of Electric and Magnetic Fields under Gauge Transformations¶

→ Back to problem

Problem:

Under the gauge transformation \(A_\mu \to A_\mu + \partial_\mu \lambda\), use the definitions in Eq. (6.3)

\[ \mathbf{E} = -\nabla A_0 - \frac{\partial \mathbf{A}}{\partial t}, \qquad \mathbf{B} = \nabla \times \mathbf{A} \]

to show by direct calculation that \(\mathbf{E}\) and \(\mathbf{B}\) are each invariant.

Invariance of the Electric Field:

Substituting the gauge transformation \(A_0 \to A_0 + \partial_0 \lambda\), \(\mathbf{A} \to \mathbf{A} + \nabla\lambda\) into equation (6.3):

\[ \mathbf{E}' = -\nabla(A_0 + \partial_0\lambda) - \frac{\partial}{\partial t}(\mathbf{A} + \nabla\lambda) \]

\[ = -\nabla A_0 - \nabla(\partial_0\lambda) - \frac{\partial \mathbf{A}}{\partial t} - \frac{\partial}{\partial t}(\nabla\lambda) \]

\[ = \mathbf{E} - \nabla(\partial_0\lambda) - \partial_0(\nabla\lambda) \]

Since the order of partial derivatives can be exchanged, \(\nabla(\partial_0\lambda) = \partial_0(\nabla\lambda)\). Therefore:

\[ \mathbf{E}' = \mathbf{E} - \nabla(\partial_0\lambda) + \nabla(\partial_0\lambda) = \mathbf{E} \quad \checkmark \]

Invariance of the Magnetic Field:

\[ \mathbf{B}' = \nabla \times (\mathbf{A} + \nabla\lambda) = \nabla \times \mathbf{A} + \nabla \times (\nabla\lambda) \]

For any scalar field \(\lambda\), \(\nabla \times (\nabla\lambda) = \mathbf{0}\) (the curl of a gradient is identically zero). Therefore:

\[ \mathbf{B}' = \mathbf{B} \quad \checkmark \]

B-4. Gauge Transformation Law of the Covariant Derivative¶

→ Back to problem

Problem:

Substitute the transformations \(\psi \to e^{i\alpha(x)}\psi\), \(A_\mu \to A_\mu - \frac{1}{q}\partial_\mu \alpha\) into the covariant derivative \(D_\mu = \partial_\mu + iqA_\mu\), and verify that

\[ D_\mu \psi \to e^{i\alpha(x)} D_\mu \psi \]

holds by reproducing the calculation from the text with your own hands. Write out each intermediate term without omitting any steps.

Calculation:

Write out the transformed field and covariant derivative explicitly:

\[ \psi' = e^{i\alpha(x)}\psi, \qquad A_\mu' = A_\mu - \frac{1}{q}\partial_\mu\alpha \]

Apply the transformed covariant derivative to \(\psi'\):

\[ D_\mu'\psi' = \left(\partial_\mu + iqA_\mu'\right)\psi' = \left(\partial_\mu + iq\left(A_\mu - \frac{1}{q}\partial_\mu\alpha\right)\right)(e^{i\alpha}\psi) \]

\[ = \left(\partial_\mu + iqA_\mu - i\partial_\mu\alpha\right)(e^{i\alpha}\psi) \]

Apply the product rule to \(\partial_\mu(e^{i\alpha}\psi)\):

\[ \partial_\mu(e^{i\alpha}\psi) = e^{i\alpha}(i\partial_\mu\alpha)\psi + e^{i\alpha}\partial_\mu\psi \]

Substituting this in:

\[ D_\mu'\psi' = e^{i\alpha}(i\partial_\mu\alpha)\psi + e^{i\alpha}\partial_\mu\psi + iqA_\mu \cdot e^{i\alpha}\psi - i(\partial_\mu\alpha) \cdot e^{i\alpha}\psi \]

Looking at the 1st and 4th terms:

\[ e^{i\alpha}(i\partial_\mu\alpha)\psi - i(\partial_\mu\alpha)e^{i\alpha}\psi = 0 \]

These cancel completely. The remaining terms are:

\[ D_\mu'\psi' = e^{i\alpha}\partial_\mu\psi + iqA_\mu e^{i\alpha}\psi = e^{i\alpha}(\partial_\mu\psi + iqA_\mu\psi) = e^{i\alpha}D_\mu\psi \]

\[ \boxed{D_\mu'\psi' = e^{i\alpha(x)}D_\mu\psi \quad \checkmark} \]

B-5. Calculation of Conjugate Momenta¶

→ Back to problem

Problem:

For the Lagrangian \(\mathcal{L} = -\frac{1}{4}F_{\mu\nu}F^{\mu\nu}\), calculate the conjugate momenta \(\pi^\mu = \partial\mathcal{L}/\partial(\partial_0 A_\mu)\) for each component \(\mu = 0, 1, 2, 3\), and verify the following.

(a) \(\pi^0 = 0\)

(b) \(\pi^i = F^{0i} = E^i\) (\(i = 1, 2, 3\))

(a) \(\pi^0 = 0\):

Using the result from D2, \(\mathcal{L} = \frac{1}{2}(\mathbf{E}^2 - \mathbf{B}^2)\). Since \(E_i = F_{0i} = \partial_0 A_i - \partial_i A_0\), we have \(\mathbf{E}^2 = \sum_i(\partial_0 A_i - \partial_i A_0)^2\).

\(\partial_0 A_0\) does not appear in any component of \(E_i\) (since \(E_i\) contains only \(\partial_0 A_i\) and \(\partial_i A_0\)). \(\mathbf{B}^2\) contains only spatial derivatives and does not contain \(\partial_0 A_\mu\).

Therefore:

\[ \pi^0 = \frac{\partial\mathcal{L}}{\partial(\partial_0 A_0)} = 0 \quad \checkmark \]

(b) \(\pi^i = E^i\):

\[ \pi^i = \frac{\partial\mathcal{L}}{\partial(\partial_0 A_i)} \]

Since \(\mathcal{L} = \frac{1}{2}\sum_j E_j^2 - \frac{1}{2}\mathbf{B}^2\) with \(E_j = \partial_0 A_j - \partial_j A_0\), and \(\mathbf{B}^2\) does not contain \(\partial_0 A_i\):

\[ \pi^i = \frac{\partial}{\partial(\partial_0 A_i)}\left[\frac{1}{2}\sum_j(\partial_0 A_j - \partial_j A_0)^2\right] = \frac{1}{2}\cdot 2(\partial_0 A_i - \partial_i A_0) = E_i \]

Noting the metric convention, \(E^i = E_i\) (as a 3-dimensional vector), and also \(F^{0i} = E^i\).

\[ \boxed{\pi^i = F^{0i} = E^i \quad \checkmark} \]

B-6. Transversality Condition for Polarization Vectors¶

→ Back to problem

Problem:

When the wave vector \(\mathbf{k} = k(0, \sin\theta, \cos\theta)\) lies in the \(yz\) plane, explicitly write down two independent polarization vectors \(\boldsymbol{\epsilon}(\mathbf{k}, 1)\), \(\boldsymbol{\epsilon}(\mathbf{k}, 2)\) that satisfy the Coulomb gauge transversality condition \(\mathbf{k} \cdot \boldsymbol{\epsilon}(\mathbf{k}, \lambda) = 0\). Ensure that they also satisfy the orthonormality condition \(\boldsymbol{\epsilon}(\mathbf{k}, \lambda) \cdot \boldsymbol{\epsilon}(\mathbf{k}, \lambda') = \delta_{\lambda\lambda'}\).

Solution:

For \(\mathbf{k} = k(0, \sin\theta, \cos\theta)\), we find two orthonormal vectors satisfying \(\mathbf{k} \cdot \boldsymbol{\epsilon} = 0\).

First polarization vector: The \(x\)-direction is clearly orthogonal to \(\mathbf{k}\):

\[ \boldsymbol{\epsilon}(\mathbf{k}, 1) = (1, 0, 0) \]

Verification: \(\mathbf{k} \cdot \boldsymbol{\epsilon}(\mathbf{k}, 1) = k(0\cdot1 + \sin\theta\cdot0 + \cos\theta\cdot0) = 0\) \(\checkmark\)

Second polarization vector: We find the direction orthogonal to both \(\mathbf{k}\) and \(\boldsymbol{\epsilon}(\mathbf{k}, 1)\). Computing \(\hat{\mathbf{k}} \times \boldsymbol{\epsilon}(\mathbf{k}, 1)\):

\[ \hat{\mathbf{k}} \times \boldsymbol{\epsilon}(\mathbf{k}, 1) = (0, \sin\theta, \cos\theta) \times (1, 0, 0) \]

\[ = \begin{vmatrix} \hat{x} & \hat{y} & \hat{z} \\ 0 & \sin\theta & \cos\theta \\ 1 & 0 & 0 \end{vmatrix} = (0\cdot0 - \cos\theta\cdot0,\; \cos\theta\cdot1 - 0\cdot0,\; 0\cdot0 - \sin\theta\cdot1) \]

\[ = (0, \cos\theta, -\sin\theta) \]

Normalization check: \(|(0, \cos\theta, -\sin\theta)| = \sqrt{\cos^2\theta + \sin^2\theta} = 1\) \(\checkmark\)

\[ \boldsymbol{\epsilon}(\mathbf{k}, 2) = (0, \cos\theta, -\sin\theta) \]

Verification:

\(\mathbf{k} \cdot \boldsymbol{\epsilon}(\mathbf{k}, 2) = k(0\cdot0 + \sin\theta\cos\theta - \cos\theta\sin\theta) = 0\) \(\checkmark\)
\(\boldsymbol{\epsilon}(\mathbf{k}, 1) \cdot \boldsymbol{\epsilon}(\mathbf{k}, 2) = 1\cdot0 + 0\cdot\cos\theta + 0\cdot(-\sin\theta) = 0\) \(\checkmark\)
\(|\boldsymbol{\epsilon}(\mathbf{k}, 1)|^2 = 1\), \(|\boldsymbol{\epsilon}(\mathbf{k}, 2)|^2 = 1\) \(\checkmark\)

\[ \boxed{\boldsymbol{\epsilon}(\mathbf{k}, 1) = (1, 0, 0), \qquad \boldsymbol{\epsilon}(\mathbf{k}, 2) = (0, \cos\theta, -\sin\theta)} \]

B-7. Verification of the Bianchi Identity¶

→ Back to problem

Problem:

Verify the Bianchi identity

\[ \partial_\lambda F_{\mu\nu} + \partial_\mu F_{\nu\lambda} + \partial_\nu F_{\lambda\mu} = 0 \]

for the case \((\lambda, \mu, \nu) = (0, 1, 2)\) by substituting \(F_{\mu\nu} = \partial_\mu A_\nu - \partial_\nu A_\mu\) into each term, writing them out explicitly, and confirming that the whole expression vanishes.

Calculation:

Substitute \((\lambda, \mu, \nu) = (0, 1, 2)\):

\[ \partial_0 F_{12} + \partial_1 F_{20} + \partial_2 F_{01} = 0 \]

Substitute \(F_{\mu\nu} = \partial_\mu A_\nu - \partial_\nu A_\mu\) into each term:

\[ \partial_0 F_{12} = \partial_0(\partial_1 A_2 - \partial_2 A_1) \]

\[ \partial_1 F_{20} = \partial_1(\partial_2 A_0 - \partial_0 A_2) \]

\[ \partial_2 F_{01} = \partial_2(\partial_0 A_1 - \partial_1 A_0) \]

Expanding everything:

\[ \partial_0\partial_1 A_2 - \partial_0\partial_2 A_1 + \partial_1\partial_2 A_0 - \partial_1\partial_0 A_2 + \partial_2\partial_0 A_1 - \partial_2\partial_1 A_0 \]

Using the commutativity of partial derivatives \(\partial_\mu\partial_\nu = \partial_\nu\partial_\mu\) to find pairs:

\(\partial_0\partial_1 A_2\) and \(-\partial_1\partial_0 A_2 = -\partial_0\partial_1 A_2\) → cancel
\(-\partial_0\partial_2 A_1\) and \(\partial_2\partial_0 A_1 = \partial_0\partial_2 A_1\) → cancel
\(\partial_1\partial_2 A_0\) and \(-\partial_2\partial_1 A_0 = -\partial_1\partial_2 A_0\) → cancel

\[ \boxed{\partial_0 F_{12} + \partial_1 F_{20} + \partial_2 F_{01} = 0 \quad \checkmark} \]

B-8. Dispersion Relation of the Photon¶

→ Back to problem

Problem:

Substitute the plane wave solution \(A_i \propto e^{-i(\omega t - \mathbf{k}\cdot\mathbf{x})}\) into the wave equation for a massless field \(\Box A_i = 0\), and derive the dispersion relation \(\omega = |\mathbf{k}|\). Compare this with the dispersion relation \(\omega = \sqrt{|\mathbf{k}|^2 + m^2}\) for a Klein-Gordon field of mass \(m\), and discuss the correspondence with the fact that the photon has zero mass.

Calculation:

Substituting the plane wave solution \(A_i = \epsilon_i e^{-i(\omega t - \mathbf{k}\cdot\mathbf{x})}\) into the wave equation \(\Box A_i = 0\):

\[ \Box = \frac{\partial^2}{\partial t^2} - \nabla^2 \]

\[ \frac{\partial^2}{\partial t^2} e^{-i(\omega t - \mathbf{k}\cdot\mathbf{x})} = (-i\omega)^2 e^{-i(\omega t - \mathbf{k}\cdot\mathbf{x})} = -\omega^2 e^{-i(\omega t - \mathbf{k}\cdot\mathbf{x})} \]

\[ \nabla^2 e^{-i(\omega t - \mathbf{k}\cdot\mathbf{x})} = (i\mathbf{k})^2 e^{-i(\omega t - \mathbf{k}\cdot\mathbf{x})} = -|\mathbf{k}|^2 e^{-i(\omega t - \mathbf{k}\cdot\mathbf{x})} \]

\[ \Box A_i = (-\omega^2 + |\mathbf{k}|^2) A_i = 0 \]

For a non-trivial solution \(A_i \neq 0\):

\[ \boxed{\omega^2 = |\mathbf{k}|^2 \quad \Longrightarrow \quad \omega = |\mathbf{k}|} \]

Comparison with the Klein-Gordon Field:

The wave equation \((\Box + m^2)\phi = 0\) for a Klein-Gordon field of mass \(m\) gives the dispersion relation:

\[ \omega^2 = |\mathbf{k}|^2 + m^2 \quad \Longrightarrow \quad \omega = \sqrt{|\mathbf{k}|^2 + m^2} \]

Setting \(m = 0\) for the photon recovers \(\omega = |\mathbf{k}|\). This corresponds to the fact that the photon is a massless particle, propagating at the speed of light (\(E = pc\), or equivalently \(\omega = |\mathbf{k}|\) in natural units).

Medium¶

M-1. Deriving Maxwell's Equations from the Euler-Lagrange Equation¶

→ Back to problem

Problem:

Starting from the Lagrangian density \(\mathcal{L} = -\frac{1}{4}F_{\mu\nu}F^{\mu\nu}\), apply the Euler-Lagrange equation for the field \(A_\nu\)

\[ \partial_\mu \frac{\partial \mathcal{L}}{\partial(\partial_\mu A_\nu)} - \frac{\partial \mathcal{L}}{\partial A_\nu} = 0 \]

to derive the equation of motion \(\partial_\mu F^{\mu\nu} = 0\). Furthermore, write out the cases \(\nu = 0\) and \(\nu = i\) in terms of 3-dimensional vectors, and show that they correspond to Gauss's law \(\nabla \cdot \mathbf{E} = 0\) and the Ampère-Maxwell law \(\partial_t \mathbf{E} = \nabla \times \mathbf{B}\), respectively.

Solution Strategy:

Apply the Euler-Lagrange equation with respect to \(A_\nu\). Since \(\mathcal{L}\) does not depend on \(A_\nu\) itself, \(\partial\mathcal{L}/\partial A_\nu = 0\). We compute \(\partial\mathcal{L}/\partial(\partial_\mu A_\nu)\).

Calculation:

\[ \mathcal{L} = -\frac{1}{4}F_{\alpha\beta}F^{\alpha\beta} = -\frac{1}{4}\eta^{\alpha\gamma}\eta^{\beta\delta}F_{\alpha\beta}F_{\gamma\delta} \]

Since \(F_{\alpha\beta} = \partial_\alpha A_\beta - \partial_\beta A_\alpha\), we compute the derivative with respect to \(\partial_\mu A_\nu\):

\[ \frac{\partial F_{\alpha\beta}}{\partial(\partial_\mu A_\nu)} = \delta^\mu_\alpha \delta^\nu_\beta - \delta^\mu_\beta \delta^\nu_\alpha \]

Therefore:

\[ \frac{\partial\mathcal{L}}{\partial(\partial_\mu A_\nu)} = -\frac{1}{4}\eta^{\alpha\gamma}\eta^{\beta\delta}\left[\frac{\partial F_{\alpha\beta}}{\partial(\partial_\mu A_\nu)}F_{\gamma\delta} + F_{\alpha\beta}\frac{\partial F_{\gamma\delta}}{\partial(\partial_\mu A_\nu)}\right] \]

The two terms are equal (by relabeling dummy indices), so:

\[ = -\frac{1}{2}\eta^{\alpha\gamma}\eta^{\beta\delta}(\delta^\mu_\alpha\delta^\nu_\beta - \delta^\mu_\beta\delta^\nu_\alpha)F_{\gamma\delta} \]

\[ = -\frac{1}{2}\left(\eta^{\mu\gamma}\eta^{\nu\delta}F_{\gamma\delta} - \eta^{\nu\gamma}\eta^{\mu\delta}F_{\gamma\delta}\right) \]

\[ = -\frac{1}{2}\left(F^{\mu\nu} - F^{\nu\mu}\right) \]

Since \(F^{\mu\nu}\) is antisymmetric, \(F^{\nu\mu} = -F^{\mu\nu}\):

\[ = -\frac{1}{2}(F^{\mu\nu} + F^{\mu\nu}) = -F^{\mu\nu} \]

The Euler-Lagrange equation gives:

\[ \partial_\mu\left(\frac{\partial\mathcal{L}}{\partial(\partial_\mu A_\nu)}\right) = 0 \quad \Longrightarrow \quad \partial_\mu(-F^{\mu\nu}) = 0 \]

\[ \boxed{\partial_\mu F^{\mu\nu} = 0} \]

Expansion in 3-Dimensional Vector Form:

Case \(\nu = 0\):

\[ \partial_\mu F^{\mu 0} = \partial_0 F^{00} + \partial_i F^{i0} = 0 + \partial_i(-E^i) = -\nabla \cdot \mathbf{E} = 0 \]

\[ \boxed{\nabla \cdot \mathbf{E} = 0} \quad \text{(Gauss's law)} \]

Case \(\nu = j\):

\[ \partial_\mu F^{\mu j} = \partial_0 F^{0j} + \partial_i F^{ij} = 0 \]

Since \(F^{0j} = E^j\) and \(F^{ij} = -\epsilon^{ijk}B_k\):

\[ \partial_0 E^j + \partial_i(-\epsilon^{ijk}B_k) = 0 \]

\[ \frac{\partial E^j}{\partial t} - \epsilon^{ijk}\partial_i B_k = 0 \]

Since \((\nabla \times \mathbf{B})^j = \epsilon^{jik}\partial_i B_k = \epsilon^{ijk}\partial_i B_k\) (because \(\epsilon^{jik} = \epsilon^{ijk}\)):

\[ \frac{\partial E^j}{\partial t} = (\nabla \times \mathbf{B})^j \]

\[ \boxed{\frac{\partial \mathbf{E}}{\partial t} = \nabla \times \mathbf{B}} \quad \text{(Ampère-Maxwell law)} \]

Verification:

The source-free Maxwell equations consist of four equations: \(\nabla \cdot \mathbf{E} = 0\), \(\nabla \times \mathbf{B} = \partial_t \mathbf{E}\) (from the Euler-Lagrange equations), and \(\nabla \cdot \mathbf{B} = 0\), \(\nabla \times \mathbf{E} = -\partial_t \mathbf{B}\) (from the Bianchi identity). We have confirmed that the former two are obtained from \(\partial_\mu F^{\mu\nu} = 0\) with \(\nu = 0,1,2,3\).

M-2. Counting Degrees of Freedom in Coulomb Gauge¶

→ Back to problem

Problem:

Following the steps below, demonstrate that the electromagnetic field has 2 physical degrees of freedom.

(a) \(A_\mu\) has 4 components. Explain how the constraint \(\pi^0 = 0\) excludes \(A_0\) from the independent dynamical variables.

(b) Explain how the Coulomb gauge condition \(\nabla \cdot \mathbf{A} = 0\) removes one further degree of freedom from the remaining 3 components, by expressing it in Fourier space as \(\mathbf{k} \cdot \tilde{\mathbf{A}}(\mathbf{k}) = 0\).

(c) In vacuum, show that combining Gauss's law \(\nabla \cdot \mathbf{E} = 0\) with the Coulomb gauge condition leads to \(A_0 = 0\) (boundary condition: \(A_0 \to 0\) at infinity).

(d) Summarize the above and state that the remaining physical degrees of freedom correspond to two transverse polarizations.

(a) Exclusion of \(A_0\) due to \(\pi^0 = 0\):

As shown in D5(a), \(\pi^0 = \partial\mathcal{L}/\partial\dot{A}_0 = 0\). In canonical quantization, we impose equal-time commutation relations between the fields \(A_\mu\) and their conjugate momenta \(\pi^\mu\): \([A_\mu(\mathbf{x},t), \pi^\nu(\mathbf{y},t)] = i\delta^\nu_\mu\delta^3(\mathbf{x}-\mathbf{y})\). However, since \(\pi^0 = 0\) is identically zero as an operator, we get \([A_0, \pi^0] = [A_0, 0] = 0 \neq i\delta^3(\mathbf{x}-\mathbf{y})\), which is a contradiction.

This means that \(A_0\) is not an independent dynamical degree of freedom. \(A_0\) is a dependent variable determined by other variables (a quantity fixed through constraint conditions) and is excluded from the independent canonical variables.

Degrees of freedom: \(4 \to 3\)

(b) Removal of one degree of freedom by the Coulomb gauge condition:

Writing the Coulomb gauge condition \(\nabla \cdot \mathbf{A} = 0\) in Fourier space:

\[ \mathbf{k} \cdot \tilde{\mathbf{A}}(\mathbf{k}) = 0 \]

This is one scalar condition for each \(\mathbf{k}\), which fixes the component of the 3-component vector \(\tilde{\mathbf{A}}(\mathbf{k})\) along the \(\mathbf{k}\) direction (the longitudinal component) to zero. Only the 2 components perpendicular to \(\mathbf{k}\) (transverse components) remain.

Degrees of freedom: \(3 \to 2\)

(c) Derivation of \(A_0 = 0\):

Under the Coulomb gauge \(\nabla \cdot \mathbf{A} = 0\), we write out Gauss's law \(\nabla \cdot \mathbf{E} = 0\):

\[ \nabla \cdot \mathbf{E} = \nabla \cdot \left(-\nabla A_0 - \frac{\partial \mathbf{A}}{\partial t}\right) = -\nabla^2 A_0 - \frac{\partial}{\partial t}(\nabla \cdot \mathbf{A}) = -\nabla^2 A_0 = 0 \]

(where we used \(\nabla \cdot \mathbf{A} = 0\).)

This is Laplace's equation \(\nabla^2 A_0 = 0\). Imposing the boundary condition \(A_0 \to 0\) at infinity, the uniqueness theorem for Laplace's equation gives:

\[ A_0 = 0 \]

(d) Summary:

\(A_\mu\) has 4 components
\(\pi^0 = 0\): \(A_0\) is not a dynamical variable → 3 components
Coulomb gauge \(\nabla \cdot \mathbf{A} = 0\): removes the longitudinal component → 2 components
Gauss's law in vacuum: confirms \(A_0 = 0\)

The remaining physical degrees of freedom correspond to two transverse polarizations \(\boldsymbol{\epsilon}(\mathbf{k}, 1)\), \(\boldsymbol{\epsilon}(\mathbf{k}, 2)\). This is consistent with the experimental fact that light has two polarizations (e.g., horizontal and vertical polarization, or left and right circular polarization).

M-3. Completeness Relation for Polarization Vectors¶

→ Back to problem

Problem:

In the Coulomb gauge, the two polarization vectors \(\boldsymbol{\epsilon}(\mathbf{k}, \lambda)\) (\(\lambda = 1, 2\)) satisfy the orthonormality condition

\[ \boldsymbol{\epsilon}(\mathbf{k}, \lambda) \cdot \boldsymbol{\epsilon}(\mathbf{k}, \lambda') = \delta_{\lambda\lambda'} \]

and the transversality condition \(\mathbf{k} \cdot \boldsymbol{\epsilon}(\mathbf{k}, \lambda) = 0\).

(a) Show that the completeness relation

\[ \sum_{\lambda=1}^{2} \epsilon_i(\mathbf{k}, \lambda)\, \epsilon_j(\mathbf{k}, \lambda) = \delta_{ij} - \frac{k_i k_j}{|\mathbf{k}|^2} \]

holds.

(b) Verify that the right-hand side \(\delta_{ij} - k_i k_j / |\mathbf{k}|^2\) is a transverse projector that removes the projection onto the \(\mathbf{k}\) direction, by contracting with \(k_j\).

(a) Proof of the Completeness Relation:

In 3-dimensional space, \(\hat{\mathbf{k}} = \mathbf{k}/|\mathbf{k}|\), \(\boldsymbol{\epsilon}(\mathbf{k}, 1)\), \(\boldsymbol{\epsilon}(\mathbf{k}, 2)\) form an orthonormal basis (\(\hat{\mathbf{k}}\) is orthogonal to both \(\boldsymbol{\epsilon}\)'s, and \(|\hat{\mathbf{k}}| = 1\)).

The 3-dimensional completeness relation is:

\[ \hat{k}_i \hat{k}_j + \sum_{\lambda=1}^{2}\epsilon_i(\mathbf{k},\lambda)\epsilon_j(\mathbf{k},\lambda) = \delta_{ij} \]

Rearranging this:

\[ \sum_{\lambda=1}^{2}\epsilon_i(\mathbf{k},\lambda)\epsilon_j(\mathbf{k},\lambda) = \delta_{ij} - \hat{k}_i\hat{k}_j = \delta_{ij} - \frac{k_i k_j}{|\mathbf{k}|^2} \]

\[ \boxed{\sum_{\lambda=1}^{2}\epsilon_i(\mathbf{k},\lambda)\epsilon_j(\mathbf{k},\lambda) = \delta_{ij} - \frac{k_i k_j}{|\mathbf{k}|^2}} \]

(b) Verification of the Transverse Projection Operator:

Multiplying the right-hand side by \(k_j\) and summing over \(j\):

\[ \left(\delta_{ij} - \frac{k_i k_j}{|\mathbf{k}|^2}\right)k_j = k_i - \frac{k_i k_j k_j}{|\mathbf{k}|^2} = k_i - \frac{k_i |\mathbf{k}|^2}{|\mathbf{k}|^2} = k_i - k_i = 0 \]

\[ \boxed{\left(\delta_{ij} - \frac{k_i k_j}{|\mathbf{k}|^2}\right)k_j = 0} \]

This shows that this operator removes the component of any vector along the \(\mathbf{k}\) direction (longitudinal component) and retains only the component perpendicular to \(\mathbf{k}\) (transverse component), confirming that it is a transverse projector.

Verification:

We confirm the idempotency of the projection operator:

\[ P_{ij}^{\perp} = \delta_{ij} - \frac{k_i k_j}{|\mathbf{k}|^2} \]

\[ P_{ij}^{\perp}P_{jl}^{\perp} = \left(\delta_{ij} - \frac{k_i k_j}{|\mathbf{k}|^2}\right)\left(\delta_{jl} - \frac{k_j k_l}{|\mathbf{k}|^2}\right) \]

\[ = \delta_{il} - \frac{k_i k_l}{|\mathbf{k}|^2} - \frac{k_i k_l}{|\mathbf{k}|^2} + \frac{k_i k_j k_j k_l}{|\mathbf{k}|^4} = \delta_{il} - \frac{2k_i k_l}{|\mathbf{k}|^2} + \frac{k_i k_l}{|\mathbf{k}|^2} = \delta_{il} - \frac{k_i k_l}{|\mathbf{k}|^2} = P_{il}^{\perp} \]

The idempotency \(P^2 = P\) is confirmed, verifying that this is indeed a projection operator. \(\checkmark\)

M-4. Canonical Commutation Relations in Coulomb Gauge and the Transverse Delta Function¶

→ Back to problem

Problem:

In the Coulomb gauge, starting from the Fourier expansion (6.20) of \(\mathbf{A}\) and the commutation relations of creation and annihilation operators

\[ [a(\mathbf{k}, \lambda),\, a^\dagger(\mathbf{k}', \lambda')] = (2\pi)^3 \delta^3(\mathbf{k} - \mathbf{k}') \delta_{\lambda\lambda'} \]

derive the equal-time commutation relation

\[ [A_i(\mathbf{x}, t),\, \pi_j(\mathbf{y}, t)] = i\delta_{ij}^{\perp}(\mathbf{x} - \mathbf{y}) \]

Here \(\pi_j = E_j = \dot{A}_j\) (Coulomb gauge), and \(\delta_{ij}^{\perp}\) is the transverse delta function

\[ \delta_{ij}^{\perp}(\mathbf{x} - \mathbf{y}) = \int \frac{d^3k}{(2\pi)^3} \left(\delta_{ij} - \frac{k_i k_j}{|\mathbf{k}|^2}\right) e^{i\mathbf{k}\cdot(\mathbf{x} - \mathbf{y})} \]

Explain the physical reason why the transverse delta function appears instead of the usual \(i\delta_{ij}\delta^3(\mathbf{x}-\mathbf{y})\).

Solution Strategy:

Substitute the Fourier expansion of the field in the Coulomb gauge, and compute the equal-time commutation relation using the \([a, a^\dagger]\) commutation relations and the completeness relation for polarization vectors.

Calculation:

The expansion of \(\mathbf{A}\) in the Coulomb gauge is:

\[ A_i(\mathbf{x}, t) = \int \frac{d^3k}{(2\pi)^3}\frac{1}{\sqrt{2\omega_k}}\sum_{\lambda=1}^{2}\epsilon_i(\mathbf{k},\lambda)\left[a(\mathbf{k},\lambda)e^{-i\omega_k t + i\mathbf{k}\cdot\mathbf{x}} + a^\dagger(\mathbf{k},\lambda)e^{i\omega_k t - i\mathbf{k}\cdot\mathbf{x}}\right] \]

where \(\omega_k = |\mathbf{k}|\). The conjugate momentum is \(\pi_j = \dot{A}_j\):

\[ \pi_j(\mathbf{y}, t) = \int \frac{d^3k'}{(2\pi)^3}\frac{(-i\omega_{k'})}{{\sqrt{2\omega_{k'}}}}\sum_{\lambda'=1}^{2}\epsilon_j(\mathbf{k}',\lambda')\left[a(\mathbf{k}',\lambda')e^{-i\omega_{k'} t + i\mathbf{k}'\cdot\mathbf{y}} - a^\dagger(\mathbf{k}',\lambda')e^{i\omega_{k'} t - i\mathbf{k}'\cdot\mathbf{y}}\right] \]

We compute the commutator \([A_i(\mathbf{x},t), \pi_j(\mathbf{y},t)]\). Since \([a, a] = [a^\dagger, a^\dagger] = 0\), only the terms involving \([a, a^\dagger]\) and \([a^\dagger, a]\) contribute:

\[ [A_i(\mathbf{x},t), \pi_j(\mathbf{y},t)] = \int\frac{d^3k}{(2\pi)^3}\frac{1}{\sqrt{2\omega_k}}\int\frac{d^3k'}{(2\pi)^3}\frac{(-i\omega_{k'})}{\sqrt{2\omega_{k'}}}\sum_{\lambda,\lambda'}\epsilon_i(\mathbf{k},\lambda)\epsilon_j(\mathbf{k}',\lambda') \]

\[ \times \left\{[a(\mathbf{k},\lambda), a^\dagger(\mathbf{k}',\lambda')]e^{-i\omega_k t + i\mathbf{k}\cdot\mathbf{x}}e^{i\omega_{k'} t - i\mathbf{k}'\cdot\mathbf{y}}(-1)\right. \]

\[ \left. + [a^\dagger(\mathbf{k},\lambda), a(\mathbf{k}',\lambda')]e^{i\omega_k t - i\mathbf{k}\cdot\mathbf{x}}e^{-i\omega_{k'} t + i\mathbf{k}'\cdot\mathbf{y}}(+1)\right\} \]

Using \([a(\mathbf{k},\lambda), a^\dagger(\mathbf{k}',\lambda')] = (2\pi)^3\delta^3(\mathbf{k}-\mathbf{k}')\delta_{\lambda\lambda'}\) and noting that \([a^\dagger, a] = -[a, a^\dagger]\), we perform the \(\mathbf{k}'\) integration:

\[ = \int\frac{d^3k}{(2\pi)^3}\frac{(-i\omega_k)}{2\omega_k}\sum_{\lambda}\epsilon_i(\mathbf{k},\lambda)\epsilon_j(\mathbf{k},\lambda)\left\{-e^{i\mathbf{k}\cdot(\mathbf{x}-\mathbf{y})} - e^{-i\mathbf{k}\cdot(\mathbf{x}-\mathbf{y})}\right\} \]

Here the time dependence has cancelled since \(e^{-i\omega_k t}e^{i\omega_k t} = 1\).

\[ = \int\frac{d^3k}{(2\pi)^3}\frac{(-i)}{2}\sum_{\lambda}\epsilon_i(\mathbf{k},\lambda)\epsilon_j(\mathbf{k},\lambda)\left\{-e^{i\mathbf{k}\cdot(\mathbf{x}-\mathbf{y})} - e^{-i\mathbf{k}\cdot(\mathbf{x}-\mathbf{y})}\right\} \]

By the change of variables \(\mathbf{k} \to -\mathbf{k}\), the \(e^{-i\mathbf{k}\cdot(\mathbf{x}-\mathbf{y})}\) term can be converted to \(e^{i\mathbf{k}\cdot(\mathbf{x}-\mathbf{y})}\). The completeness relation for polarization vectors \(\sum_\lambda \epsilon_i(\mathbf{k},\lambda)\epsilon_j(\mathbf{k},\lambda) = \delta_{ij} - k_ik_j/|\mathbf{k}|^2\) is invariant under \(\mathbf{k} \to -\mathbf{k}\) (since \(k_ik_j/|\mathbf{k}|^2\) is an even function of \(\mathbf{k}\)). Therefore:

\[ = \int\frac{d^3k}{(2\pi)^3}\frac{(-i)}{2}\left(\delta_{ij} - \frac{k_ik_j}{|\mathbf{k}|^2}\right)(-2)e^{i\mathbf{k}\cdot(\mathbf{x}-\mathbf{y})} \]

\[ = i\int\frac{d^3k}{(2\pi)^3}\left(\delta_{ij} - \frac{k_ik_j}{|\mathbf{k}|^2}\right)e^{i\mathbf{k}\cdot(\mathbf{x}-\mathbf{y})} \]

\[ \boxed{[A_i(\mathbf{x},t), \pi_j(\mathbf{y},t)] = i\delta_{ij}^{\perp}(\mathbf{x}-\mathbf{y})} \]

where the transverse delta function is:

\[ \delta_{ij}^{\perp}(\mathbf{x}-\mathbf{y}) = \int\frac{d^3k}{(2\pi)^3}\left(\delta_{ij} - \frac{k_ik_j}{|\mathbf{k}|^2}\right)e^{i\mathbf{k}\cdot(\mathbf{x}-\mathbf{y})} \]

Physical Reason:

The reason why the transverse delta function appears instead of the usual canonical commutation relation \([A_i, \pi_j] = i\delta_{ij}\delta^3(\mathbf{x}-\mathbf{y})\):

The Coulomb gauge condition \(\nabla \cdot \mathbf{A} = 0\) identically sets the longitudinal component (the component along \(\mathbf{k}\)) of \(A_i\) to zero. Therefore, out of the 3 components of \(A_i\), only the 2 transverse components are independent dynamical variables. The commutation relation reflects this constraint, and the transverse projection operator \(\delta_{ij} - k_ik_j/|\mathbf{k}|^2\), which automatically removes the projection onto the \(\mathbf{k}\) direction, appears. If the usual \(\delta_{ij}\delta^3\) were to hold, then \(\partial_i^x [A_i, \pi_j] = i\partial_j\delta^3 \neq 0\), which would contradict \(\nabla \cdot \mathbf{A} = 0\). With the transverse delta function, \(\partial_i^x \delta_{ij}^\perp = 0\) is guaranteed, maintaining consistency.

Advanced¶

A-1. Comparison with the Proca Field (Massive Vector Field)¶

→ Back to problem

Problem:

The Lagrangian for a vector field with mass \(m\) (the Proca Lagrangian) is given by

\[ \mathcal{L}_{\text{Proca}} = -\frac{1}{4}F_{\mu\nu}F^{\mu\nu} + \frac{1}{2}m^2 A_\mu A^\mu \]

(a) Derive the Euler-Lagrange equations and show that the equation of motion is

\[ \partial_\mu F^{\mu\nu} + m^2 A^\nu = 0 \]

(b) Act with \(\partial_\nu\) on both sides of the above equation of motion and show that \(\partial_\nu A^\nu = 0\) (the Lorenz condition) follows automatically as a consequence of the equation of motion when \(m \neq 0\).

(c) Using this result, argue by counting degrees of freedom that the massive vector field has 3 physical degrees of freedom (2 transverse polarizations + 1 longitudinal polarization).

(d) Discuss qualitatively how the longitudinal polarization disappears and gauge symmetry is restored in the \(m \to 0\) limit. State your expectations for how this discussion relates to the Higgs mechanism, which will be studied from Ch. 17 onward.

(a) Derivation of the Equations of Motion:

Proca Lagrangian:

\[ \mathcal{L}_{\text{Proca}} = -\frac{1}{4}F_{\mu\nu}F^{\mu\nu} + \frac{1}{2}m^2 A_\mu A^\mu \]

Euler-Lagrange equation:

\[ \partial_\mu\frac{\partial\mathcal{L}}{\partial(\partial_\mu A_\nu)} - \frac{\partial\mathcal{L}}{\partial A_\nu} = 0 \]

The first term is the same calculation as in S1, giving \(\partial_\mu(-F^{\mu\nu}) = -\partial_\mu F^{\mu\nu}\).

The second term: differentiating \(\frac{1}{2}m^2 A_\alpha A^\alpha = \frac{1}{2}m^2 \eta^{\alpha\beta}A_\alpha A_\beta\) with respect to \(A_\nu\):

\[ \frac{\partial}{\partial A_\nu}\left(\frac{1}{2}m^2\eta^{\alpha\beta}A_\alpha A_\beta\right) = \frac{1}{2}m^2\eta^{\alpha\beta}(\delta^\nu_\alpha A_\beta + A_\alpha\delta^\nu_\beta) = m^2\eta^{\nu\beta}A_\beta = m^2 A^\nu \]

Euler-Lagrange equation:

\[ -\partial_\mu F^{\mu\nu} - m^2 A^\nu = 0 \]

\[ \boxed{\partial_\mu F^{\mu\nu} + m^2 A^\nu = 0} \]

(b) Automatic Satisfaction of the Lorenz Condition:

Acting with \(\partial_\nu\) on both sides of the equation of motion:

\[ \partial_\nu\partial_\mu F^{\mu\nu} + m^2\partial_\nu A^\nu = 0 \]

For the first term: \(\partial_\nu\partial_\mu F^{\mu\nu}\). Here \(F^{\mu\nu}\) is antisymmetric (\(F^{\mu\nu} = -F^{\nu\mu}\)), while \(\partial_\nu\partial_\mu\) is symmetric in \(\mu, \nu\). The contraction of a symmetric tensor with an antisymmetric tensor vanishes:

\[ \partial_\nu\partial_\mu F^{\mu\nu} = 0 \]

Therefore:

\[ m^2\partial_\nu A^\nu = 0 \]

If \(m \neq 0\):

\[ \boxed{\partial_\nu A^\nu = 0} \]

The Lorenz condition holds automatically as a consequence of the equation of motion. There is no need to impose it externally as a gauge-fixing condition.

(c) Counting Physical Degrees of Freedom:

\(A_\mu\) has 4 components
The constraint \(\partial_\nu A^\nu = 0\) derived from the equation of motion provides 1 scalar condition
For \(m \neq 0\), there is no gauge symmetry (the mass term \(m^2 A_\mu A^\mu\) is not invariant under the gauge transformation \(A_\mu \to A_\mu + \partial_\mu\lambda\))
Since there is no gauge symmetry, no additional gauge-fixing condition is needed

Therefore the number of physical degrees of freedom is:

\[ 4 - 1 = 3 \]

This corresponds to 2 transverse polarizations + 1 longitudinal polarization. Massive spin-1 particles (e.g., \(W^\pm\), \(Z^0\) bosons) have 3 polarization states.

(d) The \(m \to 0\) Limit and the Higgs Mechanism:

The \(m \to 0\) limit:

As \(m \to 0\), the mass term \(m^2 A_\mu A^\mu\) vanishes, and the Lagrangian recovers gauge symmetry \(A_\mu \to A_\mu + \partial_\mu\lambda\)
The recovery of gauge symmetry introduces a new gauge degree of freedom (\(\lambda(x)\))
This additional degree of freedom reduces the physical degrees of freedom from \(3 \to 2\)
The longitudinal polarization vector \(\epsilon^\mu_L(k) \sim k^\mu/m\) diverges as \(m \to 0\), and the longitudinal mode disappears from (decouples from) the physical spectrum

Relation to the Higgs mechanism:

The Higgs mechanism (Ch. 17 and beyond) corresponds to the reverse of this process:

Starting point: massless gauge field (2 degrees of freedom) + Higgs field (scalar field)
The Higgs field acquires a vacuum expectation value (spontaneous symmetry breaking)
The Nambu-Goldstone mode of the Higgs field (1 degree of freedom) is "eaten" by the gauge field and becomes the longitudinal polarization component of the gauge field
Result: massive vector field (3 degrees of freedom)

This is the picture in which "when a gauge boson acquires mass, the Goldstone boson is absorbed as the longitudinal polarization," and it is the core mechanism of the Standard Model that explains the origin of the masses of the \(W^\pm\) and \(Z^0\) bosons.

Consistency Check:

Consistency of the degree-of-freedom counting: - Proca field (\(m \neq 0\)): 4 components − 1 constraint = 3 degrees of freedom \(\checkmark\) - Maxwell field (\(m = 0\)): 4 components − 1 constraint (\(\pi^0 = 0\)) − 1 gauge fixing = 2 degrees of freedom \(\checkmark\) - Higgs mechanism: 2 (gauge field) + 1 (Goldstone) = 3 (massive vector) \(\checkmark\)

A-2. Lagrangian with Gauge-Fixing Term and the Photon Propagator¶

→ Back to problem

Problem:

To implement the Lorenz gauge condition \(\partial_\mu A^\mu = 0\), we consider the Lagrangian with a gauge-fixing term added:

\[ \mathcal{L}_{\text{gf}} = -\frac{1}{4}F_{\mu\nu}F^{\mu\nu} - \frac{1}{2\xi}(\partial_\mu A^\mu)^2 \]

where \(\xi\) is the gauge parameter.

(a) Derive the Euler-Lagrange equation of motion for \(A_\nu\) from this Lagrangian, and show that

\[ \left[\eta^{\mu\nu}\Box - \left(1 - \frac{1}{\xi}\right)\partial^\mu \partial^\nu\right] A_\nu = 0 \]

(b) Fourier transform to momentum space via \(A_\nu(x) = \int \frac{d^4k}{(2\pi)^4}\, \tilde{A}_\nu(k)\, e^{-ikx}\), and rewrite the above equation as an algebraic equation for \(\tilde{A}_\nu\).

(c) Determine the photon propagator (Feynman propagator) \(D_F^{\mu\nu}(k)\) as the inverse of the above differential operator, and show that

\[ D_F^{\mu\nu}(k) = \frac{-1}{k^2 + i\epsilon}\left[\eta^{\mu\nu} - (1 - \xi)\frac{k^\mu k^\nu}{k^2}\right] \]

In particular, verify that for \(\xi = 1\) (Feynman gauge) the propagator takes its simplest form \(D_F^{\mu\nu} = -\eta^{\mu\nu}/(k^2 + i\epsilon)\).

(d) Write down the propagator in the case \(\xi = 0\) (Landau gauge), and verify that \(k_\mu D_F^{\mu\nu}(k) = 0\) holds. Explain why this is the momentum-space expression of the Lorenz condition \(\partial_\mu A^\mu = 0\).

(a) Derivation of the Euler-Lagrange Equation:

\[ \mathcal{L}_{\text{gf}} = -\frac{1}{4}F_{\mu\nu}F^{\mu\nu} - \frac{1}{2\xi}(\partial_\mu A^\mu)^2 \]

Contribution from the first term (same as S1):

\[ \partial_\mu\frac{\partial(-\frac{1}{4}F_{\alpha\beta}F^{\alpha\beta})}{\partial(\partial_\mu A_\nu)} = \partial_\mu F^{\mu\nu} \]

(Note on signs: since we obtained \(\partial\mathcal{L}/\partial(\partial_\mu A_\nu) = -F^{\mu\nu}\) in S1, the contribution to the first term of the Euler-Lagrange equation is \(\partial_\mu(-F^{\mu\nu})\).)

Contribution from the second term:

\[ -\frac{1}{2\xi}(\partial_\alpha A^\alpha)^2 = -\frac{1}{2\xi}\eta^{\alpha\beta}\eta^{\gamma\delta}(\partial_\alpha A_\beta)(\partial_\gamma A_\delta) \]

Differentiating with respect to \(\partial_\mu A_\nu\):

\[ \frac{\partial}{\partial(\partial_\mu A_\nu)}\left[-\frac{1}{2\xi}\eta^{\alpha\beta}\eta^{\gamma\delta}(\partial_\alpha A_\beta)(\partial_\gamma A_\delta)\right] \]

\[ = -\frac{1}{2\xi}\eta^{\alpha\beta}\eta^{\gamma\delta}\left[\delta^\mu_\alpha\delta^\nu_\beta(\partial_\gamma A_\delta) + (\partial_\alpha A_\beta)\delta^\mu_\gamma\delta^\nu_\delta\right] \]

\[ = -\frac{1}{2\xi}\left[\eta^{\mu\nu}(\partial_\gamma A^\gamma) + (\partial_\alpha A^\alpha)\eta^{\mu\nu}\right] = -\frac{1}{\xi}\eta^{\mu\nu}(\partial_\alpha A^\alpha) \]

Applying \(\partial_\mu\):

\[ \partial_\mu\left[-\frac{1}{\xi}\eta^{\mu\nu}(\partial_\alpha A^\alpha)\right] = -\frac{1}{\xi}\partial^\nu(\partial_\alpha A^\alpha) \]

Complete Euler-Lagrange equation (noting \(\partial\mathcal{L}/\partial A_\nu = 0\)):

\[ \partial_\mu F^{\mu\nu} - \frac{1}{\xi}\partial^\nu(\partial_\mu A^\mu) = 0 \]

Substituting \(F^{\mu\nu} = \partial^\mu A^\nu - \partial^\nu A^\mu\):

\[ \partial_\mu(\partial^\mu A^\nu - \partial^\nu A^\mu) - \frac{1}{\xi}\partial^\nu(\partial_\mu A^\mu) = 0 \]

\[ \Box A^\nu - \partial^\nu(\partial_\mu A^\mu) - \frac{1}{\xi}\partial^\nu(\partial_\mu A^\mu) = 0 \]

\[ \Box A^\nu - \left(1 + \frac{1}{\xi}\right)\partial^\nu(\partial_\mu A^\mu) = 0 \]

Wait, let us redo this carefully. \(\partial_\mu F^{\mu\nu} = \partial_\mu\partial^\mu A^\nu - \partial_\mu\partial^\nu A^\mu = \Box A^\nu - \partial^\nu(\partial_\mu A^\mu)\).

Therefore:

\[ \Box A^\nu - \partial^\nu(\partial_\mu A^\mu) - \frac{1}{\xi}\partial^\nu(\partial_\mu A^\mu) = 0 \]

\[ \Box A^\nu - \left(1 - \frac{1}{\xi}\right)\partial^\nu(\partial_\mu A^\mu) = 0 \]

Wait, let us check the signs. \(-\partial^\nu(\partial_\mu A^\mu) - \frac{1}{\xi}\partial^\nu(\partial_\mu A^\mu) = -(1 + \frac{1}{\xi})\partial^\nu(\partial_\mu A^\mu)\)...

Let us start over. The Euler-Lagrange equation is:

\[ \partial_\mu\frac{\partial\mathcal{L}}{\partial(\partial_\mu A_\nu)} - \frac{\partial\mathcal{L}}{\partial A_\nu} = 0 \]

First term (from \(-\frac{1}{4}F^2\)): \(\frac{\partial(-\frac{1}{4}F^2)}{\partial(\partial_\mu A_\nu)} = -F^{\mu\nu}\) (computed in S1).

Second term (from gauge-fixing term): \(\frac{\partial(-\frac{1}{2\xi}(\partial A)^2)}{\partial(\partial_\mu A_\nu)} = -\frac{1}{\xi}\eta^{\mu\nu}(\partial_\alpha A^\alpha)\).

Combining:

\[ \partial_\mu\left[-F^{\mu\nu} - \frac{1}{\xi}\eta^{\mu\nu}(\partial_\alpha A^\alpha)\right] = 0 \]

\[ -\partial_\mu F^{\mu\nu} - \frac{1}{\xi}\partial^\nu(\partial_\alpha A^\alpha) = 0 \]

\[ \partial_\mu F^{\mu\nu} + \frac{1}{\xi}\partial^\nu(\partial_\mu A^\mu) = 0 \]

Substituting \(\partial_\mu F^{\mu\nu} = \Box A^\nu - \partial^\nu(\partial_\mu A^\mu)\):

\[ \Box A^\nu - \partial^\nu(\partial_\mu A^\mu) + \frac{1}{\xi}\partial^\nu(\partial_\mu A^\mu) = 0 \]

\[ \Box A^\nu - \left(1 - \frac{1}{\xi}\right)\partial^\nu(\partial_\mu A^\mu) = 0 \]

Rearranging indices and rewriting with \(\nu\) as a lower index (using \(\eta^{\mu\nu}\)):

\[ \left[\eta^{\mu\nu}\Box - \left(1 - \frac{1}{\xi}\right)\partial^\mu\partial^\nu\right]A_\nu = 0 \]

\[ \boxed{\left[\eta^{\mu\nu}\Box - \left(1 - \frac{1}{\xi}\right)\partial^\mu\partial^\nu\right]A_\nu = 0} \]

(b) Transformation to Momentum Space:

Substituting \(A_\nu(x) = \int\frac{d^4k}{(2\pi)^4}\tilde{A}_\nu(k)e^{-ikx}\). Since \(\partial_\mu \to -ik_\mu\):

\[ \Box \to -k^2, \qquad \partial^\mu\partial^\nu \to -k^\mu(-ik^\nu) = -k^\mu k^\nu \]

Actually, since \(\partial_\mu e^{-ikx} = -ik_\mu e^{-ikx}\), for \(\partial^\mu = \eta^{\mu\nu}\partial_\nu\) we have \(\partial^\mu e^{-ikx} = -ik^\mu e^{-ikx}\).

For \(\Box = \partial_\mu\partial^\mu\): \(\Box e^{-ikx} = (-ik_\mu)(-ik^\mu)e^{-ikx} = -k^2 e^{-ikx}\) (where \(k^2 = k_\mu k^\mu\)).

For \(\partial^\mu\partial^\nu e^{-ikx} = (-ik^\mu)(-ik^\nu)e^{-ikx} = -k^\mu k^\nu e^{-ikx}\).

The equation of motion becomes:

\[ \left[-k^2\eta^{\mu\nu} - \left(1 - \frac{1}{\xi}\right)(-k^\mu k^\nu)\right]\tilde{A}_\nu = 0 \]

\[ \boxed{\left[-k^2\eta^{\mu\nu} + \left(1 - \frac{1}{\xi}\right)k^\mu k^\nu\right]\tilde{A}_\nu = 0} \]

(c) Derivation of the Photon Propagator:

The propagator \(D_F^{\nu\rho}(k)\) is defined as the inverse of the differential operator:

\[ \left[-k^2\eta^{\mu\nu} + \left(1 - \frac{1}{\xi}\right)k^\mu k^\nu\right]D_{F\,\nu\rho}(k) = \eta^\mu{}_\rho = \delta^\mu_\rho \]

We assume \(D_F^{\nu\rho}\) as a linear combination of two independent tensor structures:

\[ D_{F\,\nu\rho}(k) = A\,\eta_{\nu\rho} + B\,\frac{k_\nu k_\rho}{k^2} \]

Substituting into the left-hand side:

\[ \left[-k^2\eta^{\mu\nu} + \left(1 - \frac{1}{\xi}\right)k^\mu k^\nu\right]\left[A\,\eta_{\nu\rho} + B\,\frac{k_\nu k_\rho}{k^2}\right] \]

First term:

\[ -k^2\eta^{\mu\nu}\left[A\,\eta_{\nu\rho} + B\,\frac{k_\nu k_\rho}{k^2}\right] = -k^2 A\,\delta^\mu_\rho - B\,k^\mu k_\rho \]

Second term:

\[ \left(1 - \frac{1}{\xi}\right)k^\mu k^\nu\left[A\,\eta_{\nu\rho} + B\,\frac{k_\nu k_\rho}{k^2}\right] = \left(1 - \frac{1}{\xi}\right)\left[A\,k^\mu k_\rho + B\,\frac{k^2 k^\mu k_\rho}{k^2}\right] \]

\[ = \left(1 - \frac{1}{\xi}\right)(A + B)k^\mu k_\rho \]

In total:

\[ -k^2 A\,\delta^\mu_\rho + \left[-B + \left(1 - \frac{1}{\xi}\right)(A + B)\right]k^\mu k_\rho = \delta^\mu_\rho \]

Equating the coefficients of \(\delta^\mu_\rho\) and \(k^\mu k_\rho\) separately:

Coefficient of \(\delta^\mu_\rho\):

\[ -k^2 A = 1 \quad \Longrightarrow \quad A = -\frac{1}{k^2} \]

Coefficient of \(k^\mu k_\rho\):

\[ -B + \left(1 - \frac{1}{\xi}\right)(A + B) = 0 \]

\[ -B + \left(1 - \frac{1}{\xi}\right)A + \left(1 - \frac{1}{\xi}\right)B = 0 \]

\[ B\left[-1 + 1 - \frac{1}{\xi}\right] + \left(1 - \frac{1}{\xi}\right)A = 0 \]

\[ -\frac{B}{\xi} + \left(1 - \frac{1}{\xi}\right)\left(-\frac{1}{k^2}\right) = 0 \]

\[ -\frac{B}{\xi} = \frac{1}{k^2}\left(1 - \frac{1}{\xi}\right) = \frac{\xi - 1}{\xi k^2} \]

\[ B = -\frac{\xi - 1}{k^2} = \frac{1 - \xi}{k^2} \]

Therefore:

\[ D_{F\,\nu\rho}(k) = -\frac{1}{k^2}\eta_{\nu\rho} + \frac{1-\xi}{k^2}\cdot\frac{k_\nu k_\rho}{k^2} = \frac{-1}{k^2}\left[\eta_{\nu\rho} - (1-\xi)\frac{k_\nu k_\rho}{k^2}\right] \]

Adding the \(i\epsilon\) prescription:

\[ \boxed{D_F^{\mu\nu}(k) = \frac{-1}{k^2 + i\epsilon}\left[\eta^{\mu\nu} - (1-\xi)\frac{k^\mu k^\nu}{k^2}\right]} \]

Feynman gauge \(\xi = 1\):

\[ D_F^{\mu\nu}(k) = \frac{-\eta^{\mu\nu}}{k^2 + i\epsilon} \]

This gives the simplest form. \(\checkmark\)

(d) Landau Gauge \(\xi = 0\):

Substituting \(\xi = 0\):

\[ D_F^{\mu\nu}(k)\big|_{\xi=0} = \frac{-1}{k^2 + i\epsilon}\left[\eta^{\mu\nu} - \frac{k^\mu k^\nu}{k^2}\right] \]

Contracting with \(k_\mu\):

\[ k_\mu D_F^{\mu\nu}(k) = \frac{-1}{k^2 + i\epsilon}\left[k^\nu - \frac{k^2 k^\nu}{k^2}\right] = \frac{-1}{k^2 + i\epsilon}\left[k^\nu - k^\nu\right] = 0 \]

\[ \boxed{k_\mu D_F^{\mu\nu}(k)\big|_{\xi=0} = 0 \quad \checkmark} \]

Physical meaning: In momentum space, \(k_\mu D_F^{\mu\nu} = 0\) means that the propagator has no component in the \(k_\mu\) direction. This is a reflection of the Lorenz condition \(\partial_\mu A^\mu = 0\) in coordinate space (which becomes \(k_\mu \tilde{A}^\mu = 0\) under Fourier transform). In the Landau gauge, the propagator itself has the structure of a transverse projection, and the Lorenz condition is automatically realized at the level of the propagator.

Verification:

Feynman gauge check: For \(\xi = 1\), the differential operator becomes \(-k^2\eta^{\mu\nu}\) (the \(k^\mu k^\nu\) term vanishes), and its inverse is obviously \(-\eta_{\mu\nu}/k^2\). \(\checkmark\)

Tensor structure check: Computing the trace of \(D_F^{\mu\nu}\) for general \(\xi\):

\[ \eta_{\mu\nu}D_F^{\mu\nu} = \frac{-1}{k^2}\left[4 - (1-\xi)\frac{k^2}{k^2}\right] = \frac{-1}{k^2}(4 - 1 + \xi) = \frac{-(3+\xi)}{k^2} \]

\(\xi = 1\): trace \(= -4/k^2\) (\(-\eta_{\mu\nu}\eta^{\mu\nu}/k^2 = -4/k^2\)) \(\checkmark\)

Gauge invariance check: Physical scattering amplitudes are independent of \(\xi\) (due to the Ward identity). The \(k^\mu k^\nu\) term in the propagator vanishes when external lines are on-shell, due to the coupling with conserved currents \(k_\mu J^\mu = 0\). This is discussed in detail from Ch. 10 onward.

Feedback on this page

Let us know if something was unclear, incorrect, or could be improved.