Skip to article frontmatterSkip to article content

54-Momentum & E=mc²

5.1Proper time

We have seen that in Special Relativity events are described by four coordinates: (ct,x,y,z)(ct,x,y,z). Moreover, distance is measured via a inner product AμBμ=A0B0A1B1A2B2A3B3A^\mu \cdot B^\mu = A^0B^0 - A^1B^1 - A^2B^2 - A^3 B^3. That opens the question: what about other quantities that we use in mechanics.

If position is Xμ=(ct,x,y,z)X^\mu = (ct,x,y,z) then what is velocity? Is vμdXμdtv^\mu \equiv \frac{dX^\mu}{dt} a good choice? It is what we are used to: velocity is change in position over time. However, we need to be careful. We require that our quantities are four-vectors, i.e. they transform according to the Lorentz Transformation. And the length, i.e. the inner product with itself, is the same for all inertial observes.

However, in our our first choice of the definition, we take the derivative with respect to time. But time is not the same for different observers!

We do know that the distance ds2ds^2 is LT invariant, as is c2c^2, therefore we can combine both into another invariant - of time

dτ2ds2c2d\tau^2 \equiv \frac{ds^2}{c^2}

If we spell out ds2ds^2 we can write

dτ2=ds2c2=dt21c2(dx2+dy2+dz2)d\tau^2 = \frac{ds^2}{c^2} = dt^2-\frac{1}{c^2}(dx^2+dy^2+dz^2)

dτd\tau is called proper time or Eigenzeit because for the rest frame SS' we have (dx=dy=dz=0)(dx'=dy'=dz'=0) and thus

dτ2=dt2d\tau^2 = dt'^2

We associate to a moving particle the 3-velocity u=(ux,uy,uz)=(dxdt,dydt,dzdt)\vec{u}=(u_x,u_y,u_z)=(\frac{dx}{dt}, \frac{dy}{dt}, \frac{dz}{dt}). This is the velocity that we normally use: it is distance as measured in our frame of reference over time as we see on our clocks. We can relate the proper time dτd\tau to the frame/coordinate time dtdt:

dτ2=dt21c2(dx2+dy2+dz2)=dt2[11c2((dxdt)2+(dydt)2+(dzdt)2)]\begin{split} d\tau^2 &= dt^2-\frac{1}{c^2}(dx^2+dy^2+dz^2)\\ &=dt^2\left [ 1-\frac{1}{c^2} \left ( \left (\frac{dx}{dt} \right )^2+ \left (\frac{dy}{dt} \right )^2+ \left (\frac{dz}{dt} \right )^2 \right) \right ] \end{split}

Here we use the magnitude of the 3-velocity uu. In other words

dτ2dt2=1u2c2dt=γ(u)dτ\frac{d\tau^2}{dt^2}=1-\frac{u^2}{c^2} \Rightarrow dt = \gamma (u) d\tau

The proper time interval relates to the frame time via the gammagamma-factor for the velocity uu.

5.24-velocity

Now we can tackle the 4-velocity. In order to make any sense we must define a velocity whose length is an invariant. Furthermore, velocity must be something like displacement over time interval. For the displacement the obvious choice is: dXμdX^\mu, i.e. a particle has moved from XμX^\mu to Xμ+dXμX^\mu + dX^\mu. The displacement dXμdX^\mu transforms, of course, via the Lorentz Transformation. Moreover, its length is a Lorentz Invariant. In order to arrive at an adequate velocity, we must thus divide the displacement by a time interval that is also a Lorentz Invariant. Luckily, we have just seen that proper time is a Lorentz Invariant.

Therefore the 4-velocity U\vec{U} is

UμdXμdτU^\mu \equiv \frac{dX^\mu}{d\tau}

where the derivative of the 4-position vector is taken with respect to the proper time τ\tau. We obtain the relation to the 3-velocity u\vec{u} just from filling in dτ=dt/γ(u)d\tau = dt/\gamma(u)

Uμ=γ(u)(dctdt,dxdt,dydt,dzdt)=(γ(u)c,γ(u)u)U^\mu = \gamma (u) \left ( \frac{dct}{dt}, \frac{dx}{dt}, \frac{dy}{dt}, \frac{dz}{dt} \right ) = (\gamma (u) c, \gamma (u)\vec{u})

4-velocity transfers between frames moving with speed VV as given by the Lorentz transformation as U\vec{U} is a 4-vector.

5.2.1Be careful with 4-vector interpretation

We compute the inner product of U\vec{U} with itself U2=γ2(u)(c2u2)U^2 = \gamma^2(u) (c^2-u^2). That is a LT invariant of course. Therefore we can choose the frame such that u=0u=0, or in other words U2=c2U^2=c^2. The 4-velocity length is constant! That is not intuitive at all. Even stranger as the vector has constant length, it follows that the 4-velocity is always perpendicular to the 4-acceleration.

ddτU2=2UddτU=0\frac{d}{d\tau}U^2 = 2\vec{U}\cdot \frac{d}{d\tau}\vec{U}=0

The counter intuitive stuff happens of course due to the pseudo-Euclidean metric.

5.2.2Revisit 3-velocity transformation

Earlier we transformed the velocity uu of a particle in SS to SS' which was moving with VV. This was quite complicated and the formula is difficult to remember. However, there is no need to remember the formula, you can always derive it from the transformation of the 4-velocity.

For the 4-velocity U=(γ(u)c,γ(u)u)\vec{U}=(\gamma(u)c,\gamma(u)\vec{u}) we can write down the LT of a 4-vector between SS and SS'.

γ(u)c=γ(V)(γ(u)cVcγ(u)ux)γ(u)ux=γ(V)(γ(u)uxVcγ(u)c)γ(u)uy=γ(u)uyγ(u)uz=γ(u)uz\begin{array}{rcl} \gamma(u') c &=& \gamma(V) \left ( \gamma(u)c - \frac{V}{c}\gamma(u)u_x\right )\\ \gamma(u') u'_x &=& \gamma(V) \left ( \gamma(u)u_x - \frac{V}{c}\gamma(u)c\right )\\ \gamma(u') u'_y &=& \gamma(u)u_y\\ \gamma(u') u'_z &=& \gamma(u)u_z \end{array}

If we now divide the second of these equations by the first we obtain

uxc=uxcVc1Vuxc2\frac{u'_x}{c} = \frac{\frac{u_x}{c}-\frac{V}{c}}{1-\frac{Vu_x}{c^2}}

and if we divide the third of these equations by the first we obtain

uyc=uycγ(V)(1Vuxc2)\frac{u'_y}{c} =\frac{\frac{u_y}{c}}{\gamma(V) \left( 1-\frac{Vu_x}{c^2}\right )}

Just what we have derived before, but now in a way that you can always do this on the spot if you know the definition of the 4-velocity and the LT of a 4-vector.

5.34-momentum

If we postulate that the mass mm is LT invariant we can define the 4-momentum simply by

P=mU=(mγ(u)c,mγ(u)u)(P0,p)\vec{P} = m\vec{U} = (m\gamma (u)c,m\gamma(u)\vec{u})\equiv (P^0,\vec{p})

with the 3-momentum p=mγ(u)u=mdxdτ\vec{p}=m\gamma(u)\vec{u}=m\frac{d\vec{x}}{d\tau}.

5.3.1Conservation of 4-momentum

For collisions now the total 4-momentum is conserved (per component)

i,beforePi=j,afterPj\sum_{i,before} \vec{P}_i = \sum_{j,after} \vec{P}_j

If the total momentum is conserved than this must hold for the components (mγ(u)c,p)(m\gamma (u)c,\vec{p}).

Note, that we did not write “mass is conserved”. We postulate that it is a LT invariant, that is: it is the same for all inertial observers. But that does not imply that for collisions the mass should equal before and after the collision.

5.4E=mc²

The most famous equation in physics.

We will derive it by looking at N2 in its relativistic form.

F=dpdt=ddt(mγ(u)u)=mdudτ\vec{F} = \frac{d\vec{p}}{dt} = \frac{d}{dt}(m\gamma(u)\vec{u}) = m\frac{d\vec{u}}{d\tau}

Kinetic energy was defined as work done on a mass. We again start from that and fill in N2 and take it step by step

ΔEkin=12Fdr=12Fudt=12ddt(mγ(u)u)udt=m0u~udγ(u)u\begin{array}{rcl} \Delta E_{kin} &=& \displaystyle{\int_1^2 \vec{F}\cdot d\vec{r} = \int_1^2 \vec{F}\cdot \vec{u}dt} \\ &=& \displaystyle{\int_1^2 \frac{d}{dt}(m\gamma(u)\vec{u})\cdot \vec{u}dt}\\ &=& \displaystyle{m\int_0^{\tilde{u}} \vec{u}\cdot d\gamma(u)\vec{u}} \end{array}

This integration is more difficult than what we had before as the γ(u)\gamma(u) factor appears additional in the differential (for small velocities we have γ(u)=1\gamma(u)=1 and we just get 12mu2\frac{1}{2}mu^2 as before). Now we apply integration by parts

ΔEkin=m[uγ(u)u]0u~m0u~γ(u)udu=mγ(u~)u~2m0u~udu1u2c2=mγ(u~)u~2m0u~12du21u2c2=mγ(u~)u~2mc2[1u2c2]0u~=mγ(u~)u~2mc2(1u~2c2+1)=mγ(u~)u~2+mc2γ(u~)mc2=mc2+mc2γ(u~)(u~2c2+1u~2c2)=mc2(γ(u~)1)\begin{array}{rcl} \Delta E_{kin} &=& \displaystyle{m[\vec{u}\cdot \gamma(u)\vec{u}]_0^{\tilde{u}} - m\int_0^{\tilde{u}} \gamma(u)\vec{u} \cdot d\vec{u}}\\ &=& \displaystyle{m\gamma (\tilde{u}){\tilde{u}}^2 - m\int_0^{\tilde{u}} \frac{\vec{u} \cdot d\vec{u}}{\sqrt{1-\frac{u^2}{c^2}}}}\\ &=& \displaystyle{m\gamma (\tilde{u}){\tilde{u}}^2 - m\int_0^{\tilde{u}}\frac{\frac{1}{2}du^2}{\sqrt{1-\frac{u^2}{c^2}}}}\\ &=& \displaystyle{m\gamma (\tilde{u}){\tilde{u}}^2 - mc^2 \left[\sqrt{1-\frac{u^2}{c^2}}\right]_0^{\tilde{u}}}\\ &=& m\gamma (\tilde{u})\tilde{u}^2 - mc^2\left ( - \sqrt{1-\frac{\tilde{u}^2}{c^2}}+1 \right )\\ &=& m\gamma (\tilde{u})\tilde{u}^2 + \frac{mc^2}{\gamma(\tilde{u})}-mc^2\\ &=& \displaystyle{-mc^2+mc^2\gamma(\tilde{u}) \left ( \frac{\tilde{u}^2}{c^2}+1-\frac{\tilde{u}^2}{c^2} \right ) }\\ &=& mc^2(\gamma(\tilde{u})-1) \end{array}

Integration by parts

Easy to remember integration by parts formula, from the product rule

(fg)=fg+fg(fg)=fg+fgfg=[fg]fg\begin{array}{rcl} (fg)' &=& f'g+fg' \\ \Rightarrow \int (fg)' &=& \int f'g + \int fg'\\ \int f'g &=& [fg] - \int fg' \end{array}

In the derivation of the kinetic energy we used f=dγ(u)uf'=d\gamma(u)\vec{u} and g=ug=\vec{u}.

If we now inspect the limiting cases for the velocity

ΔEkin=mc2(γ(u)1)\Delta E_{kin} = mc^2(\gamma(u)-1)

The limiting cases work out. Very reassuring.

We can add a constant (LT invariant) to the kinetic energy E=Ekin+mc2=mγ(u)c2E=E_{kin}+mc^2 = m\gamma(u)c^2. Adding constants to the energy/potential is always allowed as only the change of it is physically relevant (or the relative energies). The reason for this constant will be apparent below as this allows to include the energy in 4-momentum nicely.

We obtain

E=mγ(u)c2E=m\gamma(u) c^2

or in the rest frame (u=0γ(u)=1)(u=0 \Rightarrow \gamma(u)=1)

E=mc2E=mc^2

With this energy E=mγ(u)c2E=m\gamma(u)c^2 we can define the 4-momentum as follows (we had P=(mγ(u)c,p)\vec{P}=(m\gamma(u)c,\vec{p}))

P=(Ec,p)\vec{P}=\left ( \frac{E}{c},\vec{p} \right )

4-momentum with a different energy?

With a different energy (addition of another constant to EkinE_{kin} than what we did above) the length of the 4-momentum would not be LT invariant and P\vec{P} not a 4-vector. If we would have used E=mc2(γ1)E=mc^2(\gamma -1) then P2P^2 would not be LT invariant. You see this by computing P2=Ekin2c2p2c2=m2c2(22γ)P^2=\frac{E^2_{kin}}{c^2}-p^2c^2=m^2c^2(2-2\gamma).

And we have finally derived the most famous equation in physics. We will use, however, E=mγ(u)c2E=m\gamma(u)c^2 most of the time as we are not always in the rest frame. The equation says essentially that mass is the same as energy. They are different manifestations of the same thing. A particle has energy in itself at rest without being in any potential.

5.4.1Mass in units of energy

The mass of an electron me=9.131031kgm_e = 9.13\cdot 10^{-31} \mathrm{kg} is often given as 512keV,[kiloelectronVolts].Massofallelementaryparticlesisgivenactuallyinunitsof512 \mathrm{keV}, [kilo electron Volts]. Mass of all elementary particles is given actually in units of \mathrm{eV}$.

One electron volt is

1eV=1.61019C1V=1.61019J1 eV = 1.6\cdot 10^{-19} C \cdot 1V = 1.6\cdot 10^{-19} J

The conversion to mass via E=mc2E=mc^2

mec2=8.21014J=8.210141.61019=512keVm_e c^2 = 8.2 \cdot 10^{-14}J = \frac{8.2 \cdot 10^{-14}}{1.6\cdot 10^{-19}} = 512 \mathrm{keV}

5.4.2The fame

The origin of the fame is probably twofold.

5.5Energy-momentum relation

The 4-momentum is, of course, a 4-vector and therefore P2P^2 is LT invariant. Let us have a look at the outcome with P=(Ec,p)\vec{P}=\left ( \frac{E}{c},\vec{p} \right )

P2=E2c2p2=m2γ2(u)c2m2γ2(u)u2=m2γ2(u)c2(1u2c2)=m2c2E2p2c2=m2c4\begin{array}{rcl} P^2 = \frac{E^2}{c^2}-p^2 &=& m^2\gamma^2(u) c^2 - m^2\gamma^2(u)u^2\\ &=& m^2\gamma^2(u)c^2 \left ( 1-\frac{u^2}{c^2}\right ) = m^2c^2\\ \Rightarrow E^2-p^2c^2 &=& m^2c^4 \end{array}

Indeed, we find that P2P^2 is LT invariant as mm and cc are LT invariants. Rearranging the equation, we obtain

E2=(mc2)2+(pc)2E^2 = (mc^2)^2 + (pc)^2

This converts back to E=mc2E=mc^2 in the rest frame.

Einstein triangle.

Figure 1:Einstein triangle.

You can visualize the energy momentum relation with the Einstein triangle shown here, as the relation has the form of c2=a2+b2c^2=a^2+b^2. With the kinetic energy as Ekin=mc2(γ(u)1)E_{kin}=mc^2(\gamma(u)-1). E=E0+Ekinmc2+EkinE=E_0+E_{kin}\equiv mc^2 +E_{kin}.

5.5.1LT invariance of P²

Above we found a very useful, but bit hidden relation in the derivation

P2=m2c2P^2 = m^2c^2

This is of course LT invariant, as mm and cc are LT invariants (and the momentum is a 4-vector), but more importantly we can use this for computations of relativistic collisions. By the conservation of 4-momentum we can of course compute all collisions by equating the 4 components of the momentum before and after the collision. It is often, however, mathematically easier to write down the conservation of momentum and then square it. Because you can write down P2=m2c2P^2=m^2c^2 directly, this saves often computations.

5.6Photons

For photons we have the energy given by E=ωE= \hbar \omega and the momentum as p=ωcp= \frac{\hbar\omega}{c}. The 4-momentum of a photon is

P=Pμ=(Ec,p)=(ωc,ωc)(hνc,hνc)\vec{P} = P^\mu = \left ( \frac{E}{c},\vec{p} \right ) = \left ( \frac{\hbar \omega}{c}, \frac{\hbar \omega}{c} \right ) \left ( \frac{h \nu}{c}, \frac{h \nu}{c} \right )

It is directly clear that for photons the LT invariant P2=0P^2=0.

We could substitute the photon 4-momentum into the energy-momentum relation, we find

E2=(pc)2+(mc2)2m=0E^2 = (pc)^2+(mc^2)^2 \Rightarrow m=0

This seems to confirm that photons do not have mass. But we need to be careful: photons do not have a 4-momentum of the form Pμ=(mγc,mγu)P^\mu = ( m\gamma c, m\gamma u). They can’t: (1) their velocity is always c, which would lead to \infty for their γ(c)\gamma (c), (2) with a mass m=0m=0 we multiply γc\gamma c by zero. Together, this would gives us 0×0 \times \infty which is not defined in a unique way.

Thus: photons do not have mass. Do not get confused with E=mc2E=mc^2.

5.6.1Rest frame of a photon?

Does a photon have a rest frame? It travels with the speed of light cc (obviously) in all frames.

The answers is no and we give three good arguments.

5.6.2Doppler revisited

In chapter 14 we discussed the Doppler effect from a relativistic point of view. With the concept of 4-momentum it is easy to derive the Doppler shift of photons as observed in different frames of reference. We take the usual LT between SS' and SS. In SS’ a photon is moving along the xx'-direction. It has frequency ff'. Its 4-momentum is

Pphotonμ=(hfc,±hfc)P'^\mu_{photon} = \left ( \frac{hf'}{c}, \pm \frac{hf'}{c} \right )

The ±-sign indicates the direction of the photon: + for moving in the positive xx'-direction, - for moving in the negative xx'-direction.

Using the Lorentz Transformation, we can easily transform the 4-momentum to the frame of SS:

hfc=γ(hfc+Vc±hfc)=γ(1±Vc)hfcff=1±V1V2\begin{split} \frac{hf}{c} &= \gamma \left (\frac{hf'}{c} + \frac{V}{c} \frac{\pm hf'}{c}\right ) = \gamma \left ( 1 \pm \frac{V}{c} \right ) \frac{hf'}{c} \Rightarrow \\ \frac{f}{f'} &= \frac{1 \pm V}{\sqrt{1-V^2}} \end{split}

Note that we didn’t use the transformation of Pphoton1P'^1_{photon} as this will give the same result.

5.7Speed of light as limiting velocity

The gammagamma factor increases strongly if the speed approaches the speed of light u/c1u/c\to 1 as can be seen in this plot

Source
import numpy as np
import matplotlib.pyplot as plt

c = 299792458 # speed of light in m/s
x = np.linspace(0, c, 1000)
y = 1 / np.sqrt(1 - (x / c)**2)

plt.figure(figsize=(8, 6))
plt.plot(x/c, y, 'b-')
plt.vlines(1,0,20,color='red')
plt.xlim(0,1.1)
plt.ylim(0,20)
plt.xlabel('$u/c$')
plt.grid()
plt.savefig('../gamma_v.svg')
plt.show()
The gamma factor increases strongly if the speed approaches the speed of light u/c\to 1

Figure 2:The gammagamma factor increases strongly if the speed approaches the speed of light u/c1u/c\to 1

For a massive particle this has strong consequences. In the limit ucu\to c the factor goes towards infinity. If we consider that the kinetic energy is E=m(γ(u)1)c2E=m(\gamma(u) -1)c^2, the amount of work done to increase the speed increases with gammagamma. Therefore no massive particle can move with the speed of light (or faster) as this would require an infinite amount of energy for the acceleration.

NB: cc is the speed of light in vacuum. In matter the speed of light vv is smaller than cc, characterized by the refractive index nn as n=c/vn=c/v. This leads e.g. to refraction by Snell’s law at an interface. In matter the speed of massive particles can be larger than the speed of light there. This happens e.g. in a nuclear reactor when electrons move faster than the speed of light in water (0.75c0.75c). As water is a dielectric, the light waves generated from the response to the moving charge lag behind and a phenomena similar to a sonic boom is created. This phenomena is termed Cherenkov radiation. If you have the opportunity to see it in a nuclear reactor, we highly recommend to take it. The color is a very intense deep blue.

Cherenkov radiation glowing in the core of the Advanced Test Reactor at Idaho National Laboratory (Wikipedia Commons, CC BY-SA 2.0)

Figure 3:Cherenkov radiation glowing in the core of the Advanced Test Reactor at Idaho National Laboratory (Wikipedia Commons, CC BY-SA 2.0)