Skip to article frontmatterSkip to article content

3Velocity Transformation & Doppler shift

Imagine we have two space ships moving each with a speed of 34c\frac{3}{4}c as shown below. What is the speed that either the red or yellow space ships sees for the other space ship speed?

We should, first of all realize, that the information regarding the velocity of the two space ships is given by an observer SS who is neither in the red nor the yellow ship. We need to transform this information to an observer in the red or in the yellow ship.

Two space ships approaching each other.

Figure 1:Two space ships approaching each other.

For the GT we have derived the velocity transformation to be

vx=vxVv'_{x'} = v_x-V

So, let’s translate our velocity information from the observer SS to someone in the red ship. The relative velocity between SS and the red ship is V=34cV=\frac{3}{4}c. Thus according to the observer in the red ship, SRS_R, her velocity is VR=VRVR=0V'_R = V_R -V_R =0, obviously.

However, she will denote the velocity of the yellow ship as Vy=VyVR=(3/43/4)c=32c>cV'_y = V_y - V_R = (-3/4 - 3/4)c = \frac{3}{2}c > c. In the world of Galilei and Newton, this is no problem at all: velocities can be as big as you can imagine. However, in reality, this is not true. We have to use Special relativity if the velocities start to approach cc. It is not possible for any object to move faster than the speed of light, as we will see later.

In the above, we have only looked at the velocity component in the xx-direction. We have in addition found vy=vy,vz=vzv'_{y'}=v_y, v'_{z'}=v_z.

As our universe does not follow Galilei and Newton, we need to derive the transformation/addition formula for velocities with the LT. So let’s do it.

3.1Velocity Transformation

Let us start from the definition of velocity (assuming we deal with constant velocities, so we don’t need to worry about differentiation and integration). We will denote velocities by uu to avoid confusion with VV, the relative velocity between the two observers.

Observer SS' will write down:

ux=x2x1t2t1=ΔxΔt and  uy=y2y1t2t1=ΔyΔt()u'_{x'} = \frac{x'_2-x'_1}{t'_2-t'_1} =\frac{\Delta x'}{\Delta t'} \text{\,\,\,\,\ and \,\,\,\, } u'_{y'} = \frac{y'_2-y'_1}{t'_2-t'_1} =\frac{\Delta y'}{\Delta t'} \quad (*)

We have left out the zz'-component as that will be completely analogous to the yy'-coordinate.

Observer SS will use similar definitions. How do these observers translate velocity information that they get from each other?

We need to use the LT to transform (ct,x,y)(ct',x',y') to (ct,x,y)(ct,x,y):

x2x1=γ(x2Vcct2)γ(x1Vcct1)=γ(x2x1)γVc(ct2ct1)y2y1=y2y1\begin{array}{rcl} x'_2-x'_1 &=& \gamma \left ( x_2 -\frac{V}{c} ct_2 \right ) -\gamma \left ( x_1 -\frac{V}{c} ct_1 \right )\\ &=& \gamma (x_2-x_1)-\gamma \frac{V}{c}(ct_2-ct_1) \\ \\ y'_2-y'_1 &=& y_2 - y_1 \end{array}

and

ct2ct1=γ(ct2Vcx2)γ(ct1Vcx1)=γ(ct2ct1)γVc(x2x1)\begin{array}{rcl} ct'_2-ct'_1 &=& \gamma \left ( ct_2 -\frac{V}{c} x_2 \right ) -\gamma \left ( ct_1 -\frac{V}{c} x_1 \right )\\ &=& \gamma (ct_2-ct_1)-\gamma \frac{V}{c}(x_2-x_1) \end{array}

From the last line it is clear that also the y,zy,z components of the velocity u\vec{u} will be influenced by the transformation although the relative motion between the two observers is only along the xx-direction. Substituting the expressions for the space and time difference into vxv'_{x'}, we obtain

ux=γΔxγVcΔctγΔctγVcΔx=ΔxΔtV1Vc2ΔxΔt=uxV1Vuxc2\begin{array}{rcl} u'_{x'}&=& \displaystyle{\frac{\gamma \Delta x - \gamma \frac{V}{c}\Delta ct}{\gamma \Delta ct - \gamma \frac{V}{c}\Delta x} = \frac{\frac{\Delta x}{\Delta t}-V}{1-\frac{V}{c^2}\frac{\Delta x}{\Delta t}}}\\ \\ &=& \displaystyle{\frac{u_x-V}{1-\frac{Vu_x}{c^2}}} \end{array}

For the transverse components y,zy,z, we obtain due to the change of the time interval

uy=ΔyγΔctγVcΔx=uyγ(1Vuxc2)u'_{y'} = \frac{\Delta y}{\gamma \Delta ct - \gamma \frac{V}{c}\Delta x} = \frac{u_y}{\gamma \left (1-\frac{Vu_x}{c^2}\right )}

In the limit of ux,Vcu_x,V\ll c both formulas reduce to the Galileo transformation as required. For uxcu_x\to c and VcV\to -c the combined velocity will stay smaller than cc. Check yourself that this is true.

The formula for the velocity transformation/addition are not so easy to remember. Later you will see how to derive them from the transformation properties of the 4-velocity, which is easy to remember.

For our example of the two approaching space ships, ux=34c,V=34cu_x=-\frac{3}{4}c, V=\frac{3}{4}c we find for the speed of the yellow approaching the red ship

ux=34341+3434c=2425c<cu''_{x''} = \frac{-\frac{3}{4}-\frac{3}{4}}{1+\frac{3}{4}\frac{3}{4}}c=-\frac{24}{25}c <c

This is again smaller (in absolute sense) than cc. For the other ship we find of course the same, but with different sign.

3.2Doppler effect

The Doppler effect is well-known from waves. You know it from daily life. If a car is passing you at high speed, the frequency of the sound you hear changes from approaching to driving away from you. The received frequency fobsf_{obs} by you is higher than the emitted frequency fsrcf_{src} while the car is approaching, and smaller when it drives off.

Here we investigate the effect of a moving source that is emitting light with fsrcf_{src} (electro-magnetic waves). This is one of the few cases where the relativistic case is easier than the classical effect. In the latter it matters if the source is moving or the medium. For EM-waves, however, there is no medium (aether) as we have seen which simplifies things.

Effect on sound waves due to motion.

Figure 2:Effect on sound waves due to motion.

For the case of an observer with speed vobsv_{obs} and speed of sound in the medium uu and moving source vsrcv_{src} (e.g. car) the classical formula of the frequency shift is

fobs=fsrcu±vobsuvsrcf_{obs} = f_{src}\frac{u\pm v_{obs}}{u\mp v_{src}}

where for the stationary observer and medium, we have +/+/- and for the moving observer and stationary source /+-/+.

The origin of the observed frequency shift of a moving source is visible in the picture. In the direction of motion, more wave maxima arrive per unit time, as the car is moving closer between two wave maxima.

For the relativistic effect we consider a moving source with velocity u\vec{u} moving with observer SS' relative to SS. The frequency of the source is f0=1T0f_0=\frac{1}{T_0} in the rest frame, with T0T_0 the period of the oscillation.

Observer S’ and source both moving with respect to S.

Figure 3:Observer S’ and source both moving with respect to S.

We now consider the situation for SS as shown in the figure below. The position of the source r0\vec{r}_0 is indicated with the star *.

.

Figure 4:.

We do know that according to SS', the proper frequency is f0f_0 and the proper period T0=1/f0T_0 = 1/f_0. Thus if a maximum is send at t0t'_0 the next one will be at t0+1f0t'_0 + \frac{1}{f_0}.

SS will denote the first maximum with time t1=t0t_1 = t_0, but will have to take time dilation into account for the second one: t2=t0+γf0t_2 = t_0 + \frac{\gamma}{f_0}. Note that these two time instants are the moments, according to SS when the two maxima are send, not when they are received by SS.

During this time interval γf0\frac{\gamma}{f_0} the source moves from r0\vec{r}_0 to r1=r0+uγf0\vec{r}_1=\vec{r}_0+\vec{u}\frac{\gamma}{f_0}. Thus, the distance that the second maximum has to travel is different from that of the first one, just like in the classical Doppler case.

We consider the 2 consecutive wave maxima that are emitted in SS' and received in SS:

To move further we split the velocity of the source into a radial component (in line to the observer in SS) and a tangential part perpendicular u=ur+ut=urr^+utt^\vec{u}=\vec{u}_r+\vec{u}_t =u_r \hat{r} +u_t \hat{t}. If the distance r0uγf0r_0 \gg \vec{u} \frac{\gamma}{f_0} then the distance r1=r0+urγf0r_1 = r_0 + u_r \frac{\gamma}{f_0}.

Note that we could drop the vector notation here from the exact relation above. Classically only the radial component is relevant as only the distance matters.

With this simplification on the distances we can compute t2t_2

t2=(t0+γf0)+r1ct0+γf0+r0+urγf0ct_2 = (t_0+\frac{\gamma}{f_0})+\frac{r_1}{c} \approx t_0+\frac{\gamma}{f_0} + \frac{r_0+u_r \frac{\gamma}{f_0}}{c}

For the frequency ff in SS we now subtract the two arrival times

1f=t2t1=γf0+urcγf0\frac{1}{f} =t_2-t_1 = \frac{\gamma}{f_0}+ \frac{u_r}{c}\frac{\gamma}{f_0}

Rewriting this into a ratio of the emitted and received frequency, we obtain for the relativistic Doppler effect

f0f=γ(1+urc)=1+urc1u2c2\frac{f_0}{f} = \gamma \left (1+\frac{u_r}{c} \right ) = \frac{1+\frac{u_r}{c}}{\sqrt{1-\frac{u^2}{c^2}}}

Please observe that the gammagamma factor is of γ(u)\gamma(u) that means even for only tangential movement (ur=0)(u_r=0) there is a Doppler shit.

3.2.1Cosmic background radiation

The most well-known frequency shift is the red-shift from the expanding universe.

The astronomer Edwin Hubble first found in the 1920s that the universe does not only consists out of our own galaxy, the milky way, but there must be (many) other galaxies, which were called nebula at that time. Second he could show that all further away galaxies move away from us by measuring the Doppler shift of well-known emission lines of stars and their distance from periodic intensity variation of Cepheid Variable stars. It turned out that the distance of the galaxies dd was roughly linearly proportional to the red-shift which is again linearly related to the the radial velocity vv as we derived. This is known now as Hubble’s law v=H0dv=H_0 d with the Hubble constant (H070 km/s/MpcH_0 \sim 70\ km/s/Mpc). Further away galaxies move faster away, but why? And why is no galaxy approaching us?

At end of the 1920s Georges Lemaître applied Einstein general theory of relativity to cosmology and found that the universe must be expanding, while it started in a “primeval atom”, now known as the Big Bang. He could explain the red-shift relation from the expanding universe hypothesis.

The Big Bang theory was highly debated early on, in particular by Einstein, but is now fully accepted. The strongest experimental evidence was the discovery of the cosmic background radiation in 1965 (by accident).

The whole cosmos is nearly uniformly filled by a background radiation of about 2.7K2.7 \mathrm{K} (wavelength in the μ\mum range) with small inhomogeneities as shown in the picture by the Planck satellite around 2013.

Background radiation in the universe as observed from earth.
By NASA / WMAP Science Team - http://map.gsfc.nasa.gov/media/121238/ilc_9yr_moll4096.png, Public Domain.

Figure 5:Background radiation in the universe as observed from earth. By NASA / WMAP Science Team - http://map.gsfc.nasa.gov/media/121238/ilc_9yr_moll4096.png, Public Domain.

This radiation is the red shifted radiation from around 380,000 years after the Big Bang when the universe became transparent. At that time the temperature had dropped (due to the adiabatic expansion) to around 3000 K, at which protons and electrons can form stable hydrogen atoms p+eHp+e^- \to H. This event is called recombination. At higher temperatures photons are scattered from the free electrons (and protons) constantly, effectively the photons have a very short mean free path and the universe is opaque. At the recombination temperature all of a sudden the photon could travel without strong scattering, the universe was transparent. The 3K cosmic background radiation that we measure today is the red-shifted version of this 3000 K light. It gives us a glimpse of how the universe looked at that time. Apart from the background radiation there were no other light sources in the universe as stars had not formed yet, the Dark Ages of the universe began.

The red-shift here is actually caused by the expansion of the universe itself (the universe expands causing the photons’ wavelength to expand). NB: Time in cosmology is often given in units of red-shift (e.g. the red-shift for recombination is z=1089z=1089).

Wavelength temperature relation

How can we relate the wavelength of electro-magnetic radiation to temperature?

Matter emits electro-magnetic radiation depending on its temperature. This relation is given by Planck’s law with which quantum mechanics started in 1900 as he considered black body radiation. The emitted spectral density per solid angle depends on the thermal energy kTkT and is given by

u(λ,T)=2hcλ51ehc/λkT1u(\lambda, T) = \frac{2hc}{\lambda^5}\frac{1}{e^{hc/\lambda kT}-1}

Here for the first time hh, Planck’s constant, was introduced to quantize energy packages hfhf of oscillation.

Phenomenologically, the relation between the peak of the spectrum and the temperature was found by Wien already earlier to be λpeak=bT\lambda_{peak} = \frac{b}{T} with bb Wien’s constant b2900 μmKb\sim 2900\ \mu m\cdot K.

NB: If you buy a light bulb for a lamp, then a temperature is indicated on the packaging, e.g. 2700K2700 \mathrm{K} for “warm white”, 4000K4000 \mathrm{K} for “cool white” to describe the light color. Quantum physics at your local super market.

import numpy as np
import matplotlib.pyplot as plt
import ipywidgets
from IPython.display import display

# Constants
h = 6.62607015e-34  # Planck constant, J*s
c = 3.0e8  # Speed of light, m/s
k = 1.380649e-23  # Boltzmann constant, J/K

# Wavelength range (from 1 nm to 3000 nm)
wavelengths = np.linspace(1e-9, 3000e-9, 1000)

# Planck's Law function with overflow protection
def planck(wavelength, T):
    exp_term = (h * c) / (wavelength * k * T)
    # Prevent overflow in the exponential function
    with np.errstate(over='ignore'):
        return (2.0 * h * c**2) / (wavelength**5) * np.where(exp_term < 700, 1/(np.exp(exp_term) - 1), 0)

# Visible spectrum range in meters
visible_start = 380e-9
visible_end = 750e-9

# Colors of the visible spectrum in the correct order from violet to red
colors = ['violet', 'indigo', 'blue', 'green', 'yellow', 'orange', 'red']
num_colors = len(colors)
wavelength_visible = wavelengths[(wavelengths >= visible_start) & (wavelengths <= visible_end)]
color_indices = np.linspace(0, len(wavelength_visible), num_colors + 1, dtype=int)

# Function to plot the Planck curve
def plot_planck_curve(T):
    spectral_radiance = planck(wavelengths, T)
    
    plt.figure(figsize=(10, 6))
    
    # Plot the Planck curve
    plt.plot(wavelengths * 1e9, spectral_radiance, label=f'T = {T} K')
    
    # Highlight the visible spectrum with rainbow bands
    for i in range(num_colors):
        plt.fill_between(wavelength_visible[color_indices[i]:color_indices[i+1]] * 1e9, 
                         spectral_radiance[(wavelengths >= visible_start) & (wavelengths <= visible_end)][color_indices[i]:color_indices[i+1]], 
                         color=colors[i])
    
    # Plot settings
    plt.xlim(0, 3000)
    plt.ylim(0, 1.1 * max(spectral_radiance))
    plt.xlabel('Wavelength (nm)')
    plt.ylabel('Spectral Radiance')
    plt.title('Planck Curve')
    plt.legend()
    plt.grid(True)
    plt.show()

ipywidgets.interact(plot_planck_curve,T=(1,10000,100))