Skip to article frontmatterSkip to article content

4Spacetime and 4-vectors

4.1Space time

In 3D space we define a point/coordinate by its components (x,y,z)(x,y,z) where all components have the same unit. We can do this also in 4D space time by an event (ct,x,y,z)(ct,x,y,z) as ctct has unit length (it should be called time space by this ordering, but what ever). The same unit for all components is needed if we want to do geometry with the coordinates.

If we want to measure distances Δs\Delta s between two points (x1,y1,z1)(x_1,y_1,z_1) and (x2,y2,z2)(x_2,y_2,z_2) we do this in 3D Euclidean space as Δs2=(x2x1)2+(y2y1)2+(z2z1)2=Δx2+Δy2+Δz2\Delta s^2 = (x_2-x_1)^2+(y_2-y_1)^2+(z_2-z_1)^2 = \Delta x^2 + \Delta y^2 + \Delta z^2. These distances are Galileo invariant, observer SS and SS' moving with V\vec{V} measure the same distance Δs2=Δs2\Delta s^2 = \Delta s^{'2}. Note, that we take these two pints at the sam time tt: t1=t2t_1=t_2. Or rephrased: we perform the measurement in the rest frame of the object we measure. That makes sense: measuring the length of an object that is moving requires that we measure the left and right side at the same time. Otherwise, the motion of the object will interfere with our measurements of the length.

The above statement is easily shown by invoking the Galilei Transformation:

x=xVty=yz=zt=t\begin{split} x' &= x - V t \\ y' &= y \\ z' &= z \\ t' &= t \end{split}

We transform the two points (x1,y1,z1)(x_1,y_1,z_1) and (x2,y2,z2)(x_2,y_2,z_2) at the same time tt, we get:

x1=x1Vt,x2=x2Vtx2x1=x2x1y1=y1,y2=y2y2y1=y2y1z1=z1,z2=z2z2z1=z2z1t=t\begin{split} x'_1 &= x_1 - V t, x'_2 = x_2 - V t \Rightarrow x'_2 - x'_1 = x_2 - x_1\\ y'_1 &= y_1, y'_2 = y_2 \Rightarrow y'_2 - y'_1 = y_2 - y_1\\ z'_1 &= z_1, z'_2 = z_2 \Rightarrow z'_2 - z'_1 = z_2 - z_1\\ t' &= t \end{split}

If we want to measure distances in space time and require that the distance is now Lorentz invariant, we cannot measure distance the same way! If we measure in SS the positions at the same time, that will in general be at different times according to SS'. Time is relative!

To do geometry, measure angles etc. we need an inner product and the inner product provides a distance measure (a metric) by the norm. For 3D you know that for two vectors r1\vec{r}_1 and r2\vec{r}_2: Δs2=r1r22=(r1r2)(r1r2)=Δx2+Δy2+Δz2\Delta s^2 = || \vec{r}_1-\vec{r}_2 || ^2 = (\vec{r}_1-\vec{r}_2)\cdot (\vec{r}_1-\vec{r}_2)= \Delta x^2 + \Delta y^2 + \Delta z^2. Clearly the inner product in 4D space time cannot be defined in the same way.

We want that two relativistic observers measure the same distance (e.g. between two events), that is, it must be Lorentz invariant. We start by noting that the speed of light is constant for both observers. A light wave traveling in SS and SS' must therefore obey

c2t2x2y2z2=0=c2t2x2y2z2c^2t^2-x^2-y^2-z^2 = 0 = c^2t^{'2}-x^{'2}-y^{'2}-z^{'2}

Given this observation it is needed (and natural) to define the distance in space time as

ds2=c2dt2dx2dy2dz2d s^2 = c^2d t^2 - d x^2 - d y^2 - d z^2

!!! important “Warning” Notice directly that the distance Δs2\Delta s^2 can be negative! (And we are OK with that).

It is straightforward to show that the above distance ds2ds^2 is indeed a Lorentz Invariant, i.e. ds2=ds2ds'^2 = ds^2. Suppose we have two events: E1:(ct,x,y,z)E_1: (ct,x,y,z) and E2:(c(t+dt),x+dx,y+dy,z+dz)E_2: (c(t+dt),x+dx,y+dy,z+dz). We can transform these to SS' via the standard Lorentz Transformation:

ct2=γ(c(t+dt)Vc(x+dx))cdt=γ(cdtVcdx)x2=γ((x+dx)Vcc(t+dt))cdx=γ(dxVccdt)y2=y2dy=dyz2=z2dz=dz\begin{array}{rcl} ct'_2 &=& \gamma \left ( c(t+dt) - \frac{V}{c}(x+dx) \right ) \Rightarrow &cdt' &=& \gamma \left ( cdt - \frac{V}{c}dx \right ) \\ x'_2 &=& \gamma \left ( (x+dx) - \frac{V}{c}c(t+dt) \right ) \Rightarrow &cdx' &=& \gamma \left ( dx - \frac{V}{c}cdt \right ) \\ y'_2 &=& y_2 \Rightarrow &dy' &=& dy\\ z'_2 &=& z_2 \Rightarrow &dz' &=& dz \end{array}

Clearly, we do only have to concentrate on the cdtcdt and dxdx terms:

cdt2dx2=γ2(cdtVcdx)2γ2(dxVccdt)2=γ2(c2dt22Vccdtdx+V2c2dx2dx2+2VcdxcdtV2c2c2dt2)=γ2(1V2c2)=1(c2dt2dx2)=c2dt2dx2\begin{array}{rcl} cdt'^2 - dx'^2 &=& \gamma^2 \left ( cdt - \frac{V}{c}dx\right )^2 - \gamma^2 \left ( dx - \frac{V}{c}cdt \right )^2 \\ &=& \gamma^2 \left ( c^2 dt^2 - 2 \frac{V}{c} cdtdx + \frac{V^2}{c^2}dx^2 - dx^2 + 2 \frac{V}{c} dxcdt - \frac{V^2}{c^2}c^2dt^2\right ) \\ &=& \underbrace{\gamma^2 \left ( 1 - \frac{V^2}{c^2} \right )}_{=1} \left ( c^2dt^2 - dx^2 \right) \\ &=& c^2dt^2 - dx^2 \\ \end{array}

Note that if we had used a + sign, that is ds2c2dt2+dx2ds^2 \equiv c^2dt^2 + dx^2, we would not have arrived at a Lorentz Invariant.

4.24-vector

The idea of having to work with a ‘position’ vector with 4 components with an inproduct as we discussed above, is generalized to vectors, i.e. quantities with a direction and a magnitude.

We define a 4-vector A=Aμ=(A0,A1,A2,A3)\vec{A}=A^\mu=(A^0,A^1,A^2,A^3) to be a vector that transforms between two observers SS and SS' moving with VV along the xx-direction by the LT

A0=γ(A0VcA1)A1=γ(A1VcA0)A2=A2A3=A3\begin{array}{rcl} A^{0'} &=& \gamma \left ( A^0-\frac{V}{c}A^1\right ) \\ A^{1'} &=& \gamma \left ( A^1-\frac{V}{c}A^0\right ) \\ A^{2'} &=& A^2\\ A^{3'} &=& A^3 \end{array}

Other tuples of 4 values are not 4-vectors. The requirement that the 4-vector must transform via the LT is essential. We will use this later for the 4-velocity and 4-momentum.

4.2.1Inner product & conventions

Like the distance also the inner product can be defined between two 4-vectors. We use a capital letter for a 4-vector

A=Aμ=(A0,Ak)=(A0,A1,A2,A3)=(A0,a)\vec{A} = A^\mu = (A^0,A^k)=(A^0,A^1,A^2,A^3)=(A^0, \vec{a})

This notation is just to a make clear distinction with 3-vectors that only have spatial coordinates. With a Greek index μ,Aμ\mu, A^\mu we indicate all 4 components of the vector, while with a Latin index k,Akk,A^k we only indicate the spatial components. We also start counting at 0 for the first component, which is ‘time’.

The inner product between two 4-vectors A,B\vec{A}, \vec{B} is now defined according to the rule we already saw before

ABA0B0A1B1A2B2A3B3\vec{A}\cdot \vec{B} \equiv A^0B^0 - A^1B^1-A^2B^2-A^3B^3

This is not a “choice” for the inner product, but follows strictly from the requirement that distance or length should not change under LT. A space with this inner product is called Minkowski space or the space has a Minkowski metric after Hermann Minkowski.

Notice that time component (+)(+) is treated differently than the spatial components ()(-) in the inner product. Sometimes the inner product is also called pseudo Euclidean as there are -1 and +1 present in the inner product (instead of only +1 for Euclidean space).

4.2.2Lorentz invariants

As is clear by the above construction the inner product of two 4-vectors must be LT invariant, that is for observers S:A,BS:\vec{A},\vec{B} and S:A,BS':\vec{A}',\vec{B}' it holds

AB=AB\vec{A}\cdot \vec{B} = \vec{A}'\cdot \vec{B}'

This property can be a very powerful tool (OK, we constructed it that way). If we know the value of the inner product in one frame of reference, it will be the same in all other inertial frames of reference! We will use that later often. It is also clear that the distance interval ds2ds^2 is a Lorentz invariant.

??? “Inner product LT invariant: the hard way”

If you do not believe that the inner product is LT invariant you can write it out of course (with βVc\beta \equiv \frac{V}{c}, a short hand notation that is frequently used).

We compute AB\vec{A}'\cdot \vec{B}'. We will concentrate on only A0B0A1B1A^0B^0 - A^1B^1, as with the standard Lorentz Transformation the A2A^2 and A3A^3 component are trivial.

AB=γ(A0βA1)γ(B0βB1)γ(A1βA0)γ(B1βB0)=γ2(A0B0βA1B0βA0B1+β2A1B1)γ2(A1B1βA0B1βA1B0+β2A0B0)=γ2(1β2)(A0B0A1B1)=A0B0A1B1=AB\begin{array}{rcl} \vec{A}'\cdot \vec{B}' &=& \gamma ( A^0 - \beta A^1 ) \cdot \gamma ( B^0 - \beta B^1) - \gamma ( A^1 - \beta A^0) \cdot \gamma ( B1 - \beta B^0) \\ &=& \gamma^2 \left ( A^0B^0 - \beta A^1B^0 -\beta A^0B^1 + \beta^2 A^1B^1 \right ) \\ &-& \gamma^2 \left ( A^1B^1 - \beta A^0B^1 - \beta A^1B^0 + \beta^2 A^0B^0\right ) \\ &=& \gamma^2 (1 - \beta^2) (A^0B^0 - A^1B^1 ) \\ &=& A^0B^0 - A^1B^1 \\ &=& \vec{A}\cdot \vec{B} \end{array}

4.3The light cone

Let us consider an event in space time X=Xμ=(ct,x,y,z)=(x0,x1,x2,x3)\vec{X}=X^\mu=(ct,x,y,z)=(x^0,x^1,x^2,x^3). For sake of simplicity we only consider one space like component here. In the sketch we have the space axis (xx or x1x^1) to the right and the time axis (ctct or x0x^0) up. We consider 3 events A,B,CA,B,C (points in space time) and their connection to the origin OO

Here you visually can observe that the sign of the distance using the Minkowski inner product classifies parts of space time.

This is even more evident if you look at the light cone in the sketch. The cone mantel is generated by revolving the line x=ctx=ct, a light line. Here only a 2D cone is shown (ct,x,y)(ct,x,y), but of course this should be a 3D cone (ct,x,y,z)(ct,x,y,z). The inside of the cone at negative times is the past that could have influenced me at now. My now can influence my future (inside the cone to positive times). All the rest, outside the cone is not causally connected to me.

4.4Minkowski-diagram

Now we can have a look at world lines of an observer SS' with respect to SS traveling with VV along the xx-axis in a graphical manner. The world line of an object is the path that an object travels in the 4-dimensional spacetime.

We plot the coordinate system of SS' (blue) in the coordinate system of S (black).

Both lines of SS' make the same angle α\alpha with the coordinates axis of SS. They lie symmetric around the light line x=ctx=ct (diagonal with α=45\alpha=45 deg). The higher the speed VV the higher the angle and the closer the lines lie to the light line. See the animation below, where the (ct,x)(ct',x') axis are plotted in the (ct,x)(ct,x) diagram of SS for different values of V/cV/c.

To further investigate how this plot can help us, let us consider lines of equal time in SS. These are just the lines perpendicular to the ctct-axis, and parallel to the xx-axis, as you expect. And of course, lines parallel to ctct, perpendicular to xx are lines of constant space coordinate.

For the frame of reference SS' that is only a bit different.

With this information in hand, we can investigate how events are transferred from SS to SS'. We can graphically do a LT without the explicit computation.

In the animation below, we see the effect of different values of V/cV/c on the lines of constant ctct' and xx' as seen by SS. For clarity, these are only drawn for V/c0V/c \geq 0

4.4.1The ladder & barn revisited

We will now take a look back at the ladder and barn paradox. We had a barn of 10m10 \mathrm{m} wide and a ladder of 26m26 \mathrm{m} long (both measured in their rest frame). The ladder was moving towards the barn with high velocity. We start by drawing the barn SS (black) and ladder SS' (blue) coordinate systems in the Minkowski diagram. Now we add the barn world line into the diagram (light blue) with 2 lines of constant space coordinate (parallel to ctct) in SS.

Now we can add the ladder to SS'. It has rest length of 26m26 \mathrm{m} and in the (x,ct)(x',ct') plane it is a world line of constant space coordinate, therefore parallel to ctct'. The ladder itself is a line of constant time in ctct' and therefore parallel to xx'.

As the ladder moves (we move it parallel to xx' between the world lines) it will eventually enter the barn and hit the right door of the barn (dashed red line). This event is indicated by the space time point AA. For SS' the other end of the ladder is then still outside the barn at space time point CC. According to SS' the ladder does not fit into the barn.

When the ladder hits the right door for SS at space time point AA, he makes a measurement of the ladder. To this end we draw a line of constant time (dashed light blue, parallel to xx) until it intersects the world line of the ladder at space time point BB. Observer SS measures that the ladder fits into the barn.

From this diagram it is obvious that the events BB and CC are not the same, therefore it is not strange that SS and SS' disagree about the outcome of the measurement. Both are right! But they would not be able to agree that both doors shut at the same time, to capture the ladder.

4.4.2The twin paradox

Let there be two twins, Alice and Bob. Bob leaves earth in a space ship with relativistic speed v\vec{v}, while Alice remains back home on earth. At some time Bob turns around, with v-\vec{v} and comes back to Alice. Based on time dilation Alice will argue that Bob is younger than she due to ΔT=γΔT0\Delta T = \gamma \Delta T_0. For the gammagamma-factor it does not matter if Bob is moving away or approaching as it is quadratic in the velocity. For each year she ages, her brother only ages 1/γ1/\gamma years. Bob can argue that due to the principle of relativity, he is at rest and his twin sister is moving away and then coming back, therefore she will be younger than he - and we have a paradox.

This paradox has two issues:

  1. The principle of relativity is not applicable as Bob must turn around. This requires acceleration of his frame and breaks the symmetry of the problem.
  2. Bob will be younger than Alice, due to the relativity of simultaneity changing around the turning point. We can see this by looking at the Minkowski-diagram below. Just before Bob is turning around, his line of simultaneity is xx', but just after turning around his line of simultaneity is xx''. On the time line of Alice, Bob lines of simultaneity first is at point A, but then makes a jump around the turning point to B. Bob will be younger than Alice, by the length of this jump on her time line from A to B.
.

Figure 14:.

Extra: We symmetries the problem. Both Alice and Bob move in space ships away from each other at the same but opposite speed, then turn around and meet again. Who is older now?

Answer

They are the same age. You can now reason with symmetry even though both are accelerated. You can also draw the Minkowski-diagram similar to the above and see that both make the same “jump” in the time, and thus are the same age.

4.4.3Lines of invariant distance

We have seen that the length interval ds2ds^2 is a Lorentz invariant. Therefore we can use it to also indicate corresponding time and space units in a Minkowski diagram for two moving observers. If we fix ds2ds^2 then the equation ds2=c2dt2dx2ds^2=c^2 dt^2-dx^2 describes a hyperbola in (ct,x)(ct,x) of the Minkowski diagram.

Image from

Figure 18:Image from Idema (2023)

For ds2<0ds^2<0 we find the corresponding space units (the interval is space-like), and for ds2>0ds^2>0 the corresponding time units (the interval is time-like. All hyperbola have the light line ds2=0ds^2=0 as asymptotes.

4.5LT as a rotation

This part is optional, but insightful.

You can think of the LT as a rotation of the 4 coordinates of Minkowski space time. Obviously it is not a “normal” rotation with a rotation matrix RSO(n)R\in SO(n) as we encountered in change to polar coordinates.

The LT in matrix notation reads as follows with γ=11β2\gamma = \frac{1}{\sqrt{1-\beta^2}} and β=V/c\beta=V/c.

(ctxyz)=(γγβ00γβγ0000100001)(ctxyz)\left ( \begin{array}{c} ct' \\ x' \\ y' \\z' \end{array} \right ) = \left ( \begin{array}{cccc} \gamma & -\gamma\beta & 0 & 0\\ -\gamma\beta & \gamma & 0 & 0\\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1\\ \end{array} \right ) \left ( \begin{array}{c} ct \\ x \\ y \\z \end{array} \right )

The matrix transfers the space time coordinates between two observers moving with VV. From this it is clear that transferring between more than two observers SSSS\to S' \to S^{''} \to \dots can be done easily by multiplying the respective Lorentz transformation matrices into one overall LT. This must be possible, of course, as the LT is a linear transformation in space time (ct,x)(ct,x).

From the matrix notation it is also clear that for rotations around “different axis”, speeds in x,y,zx,y,z direction, the order of change of frame matters as matrix multiplication does not commute.

In 3D normal space, distance is persevered under rotation with RSO(n)R\in SO(n), in Minkowski space distance is preserved under Lorentz transformation which too is a rotation.

You can see the rotation clearer if we introduce the quantity rapidity α\alpha, which is defined as tanhαVc\tanh \alpha \equiv \frac{V}{c} (a relativistic generalization of the modulus of the velocity. It goes from 0 for v=0v=0 to \infty for v=cv=c). We will not use the rapidity except here, however, it is used for relativistic velocity decompositions. With tanhα=Vc\tanh \alpha = \frac{V}{c} we can write the Lorentz transformation as (using γ=11tanh2α=coshα\gamma = \frac{1}{\sqrt{1-\tanh^2 \alpha}}=\cosh \alpha and γβ=tanhα1tanh2α=sinhα\gamma\beta=\frac{\tanh\alpha}{\sqrt{1-\tanh^2 \alpha}}=\sinh\alpha)

(ctxyz)=(coshαsinhα00sinhαcoshα0000100001)(ctxyz)\left ( \begin{array}{c} ct' \\ x' \\ y' \\z' \end{array} \right ) = \left ( \begin{array}{cccc} \cosh\alpha & -\sinh\alpha & 0 & 0\\ -\sinh\alpha & \cosh\alpha & 0 & 0\\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1\\ \end{array} \right ) \left ( \begin{array}{c} ct \\ x \\ y \\z \end{array} \right )

Notice the similarity to the rotation with sine and cosine.

With that LT is a rotation in hyperbolic space with “angle” α\alpha (where α\alpha is the rapidity), we identify the matrix as L(α)L(\alpha). That the hyperbolic functions appear should not be a surprise as they are equivalent to the sine and cosine for the circle, (ct2+x2=1)(ct^2+x^2=1), for the hyperbola (ct2x2=1)(ct^2-x^2=1). Notice the relation to the inner products for standard and Minkowski space.

Minkowski made the sketch below to show that the Lorentz transformation is a rotation over a hyperbola not a circle as we were used to. The asymptotes of the hyperbola are given by the light lines.

Drawing by Minkowski

Figure 19:Drawing by Minkowski

The addition of velocities that we derived earlier is easy with this notation with rotations and rapidity L(α1)L(α2)=L(α1+α2)L(\alpha_1)L(\alpha_2)=L(\alpha_1+\alpha_2). In terms of speeds this reads

β=tanh(α1+α2)=tanhα1+tanhα21+tanhα1tanhα2=β1+β21+β1β2\beta = \tanh (\alpha_1+\alpha_2)= \frac{\tanh \alpha_1 +\tanh \alpha_2}{1+\tanh \alpha_1 \tanh \alpha_2}=\frac{\beta_1 + \beta_2}{1+\beta_1\beta_2}

The addition of velocities is brought back to hyperbolic identities.

References
  1. Idema, T. (2023). Introduction to particle and continuum mechanics. TU Delft OPEN Publishing. 10.59490/tb.81