Skip to article frontmatterSkip to article content

4Angular Momentum, Torque & Central Forces

4.1Torque & Angular Momentum

From experience we know that if we want to unscrew a bottle, lift a heavy object on one side, try to unscrew a screw, we better use ‘leverage’.

Lifting is easier using leverage.

Figure 1:Lifting is easier using leverage.

With a relatively small force FSF_S, we can lift the side of a heavy object. The key concept to use here is torque, which in words is loosely formulated: apply the force using a long arm and the force seems to be magnified. The torque is then force multiplied by arm: Γ=Forcetimesarm\Gamma = Force \, \text{times} \, arm

This is, of course, too sloppy for physicists. We need strict, formal definitions. So, we put the above into a mathematical definition.

That is: torque (or krachtmoment in Dutch) is the outer product of ‘arm’ as a vector(!) and the force. We notice a few peculiarities.

  1. like force, torque is a vector. That is: it has a magnitude and a direction. In principle: three components.
  2. its direction is perpendicular to the force vector F\vec{F} and perpendicular to the arm r\vec{r}.
  3. the arm is not a number: it is a vector!

We further know from experience that we can balance torques, like we can balance forces. Rephrased: the net effect of more than one force is found by adding all the forces (as vectors!) and using the net force in Newtons second law: ma=Fi=Fnetm\vec{a} = \sum \vec{F}_i = \vec{F}_{net}. From Newtons first law, we immediately infer: if Fi=Fnet=0\sum \vec{F}_i = \vec{F}_{net}=0 then the object moves at constant velocity. We can move with the object at this speed and conclude that it from this perspective has zero velocity: it doesn’t move, i.e. it is in equilibrium.

The same hols for torque: we can work with the sum of all torques that act on an object: Γi=Γnet\sum \vec{\Gamma}_i = \vec{\Gamma}_{net}. And if this sum is zero, the object is in equilibrium.

However, there is a catch: using torques requires that we are much more explicit and precise about the choice of our origin. Why? The reason is in the ‘arm’. That is only defined if we provide an origin.

4.1.1The seesaw and torque

Let’s consider a simple example (simple in the sense that we are all familiar with it): the seesaw.

An adult (left) and a child (right) on a seesaw.

Figure 2:An adult (left) and a child (right) on a seesaw.

It is obvious that the adult -seesawing with the child- should sit much closer to the pivot point than the child. That is: we assume that the mass of the adult is greater than that of the child.

Let’s turn this picture into one that captures the essence and includes the necessary physical quantities, and then draw a free-body diagram.

Free-body diagram of the seesaw and the masses.

Figure 3:Free-body diagram of the seesaw and the masses.

What did we draw?

  1. The force of gravity acting on the two masses MM and mm. That is obvious: without forces nothing will happen and there is nothing to be analyzed.
  2. The ‘reaction forces’ from the seesaw on both masses. Why? If the seesaw is in equilibrium, then each of the masses is in equilibrium and the sum of forces on each mass must be zero.
  3. The distance of each of the masses to the pivot point. Why? Leverage! The heavy MM must be closer to the pivot point the get equilibrium.
  4. An origin OO. Why? We need a point to measure the ‘arm’, ‘leverage’, from.
  5. The zz-coordinate. Why? We deal with forces in the vertical direction. Hence a coordinate, a direction that we all use, is handy.

Analysis
Time for a first analysis: what keeps this seesaw in equilibrium?

  1. The sum of forces on each of the masses is zero. As gravity pulls them down, the seesaw must exert a force of the same magnitude but in the opposite direction. These are the green forces.
  2. With this idea we have the masses in equilibrium, but not necessarily the seesaw. Why? We did not consider forces on the seesaw. Which are these: (a) the reaction force (i.e. the N3 pair) of the green force from the seesaw on mass MM. We did not draw that! Similarly, for the mass mm.
  3. Now that we focus on the seesaw itself: this is in equilibrium (that is given), but there are two forces acting on it in the negative zz-direction as we found in (2). Even if we consider the mass of the seesaw: that will not help, gravity will pull it downwards. What did we forget? The force at the pivot point, of course! The pivot will exert an upward force, preventing the seesaw from falling down. For simplicity, we assume that the seesaw has zero mass. Thus, there are three forces acting on it: Mg-Mg, mg-mg, FpF_p with FpMgmg=0F_p - Mg - mg = 0.

Let’s redraw, now concentrating on the forces on the seesaw.

Free-body diagram of the seesaw.

Figure 4:Free-body diagram of the seesaw.

Analysis part 2
We know that the seesaw is in equilibrium, thus

FpMgmg=0F_p - Mg - mg = 0

This guarantees that the seesaw does not change its velocity, and as it does not move at some time t0t_0, it doesn’t move for all t>t0t>t_0.

But this doesn’t guarantee that the seesaw doesn’t rotate around the pivot point. For that we need that the ‘leverages’ on the left and right side ‘perform’ the same.
Making this precise: the torques exerted on the seesaw must also equate to zero.
We have 3 forces, thus 3 torques: Mg-Mg with arm LL, mg-mg with arm ll and FpF_p with arm zero.

Now we need to be even more precise: torque is a vector and it is made as an outer product of the vector ‘arm’ and the force.
We have already drawn an xx-coordinate in the figure, that will allow us to write the ‘arm’ as a vector. After all, we need to evaluate the outer product r×F\vec{r} \times \vec{F}. We do that for the three forces, starting on the left:

Γ1=Lx^×(Mg)z^=MLgx^×z^=MLg(y^)=MLgy^\vec{\Gamma}_1 = -L\hat{x} \times (-Mg)\hat{z} = MLg \, \hat{x} \times \hat{z} = MLg \left ( -\hat{y} \right ) = -MLg \, \hat{y}

We have used here, that the outer product of x^\hat{x} with z^\hat{z} is equal to y^-\hat{y} with y^\hat{y} the unit vector in the yy-direction pointing into the screen.

Similarly for the force coming from the small mass mm on the right side:

Γ2=lx^×(mg)z^=mlgx^×z^=mlgy^\vec{\Gamma}_2 = l\hat{x} \times (-mg)\hat{z} = -mlg \, \hat{x} \times \hat{z} = mlg \, \hat{y}

Finally, the torque from the force exerted by the pivot point:

Γ3=0x^×Fpz^=0\vec{\Gamma}_3 = 0\hat{x} \times F_p\hat{z} = 0

Next, we evaluate the total torque:

Γ1+Γ2+Γ3=(mlgMLg)y^\vec{\Gamma}_1 + \vec{\Gamma}_2 +\vec{\Gamma}_3 = ( mlg - MLg )\hat{y}

In order for the seesaw not to start rotating, we must have that the torque is zero and thus:

Γi=0mlg=MLgmM=Ll\sum \vec{\Gamma}_i = 0 \Rightarrow mlg = MLg \rightarrow \frac{m}{M} = \frac{L}{l}

A result we expected: the greater mass should be closer to the pivot point.

4.1.2Different origin

So far, so good. But what if we had chosen the origin not at the pivot point, but somewhere to the left? Then all ‘arm’ will change length. And all torques will be different. Wouldn’t that make Γi0\sum \vec{\Gamma}_i \neq 0?
No, it wouldn’t! Let’s just do it and recalculate. In the figure below, we have moved the origin to the left end of the seesaw. The distance from the heavy mass to the origin is Λ\Lambda (green arrow).

Free-body diagram with the origin located at the seesaw’s end.

Figure 5:Free-body diagram with the origin located at the seesaw’s end.

We still have that the sum of forces is zero. But what about the sum of torques? Obviously, the choice of the origin can not affect the seesaw: it is still in balance, regardless of our choice of the origin. Let’s see if that is correct:

Γi=Λx^×Mgz^+(Λ+L)×Fpz^+(Λ+L+l)x^×mgz^\sum \vec{\Gamma}_i = \Lambda \hat{x} \times -Mg\hat{z} + \left ( \Lambda + L \right ) \times F_p\hat{z} + \left ( \Lambda + L + l \right )\hat{x} \times -mg\hat{z}

We have drawn the three unit vectors x^,y^,z^\hat{x}, \hat{y}, \hat{z} in the figure. We will use again: x^×z^=y^\hat{x} \times \hat{z} = -\hat{y}. This simplifies the torque equation above to:

Γi=[MgΛ(Λ+L)Fp+mg(Λ+L+l)]y^\sum \vec{\Gamma}_i = \left [ Mg\Lambda - (\Lambda + L ) F_p + mg( \Lambda + L + l ) \right ]\hat{y}

This is clearly more complicated than the expression we had with the first choice of the origin. Moreover, the force from the pivot point shows up in our expression.

Luckily, we have equilibrium. Hence: FpMgmg=0Fp=Mg+mgF_p - Mg - mg = 0 \Rightarrow F_p = Mg + mg. We substitute this into our torque equation:

Γi=[MgΛ(Λ+L)(Mg+mg))+mg(Λ+L+l)]y^=[Mg(Λ(Λ+L))+mg((Λ+L)+Λ+L+l)]y^=[MgL+mgl]y^\begin{split} \sum \vec{\Gamma}_i &= \left [ Mg\Lambda - (\Lambda + L ) (Mg + mg)) + mg( \Lambda + L + l ) \right ] \hat{y} \\ &= \left [ Mg(\Lambda - (\Lambda + L )) + mg( - (\Lambda + L ) + \Lambda + L + l ) \right ]\hat{y}\\ &= \left [ -MgL + mgl \right ]\hat{y} \end{split}

Which is exactly the same expression as we found before. So, indeed, the choice of the origin doesn’t matter.

Conclusion

For equilibrium we need that the sum of torques is zero:

iΓi=0\sum_i \vec{\Gamma}_i = 0

4.2Angular Momentum

From our seesaw example we learn: the seesaw can only be in equilibrium if the sum of torques is zero. What if this sum is non-zero? That is, a net torque operates on the seesaw.

We know that the seesaw will rotate and in order to balance it, we have to shift at least one of the masses.

In which direction will it rotate?

Before answering: first we need to think about direction of rotation. Does it have direction and if so: how do we make clear what that is?

Again the seesaw will give guidance. Suppose we remove the smaller mass all together. Then, it is obvious: the ‘heavy’ left side will rotate to the ground and the light right side upwards. From the point of view we are standing: the seesaw will rotate counter clockwise.

We will use the corkscrew rule or right hand rule to give that a direction: rotate a corkscrew clockwise and the screw will move into the cork away from you; rotate a corkscrew counter clockwise and it will move out of the cork, towards you. Of course, in stead of a corkscrew you can think of a screwdriver or a water tap: closing is rotating 'clock wise, opening the tap is counter clockwise.

The rotation is given by the black arrow. You can find it by using the corkscrew rule: rotating a corkscrew as the blue arrow indicates gives that the screw moves forward like the black arrow.

Figure 6:The rotation is given by the black arrow. You can find it by using the corkscrew rule: rotating a corkscrew as the blue arrow indicates gives that the screw moves forward like the black arrow.

With this, we can define the direction of rotation better than via clock or counter clock wise. In our seesaw example, we will say: if the seesaw rotates clockwise, its rotation is described by a vector that points in the positive yy-direction as given in the figure, that is pointing into the paper (or screen).

Now, we can couple this to the direction of the torque. We saw -taking the origin at the pivot point- two torques acting on the seesaw. The large mass has its torque pointing in the negative yy-direction: it points out of the screen/paper. And this torque tries to rotate the seesaw counter clockwise. On the other hand, the small mass has a torque pointing in the positive yy-direction which is in line with it trying to rotate the seesaw clockwise.
Which of the two is ‘strongest’ determines the direction of rotation: if MgL>mglMgL > mgl then the net torque is in the minus-yy direction. That is, the torque of the larger mass is more negative than the smaller one is positive: MgL+mgl<0-MgL + mgl <0 and the net torque points towards us.

The quantity that goes with this, is the angular momentum. It is defined as

Note that it is a cross product as well. Hence it is a vector itself. Further note that r×pp×r\vec{r} \times \vec{p} \neq \vec{p} \times \vec{r}. The order matters! First r\vec{r} then p\vec{p}. If you do it the other way around, you unwillingly have introduced a minus sign that should not be there.
Furthermore, note that since lr×p\vec{l} \equiv \vec{r} \times \vec{p}, l\vec{l} is perpendicular to the plane formed by r\vec{r} and p\vec{p}.

Angular momentum of a particle at a certain position with momentum.

Figure 7:Angular momentum of a particle at a certain position with momentum.

4.2.1Torque & Analogy to N2

Angular momentum obeys a variation of Newton’s second law that ties it directly to torque.

l=r×p\vec{l} = \vec{r} \times \vec{p} \Rightarrow
dldt=d(r×p)dt=drdt=v×p=0 since v//p+r×dpdt N2: =F=r×F\frac{d \vec{l}}{dt} = \frac{d (\vec{r} \times \vec{p} )}{dt} = \underbrace{\underbrace{\frac{d \vec{r}}{dt}}_{=\vec{v}} \times \vec{p} }_{=0 \text{ since } \vec{v} // \vec{p}}+ \vec{r} \times \underbrace{\frac{d\vec{p}}{dt}}_{\text{ N2: }= \vec{F} } = \vec{r} \times \vec{F}

Thus, we find a general law for the angular momentum:

Again, note that the right hand side is a cross product, so the order does matter.

With the torque denoted by Γ\vec{\Gamma}, we have

Γr×F\vec{\Gamma}\equiv \vec{r} \times \vec{F}

then we can write down an equation similar to N2 (p˙=F)(\dot{\vec{p}}=\vec{F}) but now for angular motion

l˙=Γ\dot{\vec{l}}=\vec{\Gamma}

where the force is replaced by the torque and the linear momentum by the angular momentum.

NB: Note that the torque and angular moment change if we choose a different origin as this changes the value of r\vec{r}.

It is a common mistake to identify angular momentum with rotational motion. That is not correct. A particle that travels in a straight line will, in general, also have a non-zero angular momentum, see Figure 11. Here we look at a free particle: there are no forces working on it. So it travels in a straight line, with constant momentum.

Angular momentum of a free particle.

Figure 11:Angular momentum of a free particle.

However, the particle position does change over time. So: is its angular momentum constant or not?
That is easy to find out. We could take ‘N2’ for angular momentum:

dldt=r×F=0 free particle=0l=const\frac{d \vec{l}}{dt} = \vec{r} \times \underbrace{\vec{F}}_{\substack{=0 \\ \text{ free particle} }} = 0 \Rightarrow \vec{l} = \text{const}

Clearly, the angular momentum of a free particle is constant. Moreover, the momentum of a free particle is also constant. But what about the position vector: isn’t that changing over time and eventually becomes very, very long? Why does that not change r×p\vec{r} \times \vec{p}?
Take a look at Figure 12. We have chosen the xyxy-plane such that both r\vec{r} and p\vec{p} are in it. Furthermore, we have taken it such that p\vec{p} is parallel to the xx-axis.

Angular momentum of a free particle is constant.

Figure 12:Angular momentum of a free particle is constant.

At some point in time, the particle is at position r1\vec{r}_1. Its angular momentum is perpendicular to the xyxy-plane and has magnitude r1×p=rp|| \vec{r}_1 \times \vec{p} || = r_\perp p. Later in time it is at position r2\vec{r}_2. Still, its angular momentum is perpendicular to the xyxy-plane and has magnitude r2×p=rp|| \vec{r}_2 \times \vec{p} || = r_\perp p, indeed identical to the earlier value. This shows that indeed the angular momentum of a free particle is constant.

4.3Examples

4.4Exercises

Solution to Exercise 1

The particle falls under a force that points in the negaive yy-direction. As a consequence, it will start moving vertically downwards:

x: mdvxdt=0vx=C1=0y: mdvydt=mavy=at+C2=at\begin{split} \text{x: } \hspace{1cm} &m\frac{dv_x}{dt} = 0 \rightarrow v_x = C_1 = 0\\ \text{y: } \hspace{1cm} &m\frac{dv_y}{dt} = -ma \rightarrow v_y = -a t + C_2 = -at \end{split}

Thus, we find for the momentum om the particle: p=(0,mat)\vec{p} = \left (0, -mat \right ).

The position of mm follows from:

x: dxdt=vx=0x(t)=C3=x0y: dydt=vy=aty(t)=12at2+C4=H12at2\begin{split} \text{x: } \hspace{1cm} \frac{dx}{dt} &= v_x = 0 \rightarrow x(t) = C_3 = x_0 \\ \text{y: } \hspace{1cm} \frac{dy}{dt} &= v_y = -at \rightarrow y(t) = -\frac{1}{2}at^2 + C_4 = H -\frac{1}{2}at^2 \end{split}

We can now compute the angular momentum:

l=r×p=(x0x^+(H12at2)y^)×(maty^)=mx0atx^×y^=z^+x0(H12at2)y^×y^=0=mx0atz^\begin{split} \vec{l} &= \vec{r} \times \vec{p} \\ &=\left ( x_0 \hat{x} + (H -\frac{1}{2}at^2 ) \hat{y} \right ) \times \left ( -mat\hat{y}\right ) \\ &= -mx_0at \underbrace{\hat{x} \times \hat{y}}_{=\hat{z}} + x_0 (H -\frac{1}{2}at^2 )\underbrace{\hat{y} \times \hat{y}}_{=0} \\ &= -mx_0 a t \, \hat{z} \end{split}

Thus, the angular momentum is pointing in the negative zz-direction and increases linearly with time in magnitude.

We could have tried to find this via the variation of N2 for angular momentum. Now, we need to compute the torque on mm and solve dldt=Γ\frac{d\vec{l}}{dt} = \vec{\Gamma}. This goes as follows:

Γ=r×F=(xx^+yy^)×may^=maxz^\begin{split} \vec{\Gamma} &= \vec{r} \times \vec{F} \\ &= \left ( x\hat{x} + y\hat{y}\right ) \times -ma\hat{y} \\ &= -ma\,x\hat{z} \end{split}

And thus:

dldt=maxz^\frac{d\vec{l}}{dt} = -ma\,x\hat{z}

To get any further, we need information about x(t)x(t). From the xx-component of N2 we know (see above): x(t)=x0x(t) = x_0. If we plug this in, we get:

dldt=max0z^l=mx0at+C5=mx0at\frac{d\vec{l}}{dt} = -ma\,x_0 \hat{z} \rightarrow \vec{l} = -mx_0at + C_5 = -mx_0at

where we have used: t=0p=0l=0C5=0t=0 \rightarrow \vec{p}=0 \rightarrow \vec{l}=0 \Rightarrow C_5 = 0

Solution to Exercise 2

We can follow the same procedure as in exercise (6.1). But now, the outcome of the xx-component of N2 changes.

x: mdvxdt=0vx=C1=v0y: mdvydt=mavy=at+C2=at\begin{split} \text{x: } \hspace{1cm} &m\frac{dv_x}{dt} = 0 \rightarrow v_x = C_1 = v_0\\ \text{y: } \hspace{1cm} &m\frac{dv_y}{dt} = -ma \rightarrow v_y = -a t + C_2 = -at \end{split}

Thus, we find for the momentum om the particle: p=(mv0,mat)\vec{p} = \left (mv_0, -mat \right ).

The position of mm follows from:

x: dxdt=vx=v0x(t)=v0t+C3=x0+v0ty: dydt=vy=aty(t)=12at2+C4=H12at2\begin{split} \text{x: } \hspace{1cm} \frac{dx}{dt} &= v_x = v_0 \rightarrow x(t) = v_0 t + C_3 = x_0 + v_0 t\\ \text{y: } \hspace{1cm} \frac{dy}{dt} &= v_y = -at \rightarrow y(t) = -\frac{1}{2}at^2 + C_4 = H -\frac{1}{2}at^2 \end{split}

We can now compute the angular momentum:

l=r×p=((x0+v0t)x^+(H12at2)y^)×(mv0x^maty^)=m(x0+v0t)atx^×y^=z^+(H12at2)mv0y^×x^=z^=m(Hv0+x0at+12v0at2 )z^\begin{split} \vec{l} &= \vec{r} \times \vec{p} \\ &=\left ( (x_0 + v_0 t) \hat{x} + (H -\frac{1}{2}at^2 ) \hat{y} \right ) \times \left ( mv_0 \hat{x} - mat\hat{y}\right ) \\ &= -m(x_0 + v_0 t)at \underbrace{\hat{x} \times \hat{y}}_{=\hat{z}} + (H -\frac{1}{2}at^2 )mv_0 \underbrace{\hat{y} \times \hat{x}}_{=\hat{-z}} \\ &= -m\left ( Hv_0 + x_0 a t + \frac{1}{2}v_0 a t^2\ \right ) \hat{z} \end{split}

Thus, the angular momentum still points in the negative zz-direction but increases quadratically with time in magnitude.

Solution to Exercise 3

We can take the situation of Exercise 1, but shift our origin such that at t=0x=0t=0 \rightarrow x=0. Now the particle will fall along the yy-axis. It has its momentum also in the yy-direction and consequently l=r×p=0\vec{l} = \vec{r} \times \vec{p} =0 and stays zero!

4.5Central Forces

We have looked at a specific class of forces: the conservative ones. Here we will inspect a second class, that is very useful to identify: the central forces.

A force is called a central force if:

F=F(r)r^\vec{F} = |\vec{F}(\vec{r})| \hat{r}

In words: a force is central of it points always into the direction of the origin or exactly in the opposite direction. The reason to identify this class is in the consequences it has for the angular momentum.

Suppose, a particle of mass mm is subject to a central force. Then we can immediately infer that its angular momentum is a constant:

if F=F(r)r^ then dldt=r×F=F(r)r×r^=0\text{if } \vec{F} = |\vec{F}(\vec{r})| \hat{r} \text{ then } \frac{d\vec{l}}{dt} = \vec{r} \times \vec{F} = |\vec{F}(\vec{r})| \vec{r} \times \hat{r} = 0

where we have use that r\vec{r} and r^\hat{r} are always parallel so their cross-product is zero.

The above is rather trivial, but has a very important consequence: a particle that moves under the influence of a central force, moves with a constant angular momentum (vector!) and must move in a plane. It can not get out of that plane. Thus its motion is at maximum a 2-dimensional problem. We can always use a coordinate system, such that the motion of the particle is confined to only two of the three coordinates, e.g. we can choose our x,yx,y plane such that the particle moves in it and thus always has z(t)=0z(t) = 0.

Why is this so? Why does the fact that the angular momentum vector is a constant immediately imply that the particle motion is in a plane? The argumentation goes as follows.
Imagine a particle that moves under the influence of a central force. At some point in time it will have position r0\vec{r}_0 and momentum p0\vec{p}_0. Neither of them is zero. We will assume that r0\vec{r}_0 and p0\vec{p}_0 are not parallel (in general they will not be). Thus they define a plane. Due to the cross-product l0=r0×p0\vec{l}_0 = \vec{r}_0 \times \vec{p}_0 is perpendicular to this plane.
A little time later, say Δt\Delta t later, both position and momentum will have changed. Since the force is central, the force is also in the plane defined by the initial position and momentum. Thus the change of momentum is in that plane as well: p(t+Δt)=p(t)+FΔt\vec{p} (t + \Delta t) = \vec{p} (t) + \vec{F} \Delta t. The right hand side is completely in our plane. And thus, the new momentum is also in the plane. But that means that the velocity is also in the same plane. An thus the new position r(t+Δt)=r(t)+pmΔt\vec{r} (t + \Delta t) = \vec{r}(t) + \frac{\vec{p}}{m} \Delta t must be in the same plane as well. We can repeat this argument for the next time and thus see, that both momentum and position can not get out of the plane. This is, of course, fully in agreement with the fact that l=const\vec{l} = const for a central force.

4.6Central forces: conservative or not?

We can further restrict our class of central forces:

 if F(r)=f(r)r^ then F is central and conservative\text{ if } \vec{F}(\vec{r}) = f(r) \hat{r} \text{ then F is central and conservative}

In the above, F(r)=f(r)|\vec{F}(\vec{r})| = f(r), that is: the magnitude of the force only depends on the distance from the origin not on the direction. Rephrased: the force is spherically symmetric. If that is the case, the force is automatically conservative and a potential does exist.

Both the concept of central forces and potential energy play a pivotal role in understanding the motion of celestial bodies, like our earth revolving the sun. The planetary motion is an example of using the concept of central forces. It is, however, also an example in its own right. Using his new theory, Newton was able to prove that the motion of the earth around the sun is an ellipsoidal one. It helped changing the way we viewed the world from geo-centric to helio-centric.

4.6.1Keppler’s Laws

Before we embark at the problem of the earth moving under the influence of the sun’s gravity, we will go back in time a little bit.

Kepler has formulated three laws that describe features of the orbits of the planets around the sun.

  1. The orbit of a planet is an ellipse with the Sun at one of the two focal points.
  2. A line segment joining a planet and the Sun sweeps out equal areas during equal intervals of time (Law of Equal Areas).
  3. The square of a planet’s orbital period is proportional to the cube of the length of the semi-major axis of its orbit.
Kepler’s 2nd Law of Equal Area.

Figure 19:Kepler’s 2nd Law of Equal Area.

TA2RA3=TB2RB3=const.\frac{T_A^2}{R_A^3} = \frac{T_B^2}{R_B^3}= \text{const.}
Source
import numpy as np
import matplotlib.pyplot as plt
import matplotlib.animation as animation
from IPython.display import HTML

# Ellipse parameters
a = 5  # semi-major axis
b = 3  # semi-minor axis
c = np.sqrt(a**2 - b**2)
sun_x, sun_y = -c, 0

# Generate full orbit for reference
theta = np.linspace(0, 2*np.pi, 1000)
orbit_x = a * np.cos(theta)
orbit_y = b * np.sin(theta)

# Simplified approximation: divide into equal 'time' intervals
# (In reality we would use Kepler's Equation for true equal time intervals)
num_sectors = 12
angles = np.linspace(0, 2*np.pi, num_sectors+1)
planet_x = a * np.cos(angles)
planet_y = b * np.sin(angles)

# Set up figure
fig, ax = plt.subplots(figsize=(8,8))
ax.plot(orbit_x, orbit_y, label='Orbit')
ax.plot(sun_x, sun_y, 'yo', markersize=12, label='Sun')
planet, = ax.plot([], [], 'ro', markersize=8)
area_patch = plt.Polygon([[sun_x, sun_y]], closed=True, alpha=0.3, color='blue')
ax.add_patch(area_patch)
time_text = ax.text(0.05, 0.95, '', transform=ax.transAxes, fontsize=12)

ax.set_aspect('equal')
ax.set_xlim(-a-1, a+1)
ax.set_ylim(-b-1, b+1)
ax.set_title("Kepler's 2nd Law: Equal Areas in Equal Times")
ax.set_xlabel("X")
ax.set_ylabel("Y")
ax.legend()
ax.grid()

# Initialization function
def init():
    planet.set_data([], [])
    area_patch.set_xy([[sun_x, sun_y]])
    time_text.set_text('')
    return planet, area_patch, time_text

# Animation function
def update(frame):
    # Update planet position
    px = planet_x[frame]
    py = planet_y[frame]
    planet.set_data(px, py)

    # Draw current sector area
    sector_x = [sun_x, planet_x[frame], planet_x[frame+1], sun_x]
    sector_y = [sun_y, planet_y[frame], planet_y[frame+1], sun_y]
    coords = np.column_stack((sector_x, sector_y))
    area_patch.set_xy(coords)

    # Update timer
    time_text.set_text(f'Time = {frame+1}')

    return planet, area_patch, time_text

ani = animation.FuncAnimation(
    fig, update, frames=num_sectors,
    init_func=init, blit=True, interval=1000
)

HTML(ani.to_jshtml())
Loading...

It is important to realize, that Kepler came to his laws by -what we would now call- curve fitting. That is, he was looking for a generic description of the orbits of planets that would match the Brahe data. He abandoned the Copernicus idea of circles with epi-circles with the sun in the center of the orbit. Instead he arrived at ellipses with the sun out of the center, in one of the focal points of the ellipse.

But, there was no scientific theory backing this up. It is purely ‘data-fitting’. Nevertheless, it is a major step forward in the thinking about our universe and solar system. It radically changed from the idea that the universe is ‘eternal’, that is for ever the same and build up of circles and spheres: the mathematical objects with highest symmetry showing how perfect the creation of the universe is.

Kepler had formulated his laws by 1619 AD. It would take another 60 years before Isaac Newton showed that these laws are actually imbedded in his first principle approach: all that is needed is Newton’s second law and his Gravitational Law.

4.7Newton’s theory and Kepler’s Laws

The planets move under the influence of the gravitational force between them and the sun. We start with inspecting and classifying the force of gravity. Newton had formulated the Law of gravity: two objects of mass m1m_1 and m2m_2, respectively, exert a force on each other that is inversely proportional to the square of the distance between the two masses and is always attractive. In a mathematical equation, we can make this more precise:

Fg=Gm1m2r122r^12\vec{F}_g = - G \frac{m_1 m_2}{r_{12}^2} \hat{r}_{12}

In the figure below, the situation is sketched. We have chosen the origin somewhere and denote te position of the sun and the planet by r1\vec{r}_1 and r2\vec{r}_2. Gravity works along the vector r12=r2r1\vec{r}_{12} = \vec{r}_2 - \vec{r}_1. The corresponding unit vector is defined as r^12=r12r12\hat{r}_{12} = \frac{\vec{r}_{12}}{r_{12}}.

The sun and a planet.

Figure 20:The sun and a planet.

Newton realized that he could make a very good approximation. Given that the mass of the sun is much bigger than that of a planet, the acceleration of the sun due to the gravitational force of the planet on the sun is much less than the acceleration of the planet due to the sun’s gravity. For this, we only need Newton’s 3rd law:

Fg,sun on planet=Fg,planet on sunF_{g, sun \text{ } on\text{ } planet} = - F_{g, planet \text{ } on\text{ } sun}

Hence

msunasun=mplanetaplanetasun=mplanetmsunaplanetaplanetm_{sun} a_{sun} = -m_{planet} a_{planet} \rightarrow a_{sun} = \frac{m_{planet}}{m_{sun}} a_{planet} \ll a_{planet}

Newton concluded, that for all practical purposes, he could treat the sun as not moving. Next, he took the origin at the position of the sun. And from here on, we can ignore the sun and pretend that the planet feels a force given by

F(r)=GmMr2r^\vec{F}(\vec{r}) = - G \frac{mM}{r^2}\hat{r}

with MM the mass of the sun and mm that of the planet. rr is now the distance from the planet to the origin and r^\hat{r} the unit vector pointing from the origin to the planet.

First observation: The force is central!

First conclusion: Then the angular momentum of the planet is conserved (is a constant during the motion of the planet) and the motion is in a plane, i.e. we deal with a 2-dimensional problem!

Second Observation: The force is of the form F(r)=f(r)r^\vec{F}({\vec{r}}) = f(r) \hat{r}

Second conclusion: Thus, we do know that a potential energy can be associated with it. It is a conservative force. This also implies that the mechanical energy of the planet, that is the sum of kinetic en potential energy, is a constant over time. In other words, there is no frictional force and the motion can continue forever. This seems to be inline with our observation of the universe: the time scales are so large that friction must be small.

4.7.1Constant Angular Momentum: Equal Area Law

The first clue towards the Kepler Laws comes from angular momentum. Let’s consider the earth-sun system (ignoring all other planets in our solar system). As we saw above, gravity with the sun pinned in the origin, is a central force and thus

dldt=r×(GmMr2rr)=0\frac{d\vec{l}}{dt} = \vec{r} \times \left ( -G\frac{mM}{r^2} \frac{\vec{r}}{r} \right ) = 0

Thus, l=const.\vec{l} = \text{const.} both in length and in direction. From the latter, we can infer that the motion of the earth around the sun is in a plane. Hence, we deal with a 2-dimensional problem.

A free body diagram to help determine the area.

Figure 21:A free body diagram to help determine the area.

At some point in time, the earth is at location r\vec{r} (see red arrow in Figure 21). It has velocity v\vec{v}, given by the black arrow. In a small time interval dtdt, the earth will move a little and arrive at r+dr\vec{r} + d\vec{r} (the green arrow). As the time interval is very short, we can treat the velocity as a constant and thus write: dr=vdtd\vec{r} = \vec{v} dt.

Instead of concentrating on the motion of the earth, we focus on the area traced out by the earth orbit in the interval dtdt. That is the yellow area in the figure. This area is composed of two parts: the light yellow part and a smaller, bright yellow part. The light yellow part has an area A1=12height x baseA_1 = \frac{1}{2} \text{height x base}. If we make dtdt very small, the height is almost equal to rr and the base becomes vdtv_{\perp} dt and thus A112rvdtA_1 \approx \frac{1}{2} r v_\perp dt. For the smaller yellow triangle we have: A2=12dr x base12(v//dt)(vdt)=12v//vdt2A_2 = \frac{1}{2} dr \text{ x base} \approx \frac{1}{2} (v_{//}dt) \cdot (v_\perp dt) = \frac{1}{2}v_{//}v_\perp dt^2.

The total area traced out by the earth orbit during dtdt is thus in good approximation:

dA=A1+A2=12(rv+v//vdt)dtdA = A_1 + A_2 = \frac{1}{2} \left ( r v_\perp + v_{//}v_\perp dt \right ) dt

We divide both sides by dtdt and take the limit dt0dt \rightarrow 0:

dAdt=(12rv+12vv//dt)12rv\frac{dA}{dt} = ( \frac{1}{2} r v_\perp + \frac{1}{2}v_\perp v_//dt ) \rightarrow \frac{1}{2} r v_\perp

In stead of vv_\perp we can also write pm\frac{p_\perp}{m}:

dAdt=12mrp\frac{dA}{dt} = \frac{1}{2m} r p_\perp

But rpr p_\perp is the magnitude of r×p\vec{r} \times \vec{p}. And that is the magnitude of the angular momentum: l=r×p=rpl = || \vec{r} \times \vec{p} || = r p_\perp!!!

We know ll is constant, thus we have found:

dAdt=12mrp=l2m=constant\frac{dA}{dt} = \frac{1}{2m} r p_\perp = \frac{l}{2m} = \text{constant}

We can easily integrate this equation:

dAdt=l2mA(t)=l2mt+C\frac{dA}{dt} = \frac{l}{2m} \rightarrow A(t) = \frac{l}{2m}t + C

We can set the constant CC to zero at some point in time t0t_0 and start counting the increase of the swept area. But we immediately infer that if we check the swept area between tt and t+Δtt+\Delta t, this will be ΔA=l2mΔt\Delta A = \frac{l}{2m} \Delta t regardless of where the earth is in its orbit. In words: in equal time intervals, the earth sweeps an area that is the same for any position of the earth. We have established the Equal Area Law!

4.7.2Newton’s theory and Kepler’s Laws - part 2

We have:

  • The sun is replaced by a force field originating at the origin. This force field is a central force.
    1. Thus, the angular momentum is conserved.
    2. The orbit is in a plane: we deal with a 2-dimensional problem.
  • The force is conserved: a potential exists.

Based on these, we will derive Kepler’s laws only using Newtonian Mechanics. This is easiest in polar coordinates (r,ϕ)(r, \phi ). However, in this course we do not deal with these coordinates. We will thus give a coarse overview of the steps that should be taken.

The first thing we notice, is that the constant angular momentum provides a constraint on the relation between r\vec{r} and p\vec{p}. This constraint can be used to rewrite the kinetic energy Ekin=12mv2E_{kin} = \frac{1}{2} m v^2 into Ekin=12mr˙2+l22mr2E_{kin} = \frac{1}{2}m\dot{r}^2 + \frac{l^2}{2mr^2}.

What does this mean? The coordinate rr is the distance from the sun to the earth. Its time derivative (r˙=drdt=vr)(\dot{r} = \frac{dr}{dt} = v_r) is the velocity of the earth away from the sun. This is called the radial component of the velocity. Figure 22 illustrates this.

The coordinate r is the distance from the sun to the earth. Its time derivative (\dot{r} = \frac{dr}{dt} = v_r) is the velocity of the earth away from the sun.

Figure 22:The coordinate rr is the distance from the sun to the earth. Its time derivative (r˙=drdt=vr)(\dot{r} = \frac{dr}{dt} = v_r) is the velocity of the earth away from the sun.

It is important to realize that r˙\dot{r} tells us if we are moving such that we are getting closer to the sun or further away. But it does not tells us how we move ‘around’ the sun. For that we need the information of the component of the velocity perpendicular toe r\vec{r} (the other grey vector in the figure).

So, it seems that we are working with incomplete information. And in a sense we do. But it will turn out to be very useful to understand the physics of the earth’s orbit.

We already saw that in this case gravity is a conservative force. The potential energy is found by solving V(r)=rrefrFgdrV(r) = -\int_{r_{ref}}^r \vec{F}_g \cdot d\vec{r} . We can plug in Fg=GmMr2r^\vec{F}_g = - G \frac{mM}{r^2} \hat{r}. Thus only the radial coordinate is of importance in the inner product in the integral. Furthermore, we will use as reference boundary: \infty. Thus, the potential energy is:

V(r)=rrefrFgdr=GmMrdrr2=GmMr\begin{split} V(r) &= -\int_{r_{ref}}^r \vec{F}_g \cdot d\vec{r} \\ &= GmM \int_\infty^r \frac{dr}{r^2} \\ & = -G\frac{mM}{r} \end{split}

Thus, energy conservation can be written as:

12m(vx2+vy2)GmMr=E0=const\frac{1}{2} m (v_x^2 + v_y^2) - -G\frac{mM}{r} = E_0 = const

As expected: we have an equation with two unknowns (x(t),y(t))(x(t), y(t)). Once we solved the problem, we will thus have the coordinates of the planet’s trajectory as a function of time. However, we will not do that. Reason: it is complicated and we don’t need it! What we need is to find what kind of figure the trajectory is.

Our first step is to bring the number of unknowns in the energy equation down from two to one. For that, we use Ekin=12mr˙2+l22mr2E_{kin} = \frac{1}{2}m\dot{r}^2 + \frac{l^2}{2mr^2}.

12r˙2+l22mr2GmMr=E0=const\frac{1}{2} \dot{r}^2 + \frac{l^2}{2mr^2} - -G\frac{mM}{r} = E_0 = const

Now we have an equation with only one unknown r(t)r(t).

We can interpret this in a different way: the second term, with the angular momentum, originates from kinetic energy, but now looks like a potential energy. And that is exactly what we are going to do: treat it as a potential energy.

Hence, we can first inspect the global features of our energy equation. Notice that the gravity potential energy is an increasing function of the distance from the planet to the sun (located and fixed in the origin). This shows that the underlying force attractive is. The new part, coming from angular momentum, on the other hand is a decreasing function of distance. Thus, the related force is repelling.

We can make a drawing of the energy. See Figure 23.

Source
import numpy as np
import matplotlib.pyplot as plt

G = 6.67430e-11  # gravitational constant
M = 1.989e30    # mass of the sun (kg)
m = 5.972e24    # mass of the Earth (kg)
l = 2.65E40     # angular momentum (kg m^2/s)
r = np.linspace(.5e11, 2.5e11, int(1e5))  # distance from the sun to Earth (m)


U1 = l**2/(2*m*r**2) 
U2 = -G*M*m/r
Ut = U1 + U2

plt.figure(figsize=(10, 6))
plt.xlabel('Distance from Sun (m)')
plt.ylabel('Energy (Joules)')

plt.plot(r, Ut, label='Total Energy', color='black')
plt.plot(r, U1, label='Angular momentum', color='red')
plt.plot(r, U2, label='gravity', color='blue')
plt.grid()
plt.legend()
plt.savefig('../images/KeplerEnergy', dpi=300)
plt.show()
Output
<Figure size 1000x600 with 1 Axes>
Energies related to our planet, with a minimum around 1.5e11m.

Figure 23:Energies related to our planet, with a minimum around 1.5e11m1.5e11m.

The blue line is the potential energy of gravity. The red one stems from the kinetic energy associated with the angular velocity. The black line is the sum of the two, a kind of effective potential:

Ueff=l22mr2GmMrU_{eff} = \frac{l^2}{2mr^2} - -G\frac{mM}{r}

We see, that the energy can not be just any value: the kinetic energy of our quasi-one-dimensional particle (12mr˙2\frac{1}{2}m\dot{r}^2) can not be negative and the total potential energy has, according to Figure 23 a clear minimum. The total energy can not be below this minimum. On the other hand: there is no maximum.

Case 1: Effective potential = minimal
Case 2: Effective potential < Total energy < 0$
Case 3: Total energy > 0$

Suppose, we would prepare the system such that its total energy was equal to the minimum of the black line, i.e. of the total potential energy. Then, of course, via the arguments we have given above this is only possible if the kinetic energy is zero.

Ekin+Ueff(r)=Ueff,minEkin=0E_{kin} + U_{eff}(r) = U_{eff, min} \Rightarrow E_{kin} = 0

This implies that r˙=0\dot{r} = 0:

Ekin=12mr˙2=0r˙=0E_{kin} = \frac{1}{2} m \dot{r}^2 = 0 \rightarrow \dot{r} = 0

At first glance, this seems strange: r˙=0\dot{r} = 0 suggests that the earth doesn’t move, it has zero velocity. That would indeed be strange: after all we are dealing here with a planet that is attracted via gravity towards the sun. How can it possible have zero velocity?

We are about to make a mistake: r˙=0\dot{r} = 0 doesn’t mean that the velocity is zero. It means that r(t)=constr(t) = const. The planet still has a velocity perpendicular to its position vector r\vec{r}. Earlier we found: l=mrv=constl = mrv_\perp = const. We now have, since

r˙=0r=r0=const,l=mr0v=constv=lmr0=cst\dot{r}=0 \rightarrow r=r_0=const, l = mr_0 v_\perp =const \rightarrow v_\perp = \frac{l}{m_{r_0}}= cst

Thus, if a planet orbits its sun such that its (pseudo-)potential Ueff=minimumU_{eff} = minimum, then its orbit is a circle of radius r0r_0 that corresponds to the minimum in UeffU_{eff} and the planet has a velocity that is constant in magnitude v=lmr0v = \frac{l}{mr_0}.

4.7.3Ellipsoidal orbits

We are left with the task of showing that planets ‘circle’ the sun in an ellipse. From the above, we now know that this must mean that the total energy is smaller than zero: E<0E<0. We will not go over the details of the derivation, but leave that for another course.

The outcome of the analysis would be the following expression for the orbit in case of an ellipse:

(x+ea)2a2+y2b2=1\frac{(x+ea)^2}{a^2} + \frac{y^2}{b^2} = 1
Ellips in Cartesian coordinates.

Figure 26:Ellips in Cartesian coordinates.

This is an ellipse with semi major and minor-axis aa and bb, respectively. The center of the ellipse is located at (ea,0)(-ea,0). Note that the sun is in the origin and that seen from the center of the ellipse, the origin is at one of the focal points of the ellipse. Consequently, the orbit is not symmetric as viewed from the sun. We notice this on earth: the summer and winter (when the sun is closest respectively furthest from the sun) are not symmetric, even if we take the tilted axis of the earth into account.

The half and short long axis are given by:

a=α1e2=GMm2Ea = \frac{\alpha}{1-e^2} = \frac{GMm}{2|E|}
b=aα=l22mEb = a\alpha = \frac{l^2}{2m|E|}

with

e=1+2El2(GMm)2me = \sqrt{1 + \frac{2El^2}{(GMm)^2m}}

and

αl22GMm2\alpha \equiv \frac{l^2}{2GMm^2}

This type of curve is know as the conic sections. That is, they can be found by intersecting a cone with a plane. See the animation below, where a plane is at various positions and at various angles intesecting a cone.

Conic sections   animation created by Sara van der Werf

Figure 27:Conic sections animation created by Sara van der Werf

Note that in the definition of ee, the total energy of the system plays a role. This energy can be negative (see Figure 23). The minimum value of the effective potential energy is easily computed. It is Ueff,min=12(GmM)2ml2U_{eff, min} = -\frac{1}{2} \frac{(GmM)^2m}{l^2} and is realized when the planet is at a distance r=l2GMm2r = \frac{l^2}{GMm^2}. For this case we have e=0e = 0 and the planet is moving in a circle around the sun, as we already argued above.

For 0e<10 \leq e < 1 the orbit is an ellipse as Kepler already had postulated (for these values of ee the orbit is a closed one).

For e=1e=1, the orbit is a parabola: the object will eventually move to infinity where it has exactly zero radial velocity.

Finally, for e>1e > 1 the trajectory is a hyperbola with the planet again moving to infinity.

Conclusion: according to Newton’s laws of mechanics, combined with the Gravitation force proposed by Newton, planets must move in ellipses around their star.

This holds for our solar system, but for any other star with planets as well. Research has shown that there are hundreds of solar systems out in the universe with thousands of planets moving around their star. See e.g. https://exoplanets.nasa.gov/

4.7.4Kepler 3

We are left with proving Kepler’s third law:

TA2RA3=TB2RB3=const\frac{T_A^2}{R_A^3} = \frac{T_B^2}{R_B^3}= const

Now that we know the orbit, this is not difficult. We concentrate on the motion during one lapse (one ‘year’). From Keppler’s 1st^\text{st} law we know that the area a planet sweeps out of its ellipse is given by

A(t)=l2mt+CA(t) = \frac{l}{2m}t + C

where CC is an integration constant. Furthermore, this way of writing makes that the area swept keeps increasing: after one round along the ellipse, we simply keep counting.

However, we can easily back out what happens after exactly one round, or one ‘year’. The total area swept is then, of course, the area of the ellipse itself, that is: in one year (time TT) the area swept is πab\pi a b. Hence we conclude:

A(T)=πabπab=l2mTA(T) = \pi a b \Rightarrow \pi ab = \frac{l}{2m}T

If we put back what we found for aa and bb, we get

T2a3=4π2GM\frac{T^2}{a^3} = \frac{4\pi^2}{GM}

Thus, indeed Kepler was right. Moreover, we note that the constant is only depending on the mass of the sun. The same law will hold for other solar systems, but with a different constant.

In Figure 28 Kepler’s third law is shown for our solar system. The red data points are based on the measured ‘year’ of each planet and the distance to the sun. The blue line is the prediction from Newton’s theory.

Kepler 3 for our solar system.

Figure 28:Kepler 3 for our solar system.

4.8Speed of the planets & dark matter

Starting from Kepler 3, we can compute the orbital speed of a planet around the sun

T2=4π2GMa3ω2=GMa3,T=2πω,ω=vr,arv=GMr\begin{split} T^2 &= \frac{4\pi^2}{GM}a^3\\ \omega^2 &= \frac{GM}{a^3}, \quad T=\frac{2\pi}{\omega}, \omega=\frac{v}{r}, a \approx r \\ \Rightarrow v &= \sqrt{\frac{GM}{r}} \end{split}

Indeed if we measure the speed of the planets in the solar system this prediction holds, the velocity drops with the distance from the sun as r1/2\propto r^{-1/2} (see figure). As MM we use the mass of the sun here.

From LibreTexts Physics, licensed under CC BY-NC-SA 4.0.

Figure 30:From LibreTexts Physics, licensed under CC BY-NC-SA 4.0.

The distance is measured in Astronomical Units [AU], the distance from the earth to the sun (about 8.3 light minutes). Note that the earth is moving with an unbelievable 30 km/s, that is 105 km/h! Do you notice any of that? We will use this motion later with the Michelson-Morley experiment.

If we plot the same speed versus distance curve not for the planets in our solar system, but for stars orbiting the center of our galaxy, the milky way, then the picture looks very different. The far away stars orbit at a much higher speed than expected and the form of the found curve does not match r1/2\propto r^{-1/2}.

From Wikimedia Commons, licensed under CC-SA 3.0.

Figure 31:From Wikimedia Commons, licensed under CC-SA 3.0.

This mismatch is not understood to this day! The mass MM here is calculated from the visible stars and the supermassive black holes at the center of the galaxy. But even if the mass is calculated wrongly, the shape of the dependency does not match. It turns out, this mismatch is observed in all galaxies! Apparently the law of gravity does not hold for large distances or there must be extra mass that increases the speed that we do not see. This mismatch has lead to the postulation of dark matter and an alternative formulation for the laws of gravity. This is the most disturbing problem in physics today; second is probably the interpretation of measurement in quantum mechanics (collapse of the wave function/Kopenhagen interpretation of Quantum Mechanics; multiverse theories).

The majority of all matter in the universe is believed to be dark. And we have no clue what it could be! Most scientist even think it must be non-baryonic, that is, other stuff than our well-known protons or neutrons. It remains most confusing.

The usual distance unit for distances in astronomy outside the solar system is not light years (ly), but parsec [pc], or kpc, or Mpc. One parsec is about 3.3 ly (or 1013 km). Note: stars visible to the eye are typically not more than a few hundred parsec away. The Milky Way is perfectly visible to the naked eye as a band/stripe of “milk” sprayed over the night sky. But you cannot see it anywhere close to Delft, there is much too much light from cities and greenhouses. Go to Scandinavia in the winter (“wintergatan”) or any place remote where there are few people. The reason you see a “band” in the night sky, is that the Milky Way is a spiral galaxy, sort of pancake shaped, and you see the band in the direction of the pancake.