Skip to article frontmatterSkip to article content

3Work & Energy

3.1Work

Work and energy are two important concepts. Work is the transfer of energy that occurs when a force is applied to an object and causes displacement in the direction of that force, calculated as ‘force times path’. However, we need a formal definition:

if a point particle moves from r \vec{r} to r+dr \vec{r} + d\vec{r} and during this interval a force F\vec{F} acts on the particle, then this force has performed an amount of work equal to:

dW=FdrdW = \vec{F} \cdot d\vec{r}
Path of a particle.

Figure 1:Path of a particle.

Note that this an inner product between two vectors, resulting in a scalar . In other words, work is a number, not a vector. It has no direction. That is one of the advantages over force.

dW=Fdr=Fxdx+Fydy+FzdzdW = \vec{F} \cdot d\vec{r} = F_x dx + F_y dy + F_z dz

Work done on mm by FF during motion from 1 to 2 over a prescribed trajectory:

W12=12FdrW_{12} = \int_1^2 \vec{F} \cdot d\vec{r}

Keep in mind: in general the work depends on the starting point 1, the end point 2 and on the trajectory. Different trajectories from 1 to 2 may lead to different amounts of work.

3.2Kinetic Energy

Kinetic energy is defined and derived using the definition of work and Newton’s 2nd 2^{nd} Law.

The following holds: if work is done on a particle, then its kinetic energy must change. And vice versa: if the kinetic energy of an object changes, then work must have been done on that particle. The following derivation shows this.

W12=12Fdr=12Fdrdtdt=12Fvdt=12mdvdtvdt=m12vdv=m[12v2]12=12mv2212mv12W_{12} = \int_1^2 \vec{F} \cdot d\vec{r} = \int_1^2 \vec{F} \cdot \frac{d\vec{r}}{dt}dt = \int_1^2 \vec{F} \cdot \vec{v} dt \\ = \int_1^2 m\frac{d\vec{v}}{dt} \cdot \vec{v} dt = m\int_1^2 \vec{v} \cdot d\vec{v} =m [\frac{1}{2}\vec{v}^2]_1^2 \\ = \frac{1}{2}m\vec{v}_2^2-\frac{1}{2}m\vec{v}_1^2

It is from the above that we indicate 12mv2\frac{1}{2}m\vec{v}^2 as kinetic energy. It is important to realize that the concept of kinetic energy does not bring anything that is not contained in N2 to the table. But it does give a new perspective: kinetic energy can only be gained or lost if a force performs work on the particle. And vice versa: if a force performs work on a particle, the particle will change its kinetic energy.

Obviously, if more than one force acts, the net work done on the particle determines the change in kinetic energy. It is perfectly possible that force 1 adds an amount W W to the particle, whereas at the same time force 2 will take out an amount W -W . This is the case for a particle that moves under the influence of two forces that cancel each other: F1=F2 \vec{F}_1 = -\vec{F}_2 . From Newton 2, we immediately infer that if the two forces cancel each other, then the particle will move with a constant velocity. Hence, its kinetic energy stays constant. This is completely in line with the fact that in this case the net work done on the particle is zero:

W1+W2=12F1dr+12F2dr=12F1dr12F1dr=0W_1 + W_2 = \int_1^2 \vec{F}_1 \cdot d\vec{r} + \int_1^2 \vec{F}_2 \cdot d\vec{r} = \int_1^2 \vec{F}_1 \cdot d\vec{r} - \int_1^2 \vec{F}_1 \cdot d\vec{r} = 0

3.3Worked Examples

Solution to Exercise 1

The answer is, of course, zero! That is because the path (from Delft to Rotterdam) is perpendicular to the gravitational force. Therefore the inner product Fdr=0\vec{F}\cdot d\vec{r}=0 over the whole way. Let us look at it more formally, this will help us when things get more complicated later.

The force is F(x,y,z)=(0,0,mg)=mgz^\vec{F}(x,y,z)=(0,0,-mg)=-mg\hat{z} and we choose our coordinate system such that the path be along the xx-axis, the yy-coordinate is zero and we the backpack is at height z=1z=1 m.

W=DelftRottFxdx+Fydy+Fzdz=Fxdxy=0,z=1=0dx=0W = \int_{Delft}^{Rott} F_x dx + F_y dy + F_z dz = \int F_x dx|_{y=0,z=1} = \int 0\, dx = 0
Solution to Exercise 2
W=x1x2Fdx=0xkxdx=12kx2W = \int_{x_1}^{x_2} F \mathrm{d}x = \int_{0}^{x} kx \mathrm{d}x = \frac{1}{2}kx^2
Solution to Exercise 3

We can split up the integral in these two parts as the direction in both parts is constant, therefore the inner product can be separated out.

W=(0,0)(1,0)Fdr+(1,0)(1,1)Fdr=(0,0)(1,0)Fxdxy=0+(1,0)(1,1)Fydyx=1=(x=0)(x=1)ydxy=0+(y=0)(y=1)x2dyx=1=yxx=0x=1y=0+x2yy=0y=1x=1=1\begin{array}{rcl} W &=& \int_{(0,0)}^{(1,0)} \vec{F}\cdot d\vec{r} + \int_{(1,0)}^{(1,1)} \vec{F}\cdot d\vec{r} \\ &=& \int_{(0,0)}^{(1,0)} F_x dx|_{y=0} + \int_{(1,0)}^{(1,1)} F_y dy|_{x=1}\\ &=& \int_{(x=0)}^{(x=1)} -y\,dx|_{y=0} + \int_{(y=0)}^{(y=1)} x^2\,dy|_{x=1}\\ &=& -yx|_{x=0}^{x=1}|_{y=0} + x^2y|_{y=0}^{y=1}|_{x=1} = 1 \end{array}

Try to integrate the force field yourself along a different path (0,0)(0,1)(1,1)(0,0)\to(0,1)\to(1,1) to the same end point.

W=y=0y=1Fydyx=0+x=0x=1Fxdxy=1=y=0y=1x2dyx=0+x=0x=1ydxy=1=1+0=1\begin{array}{rcl} W &=& \int_{y=0}^{y=1} F_y \,dy|_{x=0} + \int_{x=0}^{x=1} F_x\,dx|_{y=1} \\ &=& \int_{y=0}^{y=1} x^2\,dy|_{x=0} + \int_{x=0}^{x=1} -y\,dx|_{y=1} \\ &=& -1 +0 =-1 \end{array}

The work done is not the same over this path. This is already obvious from the graph showing the path and the force field: the second path clearly moves against the force, where the first is moving with direction of the force.

3.4Gravitational potential energy

Let’s consider an object close to the surface of any planet, where the acceleration due to gravity can be described by Fg=mgF_g=mg. Raising the object to a height HH requires us to do work:

W=0HFdx=0Hmgdx=mgHW = \int_0^H F dx = \int_0^H -m g dx = -m g H

Note that there is a minus sign, we have done work against the gravitational force. As energy is a conservative quantity, the object has the ‘gained’ some energy. We call this potential energy, more particular in this case gravitational potential energy.

When the object is released from that height HH, this gravitational potential energy is converted to kinetic energy. The gravitational force does work on the object:

W==H0Fdx=H0mgdx=mgH=ΔEkinW = = \int_H^0 F dx = \int_H^0 m g dx = m g H = \Delta E_{kin}

From this, it follow that the object will reach a velocity of v=2gHv=\sqrt{2gH}.

3.5Conservative force

Work done on mm by FF during motion from 1 to 2 over a prescribed trajectory, is defined as:

W12=12FdrW_{12} = \int_1^2 \vec{F} \cdot d\vec{r}

In general, the amount of work depends on the path followed. That is, the work done when going from r1 \vec{r}_1 to r2 \vec{r}_2 over the red path in the figure below, will be different when going from r1 \vec{r}_1 to r2 \vec{r}_2 over the blue path. Work depends on the specific trajectory followed.

Two different paths.

Figure 3:Two different paths.

However, there is a certain class of forces for which the path does not matter, only the start and end point do. These forces are called conservative forces. As a consequence, the work done by a conservative force over a closed path, i.e start and end are the same, is always zero. No matter which closed path is taken.

conservative force Fdr=0 for ALL closed paths\text{conservative force } \Leftrightarrow \oint \vec{F} \cdot d\vec{r} = 0 \text{ for }\textbf{ALL}\text{ closed paths}

3.5.1Stokes’ Theorem

It was George Stokes who proved an important theorem, that we will use to turn the concept of conservative forces into a new and important concept.

Sir George Stokes (1819-1903). From Wikimedia Commons, public domain.

Figure 4:Sir George Stokes (1819-1903). From Wikimedia Commons, public domain.

His theorem reads as:

Fdr=×Fdσ\oint \vec{F} \cdot d\vec{r} = \iint \vec{\nabla} \times \vec{F} \cdot d\vec{\sigma}

In words: the integral of the force over a closed path equals the surface integral of the curl of that force. The surface being ‘cut out’ by the close path. The term ×F\vec{\nabla} \times \vec{F} is called the curl of FF: it is a vector. The meaning of it and some words on the theorem are given below.

3.5.2Conservative force and ×F\vec{\nabla} \times \vec{F}

For a conservative force the integral over the closed path is zero for any closed path. Consequently, ×F=0 \vec{\nabla} \times \vec{F} = 0 everywhere. How do we know this? Suppose ×F0 \vec{\nabla} \times \vec{F} \neq 0 at some point in space. Then, since we deal with continuous differentiable vector fields, in the close vicinity of this point, it must also be non-zero. Without loss of generality, we can assume that in that region ×Fdσ>0 \vec{\nabla} \times \vec{F} \cdot d\vec{\sigma}> 0 . Next, we draw a closed curve around this point, in this region. We now calculate the Fdr\oint \vec{F} \cdot d\vec{r} along this curve. That is, we invoke Stokes Theorem. But we know that ×Fdσ>0 \vec{\nabla} \times \vec{F} \cdot d\vec{\sigma} > 0 on the surface formed by the closed curve. Consequently, the outcome of the surface integral is non-zero. But that is a contradiction as we started with a conservative force and thus the integral should have been zero.
The only way out, is that ×F=0 \vec{\nabla} \times \vec{F} = 0 everywhere.

Thus we have:

conservative force ×F=0 everywhere\text{conservative force } \Leftrightarrow \vec{\nabla} \times \vec{F} = 0 \text{ everywhere}

3.6Potential Energy

A direct consequence of the above is:

if ×F=0 \vec{\nabla} \times \vec{F} = 0 everywhere, a function V(r) V (\vec{r}) exists such that F=V \vec{F} = -\vec{\nabla}V

conservative force ×F=0 everywhereF=VV(r)=refFdr\text{conservative force } \Leftrightarrow \vec{\nabla} \times \vec{F} = 0 \text{ everywhere} \\ \big \Updownarrow \\ \vec{F} = -\vec{\nabla}V \Leftrightarrow V(\vec{r}) = -\int_{ref} \vec{F} \cdot d\vec{r}

where in the last integral, the lower limit is taken from some, self picked, reference point. The upper limit is the position r \vec{r} .

This function V is called the potential energy or the potential for short. It has a direct connection to work and kinetic energy.

Ekin,2Ekin,1=W12=12Fdr=V(r2)V(r1)E_{kin,2} - E_{kin,1} = W_{12} = \int_1^2 \vec{F} \cdot d\vec{r} = V(\vec{r}_2) - V(\vec{r}_1)

or rewritten:

Ekin,1+V(r1)=Ekin,2+V(r2)E_{kin,1} + V(\vec{r}_1) = E_{kin,2} + V(\vec{r}_2)

In words: for a conservative force, the sum of kinetic and potential energy stays constant.

3.6.1Energy versus Newton’s Second Law

We, starting from Newton’s Laws, arrived at an energy formulation for physical problems.
Question: can we also go back? That is: suppose we would start with formulating the energy rule for a physical problem, can we then back out the equation of motion?
Answer: yes, we can!

It goes as follows. Take a system that can be completely described by its kinetic plus potential energy. Then: take the time-derivative and simplify, we will do it for a 1-dimensional case first.

12mv2+V(x)=E0ddt[12mv2+V(x)]=dE0dt=0mvv˙+dVdxdxdt=v=0v(mv˙+dVdx)=0\begin{split} &\frac{1}{2}mv^2 + V(x) = E_0 \Rightarrow \\ &\frac{d}{dt} \left [ \frac{1}{2}mv^2 + V(x) \right ] = \frac{dE_0}{dt} = 0 \Rightarrow \\ &mv\dot{v} + \frac{dV}{dx}\underbrace{\frac{dx}{dt}}_{=v} = 0 \Rightarrow \\ &v \left (m\dot{v} + \frac{dV}{dx} \right ) = 0 \end{split}

The last equation must hold for all times and all circumstances. Thus, the term in brackets must be zero.

mv˙+dVdx=0mx¨=dVdx=Fm\dot{v} + \frac{dV}{dx} = 0 \Rightarrow m\ddot{x} = -\frac{dV}{dx} = F

And we have recovered Newton’s second law.

In 3 dimensions it is the same procedure. What is a bit more complicated, is using the chain rule. In the above 1-d case we used dVdt=dV(x(t))dt=dVdxdx(t)dt\frac{dV}{dt} = \frac{dV(x(t))}{dt} = \frac{dV}{dx}\frac{dx(t)}{dt}. In 3-d this becomes:

dVdt=dV(r(t))dt=dVdrdr(t)dt=Vv\frac{dV}{dt} = \frac{dV(\vec{r}(t))}{dt} = \frac{dV}{d\vec{r}}\cdot \frac{d\vec{r}(t)}{dt} = \vec{\nabla} V \cdot \vec{v}

Thus, if we repeat the derivation, we find:

12mv2+V(r)=E0ddt[12mv2+V(r)]=0mvv˙+Vv=0v(ma+V)=0ma=V=F\begin{split} &\frac{1}{2}mv^2 + V(\vec{r}) = E_0 \Rightarrow \\ &\frac{d}{dt} \left [ \frac{1}{2}mv^2 + V(\vec{r}) \right ] = 0 \Rightarrow \\ &m\vec{v} \cdot \dot{\vec{v}} + \vec{\nabla} V \cdot \vec{v} = 0 \Rightarrow \\ &v \left (m\vec{a} + \vec{\nabla} V \right ) = 0 \Rightarrow \\ &m\vec{a} = -\vec{\nabla} V = \vec{F} \end{split}

And we have recovered the 3-dimensional form of Newton’s second Law. This is a great result. It allows us to pick what we like: formulate a problem in terms of forces and momentum, i.e. Newton’s second law, or reason from energy considerations. It doesn’t matter: they are equivalent. It is a matter of taste, a matter of what do you see first, understand best, find easiest to start with. Up to you!

3.7Stable/Unstable Equilibrium

A particle (or system) is in equilibrium when the sum of forces acting on it is zero. Then, it will keep the same velocity, and we can easily find an inertial system in which the particle is at rest, at an equilibrium position.
The equilibrium position (or more general state) can also be found directly from the potential energy.

Potential energy and (conservative) forces are coupled via:

F=V\vec{F} = - \vec{\nabla}V

The equilibrium positions (iFi=0)\left ( \sum_i \vec{F}_i = 0 \right ) can be found by finding the extremes of the potential energy:

equilibrium position V=0\text{equilibrium position } \Leftrightarrow \vec{\nabla}V = 0

Once we find the equilibrium points, we can also quickly address their nature: is it a stable or unstable solution? That follows directly from inspecting the characteristics of the potential energy around the equilibrium points.

For a stable equilibrium, we require that a small push or a slight displacement will result in a force pushing back such that the equilibrium position is restored (apart from the inertia of the object that might cause an overshoot or oscillation).

However, an unstable equilibrium is one for which the slightest push or displacement will result in motion away from the equilibrium position.

The second derivative of the potential can be investigated to find the type of extremum. For 1D functions that is easy, for scalar valued functions of more variables that is a bit more complicated. Here we only look at the 1D case V(x):RRV(x): \mathbb{R} \rightarrow \mathbb{R}

equilibrium: V=0{stable: d2Vdx2>0unstable: d2Vdx2<0\text{equilibrium: } \vec{\nabla}V = 0 \begin{cases} \text{stable: } & \frac{d^2V}{dx^2} > 0 \\ \text{unstable: } & \frac{d^2V}{dx^2} < 0 \end{cases}

Luckily, the definition of potential energy is such that these rules are easy to visualize in 1D and remember, see fig.(?.?).

Stable and unstable position of a particle in a potential.

Figure 7:Stable and unstable position of a particle in a potential.

A valley is stable; a hill top is unstable.
NB: Now the choice of the minus sign in the definition of the potential is clear. Otherwise a hill would be stable, but that does not feel natural at all.

It is also easy to visualize what will happen if we distort that particle from the equilibrium state:

  • The valley, i.e., the stable system, will make the particle move back to the lowest point. Due to inertia, it will not stop but will continue to move. As the lowest position is one of zero force, the particle will 'climb' toward the other end of the valley and start an oscillatory motion.

  • The top, i.e., the unstable point, will make the particle move away from the stable point. The force acting on the particle is now pushing it outwards, down the slope of the hill.

3.7.1Taylor Series Expansion of the Potential

The Taylor expansion or Taylor series is a mathematical approximation of a function in the vicinity of a specific point. It uses an infinite series of polynomial terms with coefficients given by value of the derivative of the function at that specific point: the more terms you use, the better the approximation. If you use all terms, then it is exact. Mathematically, it reads for a 1D scalar function f:RRf: \mathbb{R} \rightarrow \mathbb{R}:

f(x)f(x0)+11!f(x0)(xx0)+12!f(x0)(xx0)2+13!f(x0)(xx0)3+...f(x) \approx f(x_0) + \frac{1}{1!} f'(x_0) (x-x_0 ) + \frac{1}{2!} f''(x_0) (x-x_0)^2 + \frac{1}{3!} f'''(x_0) (x-x_0)^3 + ...

For our purpose here, it suffices to stop after the second derivative term:

f(x)f(x0)+f(x0)(xx0)+12f(x0)(xx0)2+O(x3)f(x) \approx f(x_0) + f'(x_0) (x-x_0 ) + \frac{1}{2} f''(x_0) (x-x_0)^2 + \mathcal{O}(x^3)

A way of understanding why the Taylor series actually works is the following.
Imagine you have to explain to someone how a function looks around some point x0x_0, but you are not allowed to draw it. One way of passing on information about f(x)f(x) is to start by giving the value of f(x)f(x) at the point x0x_0:

f(x)f(x0)f(x) \approx f(x_0)

Next, you give how the tangent at x0x_0 is; you pass on the first derivative at x0x_0. The other person can now see a bit better how the function changes when moving away from x0x_0:

f(x)f(x0)+f(x0)(xx0)f(x) \approx f(x_0) + f'(x_0) (x-x_0 )

Then, you tell that the function is not a straight line but curved, and you give the second derivative. So now the other one can see how it deviates from a straight line:

f(x)f(x0)+11!f(x0)(xx0)+12!f(x0)(xx0)2f(x) \approx f(x_0) + \frac{1}{1!} f'(x_0) (x-x_0 ) + \frac{1}{2!} f''(x_0) (x-x_0)^2

Note that the prefactor is placed back. But the function is not necessarily a parabola; it will start deviating more and more as we move away from x0x_0. Hence we need to correct that by invoking the third derivative that tells us how fast this deviation is. And this process can continue on and on.

Important to note: if we stay close enough to x0x_0 the terms with the lowest order terms will always prevail as higher powers of (xx0)(x-x_0) tend to zero faster than a lower powers (for instance: 0.54<<0.520.5^4 << 0.5^2).

This 3Blue1Brown clip explains the 1D Taylor series nicely.

A 3blue1brown clip on Taylor series.

For scalar valued functions as our potentials V(r):R3RV(\vec{r}): \mathbb{R}^3 \rightarrow \mathbb{R} the extension of the Taylor series is not too difficult. If we expand the function around a point

V(r)V(r0)+V(r0)(rr0)+12(rr0)(2V)(r0)(rr0)+O(r3)\begin{split} V(\vec{r}) &\approx V(\vec{r}_0) + \vec{\nabla}V(\vec{r}_0) \cdot (\vec{r}-\vec{r}_0 ) \\ &+ \frac{1}{2} (\vec{r}-\vec{r}_0 ) \cdot (\partial^2 V)(\vec{r}_0) \cdot (\vec{r}-\vec{r}_0 ) + \mathcal{O}(r^3) \end{split}

The second derivative of the potential indicated by 2V\partial^2 V is the Hessian matrix.

Conceptually the extrema of the function are again the hills and valleys. The classification of the extrema has next to hills and valleys also saddle points etc. In this course we will not bother about these more dimensional cases, but only stick to simple ones.

Footnotes
  1. Exercise from Idema, T. (2023). Introduction to particle and continuum mechanics. Idema (2023)

References
  1. Idema, T. (2023). Introduction to particle and continuum mechanics. TU Delft OPEN Publishing. 10.59490/tb.81