Documente Academic
Documente Profesional
Documente Cultură
S1: Kinematics
Summary
∆x
■ The equation for velocity may also be represented as v =
∆t
1
Cambridge Pre-U Physics
Learning Outcomes
■ use calculus to describe motion, with differentials corresponding to gradients of graphs and
integrals corresponding to areas under graphs
Later in your studies of maths and physics you will learn more about calculus (differentiation and
integration).
The two methods in calculus are differentiation, which is the equivalent when using graphs
of calculating the gradient of a function, and integration, which when using graphs involves
calculating the area under a line. We have seen already that gradients and areas are important in
calculating displacement, velocity and acceleration. So why is calculus needed?
Calculus can be applied directly to mathematical functions (equations) and this means we
do not need to plot graphs. Calculus can also give us calculated answers when the graph of a
function is curved and difficult to measure visually. For example, if we have a mathematical
1
function for the position of an object (such as s = ut + at 2) we can differentiate this function
2
to get the velocity.
1
We write the function and the differentiated function as follows:
1 2
s = ut + at
2
ds
v = = u + at
dt
Here, ‘ds/dt’ means ‘the function of displacement s differentiated with respect to time t’.
Using calculus allows us to solve problems with complicated and non-uniform functions for
acceleration. Calculus is especially useful when making calculations involving friction or air
resistance, where the acceleration changes depending on the size and direction of the velocity.
Note that differentiation of velocity produces a function that is closely related to an expression
we have already seen for acceleration, a = ∆v . For the differentiation of velocity v to give us the
d ∆t
acceleration a, we write a = v .
dt d
So what is the difference between a = ∆v and a = v ?
∆t dt
Differentiation provides an instantaneous value for a gradient, for example a true
instantaneous acceleration, whereas using ∆v means we are calculating only an approximation
∆t
to the true instantaneous value.
The same applies to integration when compared to counting squares on a graph. When we
count squares under a graph, we are determining an approximate value. Integration of a
function gives us an exact, calculated answer instead. For example, integrating the function
of velocity, v = u + at gives us a function for the displacement s, which is written as
1
∫ ∫ (u + at ) dt = ut + 2 at
2
s = v dt =
Summary
■ Calculus can be used to describe motion, with differentials corresponding to the
gradients of graphs and integrals corresponding to areas under graphs.
End-of-chapter questions
S2.1
A cheetah sets off in pursuit of its prey, an antelope. The antelope is initially 100 m
ahead of the cheetah and running at 20 m s−1. The cheetah accelerates at 15 m s−2 to its
top speed of 30 m s−1 and can keep that speed up for 10 s. Does the cheetah catch the
antelope, and if so, how long does it take?
2
Cambridge Pre-U Physics
Learning Outcomes
■ understand how to add three or more forces using a diagram
b a b
c
R R
R c
c
a b a
Figure S4.1 The three vectors a, b and c add up to a resultant R, which is the same no matter whether 1
we start with a, b or c
If the vectors form a closed polygon, the resultant force is zero. This can occur in a
number of situations. For example:
• When an object is in equilibrium (not accelerating) because all the forces on it balance,
this follows from Newton’s second law.
• When only internal forces act within a system, and no external forces. Every internal
force is part of a pair. The other force of the pair is equal in magnitude and opposite in
direction, so every internal pair of forces cancels out. This leaves zero resultant and,
again, a closed polygon.
Cambridge Pre-U Physics
Learning Outcomes
■ understand that frictional forces depend upon the surfaces, the normal force and whether the
surfaces are in motion
S4.2 Friction
In Worked Example 1 in Chapter 4 of the Coursebook we are told the frictional force, but it
is often possible to calculate it. Consider the following: a book is placed on a table and the
table is tilted upwards. At some angle the book begins to slide. Once the book has started
sliding, it will move at a constant acceleration down the slope. How can we explain this, and
calculate the forces that act?
Frictional forces depend on two things:
• the nature of the two surfaces in contact with each other
• the strength of the force normal to those surfaces.
Here, ‘normal’ means ‘at right angles to’. The force that acts at right angles to a surface
is often called the normal contact force. You can observe this easily – it is harder to make
an object slide over a surface if you push down firmly on the object while trying to slide it. 1
If you only push down lightly, the object is easier to slide. Pushing down hard produces a
larger normal contact force than pushing down lightly.
We can write an equation for the frictional force, F, that acts parallel to the surface in
contact in terms of the normal contact force, N:
F ≤ µN
where µ is a constant called the coefficient of friction. The value of μ depends on the
properties of the two surfaces that are in contact with each other. The coefficient of friction is
a number – it has no units because the frictional force and the normal contact force are both
measured in newtons. The most important part of this equation is the ‘≤’ symbol, which
means ‘less than or equal to’. The equation tells us that the frictional force between two
particular surfaces can increase to exactly balance any applied force up to a limit.
N = 20 N
F X
W = 20 N
Figure S4.2 A heavy book placed on a table.
In Figure S4.2, a heavy book of weight W = 20 N is placed on a table. The coefficient of
friction µ between the book and the table is 0.3. A horizontal force, X, is gradually increased
until the book starts to move.
The book is not accelerating vertically, so from Newton’s first law of motion we know that
the normal contact force N will be 20 N. As long as the applied horizontal force X is less than
Cambridge Pre-U Physics
µN = 0.3 × 20 = 6 N, then the frictional force will exactly balance the value of X. Once X is
greater than 6 N, F cannot be greater than 6 N and the horizontal forces become unbalanced.
Friction is a variable force, one that balances and opposes other forces –
up to a limit.
We have one other experience to explain. If the force applied is a small amount greater
than the maximum frictional force µN, we would expect the book to accelerate very slowly.
In fact, it accelerates more than we would expect, so that a smaller force is enough to keep it
moving. This suggests that two different explanations of friction are needed, one for before
an object starts moving, and another for after the object has started moving. There are two
different coefficients of friction:
• µS is the coefficient of static friction which applies when two surfaces are not moving
relative to each other
• µK is the coefficient of kinetic friction which applies when two surfaces are moving
relative to each other.
Generally, µS is greater than µK between the same surfaces.
The worked example shows a typical calculation.
A book of mass 0.8 kg is placed on a surface with coefficient of static friction 0.4.
a The surface is gradually tilted up until the book begins to slide. Find the angle at which it
begins to slide.
b Given that the coefficient of kinetic friction is 0.25, find the acceleration of the book down
the slope.
N
F
2
Step 1 We find the angle, θ, at which the book just begins to slide. We do this by taking
components of the forces parallel and perpendicular to the slope:
Perpendicular to the slope N = W cos θ
Parallel to the slope F = W sin θ
Dividing the second equation by the first we get sin θ/cos θ = tan θ = F/N
However, we also know that at the limit of static friction F = µSN and so F/N = µS
Hence the book begins to slide when tan θ = µS.
θ = tan−1(µS) = tan−1(0.4) = 21.8°
Note that the normal force and the component of the weight down the slope are
both proportional to the weight. This means that the value of F/N at the limit of
static friction stays the same for any weight. Therefore, for a particular value of the
coefficient of static friction, the angle at which the book starts to slide is the same
for any weight.
Step 2 Once the angle is very slightly greater than 21.8°, the static frictional force reaches
its maximum value and the book begins to slide. Once it is moving, the coefficient of
friction reduces to μK = 0.25 and so the frictional force drops to μKN.
The force down the slope is now given by W sin θ − F = W sin θ − µKN
Cambridge Pre-U Physics
Summary
■ The value of the frictional force is given by the expression F ≤ μN where μ is the
coefficient of friction.
■ There are two coefficients of friction:
■ The coefficient of static friction μS, which applies when the two surfaces in
contact are stationary.
■ The coefficient of kinetic friction μK, which applies when the two surfaces in
contact are moving. 3
■ Generally, kinetic friction is less than static friction.
End-of-chapter questions
S4.1
A large shopping trolley is filled up and has a mass of 80 kg. The coefficient of static
friction is 0.2.
a Find the acceleration of the trolley if it is initially stationary and a force of 100 N is
applied to it.
b Find the acceleration of the trolley if it is initially stationary and a force of 200 N is
applied to it.
c Find the acceleration under the same forces if the trolley is already moving. The
coefficient of kinetic friction is 0.08.
Cambridge Pre-U Physics
Learning Outcomes
■ understand that a heat engine is a device that is supplied with thermal energy and converts
some of this energy into useful work
are often only about 30–40% efficient, but electric motors can be over 90% efficient. You will
look at heat engines in much more detail in Chapter 22.
The force produced by an expanding gas is not always constant. For example, when a gas
expands in a combustion engine, the pressure of the gas reduces so that the force on the
piston reduces. We cannot simply multiply the force by the displacement to calculate
the total work done, we must plot a graph of force (y-axis) against displacement (x-axis).
The work done can then be found by measuring or calculating the area under the force–
displacement graph.
Cambridge Pre-U Physics
Learning Outcomes
■ understand that gravitational potential is the energy per unit mass of a system
1 A car park has floors spaced 3 m apart: Calculate the change in gravitational potential in
going from:
a Level 1 to level 2
b Level 2 to level 7
c Level 5 to level 4
Calculate the change in g.p.e. of a person of mass 60 kg and a car of mass 1500 kg
in each case.
Step 1 Use the equation change of gravitational potential = g∆h
a change of gravitational potential = 9.81 × (2 − 1) × 3 = 29.4 J kg−1
b change of gravitational potential = 9.81 × (7 − 2) × 3 = 147 J kg−1
c change of gravitational potential = 9.81 × (4 − 5) × 3 = −29.4 J kg−1
The change of g.p.e. is found by multiplying the mass by the change of gravitational
potential:
a change of g.p.e. = 60 × 29.4 = 1770 J for the person and 1500 × 29.4 = 44 100 J
for the car
b change of g.p.e. = 60 × 147 = 8830 J for the person and 1500 × 147 = 221 000 J
for the car
c change of g.p.e. = 60 × −29.4 = −1770 J for the person and 1500 × −29.4 = −44 100 J
for the car
Cambridge Pre-U Physics
Learning Outcomes
■ understand that the efficiency equation can also be written in terms of power as well as energy
Summary
■ A heat engine is supplied with thermal energy and converts some of this energy into
useful work. 1
S6: Momentum
Learning Outcomes
■ calculate impulses and relate them to change in momentum
S6.1 Impulse
Newton’s second and third laws of motion can be used to explain why momentum is
conserved. Consider a collision between two objects, A and B. Object A exerts a force FA on
F
object B with mass mB, causing B to accelerate with acceleration aB = mA . The change in
B
velocity of B is given by ∆v B = aBt, where t is the time the collision takes (we call this time the
duration of the collision).
Ft
Hence ∆vB = A by Newton’s second law.
mB
Ft
By a similar argument, the change in velocity of A is ∆v A = mB . (See if you can use the
A
logic of the explanation above for object B to write down the explanation for object A.) If we
multiply each side of the equation for each object’s change in velocity by the relevant object’s
1
mass we get:
The two objects collide with each other, and are in contact with each other for the same
length of time, t, so t is the same in both equations. Newton’s third law tells us that FA = −FB,
as the force exerted by object A on object B has same value but the opposite direction to the
force exerted by object B on object A. It follows that:
In other words, during any interaction (for example, a collision), the total change in
momentum of any pair of interacting objects is zero.
This analysis has another point to it. The change in momentum of an object, m∆v can be
calculated by multiplying the force F acting on it and the time t over which the force acts.
The quantity Ft is called impulse and it equals the change in momentum. Impulse can be
measured in kg m s−1, the same units as momentum. The units that are more often used for
impulse are N s.
Cambridge Pre-U Physics
1 A ball of mass 100 g is travelling due west at 50 m s−1. It is struck by a racket that exerts a
force due east. The ball and racket are in contact with each other for 160 ms, after which the
ball travels east with a speed of 30 m s−1. Find the average force exerted by:
a the racket on the ball
b the ball on the racket.
Step 1 First we find the change in momentum of the ball. We shall use west as the
positive direction. The ball’s momentum initially is 0.1 kg × 50 m s−1 = 5 kg m s−1.
After the collision the ball has momentum 0.1 kg × 30 m s−1 east = −3 kg m s−1, where
the negative sign indicates that the direction has reversed. Hence the change in
momentum is:
−8 = F × t
For part (b) we use Newton’s third law, which tells us that the force exerted on the racket by
the ball is 50 N to the west. 2
maximum value
of the force
0
Time / s
Figure S6.1 When the force varies with time, the impulse is found by taking the area under the
graph (shaded region).
We can still calculate a value for the force, by taking Ft = m∆v, with t being the total collision
time. This will be the average force.
Cambridge Pre-U Physics
question
6.1 A toy cart of mass 0.5 kg is first pushed by a force of 2 N for 4 s, and then a force of 6 N for
1 s. Find:
a the total impulse acting on the cart
b the change in momentum of the cart
c the change in velocity of the cart
d the average force acting on the cart.
Summary
■ Change of momentum is a vector called impulse, given by Ft, and measured in N s.
■ If the force varies, the impulse can be found from the area under a force–time graph.
End-of-chapter questions
S6.1
A toy rocket has a spring in its base which is used to launch it sideways from a wall.
Figure S6.2 shows a force–time graph of the spring during the launch. The rocket has
mass 25 g. Using the graph find the speed of the rocket just after it launches.
0.06
Force
/N
0
0 0.5 0.7
Time / s
S6.2
A tennis ball hits a wall at 30 m s−1, reversing its motion. The ball has mass 80 g. If the
average force between the ball and the wall is 60 N, find the contact time.
Cambridge Pre-U Physics
Learning Outcomes
■ distinguish between elastic and plastic deformation of a material
■ recall the terms brittle, ductile, hard, malleable, stiff, strong and tough, explain their meaning
and give examples of materials exhibiting such behaviour
■ explain the meaning of strength, breaking stress and stiffness
■ draw force–extension, force–compression and tensile/compressive stress–strain graphs, and
explain the meaning of the limit of proportionality, elastic limit, yield point, breaking force
and breaking stress
■ state Hooke’s law and identify situations in which it is obeyed
■ account for the stress–strain graphs of metals and polymers in terms of the microstructure
of the material
remains elastic up to strains of several hundred percent. We can conclude that both the
shape of a stress–strain graph and the numerical values on the axes indicate the type of
material being investigated.
To begin with, contrast the behaviour of two different metals in Figure S7.1.
300
metal A
200
Stress / MPa
metal B
100
0
0 0.2 0.4 0.6 0.8 1.0
Strain / %
Figure S7.1 Stress–strain graph for two different metals.
Cambridge Pre-U Physics
Both metals obey Hooke’s law, although they have different Young moduli, with A being
stiffer than B. However, the graphs tell us much more.
• Metal A obeys Hooke’s law but then breaks suddenly, with only a very small region
beyond the straight-line section. If metal A were to be tested in the experiment shown
in Figure 7.10 (in the Coursebook), it would extend steadily by a fixed amount with each
weight added, and then snap. Metal B would behave very differently. As with A, at first it
would extend a fixed amount with each weight. Then it would extend much, much more
with each weight – perhaps 10 times as much – and it would be possible to see the wire
getting thinner. At some point the wire would continue to stretch even with no more
weight added, and then snap.
• Metal A will break at a larger load than B – it is said to be stronger. It extends up to the
limit of proportionality (the end of the straight-line region of the graph) and then a
little beyond, to the elastic limit. Remember that when a material extends past the elastic
limit, it will not return to its original length because it is deformed. Rather than then
deforming, metal A snaps. A material showing this behaviour is described as brittle.
Among metals, a good example would be cast iron. Among non-metals, most types of
glass show the same behaviour.
• Metal B will extend past the limit of proportionality, past the elastic limit and then will
deform substantially before breaking. When loaded in this way it will be drawn out into a
thinner and thinner wire. We call this behaviour ductility. In fact, electrical wire is often
made this way – by being drawn through a small hole and stretched. A good example
would be copper.
Stretching a metal wire is something that involves deformation in one dimension. A metal
can also be deformed in two dimensions by hammering or rolling and stretching it out flat.
Metals that deform easily in this fashion are called malleable (from the Latin word for a
hammer). The most malleable metal is gold, which is often made into exceptionally thin
2
sheets called gold leaf, used for decoration. Lead is also very malleable; in the past, lead was
often used as a roofing material.
Some metals such as steel and titanium are not very malleable unless they are heated. If
they are hammered when at normal temperatures they show very little or no deformation.
Sometimes a sample of these metals may shatter into pieces when hammered. These metals
are said to be hard.
More generally, a harder material will scratch the surface of a less hard material.
Geologists use this relative hardness to test and identify minerals. By using a small number
of items with different hardness, such as a piece of glass, a steel penknife and a couple of
different stones, it is easy to determine the hardness of a mineral relative to those materials.
Figure S7.2 summarises some of the important terms discussed so far.
plastic region
ultimate
yield point tensile
Stress / MPa
A
Strain / %
Figure S7.2 A typical stress–strain graph for a metal.
Cambridge Pre-U Physics
The behaviour of a metal like copper can be explained using this graph. Once it is loaded past
the elastic limit, it begins to deform. Beyond that point the graph’s gradient is very shallow,
showing that the material is much less stiff and so the wire extends much more. Eventually
the graph starts to curve downwards (at the ‘ultimate tensile strength’ point). Beyond this
point, the load on the wire is sufficient to keep the wire extending – it actually takes less and
less force to continue to stretch the wire. Eventually the wire reaches its breaking force and
snaps. By knowing the cross-sectional area of the wire at this moment, the breaking stress
can be calculated using the equation stress = force/area.
Testing Materials
You may wonder how a graph like that in Figure S7.2 is plotted, given that a wire will extend
rapidly once the load exceeds the ultimate tensile strength. Professional testing apparatus
works differently – the test sample is trapped between jaws and stretched (see Figure S7.3). Both
the force applied and the extension are measured. As the material stretches, the apparatus can
alter the force as necessary in order to stretch the wire by constant increments.
stress is
concentrated here
Figure S7.4 In a brittle material stress can be concentrated at the tip of a crack.
A brittle material, such as glass, does not deform plastically. Brittle materials are much more
likely to form cracks that propagate. A ductile material can deform plastically, so the atoms
can rearrange as the material deforms. This acts to ‘blunt’ the crack and share the load
among more bonds. Resistance to cracking is called toughness. However, a tough material is
not necessarily strong – for example, polythene is tough but it is not very strong.
Cracks will only propagate when a material is placed under tension. Under compression,
a crack will close up. Some materials can be very much stronger under compression than
under tension – examples include stone and concrete. Such materials are often used in
construction. They are wonderful at supporting walls, where they are compressed, but less
good at spanning gaps, where they are stretched (see Figure S7.5).
area of
compression
area of
tension
load
Figure S7.5 A horizontal structure under load will have areas both of compression and of tension
To solve this, concrete lintels (horizontal supports such as those over windows or doors) are
often reinforced by the addition of steel rods. The steel performs well under tension, and
prevents the concrete from cracking.
Metals
Although metals are crystalline (they are made up of atoms in a regular arrangement), there
are two important reasons why they do not behave in the ‘ideal’ way just described:
1 A crystal is never perfect – the planes (layers) of atoms do not always align. The point where
one plane does not align with the next is called a dislocation. Dislocations allow whole
planes of atoms to slide over one another, and so the material can deform without breaking.
However, this ability to deform is limited. If there are too many dislocations, they can
tangle and restrict the movement of planes of atoms. This leads to something called work-
hardening, where a material that is repeatedly stretched will eventually go brittle and snap.
This is easily demonstrated: a steel paperclip has been bent into shape and can be bent out of
shape; however, if this is done too many times, the paperclip will break.
2 Most metals are polycrystalline. Instead of being formed from a single crystal, most
metals are made up of many grains (crystallites or small crystals). Within any single
grain the atoms will be nicely ordered (apart from dislocations), but each grain is
randomly arranged relative to the others. The presence of boundaries between the grains
(called grain boundaries) also limits the movement of the planes of atoms.
These two factors mean that a sample of a metal made from a single, large crystal will be
brittle. However, a sample containing many much smaller grains will also be brittle.
For metals, the behaviour under compression is very similar to the behaviour under
tension, as the atomic bonds behave the same way in each case. The planes of atoms can slide
over each other in the same fashion.
Polymers
The molecules in polymers consist of many repeated units of atoms bonded together. We call
these long-chain molecules, and their length means they are often found coiled and wound
around each other. When a force of tension is applied, the molecules begin to straighten out. 5
This requires much less force than stretching the bonds between atoms, and so polymers
(such as polythene or rubber) are much less stiff than metals.
The amount of ‘unwinding’ of long-chain molecules is usually not proportional to the force
applied, so polymers often do not obey Hooke’s law. However, the unwinding means that the
maximum strain can be much greater than that of individual bonds between atoms. Some
polymers can withstand a strain of several hundred percent. Once the molecules are fully
stretched, though, a polymer can become much stiffer. You may have noticed that an elastic
band will extend significantly up to a point, but then stiffen and sometimes break.
Different polymers behave differently under compression and tension. The amount of
compressive and tensile strain a polymer may undergo before strain acts directly on the
bonds within molecules depends on how straight or coiled up the polymer molecules are.
Stretching molecules is not necessarily an elastic process. In some polymers, such as
hardened rubber, there are many cross-links, which are weak bonds either between curved
sections of one molecule, or between one molecule and another (see Figure S7.6). These
cross-links have to be broken before the molecules can stretch. Having more cross-links
makes a polymer stiffer, and it means that more energy is needed for the stretching to take
place. After stretching, cross-links can reform, so the material returns to its original length.
Stretching and shrinking a material made from polymers can cause the material to give off
heat. This is due to energy being released when the cross-links re-form.
Cambridge Pre-U Physics
cross-links
Amorphous materials
Not all materials are crystalline. In some materials, the atoms or molecules are arranged in
an apparently random pattern. These are amorphous materials, such as glass and ceramics.
Amorphous materials are brittle, because there are no crystal planes able to move, no
dislocations and no cross-links to absorb energy.
Summary
■ A ductile material can be drawn out into a wire, a malleable material can be
flattened into a sheet.
■ Brittle materials break cleanly without deforming.
■ Tough materials deform and so resist cracking.
■ A strong material requires a large stress to break it. This can be measured by a 6
quantity called the ultimate tensile stress, the breaking stress or the yield stress.
■ A material can show different properties depending on whether forces applied are
tensile or compressive.
■ The characteristics of a stress–strain graph can be explained by the small-scale
structure of a material.
End-of-chapter questions
S7.1
Two identical steel wires are tested. The first wire is heated and quenched (placed
quickly in cold liquid) so that it becomes brittle. The second wire is left untreated. Each
wire in turn is loaded with equal masses, one at a time, until they break. Predict how
each of the two wires would behave. Highlight any similarities and differences.
S7.2
A wire of diameter 0.2 mm is gradually loaded with masses. Once a total mass of 2.3 kg
is loaded, the wire starts stretching rapidly and then breaks. Calculate the ultimate
tensile force and thus the stress of the wire.
Cambridge Pre-U Physics
S13: Waves
Learning Outcomes
■ describe sound waves in terms of the displacement of molecules or changes in pressure
■ explain what is meant by a plane-polarised wave, and use Malus’ law to calculate the
amplitude and intensity of transmission through a polarising filter
■ understand refraction of waves at the interface between two media, and relate the refractive
index to the wave speeds in those media
■ derive the equation for the critical angle and use it to solve problems
■ recall that total internal reflection occurs when a wave is incident at an angle greater than the
critical angle, and that optical fibres use total internal reflection to transmit signals
■ recall that, in general, waves are partially transmitted and partially reflected at an interface
between media
S13.1 Terminology
Another name for a progressive wave is a travelling wave.
We can use the terms frequency and period to describe other periodic (or cyclic)
1
phenomena too, as you will see in later chapters on oscillations and rotation. The period,
T is the time for one cycle, and the frequency, f is the number of cycles per unit time. They
1
are always related by the reciprocal relationship f = .
T
Figure 13.8 in the Coursebook (Chapter 13) shows how we can represent longitudinal and
transverse waves. The high pressure regions of a longitudinal wave are called compressions,
and the low pressure regions are called rarefactions.
The sine graph used to represent a longitudinal wave may be plotted as pressure change
against distance, with zero on the pressure axis corresponding to the equilibrium pressure.
Alternatively it may be plotted as displacement against distance, where the displacement
refers to the displacement of the particles from their equilibrium position. The maximum
displacement does not correspond to the maximum pressure, though! At the centre of a
compression (maximum pressure) or rarefaction (minimum pressure), the displacement is
zero. The largest displacements correspond to the points that are between and equidistant
(equal distances) from the compressions and rarefactions. We can describe the displacement
and pressure in a sound wave as being 90° out of phase with each other. Phase difference is
discussed further in the next section.
Cambridge Pre-U Physics
So far, we have described waves in terms of how their displacement varies with distance
along the direction of travel of the wave. The graphs we have been plotting are a ‘snapshot’ of
what the wave looks like at a particular instant in time. If we were to take a second ‘snapshot’
half a period later, we would see that the wave had moved half a wavelength to the right
(along the distance axis). This is shown in Figure S13.1a. Instead of plotting displacement
against position (at a given time), we could plot displacement against time, at a given position.
This produces the graph shown in Figure S13.1b, on which we can identify the period of the
wave. We measured this period in Box 13.1 and the accompanying worked example.
Distance
–
b
T
Displacement
Time
–
Figure 13.1 a A progressive wave travels along the direction of propagation, so at later times
the graph of displacement against distance will be shifted along the distance axis. b A graph of
displacement against time, for a fixed point along the direction of travel of the wave. The time for 2
one complete oscillation to pass that point is known as the period, T.
Note that phase difference can be measured in radians, where a complete cycle of
360° = 2π radians. (See also Chapter 17.) For example, this means that a phase difference
of 90° is π radians.
2
Reflection
You will be very familiar with the phenomenon of reflection from your everyday life. You
probably see your own reflection in a mirror or a reflective surface several times daily, and
often you will hear the reflection of sound as echoes. The reflection of seismic waves can
be used to investigate the structure of rocks beneath the surface and search for oil. Police
radar detectors reflect radio waves off vehicles. If the vehicle is moving, the reflected wave
undergoes a Doppler shift, which can be used to calculate the speed of the vehicle.
All types of waves can be reflected, although the properties of the surface required to
reflect them vary depending on the type of wave. When waves are reflected, they obey the
law of reflection, illustrated in Figure S13.2:
The incident and reflected rays are at equal angles to the normal at the reflection point.
Cambridge Pre-U Physics
i r
i=r
mirror
b A parabolic mirror
F r i
i=r
Figure S13.2 a The law of reflection. The normal is a line drawn at right angles to the surface. For
curved surfaces, at any point the normal is a line at right angles to the tangent to the curve at that
point. b Reflections from a curved surface – a parabolic mirror.
3
We have already met the idea of wavefronts, which ‘join up’ points of equal phase on the
wave. A ray is a line that is at right angles to the wavefront. If we start a ray from the wave
source in a given direction, it will follow a path that is at right angles to all the wavefronts
it crosses. You will already be familiar with the idea of light rays from your earlier physics
courses, but we can extend the use of rays to any other types of wave.
We can use a ray diagram to analyse the properties of the reflection. Figure S13.3 is a ray
diagram showing how an image is formed from a reflection in a plane mirror. This image
is known as a virtual image since no real rays of light actually cross (or converge) at the
image location. To find the image, we have to project the reflected rays backwards behind the
mirror to the point where they meet (the dotted lines in the diagram). The reflected light rays
are said to be diverging (spreading apart) in front of the mirror. They diverge in the same
way as light would if it travelled directly from an object placed at the image location (if the
mirror were not there).
questionS
13.1 Use a ray diagram to prove that the image of a point in a plane mirror is the same
distance behind the mirror as the object point is in front.
13.2 Using the result in question 1, explain what the image looks like when a three-
dimensional object is placed in front of the mirror – use diagrams to help you.
a b
Figure S13.4 A wave pulse passing along a string. In a the end of the string is fixed, and the pulse
undergoes a phase change of π radians (180°) on reflection. In b the end of the string is free and the
pulse is not inverted on reflection.
We can explain the phase change on reflection using our knowledge of Newton’s laws of
motion. Think about the case where the Slinky was held fixed at one end (Figure S13.4a), and
imagine that an upward pulse is arriving at the fixed point. The upward movement of the
Slinky exerts an upward force on the fixed point as it arrives. Therefore, by Newton’s third
law, the fixed point must exert an equal downwards force on the Slinky. This accelerates this
part of the Slinky downwards, and so the pulse is inverted.
The same phase change on reflection can happen with light. A light ray travelling through
air and reflecting off the surface of a piece of glass undergoes a phase shift of π radians on
reflection. However, a ray travelling through glass and reflecting off the interface between the
glass and air, does not undergo a phase shift on reflection.
The general principle can be summarised as:
• when a wave travels through a more dense medium and reflects off a less dense medium,
there is no phase shift
• when a wave travels through a less dense medium and reflects off a more dense medium,
the wave is inverted.
For light, we say that the medium with the higher refractive index (see below) is more
optically dense. In the case of a mechanical wave on a spring or a rope, we are referring to
density in the usual sense of mass per unit volume, assuming that the tension in the spring or
rope remains the same across the boundary.
Refraction
You may have noticed that when you put a straw in a glass of water, the straw appears bent
(Figure S13.5). Of course, the straw itself is not bent, but light rays travelling from the straw
change direction as they leave the water. This phenomenon is called refraction, and occurs
Cambridge Pre-U Physics
whenever a wave travels through a boundary between two different materials and changes
speed. The human eye uses refraction to form an image of the world around us on the retina.
If you wear spectacles or use contact lenses, the refraction of light in the lens provides the
correction necessary for an image to be formed in focus on the retina.
Figure S13.5 A straw placed in a glass of water appears bent because the light rays reflected from
the bottom of the straw are refracted when they leave the water.
Imagine a car driving along a straight road with a hard surface. At the edge of the road there
is soft mud. If the wheels on the left side of the car roll off the road into the mud, then they
will be slowed down compared to the wheels that remain on the road. The car will turn to
the left: its velocity vector will change from being almost parallel to the road, to pointing
to the left of the road. This models what happens when waves are refracted.
λ1
medium 11
medium
(speed
(speed vv1 )1 )
λλ2 medium
medium 22
(speed vv2 ) )
(speed 2
v1 > v2
λ1 > λ22
normal to boundary
λ1 θ1
Figure 13.6 The speed of a wave
A θ1 C depends on the medium (material)
θ2 through which it travels. When a wave is
λ2 θ2 transmitted across a boundary between
D
two media that have a different wave
speed, it is refracted. a shows how the
ray and wavefronts are refracted.
b shows a close-up view of a, and
allows us to derive the law of refraction
(see text).
Cambridge Pre-U Physics
In Figure S13.6, the wavefronts are continuous across the boundary between the two
materials – that is, although the wavefronts change direction, each is an unbroken line as it
crosses the boundary. The wavefronts are continuous because the frequency of the waves is
the same on either side of the boundary. The frequency of the wave is set at the moment it
leaves the source. The frequency cannot change as the wave crossed the boundary (otherwise
a number of wavefronts would disappear completely). However, the wavelength does change
as the wave crosses the boundary, because the speed of the wave is different in the two
materials. Earlier in Chapter 13 of the Coursebook, we used the equation v = f λ to relate the
wavespeed, frequency and wavelength. If the frequency remains constant but v decreases as
the wave moves from medium 1 to medium 2, the wavelength λ must decrease. To allow this
while keeping the wavefronts continuous across the boundary, the wavefronts have to change
direction as they cross the boundary: they are refracted and the ray appears to bend.
We can use the geometry of the two right-angled triangles shown on the diagram to
produce two different expressions for the length of line AC, in terms of the wavelengths on
each side of the boundary:
λ
AC = 1
sin θ1
λ2
AC =
sin θ 2
Using v = f λ : 6
v1 v2
=
f sin θ1 f sin θ 2
then rearranging:
v1 sin θ1
= = n
v 2 sin θ 2
Here n is called the refractive index of medium 2 with respect to medium 1, and is the
v
ratio of the wavespeeds in the two media, 1 . We could also call this the boundary refractive
v2
index when travelling from medium 1 to medium 2. The line at right angles to the boundary,
at the point at which the ray crosses the boundary, is called the normal. The angles θ1 and
θ 2 are the angles the ray makes with the normal on either side of the boundary. Notice that
the incident ray, refracted ray and normal are all in the same plane. Because we measure the
angles to the normal, we can use the same law for curved surfaces.
c
nabs =
v
where nabs is the (absolute) refractive index, c is the speed of light in a vacuum (3.00 × 108 m s−1),
and v is the speed of light in the medium. You will see in most cases the absolute refractive
Cambridge Pre-U Physics
index is simply called the refractive index of a medium. The speed of light in any medium can
never be greater than the speed of light in a vacuum (empty space), so the refractive index is
always greater than 1. For light travelling in air, the speed of light is very close to the speed of
light in a vacuum, so we often approximate the refractive index of air to be 1. Table S13.1 gives
some refractive indices for common media.
We can use this new definition to write a new expression for Snell’s law. If the refractive
index of medium 1 is n1 and that of medium 2 is n2, we can write:
c
v1 =
n1
and
c
v2 =
n2
n1 sinθ1 = n2 sinθ 2
Note that we can write the refractive index of medium 2 with respect to medium 1, n, as
n 7
n = n2
1
The use of the absolute refractive indices makes it easier to solve problems. To see how this
works in practice, look at Worked example S13.1.
A ray of light falls on a glass block at an angle of incidence (angle to the normal) of 45°.
The angle of refraction inside the block is measured to be 30°. What is the refractive index
of the glass?
Step 1 Decide which material is medium 1 and which is medium 2. You may wish to draw a
labelled diagram to show the materials and the angles. In this case, we are going from
Cambridge Pre-U Physics
air into glass, and so medium 1 is air and medium 2 is glass. The refractive index of air is
1.00 (to 2 s.f.).
air (n = 1.00)
45°
glass
Step 2 We need to find the refractive index of medium 2 (n2). Rearrange Snell’s law to find this
quantity, then substitute in the values given in the question.
n1 sin θ1 = n2 sin θ 2
n1 sin θ1
⇒ n2 =
n2 sin θ 2
Step 1 Read the question carefully! Here we are given and asked for angles to the horizontal, but
remember that Snell’s law works with angles to the normal. Draw a labelled diagram with
the given quantities and angles marked, and work out the angles to the normal.
air (n = 1.00)
θ2
55°
n1 sin θ1 = n2 sin θ 2
−1 n1 sin θ 1
⇒ θ1 = sin n
2
1 The incident ray, refracted ray and normal to the point of incidence are all in the same plane.
2 If light travels from a medium of refractive index n1 into a medium of refractive index n2, then
the angles that the rays make to the normal to the boundary are given by the relationship
n1 sinθ1 = n2 sinθ 2
θ1is the angle between the ray and the normal in the medium of refractive index n1and
θ2is the angle between the ray and the normal in the medium of refractive index n2
(see Figure S13.7).
medium 1
(refractive
index n1)
θ1
θ2
medium 2
(refractive
index n2 )
Apparent depth
You may have noticed that when you look down into a pool of water, it appears to be less
deep than it actually is. This effect is due to refraction – if you look back at the photograph
of the straw at the start of this section (Figure S13.5) you will notice that the straw looks bent
upwards in the water.
In fact, if we look directly down into the water (at right angles to the surface), the
refractive index of the water is given by the ratio
real depth
n=
apparent depth
If you look at the water at a smaller angle than a right angle, you will find that the apparent
depth is reduced, so this formula only applies if you are looking directly down.
We can explain this using our knowledge of refraction (see Figure S13.8).
Cambridge Pre-U Physics
θ1
air A
O
θ2
apparent
depth θ1
θ1
θ2
water
refractive
index n C
Figure S13.8 Refraction means that water appears to be less deep than it really is.
A ray of light coming from the bottom of the container of water at an angle to the normal
is refracted away from the normal as it leaves the water and passes into air (ray CO). A ray
of light that comes from the bottom of the container but is normal to the surface passes
through without changing angle (ray CA). If we trace the first ray back into the water (dotted
line OB), then it meets the ray that came out along the normal at point B. The distance AB is
the apparent depth (the real depth is the distance AC).
If you redraw Figure S13.8 with a larger angle θ1, then you will notice that the rays cross
higher up in the water and the apparent depth is reduced. So to find the maximum possible
10
apparent depth, we need to work out what happens as θ1 tends to zero.
Snell’s law tells us that:
sin θ1 = n sin θ 2
OA
sin θ1 =
OB
and that in triangle AOC
OA
sin θ 2 =
OC
OA OA
=n
OB OC
OA OA
=n
AB AC
AC real depth
h= =
AB apparent depth
Cambridge Pre-U Physics
This equation can be used to measure the refractive index of a rectangular block of solid
or a liquid, using a travelling microscope (a microscope that moves up and down on a scale).
1. Focus the microscope on a mark on a piece of paper laid on the bench. Call this
measurement on the microscope scale a.
2. Put the block or liquid in place and refocus the microscope so it is again focused on the
mark. Call this measurement on the microscope scale b.
3. Focus the microscope on the top of the block. Call this measurement on the microscope
scale c.
4. The real depth is (c – a), and the apparent depth is (c – b).
5. You can use these measurements in the formula above to calculate the refractive index.
question
13.3 If you stand at one end of a swimming pool of constant depth, as you look to the far end
it looks like the swimming pool gets shallower. Explain this effect using a ray diagram
and your knowledge of refraction.
11
Figure S13.9 White light entering a prism. Glass has a different refractive index for different
colours (wavelengths), so the colours are refracted differently.
White light was known to be split into colours by a prism before Isaac Newton’s experiments
with light, but the colours were thought to originate from the prism in some way. To test this
idea, Newton took the coloured light from the prism and tried to split it further. Since no further
colours were produced, he deduced that the white light was made up of a mixture of colours.
The angle of incidence required for the angle of refraction to be 90° is known as the
critical angle. A critical angle only exists for a ray going from one medium into another
medium with a lower refractive index. (Think about the opposite situation: if the ray were
going into a medium with a higher refractive index, it would be bent towards the normal, and
we would not reach an angle of refraction of 90° before the angle of incidence reached 90°).
For a ray of light going from a medium of refractive index n into air (which we will take
to have a refractive index of 1), the critical angle can be found by using Snell’s law, with the
angle of refraction set to 90°. We will call the critical angle c.
n sin c = 1.0 sin 90°
1
⇒ c = sin −1
n
More generally, if light travels from a medium with refractive index n1 into a medium with
refractive index n2, where n1 > n2, then the critical angle is:
n2
c = sin −1
n1
Once the angle of incidence becomes greater than or equal to the critical angle, no
refraction takes place and the ray undergoes total internal reflection (see Figure S13.10),
which obeys the laws of reflection discussed earlier in the chapter.
medium weak
with reflection
refractive c i >c i
index n i i
Figure S13.10 a Total internal reflection occurs when the angle of incidence is greater than the
critical angle. b Photograph showing total internal reflection in an acrylic block.
The critical angle is defined as the angle of incidence for a ray crossing the boundary from
a medium of higher refractive index to one of lower refractive index for which the law of
refraction predicts an angle of refraction of 90°. No refracted ray can form and the incident
ray undergoes total internal reflection at all angles greater than or equal to the critical angle.
Diamond has a very high refractive index and therefore a small critical angle. Diamonds
used for jewellery are cut so that light entering through the top surface is totally internally
Cambridge Pre-U Physics
reflected and comes back out of the top, so it looks like light is streaming out of the diamond.
Getting the cut right is critical to this – if one cut is not correct, then the light will exit through
the sides of the diamond after being internally reflected. The small critical angle means that
most light entering the diamond is totally internally reflected, and a small movement of the
diamond can cause the light to illuminate a different facet – the diamond appears to sparkle.
Many optical instruments such as binoculars and periscopes use total internal reflections
in 45° prisms. Since the critical angle for glass with a refractive index of 1.5 is around 42°, the
light is incident on the internal face of the prism at an angle greater than the critical angle,
and is totally internally reflected (see Figure S13.11).
periscope
light ray
13
eye
Figure S13.11 Light is totally internally reflected inside 45° prisms in a periscope.
Fibre optics
Transparent glass fibres (often called optical fibres) guide light along them by total internal
reflection. Light rays that pass into one end of a fibre meet the inner surface at an angle
greater than the critical angle, and are therefore totally internally reflected. This continues
to be occur even when the fibre is bent, as long as the radius of the bend in the fibre is much
greater than the radius of the fibre. Most optical fibres produced have a diameter less than a
millimetre, so the condition for total internal reflection is easy to achieve. The fibre used to
transmit the light is usually clad (coated) in a layer of glass with a lower refractive index. This
means that the critical angle is quite large, so the rays travel very close to the axis of the fibre.
Optical fibres are mainly used for communication. In some cities, optical fibre is used
instead of copper wire for high-speed internet communications. It is possible to send
information down an optical fibre much more quickly, and with less signal loss, than sending
electrical pulses down a copper cable (see also Chapter 20 on communications systems). This
is because the high frequency of light (>1014 Hz) means that very short pulses can be used
and detected. Light with a single frequency (monochromatic light) is used since the glass
is dispersive, and light with a mix of different frequencies (colours) would travel at different
speeds. If the fibres were used to communicate over long distances, then the different frequency
components in non-monochromatic light would spread out and cause the signal to degrade.
Optical fibres are also used in medicine. A device called an endoscope can be inserted
into the body and used to see inside. An endoscope contains one bundle of optical fibres to
transmit light inside the body to illuminate the area under investigation, and another bundle
of fibres to transmit the image back to the physician. Endoscopes are used for diagnosis –
determining the nature of a medical condition. They are also used in operations with special
surgical instruments that can be inserted through a small incision in the patient’s tissue.
Cambridge Pre-U Physics
This minimises the need to cut through large amounts of tissue to perform an operation and
helps to reduce the patient’s recovery time.
a
θc θc
90 – θ c θc
θ max
cladding: n2 core: n1
Figure S13.12 a An optical fibre used for digital audio connections between devices. b Diagram
showing transmission of light through an optical fibre. This shows the maximum possible angle to 14
the axis at which light can be incident, as it meets the fibre boundaries at the critical angle.
Partial reflection
a b
θ incident
air
θ refracted
Figure S13.13 Partial reflection. a In this photograph, you can see a reflection from the buildings
on the surface of the water – however, if you were viewing from under the surface of the water,
you would be able to see a refracted image of the buildings, too. We can also see a refracted
image of the bottom of the lake, but from within the water, you would be able to see a reflection
of the pebbles on the upper surface of the water. b When a ray of light is incident on a boundary
between media, some of the light is transmitted (refracted) and some is reflected.
So far, we have discussed refraction and total internal reflection. When light is incident on an
interface between two media at less than the critical angle, most of the light is refracted and
transmitted, but some is reflected too (see Figure S13.13). The amount of light that is reflected
depends on the angle of incidence in a complicated way, but once we get beyond the critical
angle, we know that no light is transmitted – it is all reflected, hence the name total internal
reflection.
Cambridge Pre-U Physics
You will have experienced partial reflection on a daily basis, but may not have thought
about it. If you look out of a window when it is dark outside, you will see your reflection
in the window. From the outside, though, a passer-by will be able to see you clearly. That’s
one of the reasons we usually draw curtains or blinds across windows at night (although
of course blinds or curtains also have other uses, such as thermal insulation). If there are
streetlights nearby, you may be able to see the reflection and the view outside superimposed
in the window (Figure S13.14). In fact, there will always be some reflection, but we usually
do not notice the reflection so much when it is bright outside. This is because only a small
fraction (<10%) of the light is reflected – so that when it is bright outside, the transmitted
light is much brighter than the reflection.
The same effect is used in so-called ‘two-way glass’. If you set up one side of the glass
with much brighter lighting than the other, it appears to be mirror-like on that side, while
allowing an observer on the dimly lit side to see through. This effect can be enhanced by
partly silvering the side you wish to be reflective (coating the glass with a thin layer of
reflective paint).
We will look again at partial reflection when we discuss thin-film interference.
15
Figure S13.14 Partial reflection in a window. The inside of the room is clearly visible in the right-
hand half of the window, but it is also possible to see outside through the left-hand half.
a y vertically polarised
wave wave passes
through
vertical slit
b y
horizontally polarised
wave wave unable to
pass through
vertical slit
x
z
a component
wave with of the wave
c y intermediate passes through–
linear polarisation this component
is vertically
polarised
Figure S13.15 Transverse waves on a string can be polarised. The polarisation plane of each wave
is shown by the orange line. In a, we see a vertically plane-polarised wave, which passes through a
vertical slit. In b, a horizontally plane-polarised wave is unable to pass through the vertical slit, so
there is no transmission (the wave would be absorbed or reflected). In c, a wave with a polarisation
between horizontal and vertical is partially transmitted, as it has a component which is in the
vertical plane.
16
question
Since electromagnetic waves are transverse, they can have a polarisation. For example,
light from the Sun or from a light bulb is described as unpolarised, since it consists of
light in all possible polarisation states superposed. This light can be polarised by a linear
polarising filter, which is often called a Polaroid sheet. This filter only allows light in one
plane of polarisation through. If two Polaroid sheets are placed so that the directions of
polarisation are at right angles to each other, then no light is transmitted (all polarisations
of light are blocked). A Polaroid sheet consists of a transparent polymer in which all of the
long-chain molecules have been aligned in the same direction. The action of this is similar
to the slit in Figure S13.15, except that the waves are absorbed if they are polarised parallel
to the chains of molecules, and transmitted if they are at right angles to the chains. Figure
S13.16 shows the effect of Polaroid sheets on unpolarised light.
Cambridge Pre-U Physics
Figure S13.16 No light is transmitted in the region where these polarising filters overlap, because
their directions of polarisation are at right angles to each other. Where they are not overlapped,
the light passing through is plane-polarised. There is a reduction in the intensity of light because
not all the light incident on the filter is able to pass through. When unpolarised light of intensity I
is incident on a polarising filter, the transmitted intensity is I
2
Light reflected from the surface of a still lake is partially plane-polarised. This means more
of the reflected light is plane-polarised parallel to the lake’s surface than would be expected in
unpolarised light. At an angle of approximately 37° to the lake’s surface, the reflected light is
fully plane-polarised parallel to the surface. Reflections from other transparent media, such as
glass, are also partially polarised (the angle at which the reflection is fully polarised is different:
it depends on the refractive index). Why this happens is explained in Figure S13.17. If we take a
photograph of a lake or a window through a polarising filter, we can reduce the intensity of the
reflected light compared to the intensity of the transmitted light. If you have polarising sunglasses
you may have noticed this effect: you may be able to see through a car window when without the
polarising sunglasses you would have just seen your reflection in the window (see Figure S13.18). 17
polarised
unpolarised light reflected ray
53° 53°
air
water
(refractive index
n = 1.33)
partially polarised
refracted ray
When light is incident on water at 53° to the normal, the fraction of the light that is reflected
from the water is fully plane-polarised. When light enters the surface, it causes electrons in
the surface to oscillate. These oscillations are in the two directions that are perpendicular
to the refracted ray, and are the source of both the refracted and the reflected rays. However,
the oscillations that are in the plane of the diagram (represented by the bars across the ray)
are parallel to the reflected ray. They cannot therefore contribute to it, since the oscillations
making up the reflected ray must be perpendicular to the ray.
For this reason, the reflected ray is polarised and only consists of the oscillations out of the
plane of the diagram (represented by the circles on the ray). The full polarisation of the reflected
ray only occurs when the reflected and refracted ray are at right angles to each other. At other
angles, the oscillations in the plane of the diagram have a component that is perpendicular to
the reflected ray, and so can contribute to it. This gives rise to a partially polarised reflected ray.
Cambridge Pre-U Physics
Show that the angle of incidence required for the reflected and refracted ray to be at right
angles to each other in Figure S13.17 is 53°.
Let the angle of incidence (and therefore angle of reflection) be θ. The diagram below shows
all the angles in our problem:
polarised
unpolarised light reflected ray
53° 53°
air
water
(refractive index
n = 1.33)
partially polarised
refracted ray
We know that the angles along one side of the normal must add up to 180°, hence we can
work out that the angle of refraction is 90° – θ.
n1 sin θ1 = n2 sin θ 2
sinθ
Step 3 Recall that sin ( 90° − θ ) = cosθ and tanθ = , and hence solve the equation for θ.
cosθ
sin θ = 1.33 cos θ
tan θ = 1.33
θ = tan−1(1.33)= 53.1°
Figure S13.18 An example of photographs taken a without and b with a polarising filter. With the
filter, most of the light reflected from the surface of the water is absorbed, allowing the refracted
light from beneath the water to be seen.
Cambridge Pre-U Physics
Polarisation of microwaves
Microwaves are another form of electromagnetic radiation, so they can also be polarised. You
can use the equipment shown in Figure S13.19 to investigate the polarisation of microwaves.
The microwaves used have a wavelength of a few centimetres, much larger than the
wavelengths of visible light. Microwaves therefore need a different kind of polarising filter.
The metal grids shown in Figure S13.19 are used as polarising filters for microwaves. When the
electric field of the electromagnetic wave oscillates parallel to the metal bars, it makes electrons
in the metal move up and down the bars. This absorbs the wave energy and the microwaves
are not transmitted. However, when the electric field of the waves oscillates at right angles to
the bars, then the electrons are not moved up and down the bars. (The electrons do not move
across the bars provided the bars are thin compared to the wavelength of the microwaves.) The
wave energy is not absorbed and hence the wave is transmitted.
If the grids are placed so that the metal bars cross each other at right angles, then no microwaves
will be transmitted through the combination of grids. This is the same effect as in Figure S13.16,
where we observed that no light was transmitted through two crossed polarising filters.
R 20 cm T
polarisation grid
19
Another source of light that is partially polarised is the light from the daytime sky. Sunlight
is scattered by molecules in the atmosphere and this scattered light is partially plane-
polarised. It is completely plane-polarised when the scattered light is at 90° to the incident
light, as the oscillations in the scattered ray cannot have a component in the direction of the
incident ray. This means that there is only one possible oscillation direction for the scattered
ray in this case. Light reflected from the clouds is unpolarised, so by using a polarising filter
on a camera, we are able to reduce the intensity of the light from the sky and increase the
contrast with the clouds.
Malus’ law
When plane-polarised light falls on a linear polarising filter at an angle θ to the polarisation
direction of the filter, the component of the light that is parallel to the filter’s polarisation
direction is transmitted. The remainder of the light is absorbed by the filter – this energy
must be transferred to thermal energy in the filter.
If the incident polarised light has an amplitude A0 (see Figure S13.20), then the component
of light that is transmitted must have an amplitude given by:
At = A0 cos θ
This result is called Malus’ law, and the graph of the function It is plotted in Figure S13.21.
polarisation direction
of filter
transmitted incident
amplitude θ amplitude A0
A = A0 cos θ
absorbed
amplitude
polarisation of
incident light
Figure S13.20 When plane-polarised light is incident on a linear polarising filter, the component
in the direction of polarisation of the filter is transmitted.
We can use this idea to show that the intensity of polarised light transmitted through a
polaroid sheet is half the incident intensity of unpolarised light on the sheet:
• Let the intensity of the unpolarised light be I0 . Since the light is unpolarised, this
intensity is equally distributed over all polarisation angles.
• To work out how much light is transmitted, we need to add up the contributions
transmitted at each possible polarisation angle in the incident light. The size of the
contribution at a particular angle is the value read from the graph, and we have already
said that the intensity is equally distributed over all angles. If we work out the area under 20
the graph from 0 to π radians, this will be p times the total transmitted intensity. So,
considering the graph of It in Figure S13.21a, if we find the area under the curve between 0
Iπ
and π radians, we get a value of 0 see Figure S13.21b), so the transmitted intensity is I0
2 2
• We do not need to calculate this over the full circle from 0 to 2π because polarisation
angles of, say, π and 3π refer to the same polarisation. However, we would get the same
2 2
result if we did the calculation between 0 and 2π (and divided by 2π rather than π – look
at the graph and convince yourself of this).
We can also approach this problem more formally using calculus:
• The intensity is distributed evenly over all angles, so the incident intensity over a small
range of polarisation angles θ to (θ + dθ) is:
I0dθ
dI =
π
(where the possible range of polarisation angles is 0 to π).
• The transmitted intensity over a small range of polarisation angles θ to (θ + dθ) is:
I0 cos 2 θ dθ
dIt =
π
• Therefore the total transmitted intensity is the integral (the sum) of this over all possible
polarisation angles, that is:
π π
I0 cos 2 θ dθ
∫
It = dIt =
0
∫
0
π
Cambridge Pre-U Physics
1
2
• We use the trigonometric identity cos θ = (1 + cos2θ ) to do this integral, so:
2
π π
I0 (1 + cos2θ ) I I
It = ∫
0
2π
dθ = 0 = 0
2π
0 2
Note that the cos 2θ term gives zero when integrated over this range: you can either do
the calculation to show this or use symmetry considerations).
a
I0
Intensity
0 π/2 π 3 π/2 2π
θ (angle to polarisation axis of filter) 21
b
I0
I0 cos2 θ
π
Intensity
π
I0 /2
I0 sin2 θ
0 π/2 π 3 π/2 2π
You can investigate Malus’ law in the laboratory using some Polaroid sheets and a light level
meter. Take two Polaroid sheets and place them so that their directions of polarisation are at
90° to each other (‘crossed Polaroids’). Place another sheet between them and vary the angle
of this middle sheet’s polarisation direction to the polarisation direction of the top sheet
(see Figure S13.22). Use the light meter to investigate how the intensity of transmitted light
changes as you change the angle.
Figure S13.22 Two crossed Polaroid sheets with a third polariser inserted between them at an angle.
Two pieces of polaroid sheet are placed with their directions of polarisation at 90° to each
other, so that no light passes through. A third piece is placed between them so that its
polarisation direction is at 45° to the polarisation direction of the top sheet. Unpolarised
light of intensity I 0 is incident on the stack of sheets. What fraction of this light is
transmitted through the stack?
22
Step 1 Determine how much light passes through the first sheet.
We know that the intensity of the plane-polarised light transmitted by the polarised
sheet is half the intensity of the unpolarised light incident on it. So after the first
sheet, we are left with an intensity:
I0
I1 =
2
Step 2 Determine how much of that light passes through the second sheet.
Malus’ law tells us that if polarised light is incident on a polarising filter, the
transmitted intensity is given by:
It = I0 cos 2 θ
The second sheet has a polarisation direction at 45° to the polarisation direction of
the light transmitted through the first sheet. So the intensity after the second sheet is
given by:
I1 I0
I 2 = I1 cos 2 45° = =
2 4
Step 3 Determine how much of that light passes through the third sheet.
The polarisation of light passing through the second sheet is also at 45° to the
polarisation direction of the third sheet, so the same reasoning as in Step 2 applies,
and the intensity passing through the third sheet is:
I 2 I1 I0
I3 = I 2 cos 2 45° = = =
2 4 8
So one-eighth of the incident light intensity is transmitted through the stack of sheets.
Cambridge Pre-U Physics
Applications of polarisation
Certain complex molecules have a ‘handedness’. This means that they exist in two forms
which are mirror images of each other, called enantiomers. These mirror images cannot be
superimposed by rotating the molecules, in the same way as you cannot lay one hand on top
of the other and match up the fingers. If the substance is dissolved to form a solution:
• a solution of one of the two enantiomers of a molecule will rotate the plane of polarisation
of plane-polarised light clockwise
• a solution of the other enantiomer will rotate plane-polarised light anticlockwise.
Molecules that do this are said to be optically active. A solution containing a 1:1 mixture
of the two enantiomers will not rotate the plane of polarisation of the light, since the two
contributions to the rotation from each enantiomer cancel out. The amount of rotation of the
light that passes through a solution can be measured. If we have a solution consisting of only
one of the enantiomers, then this measurement can be used to determine its concentration.
Glucose (a sugar) is an optically active molecule, and its concentration in solution can be
determined in this way. This is used in the food industry.
The polymer molecules in certain plastics such as Perspex can also rotate the plane of
polarisation of light (see Figure S13.23a). The extent to which a plastic rotates the plane of
polarisation depends on the stress exerted on the polymer sample and the colour of the light.
This leads to concentrations of stress showing up as colourful patterns under plane-polarised
light. This is used by engineers to investigate stresses in structures, by building and loading a
Perspex model and examining it under plane-polarised light.
Thin slices of rocks, thin enough for light to be transmitted through the mineral crystals,
allow us to investigate the optical properties of those crystals (see Figure S13.23b). Many
mineral crystals rotate the plane of polarised light in a similar way to Perspex, and the extent
of that rotation is frequency (colour) dependent. This property is known as birefringence. By
examining the crystals under polarised light, it is possible to identify the minerals present in 23
the rock. Usually, polarised light is shone through the crystals and a second polarising filter,
placed at 90° to the plane of polarisation of the first one, is placed at the eyepiece or detector
of a microscope. Only light that has had its plane of polarisation rotated can be seen at the
eyepiece or detector.
a b
Figure S13.23 a A plastic protractor viewed in plane-polarised light. The patterns of stress
that were locked in to the structure of the plastic as the shape was formed and cooled are visible.
b The lower image shows a thin-section (a very fine slice) of the rock in the upper image, viewed in
cross-polarised light. Birefringence colours are visible.
Cambridge Pre-U Physics
Summary
■ The law of reflection: when rays are reflected from a surface, the incident and
reflected rays are at equal angles to the normal at the reflection point.
■ When a wave travels through a less dense medium and is reflected off a more dense
medium, there is a 180° (π radians) phase shift in the reflected wave (the wave is
inverted). This also applies to light, where a more optically dense medium is one
with a higher refractive index.
■ When a wave travels between two media with different wave speeds, it is refracted
according to the law of refraction:
n1 sin θ1 = n2 sin θ 2
■ Many transparent materials, such as glass or plastic, are dispersive, which means
that different frequencies of light travel through them at different speeds – they have
a different refractive index for different colours of light.
■ The critical angle is the angle of incidence for a ray crossing the boundary from a
medium of higher refractive index to one of lower refractive index for which the law
of refraction predicts an angle of refraction of 90°.
■ No refracted ray can form and the incident ray undergoes total internal reflection at
all angles greater than or equal to the critical angle.
■ When a wave is refracted, not all of the wave is transmitted – some is reflected too.
This is known as partial reflection.
■ Transverse waves may be polarised, which means that their oscillations are confined
to a particular plane. 24
■ Polarising filters only allow one polarisation of light to pass through. If light is
incident on them at an angle θ to the filter’s polarisation direction, then only a
component of that light will pass through. If the incident wave has intensity I0 the
transmitted wave has intensity:
It = I0 cos 2 θ
Cambridge Pre-U Physics
End-of-chapter questions
S13: Waves and Optics
S13.1
A semi-circular convex mirror is attached to the wall.
mirror
Using a ray diagram, explain why a viewer positioned at A has almost a 180° field of view in the mirror
(that is, they can almost see directly along the wall). [4]
S13.2
a Define the term critical angle. [2]
b
Explain what happens when light is incident on an interface at an angle greater than the critical
angle. [2]
d
Using these diagrams, and your knowledge of total internal reflection, explain why the cut of a
diamond is very important if you wish to have a ‘brilliant’ gemstone (one which appears to be
illuminated from within). [3]
e
A ray of light meets a flat surface of the diamond at an angle of 85° to the surface. Calculate the
angle at which the light leaves the diamond into the air. Take the refractive index of the air to
be 1.00. [3]
S13.3
a
Explain how an optical fibre is able to transmit light efficiently over a long distance. Use a diagram in
your answer. [3]
b
An optical fibre consists of an inner cylindrical core, through which the light is transmitted, and a
cladding with a lower refractive index. The core has a refractive index of 1.50, and the cladding a
refractive index of 1.45. Calculate the critical angle within the fibre. [2]
Cambridge Pre-U Physics
Calculate the maximum angle θmax, relative to the axis of the fibre, at which the light may enter and be
c
θc θc
90 – θ c θc
θ max
cladding: n2 core: n1
S13.4
In a seismic survey, an explosion is triggered. The P-waves (compressional waves, like sound) generated by
the explosion travel through the Earth, and are detected by seismometers (sensors). The diagram below
shows some possible paths the wave can take.
explosion 10 km
direct wave
1 km
10° reflected
wave speed 6 km/s wave
α
wave speed
8 km/s
refracted
wave
26
a
At a distance of 10 km from the source, calculate the difference in arrival time between the direct
wave and the reflected wave. [3]
b
When a light wave travels from one medium to another, it obeys Snell’s law relating the angles and the
refractive indices. Write down Snell’s law and then express it in terms of the speed of light in the two
media, v1 and v2. [3]
a
Seismic waves also obey a version of Snell’s law. Using the expression you derived in b, calculate
the angle of refraction α in the diagram. [2]
S13.5
An interview room in a police station is set up as shown in the diagram below.
glass window
Explain carefully why someone in the viewing room is able to see into the interview room, but someone in the
interview room cannot see into the viewing room and only sees their own reflection in the glass. [3]
Cambridge Pre-U Physics
S13.6
S13.7
A Polaroid sheet, when held in front of an unpolarised light source, reduces the intensity of the light by
a factor of 2.
Two Polaroid sheets are placed ‘crossed’, so that their directions of polarisation are perpendicular to each
other. No light passes through the two sheets.
If the angle θ is set at 45°, what fraction of incident light does the combination of Polaroid sheets
a
allow through? [4]
b Sketch how the light intensity varies with θ. [2]
1
cos 2 ( 45° ) =
2
27
Cambridge Pre-U Physics
Learning Outcomes
■ determine the resultant amplitude when two waves superpose, making use of phasor
diagrams
■ recall that waves can be diffracted and that substantial diffraction occurs when the size of the
gap or obstacle is comparable to the wavelength
■ recall qualitatively the diffraction patterns for a slit, a circular hole and a straight edge
■ recognise and use the equation nλ = bsinθ to locate the positions of destructive interference
for single-slit diffraction
λ
■ recognise and use the Rayleigh criterion θ ≈ for the resolving power of a single aperture
b
Figure S14.1 Phasor arrows indicate the point in the cycle that we have reached, and allow us to
calculate the wave displacement at that point.
If we wish to add up the displacements from two different waves, then we can do this very
straightforwardly with phasors. All we need to do is find the vector sum of the two phasor
arrows (add them ‘tip to tail’). The resultant phasor gives us the amplitude and phase of the
superposition of the two waves. If the two waves are in phase, the resultant wave has the
same phase as the two original waves but the amplitude is the sum of their amplitudes. If
the two waves are in antiphase, then the amplitude of the resultant wave is the difference
between the amplitudes of the two original waves, and the phase is the same as the wave
that had a larger amplitude. In the case where the waves have the same amplitude, then the
two waves ‘cancel out’. For waves with a phase difference in between these two extremes, 2
then we go through the procedure of adding the phasor arrows. If the two waves have the
same amplitude, then you will find that the phase of the resultant wave is the average of the
phases of the two original waves. The phasor diagrams for these three cases are shown in
Figure S14.2.
Cambridge Pre-U Physics
a Oscillations in phase
II
III = I + II
b Oscillations in antiphase
II 3
III = I + II
II
III = I + II
Figure S14.2 Using phasor arrows to determine the superposition of two waves.
Cambridge Pre-U Physics
This idea of adding phasor arrows as vectors is very useful when considering situations where
there are multiple contributions to the wave displacement at a particular point, for example
in a diffraction grating where we are adding up contributions that come from many of the
lines in the grating. We will use phasor diagrams to analyse the diffraction pattern for N-slits
(N > 2) and the diffraction grating later.1
to screen
d sin θ
Figure S14.3 Path difference between two interfering rays in double-slit diffraction.
If constructive interference occurs, then the path difference between the two rays must be
an integer multiple of the wavelength: i.e. for an integer n, the condition for constructive
interference is:
path difference = nλ = a sinθ
If destructive interference occurs, then the path difference between the two rays must be
1
n + wavelengths (leading to the waves being out of phase by half a wavelength). The
2
condition for destructive interference is therefore:
1
path difference = n + λ = a sinθ
2
1 For those of you that have come across complex numbers in your mathematics, the idea of phasor diagrams
leads neatly into representing waves by complex numbers (which also add as vectors in the Argand plane).
Cambridge Pre-U Physics
The equation for constructive interference above allows us to calculate the angular
separation of two maxima in the interference pattern. If we consider the geometry of the
situation, we can derive the equation for the separation on the screen of the maxima, as given
in Chapter 14 of the Coursebook. We set up the double slits so that they are a distance D
away from the screen. The geometry is shown in figure Figure S14.4>.
X
Center of θ
double
slits D
screen
Figure S14.4 The geometry of double-slit diffraction.
The position of the nth order maximum is given by nλ = asinθ . However, since distance D is
large, angle θ is relatively small, and therefore (with θ in radians):
x
sinθ ≈ tanθ ≈
D
Substituting this into the equation giving the position of our nth order maximum, we get:
ax
nλ =
D
n increases by 1 between successive maxima, so the separation between maxima is given by: 5
Dλ
x=
a
Therefore, the phase difference between the two rays that have travelled at an angle θ to the
normal to the slits is:
2π 2π ax
φ = a sinθ × =
λ λD
x
where in the second equality we have used the approximation sinθ ≈ tanθ ≈ .
D
Constructive interference occurs when the phase difference is an integer multiple of
2π (φ = 2mπ ), and if the amplitude of one of the waves is A0, the amplitude here will be 2A0.
Destructive interference occurs when the phase difference is an odd integer multiple of
π (φ = ( 2m + 1) π ), and the amplitude at these points will be 0. At phase differences between
these two extremes, the amplitude will lie between 0 and 2A0. These situations are shown in
Figure S14.5.
Cambridge Pre-U Physics
A0 A0
φ
Figure S14.5 Adding up phasors for double-slit interference. a shows constructive interference,
b shows destructive interference and c shows a phase difference φ which lies between
constructive and destructive interference.
If we use the cosine rule to calculate the amplitude A of the addition of two phasors, each of
amplitude A0, with a phase difference φ between them, we find:
where I0 is the maximum intensity seen on the screen. This function is shown in Figure 6
S14.6. In reality, the finite width of the slit means that the pattern decays in intensity as you
move away from the centre, so the central maximum does in fact have the largest intensity.
The actual pattern is in fact a combination of the double-slit pattern we have derived here
and the single-slit pattern we will investigate shortly.
Intensity I/I0
1
0.8
0.6
0.4
0.2
x
‒2λD/a ‒λD/a 0 λD/a 2λD/a
Figure S14.6 The intensity of the double-slit pattern as a function of position on the screen.
Cambridge Pre-U Physics
S14.3 M
ultiple-slit interference and diffraction
gratings
Now we will investigate what happens when we have more than two slits. Again we will use
the idea of phasor diagrams to work out the intensity of the pattern that will be seen on the
screen.
Imagine that we now have three slits, the centre of each slit being separated from the
centre of the next slit by a distance d (note that in the section above we called this a to match
with the notation given in Chapter 14 of the Coursebook; here we call the slit separation d to
match with the equation for the diffraction grating given in Chapter 14 of the Coursebook).
The maxima of the pattern that results correspond to the phasors from all three slits being
lined up at a particular point on the screen (i.e. the phase difference between the rays is a
multiple of 2π). However, in between these primary maxima, there is a secondary maximum,
which corresponds to two of the phasors being in phase, and the other being out of phase.
There are also two minima between the primary maxima. These occur when the phasors add
up as a closed figure: with three slits, this is a triangle. The phasor diagrams and the graph of
the resulting intensity on the screen for three-slit interference is shown in Figure S14.7.
2 3 4
x
‒λD/d ‒λD/2d λD/2d λD/d
2 2π/3 φ = 2π/3
3 φ=π
4π/3
4 φ = 4π/3
Figure S14.7 The diffraction pattern for three-slit interference. φ is the phase difference between
neighbouring slits, and D is the distance to the screen.
Cambridge Pre-U Physics
Now consider what happens as we add more slits. With four slits, we get two secondary
maxima between primary maxima, and minima between each of the maxima. However,
because the amplitude at the primary maximum now comes from four phasors lined up in
phase, but at the secondary maxima the in phase contribution is smaller (see the question
below), we notice that the primary maxima are brighter compared to the secondary maxima.
The first minimum is also closer to the primary maximum than it was in the three-slit
pattern. This trend continues, and as we add more and more slits, the primary maxima
become brighter and narrower and the secondary maxima gradually disappear. When we
have a large number of slits, we have a diffraction grating, which has bright, narrow maxima
at the positions of the primary maxima in the n-slit pattern. This leads to the diffraction
grating equation, relating the nth order maximum to the slit separation d:
nλ = d sinθ
Since the peaks are so narrow for a grating, it means that we are better able to distinguish
lines of different wavelength. So a diffraction grating can be used in an instrument such as a
spectrometer to measure the wavelength of spectral lines to a high degree of accuracy.
question
14.1 Use the ideas shown in Figure S14.7 for three-slit interference to sketch the phasor
diagrams and intensity of the diffraction pattern for four-slit interference.
Figure S14.8 Huygens’ principle: each point on a wavefront is a point source of wavelets (semi-
circular waves). These wavelets superpose to form future wavefronts.
Cambridge Pre-U Physics
This means that we can treat points along the single slit as being sources of secondary
wavelets, or equivalently we can consider the effect of many rays, each equidistant from the
next ray, coming from the slit. The result of this analysis is shown in Figure S14.9. In order
to get destructive interference, we have to have a path difference of a wavelength (a phase
difference of 2π) across the width of the slit.
Angle θ
θ
Path difference
across whole
slit = b sin θ
9
b b
Figure S14.9 Diffraction at a single aperture.
We find that we get minima at angle θ from the centre of the pattern, where
nλ = bsinθ
In this equation, b is the width of the slit. If you look at the pattern in Figure S14.9 you
will see that there is a maximum at the centre of the pattern, as you might expect. So this
equation is only valid when n ≠ 0. Remember, of course, that in an experiment, you might
measure the angular distance between the first minima on either side of the central peak
(since this is easier to determine than the position of the centre of the pattern). This angle
would be twice the angle from the centre of the pattern. Therefore, you must be careful to
use the correct angle when doing calculations. You must also be careful with calculations
involving double slits or diffraction gratings.
A more detailed analysis of the addition of the phasors for each ray, in the limit where the
spacing between the rays goes to zero, gives us the following expression for the intensity of
the pattern as viewed on the screen (relative to the maximum intensity I0):
π bx
I0 sin 2
λ D
I= 2
π bx
λD
If you have a graphical calculator you could plot this function to verify that it looks like what
we have drawn in Figure S14.9. You are not expected to know this formula.
A real double-slit interference pattern uses slits that have a finite width (our previous
analysis assumed that the slits were point sources of waves). The slits are narrower than the
Cambridge Pre-U Physics
separation between them, and this means that the distance between minima of the single-slit
pattern of one of these slits would be much wider than the separation between minima of the
double-slit pattern we derived earlier. In order to work out what the pattern actually looks
like, we use the single-slit pattern as an ‘envelope’ over the double-slit pattern. So the double-
slit pattern ends up brighter where the single-slit pattern has a maximum, and disappears
where the single-slit pattern would have a minimum. Since the single-slit pattern decays
away quite quickly off-axis, this explains why, if you do the double-slit experiment, you will
only see a bright diffraction pattern near the centre of the screen. You should be able to make
out the minima which correspond to the single-slit pattern and work out the width of the
slits in the apparatus you are using. You will see the pattern again outside of these minima,
but it will be fainter. The question below asks you to work through what you might see in
such a case.
question
14.2 Light of wavelength 500 nm is incident on a double slit. The slit separation is 0.50 mm,
and the width of each slit is 0.10 mm. The diffraction pattern is viewed on a screen at a
distance of 5.0 m from the slits.
a Calculate the fringe separation in the double slit interference pattern, assuming that
the slits are point sources of light.
b Calculate the position of the first minimum of the diffraction pattern of a single slit of
width 0.10 mm.
c Use your answers to a and b to sketch the diffraction pattern for these double slits.
In the case of a diffraction grating, the single-slit envelope to the diffraction pattern can
cause us to have ‘missing orders’ – which is where one of the maxima of the pattern we
10
would expect from the grating lines up with one of the minima of the single-slit pattern. The
expected maximum disappears and we get a ‘gap’ in the pattern.
However, since we are resolving two objects that are close together, the angle θ is small, so
we can use the small angle approximation sinθ ≈ θ . This gives us the Rayleigh criterion: an
aperture of size b allows us to resolve two point sources of light of wavelength λ if they are
separated by an angle greater than
λ
θ ≈
b
Cambridge Pre-U Physics
θ = λ/b
Figure S14.10 The light from two point sources which has passed through an aperture of width
λ
b can be resolved if the sources are separated by a minimum angular distance of θ = . This
b
corresponds to the maximum of the diffraction pattern of one source lining up with the first
minimum of the other source.
There are other factors which affect the resolution of a telescope, such as the quality of
the optical components (lenses and mirrors), and on Earth, the effects of the atmosphere.
Telescopes are therefore often placed on mountains to reduce distortion due to the
atmosphere. The Hubble Space Telescope is in orbit to avoid all atmospheric effects. This
telescope is quite close to being limited in its resolution only by the diffraction limit.
Most optical instruments have a circular aperture. This changes the analysis presented above,
λ
but it turns out that it only changes the Rayleigh criterion slightly: θ ≈ 1.22 . Remember
b
that this equation gives us the angular distance between the centre of the pattern and the
first minimum. The diffraction pattern from a circular aperture is also circular: it has a 11
central maximum, surrounded by a minimum that takes the form of a ring. This in turn
is surrounded by further maxima and minima, with the maxima getting much less bright
the further you go from the centre. An example of the diffraction pattern from a circular
aperture is shown in figure Figure S14.11.
questions
14.3 The pupil in your eye has a diameter of about 5 mm. The wavelength of light is
approximately 500 nm.
a What is the limit on the angular resolution of your eye set by the size of the pupil?
b What width does this correspond to on the retina, which is approximately 25 mm
behind the pupil?
c The cones on your retina are separated by about 0.003 mm. Comment on this value,
in light of your answer to part (b).
14.4 The 300m diameter Arecibo radio dish in Puerto Rico is used with 100 mm radio waves.
Estimate the angular resolution that can be achieved at this wavelength.
a wavefronts b Intensity
secondary
sources
incident 12
wave
shadow
region
edge of Distance
absorbing geometrical shadow
screen
Figure S14.12 a Huygens’ principle (treating each wavefront as a source of spherical wavelets)
tells us that when a wavefront meets an edge, there will be some diffraction into the shadow
region. b Graph showing the wave intensity as we cross the geometric edge. Notice that as well as
having some intensity in the shadow region, we also get maxima and minima in intensity in the
non-shadowed region – a diffraction pattern. c The diffraction pattern for a straight edge.
Cambridge Pre-U Physics
Summary
■ Phasor diagrams can be used to track the phase and amplitude of a wave. Phasors
are added like vectors, and the sum of the phasors from two or more different waves
gives us the amplitude and phase of the superposition of those waves.
■ Using phasor diagrams, we can work out the positions of constructive and
destructive interference in the diffraction patterns from double slits, multiple slits
and single slits.
■ When a wave passes through a single slit that is of comparable size to its wavelength,
it is diffracted. The positions of the minima in the diffraction pattern are given by the
equation nλ = bsinθ , where b is the width of the slit.
■ The diffraction patterns produced by double slits and diffraction gratings are a
combination of the interference pattern for that arrangements of slits and the single-
slit diffraction pattern.
■ Diffraction through an aperture limits the resolution of optical instruments such
λ
as telescopes. We can use the Rayleigh criterion θ ≈ to work out the minimum
b
angular distance that can be resolved by an optical instrument.
■ Diffraction also happens at the edge of a barrier. There is both diffraction into the
geometric shadow region and a series of maxima and minima in the non-shadow
region, close to the edge.
13
Cambridge Pre-U Physics
S16: Radioactivity
Learning Outcomes
■ show an awareness of the existence and main sources of background radiation
■ recall that the standard model classifies matter into three families: quarks (including up and
down), leptons (including electrons and neutrinos) and force carriers (including photons
and gluons)
■ recall that matter is classified as baryons and leptons, and that baryon numbers and lepton
numbers are conserved in nuclear transformations
question
16.1 A proton has about 2000 times the mass of an electron and so the mass of a hydrogen
atom can be assumed to be the same as the mass of the proton. Use the data given for
the radius of a proton and an atom to find the ratio of the densities of hydrogen and the
bare proton.
1
S16.1 Background radiation
We are constantly surrounded by radiation, emitted by radioactive substances in the
environment. This naturally occurring radiation is called background radiation and it
comes from a number of sources including the following.
• Cosmic rays – These are high energy particles from the Sun and other stars which hit our
atmosphere. Some reach ground level, and others interact with atoms in the atmosphere,
changing their nuclei. This is how radioactive carbon-14 (used in carbon dating) is formed.
• Radon – This is a radioactive gas, present in very small quantities in the air and which
can also build up in rocks such as granite. Radon levels vary greatly from place to place
around the world, depending on the underlying geology.
• Terrestrial – Most rocks and soil contain radioactive substances such as uranium in small
quantities. These substances also find their way into building materials.
• Biological – There are radioactive isotopes of many of the atoms of elements that plants
and animals use. As a result, our own bodies and the food we eat are slightly radioactive.
Carbon-14 and potassium-40 are specific examples. The high levels of potassium in nuts
and bananas has led to the joking suggestion that they be used as a unit for radioactivity!
• Nuclear testing and accidents – Open-air tests of nuclear weapons through the 1950s and
1960s and the small number of leaks from nuclear power stations has released radioactive
substances into the atmosphere or environment.
• Medical – Some people count exposure to radiation through medical procedures as part
of background radiation, although it is unevenly distributed and people are usually aware
of the exposure, whereas for the other forms they are not.
Because living things have evolved in an environment of low-level nuclear radiation, all
living things have a certain tolerance for very low doses. In addition, the background level
allows us to set a gauge to measure other radiations by. For example, you may have seen
experiments that demonstrate radioactivity. If you were told that the additional exposure
to radiation was less than 1% of the annual total background radiation you would normally
Cambridge Pre-U Physics
experience, you would probably find that acceptable. If the exposure turned out to double the
annual background radiation, you may think it was not worth the risk. As it is impossible to
have zero radiation exposure anywhere on Earth, the average background radiation sets a
reasonable level for safe working.
Table S16.1 shows that Cornwall in the UK has a particularly high level of radon, which
otherwise contributes about 50% of a typical person’s background exposure. A transatlantic
flight increases the exposure to cosmic radiation, and working in a nuclear power station
adds very little more to one’s exposure than two flights.
e+ + e− → p+ + p−
where e+ is the positron and p− (which you may see written as − p) is the antiproton. Neither of
the particles on the left is a baryon, so their baryon number is zero. On the right, the baryon
numbers are +1 and −1, for a total of zero as well.
We see a second example of conservation in the beta-decay equation. When an electron (a
lepton) is produced, so is an antineutrino. As there are no leptons on the left of the equation
(just one quark) there must be zero total lepton number on the right. This is achieved by
having two particles produced as well as a change of quark. One is the electron, a lepton with
lepton number of +1, and the other is an antineutrino, with a lepton number of −1. Again,
the total lepton number remains zero. In β+ decay, the positron, with a lepton number of
−1 (an antiparticle) is accompanied by a neutrino (lepton number of +1). This then ensures
conservation of lepton number as well.
Mesons are hadrons with zero baryon number. They consist of a quark and an antiquark
(+⅓ and −⅓) and so can be created from the energy of collisions or decays. An example is
−
the π+ meson, which is an up-quark (u, charge +⅔) and an anti-down quark (d charge +⅓).
Mesons are all short-lived because they can decay into lighter lepton–antilepton pairs whilst
still conserving baryon and lepton number.
Cambridge Pre-U Physics
Summary
■ Hadrons are particles made of quarks, which are affected by the strong force. They
include baryons and mesons.
■ Leptons are fundamental particles that are not affected by the strong force.
■ Baryon number is conserved, with baryons having a baryon number of +1, antibaryons
−1 and mesons 0. Quarks have a baryon number of +⅓ and antiquarks −⅓.
■ Lepton number is conserved, with leptons (such as electrons and neutrinos) having a
lepton number of +1 and antileptons (such as positrons and antineutrinos) −1.
■ Background radiation is present everywhere on Earth from natural sources
(including cosmic rays and some types of rocks) and artificial sources (including
medical devices such as X-ray machines and nuclear weapons tests).
■ The standard model classifies matter into three families: quarks (including up and
down), leptons (including electrons and neutrinos) and force carriers (including
photons and gluons).
■ Matter can be classified into baryons, mesons and leptons.
■ Baryon numbers and lepton numbers are conserved in nuclear transformations.
Cambridge Pre-U Physics
Learning Outcomes
■ describe qualitatively the motion of a rigid solid object under the influence of a single force in
terms of linear acceleration and rotational acceleration
■ ∑
recall and use I = mr 2 to calculate the moment of inertia of a body consisting of three or
fewer point particles fixed together
■ use integration to calculate the moment of inertia of a ring, a disk and a rod
■ deduce equations for rotational motion by analogy with Newton’s laws for linear motion,
1 2 dω
including E = Iω , L = Iω and Γ = I
2 dt
■ apply the laws of rotational motion to perform kinematic calculations for a rotating object
when the moment of inertia is given
v
ω
Figure S17.1 A force that acts on an object in a line that does not pass through the centre of mass
causes the object to undergo linear and angular acceleration.
This combination of linear and rotational acceleration requires some thought to analyse in
detail. First, we will look at how we can describe and explain the rotation of a rigid body
about a fixed axis or pivot point.
We can define the instantaneous angular velocity as the rate of change of angular
displacement, measured in radians per second:
dθ
ω=
dt
If we plot a graph of angular displacement against time, the instantaneous angular velocity at
a particular time is the gradient of the graph taken at that time. Note that this has the same
form as the equation for linear velocity:
dx
v=
dt
If the angular displacement ∆θ changes over a time ∆t, then we can calculate the average
angular velocity as:
∆θ
ω av =
∆t
Just as with linear velocity, if the angular velocity is constant, then the graph of angular
displacement against time is a straight line with the gradient equal to the average angular
velocity. If the angular velocity is changing, the graph of angular displacement against time
is curved and the instantaneous angular velocity is the gradient of a tangent to the curve.
Cambridge Pre-U Physics
or as we had before:
v = rω
θ
s = rθ 3
v = rω
question
17.1 a A turntable rotates at 33 revolutions per minute. Determine the period, frequency
and angular frequency for this rotation (in standard units).
b The diameter of the turntable is 30 cm. Calculate the speed of a point on the edge of
the turntable.
moment (in N m) = force (in N) × perpendicular distance from the pivot (in m)
Cambridge Pre-U Physics
The angular acceleration is the rate of change of angular velocity. In this book, we give
angular acceleration the symbol α. It is measured in radians per second per second (radians/
(second)2):
dω
α=
dt
dθ dω d 2θ
Note that since ω = , angular acceleration α = = 2
dt dt dt
We call this the second derivative of angular displacement with respect to time.
We have seen how we can determine equations of motion for rotational motion that have
the same form as the equations for linear motion. However, before we can write down the
equivalent of Newton’s second law (F = ma) for rotational motion, we need to answer the
following question: what is the rotational equivalent to mass? This is not a straightforward
question to answer!
a b
25 25
kg kg
! !
Figure S17.3 Loading supermarket trolleys: a It is easier to make this supermarket trolley turn
around a corner. b It is much harder to get this supermarket trolley to turn around a corner. Once
you have started the trolley rotating, it is also harder to stop.
We can do a simple experiment in the lab to show the same thing (Figure S17.4). Take a metre
rule, and tape equal masses on either side of the centre, as shown in a. Try to rotate the rule.
Now move the masses further away from the centre towards each end, as shown in b, and try
to rotate the rule again. It should be harder to start and stop the rotation in b than in a.
Cambridge Pre-U Physics
ω ω
m
m
a b
Figure S17.4 A metre rule with masses attached. It is easier to start and stop the ruler rotating
with the masses in position a than with them in position b.
m1
r1
O
m2
v1 = r1 ω
5
ω
Figure S17.5 A rotating rigid body.
Figure S17.5 shows a rigid body rotating about an axis O at angular velocity ω. Imagine it as
being made up of a series of point particles, of masses m1, m2, m3 . . . , each at a distance of r1,
r2, r3 . . . from the rotation axis. Particle 1 is moving at a speed
v1 = r1ω
1 1
KE1 = m1v12 = m1r12ω 2
2 2
We can write down similar equations for the rest of the particles. The total kinetic energy of
the rotating body is the sum of the kinetic energies of the particles:
1 1 1
KE = m1r12ω 2 + m2r22ω 2 + m3r32ω 2 +…
2 2 2
1
( )
KE = ω 2 m1r12 + m2r22 + m3r32 +…
2
Cambridge Pre-U Physics
We call the quantity in brackets the moment of inertia and give it the symbol I. Note that
the angular velocity ω is the same for all the particles. We can write this quantity using
mathematical notation:
i
(the ‘Σ’ means ‘sum over all values of the index i’)
This means that our kinetic energy equation for a rotating body becomes
1
KE = ω 2
2
This equation has a similar form to the kinetic energy of linear motion, where the moment
of inertia is the rotational equivalent of mass (measured in kg m2) and the angular velocity is
equivalent to velocity.
Look again at the formula for the moment of inertia. We can see that if the mass is
distributed further from the pivot point, the moment of inertia is larger. (In fact, if you
double the distance from the pivot, you increase the moment of inertia by a factor of 4.) Just
as a more massive object is more difficult to accelerate, an object with a larger moment of
inertia is harder to rotate. This explains why the supermarket trolley we discussed is hard to
get around a corner with the mass distributed towards the front of the trolley – its moment
of inertia is much larger, and so a larger torque is required to produce a given angular
acceleration. The same logic applies to the experiment with the metre rule.
We can now write down the equivalent to Newton’s second law for angular acceleration: 6
0.25 kg
20 cm
1.00 kg
The 0.25 kg mass is at 0.5 m from the pivot, and the 1.00 kg mass is at 0.7 m from the pivot.
Therefore the moment of inertia is
2 2
I = 0.25 kg × ( 0.5 m ) + 1.00 kg × ( 0.7 m ) = 0.55 kg m 2
∫
I = r 2dm
We will demonstrate how to use this formula in three examples: a rod, a disk and a ring.
dx
– L/2 0 L/2
Figure S17.7 Calculating the moment of inertia of a uniform rod.
We will calculate the moment of inertia of a rod about its centre point (see Figure S17.7).
The rod is uniform, has mass M and total length L. Since the rod is uniform, it will have a
constant mass per unit length:
M
ρ=
L
We will divide the rod up into small elements, each of length dx. Each element therefore
has mass:
M
dm = dx
L
Cambridge Pre-U Physics
The x coordinate is the displacement from the pivot point. We can say that the element of
length dx at position x has the moment of inertia:
dI = x 2dm
Of course, the contribution of each element to the total moment of inertia varies in size
depending on how far you are from the pivot point – this is taken account of here, because
we have the x 2 term. In order to find the total moment of inertia, we need to sum up these
contributions over the entire length of the rod. The x-axis is defined as being along the rod,
with the origin of coordinates at the centre of the rod. So the rod extends from:
x = − L / 2 to x = L / 2
We can find the total moment of inertia by integrating over x , from –L/2 to L/2.
L
x= L
2
x2M x3M 2 M L3 L3 ML2
∫
I = dI = ∫ L
L
dx = = + = 12
3L − L 3L 8 8
x =− 2
2
You may want to check you have followed each step in obtaining this result by doing the full
calculation yourself.
When we calculate a moment of inertia, we always put the origin of coordinates at the 8
pivot point. In the example of the rod, if we instead pivot the rod about one end then we
should place the origin at that end of the rod, and x will take values between 0 and L in the
integration.
question
2
17.2 Prove that the moment of inertia of a uniform rod about one of its ends is ML . Hint:
3
follow the same steps we used above, but change the origin of coordinates to one end
of the rod.
I = MR 2
This result is also the moment of inertia of a thin-walled, hollow cylinder about its axis, as
the distribution of mass about the rotation axis is identical.
Cambridge Pre-U Physics
dr
Figure S17.8 shows a solid disk, with total mass M and radius R. To calculate its moment of
inertia about an axis through the centre and perpendicular to the disk, we need to divide it
up into infinitesimally small rings (annuli). Each ring has a different radius, so to add up all
the infinitesimally small rings we integrate over radii from 0 to R.
The mass per unit area of the disk is
M
ρ= 9
π R2
Consider an element of this disk: a thin ring, of width dr and at radius r from the centre
of the disc. Its circumference is 2π r . (We can ignore the fact that the inner and outer
circumferences are very slightly different, because if we took this into account, they would
contribute terms to the expression with ( dr ) in them. As we integrate and dr tends to zero,
2
then these terms go to zero much faster than terms where dr is the only small quantity.)
The area of the thin ring is therefore
dA = 2π r dr
Note carefully the difference between R, which is the radius of the whole disk, and r, which is
the radius of the thin ring whose moment of inertia we are adding to the total.
The formula for the moment of inertia of a thin ring is:
I = mr 2
We now need to add up these contributions for the whole disk, so we integrate over r from 0
to R:
r=R r=R
2M 3 2M r 4 2 M R 4 MR 2
I = dI =∫ R ∫
2 r dr = 2 = 2 =
R 4 r =0 R 4 2
r =0
question
17.3 Without doing any further calculation, write down the moment of inertia of a solid
cylinder. Justify your answer.
We can use the result for a disk to calculate the moment of inertia of a sphere. We can
consider a sphere as being made up of lots of thin disks, and their radius varies as a function
of how far they are above the centre of the sphere. Doing this is beyond the scope of this
course, but you might like the challenge! The result is given in the table of moments of
inertias below.
R2
dr
10
R1
r
Figure S17.9 shows a ring, or annulus, with inner radius R1, outer radius R2 and mass M. If
we want to calculate its moment of inertia about an axis through the centre of the ring and
perpendicular to the ring, then our calculation is very similar to that for the disk. In fact, the
only changes we need to make are:
• adjusting the mass per unit area, so it takes account of the missing central part of the ring
• changing the limits of integration.
The new mass per unit area is (subtracting the area of the central missing part of the disk
from the area of a solid disk):
M
ρ=
π R2 − R1 2
2
( )
When we divide the ring up into infinitesimally thin rings, the moment of inertia of each
thin ring is:
M 2M
dI = r 2dm = r 2 × 2π r dr = r 3dr
π ( R2 2 − R1 2 ) ( R2 2 − R1 2 )
Cambridge Pre-U Physics
To calculate the moment of inertia of the whole ring, we need to integrate this result from
r = R1 to r = R2:
r = R2 r=R
2M 2M r4 2 2M R2 4 − R1 4
∫ ∫ (R
3
I = dI = r dr = = 2
r = R1
2
2 − R1 2 ) (R
2
2 − R1 2 ) 4 r = R1 ( )
R2 − R1 2 4
However, since
(
R2 4 − R1 4 = R2 2 − R1 2 R2 2 + R1 2)( )
we can simplify our result to:
M 2
I=
2
(
R2 + R1 2 )
11
and
I = MR 2
Thin cylindrical shell with open ends,
of radius R and mass M about its axis
and
MR 2
I=
Solid cylinder of radius R and mass M 2
about its axis
z
r
Hollow sphere of radius R and mass M
y 2 MR 2
I=
3
x
z 12
r
Solid sphere of radius R and mass M
y 2 MR 2
I=
5
x
A vinyl record rotates on a turntable with an angular speed of 3.49 radians per second. The
record’s diameter is 0.305 m and its moment of inertia is 1.28 × 10 −3 kg m2.
a Calculate the mass of the record.
b Calculate its rotational kinetic energy.
c The record is brought to a standstill in 0.50 s by the application of a constant torque.
Calculate the torque exerted on the record.
MR 2
I=
2
Cambridge Pre-U Physics
Step 1 We have been given the moment of inertia and radius of the disk, so rearrange
the formula for the moment of inertia to make mass the subject:
2I
M=
R2
Step 2 Substitute the values given in the question, remembering to divide the
diameter of the record by two to get its radius, to calculate the total mass:
1
KE = Iω 2
2
Step 1 Substitute the given values to determine the rotational kinetic energy:
1
( )
2
× 1.28 × 10−3 kg m 2 × 3.49 rad s −1 = 7.80 × 10−3 J
2
c
∆ω −3.49 rad s −1
α= = = −7.0 rad s −2 13
∆t 0.50 s
Step 2 Use this value to calculate the magnitude of the torque exerted on the record:
Angular momentum, L
Remember that in a system where no external force acts, momentum is conserved. This is a
powerful law in mechanics. The rotational equivalent is that in a system where no external
torque (moment) acts, angular momentum is conserved.
L = Iω
You may have experienced this if you have watched or taken part in ice skating or ballet.
An ice dancer who starts spinning with his arms outstretched will increase his rotation rate
as he brings his hands in (see Figure S17.10). As he brings his arms in, the mass of his arms 14
moves closer to his rotation axis. This means that his moment of inertia is reduced. Since
no external torque has acted, angular momentum must be conserved. The reduction in his
moment of inertia must be balanced by an increase in his angular velocity. He therefore
spins faster. Interestingly, his kinetic energy might increase during this process. Think about
where this energy might come from before reading on. As the skater pulls his arms in, he
causes them to accelerate – they do not follow the path that they would follow if no force
acted on them. He therefore has to do work to bring the arms in, and that work increases the
kinetic energy stored in his rotating body.
I1
Figure S17.10 An ice skater speeds up his rotation as he pulls his arms in. Angular momentum is
conserved, so reducing his moment of inertia means that his angular velocity must increase.
Cambridge Pre-U Physics
The attitude indicator on an aircraft may use a device called a gyroscope to maintain
an artificial horizon (see Figure S17.11). The gyroscope contains a rotating disk, which is
mounted in a framework containing three gimbals so that it is able to rotate freely in three
dimensions. The rotating disk has angular momentum, and since no external torque acts on it
(the gimbals have little friction), the disk remains horizontal while the aircraft and gyroscope
gimbals tilt around it. This enables the pilot to see what angle the aircraft is tilted at.
This is a solid cylinder, so it has the same moment of inertia as a solid disk. The question
does not state a radius, so the end result is probably independent of radius, but for now we
will call the radius R.
MR 2
I=
2
The centre of mass is moving with linear speed v. This means that the point at which the
cylinder touches the ground is also moving with speed v, so the speed of the edge of the
cylinder is v.
v
ω=
R
Step 3 Determine the rotational kinetic energy and the kinetic energy of the centre of mass.
The total kinetic energy is the sum of the rotational kinetic energy of the cylinder and
the (linear) kinetic energy of the centre of mass.
Rotational KE:
1 MR 2v 2 Mv 2
KE = Iω 2 = =
2 4R2 4
Cambridge Pre-U Physics
Total KE:
3
KE = Mv 2
4
Think about what this result means for rolling a cylinder down a slope. At a given speed, it
has greater kinetic energy than would be expected from its centre of mass alone, as there
is also energy stored in the rotational motion. If the cylinder was dropped from a height h,
or slid down a frictionless slope from that height, it would achieve the same final velocity –
all of the initial potential energy would have been converted to kinetic energy of the centre
of mass. However, if it rolls down a slope from this same height, its centre of mass will
end up moving more slowly. The energy is now partitioned (split) between the KE of the
linear motion of the centre of mass and the rotational KE. You could test this for yourself
by rolling a full and an empty cylindrical jam jar or food tin down a slope, and see if the
difference in times you measure is the same as the distance you calculate. You will need to
work through the same calculation steps for a hollow cylinder (the moment of inertia is the
same as a thin hoop).
Summary 16
■ The circular motion of a rigid solid object under the influence of a single force can be
modelled in terms of linear acceleration and rotational acceleration.
■ The moment of inertia of a body consisting of point particles fixed together is given
∑
by I = mr 2 .
■ The moment of inertia of a ring, a disk and a rod can be calculated using integration.
■ Angular momentum is defined by the equation
angular momentum = moment of inertia × angular velocity.
■ In a system where no external torque (moment) acts, angular momentum is
conserved.
■ The equations for rotational motion can be remembered by analogy with Newton’s
1 2 dω
laws for linear motion, including E = Iω , L = Iω and L = .
2 dt
■ When given the moment of inertia for a rotating object, the equations of rotational
motion and the conservation of angular momentum can be used to perform
kinematic calculations.
Cambridge Pre-U Physics
End-of-chapter questions
S13: Waves and Optics
S17.1
A car is travelling up one side of a hill and down the other side. The crest of the hill is a circular arc with
a radius of 45.0 m. Determine the maximum speed that the car can have while moving over the crest
without losing contact. [6]
S17.2
Find the moment of inertia of an equilateral triangle consisting of three point masses of mass m jointed
by light rods of length L, about the midpoint of one of the sides. [5]
S17.3
Explain why a tightrope walker uses a long pole to maintain their balance as they are walking. [3]
S17.4
A vehicle called a Gyrobus was developed in the 1950s. It used a flywheel to store the energy required to
power the bus: the wheel was spun up at a charging stop before setting off, and was then used to drive a
generator and an electric motor.
a When fully ‘charged’, the flywheel rotates about a vertical axis at 3000 revolutions per minute.
Calculate the angular speed ω of the disc. [2]
b Laws of rotational motion can be deduced by comparison with Newton’s laws of linear motion.
Copy out and complete the table below by stating the equivalent formulae, in words, for rotational
motion. [2]
c
The diagram below shows a flywheel of mass M and thickness t with radius R. The uniform density of the
flywheel is ρ.
R
R t
ω
(i) U se integration to derive an expression for the moment of inertia I of the disc. You may wish to
draw a diagram to illustrate your working. [4]
(ii) The flywheel has a mass of 1500 kg and a moment of inertia of 4.8 × 102 kg m2. Calculate the radius
of the flywheel. [2]
(iii) Determine the rotational kinetic energy of the disc, when rotating at 3000 rpm. [3]
d
The drivers of the Gyrobus found that it did not handle as expected, particularly when the bus tilted
during a turn (for example on a slightly banked turn). Suggest why they found this. [2]
Cambridge Pre-U Physics
S18: Gravitation
Learning Outcomes
■ state Kepler’s laws of planetary motion:
■ planets move in elliptical orbits with the Sun at one focus
■ the orbital period squared of a planet is proportional to its mean distance from the
Sun cubed
■ understand energy transfer by analysis of the area under a gravitational force–distance graph
■ calculate escape velocity using the ideas of gravitational potential energy (or area under a
force–distance graph) and energy transfer
planet
centre of the epicycle,
point about which
the planet rotates
deferent
epicycle
Earth
A key principle in science is that of ‘Occam’s Razor’, named after the English monk and
philosopher William of Occam. This states that ‘among competing hypotheses, the one
with the fewest assumptions should be selected’. In other words, if there are two competing
theories that make exactly the same predictions (and match the experimental data), the 2
simpler one is better. The Polish astronomer and mathematician Nicolaus Copernicus
(1473–1543) developed a different model using another Greek idea from the philosopher
Aristarchus. This model had the Sun at the centre (a heliocentric model), where the planets
move in circular orbits around the Sun, and the Moon orbits the Earth. In this model, the
stars remained on a fixed sphere but at a very great distance from the Sun. This was a much
simpler theory than that of Ptolemy and explained some, but not all of the measurements of
the motion of the planets.
The Danish astronomer Tycho Brahe (1546–1601) made a large number of extremely
accurate observations of the apparent movements of the planets and stars. Many of these
observations could not be explained by a Copernican model using circular orbits. The
German astronomer Johannes Kepler (1571–1630) inherited Brahe’s data after Brahe’s death.
Kepler accepted Copernicus’ idea of a heliocentric solar system (which was controversial at
the time for philosophical and religious reasons), but he realised that in order to fit the data,
the planets had to move in elliptical orbits. The key point here is that the uncertainties in the
observations were small enough to distinguish between these two similar models, which was
incredible given that they were taken without the aid of a telescope. Kepler developed the
following three laws.
250
T ∝ r2
3
T∝ r 2 Pluto
200
3
Period, T / years
Neptune
150
100
Uranus
50 Mars Saturn T∝ r
Jupiter
0
0 1 2 3 4 5 6
Mean distance, r / 109 km
Figure S18.2 a Kepler’s first and second laws – the planets follow elliptical orbits with the Sun at
one focus (Kepler’s first law), and the line joining the planet to the Sun sweeps out equal areas in
equal times (Kepler’s second law). b Kepler's third law for our solar system.
Kepler’s laws were empirical – which means that they were developed from observations
without being based on a physical theory. The English scientist Isaac Newton (1642–1726)
proposed just such a theory, which suggested (as we have seen) that the force of gravity
between two objects is inversely proportional to the square of the distance between them.
Newton showed that this ‘universal theory of gravitation’ could be used to explain all of
Kepler’s laws. We have already used Newton’s theory to derive Kepler’s third law for the case
of a circular orbit in Chapter 18 of the Coursebook. Kepler’s third law can also be derived for
a more general, elliptical orbit, but that is beyond the scope of this course.
Cambridge Pre-U Physics
question
18.1 The Earth’s orbit is not very elliptical – the Earth’s closest approach to the Sun is
1.47 × 108 km and its greatest distance from the Sun is 1.52 × 108 km.
a Draw a sketch of the orbit and indicate the points of closest approach (A) and
greatest distance to the Sun (B). Exaggerate your sketch so that the ellipticity is
apparent.
b By considering the time taken to sweep out a small area ∆A, use Kepler’s second law
to estimate the ratio between the Earth’s orbital speeds between points A and B.
c Repeat the calculation in part b for Pluto, where the distance of closest approach to
the Sun is 4.44 × 109 km and the greatest distance from the Sun is 7.38 × 109 km.
S18.3 P
otential energy and gravitational 4
force–distance graphs
Remember that we defined gravitational potential at a point as the work done per unit mass
in bringing a mass from infinity to the point. Since the gravitational force is always attractive,
in the opposite direction to the displacement from the object with mass M, the expression for
gravitational potential contains a minus sign:
GM
φ=−
r
The minus sign means that even though the magnitude of the potential decreases as you
move the test mass away from the mass M, the change in potential as you move the test mass
away is positive (i.e. work is done to separate the masses).
Two objects have gravitational potential energy because they are each within the other
object’s gravitational field. We define the objects to have zero potential energy when they
are infinitely far apart. Using the expression for the gravitational potential given above, we
can calculate the potential energy of one object within the gravitational field of another. For
example, if we know the mass of a satellite orbiting a planet, and the gravitational potential
of the planet at the position of the satellite, we multiply the potential by the mass of the
satellite to get the gravitational potential energy E. This quantity is the equivalent of the work
done in bringing the satellite from infinity to that point within the planet’s gravitational
field. For two objects of mass m1 and m2, the gravitational potential energy is given by the
equation below. The GPE is negative because the force is attractive:
Gm1m2
E=−
r
Cambridge Pre-U Physics
Another way of deriving this result is from Newton’s law of gravitation. We can do this in
two ways – graphically, or by integration. Figure S18.3 shows a force–distance graph for a
mass in a gravitational field.
F
Figure S18.3 Force–distance graph for a mass in a gravitational field. The force has a minus sign
because it is in the opposite direction to the displacement r of the mass.
The shaded area on the graph represents the change in gravitational potential energy as the
mass is moved from one position to another. Remember that if we are moving the masses
together, the change in potential energy will be negative, and if we are moving them apart, it
will be positive. It is always worth double-checking whether you have this the right way round!
Let’s try doing this by integration. We are going to bring mass m2 into the gravitational
field of mass m1, and see how much work is done. This will be the gravitational potential
energy that these masses have in that particular configuration (compared to when they are
infinitely far apart). Since the force changes as the mass is moved, we must move the mass a
small increment dx and multiply by the force at that radius, and then add up contributions
from the range of radii we are interested in. The work done in moving the mass by dx (we
5
will take dx as being positive moving away from mass m1) is:
Gm1m2
dW = dx
x2
Let us double check the signs: we are moving the masses away from each other as we increase
x, so because gravity is attractive we expect to have to do work to do this, so we expect dW to
be positive, as it is.
Now, to get the potential energy in moving the mass from infinity to r, we need to
integrate between limits. Notice that infinity is the lower limit as we are starting there.
r r r
Gm1m2 Gm1m2 Gm1m2
∫
E grav = dW = ∫ (x )2 dx = −
x ∞ =−
r
∞ ∞
Now let us check that this still makes sense. If we’re bringing the object in from infinity to
a point in the field, then because the field is attractive we expect the potential energy to be
negative – giving us the minus sign that we indeed have!
question
18.2 The Earth has a radius of 6400 km, and a mass of 6.0 × 1024 kg.
a Calculate the change in gravitational potential in moving from the surface of the
Earth, at a distance of 6400 km from the centre of the Earth, to the orbit of the
International Space Station (ISS), at 410 km above the Earth. Explain the sign of the
change in gravitational potential that you calculated.
b An astronaut of mass 75 kg travels to the ISS. What is the change in her potential
energy between the start and end of the journey?
Cambridge Pre-U Physics
GMm 1 2
= mve
r 2
questionS
18.3 Calculate the escape velocity at the surface of each of these objects:
The Earth (mass 5.97 × 1024 kg, radius 6370 km)
The Moon (mass 7.35 × 1022 kg, radius 1740 km)
The Sun (mass 1.99 × 1030 kg, radius 6.96 × 105 km)
6
18.4 A star three times the mass of our Sun can collapse to form a black hole after all the
resources it needs for nuclear fusion to occur have been used up. A black hole is a
region of space where the escape velocity from the gravitational field is greater than
the speed of light. Calculate the radius of an object with three times the mass of the
Sun, where the escape velocity at the surface would be the speed of light (this radius
is known as the Schwarzschild radius).
Summary
■ Kepler’s first law of planetary motion: all the planets move in elliptical orbits with the
Sun at one focus of the ellipse.
■ Kepler’s second law of planetary motion: a line drawn from the Sun to the planet will
sweep out equal areas in equal times as the planet moves in its orbit.
■ Kepler’s third law of planetary motion: the period of a planet’s orbit squared is
proportional to its mean distance from the Sun cubed: T 2 ∝ r 3.
■ The area under a gravitational force–distance graph provides a way to analyse
changes in gravitational potential energy of a mass in a gravitational field.
■ Escape velocity can be determined by calculating the energy required to take a mass
from its initial position in the gravitational field to infinity (by using the expression
for gravitational potential or the area under a force-distance graph). The kinetic
energy that the body has at escape velocity is equal to the potential energy it gains
when it is taken out of the gravitational potential well.
■ The velocity required to escape from the gravitational field of a body of mass M is
2GM
given by ve = .
r
Cambridge Pre-U Physics
S19: Oscillations
Learning Outcomes
■ show that the condition for simple harmonic motion leads to a differential equation of the
d2x
form = −ω 2 x and that x = A cosω t is a solution to this equation
dt 2
2
■ use differential calculus to derive the expressions v = – Aω sinω t and a = – Aω cosω t for
simple harmonic motion
■ recognise and use the expressions x = A cosω t , v = – Aω sinω t , a = – Aω 2cosω t and
F = –mω 2 x to solve problems
■ understand the phase differences between displacement, velocity and acceleration in simple
harmonic motion
■ show that the total energy of an undamped simple harmonic system is given by
1
E = mA2ω 2 and recognise that this is a constant
2
1
■ recognise and use E = mA2ω 2 to solve problems
2
In this section, we are going to work from what we already know about the conditions for
simple harmonic motion, and derive the differential equation that governs it. We can then
show that the solutions to this equation are the sinusoidal oscillations that we have come to
expect for simple harmonic oscillations.
Remember that to have s.h.m. we require a restoring force which is directly proportional
to the displacement from the equilibrium position and acts in the opposite direction to
the displacement (towards the equilibrium point). In a mechanical system we will have
an oscillating mass; if you study physics further you will come across many other examples
where a system can be modelled as a simple harmonic oscillator (or where this model is a
good approximation).
Consider a mass hanging from a spring, as shown in Figure S19.1.
Displaced from
In equilibrium equilibrium
Figure S19.1 Mass m suspended from a spring with spring constant k. Displacing the mass from
its equilibrium position results in simple harmonic motion.
Cambridge Pre-U Physics
Once this system is set up, the mass will rest in equilibrium with the spring extended by an
extension x0. Hooke’s law tells us that if the spring is extended by a distance x0, the restoring
force exerted by the spring is given by F = kx 0 . In equilibrium, this is balanced by the weight
of the mass, mg. So we can calculate the equilibrium position as:
kx 0 = mg
mg
⇒ x0 =
k
If we displace the mass by a distance x downwards from its equilibrium position, the
restoring force from the spring increases to k( x + x 0 ). Remember that in equilibrium, the
restoring force was balanced by the weight, and there was no net force on the mass. Therefore
we know that the unbalanced restoring force is, in fact:
F = − kx
We include the negative sign because the force is in the opposite direction to the
displacement.
Since we know the unbalanced force on the mass, by using F = ma we can calculate the
acceleration. The equation of motion for the mass is therefore:
ma = − kx
Remember, however, that acceleration is the time derivative (rate of change) of velocity,
and velocity is the time derivative of displacement. We say that acceleration is the second
derivative of displacement with respect to time (we differentiate twice). So in fact,
d2x 2
a=
dt 2
and we can express the equation of motion as
d2x
m = − kx
dt 2
d2x k
⇒ 2 = − x
dt m
This is a differential equation, and we can solve it for x to determine how the displacement
of the mass changes with time. Since it is a second-order differential equation (it contains
a second derivative), to solve it we must integrate twice. This means that our solution will
contain two arbitrary constants. This makes sense, because we know that the motion will
depend on the initial position (first constant) and velocity (second constant) of the mass.
In other systems undergoing s.h.m., we may end up with an equation that has a different
coefficient for the term in x. The general form of the simple harmonic motion equation is:
d2x
= −ω 2 x
dt 2
x = α cosω t + β sin ω t
Cambridge Pre-U Physics
where α and β are constants that depend on the initial conditions (position and velocity at
time t = 0). ω is the angular frequency of the oscillation: ω = 2π f . If we compare this general
form of the s.h.m. equation to the equation we derived for the mass on a spring, we can see
that for this system, the angular frequency of oscillation is
k
ω=
m
and therefore the frequency of oscillation is
1 k
f=
2π m
When we use these equations for the mass on a spring, in order to get ω in the correct units
of rad s−1, we must express the stiffness k in N m−1.
In the case where the oscillator starts at its maximum displacement (as is often the case),
the solution can be written as:
x = A cos(ω t )
Here, A is the amplitude of the oscillations and ω is the angular frequency discussed above.
In order to show that this is the correct solution to the s.h.m. equation, we need to
differentiate it twice, since the second derivative of x appears in the differential equation. As
we are doing this, we will also produce equations for the velocity and the acceleration.
If we differentiate the equation for x with respect to t we get the equation for the velocity
of the simple harmonic oscillator at time t. 3
dx
= v = − Aω sin (ω t )
dt
In deriving this equation, we have used the mathematical technique called the chain rule and
the standard result for the derivative of the cosine function. We can then differentiate this
velocity equation again to get an equation for the acceleration of the oscillator at time t.
d2x
= a = − Aω 2 cos(ω t ) = −ω 2 x
dt 2
Since the acceleration is −ω 2 x , this is clearly the correct solution for our original differential
equation.
Figure S19.2 shows sketch graphs of the displacement, velocity and acceleration for a
simple harmonic oscillator. We can use the following trigonometric identity
to show that
π π π
cos θ + = cosθ cos − sinθ sin = − sinθ
2 2 2
So looking at our expressions for v and x, we can say that the phase of v leads x by π radians
π 2
(90°) – this means that we obtain the graph of v by shifting the graph of x by radians along
2
the axis in the negative direction.
π
Similarly, a leads v by radians, and a and x are π radians (180°) out of phase.
2
A
Displacement, x
0
π/2 π 3π/2 2π 5π/2 3π 7π/2 ωt
–A
Aω
4
Velocity, v
0
π/2 π 3π/2 2π 5π/2 3π 7π/2 ωt
–Aω
Aω 2
Acceleration, a
0
π/2 π 3π/2 2π 5π/2 3π 7π/2 ωt
–Aω 2
Figure S19.2 The relationship between displacement, velocity and acceleration
for a simple harmonic oscillator.
Cambridge Pre-U Physics
question
19.1 Show that x = α cosω t + β sin ω t is also a solution to the s.h.m. equation.
A 500g mass is hung from a spring with spring constant 0.1 N cm−1. Assume the acceleration
due to gravity, g is 10 ms−2.
a Calculate the extension of the spring when it is at equilibrium.
b The mass is displaced to 5.0 cm below its equilibrium position and released at time t = 0 s. In
the motion that follows, if the displacement below the equilibrium position is x, determine
the equation that describes the motion.
c Calculate the speed of the mass as it passes through the equilibrium position.
d Calculate the magnitude of the maximum acceleration experienced by the mass.
a At the equilibrium extension x0, the restoring force balances the weight:
kx 0 = mg
Therefore
mg 0.5 kg × 10 N kg −1
x0 = = = 50 cm
k 0.1 N cm −1
b Start from the solution to the s.h.m. equation:
5
x = A cos (ω t )
e can either derive the differential equation and compare it to the standard form to
W
work out ω , or remember that for a mass m on a spring of stiffness k,
k 10 Nm −1
ω= = = 4.8 rad s −1
m 0.5 kg
emember that to get ω in radians s−1, we need to put m and k into SI base units: m in
R
kg and k in N m−1. Note that we have to do this even though we are measuring A and x in
centimetres.
( )
2
a = ω 2 A = 4.8 rad s −1 × 5.0 cm = 100 cm s −2
Cambridge Pre-U Physics
question
19.2 Write down the equation describing the motion in the following cases:
a An oscillator which starts from a maximum displacement of 0.2 m and has a
frequency of 10.0 Hz.
θ
L
FT
L sin θ
m
x
mg cos θ
mg sin θ
mg
6
Figure S19.3 A free-body force diagram of a simple pendulum. The dotted lines represent the
components of the weight resolved in directions parallel and perpendicular to the string.
Applying the angular form (τ = Iα ) of Newton’s second law to the pendulum, we get:
d 2θ
Lmg sinθ = mL2
dt 2
Rearranging and cancelling, we can write this as:
d 2θ g
+ sinθ = 0
dt 2 L
This is the equation of motion for the pendulum. Notice that this equation is non-linear
(because of the sine term) and does not represent s.h.m.
However, for small angles θ (say, less than 10°), we can use the approximation sinθ ≈ θ ,
and then the equation becomes:
d 2θ g
+ θ =0
dt 2 L
g
This is now the s.h.m. equation, with angular frequency ω = .
L
Cambridge Pre-U Physics
Note that we could also express the equation in terms of the arc length, by using x = Lθ :
1 d2x g
+ x=0
L dt 2 L2
which simplifies to the s.h.m. equation in x, with the same angular frequency:
d2x g
+ x=0
dt 2 L
If we wanted to determine how good an approximation s.h.m. is to the motion of a real
pendulum, we could make a computer model of the original equation and examine how
different it is to s.h.m. for a range of given swing angles.
question
19.5 Determine the length of a pendulum that completes one oscillation per second,
when displaced by a small angle.
x=0
F
Consider the work done in stretching or compressing the spring. Work is done against
the restoring force F = kx . Since the force changes depending on the extension, we cannot
just substitute this simple equation for force into W = Fd . There are two possible ways to
proceed. One is to plot a graph of F against x: the area under the graph is the work done.
By considering the graph in Figure S19.5, we can see that the work done in stretching or
compressing the spring by a distance x is
1
W = E p = kx 2
2
This energy is stored as potential energy in the spring (assuming the spring is ‘ideal’, meaning
that it does not heat up when stretched). We can also obtain this result by integration. If the
spring is stretched by a small increment dx, then a small amount of work, dW, is done:
dW = Fdx = kx dx
Cambridge Pre-U Physics
Integrating this with respect to x gives us the same equation as we found from plotting the
graph and taking the area under it.
gradient = − k
–x0 +x0
0 x
The system also has kinetic energy due to the motion of the mass:
1
Ek = mv 2
2
The total energy of the oscillator is the sum of the kinetic and potential energies:
1 1
E = E p + Ek = kx 2 + mv 2
2 2
However, we already have expressions for x and v for a simple harmonic oscillator:
8
x = A cos(ω t + δ )
v = − Aω sin(ω t + δ )
Substituting these expressions into the energy equation, we get
1 1
E = kA2 cos 2 (ω t + δ ) + mA2ω 2 sin 2 (ω t + δ )
2 2
k
and using ω 2 = m , this becomes
1 1
E = mA2ω 2 cos 2 (ω t + δ ) + mA2ω 2 sin 2 (ω t + δ )
2 2
1
E = mA2ω 2
2
This total energy is constant at all times during the oscillations (for undamped oscillations).
Over the course of one oscillation, the energy is transferred from kinetic to potential and
back. All of the energy is in the form of kinetic energy at the point when the mass passes
through the equilibrium point, and all of the energy is in the form of potential energy when
the mass is at its maximum displacement from the equilibrium point. Figures 19.22 and
19.23 in the Coursebook illustrate this graphically.
Although we have derived this result for the case of a mass on a spring, it is in fact a
general result for mechanical simple harmonic oscillators. Certain problems are more easily
solved by first considering the energy of the system, so this equation is a useful problem-
solving tool – see the Worked example.
Cambridge Pre-U Physics
When a 100 g mass is placed on the pan of a spring balance, the scale reads 100 g and the pan
is displaced downwards by 0.5 cm. The 100 g mass is removed, and then dropped onto the
spring balance from a height of 2 cm above the pan. What is the maximum reading observed
on the scale during the resulting oscillations? Assume that the scale reading and the pan’s
displacement are linearly related, and use g = 10 N kg−1. Also assume that the pan’s mass is
negligible compared to the mass that is dropped into it.
Step 1 Calculate the spring constant for the balance. A force of 1.0 N gives a compression of
0.5 cm, so
F 1.0 N
k= = = 200 Nm −1
x 0.005 m
Step 2 Calculate the angular frequency of oscillations for the 100 g mass on the balance.
k 200 Nm −1
ω= = = 45 rad s −1
m 0.1 kg
Step 3 Calculate the total energy of the oscillations. Since the mass of the pan is much less
than the mass that is landing in the pan, we do not need to include the effects of the
collision and can assume that the mass retains all its kinetic energy. (Note that if the
mass of the pan was significant compared to the dropped mass, we would have to
analyse this as an inelastic collision.) So, the total energy is equal to the potential
energy that the mass had at the start of the drop
Step 4 Use the formula for the total energy of a simple harmonic oscillator to work out the
amplitude of the oscillations. Rearranging the formula, we get
2E 2 × 0.02 J
A= = = 0.094 m
mω 2
0.1 kg × 45 rad s −1
Summary
■ The condition for simple harmonic motion leads to a differential equation of the
d2x
form = −ω 2 x .
dt 2
■ x = A cosω t is a solution to this equation.
■ The expressions for velocity, v = – Aω sinω t , and acceleration, a = – Aω 2cosω t can
be derived by differentiating the solution to the s.h.m. equation.
■ In simple harmonic motion, the restoring force, F = –mω 2 x .
■ Phase differences arise between displacement, velocity and acceleration; these arise
naturally from the solutions to the differential equation.
■ The total energy of an undamped simple harmonic system is constant and is given by
1
E = mA2ω 2 .
2
Cambridge Pre-U Physics
Learning Outcomes
■ explain how empirical evidence leads to the gas laws and to the idea of an absolute scale of
temperature
■ understand that a model will begin to break down when the assumptions on which it is based
are no longer valid, and explain why this applies to kinetic theory at very high pressures or
very high or very low temperatures
■ recall and use the first law of thermodynamics expressed in terms of the change in internal
energy, the heating of the system and the work done on the system
■ recognise and use W = pDV for the work done on or by a gas
■ understand qualitatively how the random distribution of energies leads to the Boltzmann
factor e−E/kT as a measure of the chance of a high energy
■ apply the Boltzmann factor to activation processes including rate of reaction, current in a
semiconductor and creep in a polymer
■ describe entropy qualitatively in terms of the dispersal of energy or particles and realise
that entropy is related to the number of ways in which a particular macroscopic state can be
realised
■ recall that the second law of thermodynamics states that the entropy of an isolated system
cannot decrease and appreciate that this is related to probability 1
■ understand that the second law provides a thermodynamic arrow of time that distinguishes
the future (higher entropy) from the past (lower entropy)
■ understand that systems in which entropy decreases (e.g. humans) are not isolated and that
when their interactions with the environment are taken into account their net effect is to
increase the entropy of the Universe
■ understand that the second law implies that the Universe started in a state of low entropy
and that some physicists think that this implies it was in a state of extremely low probability.
ΔU = q + w
where ΔU is the change in the internal energy of a gas. This law states that internal energy can
be changed either by supplying energy through heating (q) or by doing work on the gas (w).
It is now time to look at the work done on or by a gas in more detail. Consider a piston
containing a gas (Figure S22.1). It has cross-sectional area A and the gas is at pressure p. If
the piston is slowly pushed in to compress the gas, then work is done by the force applied to
Cambridge Pre-U Physics
the piston (force, F = pA) and it moves through a small distance, x. Hence the work done will
be given by:
w = Fx = pAx
But Ax is the change in volume of the piston, ΔV so this work done can be written as
w = p∆V
gas, pressure p
piston area,
A
compression
distance, x
It is important to keep track of the signs in this equation. If the volume decreases then the
gas is compressed, work is done on the gas and its internal energy increases. If the volume
of the gas increases, then work is done by the gas in pushing the piston out and the internal
energy of the gas decreases.
In order to apply this equation, the change in volume must be small enough that the
pressure does not change. It is also important to measure any heating or cooling that occurs.
• If a gas is compressed very slowly, then there is time for energy to flow out into the
environment as work is being done on the gas. Hence there can be a positive w (work
done on the gas) and a negative q (heat flows out of the gas), leading to no change in
internal energy and hence no change in temperature.
3
• If a gas expands quickly, it does work (large negative w) but there is no time for heat flow
(q = 0) so the gas cools. This rapid expansion is used in refrigerators.
When solving problems that involve the first law of thermodynamics, it is important
to understand how the description of a situation can be interpreted using the relevant
thermodynamic variables. The effects on key variables of particular conditions are
summarised in Table S22.1.
1 A gas in a syringe is compressed by the piston. Its volume is reduced by 10 cm3, by applying a
pressure of 200 kPa.
a Find the work done on the gas.
b Does the internal energy of the gas increase or decrease?
c How could the gas remain at constant temperature even though work is done?
Cambridge Pre-U Physics
The pV graph below shows how the pressure and volume of the gas in a cylinder change
around a cycle.
a Use the ideal gas equation, pV = nRT, to explain why returning to the same point on the
graph indicates no change in internal energy
b Describe in words what happens along each of the lines AB, BC, CD and DA
c What is the significance of the area enclosed by the box?
A B
Pressure
D C
Volume
a If p and V are the same then pV is unchanged. This means that nRT is unchanged. As no gas is
added or lost, T must be constant. In a gas the internal energy, U, depends only on T and so
the internal energy must also be constant.
b Along AB the gas is expanding at constant temperature. It is doing work. Along BC the
gas remains at the same volume as the pressure drops. It is not doing work so it must be
cooling – energy must be being taken out of the gas in the form of heat. Along CD the gas is
compressed, and work is done on it. Along DA the pressure increases again, so heat must be
taken in while no work is done. Along CD the pressure is lower, so less work is done on the
gas than the work done by the gas during expansion.
Around the loop ABCD the sum wAB + wCD + qBC + qDA = 0 (as ∆U = 0) and wCD is less than wAB
(and of opposite sign). This means that the amount of heat taken in (qDA) has to be greater
than that taken out (qBC). The net effect is that thermal energy (q) is put in and work (w) is
taken out.
This is an example of a thermodynamic cycle. A thermodynamic cycle can be followed
repeatedly to do work. The combustion engines in vehicles follow similar cycles to extract
work from the thermal energy released by burning fuel. The cycle can also be reversed to
use mechanical work (e.g. from an electric motor) to extract heat, and this is used in the
cooling unit of an air conditioner or refrigerator.
c The area enclosed is (pH – pL)∆V where pH and pL are the high and low pressures on the
graph. This is the difference between the work done by the gas and the work done on the
gas. In other words, this is the net energy transferred from work to heat by the cycle.
Cambridge Pre-U Physics
question
22.1 A gas of volume 100 cm3 is at temperature 300 K and pressure 100 kPa. It is compressed
slowly to a volume of 90 cm3. It then expands rapidly back to 100 cm3.
a Determine the temperature and pressure after the initial compression.
b What happens to the work done on the gas?
c How much work does the gas do in expanding?
d Describe what happens to the gas (i) as it expands and (ii) in the following few
minutes.
lower temperature
Fraction of molecules
higher temperature
threshold energy, ET
Kinetic energy
Figure S22.2 Distribution of energy across molecules in a gas at different temperatures
The peak of the distribution represents the highest fraction of molecules with a particular
energy. The value of the kinetic energy corresponding to the peak is the most probable
energy for any individual molecule. At higher temperatures, the peak of the distribution
shifts to the right, meaning the most probable kinetic energy is higher, and overall the
distribution gets wider. The area under the curve represents the total number of molecules.
The distribution also shows that all possible energies are represented: some of the molecules
move very slowly, others move much faster.
This distribution is crucial to understanding a wide range of physical phenomena from
evaporation to chemical reactions. For such processes to take place, some molecules must
have an energy greater than a threshold value, ET, as shown in Figure S22.2. The number
of molecules with an energy greater than this threshold depends on the temperature. We
can see this using the area under the curve, which corresponds to the number of molecules.
Cambridge Pre-U Physics
The area under the red curve beyond the threshold (higher temperature) is much greater
than that under the blue curve (lower temperature). The number of molecules beyond the
threshold energy is proportional to a quantity called the Boltzmann factor:
N ∝ e − E / kT
Here T is the absolute temperature and k is the Boltzmann constant, which has the value
k = 1.38 × 10−23 J K−1.
We can use this quantity to determine the effect of changing temperature on physical and
chemical processes, for example to find the effect of a 10 °C rise in temperature on the rate of
a chemical reaction – see Worked example S22.3.
A particular chemical reaction requires an activation energy of 3 × 10−19 J and only molecules
with that energy or greater will take part in the reaction. The rate of reaction is proportional to
the number of molecules with an energy greater than this activation energy. Find the ratio of
the number of molecules which can be involved at 30 °C compared to 20 °C. Hence determine
the effect of changing temperature on the rate of reaction.
First convert the temperatures to kelvin: 293 K and 303 K. Then the ratio will be:
Therefore a 10 K rise in temperature leads to more than a tenfold increase in the
reaction rate.
Other important processes that depend on a threshold energy are the current in a
semiconductor and creep in polymers.
A semiconductor relies on a small number of electrons being excited to a conduction
band which lies above the valence band, separated by a large energy gap. (See the section
‘Electron energies in solids’ in Chapter 30.) The number of electrons able to enter the
conduction band is determined by the Boltzmann distribution and hence the conductivity
of a semiconductor is very dependent on temperature. This is the basis of the thermistor (see
Chapter 11). Most semiconductors are doped, meaning atoms of other elements are added to
provide conduction electrons. As a result, the conductivity of doped semiconductors is much
less dependent on temperature.
When a material, especially a polymer, is placed under tension, it will extend. The amount
of extension depends on two factors:
• the magnitude of the applied tension, which causes an instantaneous extension
• a quantity called creep, which causes a material to extend more depending on the time for
which the tension is applied.
The amount of creep depends on the material. Even under a constant load, a material
may continue to stretch over time. For many materials this is such a slow process, it may
take hundreds of years to be measurable. However, for many polymers creep is significant,
even at room temperature. For example, very thin plastic shopping bags are easily stretched.
Creep is very important in materials used to make the fan blades of aircraft jet engines,
which operate at very high temperatures and under huge loads. If the blades extend even by
a tiny amount they can hit the outer casing of the engine and cause serious damage. Creep is
again dependent on the Boltzmann distribution: for a material to undergo creep, individual
molecules must exceed a certain threshold energy before they can move within the structure
of the material.
Cambridge Pre-U Physics
question
22.2 The creep rate of a polymer is proportional to the Boltzmann factor. For a given
polymer the threshold energy for creep to occur is 4.5 × 10−18 J. Compare the creep rate
at 0 °C and 25 °C.
question
22.3 Semiconductor A has a band gap of 1.01 × 10−19 J and semiconductor B has a band gap
of 2.23 × 10−19 J. Which will show a greater temperature-dependent current over a
range of 10 °C to 60 °C?
S22.5 Entropy
Entropy is a very important concept in physics, but a difficult one to understand at first. Just
because a reaction can happen energetically, does not mean it will happen. We can determine
only the probability that a reaction will occur. A particular reaction may be possible, but also
highly unlikely.
First consider this example. A box has a divider in the middle. To the left of the divider
are molecules of gas A; to the right are molecules of gas B. When the divider is removed,
the gases will mix, but what causes that? The answer is simply that mixing is the most likely
thing to happen. What we observe is called the macrostate – that the gases are mixed. Other
macrostate observables include thermodynamic variables such as pressure and temperature.
To understand the reasons for this, we need to think about the microstates – the 7
arrangements of all the individual molecules, which is obviously something we cannot
observe and measure directly for each individual molecule. Look at Figure S22.3, which
shows possible arrangements of just 10 molecules – shown in the diagram as black and white.
Only one arrangement has all of the black on the left and all of the white on the right. There
are 5 ways to have a 4:1 mix on each side as any one of the 5 molecules could cross into the
other half. There are 5 × 4 = 20 ways for there to be a 3:2 mix on each side (any of the 5 can
first cross and then any one of the remaining 4). A 2:3 mix has 20 ways and so on.
20
15
Ways
10
0
5:0 4:1 3:2 2:3 1:4 0:5
Arrangement
Figure 22.4 Chart of the possible arrangements of ten molecules in a box separated into two
sections by a divider
This is just for 5 molecules of each gas; even in this very limited example we can see it is
20 times more likely that the gases will mix more or less evenly than that they will separate.
With a mole of molecules, the probability of the gases separating is so small that you could
wait for the entire lifetime of the Universe and still not observe that state. It is not completely
impossible – it is just overwhelmingly improbable.
The quantity called entropy measures the number of possible microstates when the
conditions of a particular macrostate (such as temperature and pressure) are applied.
Entropy is sometimes described as the amount of disorder within a system. In our example
of a small number of molecules forming a mixture, the entropy is highest for there being a
3:2 mix of A and B on either side of the divider when it is removed. The entropy is lowest for 8
the states in which A and B remain separate. So the most probable outcomes are the ones
with the highest entropy.
The mixture macrostate described above is easy to visualise. However, the main use of
entropy is in understanding the distribution of energy, using the same reasoning as our
mixture example to explain why heat is transferred from hot places to cold places: it results
in a situation that is considerably more probable. One way to picture energy distribution is
to think of energy as little packets, distributed among molecules much like the molecules
in our mixture example were distributed within the box. The most probable outcome is a
distribution of different energies across the molecules rather than all the energy being with
just a few molecules. Mixing hot (high energy) and cold (low energy) is far more likely to
lead to an even distribution of energies producing an equalised temperature, rather than any
other outcome.
In many ways this is one of the most satisfying results possible. Rather than an arbitrary
law that forces something to produce a particular outcome, the most likely outcome happens
because of chance.
Cambridge Pre-U Physics
A B C Microstate Arrangement
1 1 1 1 111
3 0 0 2
0 3 0 3 300
0 0 3 4
0 1 2 5
0 2 1 6
1 0 2 7
210
1 2 0 8
2 1 0 9
2 0 1 10
Among the ten possible microstates, there are three different arrangements of energy. The
macrostate with the energy arranged evenly can actually only happen in one way. Having all
the energy in one molecule can happen in three ways and the third macrostate, with 2 units
of energy in one molecule and 1 in another, can happen in six different ways. It is important
to remember that the energy will constantly redistribute between these ten microstates. The
most probable macrostate is the third arrangement, because there are six different ways to
9
achieve this distribution.
It is worth re-reading this example and explanation. At first, it may seem surprising. You
might think that the even distribution is, logically, the most likely. However, this distribution
of energy and the probabilities tell us that the ‘2 1 0’ arrangement is much more likely to
occur: if we could stop time and take a snapshot of the energies, there is a 6 in 10 chance we
will find the ‘2 1 0’ arrangement, a 3 in 10 chance we will find ‘3 0 0’ and just a 1 in 10 chance
we will find ‘1 1 1’.
The power of entropy becomes apparent when we consider energy distributed across very
many more molecules. Particular arrangements become overwhelmingly likely. We see that
the distribution of energy which led to the Boltzmann factor arises simply from the laws of
probability.
Entropy measures the number of ways in which something can be arranged. Because the
laws of probability make the most likely macroscopic state to be the one where there are the
largest number of microstates, systems will tend to evolve into higher entropy situations just
by chance.
The second law of thermodynamics strictly only applies to isolated systems – ones where
there is no energy transfer in or out. It is possible to decrease the entropy of a system, but to
do so requires another system to do work on it, and this second system generates heat and
increases the entropy of the surroundings. An example is a refrigerator: the contents can be
cooled but only if another system, the motor on the outside, extracts the heat and transfers it
to the surrounding room. The refrigerator is not an isolated system.
Another way of expressing the second law of thermodynamics is that in an isolated
system, entropy cannot decrease. This is important to remember when we consider how
living things appear to decrease entropy, arranging molecules into useful structures,
concentrating energy in non-random ways. However, living things are not isolated systems –
the fact that they are alive means there is a constant transfer of energy in and out. For plants,
that energy transfer comes from the Sun and nutrients; for animals it comes mainly from
the chemical energy released in respiration, fueled by food. It is true that living creatures can
reduce entropy locally – for themselves – but only at the cost of increasing entropy globally.
The largest isolated system known is the entire Universe. No energy is transferred into or
out of the Universe and so its entropy cannot decrease. The second law of thermodynamics
implies that the Universe must have begun in a very low entropy state in order for its entropy
to be increasing continually. Some physicists think that this, in turn, suggests that the state
in which the Universe formed was one of very low probability, which raises interesting
philosophical questions beyond the scope of this text.
Summary
■ The ideal gas laws can only be explored by controlling two of the variables (mass,
pressure, volume and temperature) while changing one and measuring the fourth.
■ The work done on a gas by compressing is pDV
■ On a p-V graph, a cycle which returns to the same point will return the state to the
same internal energy, but the area enclosed by the graph shows the amount of
energy exchanged between work and heat
■ In a gas, the molecules have a distribution of energies
■ The Boltzmann factor e(−E/kT) gives the proportion of molecules in a gas which have
energy above a certain value E at absolute temperature T
■ The distribution of energy amongst molecules is purely due to chance, with the
likelihood of a given state being measured by its entropy
■ The Second Law of Thermodynamics states that in an isolated system, the entropy
cannot decrease over time
■ This gives an “arrow of time” to physical systems where the individual laws of
physics sow not asymmetry with time
11
Cambridge pre-u physics
end-of-chapter questions
S13: Waves and optics
S22.1
Explain the sequence of events in the thermodynamic cycle ABCD shown below:
isothermal
A (constant temperature)
Pressure
Volume
S22.2
A gas is compressed by 1500 cm3 at a pressure of 100 kPa. The internal energy of the gas increases by 100 J.
Determine the amount of heat transferred into or out of the gas.
S22.3
The physicist James Clerk Maxwell suggested a thought experiment, in which two containers A and B are fi lled
with a gas. A and B are connected by a tiny door. A tiny creature controls the door. The creature only allows fast
molecules to pass the door into box A. Slow molecules are only allowed into box B. Gradually, the gas in box A
will increase in temperature, and the gas in box B will decrease in temperature. These two boxes could then run
a ‘heat engine’ that can provide useful work, and the gas will be mixed again. Suggest how the second law of
thermodynamics aff ects this thought experiment. 12
S22.4
A teacup falls to the fl oor and smashes. Is this a reversible process in principle? What about in practice?
Cambridge Pre-U Physics
Learning Outcomes
■ understand the relationship between electric field and potential gradient, and recall
dV
and use E = −
dx
Q1Q2 QQ
■ use integration to derive W = from F = 1 22 for point charges
4πε 0r 4πε 0r
Q1Q2
■ recognise and use W = for the electrostatic potential energy for point charges
4πε 0r
The electric potential (V) at a point is equal to the work done in bringing unit positive
charge from infinity to that point. 1
In the language of calculus, we can write this as the derivative of potential with respect to
distance:
dV
E=−
dx
However, when we use this relationship to find the field strength at a given point, we need
to remember that the electric field strength is a vector – so at any point in space it has both
a magnitude and a direction. In which direction must we take our step dx in order that the
corresponding potential gradient gives us the correct field strength? It turns out that we must
move in the direction of fastest change in electric potential (the steepest slope). This distance
is perpendicular to the equipotential lines – although we need to remember that in three
dimensions, these equipotentials are actually surfaces, not just lines. Figure S23.1 illustrates
this idea.
Cambridge Pre-U Physics
equipotentials
+Q
E = –dV/dx
and is perpendicular
to the equipotentials
Figure S23.1 The electric field strength at any point is a vector which is perpendicular to the
equipotential lines or surfaces, with a magnitude equal to the negative of the potential gradient in
that direction.
Worked example 2 in Chapter 23 of the AS & A Level Coursebook shows how to calculate
the electric field strength from a graph of electric potential against the distance moved
perpendicular to the equipotential lines. If we know a function V ( x ) that describes how
the potential changes along such a graph, we can use calculus and the expression above to
calculate the electric field strength.
We already know the functions V ( x ) for particular situations. For example, for a point
charge Q we know that the potential at a distance r from Q is
Q
V=
4πε 0r 2
The equipotentials are spheres, centred on the charge. This means that the electric field,
which is at right angles to the equipotentials, must be radial. So we can differentiate our
expression for the potential V with respect to r to obtain the electric field strength:
dV d Q Q
E=− =− =
dr dr 4πε 0r 4πε 0 r 2
You will recognise this as the correct expression for the electric field strength at a distance r
from a point charge Q.
In fact, any field where the force changes according to an inverse square law has the same
property: the field strength vector can be expressed as the gradient of a scalar potential
function. In Section S18.3 we saw this with the gravitational field. This has the consequence
that in any such field, when moving from one position to another there is a change in
potential energy that is independent of the path taken. You can follow any path from one
point to another, whether the path is short and direct or long and taking many turns, and the
net work done between the two points against the force produced by the field is the same. In
your physics studies, you have been making use of this idea for a long time. For example, you
know that the same amount of work is done by gravity regardless of whether you jump off a
cliff to reach the bottom or take a gentle, winding path down (nevertheless, you may prefer
one route for other reasons!). Remember, though, that not all forces have this property –
if you push a box against a frictional force, you do more work if you take a longer path.
Another, less obvious, example where the work done is not independent of path is the force
on an electrically charged particle in a magnetic field.
Cambridge Pre-U Physics
Q1Q2
F=
4πε 0r 2
Q1Q2
dW = Fdx = − dx
4πε 0 x 2
Notice the minus sign. When both charges have the same sign, the work done is negative.
This is to be expected – when both charges have the same sign, the force is repulsive, so the
charges are in a lower energy configuration when they are moved further apart. When the
charges have opposite signs, the work done is positive: the charges attract each other, so work
has to be done to separate them.
In order to calculate the total work done, we need to integrate the expression for dW
from infinity to radius r. We use the variable x in our expression for the derivative to
avoid confusion between the radius r (which is one of the limits for x) and the variable of
integration. The integral we need to evaluate is:
3
r
Q1Q2 r QQ QQ
W= ∫
∞ 4πε 0 x
dx = 1 2 = 1 2 − 0
4πε x
0 ∞ 4πε 0 r
So the potential energy associated with two point charges Q1 and Q2, separated by
distance r is:
Q1Q2
W=
4πε 0r
You need to remember how to produce this derivation.
If we consider the potential energy of a unit positive charge by setting Q2 to 1 C, then we
get the expression for electric potential:
Q1
V=
4πε 0r
Remember that the units of potential V are J C–1 and the units of potential energy
(work done) W are J, so the dimensions are consistent.
questions
23.1 What is the potential energy associated with a +40 µC charge at a distance of 1.5 m from
a +20 µC charge?
23.2 What is the potential energy associated with a +40 µC charge at a distance of 1.5 m from
a –20 µC charge?
Cambridge Pre-U Physics
Summary
■ We can express the relationship between field strength and potential gradient in
dV
mathematical terms as E = − . When using this, we must calculate dV in a direction
dx dx
at right angles to the equipotential lines or surfaces.
■ By integrating an expression for work done against the Coulomb force as we move
a charged particle a distance dx in the electric field of another charged particle, we
can obtain the electrostatic potential energy associated with two charged particles
QQ
separated by a distance r. This expression is W = 1 22 .
4πε 0r
4
Cambridge Pre-U Physics
S24: Capacitance
Learning Outcomes
■ analyse graphs of the variation with time of potential difference, charge and current for a
capacitor discharging through a resistor
■ define and use the time constant of a discharging capacitor τ = RC t
−
■ analyse the discharge of a capacitor using equations of the form x = x 0 e RC
The capacitor is connected to the resistor at time t = 0. The charge, Q(t), potential
difference, VC(t), and current, I(t), all vary with time as the capacitor discharges. The equation
relating the charge to the potential difference is:
Q (t ) = CVC (t )
The equation that governs the potential difference across the resistor is:
Vr (t ) = I (t ) R
Remember that the current in the capacitor is the rate of flow of charge, so we can write
dQ
I (t ) =
dt
Cambridge Pre-U Physics
and thereby express the equation governing the potential difference across the resistor in
terms of charge:
dQ
Vr (t ) = R
dt
Kirchhoff’s second law tells us that the sum of the potential differences around a loop in a
circuit must be zero, therefore
VC (t ) + VR (t ) = 0
So the potential differences across the capacitor and resistor are of the same magnitude but
opposite in sign. Now, we can substitute in the equations governing the capacitor and the
resistor, to get a differential equation for the charge, Q(t):
Q dQ
+ R =0
C dt
dQ Q
⇒ + =0
dt RC
You may be familiar with this equation: its solution (provided 1/RC is positive, which it is)
is an exponential decay. We will solve this equation now; this form of equation comes up so
often in physics that it is well worth remembering the differential equation and its solution.
We start by separating the variables and then integrating with respect to time t.
1 dQ 1
=−
Q dt RC
2
1 dQ 1
⇒ ∫ Q dt
dt = ∫ −
RC
dt
1 t
⇒ ∫ Q dQ = − RC + k
t
⇒ ln Q = − +k
RC
This means we can also determine the potential difference and the current. The potential
difference is given by
t t
Q0 − RC −
V (t ) = e = V0e RC
C
In order to find the current, we need to differentiate the expression for charge with respect to
time:
t t
dQ Q − −
I (t ) = = − 0 e RC = I0e RC
dt RC
(the minus sign produced by differentiation tells us that the charge flows out in the opposite
direction to which it flowed in).
Now we have the expressions for charge, current and potential difference as functions of
time; all three exhibit an exponential decay from their initial value. Figure S24.2 illustrates
these functions.
t
−
Q (t ) = Q0e RC
t
−
I (t ) = I 0e RC
t
−
V (t ) = V0e RC
a
Q0
Q(t) / C
0
0 t = RC,
time at which Q = Q0/e t/s
b V0
V(t) / V
0
0 t = RC,
time at which V = V0/e t/s
Cambridge Pre-U Physics
c
I0
I(t) / A
0
0 t = RC,
time at which I = I0/e t/s
Figure S24.2 Charge, potential difference and current as a capacitor is discharged.
In terms of R and C, how long does it take for the charge in a capacitor to drop to half its
initial value?
Cancelling the Q0, and taking natural logs of both sides, we get:
1 t 4
ln = −
2 RC
t = RC ln 2
In the worked example we showed that the time taken for the charge in a capacitor to drop
to half its initial value was RC ln 2. The quantity RC is known as the time constant for the
circuit containing a capacitor connected to a resistor. The time to discharge to a certain level
is, as we have seen, proportional to RC.
In fact, the time RC is the time that it takes for the charge in, current in and potential
difference across a discharging capacitor to drop to a factor of 1/e of their original values.
This is shown on the graphs in Figure S24.2.
Cambridge Pre-U Physics
question
24.1 A capacitor with a capacitance of 1000 µF is used in a time-delay circuit. The capacitor
is charged to 4.0 V and discharged through a 47 kΩ resistor. When the potential
difference across the capacitor drops to 0.7 V, a transistor circuit is switched off.
a Calculate the time taken for the circuit to switch off (i.e. for the capacitor to
discharge to 0.7 V).
b An electrical engineer swaps the capacitor for a 2500 µF capacitor, but wants the
time taken for the circuit to switch off to remain the same. What value of resistance
should they substitute for the 47 kΩ resistor?
Vcell C VC
R VR
Therefore we have:
Q dQ
+ R = Vcell
C dt
dQ
⇒ Q + RC = CVcell
dt
In Section S23.1, we showed that when the left-hand side of this differential equation equals
zero, it has a solution of Q = Ae −t / RC . Any multiple of e −t / RC put into this equation will always
give zero. So we need to add something of a different form to the solution in order to get a
non-zero right-hand side. (This form of differential equation with a non-zero right-hand
side is known as an inhomogeneous differential equation; you may have seen this type
of equation in your mathematics studies, where you will have seen its solution called the
particular integral.)
One method of solving this type of equation is to try different forms of solution to see
whether a particular function works. It turns out that if we have a function of the form
t
−
Q = Ae RC + CVcell
Cambridge Pre-U Physics
we will get the correct right-hand side of the differential equation. The initial conditions for
charging are somewhat different. When the capacitor is fully charged, its potential difference
will be equal (and opposite) to that of the cell. Therefore it will have a charge Q = CVcell when
t is large. If we charge up a capacitor that is initially completely discharged, we know that the
initial charge is zero. This information tells us that in this case, the constant of integration A
must be −CVcell . The solution is therefore that the charge increases according to the following
equation:
t
−
Q = CVcell 1 − e RC
We can deduce that the potential difference across the capacitor will follow a similar
relationship (increasing until it reaches the same potential difference as the cell):
t
−
V = Vcell 1 − e RC
To calculate the current, we need to consider the potential difference across the resistor,
which is
t
−
VR (t ) = Vcell − VC (t ) = Vcelle RC
Summary
■ When a capacitor discharges through a resistor, the potential difference, current and
t
−
charge follow the exponential form x = x 0e RC
Learning Outcomes
■ explain the Hall effect, and derive and use VH = Bvd
■ derive, recall and use r =
mv
for the radius of curvature of a charged particle moving
BQ
in a magnetic field
This equation can be applied to other charged particles, if we consider a charge Q in place of
the electron. The equation becomes:
mv 1
r=
BQ
Remember, though, that a positively charged particle will travel in the opposite direction
around the path compared to the negatively charged electron.
eVH
= Bev
d
We can re-arrange this equation to express the Hall voltage in the form:
VH = Bvd
This form of equation for the Hall voltage may be more appropriate when solving particular
types of problems.
Summary
■ The Hall voltage can be expressed in the form: VH = Bvd
Learning Outcomes
d ( Nφ )
■ recognise and use E = − and explain how it is an expression of Faraday’s and
dt
Lenz’s laws
∆ ( Nφ )
E =
∆t
Expressed in words, this means that the magnitude of the induced e.m.f. is proportional to
the rate of change of magnetic flux linkage ( Nφ ) . We can also write this law as a derivative:
d ( Nφ )
E = 1
dt
If we have a formula that expresses the flux linkage as a function of time, we can use calculus
to determine the magnitude of the induced e.m.f. (One example of such a function is when
we have a coil that turns at a known rate.)
We can also combine Faraday’s law and Lenz’s law into a single equation:
d ( Nφ )
E=−
dt
This tells us that the induced e.m.f. and the change in magnetic flux linkage have opposite
signs. This is a mathematical way of expressing Lenz’s law: the induced e.m.f. will be established
in a direction so as to produce effects which oppose the change that is producing it.
Summary
■ The equation for the e.m.f. induced across a coil when the magnetic flux linking the
d ( Nφ )
coil changes is E = − , which combines Faraday’s and Lenz’s laws.
dt
Cambridge Pre-U Physics
Learning Outcomes
■ explain atomic line spectra in terms of photon emission and transitions between discrete
energy levels
■ apply E = hf to radiation emitted in a transition between energy levels
■ show an understanding of the hydrogen line spectrum, photons and energy levels as
represented by the Lyman, Balmer and Paschen series
■ recognise and use the energy levels of the hydrogen atom as described by the empirical
13.6
equation En = − 2 eV
n
■ explain energy levels using the model of standing waves in a rectangular one-dimensional
potential well
13.6
■ derive the hydrogen atom energy level equation En = − 2 eV algebraically using the
n
model of electron standing waves, the de Broglie relation and the quantisation of angular
momentum
■ understand the use of stopping potential to find the maximum kinetic energy of
photoelectrons
■ plot a graph of stopping potential against frequency to determine the Planck constant, work
1
function and threshold frequency
Hydrogen series
The wavelengths of light emitted by hydrogen atoms to form the lines of an emission
spectrum are best understood by thinking about different series of lines. All the lines in a
given series involve transitions that end on the same energy level, and which start at each of
the higher levels. These series are named after the various scientists involved in measuring
them – see Figure S30.1.
Cambridge Pre-U Physics
n E(eV)
0.00
6 –0.38
5 –0.54
IR excited
4 –0.85
states
3 –1.51
Paschen
series
2 –3.40
Balmer
series
UV Lyman
series
ground
1 –13.6 state
Figure S30.1 Energy levels of the hydrogen atom with some of the transitions between them that
give rise to the spectral lines indicated.
In Figure S30.1, a new notation is introduced. Alongside the energies in the diagram there is
also a numerical label for each energy level, called the principal quantum number, n. The
Lyman series of lines are all transitions to the lowest energy level, n = 1, called the ground
state. All of these transitions have a minimum energy of 13.6 − 3.4 = 10.2 eV, which is the
energy difference between n = 2 and n = 1. The lowest frequency photon emitted in the Lyman
hc
series has an energy of 10.2 eV and hence a wavelength of λ = = 121nm, which is in the
E
ultraviolet. This is the energy calculated in the section ‘Photon energies’ of Chapter 30 of the
Coursebook.
2
All the other lines in the Lyman series are of greater energy, and so greater frequency and
shorter wavelength, but they converge towards a limit. No transition will have an energy
greater than 13.6 eV, as this would involve transitions from an energy level above zero
(an electron with such energy would not be bound to the hydrogen atom). The observed
spectrum of hydrogen shows many lines getting closer and closer together, converging to a
limit corresponding to an energy of 13.6 eV.
The next series involves transitions to the level n = 2. This Balmer series is one of the most
important for observations, because the transitions largely fall into the visible spectrum and so
were amongst the first observed (see Worked example S30.1). The next series to level n = 3, the
Paschen series, involves transitions of much lower energy and longer wavelength, in the infrared
area of the electromagnetic spectrum.
Find the wavelength of the light emitted due to a transition from n = 3 to n = 2. This is called
the Balmer alpha line.
The energy gap is −1.51 − (−3.40) eV = 1.89 eV = 1.89 × 1.6 × 10 −19 J = 3.02 × 10 −19 J
hc
Using λ = we find λ = 6.58 × 10−7 m = 658 nm, which is in the red part of the visible
E
spectrum.
Cambridge Pre-U Physics
question
30.1 Find the longest wavelength and the shortest wavelength lines in the Paschen series.
As we have seen, n = 1 corresponds to the ground state. Higher values of n correspond to the
excited states, which get closer and closer to the ionisation energy as n tends towards infinity.
For n = 1 this clearly gives the value E1 = −13.6 eV. The other energies are also straightforward
to calculate (see Table S30.1).
n E/eV
1 −13.6
2 −3.40
3 −1.51
4 −0.85
5 −0.54
6 −0.38
3
7 −0.28
Table S30.1 Energy levels of hydrogen compared to the ionisation energy.
questionS
30.2 Calculate the energy levels of hydrogen for n = 8 and n = 10 . From these results,
calculate the transition energy.
30.3 Find the equivalent formula to En = −13.6 eV/n2 for the hydrogen energy levels, but in J
instead of eV.
–a a
Figure S30.2 Representation of an infinite potential well and three electron (standing) waves
within it.
Because the walls of the well are infinitely high the electron wave has a value of zero at
±a. This means the electron can have a wavelength of 4a (blue line), 2a (red line), 4a/3
(orange line) and so on. In general, the allowed electron waves have wavelength 4a/n
where n = 1, 2, 3… 4
h nh
p= =
λ 4a
1 p2
and we can also write the kinetic energy, KE = mv 2 as KE =
2 m
Hence the electron’s KE is given by
n 2h 2
KE =
16a 2m
As the potential energy at the bottom of the well is 0, the formula above represents the
electron’s total energy. It predicts that the electron in this (artificial) potential well can have
only specific energies governed by the integer values of n. This is an example of quantisation
of energy and it arises from the wave-like behaviour of electrons.
Another way to write this same rule is to say (as Bohr did) that the orbital angular
momentum of the electron can only take on fixed values.
The requirement that there are a fixed number of wavelengths in an orbit means that:
This is the quantum part of the calculation – the rest is classical mechanics. In order for the
electron to follow a circular orbit there must be a centripetal force, which is provided by the
attraction between the electron and the nucleus:
mv 2 Ze 2
=
r
( 4πε 0r 2 )
where e is the fundamental electron charge and Z is the proton (atomic) number. Although
the equation is only true for hydrogen, with one electron, by including Z we can make
predictions for hydrogen-like atoms, such as He+ and Li++ (doubly ionised lithium).
This equation can be rearranged to make v the subject:
(
v = √ Ze 2 / ( 4πε 0mr ) ) 5
1 Ze 2
KE = mv 2 =
2 (8πε 0r )
The potential energy of the electron is not zero, but is given by the laws of electrostatics:
−Ze 2
PE =
4πε 0r
− Ze 2
E = KE + PE =
8πε 0r
That the energy is negative is a sign that the electron is bound within the atom. The energy of the
electron is fixed by its radius. We now use the quantisation rule relating r and the wavelength:
nλ = 2π r
h nh
p = =
λ 2π r
p2 n 2h 2
KE = =
2m (
8π 2r 2m )
Cambridge Pre-U Physics
Ze 2
But we also have derived that KE =
(8πε 0r )
Putting these two expressions equal to each other gives an equation for r:
n2h2 Ze 2
=
8π r m ( 8πε 0r )
2 2
ε 0n 2 h 2
So r=
π mZe 2
There are only specific values of r allowed. Putting those values back into the expression for
the total electron energy, E:
Ze 2 Z 2 e 4m R Z 2 e 4m
E = − = − 2 2 2 = 2 where R = − 2 2 = −21.7 × 10−19 J = −13.6 eV
8πε 0r 8ε 0 h n n 8ε 0 h
Bohr’s analysis produces the empirical (found by experiment) formula for the energy levels
of hydrogen. It also predicts that the energy levels of ionised helium (He+) will be 4 times
greater.
Angular momentum
An alternative and equivalent approach to deriving Bohr’s formula (in fact, the one Bohr
himself used) is to start from the assumption that the angular momentum (L) of the
electrons is quantised – it can only take on specific values given by:
nh 6
L =
2π
As L = mvr (see Chapter 17 of the Coursebook ‘Circular motion’ Sections S17.1 to S17.4), this
is equivalent to saying that:
L nh
mv = p = =
r ( 2π r )
which is identical to the quantisation rule given by the standing wave argument. (In
fact, the standing wave version we saw earlier was developed in 1924 by de Broglie as an
interpretation of this angular momentum rule.)
You should be able to use both methods to derive Bohr’s formula.
question
30.4 A generalised formula for the energy levels of an atom with one electron is:
En = − Z 2 × 13.6 eV / n 2
where Z is the proton (atomic) number. Find the ionisation energy of a lone electron
orbiting the nucleus of a silicon atom ( Z = 14 , so this would be Si13+).
Cambridge Pre-U Physics
hf = Φ + k.e.max
rearranged to give:
k.e.max = hf − Φ
We shine light of different frequencies onto a metal surface and measure the kinetic energy
of the emitted electrons. A graph of k.e.max against f will have an intercept of −Φ.
monochromatic
radiation
photocell
V anode
cathode
7
Monochromatic radiation of different wavelengths can be generated from a bright white light
source such as a slide projector, with different coloured filters placed in front. The wavelength
passed by the filters is marked on them and so the frequency of the light can be calculated.
The photocell is in a vacuum so the electrons emitted from the cathode do not lose any
energy in collisions. For each colour of light available, the voltage of the supply is gradually
increased until the microammeter registers zero current. This voltage, called the stopping
potential, is noted, and the experiment repeated with a new colour of light.
The stopping potential is related to the maximum kinetic energy of the electrons by the
following equation:
e × Vstopping = k.e.max
To see why this is, think about the electrons emitted from the cathode. They have kinetic
energies from zero to k.e.max and will travel freely to the anode. However, the anode is at
a negative potential due to the power supply and so the electrons are repelled from it. The
energy they need to cross a potential difference V is eV. The current will drop to zero once
there are no electrons with sufficient energy to reach the anode, that is, once eV is equal to
k.e.max.
Stopping potential/V
gradient = h/e
0 Light frequency/Hz
y-intercept =
Figure S30.5 A graph of stopping potential against light frequency enables us to determine the
work function.
Instead of plotting k.e.max on the y-axis, we have plotted the stopping potential, V = k.e.max /e.
Again by considering Einstein’s equation:
k.e.max = hf − Φ = eV
V = (h / e ) f − Φ / e
We can see that the y-intercept is −Φ/e and the gradient is h/e. The x-intercept shows the
point at which electrons would just be emitted with zero kinetic energy, the threshold
frequency. 8
Summary
■ The energy levels of hydrogen are given by En = −13.6 eV/n2 where n is the principal
quantum number and is a positive integer.
■ Electrons create standing waves and so can only have fixed de Broglie wavelengths
in a potential well or an atom.
h
■ The de Broglie wavelength is linked to electron momentum by p = λ and
p2
momentum is linked to kinetic energy by KE = .
2m
■ The formula for energy levels in an atom can be derived either by using electron
standing waves or by using the quantisation of angular momentum.
End-of-chapter questions
S30.1
The Balmer series starts with a red line of wavelength 658 nm. Further lines in the series
are of shorter wavelength. A transition from which excited state is the first one to have a
wavelength below 400 nm, i.e. in the ultraviolet?
S30.2
Find the radius of orbit of the ground state of hydrogen and hence the orbital velocity
and angular momentum (mvr). Express this in units of (h/2π).
Cambridge Pre-U Physics
Learning Outcomes
■ show that the random nature of radioactive decay leads to the differential equation
dN
= − λ N and that N = N 0e − λt is a solution to this equation
dt
■ recognise and use the equation I = I0e − µ x as applied to attenuation losses
■ recall that radiation emitted from a point source and travelling through a non-absorbent
material obeys an inverse square law and use this to solve problems
■ estimate the size of a nucleus from the distance of closest approach of a charged particle
■ relate the equation ∆E = ∆mc 2 to the creation or annihilation of particle–antiparticle pairs
■ understand how the conservation laws for energy, momentum and charge in beta-minus
decay were used to predict the existence and properties of the anti-neutrino
We shall now use these definitions to take a more mathematical approach to radioactive
decay.
The two equations for A must be the same and so we can write that:
∆N
A=− = λN
∆t
dN
= −λ N
dt
This is a differential equation and there are several ways in which to solve it. We meet
equations of this form often in physics, so it is easiest simply to recall that the solution is of
the form:
N (t ) = N 0 e − λt
Cambridge Pre-U Physics
where N(t) is the number of undecayed nuclei after a time t and N0 is the number of
undecayed nuclei at time t = 0. We can substitute these values of N into the differential
equation to show that these values do indeed solve the equation:
N (t ) = N 0e − λt
dN
= − λ N 0e − λt = − λ N (t )
dt
The number of radioactive atoms decays exponentially with time, as shown in the graph in
the Coursebook (Figure 31.10).
dN
From the definition of activity, A = − , it follows that:
dt
A = λ N 0e − λt = λ N (t )
and
A = A0e − λt
Attenuation of radiation 2
The radiation emitted by unstable nuclei ionises matter. This means that the radiation
steadily loses energy as it progresses through matter. Therefore, the intensity of the radiation
also reduces as it progresses through matter. This absorption is called attenuation of the
radiation and is expressed mathematically as follows (see Figure S31.1):
I = I0e − µx
In this equation, I is the intensity of the radiation, I0 is the intensity just before the radiation
enters the matter and x is the distance travelled through the matter. The quantity μ is called
the attenuation coefficient, which depends both on the type and the energy of the radiation
as well as the nature of the matter itself. The attenuation coefficient μ has units of m−1 (see
Worked example S31.1).
matter with
absorption
coefficient µ
I0 I0e– µ x
Figure S31.1 The transmission and absorption of radiation as it passes through matter.
Cambridge Pre-U Physics
Gamma rays of energy 1.0 MeV are fired at a sheet of lead of thickness 5.0 cm. Lead has an
attenuation coefficient of 80 m−1. If the incident gamma rays have an intensity of 10 mW m−2,
find the intensity after passing through the lead.
I = I0e − µx
= 10e −4
= 0.18 mW m −2
Notice in this calculation that the intensity was left in units of mW m−2, so the answer should
be in the same units, but that the thickness of the lead had to be converted to metres in
order to match the unit of the attenuation coefficient.
Two sources of gamma radiation are of equal power. A detector is placed 1.0 cm from the first
source and 5.0 cm from the second. What is the ratio of the intensity of radiation from the
first compared to the second?
Let the power of each source be P. The intensity at 1.0 cm distance will be
P
I1 = 2
4π × ( 0.010 )
P
I5 = 2
4π × ( 0.050 )
Cambridge Pre-U Physics
This could also have been solved by simply squaring the ratio of the distances (5:1)2 = 25:1 and
recalling that I1 must be greater than I5.
In practice, we do not rely on just attenuation or distance alone to protect ourselves from
sources of radiation. For example, a medical radiographer taking an X-ray image will stand
at some distance from the source and stand behind an attenuating screen. In school and
college laboratories, all radioactive sources are kept in lead boxes. These are then stored a
long way from where people usually work.
questions
31.1 A material has an attenuation coefficient of 65 m−1 for gamma rays of energy 3.0 MeV.
a Express the attenuation coefficient in units of cm−1.
b
Find the fractional reduction in intensity after i 1 cm and ii 30 cm.
31.2 The maximum safe level of a particular radiation is deemed to be 100 nW cm−2. How far
from a source of power 10 W would it be safe to stand, assuming no attenuation by the
surrounding medium?
nucleus
alpha particle
radius of closest
approach
Figure S31.2 An alpha particle reflected back along its path of approach cannot approach closer
than distance r from the centre of the nucleus.
We can use the idea of electrostatic potential (see Chapter 23 in the Coursebook) to work
out that closest distance. If the alpha particle has a kinetic energy E initially and zero at the
instant it turns around, at that same instant it must have electrostatic potential energy E
because energy is conserved:
kinetic energy + electrostatic potential energy = constant
Cambridge Pre-U Physics
Using the equation for potential energy from Chapter 20 (Electric fields) we can write:
Q1Q2
E =
( 4πε 0r )
Rutherford’s experiment used alpha particles with a charge Q1 = + 2e = 3.2 × 10−19 C and
gold nuclei with a charge Q2 = + 79e = 1.3 × 10−17 C. The alpha particles had kinetic energy
E = 1.07 × 10−12 J and so the distance of closest approach can be found to be 3.4 × 10 −14 m.
The importance of this result is that it is many times smaller than the radius of an atom,
which Rutherford estimated to be about 10−10 m, and so he could prove that an atomic
nucleus is very small. Note that this gives an upper limit to the size of the nucleus –
Rutherford realised that it could, in fact, be still smaller. In order to investigate the actual
size of a nucleus, it was necessary to use higher and higher energy alpha particles or protons
from particle accelerators. However, particle accelerators were only developed over 20 years
after Rutherford’s alpha-scattering experiment.
A positron and an electron, each of mass 9.11 × 10 −31 kg annihilate each other to produce
two gamma rays. In order to conserve momentum, the gamma rays are emitted in opposite
directions with equal energy. We will assume that the electron and positron were both
initially at rest.
The kinetic energy of each photon is given by ∆E = ∆mc 2, where ∆m is the mass of an
electron. So
( )
2
∆E = 9.11 × 10−31 × c 2 = 9.11 × 10−31 × 3.00 × 108 = 8.20 × 10−14 J
hc
Using the Einstein relation E =
(see Chapter 30 of the Coursebook), we can find the
λ
wavelength of these gamma rays:
Note that we could have left out a step in this calculation by using the de Broglie equation
h
λ= .
mc
Cambridge Pre-U Physics
In particle accelerators, such as the Large Hadron Collider, new particles can be created from the
kinetic energy of the colliding beams of particles. Particles and antiparticles are created together.
question
31.3 proton has a rest mass of 1.67 × 10−27 kg. A proton and an antiproton are created at
A
rest by colliding a beam of electrons and a beam of positrons head-on, so that one
electron annihilates one positron. Calculate the kinetic energy of each beam.
Plutonium
Plutonium-239 can undergo a similar reaction to uranium-238 but because it is both
more fissile (more likely to undergo fission) and produces more neutrons per reaction, less
plutonium-239 is needed to start a chain reaction than uranium-235. Uranium-238 is not
useful for nuclear fission, but makes up about 99% of natural uranium. Plutonium-239 can be
created from uranium-238 when fast neutrons strike uranium, creating uranium-239. Beta
decay rapidly turns this isotope of uranium into first neptunium-239 and then plutonium-239.
Like uranium, plutonium will decay into a variety of possible products, most of which are
radioactive. One example is:
1 239 100 137 1
0 n + 94 Pu → 40 Zr + 54 Xe + 3 0 n
A nucleus of uranium-235 undergoes induced fission when struck by a neutron. It splits into
nuclei of krypton-89 and barium-144. How many neutrons are emitted?
You will need to use a Periodic Table to look up the atomic numbers of krypton (36) and barium
(56) and then write the equation:
235 1 89 144 1
92 U + 0 n → 36 Kr + 56 Ba + x 0 n
We have to find x, the number of neutrons. The atomic number (proton number) is equal on
both sides but the mass number has to be as well. There is a total mass number of 236 on the
left and so x = 3 for it to be the same on the right. So this reaction emits three neutrons.
Don’t forget to include the original neutron that caused the fission in the first place.
Cambridge Pre-U Physics
where A and Z have to be found. Once Z is known we can identify the element written as ‘X’. The
conservation of mass means A = 241 – 4 = 237. Conservation of charge means Z = 95 – 2 = 93.
The Periodic Table then tells us that the element is neptunium, Np and the full equation is:
241 4
95 Am → 2 α + 237
93 Np
The conservation laws simply mean that the top line of numbers adds up to be equal on both
sides of the reaction arrow, and similarly with the bottom line of numbers.
Applying the conservation laws to beta decay is a little more difficult because the beta 7
particle does not have a nucleon number or proton number. However, if we remember that
0
we are looking to balance mass and charge we can write the beta particle as −1 β and the
positron as +10 β . We give them zero mass because we are only using whole numbers and the
mass of a beta particle is about 1/2000 that of a proton or neutron.
For example, strontium-90 decays by emitting a beta-minus particle:
90 0 90
38 Sr → −1 β + 39 Y
This time, in order to balance the bottom line, the proton number of the daughter nucleus is
one higher than that of the parent, so strontium produces yttrium, proton number 39. The
mass number remains unchanged.
Sodium-22 undergoes beta-plus decay:
22 0 22
11 Na → +1 β + 10 Ne
Once again the top line, the mass, remains unchanged, but this time the daughter nucleus
has a lower proton number, because the positive charge of the positron is lost from the parent
nucleus.
The principles of conservation of mass, energy and momentum can have very important
consequences in physics, as we will see.
One such example comes from the discovery of neutrinos. In alpha decay, the alpha
particle is always emitted with the same amount of energy and momentum (measured in a
cloud chamber by the length and curvature of the track) for a given isotope. Each radioactive
decay produces the same amount of energy and only one particle is produced, so it carries
off that full amount as kinetic energy. In beta decay it was observed that the beta particle
can vary in energy and momentum (including direction). For energy and momentum to
Cambridge Pre-U Physics
be conserved, scientists suggested the existence of a new particle which shared the KE and
momentum with the beta particle. For charge to be conserved, this particle had to be neutral.
As the electron sometimes carried nearly all the energy of the decay there was little energy
left to create the mass of this new particle so it must be very light. Hence it was called the
neutrino, meaning “little neutral one” in Italian.
Summary
■ The intensity of radiation is reduced when passing through matter, according to the
equation I = I0e − µ x
■ Radiation is reduced in intensity by the inverse square law as it spreads over a larger
area.
■ The Rutherford scattering experiment reveals a maximum size for the nucleus, which
is known to be around 10−15 m.
■ Nuclear fission can happen spontaneously or be induced by a neutron colliding with
the nucleus.
■ Fission can result in the release of further neutrons, causing more fission events and
a chain reaction.
■ Fusion can be caused by the high temperatures generated in a fission explosion.
■ Fission and fusion can both release enormous amounts of energy.
End-of-chapter questions 8
S31.1
A narrow beam of gamma rays is attenuated by 20 cm of material with an attenuation
coefficient of 1.2 m−1. What is the fractional reduction in intensity?
S31.2
Safety rules recommend that no one should work within 2.0 m of a stored radioactive
source. A desk is placed 1.0 m from the source but a lead shield with an attenuation
coefficient of 15.0 m−1 is added. What thickness of lead should be used to offer the same
reduction in intensity?
S31.3
a Potassium-40 undergoes beta decay. Write a balanced equation for the process.
b Thorium-232 (atomic number 90) decays by a sequence of alpha and beta decays to
lead-208 (atomic number 82), which is stable. How many of each of alpha and beta
particles are emitted?
Cambridge Pre-U Physics
Learning Outcomes
■ interpret the double-slit experiment using the Copenhagen interpretation (and collapse of
the wavefunction), Feynman’s sum-over-histories and Everett’s many-worlds theory
■ describe and explain Schrödinger,s cat paradox and appreciate the use of a thought
experiment to illustrate and argue about fundamental principles
■ recognise and use ∆p∆x > h/2π as a form of the Heisenberg uncertainty principle and
interpret it
■ recognise that the Heisenberg uncertainty principle places limits on our ability to know the
state of a system and hence to predict its future
■ recall that Newtonian physics is deterministic, but quantum theory is indeterministic
■ understand why Einstein thought that quantum theory undermined the nature of reality
by being:
● indeterministic (initial conditions do not uniquely determine the future)
The wave-function
Fundamental to the idea of interpreting quantum mechanics is the concept of the wave-
function. This is a mathematical function that contains all the information about a system
Cambridge Pre-U Physics
or particle. How it changes with time then depends on the surroundings. To calculate the
outcome of an experiment we use the wave-function, just as we used the wave nature of light
to calculate interference effects. However, the wave-function associated with a particle such
as a photon or electron is not a physical wave that we can measure and display. Instead, it is
a mathematical model of what happens. We can calculate the intensity of the wave-function
much as we would do for other types of wave, using the square of the amplitude; the intensity
gives us the probability of finding the particle at a given position. This is very significant:
it suggests that, until a particle arrives and is detected, there is uncertainty associated with
the outcome of an experiment. A particle could arrive at one of a number of different places,
and we do not know with certainty where it is going to arrive until it actually arrives; an
interpretation of this is that until we detect the particle, it actually is in a number of places.
In terms of the double-slit experiment, the wave-function for a single electron behaves
like a wave that passes through both slits and interferes to create maxima and minima.
However, these are variations in probability, not the measured intensity of a single electron.
The important fact is that we cannot ‘see’ a single electron or photon split up into pieces; in
the end, one particle enters the apparatus and one particle arrives at the detector – not 10%
of a particle at one point and 90% somewhere else. This is what is meant by something being
detected as a particle – when measured, it is definitely in one place.
The important thing to remember is that a single particle could arrive at any point where
the predicted intensity of the wave-function is not zero, so at any of the ‘peaks’ we can
calculate. Once the particle is detected, then the outcome of the experiment is known and
there is only one possible outcome for a single particle; yet until it is detected, mathematically
we have to consider that it could be at one of a number of different places. This is one of the
strangest things about quantum physics, and can take some time to get used to.
In learning about the double-slit experiment with light we discussed the idea of light
waves interfering. So is this a case of the wave-functions of particles interfering with each
2
other? A beautiful experiment in 1909 by G.I. Taylor reduced the light intensity in the
double-slit experiment so much that only a single photon was present in the system at a
time, yet interference fringes still appeared. Such experimental evidence suggests that the
photon interferes with itself rather than with other photons. Somehow, the wave-function
simultaneously and instantly ‘knows about’ both slits.
The process by which a wave-function changes from probability and uncertainty to a
definite outcome, is the source of much of the disagreement that arose about interpreting
quantum theory.
a
Sum of two single slits
b
2 slit pattern
Figure S33.1 Pattern due to a particles passing through two separate slits and b particles passing
through two slits and interfering.
probability distribution
wave-function is 100% correlated with another version of ourselves knowing that the particle
is at point B (see Figure S33.3).
The difference with the Copenhagen interpretation is that in the ‘many worlds’
interpretation, the wave-function never collapses and all possibilities remain true. What
changes is that in each reality we know can only know of one of the outcomes. This means
there are multiple different realities being generated all the time.
The different realities are sometimes referred to as different worlds or Universes, which is
why this is called the ‘many worlds’ or ‘multiple Universes’ interpretation.
no knowledge
of outcome particle particle
known to known to
be at A be at B
A B A B
photon
Figure S33.5 Schrödinger's cat thought experiment: inside the box, the radioactive isotope both
has and has not decayed, and so the cat is both dead and alive.
Cambridge Pre-U Physics
S33.3 Uncertainty
If we think about the double-slit experiment for an electron, all we know is that the electron
arrived in a particular place at a particular time. Because it travelled as a wave that passed
through both slits, we don’t know where the electron was at the moment it arrived at the slits.
If we move our detector to the slits, the electron doesn’t pass through them because it has
been detected.
You might suggest that we could work out the exact path of the electron if, at the same
time we detected its position on the screen, we also measured its velocity or momentum.
Another surprising aspect of quantum theory is that this combination of measurements –
knowing exactly the position and momentum of a particle at the same time – is impossible.
In fact, quantum theory teaches us that even asking such a question is meaningless – we
cannot know which slit the electron passed through, nor can we measure quantities later that
would enable us to calculate exactly where it was and how fast it was moving.
This is a difficult concept to grasp, as it seems very different from what we observe in the
‘real’, macroscopic world of objects, position, momentum and collisions. Let us think about
the microscopic electron, and how we might try to measure exactly where it is in the double-
slit experiment. We need to appreciate that the wave-function of a particle does not describe
a single, easily measured wave with precise wavelength. The correct description is a ‘wave
packet’, meaning a number of wavelengths superposed onto each other.
In order to deduce exactly where an electron is, according to quantum theory we need to
localise the electron’s wave-function – meaning that the spread of the electron wave-packet
would have to be known to sufficient precision that we can assign it only a very narrow
range of position. The nature of the wave-function is such that by narrowing down the range
of position, the range of momentum the electron can have gets broader. In other words,
the more precisely we know the position, the less precisely we can know the momentum.
Similarly, if we know the momentum more precisely, we know how fast the electron is going, 6
but we don’t know where it has been! This is a mathematical property of the wave-function.
We call position and momentum conjugate variables, because knowing either one with
more precision means the other must be known less precisely.
There are other pairs of conjugate variables affected in exactly the same way, for example
energy and time, and angular momentum and angular displacement.
This turns out to be a fundamental problem not just of quantum mechanics, but of these
types of conjugate variables more generally. At the microscopic scales involved, there is a
trade-off between knowledge of position and knowledge of momentum that is impossible to
get round – it is deep-rooted in nature and is not a limitation of our measuring instruments.
This was first understood in quantum theory by Werner Heisenberg and he expressed it
mathematically as the ‘uncertainty principle’:
h
∆p∆x ≥ =
2π
In this equation Δp is the uncertainty in the momentum – the spread of possible values
the momentum might have. Δx is the uncertainty in the position. If the uncertainty in one
variable is small then the uncertainty in the other variable must be large, because the product
has to be greater than the constant on the right: Planck’s constant divided by 2π.
Heisenberg’s uncertainty principle is especially significant when we consider how we
calculate what happens to a particle in the future. At a microscopic level, to be able to
calculate exactly what will happen to every particle in a room at every moment thereafter,
we would need to know precisely the position and momentum of every single particle in the
room. However, we cannot know both the position and the momentum of any one particle –
if we know where a particle is, we cannot know precisely how fast and in which direction
it is travelling, and vice versa. This means the future of any individual microscopic system
cannot be predicted with absolute certainty, which is a profoundly different situation from
the classical physics that came before quantum mechanics. What can usually be predicted,
Cambridge Pre-U Physics
however, is the general behaviour of the macroscopic system, because we can sum across all
the probabilities of the individual microscopic parts.
Heisenberg’s uncertainty principle is sometimes confused with something called the
‘observer effect’. Even Heisenberg himself first thought about his uncertainty principle in
these terms, although he soon realised his mistake. In the observer effect, we consider how we
might find the position of an electron – for example, in a double-slit experiment, we consider
how we might know which slit it passed through. To do that we must look at it (observe it) in
some way – for example, we might shine light on it. To get a sufficiently precise observation
of the position, we need to shine light waves of very short wavelength and high energy. To
get an observation of an electron, one of these high-energy photons needs to ‘bounce off’ the
electron, and this collision would cause the electron to change speed and/or direction. So
any attempt to observe the electron with high precision will in itself change the momentum
and/or position of the electron. In other words, the act of observing the electron changes the
very things we were trying observe. This ‘observer effect’ is different from the uncertainty
principle, although both affect our ability to observe and predict quantum effects.
The observer effect seems to be a limitation on our abilities to make measurements. One
can try to think of clever ways around it. However, Heisenberg’s uncertainty principle is
much more basic than this – there is no clever way round it. This is illustrated by double-slit
experiments that attempt to measure which slit the electron passes through on its way to
the detector. Any experiment sensitive enough to locate the electron at the slits destroys the
interference pattern and gives a pattern of electrons at the detector which is the sum of that
due to two separate single slits. If the electron’s position is known, it cannot pass through
both slits so the interference pattern disappears.
These differences call into question our very understanding of reality. The Universe
does not follow the simple rules we used to expect. Many 20th century physicists were
uncomfortable with the way in which quantum theory appeared to undermine reality,
including Albert Einstein who famously said ‘God does not play dice’, meaning that nature
cannot be truly random. He was convinced that particles must have had properties that
did determine the outcome of experiments, but those properties were not measurable
directly – so-called ‘hidden variables’. Einstein spent a great deal of his later life trying to
make quantum theory deterministic, local and complete by adding hidden variables. Recent
experiments, based on a theory by John Bell developed in the 1960s, have shown that there
are no hidden variables. Quantum theory is every bit as strange as it seems!
Summary
■ The double-slit experiment tells us what happens as a result of wave-particle duality,
but not why it happens.
■ Different interpretations of quantum theory explain this and other experiments in
different ways. These interpretations include the Copenhagen interpretation (and
collapse of the wave-function), Feynman’s sum-over-histories and Everett’s many-
worlds theory.
■ A thought experiment is a way of viewing a new or challenging scientific theory to
highlight its conclusions or prompt discussion of its consequences.
■ Schrödinger’s cat thought-experiment shows how apparently microscopic quantum
effects can affect the macroscopic ‘real world’.
■ Quantum theory is indeterministic, meaning that the outcome of an experiment is 8
not fully determined by the state of the particles and the system.
■ Quantum theory is incomplete because we cannot fully determine the values of all
the variables at the same time.
■ Quantum theory is non-local because wave-function collapse appears to happen
instantly, affecting all entangled wave-functions in a system.
■ Heisenberg’s uncertainty principle tells us that the precision with which we can
measure the position and momentum of a particle is limited by the equation
h
∆p∆x ≥ =
2π
Cambridge Pre-U Physics
Learning Outcomes
■ recall that Maxwell’s equations describe the electromagnetic field and predict the existence
of electromagnetic waves that travel at the speed of light
■ recall that at the end of the 19th century, most physicists assumed that these electromagnetic
waves were vibrations in a medium called the aether, filling absolute space
■ recall that experiments looking for variations in the speed of light caused by the Earth’s
motion through this aether gave null results
■ understand that Einstein’s theory of special relativity dispensed with the idea of the aether
■ state the postulates of Einstein’s special principle of relativity
■ explain how Einstein’s postulates lead to the idea of time dilation and length contraction, and
therefore undermine the idea of absolute time and space
■ understand the idea of a frame of reference (an inertial frame)
■ recognise the equations for time dilation and length contraction
■ understand that two events which are simultaneous in one frame of reference may not be
simultaneous in another, and explain this in terms of the fundamental postulates of relativity; 1
distinguish this from the phenomenon of time dilation
The derivation of the time dilation and length contraction formulae are beyond the requirements
of the syllabus, but the formulae themselves must be known. The mathematical treatment
of the loss of simultaneity is also beyond the requirements of the syllabus, as is the detailed
explanation of the twin paradox. The Lorentz transformations are also not required. This
material is included here to allow a more complete understanding of the topic.
S34.1 Introduction
At the end of the 19th century and the beginning of the 20th century, many physicists believed
that they had discovered most of the laws of the Universe. A quote attributed to Lord Kelvin
(perhaps erroneously) in 1900 was: “There is nothing new to be discovered in physics now.
All that remains is more and more precise measurement.” His sentiments were echoed by
Albert Michelson, an American physicist about whom we will learn more in this chapter.
He said “The more important fundamental laws and facts of physical science have all
been discovered, and these are so firmly established that the possibility of their ever being
supplanted in consequence of new discoveries is exceedingly remote…”. There were, however,
a number of loose ends remaining, which would ultimately lead to the theories of relativity
and quantum mechanics (as discussed in earlier chapters). These topics are often referred
to as ‘modern physics’, and earlier physics as ‘classical physics’ or ‘Newtonian physics’. We
can solve many problems in physics with purely classical physics, but as we start to consider
things moving close to the speed of light, classical physics begins to break down and we must
use relativity.
Cambridge Pre-u Physics
It should also be possible to change the motion of the light relative to the aether by rotating
the equipment in the laboratory, so that the light moved in the opposite direction compared
to the Earth’s motion. This is the approach that Albert Michelson and Edward Morley took
in an experiment they set up in 1887 to determine the effects of the aether.
A diagram of the set-up used in the Michelson-Morley experiment is shown in
Figure S34.2: this equipment is known as the Michelson interferometer. Light enters the
interferometer and is split into two beams, which travel at right angles to each other, are
then each reflected from a mirror, and return to the point at which they were split. The two
beams are then recombined and this recombined beam is observed on a screen (or through
an eyepiece). Depending on the optical path difference between the two paths the light took,
there may be constructive or destructive interference observed on the screen, or something
in between. There would be constructive interference if the path difference were equal to
a complete wavelength or a multiple of a wavelength (nl), and destructive interference if
the path difference were an odd multiple of half a wavelength ((2n+1)l/2). Of course, since
v = f l, and because the frequency remains constant, if we change the velocity of the light,
we will change the wavelength. So if an aether wind exists, the equipment can be arranged
so that one of the light paths is parallel to this wind, and the other is perpendicular. If
the two different paths are exactly the same physical distance, the aether wind should
cause an optical path difference to arise between the two paths, and so interference effects
should be observed. If the apparatus were to be rotated, so the speed of light along the two
paths changed due to the altered direction relative to the aether wind, then the optical
path difference and thus the interference effects would change. This should be particularly
noticeable if a white light source were to be used, as this produces a range of wavelengths.
Changing the optical path length would change the colour pattern produced, much as the
colours change when you view an oil slick on a puddle from different angles.
screen 3
(1) + (2)
laser L2
mirror 2
(2) (M2)
mirror 1
(M1)
Figure S34.2 Diagram of the Michelson interferometer.
Michelson and Morley’s experiment showed precisely zero change in the interference pattern
when the equipment was rotated. They repeated the experiment six months later, just in case
they had happened to perform the original experiment at a point in the Earth’s orbit where
there was no motion relative to the aether. They still found no change in the pattern as the
equipment was rotated. Their equipment was sensitive enough to detect changes of the size
of those expected (it was sensitive enough to detect an aether wind of just a few km s−1). They
had to conclude that they could not detect any motion relative to the aether. Either there
was no aether, or it was being ‘dragged along’ by the moving Earth. This experiment is often
called the ‘most famous null result in history.’ It carried serious implications for classical
physics, as we will see.
Cambridge Pre-U Physics
Other experiments (such as those by Fizeau, earlier in the 19th century) had shown that
in water, light was ‘dragged along’ by the water, but not completely – the measured speed
of travel of the light was less than the sum of the speed of the water and the speed of light
in stationary water. So, if the concept of an aether was correct, we have two experiments
showing apparently contradictory results. The solution, as we will see, is that there is no
aether. Light is unlike mechanical waves, in that it does not require a medium to travel
through.
Let us investigate what such a pion experiment would reveal if we used classical,
‘Newtonian’ physics. Figure S34.3 shows a model of the decay. We would have expected the
two emitted photons to each have a different momentum and hence velocity, due to the initial
high velocity of the pion before the decay. A ‘forward-emitted’ photon would travel faster
than the ‘backward emitted’ photon. However, this is certainly not what is observed. Both
photons are measured as having speed c, even though they are emitted from a moving source.
0.0002c 1.9998c
v = 0.9998c
π0
Classical physics cannot explain the results of these experiments. Neither the wave model
nor the particle model of classical physics are sufficient to give us an explanation, even when
we take all the potential effects of imprecise or inaccurate measurements into account. To
produce an explanation, we need a new model that can be applied to light and other particles
travelling close to the speed of light.
Note that Einstein’s 1905 paper was the ‘special theory of relativity’, which applies in
particular situations. Einstein also realised that there were even greater consequences of his
ideas, which he would later develop an entirely new type of mathematics to explain. In 1915,
he published his paper on the ‘general theory of relativity’, which took his ideas further still
and made us view gravitation, space and time in a wholly new way. This is well beyond the
scope of this book; for now, we are considering just the special theory and how it explains the
Michelson-Morley and pion experiments.
The special theory of relativity is summarised by two postulates (statements that are
assumed to be true):
First postulate (the principle of relativity): The laws of physics are the same in all inertial
frames of reference.
Second postulate: The speed of light in free space (in a vacuum) has the same value c in
all inertial frames of reference.
zB zB
velocity v
yB yB
xB xB
Inertial frame B Inertial frame A
Figure S34.4 Inertial frame A is moving at a constant velocity v with respect to inertial frame B.
In the case shown, the frames do not occupy the same point in space at time t = 0. Note that we
could choose any perpendicular axes x A , yA and z A for frame A, and a different set of perpendicular
axes for frame B, xB, yB and zB, and the constant relative velocity of the two frames can be in any
direction, and they will both still be inertial frames of reference.
Cambridge Pre-U Physics
What if the train starts to accelerate? A frame that is accelerating is a non-inertial frame of
reference. We can observe the effects of this. Imagine you as an observer place a ball in the
centre of an otherwise empty train carriage with a smooth floor. Assume there is no friction
between the ball and the floor. As the train accelerates, the velocity of the train carriage
increases. However, the ball is free to move and the concept of inertia tells us that the ball
does not accelerate.
Yet from the point of view of you, the observer sitting in the carriage, the ball moves to the
back of the carriage. From your point of view, you are stationary relative to the train, and it
would appear to you that a force instead must be acting on the ball, accelerating it towards
the back of the train. From your frame of reference, the ball appears to be in a non-inertial
frame of reference. However, someone measuring the motion from the side of the train
track would observe the train accelerating beneath a ball that continued moving at constant
velocity. To them, you and the train are in a non-inertial frame of reference, not the ball. You
can see that considering non-inertial frames of reference can get complicated!
Obviously, if you were sitting in the train carriage, it is not correct to think you would be
unaware of the force acting on the train. For example, as the train accelerated you would feel
yourself being pushed back against your seat, and you might see from the countryside passing
by outside that you were moving faster. The important concept to grasp is that an external
resultant force causes an acceleration, and an accelerating frame of reference is non-inertial.
A rotating frame of reference is also non-inertial. An object that rotates at a constant
speed is accelerating, because although its speed is constant, the direction of its velocity is
constantly changing.
In special relativity, it is important to remember that we are going to deal exclusively
with inertial frames of reference. We only consider objects that are stationary or moving
at constant speed in a straight line relative to each other. Einstein extended his theory of
relativity later to deal with accelerating frames of reference: the general theory of relativity.
6
The word ‘special’ indicates that we are dealing with this special case of inertial frames.
In classical physics, we can easily take into account the differences between inertial
frames. In Figure S34.4 the two frames are labelled A and B. The frame A is moving at
speed v relative to frame B, in the direction of both frames’ x axes. Imagine that two events
happened one after the other in different places in frame A, with a time difference ∆tA.
The same two events are observed in the stationary frame B. If these events as measured
in frame A are separated by a distance ∆xA between the x co-ordinates, ∆yA between the y
co-ordinates, ∆zA between the z co-ordinates, then the separations in frame B are given by:
of reference approaches the speed of light, the results of the transformation are very different.
So thankfully in most circumstances, we can still add velocities in the way we are used to
from classical physics!
A good resource for exploring the ideas of frames of reference is a video entitled ‘Frames
of Reference’, produced in 1960 and presented by University of Toronto professors Patterson
Hume and Donald Ivey (available on YouTube at the time of writing).
Time dilation
Now, we will look at the first of the unexpected consequences of the postulates of relativity –
time dilation. We are used to thinking of time as absolute, but what we are about to show
is that it is not! The idea of absolute time is something we take for granted: for example,
imagine you and a friend had identical clocks that are extremely precise and never run out
of power. You then spend a long time apart – it could be minutes, hours, days or even many
years – and when you meet up again you compare your clocks. The idea of absolute time is
that those clocks would show exactly the same time. A consequence of the special theory of
relativity is that these clocks may not show the same time.
The following is a classic thought experiment, due to the Nobel Prize-winning physicist
Richard Feynman. Einstein used the German word gedankenexperiment, which translates
as ‘thought experiment’, to describe the conceptual experiments he used in creating the
theory of relativity. In 1905, the fastest way to travel as a passenger was in a train, so just like
Einstein, let’s set our thought experiment on a train.
Cambridge Pre-U Physics
a
mirror
c
c
y
A A
v v∆ B v v
In B’s frame, the light travels a different, longer path,
B but it still travels at speed c
Figure S34.5 Feynman’s light clock thought experiment.
Figure S34.5 shows the set-up. We have a device called a ‘light clock’ on a moving train.
Observer A is in the train carriage; observer B is at rest by the side of the track. We will
call A’s frame of reference the train frame, and B’s frame of reference the Earth frame. The
clock consists of a light source and receiver which are in the same position. The light source
flashes, and the flash is reflected from a mirror on the roof of the carriage, back down to the
receiver. Let’s call the time from emission to reception of the light one ‘tick’ of the clock. We 8
are going to look at the time taken for one tick of the clock from the point of view of observer
A, and then from the point of view of observer B.
In the train, we will call the time taken for one tick ∆tA. The carriage is of height y, so the
time for the light to reach the ceiling and return to the detector is (using time = distance/speed):
2y
∆t A =
c
For observer B, in the Earth frame, the light follows the path shown in Figure S34.5b. The
light must be observed by B (as well as A) to travel at speed c (the second postulate of special
relativity), but you can see from the diagram that in B’s frame, the Earth frame, it travels a
greater distance. If the speed has not changed but the distance travelled is greater, then the
time elapsed for one tick is longer in the Earth frame. We can actually work out exactly
how much further it travels, and thus work out the time for one tick in frame B. Let’s call the
time taken for the light to travel from the source to the mirror and back to the receiver in B’s
frame ∆tB. During that time, the carriage travels a distance
x = v∆tB
Using Pythagoras’ theorem, the total distance travelled by the light is twice the hypotenuse
of the right angled triangle with sides y and x/2. Therefore the total distance travelled, 2d, is
2
v ∆t B
2d = 2 y 2 +
2
But we also know that the time taken must be such that the speed of light is measured to be c.
So we can write that:
2
2 y 2 + v ∆t B
2d 2
∆t B = c = c
Cambridge Pre-U Physics
question
34.1 Prove that the expression above follows from the previous expression for ∆tB.
appears a lot, so we give it the symbol γ , and often call it the γ -factor (gamma factor). Think
carefully about this expression: you can see that γ is always greater than 1, and that it is
approximately equal to 1 for speeds that are small compared to c. It becomes very large
(tending to infinity) for speeds close to the speed of light.
Let’s write our expression relating the times in the two frames using γ :
∆tB = γ∆tA
Now let’s think about what this means. Since γ is always greater than 1, more time elapses
between ticks of the clock in frame B (the Earth frame) than in frame A (the train frame). If
we think carefully about this, it means that time is running more slowly in the train frame – 9
since the time between the emission and reception of the light is shorter. This phenomenon is
known as time dilation. It is often quoted as ‘moving clocks run slow’. Remember of course
that moving in this case means moving relative to another frame of reference!
You may be thinking: “From observer A’s point of view, B is moving past him at velocity
–v. So, since –v gives the same γ-factor as +v, we could write the time-dilation equation as
∆tA = γ∆tB. This is an apparent contradiction unless γ = 1.” What we have forgotten is that the
equation we derived assumes that the light clock remains at the same x coordinates in frame
A, so the journey of the light in frame A is straight up and down. This assumption breaks
the symmetry between the frames, so we can’t just switch frames as suggested. In fact the
equation ∆tA = γ∆tB would only be valid if the light clock were instead stationary in B’s frame,
and ∆tA and ∆tB referred to times measured on this clock. So in fact, both observers see the
other’s frame as being time dilated, but there is no contradiction! Also, if time is running
more slowly, observer A in the train is aging less quickly than observer B in the Earth
frame. Later we will look at the famous thought experiment where one observer sets off on a
relativistic journey and returns having aged less than people who stayed on Earth. Again, the
situation is not as symmetrical as it might first appear.
The muons travel at high relativistic speeds, i.e. close to the speed of light. A muon with an
energy of 20 GeV has a γ-factor of approximately 190. This means that its speed is 0.999986c.
We know the half-life of these muons because of very precise laboratory measurement. We
can therefore calculate what fraction of all the muons should remain undecayed after a 15 km
trip through the atmosphere. The time taken to travel 15 km is:
15 000 m
T= = 5.00 × 10−5 s = 32.1half -lives
0.999 986 × 3 × 108 ms −1
Therefore we would expect 2−32.1 = 2.1 × 10−10 to be the fraction of muons that reach ground
level, i.e. less than one in a billion. When we make measurements of what actually takes place
in the atmosphere, many more muons than this are observed. This is because the muon,
moving at high relativistic speed, experiences less time passing in its frame of reference than
the observer in the Earth frame of reference. We need to take into account relativistic time-
dilation. The lifetime of a muon moving at this speed, observed from the Earth frame, is γ
× 1.56 μs = 0.296 ms. Now our travel time becomes 0.17 half-lives, and therefore the fraction
that is able to reach the ground is 2−0.17 = 0.89. So, in fact, after taking relativity into account,
most of the muons reach the ground. This prediction is consistent with experimental
measurements.
Length contraction
IA
10
A’s frame
IB
B’s frame
start
c
B
finish
Figure S34.6 Thought experiment for length contraction.
velocity v past observer B, who is in the Earth frame of reference. This time, our ‘light clock’
is arranged so that it sends a pulse of light along the direction of motion of the carriage, to a
mirror at the far end, and receives it back. We measure the time taken for the pulse to travel
to the mirror and back.
If we call the length of the carriage in A’s frame of reference (the train frame) lA and the
travel time of the pulse ∆tA, then since the light travels distance 2lA in time ∆tA:
2lA
∆t A =
c
In B’s frame of reference (the Earth frame), after the light pulse is emitted, the mirror
is moving away from the light: the light is travelling at speed c, so the relative speed of
approach of the light to the mirror is c – v. Once the light reflects off the mirror, and reverses
its direction, in B’s frame it is moving towards the mirror at relative speed c + v. We can
calculate the travel times for the light to get to the mirror, t1, and the time for the light to
return from the mirror to the detector, t2:
lB l
t1 = ; t2 = B
c−v c+v
Here, lB is the length of the carriage as measured in B’s frame. We can sum these times and
re-arrange to get the total travel time of the pulse in frame B:
1 1 2lBc 2l 1 2l
∆t B = lB + = = B = Bγ2
c − v c + v c 2 − v 2 c v2 c
1 − c 2
However, since the emission and reception of the light occur at the same spatial co-ordinates
in A’s frame, we can use our earlier time dilation result to relate ∆tA and ∆tB too: 11
∆tB = γ∆tA
Combining our two expressions for ∆tB , and the expression for ∆tA, we can deduce:
2 LB 2 2L
γ = γ∆t A = A γ
c c
lA
⇒ lB =
γ
What does this mean? Remembering that γ is always > 1, then it tells us that observer B
measures the length of the carriage to be shorter than observer A. More generally, if we
make a length measurement of an object in a frame where the object is stationary, otherwise
known as the rest frame of the object, then we are measuring the longest possible length for
the object. The length of the object in its rest frame is called the proper length. In any other
frame its length will be less than or equal to its proper length: we say it is length contracted.
You could reason that from A’s point of view, B is moving past with velocity –v, so if A
measures an object which is at rest in B’s frame, A will measure it as shorter than an observer
in B’s frame would measure it. You would be right, but this does not contradict the idea
that the shortest possible length for the object is in its rest frame – since we are considering
measuring two different objects.
When we measure an object, it means that we determine the coordinates of the two ends
of the object simultaneously (at exactly the same time). We can consider the act of measuring
coordinates to be an ‘event’. Therefore, measuring the two ends of the object means there are
two events. Although the two events are simultaneous in one frame of reference, we will see
below that if they are separated in space, they will not be simultaneous in another frame
of reference that is moving with respect to the first frame. For example, if an object’s length
is measured in its rest frame (by taking the coordinates of the ends simultaneously in that
frame), the two events involved in the measurement are not simultaneous in any other frame,
so they are not a measurement of length in any other frame!
Cambridge Pre-U Physics
question
34.2 Look back at the previous section, where we used the time dilation formula to show
that the lifetime of the muon in the Earth frame was long enough that approximately
90% of the muons reach the surface of the Earth. Analyse the situation again in a
frame travelling at the same velocity as the muon (the muon’s rest frame), where the
half-life is 1.56 ms. From this frame, the distance the muons have to travel is length
contracted.
a Calculate the length-contracted distance that the muons have to travel.
b Use this length and the muon’s lifetime of 1.56 µs to calculate the fraction of muons
that reach the surface of the Earth. This should be the same as the answer we
arrived at by considering the effect of time dilation on the muon’s lifetime.
Loss of simultaneity
IA
c c
IB
12
v
start
c c
Here is another thought experiment. In Figure S34.7, observer A is once again in a train
carriage, moving at velocity v, relative to observer B in the Earth frame. In the centre of the
carriage is a light source, which emits a flash of light. In observer A’s frame, the flash of light
reaches the two ends of the carriage simultaneously – it travels at speed c and has to travel
an equal distance to each end. However, in observer B’s frame, the front of the carriage is
moving away from the point at which the light was emitted, and the back of the carriage
towards it. Since the light has to travel at speed c in B’s frame, it therefore reaches the back
of the carriage first. The two events – light reaching the front of the carriage and light
reaching the back of the carriage – which happen simultaneously in A’s frame, do not happen
simultaneously in B’s frame.
Frame A B Frame B
L LB
A v
Event 1: Event 1:
light light
emitted LB(c+v) /2c L(c–v) /2c emitted LB(c+v) /2c LB(c–v) /2c
A
Event 2:
light
reaches Event 2 & 3: v
front light reaches
front and back
(simultaneous
in frame B) 13
Figure S34.8 Positioning the light source so that it illuminates the two ends of the carriages
simultaneously in the Earth (B’s) frame: in the carriage (A’s) frame, it now reaches the back later.
So if it illuminates a clock moving with the carriage (in A’s frame), the rear clock will be ahead
(show a larger reading) when illuminated.
Now, imagine that we position the light source further forward in the carriage, so that in B’s
frame the light now reaches both ends of the carriage simultaneously, and illuminates a clock
at each end. The clocks are synchronised in A’s frame. In A’s frame, the light takes longer to
reach the rear of the carriage, so when the clocks are illuminated by the light, the clock at the
rear of the carriage will be ahead of the clock at the front (that is, the time elapsed since it
was set will be larger) – see Figure S34.8.
If we continued to emit pulses of light, the rear clock will continue to be ahead, but
always by the same amount. The rate of passage of time on the two clocks is the same – the
rear clock is just ahead by a constant amount. This effect is therefore a completely different
effect to time dilation. From B’s perspective, the passage of time in the train carriage will
be slower, that is, it will be time dilated, but this time dilation affects both clocks equally. It
is worth restating this, as it is important: the fact that the rear clock is ahead by a constant
amount is unrelated to any time dilation effect. The effect we are dealing with here is called
loss of simultaneity. The clock at the rear is illuminated later after the emission of the light
in A’s frame, but both clocks are illuminated simultaneously in B’s frame. Since the clocks
show the time elapsed in A’s frame, when they are illuminated, the rear clock shows the
higher reading (is ahead).
Cambridge Pre-U Physics
With a bit of further consideration, we can work out how much the rear clock is ahead, in
a carriage of proper length L and moving at velocity v relative to B’s frame. A light source,
stationary in the carriage frame, emits photons. A photon travelling backwards approaches the
rear wall of the carriage at speed c + v in B’s frame. A photon travelling forward approaches the
front wall of the carriage at speed c – v. If we divide the train in this ratio in B’s frame, as shown in
Figure S34.8, then in B’s frame the photons will reach the walls simultaneously. The ratio is the
same in A’s frame, because length contraction contracts all lengths by the same factor. We can
work out the required position of the light source by knowing that the lengths are divided in this
ratio and must add up to L. Figure S34.8 shows the position required.
Now, the light travelling to the rear clock travels an extra distance of
L(c + v ) L(c − v ) Lv
− =
2c 2c c
in A’s frame. If we divide this by the speed of light, we get the extra time taken for the light to
travel to the rear of the carriage. So the rear clock is ahead by a time
Lv
c2
We will refer to this difference in our analysis of the twin ‘paradox’. This effect also has nothing
to do with the travel time of light – there is a true difference between the time coordinates at the
two different locations in space.
situation carefully, in the turn around and switch to the new inertial frame, the Earth clock
suddenly jumps ahead. This is related to the ideas of loss of simultaneity that we have been
discussing. On both the outward and return legs, the travelling twin ‘sees’ time passing more
slowly in the Earth frame (he could determine this from a transmission from Earth, taking
into account the effects of the time a radio signal would take to reach him), but the change of
frame on turn-around means that in the end, the Earth-bound twin is older.
The sudden ‘jump’ in the Earth clock is an effect of the change in inertial frames, and is
not an effect of the acceleration (although if we also tried to take the required acceleration
into account, it would get more complicated to calculate, as we must introduce the general
theory of relativity into the argument).
We can set the experiment up differently to avoid having to include the effects of the
acceleration. As a spaceship containing a clock travels at velocity v past the Earth, it
synchronises its clock with a clock on the Earth. It travels to a nearby star, maintaining its
velocity. When it gets there, another ship, with velocity v in the opposite direction passes it,
heading for Earth. As they pass, they synchronise their clocks. When the second spaceship
passes the Earth, they compare the reading on its clock and the clock that remained on
Earth. More time has passed on the Earth clock. This scenario gives us the same change in
frame as in the classic ‘twin paradox’, but without the acceleration.
Let’s analyse what happens more quantitatively, in the situation where the spacecraft is
travelling at 3c/5, to a star 4 light years (ly) away, Alpha Centauri. (You do not need to be
able to remember the steps of this worked example; it is included to give you another way of
understanding the twin ‘paradox’.)
The γ-factor for 3c/5 is: 15
1 5
γ= =
3
2 4
1−
5
In the Earth frame, the return journey distance is 8 ly, so a journey at the speed of light would
take 8 years. At 3c/5, the journey takes
5 40
×8= years
3 3
This is the time that will elapse on the clock that is left on Earth during the journey. As the
outgoing spaceship reaches the ‘turn-around’ point, its clock is synchronised with the incoming
spaceship’s clock while they’re at the same point in space (avoiding any problems from lack
of simultaneity). From the point of view of the observer on Earth, the clock on the spaceship
is time-dilated. The outgoing and incoming journeys take the same time (in both frames), and
therefore the total time elapsed on the spaceship clock as it returns to Earth is
1 40 32
× years = years
γ 3 3
Now let’s look at what happens in the spaceships’ frames. In those frames, the distance that the
spaceship needs to travel in each direction is length contracted. The distance to Alpha Centauri
is therefore, in this frame:
4ly 16
= ly
γ 5
Cambridge Pre-U Physics
16
ly
5 = 16 years
3 3
c
5
Since the return journey will take the same amount of time, we can already see that this matches
up with our calculation in the Earth frame: the clock on the spaceship will have advanced by
32/3 years.
Now, let’s look at what happens to the clock on Earth, in the ships’ frames. During the
outward journey, the astronaut sees the Earth’s clock as running slow (reads less), due to time
dilation. So as he arrives at Alpha Centauri, the Earth clock reads
16 1 16 4 64
years × = × years = years
3 γ 3 5 15
Now, imagine a clock on Alpha Centauri which was synchronised with the Earth clock at the time
the journey started. From the astronaut’s point of view during the outward journey, the Alpha
Centauri clock is the ‘rear clock’ (look back to our analysis of loss of simultaneity). So it is ahead
of the Earth clock by a constant amount Lv/c2. So on arrival at Alpha Centauri, the Alpha Centauri
clock reads:
64 Lv 64 3 100
years + 2 = years + 4 × years = years
15 c 15 5 15
Now, when the incoming ship arrives, it is in an inertial frame moving in the opposite direction to
the original outgoing ship. It is also at the same spatial location as the Alpha Centauri clock, so 16
it must see the same reading on that clock as the outgoing ship. However, from its point of view,
now the Earth clock is the rear clock (as Alpha Centauri is moving away at the front of the ‘train’).
So, in this change of frames, instantaneously the Earth clock advances by Lv/c2. The reading on
the Earth clock from the point of view of the ship has now become:
Now, on the return journey, the Earth clock is again time-dilated, so running slow from the
astronaut’s point of view. During the journey it advances the same amount as it did in the
outward journey, 64/15 years. Therefore the reading on the Earth clock, as the spaceship arrives
at Earth, is:
136 64 200 40
years + years = years = years
15 15 15 3
This is the same as our calculation in the Earth frame. All is consistent, and the clock on the
spaceship has advanced less than the clock on Earth. There is, indeed, no paradox!
Usually we move at speeds where the effects of relativity are virtually unnoticeable. However,
atomic clocks are accurate enough to measure time dilation at the speed that jet airliners
travel. Hafele and Keating did an experiment in 1971 where they flew four caesium atomic
beam clocks around the world on scheduled airline flights, both eastwards and westwards.
They found that the results were consistent with the predictions of relativity to within the
experimental error. They needed to take both special and general relativity into account, as
at altitude, the gravitational field is weaker. Their paper states that ‘these results provide an
unambiguous empirical resolution of the famous clock “paradox” with macroscopic clocks’.
What they refer to as the ‘clock “paradox”’ is what we have called the ‘twin “paradox”’.
Similar, more accurate, experiments conducted later have also confirmed the predictions of
relativity.
The Global Positioning System (GPS), used for satellite navigation, relies on accurate
timing to determine your position on the Earth. The GPS satellites also use atomic clocks,
and these must be corrected for the effects of relativity.
S34.7 S
ome hints to remember how to apply
relativistic effects
Remember that the γ-factor is always greater than or equal to 1.
1
γ=
v2
1−
c2
Time dilation
Moving clocks run slow – less time elapses between events in a frame that is moving with 17
respect to you. So if frame A is moving at velocity v with respect to frame B, then more time
elapses between events in frame B – so the γ-factor must multiply ∆tA:
∆t B = γ∆t A
Length contraction
Moving objects are measured as being shorter – an object is longest in its rest frame. When
an object, stationary in frame A with length lA in that frame, then if frame A is moving with
velocity v with respect to frame B, the object will be measured as having a shorter length in
frame B:
lA
lB =
γ
Don’t forget, though, that it’s equally valid for observer A, for whom frame B is moving
at velocity –v, to say that lengths in frame B are length contracted. So the equation is
equally valid with A and B exchanged: but in this case we are measuring an object which is
stationary in frame B in frame A, so there is no contradiction!
Loss of simultaneity
Rear clock ahead – if you observe two clocks separated in space that are both in the same
inertial frame, which itself is moving relative to you, then the rear clock (the one that would
pass you second if they were approaching) is a constant amount ahead (whenever you
observe them). This comes about because two observations that are simultaneous in your
frame are not simultaneous in the frame that is moving relative to you. Often, apparent
paradoxes in relativity can be answered by considering the loss of simultaneity.
Cambridge Pre-U Physics
Traditional notation
In many relativity textbooks, you will often see the transformations expressed between a
primed frame (Δx', Δy', Δz', Δt') and an un-primed frame (Δx, Δy, Δz, Δt). Conventionally,
the primed frame is the frame moving with velocity v along the x-axis with respect to the un-
primed frame. Often, books also drop the Δ (but it is implicitly there).
So this means that we can write our time-dilation and length contraction effects in the
following form:
∆t ' = γ∆t
l' = l
γ
If we combine our knowledge of all of these effects and the conditions under which they apply,
we can write down coordinate transformations for going from one frame to another. This is the
relativistic equivalent of the Galilean transformation we discussed initially. Using the prime/
un-primed frame notation, the transformations are:
∆x = γ ( ∆x + v ∆t )
v ∆x '
∆t = γ ∆t ' + 2
c
You do not need to remember these now, but they are presented for completeness. They allow
us to work with events that do not fit the restrictions that we built into our derivations of time
dilation, length contraction and loss of simultaneity – i.e. cases where we would expect a
combination of these effects.
18
Summary
■ In the late 19th century, most physicists thought that electromagnetic waves travelled
in a medium that they called the aether. However, experiments to measure the
variation in the speed of light due to Earth’s motion through the aether all yielded
null results.
■ Einstein’s two postulates of relativity are:
■ The laws of physics are the same in all inertial frames of reference (frames of
reference/coordinate systems moving at a constant speed with respect to each
other).
■ The speed of light in free space has the same value c in all inertial frames of
reference.
■ Einstein’s postulates of relativity dispense with the idea of the aether – light does not
require a medium in the same way as a mechanical wave.
■ The postulates of relativity give rise to time dilation and length contraction: space
and time are no longer absolute quantities: distances and times between events
change depending on which inertial frames they are measured in
1
■ Time dilation: ∆t ' = γ ∆t
v2
1− 2
c
v2
■ Length contraction: l ' = 1 − l
c2
■ Two events that are simultaneous in one frame of reference may not be
simultaneous in another frame of reference.
Cambridge Pre-u Physics
end-of-chapter questions
S13: Waves and optics
S34.1.
a What does Einstein’s special theory of relativity state about the laws of physics? [1]
b What does Einstein’s special theory of relativity state about the speed of light? [1]
c F illipas and Fox conducted an experiment to test special relativity. They measured the speed of the
gamma rays emitted when a particle called a neutral pion decays into a pair of gamma rays. The
gamma rays are emitted in opposite directions, and there are no other products of the decay.
i Explain why the gamma rays are expected to travel at the speed of light. [1]
ii Explain why a stationary pion could not decay to a single gamma ray photon. [2]
d T
he pions used in the experiment in (c) were moving at a speed of 0.20 c in the laboratory frame of
reference. The gamma rays were emitted parallel to the motion, as shown in the diagram below.
neutral pion
i The results of the experiment showed that the velocities of the photons relative to the laboratory
were equal to c in both directions, to within the limits of the experimental uncertainty. What
conclusion can be drawn from this? [1]
ii What is the velocity of the forward photon relative to the pion, i.e. seen from a reference frame
moving with the same velocity as the pion when it decays? [1]
iii The momentum of a photon is related to its energy by the formula E = pc. What can be said about
the frequency of the two photons emitted in this decay? [4]
19
iv In the laboratory, the half-life for the decay of a stationary neutral pion is 0.18 ns. Calculate the
half-life of the pion when it is moving at 0.2 c. [2]
S34.2.
The principle of relativity states that the laws of physics are the same for all uniformly moving observers.
a State what is meant by uniformly moving. [1]
b What does this imply about c, the speed of light in a vacuum? [1]
c Explain what is meant by time dilation (it is not necessary to derive any formulae). [3]
d A
muon has a mean lifetime of 2.2 µs when it is stationary in the laboratory. Sketch a graph to show
how the particles observed lifetime in the laboratory depends on its velocity through the laboratory.
Label your graph carefully. [4]
S34.3.
One of the consequences of special relativity is that if an astronaut were to take a lengthy journey at
speeds close to the speed of light, leaving and returning to the Earth, for her the journey would take a
relatively short length of time, but several generations may have passed on Earth.
a Using your knowledge of special relativity, explain the statement above. [3]
b The total distance travelled on such a trip is 50 light years and the astronaut travels at a speed of 0.98c.
i Calculate how much time has passed during the journey for the people remaining on Earth.
ii Calculate how much time has passed for the astronaut during her journey. [3]
c Explain why this could be considered to be an example of time travel. [2]
d T
he calculations you have done are in the frame of reference of the Earth. Explain why it would not be
justifi ed to carry out the same analysis in the same way from the reference frame of the astronaut. [2]
Cambridge Pre-u Physics
S34.4
second due to time dilation. You may wish to use the following approximation to the time dilation
equation:
t 1 v2
t' = ≈ t 1 + 2
1 − v 2 / c2 2c
[3]
c H
ow long would it take to accumulate an error of 100 m in position (given that the signals travel at
the speed of light), if this error were not corrected for? [2]
(The second eff ect on the clock comes from the general theory of relativity, and is due to gravitational
time dilation.)
S34.5
Two trains, A and B, each have proper length (length in their rest frame) L, and move in the same direction.
A’s speed is 4c/5, and B’s speed is 3c/5. A starts immediately behind B (see diagram below).
20
4c/5
A
3c/5
B
C
a H
ow long, as viewed by person C on the ground, does it take for A to overtake B? This is the time
elapsed between them being in the position shown in the diagram until the back of A is level with the
front of B. [6]
b E
xplain why we cannot use the time dilation result to calculate the time taken for the trains to
overtake in A’s frame (or B’s frame). [3]
S34.6
Two painters stand on a train platform, a distance L apart. As a train passes by at speed v, both painters
simultaneously (in the platform frame) make a mark with their brushes on the train. Due to the length
contraction of the train, we know that the marks on the train are a distance γ L apart when viewed in
the train’s frame of reference, because this distance is the distance that is length contracted down to a
distance L in the platform frame.
a How would someone on the train qualitatively explain why the marks are a distance γ L apart, even
though in their frame the painters stood a distance of L apart? [2]
γ
b Can you explain part (a) quantitatively (harder!)? [5]
Cambridge Pre-U Physics
S35: A
stronomy and cosmology
Learning Outcomes
■ understand the terms luminosity and luminous flux
L
■ recall and use the inverse square law for flux F =
4π d 2
■ understand the need to use standard candles to help determine distances to galaxies
■ recognise and use Wien’s displacement law λmax ∝ 1/T to estimate the peak surface
temperature of a star either graphically or algebraically
■ recognise and use Stefan’s law for a spherical body L = 4π r 2σ T 4
■ use Wien’s displacement law and Stefan’s law to estimate the radius of a star
■ understand that the successful application of Newtonian mechanics and gravitation to the
Solar System and beyond indicated that the laws of physics apply universally and not just
on Earth
■ recognise and use Δλ/λ ≈ Δf/f ≈ v/c for a source of electromagnetic radiation moving relative
to an observer
■ state Hubble’s law and explain why galactic redshift leads to the idea that the Universe is
expanding and to the Big Bang theory
■ explain how microwave background radiation provides empirical support for the Big Bang
theory 1
■ understand that the theory of the expanding Universe involves the expansion of space-time
and does not imply a pre-existing empty space into which this expansion takes place or a
time prior to the Big Bang
■ recall and use the equation v ≈ H0d for objects at cosmological distances
■ derive an estimate for the age of the Universe by recalling and using the Hubble time t = 1/H0
S35.1 Introduction
Since ancient times, humans have sought to understand and explain what they have seen in
the night sky. Earlier we discussed how the ancient Greek geocentric (Earth centred) model
of the Universe gave way in the Renaissance to a heliocentric model consisting of elliptical
orbits. Empirically described by Kepler’s Laws, the elliptical orbits of the solar system were
explained by Newton’s theory of gravity. Newton’s theory, and the modifications made
by Einstein in his general theory of relativity, apply across the entire visible Universe. The
same physical laws that have been experimentally determined on Earth and within the
solar system can be seen to apply universally. Astronomical phenomena offer us a natural
laboratory, the observation of which allows us to test our physical theories under extreme
conditions and large scales not available in a laboratory on Earth.
from the star at the surface of the Earth, which is known as the luminous flux, F.
This is defined as the power per unit area of surface perpendicular to the radiation at a
distance d from the star, and has units W m−2 (it is an intensity).
We can relate the star’s luminosity and the luminous flux by the equation:
L
F=
4π d 2
The flux follows an inverse square law. The equation assumes that all the radiation of the
star is spread out evenly in all directions. At a distance d, the total radiation from the star
is spread out over the surface of a sphere of radius 4π d 2, see Figure S35.1. This law means
that if we have a star of a known luminosity, and can measure the luminous flux on Earth,
we can work out how far away the star is. Alternatively, for some stars there are other ways
of determining the distance, in which case we can use the equation to determine the star’s
luminosity.
r2
r1
L
intensity
star 4π r22
luminosity L
2
L
intensity
4π r22
The spectrum of the radiation that is emitted from this black body follows Planck’s law,
which means that its spectrum only depends on its temperature (see Figure S35.2). At room
temperature, the spectrum of a black body peaks in the infrared, so to a human eye the
object would appear matt black at visible wavelengths.
8
Intensity / (arbitrary units)
6
T=
λ max 6000 K
4
5000 K
λ max
2 4000 K
3000 K
0
0 1.0 2.0 3.0
µ
Figure S35.2 The spectrum of a black body at various temperatures.
It may seem surprising, but the spectrum of a filament bulb as it is heated, and the spectrum
of a star, are close to that of an ideal black body, even though they are not in thermal
equilibrium with their surroundings. The black body spectrum is a good first approximation
to the spectrum of these objects. (The observed spectrum closest to a perfect black body
3
spectrum is that of the cosmic microwave background radiation, which we will discuss later.)
The temperature of the black body spectrum that most closely matches a star’s spectrum
is known as the effective temperature of the star. This temperature is a good estimate for the
peak surface temperature of a star; the star will be hotter inside. We can estimate the surface
temperature of a star by using Wien’s displacement law, which relates the wavelength at the
maximum of a black body spectrum to the temperature of the black body.
Wien’s displacement constant ( m K )
wavelength of maximum ( m ) =
absolute temperature ( K )
B
λmax =
T
Wien’s law was developed by Wilhelm Wien several years before Max Planck derived the
general form of the black body spectrum.
WORKED EXAMPLE
Estimate the surface temperature of a red-orange star with a spectrum that peaks at 700 nm.
This enables us to classify stars according to their colour. Red stars are (relatively!) cool,
with surface temperatures of around 3000 K. Yellow stars such as our Sun have surface
temperatures closer to 6000 K. Some blue stars will have surface temperatures greater than
20 000 K (in fact the peak of their spectrum falls in the ultraviolet).
I = σT 4
If we multiply this power per unit area by the surface area of the star, we get the luminosity,
L, for the star. For a star of radius r, the luminosity is:
L = 4π r 2σ T 4
WORKED EXAMPLE
The spectrum of Sirius A (the brightest star in the night sky) has its maximum at 292 nm. Its
luminosity is 25.4 times the luminosity of our Sun, which has a luminosity of 3.85 × 1026 W. Use
this data to estimate the radius of Sirius A.
Step 1 Use Wien’s displacement law to estimate the surface temperature of the star:
Step 2 Use Stefan’s Law for a spherical body to calculate the radius:
L = 4π r 2σ T 4
For comparison, the radius of the Sun is 6.96 × 107 m, so the radius of Sirius A is
approximately 17 times larger.
Cambridge Pre-U Physics
questions
35.1 The background radiation, a remnant from the Big Bang, has the spectrum of thermal
radiation from a black body at a temperature of 2.7 K.
a Calculate the peak wavelength of this spectrum.
b What region of the electromagnetic spectrum does this peak wavelength belong to?
35.2 Mintaka is a star system at a distance of 1200 light years from Earth, in the
constellation Orion. One of the component stars is a class O star with a surface
temperature of 29 500 K. Its luminosity is 190 000 times the luminosity of our Sun
(which has a luminosity of 3.85 × 1026 W).
a Calculate the peak wavelength in this star’s spectrum.
b Calculate an estimate for the radius of the star, using the Stefan–Boltzmann law.
35.3 Using the data for the Mintaka star in question 35.2, determine how far it would have
to be from the Earth for the luminous flux of radiation arriving from it to be equal to the
luminous flux from the Sun. Leave your answer in terms of the mean distance between
11
the Earth and the Sun, which is called 1 AU (astronomical unit). (1 AU = 1.496 × 10 m)
According to the Stefan–Boltzmann law, a star could be very luminous either because its radius
is very large, or because it is very hot, or a combination of these two factors. Astronomers
classify different types of stars into categories using the spectral class, a classification system
based on the elements observed in a star’s absorption spectrum (see Chapter 30), which is
closely connected to the temperature of a star’s outer layers. Astronomers have observed 5
that there is a clear relationship between the spectral class of a star and its luminosity. This
relationship is shown in a plot known as the Hertzsprung–Russell diagram (named after the
two scientists who independently discovered this relationship). The diagram is shown in Figure
S35.3. The y-axis of the diagram is the luminosity of the star, relative to the Sun, on a logarithmic
scale. On the x-axis is effective surface temperature, also on a logarithmic scale.
You may also see this diagram presented in terms of the stars’ magnitudes. Astronomers often
describe the luminosities of stars in terms of magnitudes. The apparent magnitude is related
to the luminous flux (the brightness as it appears in the sky), while the absolute magnitude is
related to the luminosity (the total power output of the star). Magnitudes are expressed on a
logarithmic scale, and the lower the apparent magnitude, the more luminous the star.
The most notable feature of the Hertzsprung–Russell diagram is the main sequence, along
which luminosity rises with surface temperature. Our Sun is currently on the main sequence,
and is labelled on the diagram. The relationship between luminosity and temperature for a main
sequence star can be modelled by an approximate power law
3.5
L M
=
LSun M Sun
where L is the luminosity and M is the mass. Conventionally, we have divided each by the
value for the Sun, as often values of these quantities are quoted in terms of the Sun’s luminosity
and mass. Note that if we look at a particular part of the main sequence, we can work out the
particular power law for that type of star, which will fit the particular trend for that part of the
main sequence more precisely than this approximate power law.
Inside the core of a star on the main sequence, hydrogen nuclei fuse together to form helium
nuclei. This is a nuclear fusion reaction (see Chapter 31) that generates huge amounts of thermal
energy. This energy spreads outwards, creating a thermal pressure outwards from from the
core, which balances the gravitational pressure caused by the mass of the star pulling inwards.
This balance of forces means that the star maintains a particular radius.
Cambridge Pre-u Physics
The gravitational pressure is greater for a star with larger mass, so more massive stars
are hotter – a greater thermal pressure is needed to balance this gravitational pressure. The
more massive star generates more power, and so has a larger luminosity. Since the luminosity
increases as approximately M3.5, but the amount of fuel a star has for fusion depends on its mass,
it follows that more massive stars burn their fuel more quickly and have a shorter lifetime on the
main sequence.
Stars in other regions of the diagram exist under diff erent conditions. Above and to the right
of the main sequence are the red giant stars. These are very luminous, but comparatively cool.
The Stefan–Boltzmann law tells us that a cooler star emits much less power per unit area. In
order to be more luminous than stars on the main sequence, a red giant must therefore have
a much larger radius. A red giant is typically formed when a main sequence star of average
mass has used up the supply of hydrogen in its core. The star is now fusing hydrogen in a shell
surrounding the core. The core has contracted under gravity, bringing this additional shell of
hydrogen into a zone where it can undergo fusion. The temperature is higher and the reaction
rate of nuclear fusion is increased, increasing the star’s luminosity. This causes the outer layers
of the star to expand greatly, but because the radius is much larger, the surface temperature of
the star drops.
The supergiants evolve from more massive stars on the main sequence. A supergiant is
massive enough that when it runs out of hydrogen in its core, the additional gravitational forces
almost immediately cause helium nuclei to fuse in the core. This means that the luminosity does
not increase in the same way as a red giant star, and so supergiants move horizontally across the
Hertzsprung–Russell diagram.
Below the main sequence are the white dwarfs. These are hot stars (by their colour) but they
are not particularly luminous. A white dwarf is typically an older star that no longer produces
energy by nuclear fusion; the luminosity comes from stored thermal energy. Since there is
no longer any outward pressure from the nuclear reactions, a white dwarf contracts until it
reaches a state in which the inward gravitational pressure is balanced by electron degeneracy 6
pressure. This is a quantum mechanical eff ect, and occurs because each quantum state can
only contain one electron.
106
105 supergiants
104
Luminosity (compared to the Sun)
103
main giants
102 sequence
10
1
Sun
10–1
white
10–2
dwarfs
10–3
10–4
10–5
quEstiOns
35.4 Why must a cool star be large in order to have a large luminosity?
35.5 Explain why a very massive star on the main sequence is likely to have a large
luminosity. Why is it likely to have a very short life?
35.6 Why is there a lower limit to the mass of a star?
35.7 Our Sun is approximately 300 times more luminous than 40 Eridani B, a white
dwarf star (the fi rst to be discovered). Its mass is approximately half that of
the Sun. Does 40 Eridani B obey the mass–luminosity relationship for the main
sequence?
Galaxy clusters
(1010 ly)
Nearby galaxies
(107 ly)
Milky Way
(105 ly) 7
Nearby stars
(102 ly)
Solar system
(10–4 ly)
white dwarf
H0
Venus
Hubble’s law: d = v
Relative apparent
supernovae
Luminosity
brightness
Sun
distant
standards
Figure S35.4 The cosmic distance ladder. We measure nearby objects using direct measurements,
such as parallax, and then use ‘standard candles’ to extend our distance scale to more distant objects.
the two observation points, allow us to calculate the distance to Venus. The measurement
was first made in 1761, the first transit of Venus after Halley developed the method, but
unfortunately after his death. It led to a measurement of the AU that was respectably close to
our current best estimate. We now determine the distance using radar signals bounced off
Venus and received by radio telescopes.
In fact, recently (2009), the astronomical unit has been redefined to be exactly
149 597 870 700 m. This definition means the AU is no longer exactly the same as the mean
distance between the Earth and the Sun; it is based around other constants. However, the
original definition is important in order that we understand how other measurements of
distance have been based upon it.
From the Earth, the stars appear fixed in place over long periods of time: we can observe the
same constellations as the ancient Greeks or Babylonians. However, the reality is that nothing
is fixed in space – everything moves relative to other objects. We orbit the Sun, the Sun moves
about the galactic centre, the other stars in the galaxy are moving relative to the Sun, and so on.
The stars are far enough away that although they are moving relative to us, on the timescales
that we observe them, these motions, termed ‘proper motions’ by astronomers, are very small
(but can be measured). For the closest star system, the Alpha Centauri system, these motions
are on the order of 1/1000th of a degree per year.
Although 1/1000th of a degree sounds tiny, remember that this difference is measured from
Earth according to the relative movement across the sky that we observe. Given the enormous
distances between Earth and the stars and galaxies we observe, that 1/1000th of a degree can
mean a very large distance has been moved by the object itself.
8
Astronomical parallax
Once we have a measurement for the AU (the Earth’s mean orbital distance), we can use it
to measure the distance to nearby stars. The trigonometric or astronomical parallax is the
amplitude of annual shift in position of a star as the Earth moves around its orbit, measured
as an angle (see Figure S35.5).
distant stars
apparent parallax
motion of near star
P
parallax angle
near star
1 AU
A star with a larger parallax is closer to the Earth than a star with a smaller parallax. You can
observe this easily for yourself. Try standing inside a building near a window and placing an
object on the windowsill. Now bring your eyes level with the object and look past it through
the window. If you move your head from side to side, the object will appear to move further
through your field of view than a distant object outside the window.
Often parallax is a small fraction of a degree, so we use the arcsecond as the basic unit
of measurement. Just as one hour of time is divided into 60 minutes, and each minute into
60 seconds:
1 degree of arc = 60 minutes of arc = 3600 seconds of arc.
A star that has a parallax of one second of arc is defined as being at a distance of one parsec
(pc). Therefore:
1
distance, d ( pc ) =
parallax, p ( seconds of arc )
Using trigonometry (see Figure S35.5), we can work out how the parsec is related to the
astronomical unit:
1 AU
tan ( p ) =
d
1 AU 1 AU
1 pc = = = 2.06 × 105 AU = 3.09 × 1016 m = 3.26 light years ( ly for short )
tan ( p ) tan (1′′ )
The first successful use of this measurement to measure the distance to a star was by German
astronomer Friedrich Bassel in 1838, when he determined the distance to 61 Cygni to be 10.4 ly.
Current estimates place it at 11.4 ly. This method is limited to relatively close stars (up to 100 pc),
9
since as the stars get further away, the parallax gets too small to measure accurately (although it
can be averaged for clusters of stars that are close together).
question
35.8 Calculate the distance to the following stars, given their trigonometric parallax:
a Proxima Centauri (our nearest star): 0.772 arc seconds
b Wolf 359: 0.419 arc seconds
c Alpha Cephei (Alderamin): 0.067 arc seconds
Standard candles
As we mentioned earlier, if we know the luminosity of an object and can measure its
luminous flux, then we can calculate the distance it is away from us. Astronomical objects
for which the luminosity is well known are described as standard candles, and they can be
used for distance measurements.
By examining the shape of a star’s spectrum, we can determine the surface temperature
(using Wien’s displacement law). The width of the spectral lines gives us information
that means we can set limits on the luminosity. We can use this information to compare
the star to known stars and determine its luminosity (if you have read the extension
box entitled Hertzsprung-Russell diagram, you might be interested to know that we use
this to determine its luminosity). Then by measuring its luminous flux, we can work out
the distance to the star. This technique is, confusingly, called spectroscopic parallax
(confusing because there is no parallax involved!).
If we have a cluster of stars, we can plot their apparent magnitude against surface temperature.
Assuming the cluster contains typical stars, we know the distribution of luminosities that we
Cambridge Pre-U Physics
might expect. We can therefore use their measured luminous flux to work out the distance to the
stars (if you have read the extension box entitled Hertzsprung-Russell diagram, we in fact compare
the main sequence of the cluster to the main sequence on the Hertzspung-Russell diagram). This
technique is called main sequence fitting. We can use spectroscopic parallax and main sequence
fitting to get the distances to stars within our galaxy, the Milky Way.
Another commonly used standard candle is the Cepheid variable, a type of star named
after Delta Cephei in the constellation Cepheus. These have a periodic luminosity – the
luminosity increases and decreases in a regular pattern over time. Astronomers discovered
that there is a direct relationship between the luminosity of a Cepheid variable and the period
over which it oscillates. Therefore, by measuring the period, we can determine the luminosity
(Figure S35.6). From the luminosity and measured luminous flux, we can calculate the
distance to the star. Of course, in order to calibrate our scale, we need some nearby Cepheids
for which we can use parallax to determine the luminosity. The relationship between period
and luminosity for Cepheid variables was first recognised by Henrietta Leavitt in 1912, and
this was later calibrated by Harlow Shapley. Cepheid variables were used as standard candles
by Edwin Hubble to find the distances to nearby galaxies (see section S35.5). In the 1950s, it
was determined that there is more than one type of Cepheid variable – and so the distance
scale had to be recalibrated. Cepheids are used as a standard candle for nearby galaxies.
At greater distances (to more distant clusters of galaxies), Type I supernovae can be used
as standard candles. A supernova is a violently exploding star. Such an explosion can produce
as much energy as an entire galaxy, but over a short period of time. They can therefore be
spotted at great distances. A ‘type I’ supernova is thought to have a consistent luminosity
and is therefore useful as a standard candle. However, as they are relatively rare, and short-
lived, we have to spot one before we can use it as a standard candle. Type I supernovae
involve a white dwarf star in what is called a binary system with another star close by.
There is an upper mass limit for a white dwarf star, of 1.44 times the mass of the Sun. This
is known as Chandrasekhar’s limit, and it is the point at which the gravity of the star can 10
a b
–7 6.0
Apparent magnitude
–6 type I
(classical)
–5 Cepheids 6.5
Absolute magnitude
–4
delta-Cephei
–3
0 10 20 30 40 50
–2 Time (days)
type II
–1 Cepheids
0
RR Lyrae
0.3 1 3 10 30 100
Period (days)
Figure S35.6 a The relationship between luminosity and period for variable stars. Three classes
of variable star are shown, type I Cepheids, type II Cepheids and RR Lyrae stars. The scale here is
given as absolute magnitude – the lower the magnitude (more negative) the more luminous the
star. b The periodic variation in luminosity of a Cepheid variable star.
Cambridge Pre-U Physics
δλ δ f v
z= = =
λ f c
v = H 0d
where v is the recession velocity of the galaxy, d is the distance to the galaxy and H0 is known
as Hubble’s constant. H0 is usually measured in km s−1 Mpc−1, so distance d is given in Mpc
(megaparsecs), and the equation will give us a recession velocity in km s−1. Figure S35.7 shows
a recent plot of velocity vs distance for Type 1a Supernovae, which fit Hubble’s law.
Cambridge Pre-U Physics
4 × 104
Hubble diagram for Type 1a supernovoe
3 × 104
Velocity (km s–1)
2 × 104
1 × 104
0
0 100 200 300 400 500 600 700
Distance (Mpc)
Figure S35.7 Hubble’s law for Type 1a supernovae.
The value of Hubble’s constant lies between 60 and 80 km s−1 Mpc−1. A number of recent
measurements of Hubble’s constant are shown in Figure S35.8.
question
Hubble constant
calculated using different survey methods
78
76
74
72
70
68
66
64
Hubble Spitzer WMAP9 Planck
(2011) (2012) (2012) (2013)
Figure S35.8 Values of Hubble’s constant from different experiments.
a 13
d 2d
b 1 2
2d 4d
Figure S35.9 The expansion of the Universe, modelled in one dimension. The red dot represents
our observer, the black dots other galaxies. a An observed configuration. b A Universe that is
expanding at a constant rate has a ‘scale factor’ that is increasing with time.
If the Universe doubles in scale compared to its initial configuration in Figure S35.9a, over a time
interval Δt, then the distance between each black dot and the red dot doubles (Figure S35.9b).
This means the distance to dots that were initially further away has increased by a larger amount,
and those dots appear to be moving away faster (as the rate of motion is the distance divided
by the time taken for the expansion to happen, Δt). For instance, the dot (labelled 2) that was
initially a distance 2d away moved a distance 2d in time Δt (so that it is now 4d away from us). A
dot which was initially a distance d away from us (labelled 1) has moved an additional distance d
away in time Δt. So the recession velocity of dot 2 is twice that of dot 1.
During every time interval Δt, the scale of the Universe increases by a constant factor (a
factor of 2 is used in Figure S35.9). It is a bit like using the ‘enlarge’ setting repeatedly on a
‘cosmic photocopier’: during each time interval, the scale is increased in all directions by the
same amount. The result of this is that galaxies that are further away (from us, i.e. the red
dot) appear to be moving away from us faster. In other words, this theory of expansion from
the Big Bang matches Hubble’s law, and the observed redshifts of galaxies.
There are a number of things to point out about this idea. Firstly, the galaxies need not
be moving through space as they recede from us – the space itself expands and the gaps
between the galaxies therefore get bigger. This means that the observed redshift is actually
not due to a Doppler shift from classical physics, but in fact because the scale of the Universe
Cambridge Pre-U Physics
itself has increased since the light was emitted. The redshift, z, is directly linked to this
change in scale by the equation
R2
z= −1
R1
where R1 was the scale factor of the Universe when the light was emitted, and R2 is the scale
factor when it is received. If you look back to the formula for redshift in terms of recession
velocity, a redshift greater than 1 would imply v > c, but in fact this formula breaks down
close to the speed of light, and should not be used for redshifts greater than about 0.1.
However, the new interpretation of space itself expanding gives us an interpretation of
redshifts greater than 1 – for example, a redshift of 3 means that the Universe is now 4 times
larger than it was when the light was emitted. (It does not imply something is travelling
through space faster than light!)
Secondly, imagine that our red dot in Figure S35.9 was in a different position. The same
result would apply, that the recession velocity was proportional to the distance between that
position and the object whose recession velocity was being measured. This means that the
Earth is not in a ‘special place’ at the centre of the Universe in order for Hubble’s law to apply
– the same expansion can be observed at any other point in the Universe. The idea that the
Universe should look the same if viewed from any other point in the Universe (apart from
local small-scale structure) is known as the cosmological principle. We can write this as a
formal definition:
Viewed at a sufficiently large scale, the properties of the Universe are the same for all
observers.
The theory that the Universe is expanding, and that it originated in a Big Bang, does
not imply that there has to be anything for it to expand into. As we discussed above, the
expansion can be viewed as an expansion of space-time: the scale factor of the Universe is 14
increasing with time. Similarly, the theory does not imply that there was a time before the
Big Bang.
d d 1
t= = =
v H 0d H 0
This time is known as the Hubble period. With Hubble’s constant being in the range 60
to 80 km s−1 Mpc−1, we can calculate a range of possible ages for the Universe. We need to
convert H0 to s−1 first, and then we find that the Hubble period is between 12 and 16 billion
years. This is a very approximate figure for the age of the Universe.
question
35.10 Show that the range of Hubble’s constant, between 60 to 80 km s−1 Mpc−1, leads to a
Hubble period of between 12 and 16 billion years.
There are some reasons why we might not expect the rate of expansion to be constant,
though. For example, gravity, as an attractive force between all objects with mass, would
slow down the expansion. This suggests our estimate of the age of the Universe from the
present value of Hubble’s constant would be an overestimate. Hubble’s constant may not in
fact be a constant for all of time; it is possible that it has changed as the Universe has evolved.
Cambridge Pre-U Physics
Therefore, cosmologists often refer to it now as the Hubble parameter, and define it as the
rate of change of the scale factor of the Universe.
a
Hfuture
Hnow
Expansion velocity
Hpast
0
0 past now future
Distance
b accelerating
Size of universe
low
empty density
critical density
All lines above this
correspond to an
open universe.
high density
Closed universe.
1 present
H0 time
Figure S35.10 a The expansion velocity of the Universe vs. time. If we assume that the current
expansion rate is constant, we get an estimate for the age of the Universe from Hubble’s law. This
estimate is an overestimate if the rate of expansion has been decreasing. b Possibilities for the
expansion rate of the Universe. If the density is higher than the critical density, then the final fate
of the Universe is to collapse in on itself. If the density is lower than the so-called ‘critical density’,
then the Universe will continue expanding forever. An empty Universe would continue expanding
at the current rate forever. Extrapolating this rate backwards in time would give us an upper limit
on the age of the Universe: this age is given by 1/H0.
There are a number of possibilities for the rate of expansion of the Universe (see Figure S35.10b).
• If there is sufficient mass in the Universe, i.e. the density of the Universe is high enough,
then gravitation will eventually cause the Universe to collapse in on itself. The rate of
expansion decreases and then becomes negative – the Universe is described as ‘closed’,
and will end in a ‘Big Crunch.’
• There is a density of matter, known as the critical density, at which the Universe’s
expansion will slow to zero after an infinite amount of time.
• If there is insufficient matter in the Universe (less than the critical density), then the
gravitational attraction will never cause the rate of expansion to reach zero, and the
Universe will continue expanding forever – an ‘open’ Universe.
Note that we may not be able to ‘see’ all the matter in the Universe. We can estimate the
masses of galaxies and other very large objects in the Universe in two different ways.
• We can examine the amount of light and other electromagnetic radiation given off by
an object, and estimate the mass of the object based on what we know of the physical
processes that produce the radiation.
• We can measure the gravitational effects of our target object on other large objects.
Cambridge Pre-U Physics
When we compare the masses of galaxies and clusters of galaxies produced using these
methods, we find they are significantly different. Cosmologists have therefore proposed the
existence of dark matter, which does not produce or interact with electromagnetic radiation,
but which does create gravitational effects. Estimates based on all the observations we have
made so far suggest that there is not sufficient mass (either normal matter or ‘dark matter’) to
cause the Universe to be closed.
Perhaps surprisingly, recent observations suggest that the rate of expansion of the
Universe is in fact increasing. For this to be the case, there must be something driving the
expansion. After all, we know that gravitation is an attractive force, so gravitation would
tend to slow the expansion down, not increase it. Cosmologists have therefore proposed the
existence of dark energy – a type of energy that again does not produce or interact with
electromagnetic radiation. It is thought to be present everywhere in space, and is estimated
to provide the majority (70%) of all mass–energy in the Universe. Note that dark energy and
dark matter are not the same thing.
The development of cosmological ideas provides many examples of the scientific approach
in action. For example, when Einstein was developing his general theory of relativity in
the years leading up to 1915, the equations he used predicted an expanding Universe. He
felt that this must be an error in his formula, so in order to make the Universe static (not
expanding), he added a term which he called the ‘cosmological constant’. Many years later,
when observations confirmed that the Universe was expanding, Einstein referred to his
suggestion of a cosmological constant as his ‘greatest blunder.’ However, a modified form of
the cosmological constant may now be needed to model the accelerating expansion of the
Universe. We still have many observations to make and theories to develop in order to find
all the answers.
astrophysicists at Princeton University (Dicke, Peebles and Wilkinson), in the United States,
were preparing to search for microwave radiation from the Big Bang. News of their work
reached Penzias via a friend, and the two radio astronomers realised the significance of
their discovery. They published a joint paper with the Princeton astrophysicists. Penzias and
Wilson won the 1978 Nobel Prize in Physics for their discovery.
More recently, NASA (the United States’ space agency) has launched two missions to
study the CMB. The first was the Cosmic Background Explore (COBE), the results of which
were published in 1992. They mapped variations in the CMB, which are related to the
gravitational fields present in the early Universe. These variations are thought to be evidence
for the gravitational forces that eventually drew together the galaxies and clusters of galaxies
that we observe today. The second experiment was the Wilkinson Microwave Anisotropy
Probe (WMAP). This had greater resolution, and surveyed the entire sky. Figure S35.12
shows the results of the WMAP mission.
A third mission, Planck, led by the European Space Agency with participation from
NASA, was launched in 2009 and has made even more accurate maps of the CMB. It has
mapped the polarisation of the CMB, and results from this suggest that the first stars formed
much later than was previously thought.
17
Figure S35.11 The horn antenna with which Penzias and Wilson first detected the cosmic
microwave background radiation.
Figure S35.12 WMAP’s map of the temperature of the cosmic microwave background radiation.
Hot spots show as red, and cold spots as dark blue. The variation in the temperature of the CMB is
only over a range of a few microdegrees (10 −6 K); the equipment used to map these variations has
to be very sensitive and placed in space.
Further evidence for the Big Bang theory comes from the composition of very distant
galaxies and old stars. The amount of each element in these astronomical objects is the
same as predictions developed from the Big Bang theory. However, we can only make this
comparison by looking at very old objects, which tend to be very far away. Stars that formed
more recently have a different composition, because they contain elements that were made by
nuclear fusion in previous generations of stars.
Cambridge Pre-U Physics
Unanswered questions
Cosmology is still an active field of research, and there are many unanswered questions. One
is the question of why there is an imbalance between matter and antimatter in the Universe.
The Big Bang theory predicts that there should have been equal amounts of matter and
antimatter produced, but our Universe is dominated by matter. There are various proposed
theories to explain this imbalance, but as yet no scientific consensus.
Another interesting question is related to the CMB. The variations in temperature across
the sky are remarkably small: although the variations exist and have been mapped, it is
surprising that regions of the Universe that have apparently never been in contact with each
other have come into thermal equilibrium at very nearly exactly the same temperature. This,
and some other cosmological problems, can be solved by postulating a very short period after
the Big Bang where there was a huge burst of expansion, called ‘inflation’. For this inflation
to happen, there would have had to have been an unknown form of energy present, which
has so far not been detected. This energy would have been unevenly distributed in space due
to quantum fluctuations when the Universe was very small, and it is thought that this should
give rise to the patterns that are seen in the COBE and WMAP images of the Universe.
We cannot see back beyond the time of recombination by observing photons, as the
Universe was opaque before then. One possibility for investigating the Universe at the time
when inflation was taking place is by detecting gravitational waves. These were predicted as
part of Einstein’s general theory of relativity, but they have proven to be extremely difficult
to observe. On 11th February 2016, physicists at the Laser Interferometer Gravitational-Wave
Observatory (LIGO) in the United States announced the first observation of a gravitational
wave, which was produced by a collision between two black holes.
Summary 18
■ Luminosity is defined as the total power emitted by a star. Luminous flux is the
power per unit area of surface perpendicular to the radiation at a distance d from the
L
star, and is given by the equation F =
4π d 2
■ The peak of the spectrum of a star allows us to estimate its surface temperature,
using Wien's displacement law.
■ Stefan's law L = 4π r 2σ T 4 allows us to calculate the luminosity of a spherical body.
By using Stefan's law and Wien's displacement law together, we can estimate the
radius of a star based on the peak wavelength in its spectrum and its luminosity.
■ A source of electromagnetic radiation moving relative to an observer undergoes
a shift in wavelength (and therefore frequency) given by the equation
δλ δ f v
z= = =
λ f c
■ The vast majority of galaxies in our Universe are observed to be 'redshifted', which
implies that they are moving away from us. This leads to the idea that the Universe
is expanding. The fact that it is expanding leads to the idea that it originated in a
singularity known as the Big Bang. An expanding Universe does not imply that there
is pre-existing empty space for the Universe to expand into: space itself was created
in the Big Bang.
■ There is other evidence to support the Big Bang theory, such as the detection of the
cosmic microwave background radiation.
■ The equation v = H 0d can be used to relate the speed of recession of distant objects
to their distance from us.
■ The constant H0 in Hubble's equation gives rise to the Hubble time 1 , which gives
us a first estimate for the age of the Universe.
H0
Cambridge Pre-u Physics
End-of-chapter questions
S13: Waves and Optics
S35.1
a T
he Sun has a surface temperature of 5700 K. It has a radius of 6.96 × 108 m. Use Stefan’s law to fi nd
the luminosity of the Sun. [2]
b U
se your answer to (a) to estimate the luminous fl ux at the radius of the Earth’s orbit,
1.496 × 108 km from the Sun. [2]
c Use Wien’s law to calculate the peak wavelength of the electromagnetic radiation from the Sun. [2]
S35.2
a Defi ne, for a star, the following terms:
i Luminosity [1]
ii Luminous fl ux. [1]
b Explain carefully how astronomers can estimate the luminosity of a star from its colour. [3]
c What information can be gained from the absorption spectrum of a star? [2]
d A
n ultraviolet line from the hydrogen spectrum has a wavelength of 121.6 nm when measured
in the laboratory. The same line measured in the radiation from a distant galaxy has a wavelength
of 130.5 nm.
i Calculate the velocity of recession of the galaxy. [2]
ii Estimate the distance of the galaxy from the Earth. The Hubble constant is approximately
2.3 × 10−18 s−1. [2]
S35.3
The binary system of stars 61 Cygni is observed to have a parallax of 0.286 arcseconds. 19
S35.4
The table below shows the distance to a number of galaxies and their speeds as used by Hubble in 1921.
S35.5
S35.6
Explain how redshift leads to the ideas of the expanding Universe and to the Big Bang theory.
S35.7
Explain the origin of the Cosmic Microwave Background Radiation, and how it provides signifi cant
evidence for the Big Bang theory.
20