Documente Academic
Documente Profesional
Documente Cultură
Contents
1
Periodic Table
1 18
H 1 He 2
1 Hydrogen 2 13 14 15 16 17 Helium
1.008 4.003
Li 3 Be 4 B 5 C 6 N 7 O 8 F 9 Ne 10
2 Lithium Beryllium Boron Carbon Nitrogen Oxygen Fluorine Neon
6.941 9.012 10.811 12.011 14.007 15.999 18.998 20.180
Na 11 Mg 12 Al 13 Si 14 P 15 S 16 Cl 17 Ar 18
3 Sodium Magnesium 3 4 5 6 7 8 9 10 11 12 Aluminum Silicon Phosphorous Sulfur Chlorine Argon
22.990 24.305 26.982 28.086 30.974 32.066 35.453 39.948
K 19 Ca 20 Sc 21 Ti 22 V 23 Cr 24 Mn 25 Fe 26 Co 27 Ni 28 Cu 29 Zn 30 Ga 31 Ge 32 As 33 Se 34 Br 35 Kr 36
4 Potassium Calcium Scandium Titanium Vanadium Chromium Manganese Iron Cobalt Nickel Copper Zinc Gallium Germanium Arsenic Selenium Bromine Krypton
39.098 40.078 44.956 47.88 50.942 51.996 54.938 55.847 58.933 58.69 63.546 65.39 69.723 72.61 74.922 78.96 79.904 83.80
2
Rb 37 Sr 38 Y 39 Zr 40 Nb 41 Mo 42 Tc 43 Ru 44 Rh 45 Pd 46 Ag 47 Cd 48 In 49 Sn 50 Sb 51 Te 52 I 53 Xe 54
5 Rubidium Strontium Yttrium Zirconium Niobium Molybdenum Technetium Ruthenium Rhodium Palladium Silver Cadmium Indium Tin Antimony Tellurium Iodine Xenon
85.468 87.62 88.906 91.224 92.906 95.94 (98) 101.07 102.906 106.42 107.868 112.411 114.82 118.71 121.75 127.60 126.905 131.29
Cs 55 Ba 56 Lu 71 Hf 72 Ta 73 W 74 Re 75 Os 76 Ir 77 Pt 78 Au 79 Hg 80 Tl 81 Pb 82 Bi 83 Po 84 At 85 Rn 86
6 Caesium Barium Lutetium Halfnium Tantalum Tungsten Rhenium Osmium Iridium Platinum Gold Mercury Thallium Lead Bismuth Polonium Astatine Radon
132.905 137.327 174.967 178.49 180.948 183.85 186.207 190.2 192.22 195.08 196.967 200.59 204.383 207.2 208.980 (209) (210) (222)
Fr 87 Ra 88 Lr 103
7 Francium Radium Lawrencium
(223) 226.025 (260)
La 57 Ce 58 Pr 59 Nd 60 Pm 61 Sm 62 Eu 63 Gd 64 Tb 65 Dy 66 Ho 67 Er 68 Tm 69 Yb 70
Lanthanum Cerium Praseodymium Neodymium Promethium Samarium Europium Gadolinium Terbium Dysprosium Holmium Erbium Thulium Ytterbium
138.906 140.115 140.908 144.24 (145) 150.36 151.965 157.25 158.925 162.50 164.93 167.26 168.934 173.04
Figure 1: The simplest model of a solid is a periodic array of valance orbitals embedded
in a matrix of atomic cores.
3
1 The development of Bands and their filling
nl elemental solid
1s H,He
2s Li,Be
2p B→Ne
3s Na,Mg
3p Al→Ar
4s K,Ca
3d transition metals Sc→Zn
4p Ga→Kr
5s Rb,Sr
4d transition metals Y→Cd
5p In-Xe
6s Cs,Ba
4f Rare Earths (Lanthanides) Ce→Lu
5d Transition metals La→Hg
6p Tl→Rn
Table 1: Orbital filling scheme for the first few atomic orbitals
We will imagine that each atom (cf. one of the spheres in Fig. 1)
is composed of Hydrogenic orbitals which we describe by a screened
4
Coulomb potential
−Znl e2
V (r) = (1)
r
where Znl describes the effective charge seen by each electron (in prin-
ciple, it will then be a function of n and l). As electrons are added to
the solid, they then fill up the one-electron states 1s 2s 3s 3p 3d 4s 4p
4d 4f· · ·, where the correspondence between spdf and l is s → l = 0,
p → l = 1, etc. The elemental solids are then made up by filling these
orbitals systematically (as shown in Table 1) starting with the lowest
me4 Znl
2
energy states (where Enl = 2 2
2h̄ n
Note that for large n, the orbitals do not fill up simply as a func-
tion of n as we would expect from a simple Hydrogenic model with
mZ 2 e4
En = 2h̄2 n2
(with all electrons seeing the same nuclear charge Z). For
example, the 5s orbitals fill before the 4d! This is because the situation
is complicated by atomic screening. I.e. s-electrons can sample the core
and so are not very well screened whereas d and f states face the an-
gular momentum barrier which keeps them away from the atomic core
so that they feel a potential that is screened by the electrons of smaller
n and l. To put is another way, the effective Z5s is larger than Z4d . A
schematic atomic level structure, accounting for screening, is shown in
Fig. 2.
Now let’s consider the process of constructing a periodic solid. The
simplest model of a solid is a periodic array of valence orbitals embed-
ded in a matrix of atomic cores (Fig. 1). As a simple model of how
5
5d
V(r) z/r
4f atom
6s
6spdf
5p
4d
d
5s 5spdf
4p
3d
Ce Valence Shell
4s
3p
4spdf
+
3s
3spd 6s
2p
2s 4f 5d s
2sp
1s
1s
2
V(r) + C l (l+1)/r
Figure 2: Level crossings due to atomic screening. The potential felt by states with
large l are screened since they cannot access the nucleus. Thus, orbitals of different
principle quantum numbers can be close in energy. I.e., in elemental Ce, (4f 1 5d1 6s2 )
both the 5d and 4f orbitals may be considered to be in the valence shell, and form
metallic bands. However, the 5d orbitals are much larger and of higher symmetry than
the 4f ones. Thus, electrons tend to hybridize (move on or off ) with the 5d orbitals
more effectively. The Coulomb repulsion between electrons on the same 4f orbital will
be strong, so these electrons on these orbitals tend to form magnetic moments.
the eigenstates of the individual atoms are modified when brought to-
gether to form a solid, consider a pair of isolated orbitals. If they are far
apart, each orbital has a Hamiltonian H0 = ²n, where n is the orbital
occupancy and we have ignored the effects of electronic correlations
(which would contribute terms proportional to n↑ n↓ ). If we bring them
together so that they can exchange electrons, i.e. hybridize, then the
degeneracy of the two orbitals is lifted. Suppose the system can gain
6
Figure 3: Two isolated orbitals. If they are far apart, each has a Hamiltonian H 0 = ²n,
where n is the orbital occupancy.
Figure 4: Two orbitals close enough to gain energy by hybridization. The hybridization
lifts the degeneracy of the orbitals, creating bonding and antibonding states.
7
If we continue this process of bringing in more isolated orbitals into
the region where they can hybridize with the others, then a band of
states is formed, again with width proportional to t, and centered
around ² (cf. Fig. 5). This, of course, is an oversimplification. Real
+ + + + . . . = Band
Figure 5: If we bring many orbitals into proximity so that they may exchange electrons
(hybridize), then a band is formed centered around the location of the isolated orbital,
and with width proportional to the strength of the hybridization
8
point we fill the states 2 particles per, until we run out of electrons.
Electronic correlations complicate this simple picture of band forma-
tion since they tend to try to keep the orbitals from being multiply
occupied.
9
2.1 Covalent Bonding
Note that 1 > Sr > 0, and that Habr < 0 since ψa and ψb are bound
states [where Sr = ReS and Habr = ReHab ]b. With these definitions,
(c2a + c2b )Haa + 2ca cb Habr
E0 = (8)
c2a + c2b + 2ca cb Sr
10
∂E 0 ∂E 0 0
and we search for an extremum ∂ca = ∂cb = 0. From the first condition, ∂E
∂ca =
∂E 0
The second condition, ∂cb = 0, gives
11
e
ra
rb
R
Ze Ze
Ψanti-bonding
spin triplet
Ψbonding
spin singlet
Figure 6: Two oxygen ions, each with charge Ze, bind and electron with charge e.
The electron, which is bound in the oxygen valence orbitals will form a covalent bond
between the oxygens
12
whereas an S and a Pπ orbital cannot. I.e., covalent bonds are di-
rectional! An excellent example of this is diamond (C) in which the
- Pσ
S Pπ S + -
+
No bonding Bonding
Figure 7: A bond between an S and a P orbital can only happen if the P-orbital is
oriented with either its plus or minus lobe closer to the S-orbital. I.e., covalent bonds
are directional!
13
as shown in Fig. 8. The xy, xz, and yz orbitals all face towards a
neighboring site, and can thus form bonds with these sites; however, the
x2 − y 2 and 3z 2 − r2 orbitals do not point towards neighboring sites and
therefore do not participate in bonding. If the metal had a simple cubic
structure, the situation would be reversed and the x2 − y 2 and 3z 2 − r2
orbitals, but not the xy, xz, and yz orbitals, would participate much
in the bonding. Since energy is gained from bonding, this energetically
favors an fcc lattice in the transition metals (although this may not be
the dominant factor determining lattice structure).
z z z
d y d y d y
xz xy yz
x x x
Face Centered
z z Cubic Structure
d 2 2 y y
x-y d 2 2
3z - r
x x
Figure 8: In the fcc structure, the xy, xz, and yz orbitals all face towards a neighboring
site, and can thus form bonds with these sites; however, the x2 −y 2 and 3z 2 −r2 orbitals
do not point towards neighboring sites and therefore do not participate in bonding
One can also form covalent bonds from dissimilar atoms, but these
will also have some ionic character, since the bonding electron will no
14
longer be shared equally by the bonding atoms.
15
Na + 5.14 eV e-
Na+ +
e-
Cl + Cl- + 3.61 eV
r Cl = 1.81
r Na = 0.97
16
bonding states would form. As discussed before, Coulomb repulsion
makes this energetically unfavorable.
Body Centered Face Centered
Cubic
Cubic Cubic
Cl
c
Cl c
b b
a a
Na Cs
Figure 10: Possible salt lattice structures. In the simple cubic and bcc lattices all the
nearest neighbors are of a different species than the element on the site. These ionic
lattices are unfrustrated. However, it not possible to make an unfrustrated fcc lattice
using like amounts of each element.
17
lap of the electronic clouds, and contains two free parameters n and B
(Kittel, pp. 66–71, approximates this heuristic term with an exponen-
tial, B exp (−rij /ρ), also with two free parameters). These are usually
determined from fits to experiment. If a is the separation of nearest
neighbors, rij = apij , and their are N sites in the system, then the total
potential energy may be written as
e2 X ± B X 1
Φ = N Φi = N −
+ n . (15)
a i6=j pij a i6=j pnij
P ±
The quantity A = i6=j pij , is known as the Madelung constant. A
depends upon the type of lattice only (not its size). For example
AN aCl = 1.748, and ACsCl = 1.763. Due to the short range of the
potential 1/pn , the second term may be approximated by its nearest
neighbor sum.
18
3d x2 - y2
4S
Figure 11: In metallic Ni (fcc, 3d8 4s2 ), the 4s and 3d bands (orbitals) are almost
degenerate (cf. Fig. 2) and thus, both participate in the bonding. However, the 4s
orbitals are so large compared to the 3d orbitals that they encompass many other
lattice sites, forming non-directional bonds. In addition, they hybridize weakly with
the d-orbitals (the different symmetries of the orbitals causes their overlap to almost
cancel) which in turn hybridize weakly with each other. Thus, whereas the s orbitals
form a broad metallic band, the d orbitals form a narrow one.
19
2.4 Van der Waals Bonds
n
P1 P2
x1
+ - - +
R x2
Figure 12: Noble gasses and molecules with saturated orbitals can form short ranged
van der Waals bonds by inducing fluctuating electric dipole moments in each other.
This may be modeled by two harmonic oscillators binding a positive and negative
charge each.
20
pose a quantum fluctuation on 1 induces a dipole moment p1 . Then
dipole 1 exerts a field
3n(p1 · n) − p1
E1 = (16)
r3
which is felt by 2, which in turn induces a dipole moment p2 ∝ E1 ∝
1/r3 . This in turn, generates a dipole field E2 felt by 1 ∝ p2 /r3 ∝ 1/r6 .
Thus, the energy of the interaction is very small and short ranged.
21
If we approximate each pair of charges as point dipoles, then they
interact with a Hamiltonian
−3(p2 · n)(p1 · n) + p1 · p2 −2p1 p2 2e2 x1 x2
H1 ≈ = − = − . (19)
|x1 + R − x2 |3 R3 R3
The total Hamiltonian H0 + H1 can be diagonalized a normal mode
transformation that isolates the the symmetric mode (where both os-
cillators move together) from the antisymmetric one where they move
in opposition
√ √
xs = (x1 + x2 )/ 2 xa = (x1 − x2 )/ 2 (20)
√ √
Ps = (P1 + P2 )/ 2 Pa = (P1 − P2 )/ 2 (21)
22
which is typically a small fraction of an electron volt.
This is called the Van der Waals interaction, known also as the
London interaction or the induced dipole-dipole interaction. It is the
principal attractive interaction in crystals of inert gases and also in
crystals of many organic molecules. The interaction is a quantum effect,
in the sense that ∆U → 0 as h̄ → 0.
23
Chapter 2: Crystal Structures and Symmetry
Laue, Bravais
Contents
2 Point-Group Symmetry 6
2.1 Reduction of Quantum Complexity . . . . . . . . . . . . 6
2.2 Symmetry in Lattice Summations . . . . . . . . . . . . . 7
2.3 Group designations . . . . . . . . . . . . . . . . . . . . . 11
1
A theory of the physical properties of solids would be practically
impossible if the most stable elements were not regular crystal lattices.
The N-body problem is reduced to manageable proportions by the ex-
istence of translational symmetry. This means that there exist a set
of basis vectors (a,b,c) such that the atomic structure remains invari-
ant under translations through any vector which is the sum of integral
multiples of these vectors. As shown in Fig. 1 this means that one may
go from any location in the lattice to an identical location by following
path composed of integral multiples of these vectors.
a α
Figure 1: One may go from any location in the lattice to an identical location by
following path composed of integral multiples of the vectors a and b.
Thus, one may label the locations of the ”atoms”1 . which compose
the lattice with
rn = n 1 a + n 2 b + n 3 c (1)
1
we will see that the basic building blocks of periodic structures can be more complicated than
a single atom. For example in NaCl, the basic building block is composed of one Na and one Cl ion
which is repeated in a cubic pattern to make the NaCl structure
2
where n1 , n2 , n3 are integers. In this way we may construct any periodic
structure.
b b
a a
Hexangonal Centered
b
a
a
b
Figure 2: Two dimensional lattice types of higher symmetry. These have higher
symmetry since some are invariant under rotations of 2π/3, or 2π/6, or 2π/4, etc.
The centered lattice is special since it may also be considered as lattice composed of a
two-component basis, and a rectangular unit cell (shown with a dashed rectangle).
3
ant only under rotation of π and 2π. Four other lattices, shown in
Fig. 2 of higher symmetry are also possible, and called special lattice
types (square, rectangular, centered, hexagonal). A Bravais lattice is
the common name for a distinct lattice type. The primitive cell is the
parallel piped (in 3d) formed by the primitive lattice vectors which are
defined as the lattice vectors which produce the primitive cell with the
smallest volume (a · (c × c)). Notice that the primitive cell does not
always capture the symmetry as well as a larger cell, as is the case with
the centered lattice type. The centered lattice is special since it may
also be considered as lattice composed of a two-component basis on a
rectangular unit cell (shown with a dashed rectangle).
Cu O Cu O Cu O Cu O Cu O
O O O O O Basis
Cu O Cu O Cu O Cu O
Primitive
O O O O
cell and
lattice
Cu O Cu O Cu O Cu O vectors
b
O O O O
a
Cu O Cu O Cu O Cu O
O O O O
Figure 3: A square lattice with a complex basis composed of one Cu and two O atoms
(c.f. cuprate high-temperature superconductors).
4
To account for more complex structures like molecular solids, salts,
etc., one also allows each lattice point to have structure in the form of
a basis. A good example of this in two dimensions is the CuO2 planes
which characterize the cuprate high temperature superconductors (cf.
Fig. 3). Here the basis is composed of two oxygens and one copper
atom laid down on a simple square lattice with the Cu atom centered
on the lattice points.
c
c
b b
a a
Figure 4: Three-dimensional cubic lattices. The primitive lattice vectors (a,b,c) are
also indicated. Note that the primitive cells of the centered lattice is not the unit cell
commonly drawn.
5
are 3 cubic structures, shown in Fig. 4. Note that the primitive cells of
the centered lattice is not the unit cell commonly drawn. In addition,
there are triclinic, 2 monoclinic, 4 orthorhombic ... Bravais lattices, for
a total of 14 in three dimensions.
2 Point-Group Symmetry
If we recall that Rik =< i|R|k >= Rii δik since |k > are eigenstates of
6
R, then Eq. 3 becomes
7
(a)
(b)
O 4
8
1
4
Figure 5: Equivalent points and irreducible wedge for the 2-d square lattice. Due to
the symmetry of the 2-d square lattice, the eight patterned lattice sites all contribute
an identical amount to the Madelung sum calculated around the solid black site. Due
to this symmetry, the sum can be reduced to the irreducible wedge (b) if the result at
each point is multiplied by the degeneracy factors indicated.
four) have lower degeneracies. Thus, the sum may be restricted to the
irreducible wedge, so long as the corresponding terms in the sum are
multiplied by the appropriate degeneracy factors, shown in Fig. 5(b).
An appropriate algorithm to calculate both the degeneracy table, and
the sum 5 itself are:
8
deg(i,j)=4
else
deg(i,j)=8
end if
end do
end do
deg(0,0)=1
c
c Now calculate the Madelung sum
c
sum=0.0
do i=1,n
do j=0,i
p=sqrt(i**2+j**2)
sum=sum+((-1)**(i+j))*deg(i,j)/p
end do
end do
9
classifying the symmetries of the system that we want to apply. Group
theory allows us to learn the consequences of the symmetry in much
more complicated systems.
A group S is defined as a set {E, A, B, C · · ·} which is closed under
a binary operation ∗ (ie. A ∗ B ∈ S) and:
10
Operation Identification
(x, y) → (x, y) Identity
(x, y) → (x, −y) reflection about x axis
(x, y) → (−x, y) reflection about y axis
(x, y) → (−x, −y) inversion
(x, y) → (y, x) reflection about x = y
(x, y) → (y, −x) rotation by π/2 about z
(x, y) → (−y, −x) inversion-reflection
(x, y) → (−y, x) inversion-rotation
Table 1: Point group symmetry operations for the two-dimensional square lattice.
All of the group elements are self-inverting except for the sixth and eight, which are
inverses of each other.
11
Symbol Meaning
Cj (j=2,3,4, 6) j-fold rotation axis
Sj j-fold rotation-inversion axis
Dj j 2-fold rotation axes ⊥ to a j-fold principle rotation axis
T 4 three-and 3 two-fold rotation axes, as in a tetrahedron
O 4 three-and 3 four-fold rotation axes, as in a octahedron
Ci a center of inversion
Cs a mirror plane
Table 2: The Schönflies point group symbols. These give the classification according
to rotation axes and principle mirror planes. In addition, their are suffixes for mirror
planes (h: horizontal=perpendicular to the rotation axis, v: vertical=parallel to the
main rotation axis in the plane, d: diagonal=parallel to the main rotation axis in the
plane bisecting the two-fold rotation axes).
The mirror planes are parallel to the main rotation axis which is itself
a 4-fold axis and thus the group for the square lattice is C4v .
12
3 Simple Crystal Structures
3.1 FCC
b c y
a
x
a = (x+y)/2
b = (x+z)/2 3-fold axes
c = (y+z)/2
4-fold axes
13
nearest neighbors to each site in the plane)
Metals often form into an fcc structure. There are two reasons for
this. First, as discussed before, the s and p bonding is typically very
long-ranged and therefore rather non-directional. (In fact, when the
p-bonding is short ranged, the bcc structure is favored.) This naturally
leads to a close packed structure. Second, to whatever degree there is a
d-electron overlap in the transition metals, they prefer the fcc structure.
To see this, consider the d-orbitals shown in Fig. 7 centered on one of
the face centers with the face the xy plane. Each lobe of the dxy , dyz ,
and dxz orbitals points to a near neighbor. The xz,xy,yz triplet form
rather strong bonds. The dx2 −y2 and d3z 2 −r2 orbitals do not since they
point away from the nearest neighbors. Thus the triplet of states form
strong bonding and anti-bonding bands, while the doublet states do
not split. The system can gain energy by occupying the triplet bonding
states, thus many metals form fcc structures. According to Ashcroft
and Mermin, these include Ca, Sr, Rh, Ir, Ni, Pd, Pt, Cu, Ag, Au, Al,
and Pb.
The fcc structure also explains why metals are ductile since adjacent
planes can slide past one another. In addition each plane has a 6-fold
rotation axis perpendicular to it, but since 2 adjacent planes are shifted
relative to another, the rotation axes perpendicular to the planes are
3-fold, with one along the each main diagonal of the unit cell. There
are also 4-fold axes through each center of the cube with mirror planes
14
z z z
d y d y d y
xz xy yz
x x x
z z
dx2- y2 y d 2 2 y
3z - r
x x
Figure 7: The d-orbitals. In an fcc structure, the triplet of orbitals shown on top all
point towards nearest neighbors; whereas, the bottom doublet point away. Thus the
triplet can form bonding and antibonding states.
perpendicular to it. Thus the fcc point group is Oh . In fact, this same
argument also applies to the bcc and sc lattices, so Oh is the appropriate
group for all cubic Bravais lattices and is often called the cubic group.
3.2 HCP
15
3-fold
axis
mirror plane
three 2-fold
axes in plane
Figure 8: The symmetry of the HCP lattice. The principle rotation axis is perpen-
dicular to the two-dimensional hexagonal lattices which are stacked to form the hcp
structure. In addition, there is a mirror plane centered within one of these hexagonal
2d structures, which contains three 2-fold axes. Thus the point group is D 3h .
of the planes. This shifting of the planes clearly disrupts the d-orbital
bonding advantage gained in fcc, nevertheless many metals form this
structure including Be, Mg, Sc, Y, La, Ti, Zr, Hf, Tc, Re, Ru, Os, Co,
Zn, Cd, and Tl.
3.3 BCC
Just like the simple cubic and fcc lattices, the body-centered cubic
(BCC) lattice (cf. Fig. 4) has four 3-fold axes, 3 4-fold axes, with mirror
planes perpendicular to the 4-fold axes, and therefore belongs to the
Oh point group.
The body centered cubic structure only has a coordination number
of 8. Nevertheless some metals form into a BCC lattice (Ba V Nb, Ta
W M, in addition Cr and Fe have bcc phases.) Bonding of p-orbitals is
16
FCC HCP
A A A A A A A A A A
B B B B B B B B B B
C C C C C
A A A A A A A A A A
B B B B B B B B B B
C C C C C
A A A A A A A A A A
B B B B B B B B B B
C C C C C
These spaces unfilled
Figure 9: A comparison of the FCC (left) and HCP (right) close packed structures.
The HCP structure does not have a simple Bravais unit cell, but may be constructed
by alternately stacking two-dimensional hexagonal lattices. In contract, the FCC
structure may be constructed by sequentially stacking three shifted hexagonal two-
dimensional lattices.
ideal in a BCC lattice since the nnn lattice is simply composed of two
interpenetrating cubic lattices. This structure allows the next-nearest
neighbor p-orbitals to overlap more significantly than an fcc (or hcp)
structure would. This increases the effective coordination number by
including the next nearest neighbor shell in the bonding (cf. Fig. 10).
17
2
12 6 24
fcc
1s
R(r) 8 6 12
1 bcc
2s,2p
0 1 2 3
o
r(A)
Figure 10: Absolute square of the radial part of the electronic wavefunction. For the
bcc lattice, both the 8 nearest, and 6 next nearest neighbors lie in a region of relatively
high electronic density. This favors the formation of a bcc over fcc lattice for some
elemental metals (This figure was lifted from I&L).
18
Chapter 3: The Classical Theory of Crystal
Diffraction
Bragg
Contents
1
In the last two chapters, we learned that solids generally form pe-
riodic structures of different symmetries and bases. However, given a
solid material, how do we learn what its periodic structure is? Typ-
ically, this is done by diffraction, where we project a beam (of either
particles or radiation) at a solid with a wavelength λ ≈ the characteris-
tic length scale of the lattice ( a ≈ twice the atomic or molecular radii
of the constituents). Diffraction of waves and particles (with de Broglie
incident waves k0
or particles
K
λ ≈ | a 1 | or |a 2 | θ
k0 k
k
K = k - k0
a1
θ d
a2
d sin(θ)
Figure 1: Scattering of waves or particles with wavelength of roughly the same size
as the lattice repeat distance allows us to learn about the lattice structure. Coherent
addition of two particles or waves requires that 2d sin θ = λ (the Bragg condition),
and yields a scattering maximum on a distant screen.
2
multiple planes within the bulk of the solid.
However, not all particles with de Broglie wavelength λ ≈ a will
work for this application. For example, most charged particles cannot
probe the bulk properties of the crystal, since they lose energy to the
scatterer very quickly. Recall, from classical electrodynamics, the rate
at which particles of charge q, mass M , and velocity v lose energy to
the electrons of charge e and mass m in the crystal is given roughly by
dE 4πnq 2 e2 mγ 2 v 3 q 2
≈− ln ∼ 2. (1)
dx mv 2 qeω0 v
As an example, consider a non-relativistic electron scattering into a
√
solid with a ≈ 2A. If we require that a = λ = h/p = 12.3×10−8 cm/ E
when E is measured in electron volts, then E ≈ 50eV. If we solve
Eq. 1 for the distance δx where the initial energy of the incident is lost
requiring that δE = E, when n ≈ 1023 /cm3 we find that δx ≈ 100A.
Thus, if λ ≈ a, the electrons do not penetrate into the bulk of the
sample (typically the first few hundred A of most materials are oxidized,
or distorted by surface reconstruction of the dangling bonds at the
surface, etc. See Fig. 2) Thus, electrons do not make a very good probe
of the bulk properties of a crystal (instead in a process call low-energy
electron diffraction, LEED, they may be used to study the surface of
especially clean samples. I.e. to study things like surface reconstruction
of the dangling bonds, etc.). Thus although they are obviously easier
to accelerate (electrons or ion beams), they generally do not penetrate
into the bulk and so tell us more about the surface properties of solids
3
v
e-
Oxygen
Figure 2: An electron about to scatter from a typical material. However, at the surface
of the material, oxidation and surface reconstruction distort the lattice. If the electron
scatters from this region, we cannot learn about the structure of the bulk.
4
commutes with the Hamiltonian. Thus, this will be a classical
theory.
5
R’ - r
source r B
R
R’ observer
Q P or screen
target
|AO |2 ¯¯¯Z 3 ¯2
−i(k−ko )·r ¯¯
IB ∝ ¯ d rρ(r)e ¯ . (6)
R02
The scattering intensity is just the absolute square of the Fourier trans-
form of the density of scatterers. If we let K = k − k0 (cf. Fig. 1), then
we get
|AO |2 ¯¯¯Z 3 ¯2
−iK·r ¯¯ |AO |2
IB (K) ∝ 02 ¯ d rρ(r)e ¯ = 02
|ρ(K)|2 . (7)
R R
6
From the associated Fourier uncertainty principle ∆k∆x ≈ π, we can
see that the resolution of smaller structures requires larger values of K
(some combination of large scattering angles and short wavelength of
the incident light), consistent with the discussion at the beginning of
this chapter.
I(K) ρ(r)
Figure 4: Since the measured scattering intensity I(K) ∝ |ρ(K)|2 the complex phase
information is lost. Thus, a scattering experiment does not provide enough informa-
R d3 r
tion to invert the transform ρ(r) = (2π)3
ρ(K)e+iK·r .
7
mation to get a unique ρ(r) by inverting the Fourier transform. Instead
experimentalists analyze their data by proposing a feasible model struc-
ture (i.e. a ρ(r) corresponding to some guess of which of one the 14 the
Bravais lattice and the basis), Fourier transform this, and compare it to
the experimental data. The parameters of the model are then adjusted
to obtain a best fit.
8
so that
X X
ρ(x + ma) = ρn eiGn (x+ma) = ρn eiGn (x) eiGn ma
n n
X iGn (x)
= ρn e = ρ(x) , (12)
n
G · rn = 2πm m ∈ Z (14)
g1 · a1 = 2π g2 · a1 = g3 · a1 = 0 (17)
9
or some other permutation of 1 2 and 3, which would just amount to
a renaming of g1 , g2 , and g3 . The set (g1 , g2 , g3 ) are called the basis
set for the reciprocal lattice. They may be constructed from
a2 × a 3
g1 = 2π plus cyclic permutations . (18)
a1 · (a2 × a3 )
It is easy to see that this construction satisfies Eq. 17, and that there
is a one to one correspondence between the lattice and its reciprocal
lattice. So, the reciprocal lattice belongs to the same point group as
the real-space lattice1 .
10
where V is the lattice volume, so
|AO |2
IB (K) ∝ 02
|ρG |2 V 2 δG,K (22)
R
This is called the Laue condition for scattering. The fact that this is
proportional to V 2 rather than V just indicates that the diffractions
spots, in this approximation, are infinitely bright (for a sample in the
thermodynamic limit). Of course, this is because the spots are infinitely
narrow or fine. When real broadening is taken into account, IB (K) ∝ V
as expected.
Then as G = hg1 + kg2 + lg3 , we can label the spots with the three
integers (h, k, l ), or
Ihkl ∝ |ρhkl |2 . (23)
11
|k0 |, if the crystal is rotated by π in some axis perpendicular to the
three-fold axis. In fact, single-crystal experiments are usually done
either by mounting the crystal on a precession stage (essentially
like an automotive universal joint, with the drive shaft held fixed,
and the joint rotated over all angles), or by holding the crystal fixed
and moving the source and diffraction screen around the crystal.
Since these typically are not integers, they are multiplied by p, the
12
h’k’l’ plane
d h’k’l’ Ghkl
γ
a1
a3
a2
O a1
(211) plane
13
by two non-parallel vectors v1 and v2 within the plane. Let
which is the Bragg condition. Note that the Laue condition is more
Ghkl
k θ
k
d hkl
k θ k0
k0 hkl plane
Figure 6: Comparison of the Bragg λ = 2dhkl sin θ and Laue Ghkl = Khkl conditions
for scattering.
restrictive than the Bragg condition; it requires that both the magnitude
and the direction of G and K be the same. However, there is no
inconsistency here, since whenever we apply the Bragg condition, we
assume that the plane defined by k and k0 is perpendicular to the
scattering planes (cf. Fig. 6).
15
O
O
Cu Cu Cu
Body Centered
Cubic
O O O
O
O
Cu Cu Cu
basis
O O O
r
O
O
Cu Cu
rα O
O O O
rn
Figure 7: Examples of lattices with non-trivial bases. The CuO2 lattice (left) is char-
acteristic of the cuprate high-temperature superconductors. It has a basis composed of
one Cu and two O atoms imposed on a simple cubic lattice. The BCC lattice(right)
can be considered as a cubic lattice with a basis including an atom at the corner and
one at the center of the cube.
16
scattering from different elements of the basis. The structure factors
account for these interference effects. The information about this in-
terference, and the basis structure is contained in the atomic scattering
factor f and the structure factor S.
O
Cu
R
O
Figure 8: Rays scattered from different elements of the basis, and from different places
on the atom, interfere giving the scattered intensity additional structure described by
the form factor S and the atomic form factor f , respectively.
17
where the location of each cell is given by rn = n1 a1 + n2 a2 + a3 a3 .
Then since Ghkl · rn = 2πm, m ∈ Z,
N Z
ρhkl = d3 rρ(r)e−iGhkl ·r (36)
V cell
N
where N is the number of cells and V = 1/Vc , Vc the volume of a
cell. This integral may be further subdivided into an integral over the
atomic density of each atom in the unit cell. If α labeles the different
elements of the basis, each with density ρα (r0 )
1 X −iGhkl ·rα Z 3 0 0
ρhkl = e d r ρα (r0 )e−iGhkl ·r (37)
Vc α
The atomic scattering factor f and the structure factor may then be
defined as parts of this integral
Z
0
fα = d3 r0 ρα (r0 )e−iGhkl ·r (38)
so
1 X −iGhkl ·rα Shkl
e
ρhkl = fα = (39)
Vc α Vc
fα describes the interference of spherical waves emanating from different
points within the atom, and Shkl is called the structure factor. Note
that for lattices with an elemental basis S = f .
If we imagine the crystal to be made up of isolated atoms like that
shown on the right in Fig. 8 (which is perhaps accurate for an ionic
crystal) then, since the atomic charge density is spherically symmetric
about the atom
Z Z
3 0 0 −iGhkl ·r0 0
fα = d r ρα (r )e =− r02 dr0 d(cos θ)dφ ρα (r0 )e−Ghkl r cos θ
18
Z
02 0 sin Ghkl r0
0
= 4π r dr ρ(r ) . (40)
Ghkl r0
As an example, consider a spherical atom of charge Ze− , radius R, and
charge density
3Z
ρα (r0 ) = 3
θ(R − r0 ) (41)
4πR
then
Z R 0
3Z 02 0 sin Ghkl r
fα = r dr
R3 0 Ghkl r0
Z
= − (sin (Ghkl R) − (Ghkl R) cos (Ghkl R)) (42)
(Ghkl R)3
This has zeroes whenever tan (Ghkl R) = Ghkl R and a maximum when
Ghkl = 0, or in terms of the scattering angle 2k0 sin θ = Ghkl , when
θ = 0, π. In fact, we have that fα (θ = 0) = Z. This is true in general,
since
Z
fα (θ = 0) = fα (Ghkl = 0) = d3 r0 ρα (r0 ) = Z . (43)
19
(α = 0, 1) contain the same atom with the same scattering factor f ,
then
so that
X
Shkl = f e−iπα(h+k+l)
α=0,1
0 if h + k + l is odd
= f (1 + e−iπ(h+k+l) ) =
2f if h + k + l is even
(45)
This lattice gives rise to extinctions (lines, which appear in a cubic lat-
tice, but which are missing here)! If both atoms of the basis are identi-
cal (like bcc iron), then the bcc structure leads to extinctions; however,
consider CsCl. It does not have these extinctions since fCs+ 6= fCl− . In
fact, to a good approximation fCs+ ≈ fXe and fCl− ≈ fAr . However CsI
(also a bcc structure) comes pretty close to having complete extinctions
since both the Cs and I ions take on the Xe electronic shell. Thus, in
the scattering pattern of CsI, the odd h + k + l peaks are much smaller
than the even ones. Other centered lattices also lead to extinctions.
In fact, this leads us to a rather general conclusion. The shape
and dimensions of the unit cell determines the location of the Bragg
peaks; however, the content of the unit cell helps determine the relative
intensities of the peaks.
20
Unit Cell of BCC ordered FeCo
Fe
Co
However, since one has a closed nuclear shell and the other doesn’t,
their neutron scattering factors will be quite different
neutron neutron
fFe 6= fCo (47)
21
2.3.2 Powdered x-ray Diffraction
22
Ghkl
Ewald Sphere
k
k0
O
Figure 10: The Ewald Construction to determine if the conditions are correct for
obtaining a Bragg peak: Select a point in k-space as the origin. Draw the incident
wave vector k0 to the origin. From the base of k0 , spin k (remember, that for elastic
scattering |k| = |k0 |) in all possible directions to form a sphere. At each point where
this sphere intersects a lattice point in k-space, there will be a Bragg peak with G =
k − k0 . In the example above we find 8 Bragg peaks. If however, we change k0 by a
small amount, then we have none!
23
Chapter 4: Crystal Lattice Dynamics
Debye
Contents
1
4 Theory of Neutron Scattering 26
4.1 Classical Theory of Neutron Scattering . . . . . . . . . . 27
4.2 Quantum Theory of Neutron Scattering . . . . . . . . . . 30
4.2.1 The Debye-Waller Factor . . . . . . . . . . . . . . 35
4.2.2 Zero-phonon Elastic Scattering . . . . . . . . . . 36
4.2.3 One-Phonon Inelastic Scattering . . . . . . . . . . 37
2
A crystal lattice is special due to its long range order. As you ex-
plored in the homework, this yields a sharp diffraction pattern, espe-
cially in 3-d. However, lattice vibrations are important. Among other
things, they contribute to
3
ionic mass:
m
¿1 (1)
M
which allows us to derive an accurate theory.
Due to Newton’s third law, the forces on the ions and electrons are
comparable F ∼ e2 /a2 , where a is the lattice constant. If we imagine
that, at least for small excursions, the forces binding the electrons and
the ions to the lattice may be modeled as harmonic oscillators, then
F ∼ e2 /a2 ∼ mωelectron
2 2
a ∼ M ωion a (2)
4
n’th unit cell
atom α
rα sα
rn
rnα Rnα = r n + rα + sα
O
Figure 1: Nomenclature for the lattice vibration problem. sn,α is the displacement of
the atom α within the n-th unit cell from its equilibrium position, given by rn,α =
rn + rα , where as usual, rn = n1 a1 + n2 a2 + n3 a3 .
We will define the zero potential such that when all Rn are at their
equilibrium positions, φ = 0. Then
X Pn2
H= + φ(R1 , · · · RN ) (5)
n 2M
Typical lattice vibrations involve small atomic excursions of the or-
der 0.1A or smaller, thus we may expand about the equilibrium position
of the ions.
∂φ 1 ∂ 2φ
φ({rnαi + snαi }) = φ({rnαi }) + snαi + snαi smβj (6)
∂rnαi 2 ∂rnαi ∂rmβj
The first two terms in the sum are zero; the first by definition, and
the second is zero since it is the first derivative of a potential being
5
evaluated at the equilibrium position. We will define the matrix
∂ 2φ
Φmβj
nαi = (7)
∂rnαi ∂rmβj
From the different conservation laws (related to symmetries) of the
system one may derive some simple relationships for Φ. We will discuss
these in detail later. However, one must be introduced now, that is,
rα sα
Figure 2: Since the coefficients of potential between the atoms linked by the blue lines
(m−n)βj
(or the red lines) must be identical, Φmβj
nαi = Φ0αi .
(m−n)βj ∂ 2φ
Φmβj
nαi = Φ0αi = (8)
∂r0αi ∂r(n−m)βj
ie, it can only depend upon the distance. This is important for the next
subsection.
6
1.1 The Equation of Motion
From the derivative of the potential, we can calculate the force on each
site
∂φ({rmβj + smβj })
Fnαi = − (9)
∂snαi
so that the equation of motion is
−Φmβj
nαi smβj = Mα s̈nαi (10)
If there are N unit cells, each with r atoms, then this gives 3N r equa-
tions of motion. We will take advantage of the periodicity of the lattice
by using Fourier transforms to achieve a significant decoupling of these
equations. Imagine that the coordinate s of each site is decomposed
into its Fourier components. Since the equations are linear, we may just
consider one of these components to derive our equations of motion in
Fourier space
1
snαi = √ uαi (q)ei(q·rn −ωt) (11)
Mα
where the first two terms on the rhs serve as the polarization vector
for the oscillation, uαi (q) is independent of n due to the translational
invariance of the system. In a real system the real s would be composed
of a sum over all q and polarizations. With this substitution, the
equations of motion become
1
ω 2 uαi (q) = q Φmβj
nαi e
iq·(rm −rn )
uβj (q) sum repeated indices .
Mα Mβ
(12)
7
t=0
t = ∆t
t = 2 ∆t
Figure 3: uαi (q) is independent of n so that a lattice vibration can propagate and
respect the translational invariant of the lattice.
(m−n)βj
Recall that Φmβj
nαi = Φ0αi so that if we identify
βj 1 1
Dαi = q Φmβj
nαi e
iq·(rm −rn )
= q Φpβj
0αi e
iq·(rp )
(13)
Mα Mβ Mα Mβ
where rp = rm − rn , then the equation of motion becomes
βj
ω 2 uαi (q) = Dαi uβj (q) (14)
or
µ ¶
βj βj
Dαi − ω 2 δαi uβj (q) = 0 (15)
³ ´
which only has nontrivial (u 6= 0) solutions if det D(q) − ω 2 I = 0.
For each q there are 3r different solutions (branches) with eigenvalues
ω (n) (q) (or rather ω (n) (q) are the root of the eigenvalues). The de-
pendence of these eigenvalues ω (n) (q) on q is known as the dispersion
relation.
8
basis
M1 M2 α=1 α=2 f
Φn,1 n,2
n,1 = Φn,2 = 2f (20)
9
Φn,2 n,1 n−1,2
n,1 = Φn,2 = Φn,1
n+1,1
= Φn,2 = −f . (21)
10
2.0
optical mode
1.5
acoustic mode ω+
1.0
ω
ω-
0.5 ω ~ ck
0.0
-4 -2 0 2 4
q
Figure 5: Dispersion of the linear chain of oscillators shown in Fig. 4 when M 1 = 1,
M2 = 2 and f = 1. The upper branch ω+ is called the optical and the lower branch is
the acoustic mode.
As a result, the + mode is quite flat; whereas the − mode varies from
zero at the Brillouin zone center q = 0 to a flat value at the edge of the
zone. This behavior is plotted in Fig. 5.
It is also instructive to look at the eigenvectors, since they will tell
us how the atoms vibrate. Let’s look at the optical mode at q = 0,
q
ω+ (0) = 2f /µ. Here,
√
2f /M1 −2f / M1 M2
D=
√
. (27)
−2f / M1 M2 2f /M2
11
Eigenvectors are non-trivial solutions to (ω 2 I − D)u = 0, or
√
2f /µ − 2f /M1 2f / M1 M2 u1
0=
√
. (28)
2f / M1 M2 2f /µ − 2f /M2 u2
q
with the solution u1 = − M2 /M1 u2 . In terms of the actual displace-
ments Eqs.11 v
u
sn1 u M 2 u1
=t (29)
sn2 M 1 u2
or sn1 /sn2 = −M2 /M1 so that the two atoms in the basis are moving
out of phase with amplitudes of motion inversely proportional to their
masses. These modes are described as optical modes since these atoms,
12
number of atoms in the basis). We also expect (and implicitly assumed
above) that the allowed frequencies are real and positive. However,
from simple mathematical identities, the point-group and translational
symmetries of the lattice, and its time-reversal invariance, we can learn
more about the dispersion without solving any particular problem.
The basic symmetries that we will employ are
• Time-reversal invariance.
∗βj 1
Dαi = q Φpβj
0αi e
−iq·(rp )
(30)
Mα Mβ
1 −p,β,j iq·(rp )
= q Φ0,α,i e (31)
Mα Mβ
Then, due to the symmetric properties of the second derivative
∗βj 1 0,α,i 1
Dαi = q Φ−p,β,j eiq·(rp ) = q Φp,α,i
0,β,j e
iq·(rp ) αi
= Dβj (32)
Mα Mβ Mα Mβ
13
Thus, DT ∗ = D† = D so D is hermitian and its eigenvalues ω 2 are
real. This means that either ω are real or they are pure imaginary. We
will assume the former. The latter yields pure exponential growth of
our Fourier solution, indicating an instability of the lattice to a second-
order structural phase transition.
Time-reversal invariance allows us to show related results. We as-
sume a solution of the form
1
snαi = √ uαi (q)ei(q·rn −ωt) (33)
Mα
which is a plane wave. Suppose that the plane wave is moving to the
right so that q = x̂qx , then the plane of stationary phase travels to the
right with
ω
x=t. (34)
qx
Clearly then changing the sign of qx is equivalent to taking t → −t.
If the system is to display proper time-reversal invariance, so that the
plane wave retraces its path under time-reversal, it must have the same
frequency when time, and hence q, is reversed, so
αi
Note that this is fully equivalent to the statement that Dβj (q) =
∗αi
Dβj (−q) which is clear from the definition of D.
Now, return to the secular equation, Eq. 15.
µ ¶
βj 2 βj
Dαi (q) −ω (q)δαi ²βj (q) = 0 (36)
14
Lets call the (normalized) eigenvectors of this equation ². They are the
elements of a unitary matrix which diagonalizes D. As a result, they
have orthogonality and completeness relations
X ∗(n) (m)
²α,i (q)²α,i (q) = δm,n orthogonality (37)
α,i
X ∗(n) ∗(n)
²α,i (q)²β,j (q) = δα,β δi,j (38)
n
If we now take the complex conjugate of the secular equation
µ ¶
βj βj
Dαi (−q) − ω 2 (−q)δαi ²∗βj (q) = 0 (39)
15
βj βj
it is easy to see that Dαi (q + G) = Dαi (q) (since G · rp = 2πn, where
n is an integer). I.e., D is periodic in k-space, and so its eigenvalues
(and eigenvectors) must also be periodic.
s 11
Figure 7: If each ion is shifted by s1,1,i , then the lattice energy is unchanged.
16
ions, the energy of the system should remain unchanged.
1 X m−n,β,j
δE = Φ0,α,i sn,α,i sm,β,j = 0 (45)
2 m,n,α,β,i,j
1 X m−n,β,j
= Φ0,α,i s1,1,i s1,1,j (46)
2 mnα,β,i,j
1X X m−n,β,j
= s1,1,i s1,1,j Φ0,α,i (47)
2 i,j mnα,β
After the stress has been applied, the atoms in the bulk of the sample
Figure 8: After a stress is applied to a lattice, the movement of each ion (strain) is
not only in the direction of the applied stress. The response of the lattice to an applied
stress is described by the strain matrix.
17
the central (n = 0) atom this means that
X
0 = F0,α,i = − Φm,β,j γ,k
0,α,i mβ,j sm,γ,k (50)
m,β,j,γ,k
X
Φm,β,j
0,α,i sm,γ,k = 0 (51)
m
whereas, sm,γ,k is odd in m. Thus the sum over all m yields zero.
Now let’s apply these constraints to D for an elemental lattice where
r = 1, and we may suppress the basis indices α.
1 X
Dij (q) = Φp,j
0,i e
iq·(rp )
(52)
M p
Thus, the leading order (small q) eigenvalues ω 2 (q) ∼ q 2 . I.e. they are
acoustic modes. We have shown that all elemental lattices must have
acoustic modes for small q.
18
In fact, one may show that all harmonic lattices in which the energy
is invariant under a rigid translation of the entire lattice must have at
least one acoustic mode. We will not prove this, but rather make a
simple argument. The rigid translation of the lattice corresponds to a
q = 0 translational mode, since no energy is gained by this translation,
it must be that ωs (q = 0) = 0 for the branch s which contains this
mode. The acoustic mode may be obtained by perturbing (in q) around
this point. Physically this mode corresponds to all of the elements of
the basis moving together so as to emulate the motion in the elemental
basis.
19
2.1 Periodicity and the Quantization of States
or
q(n + N )a = qna + 2πm where m is an integer (57)
20
G vector
Bisector
Figure 9: The First Brillouin Zone. The end points of all vector pairs that satisfy
the Bragg condition k − k0 = Ghkl lie on the perpendicular bisector of Ghkl . The
smallest polyhedron centered at the origin of the reciprocal lattice and enclosed by
perpendicular bisectors of the G’s is called the first Brillouin zone.
21
2.3 Point Group Symmetry and Density of States
Two other tricks to reduce the complexity of these sums are worth
mentioning here although they are discussed in detail elswhere.
The first is the use of the point group symmetry of the system. It
is clear from their definition in chapter 3, the reciprocal lattice vectors
have the same point group symmetry as the lattice. As we discussed
in chapter 2, the knowledge of the group elements and corresponding
degeneracies may be used to reduce the sums over k to the irreducible
wedge within the the First Brillioun zone. For example, for a cubic
system, this wedge is only 1/23 3! or 1/48th of the the FBZ!
The second is to introduce a phonon density of states to reduce the
multidimensional sum over k to a one-dimensional integral over energy.
This will be discussed in chapter 5.
In this section we will derive the equations of motion for the lattice, de-
termine the canonically conjugate variables (the the sense of Lagrangian
mechanics), and use this information to both first and second quantize
the system.
Any lattice displacement may be expressed as a sum over the eigen-
22
vectors of the dynamical matrix D.
1 X
sn,α,i = √ Qs (q, t)²sα,i (q)eiq·rn (58)
Mα N q,s
Recall that ²sα,i (q) are distinguished from usα,i (q) only in that they are
normalized. Also since q + G is equivalent to q, we need sum only over
the first Brillouin zone. Finally we will assume that Qs (q, t) contains
the harmonic time dependence and since sn,α,i is real Q∗s (q) = Qs (−q).
We may rewrite both the kinetic and potential energy of the system
as sums over Q. For example, the kinetic energy of the lattice
1 X
T = Mα (ṡnα,i )2 (59)
2 n,α,i
1 X X
= Q̇r (q)²rα,i (q)eiq·rn Q̇s (k)²sα,i (k)eik·rn (60)
2N n,α,i q,k,r,s
Then as
1 X i(k+q)·r X r ∗s
e n
= δk,−q and ²α,i ²α,i = δrs (61)
N n α,i
the kinetic energy may be reduced to
1 X ¯¯ ¯2
¯
T = ¯Q̇r (q)¯ (62)
2 q,r
The potential energy may be rewritten in a similar fashion
1 X
V = Φm,β,j sn,α,i sm,β,j
2 n,m,α,β,i,j n,α,i
m−n,β,j
1 X Φ0,α,i
= q
2 n,m,α,β,i,j N Mα Mβ
X
Qs (q, t)²sα,i (q)eiq·rn Qr (k, t)²rβ,j (k)eik·rm (63)
q,k,s,r
23
Let rl = rm − rn
l,β,j
1 X Φ0,α,i
V = q
2 n,l,α,β,i,j N Mα Mβ
X
Qs (q, t)²sα,i (q)eiq·rn Qr (k, t)²rβ,j (k)eik·(rl +rn ) (64)
q,k,s,r
1 X r
V = ²α,i (k)²∗s 2 ∗
α,i (k)ωr (k)Qs (k)Qr (k) (67)
2 α,i,k,r,s
P r ∗s
Finally, since α,i ²α,i (k)²α,i (k) = δr,s
1X 2
V = ωs (k) |Qs (k)|2 (68)
2 k,s
Thus we may write the Lagrangian of the ionic system as
1 X µ¯¯ ¯2
¯ 2 2
¶
L=T −V = ¯Q̇s (k)¯ − ωs (k) |Qs (k)| , (69)
2 k,s
where the Qs (k) may be regarded as canonical coordinates, and
∂L
Pr∗ (k) = = Q̇∗s (k) (70)
∂Qr (k)
(no factor of 1/2 since Q∗s (k) = Qs (−k)) are the canonically conjugate
momenta.
24
The equations of motion are
d ∂L ∂L
− ∗
or Q̈s (k) + ωs2 (k)Qs (k) = 0 (71)
dt ∂ Q̇s (k)
∗ ∂Qs (k)
for each k, s. These are the equations of motion for 3rN independent
harmonic oscillators. Since going to the Q-coordinates accomplishes the
decoupling of these equations, the {Qs (k)} are referred to as normal
coordinates.
P.A.M. Dirac laid down the rules of quantization, from Classical Hamilton-
Jacobi classical mechanics to Hamiltonian-based quantum mechanics
following the path (Dirac p.84-89):
25
Thus, following Dirac, we may now quantize the normal coordinates
[Q∗r (k), Ps (q)] = ih̄δk,q δr,s where the other commutators vanish .
(75)
Furthermore, since we have a system of 3rN uncoupled harmonic os-
cillators we may immediately second quantize by introducing
1 q i
as (k) = √ ωs (k)Qs (k) + q Ps (k) (76)
2h̄ ωs (k)
1 q i
a†s (k) = √ ωs (k)Q∗s (k) − q Ps∗ (k) , (77)
2h̄ ωs (k)
or
v
u
u h̄ ³ ´
Qs (k) = t as (k) + a†s (−k) (78)
2ωs (k)
v
u
u h̄ωs (k) ³ ´
t
Ps (k) = −i as (k) − a†s (−k) (79)
2
Where
[as (k), a†r (q)] = δr,s δq,k [as (k), ar (q)] = [a†s (k), a†r (q)] = 0 (80)
26
and a†s (k) and as (k) create and destroy phonons respectively, in the
state (k, s)
q
a†s (k) |ns (k)i = ns (k) + 1 |ns (k) + 1i (83)
q
as (k) |ns (k)i = ns (k) |ns (k) − 1i (84)
If |0i is the normalized state with no phonons present, then the state
with {ns (k)} phonons in each state (k, s) is
1
Y 1 2 Y ³ † ´ns (k)
|{ns (k)}i = as (k) |0i (85)
k,s ns (k)! k,s
27
Source of thermal
n λ ≈ | a 1 | or |a 2 |
neutrons a1
a2
Figure 10: Neutron Scattering. The De Broglie wavelength of the neutrons must
be roughly the same size as the lattice constants in order to learn about the lattice
structure and its vibrational modes from the experiment. This dictates the use of
thermal neutrons.
Thus the neutron cannot ”see” the detailed structure of the nucleus,
and so we may approximate the neutron-ion interaction potential as a
contact interaction
X
V (r) = Vn δ(r − rn ) (89)
rn
28
theory of diffraction. Nevertheless it is useful to compare the classical
result to what we will develop for the quantum problems.
For the classical problem we will assume that the lattice is elemental
(r = 1) and start with a generalization of the formalism developed in
the last chapter
I ∝ |ρ(K, Ω)|2 (90)
where
1
rn (t) = rn + sn (t) and sn (t) = √ u(q)ei(q·rn −ω(q)t) (92)
M
describes the harmonic motion of the s-mode with wave-vector q.
XZ
ρ(K, Ω) ∝ dtei[K·(rn +sn (t))−Ωt] . (93)
n
K = k0 − k = G and Ω = ω0 − ω = 0 (95)
29
When multiplied by h̄ , these can be interpreted as conditions for
the conservation of (crystal) momentum and energy when the scat-
tering event involves the creation (destruction) of a lattice excitation
(phonon). These processes are called Stokes and antistokes processes,
respectively, and are illustrated in Fig. 11.
k = k 0- q , ω = ω 0 - ω q k = k 0+ q , ω = ω 0 + ω q
q ,ω q q ,ω q
n n
k 0, ω0 k 0, ω0
Stokes Process Anti-Stokes Process
(phonon creation) (phonon absorbtion)
Figure 11: Stokes and antistokes processes in inelastic neutron scattering involving
the creation or absorption of a lattice phonon.
30
4.2 Quantum Theory of Neutron Scattering
• Callaway, p. 36–.
Initial Final
ψ 0 = 1− e i( k 0•⋅ r - ω 0 t ) ψ f = 1− i( k f •⋅ r - ω f t )
V
√ V e
√
n
k 0 ⋅ω 0
n k ⋅ω
φ0 E0 φf Ef
Figure 12: The initial (left) and final (right) states of the neutron and lattice during
a scattering event. The initial system state is given by Ψ0 = φ0 ψ0 , with energy
²0 = E0 + h̄ω0 where ω0 = k02 /2M . The final system state is given by Ψf = φf ψf ,
with energy ²f = Ef + h̄ωf where ωf = kf2 /2M .
31
interaction to be local
X 1 X Z d3 q X iq·(r−rn )
V (r) = V (r − rn ) = V (q)eiq·(r−rn ) = V0 e (97)
rn N q,n V n
If we now substitute in the ion-neutron potential Eq. 98, then the inte-
gral over r will yield a delta function V δ(q + k − k0 ) which allows the
q integral to be evaluated
¯ ¯
(2πh̄)3 X
2
¯X D
¯
¯ ¯
¯ −iK·rn ¯
E ¯2
¯
P =a 2
δ(E0 − Ef + h̄Ω) ¯¯ φ 0 ¯e ¯ φf ¯
¯
(101)
(M V ) f rn
32
Now, before proceeding to a calculation of the differential cross sec-
dσ
tion dΩdE we must be able to convert this probability (rate) for eigen-
states into a flux of neutrons of energy E and momentum p. A differ-
ential volume element of momentum space d3 p contains V d3 p/(2πh̄)3
neutron states. While this is a natural consequence of the uncertainty
principle, it is useful to show this in a more quantitative sense: Imagine
a cubic volume V = L3 with periodic boundary conditions so that for
any state Ψ in V ,
1 P iq·r
If we write Ψ(r) = N qe Ψ(q), then it must be that
with similar relations for the y and z components. So for each volume
³ ´
2π 3
element of q-space L there is one such state. In terms of states p =
h̄q, the volume of a state is (2πh̄/L)3 . Thus d3 p contains V d3 p/(2πh̄)3
states.
The incident neutron flux of states (velocity times density) is
¯ ¯
h̄k0 2 h̄k0 ¯¯ 1 ik0 ·r ¯¯2 h̄k0
j= |Ψ0 | = ¯√ e ¯ = (104)
M M ¯ V ¯ MV
Then since the number of neutrons is conserved
dσ h̄k0 dσ d3 p p2 dpdΩ
j dEdΩ = dEdΩ = P V = PV (105)
dEdΩ M V dEdΩ (2πh̄)3 (2πh̄)3
33
And for thermal (non-relativistic) neutrons E = p2 /2M , so dE =
pdp/M , and
h̄k0 dσ h̄kM dEdΩ
dEdΩ = P V (106)
M V dEdΩ (2πh̄)3
or
dσ k (M V )2
=P . (107)
dEdΩ k0 (2πh̄)3
Substituting in the previous result for P
¯ ¯
dσ k (M V )2 2 (2πh̄)3 X ¯X D
¯
¯ ¯
¯ −iK·rn ¯
E ¯2
¯
= 3
a 2
δ(E0 − Ef + h̄Ω) ¯¯ φ 0 ¯e ¯ φf ¯
¯
dEdΩ k0 (2πh̄) (M V ) f rn
(108)
or
dσ k N a2
= S(K, Ω) (109)
dEdΩ k0 h̄
where
¯ ¯
1 X ¯X D ¯ ¯ E ¯2
¯ ¯ −iK·rn ¯ ¯
S(K, Ω) = δ(E0 − Ef + h̄Ω) ¯¯ φ 0 ¯e ¯ φf ¯
¯
. (110)
N f rn
so that
1 XZ ∞ dt i((E0 −Ef )/h̄+Ω)t
S(K, Ω) = e
N f
−∞ 2π
X ¯¯D ¯ ¯
¯ iK·rn ¯
E¯ ¯D
¯¯
¯ ¯
¯ −iK·rm ¯
E¯
¯
¯ φ 0 ¯e ¯ φ f ¯ ¯ φ f ¯e ¯ φ0 ¯ . (112)
rn ,rm
then as
e−iHt/h̄ |φl i = e−iEl t/h̄ |φl i (113)
34
where H is the lattice Hamiltonian, we can write this as
1 XZ ∞ dt iΩt X D ¯¯ iHt/h̄ iK·rn −iHt/h̄ ¯¯ E
S(K, Ω) = e φ 0 ¯e e e ¯ φf
N f
−∞ 2π rn ,rm
D ¯ ¯ E
φf ¯¯e−iK·rm ¯¯ φ0 , (114)
Thus,
1 XZ ∞ dt iΩt X D ¯¯ iK·rn (t) ¯¯ E D ¯¯ −iK·rm ¯¯ E
S(K, Ω) = e φ 0 ¯e ¯ φf φ f ¯e ¯ φ0
N f
−∞ 2π rn ,rm
1 Z ∞ dt X D ¯ ¯ E
= eiΩt φ0 ¯¯eiK·rn (t) e−iK·rm ¯¯ φ0 . (116)
N −∞ 2π rn ,rm
35
4.2.1 The Debye-Waller Factor
This relation is in fact true to all orders, as long as A and B are linear
functions of a† and a . (c.f. Ashcroft and Mermin, p. 792, Callaway pp.
41-48). Thus
D E 2 2
eiK·sn (t) e−iK·sm = e− 2 h(K·sn (t)) i e− 2 h(K·sm ) i ehK·sn (t)K·sm i .
1 1
(122)
Since the Hamiltonian has no time dependence, and the lattice is in-
variant under translations rn
D E 2
eiK·sn (t) e−iK·sm = e−h(K·sn ) i ehK·sn−m (t)K·s0 i , (123)
36
where the first term is called the Debye-Waller factor e−2W .
2
e−2W = e−h(K·sn ) i . (124)
Thus letting l = n − m
Z dt i(K·rl +Ωt) hK·sl (t)K·s0 i
−2W X ∞
S(K, Ω) = e e e . (125)
l
−∞ 2π
Here the Debye-Waller factor contains much of the crucial quantum
physics. It is finite, even at T = 0 due to zero-point fluctuations, and
since hK · sn i2 will increase with temperature, the total strength of the
Bragg peaks will diminish with increasing T . However, as long as a
crystal has long-ranged order, it will remain finite.
37
dσ0 k N a2 −2W X
= e δ(Ω)N δK,G (129)
dEdΩ k0 h̄ G
However, now the scattering intensity is reduced by the Debye-
Waller factor e−2W , which accounts for zero-point motion and thermal
fluctuations.
= a(q)e−iω(q)t (132)
where we have used the fact that (a† a)n a = (a† a)n−1 a† aa = (a† a)n−1 (aa† −
1)a = (a† a)n−1 a(a† a−1) = a(a† a−1)n . Similarly a† (q, t) = a† (q)eiω(q)t .
Thus,
√ iq·r
1 X h̄e n s ³ ´
sn,α (t) = √ q ²α (q) as (q)e−iωs (q)t + a†s (−q)eiωs (q)t
Mα N q,s 2ωs (q)
(133)
38
and
v
u
1 Xu h̄ ³ ´
s0,α (0) = √ t ²rα (p) ar (p) + a†r (−p) (134)
Mα N p,r 2ωr (p)
−2W
Z ∞dt iΩt X h̄ |K · ²(K)|2
S1 (K, Ω) = e e δK+q,G (136)
−∞ 2π q,G,s 2ωs (q)M
h D E D Ei
e−iωs (q)t as (−K)a†s (−K) + eiωs (q)t a†s (−K)as (−K)
39
this can happen at any temperature, since (1 + ns (K)) 6= 0 at any T .
The second term is only finite when Ω + ωs (K) = ω0 − ωf + ωs (K) = 0;
ie., the final energy of the neutron is larger than the initial energy. The
additional energy comes from the absorption of a phonon. Thus phonon
absorption is only allowed at finite temperatures, and in fact, the factor
ns (K) = 0 at zero temperature. These terms correspond to the Stokes
and anti-Stokes processes, respectively, illustrated in Fig. 13.
k = k 0- q , ω = ω 0 - ω q k = k 0+ q , ω = ω 0 + ω q
1 + n (K) n (K)
s s
q ,ω q q ,ω q
n n
k 0, ω0 k 0, ω0
Stokes Process Anti-Stokes Process
(phonon creation) (phonon absorbtion)
Figure 13: Stokes and antistokes processes in inelastic neutron scattering involving
the creation or absorption of a lattice phonon. The antistokes process can only occur
at finite-T, when ns (K) 6= 0.
40
Chapter 5: Thermal Properties of Crystal Lattices
Debye
Contents
1 Formalism 2
1.1 The Virial Theorem . . . . . . . . . . . . . . . . . . . . . 3
1.2 The Phonon Density of States . . . . . . . . . . . . . . . 5
1
In the previous chapter, we have shown that the motion of
a harmonic crystal can be described by a set of decoupled har-
monic oscillators.
1X
H= |Ps(k)|2 + ωs2(k) |Qs(k)|2 (1)
2 k,s
At a given temperature T, the occupancy of a given mode is
1
hns(k)i = (2)
eβωs(k) − 1
In this chapter, we will apply this information to calculate the
thermodynamic properties of the ionic lattice, in addition to
addressing questions regarding its long-range order in the pres-
ence of lattice vibrations (i.e. do phonons destroy the order).
In order to evaluate the different formulas for these quantities,
we will first discuss two matters of formal convenience.
1 Formalism
2
1.1 The Virial Theorem
4
¿ À 1 X h̄ 1
s2 = ns(q) + (9)
NM q,s ωs (q) 2
This integral, which must be finite in order for the system to
have long-range order, is still difficult to perform. However, the
integral may be written as a function of ωs(q) only.
¿
2
À 1 X h̄ 1 1
s = + (10)
NM q,s ωs (q) eβωs (q) − 1 2
It would be convenient therefore, to introduce a density of
phonon states
1 X
Z(ω) = δ (ω − ωs(q)) (11)
N q,s
so that
¿ À h̄ Z 1 1
s2 = dωZ(ω) n(ω) + (12)
M ω 2
D E
In addition to the calculation of s2 , the density of states Z(ω)
is also useful in the calculation of E =< H >, the partition
function and the related thermodynamic properties
In order to calculate
1 X
Z(ω) = δ (ω − ωs(q)) (13)
N q,s
5
we must first better define the sum over q. As we discussed last
chapter for a 3-d cubic system, we will assume that we have a
periodic finite lattice of N basis points, or N 1/3 in each of the
principle lattice directions a1, a2, and a3. Then, we want the
Fourier representation to respect the periodic boundary condi-
tions (pbc), so
1/3 (a +a +a )
eiq·(r+N 1 2 3 ) = eiq·r (14)
G vector
Bisector
want unique values of qi, so we will choose those within the first
6
Brillouin zone, so that
1
G · q ≤ G2 (15)
2
The size of this region is the same as that of a unit cell of the
reciprocal lattice g1 · (g2 × g3). Since there are N states in this
region, the density of q states is
N N Vc V
= = (16)
g1 · (g2 × g3) (2π)3 (2π)3
where Vc is the volume of a Bravais lattice cell (a1 · a2 × a3),
and V is the lattice volume. Clearly as N → ∞ the density
increases until a continuum of states is formed (all that we need
here is that the spacing between q-states be much smaller than
any physically relevant value of q). The number of states in
a frequency interval dω is then given by the volume of q-space
between the surfaces defined by ω = ωs(q) and ω = ωs(q)+dω
multiplied by V /(2π)3
V Z ω+dω 3
Z(ω)dω = dq
(2π)3 ω
V XZ 3
= dω d qδ (ω − ωs(q)) (17)
(2π)3 s
7
qy
dq ⊥ Surface
dS ω ω = ωs (q)
qx
8
M
ω(q) flat
ω0
ω
-π/a π/a q ω0
linear
Figure 3: Linear harmonic chain. The phonon dispersion of this chain must in-
clude an acoustic mode, so ω(q) will be linear near q = 0, and it must be symmetric
about q = 0 and q = π/q due to the point-group symmetry. Thus, the density of
states (DOS) will be flat near ω = 0 corresponding to the acoustic mode (for which
∇q ω(q) =constant), and will be divergent near ω = ω0 corresponding to the peak of
the dispersion (where ∇q ω(q) = 0).
9
2 Models of Lattice Dispersion
0.5
0.0
-4 -2 0 2 4
q
Figure 4: Dispersion for the diatomic linear chain. In the Debye model, we replace
the acoustic mode by a purely linear mode with the same initial dispersion and ignore
any optical modes.
10
ωs(q) = cs|q|. Thus, for the thermodynamic properties of the
lattice, we care predominantly about the limit ω(q) → 0. This
physics is rather accurately described by the Debye model.
In the Debye model, we will assume that all modes are acous-
tic (elastic), so that ωs(q) = cs|q| for all s and q, then ∇q ωs(q) =
cs for all s and q, and
V X Z dSω
Z(ω) =
(2π)3 s ∇q ωs(q)
V X Z dSω
= (20)
(2π)3 s cs
The surface integral may be evaluated, and yields a constant
R
dSω = Ss for each branch. Typically cs is different for different
modes. However, we will assume that the system is isotropic, so
cs = c. If the dispersion is isotopic, then the surface of constant
ωs(q) is just a sphere, so the surface integral is trivial
2 for d = 1
Ss(ω = ω(q)) =
2πq = 2πω/c for d = 2 (21)
4πq 2 = 4πω 2/c2 for d = 3
11
then since the number of modes = d
2/c for d = 1
V
Z(ω) = 3 2πq = 4πω/c2 for d = 2 0 < ω < ωD
(2π)
2 2 3
4πq = 12πω /c
for d = 3
(22)
Note that since the total number of states is finite, we have
introduced a cutoff ωD on the frequency.
12
Figure 5: Helium adsorbed on a Vycor surface. Each He atom is attracted weakly to
the surface by a van der Waals attraction and sits in a local minimum of the surface
lattice potential.
13
3.1 Long-Range Order
14
where Z(ω) is the same as was defined above in Eq. 22.
2/c for d = 1
V
Z(ω) = 3 2πq = 4πω/c2 for d = 2 0 < ω < ωD
(2π)
2 2 3
4πq = 12πω /c
for d = 3
(26)
Thus in 3-d the DOS always cancels the 1/ω singularity but
in two dimensions the singularity is only cancelled when T = 0
(β = ∞), and in one dimension hs2i = ∞ for all T . This
Table 1: hs2 i for lattices of different dimension, assuming the presence of an acoustic
mode.
15
tems do not have long range order, since as you go along the
chain, the displacements of the atoms can accumulate to pro-
duce a very large rms displacement. In higher dimensional sys-
tems, the displacements in any direction are constrained by
the neighbors in orthogonal directions. “Real” two dimensional
time
3.2 Thermodynamics
17
where Zs(k) is the partition function for the mode s, k; i.e. the
modes are independent and decouple.
1
e−βh̄ωs(k)(ns(k)+ 2 )
X
Zs(k) =
n
X
= e−βh̄ωs(k)/2 e−βh̄ωs(k)(ns(k))
n
−βh̄ωs (k)/2
e
=
1 − e−βh̄ωs(k)
1
= (29)
2 sinh (βωs(k)/2)
The free energy is given by
X
F = −kB T ln (Z) = kB T ln (2 sinh (βh̄ωs(k)/2)) (30)
k,s
18
The specific heat is then given by
dE X
C = = kB (βh̄ωs(k))2 csch2 (βωs(k)/2) (34)
dT V k,s
20
3.3 Thermal Expansion, the Gruneisen Parameter
21
and let’s see if any of these terms will produce a temperature
dependent displacement. The last term is the usual harmonic
term, which we have already shown does not produce a T-
dependent < x >. Also the first term does not have the desired
effect! It does correspond to a temperature-independent shift
in the oscillator, as can be seen by completing the square
1 2 2 1 b 2 b2
mω x + bx = x+ − (46)
2 2 mω 2 2mω 2
Clearly < x >= − mωb 2 , independent of the temperature; that
is, assuming that b is temperature-independent. What we need
is a temperature dependent coefficient b!
The cubic term has the desired effect. As can be seen in
Fig 7, as the average energy (temperature) of a particle trapped
in a cubic potential increases, the mean position of the particle
shifts. However, it also destroys the solubility of the model. To
get around this, approximate the cubic term with a mean-field
decomposition.
¿ À
3 2
cx ≈ cηx x + c(1 − η)x2 hxi (47)
c=0.0
1
0
-2 -1 0 1 2
x
Figure 7: Plot of the potential V (x) = 12 mx2 + cx3 when m = ω = 1 and c = 0.0, 0.1.
The average position of a particle < x > in the anharmonic potential, c = 0.1, will
shift to the left as the energy (temperature) is increased; whereas, that in the harmonic
potential, c = 0, is fixed < x >= 0.
23
yields a shift in the frequency ω → ω 0
1
0 2(1 − η) < x > 2
ω = ω 1 + (49)
mω 2
which is a function of the equilibrium position. Thus a mean-
field description of the cubic term is consistent with the observed
physics.
In what follows, we will approximate the effect of the anhar-
monic cubic term as a shift in the equilibrium position of the
lattice (and hence the lattice potential) and a change of ω to
ω 0; however, we imagine that the energy levels remain of the
form
1
En = h̄ω 0(< x >)(n + ), (50)
2
and that < x > varies with temperature, consistent with the
mean-field approximation just described.
To proceed, imagine the cube of cubic system to be made up
of oscillators which are independent. Since the final result can
be formulated as a sum over these independent modes, consider
µ ¶
dF
only one. In equilibrium, where P = − dV T = 0, the free
24
energy of one of the modes is
1 µ
−βh̄ω
¶
F = Φ + h̄ω + kB T ln 1 − e (51)
2
and (following the notation of Ibach and Lüth), let the lattice
potential
1
Φ = Φ0 + f (a − a0)2 + · · · (52)
2
where f is the spring constant. Then
dF 1 ∂ω 1 h̄ω
0 = P = = f (a − a0) + h̄ω − −βh̄ω
,
da T ω ∂a 2 1−e
(53)
If we identify the last term in parenthesis as ²(ω, T ), and solve
for a, then
1 ∂ω
a = a0 −
²(ω, T ) (54)
ωf ∂a
Since we now know a(T ) for a single mode, we may calculate
the linear expansion coefficient for this mode
1 da 1 ∂ ln w ∂²(ω, T )
αL = =− 2 (55)
a0 dT a0f ∂ ln a ∂T
To generalize this to a solid let αL → αV (as discussed above)
dP
and a20f → V 2 dV = V κ (κ is the bulk modulus) and sum over
25
all modes the modes k, s
1 dV 1 X ∂ ln ωs(k) ∂² (ωs(k), T )
αV = = − . (56)
V dT κV k,s ∂ ln V ∂T
∂²
Clearly (due to the factor of ∂T ), αV will have a behavior sim-
ilar to that of the specific heat (αV ∼ T 3 for low T , and
αV =constant for high T ). In addition, for many lattices, the
Gruneisen number
∂ ln ωs(k)
γ= (57)
∂ ln V
shows a weak dependence upon s, k, and may be replaced by
its average, called the Gruneisen parameter
*
∂ ln ωs(k) +
hγi = , (58)
∂ ln V
typically on the order of two.
Before proceeding to the next section, I would like to reex-
amine the cubic term in a crystal where
X 1 X h̄3/2
s3l = r e i(p+q+k−G)·rl
(59)
l (2M N )3/2 l,k,q,p ω(q)ω(k)ω(p)
µ ¶µ ¶µ ¶
† † †
a(k) + a (−k) a(p) + a (−p) a(q) + a (−q) .
The sum over l yields a delta function δp+q+k,G (ie., crystal mo-
mentum conservation). Physically, these processes correspond
26
k (+ G) k (+ G)
q-k q-k
q -q
Figure 8: Three-phonon processes resulting from cubic terms in the inter-ion potential.
Six other three-phonon processes are possible.
27
material/T 273.2K 298.2K
C 26.2 23.2
Cu 4.03 4.01
Q = −κ∇T (60)
we get
1 X
Qx ≈ h̄ωs(q)hns(q)i1vsx(q) (65)
V q,s
since we presumably already know ωs(k), the calculation of Q
and hence κ reduces to the evaluation of the linear change in
< n >.
Within a region, < n > can change in two ways. Either
phonons can diffuse into the region, or they can decay through
an anharmonic (cubic) term into other modes. so
¯ ¯
¯ ¯
dhni ∂hni ¯
¯ ∂hni ¯
¯
= +
¯
¯
¯
¯ (66)
dt ∂t diffusion ¯ ∂t decay ¯
29
decay diffusion
diffusion
Figure 9: Change of phonon density within a trapazoidal region. hns (q)i can change
either by phonon decay or by phonon diffusion into and out of the region.
dhni
However dt = 0 since we are in a steady state. The decay
process is usually described by a relaxation time τ (or a mean-
free path l = vτ )
¯
∂hni ¯¯¯ < n > − < n >0 < n >1
¯ =− =≈ − (67)
∂t ¯¯decay τ τ
dhni
The diffusion part of dt is addressed pictorially in Fig. 10.
Formally,
¯
∂hni ¯¯¯ hn(x − vx∆t)i − hn(x)i ∂hn(x)i
¯ ≈ ≈ − vx (68)
∂t ¯¯diffusion ∆t ∂x
∂hn(x)i ∂T ∂hn(x)i0 + hn(x)i1 ∂T
≈ −vx ≈ −vx
∂T ∂x ∂T ∂x
Keeping only the lowest order term,
¯
∂hni ¯¯¯ ∂hn(x)i0 ∂T
¯ ≈ −vx (69)
∂t ¯¯diffusion ∂T ∂x
30
x
v region of
source region interest
v ∆t
Figure 10: Phonon diffusion. In time ∆t, all the phonons in the left, source region,
will travel into the region of interest on the right, while those on the right region will
all travel out in time ∆t. Thus, ∆n/∆t = (nlef t − nright )/∆t.
dhni
Then as dt = 0,
¯ ¯
¯ ¯
∂hni ¯
¯ ∂hni ¯
¯
¯
¯ =− ¯
¯ (70)
∂t diffusion
¯ ∂t decay ¯
or
1 ∂hn(x)i0 ∂T
hni = −vxτs(q) . (71)
∂T ∂x
Thus
1 X 2 ∂hn(x)i0 ∂T
Qx ≈ − h̄ωs(q)vsx(q)τs(q) , (72)
V q,s ∂T ∂x
and since Q = −κ∇T
1 X 2 ∂hn(x)i0
κ≈ h̄ωs(q)vsx(q)τ . (73)
V q,s ∂T
From this relationship we can learn several things. First since
2
κ ∼ vsx (q), phonons near the zone boundary or optical modes
31
with small vs(q) = ∇q ωs(q) contribute little to the thermal
conductivity. Also, stiff materials, with very fast speed of the
acoustic modes vsx(q) ≈ c will have a large κ. Second, since
κ ∼ τs(q), and ls(q) = vsx(q)τs(q), κ will be small for materi-
als with short mean-free paths. The mean-free path is effected
by defects, anharmonic Umklapp processes, etc. We will explore
this effect, especially its temperature dependence, in more de-
tail.
At low T , only low-energy, acoustic, modes can be excited
(those with h̄ωs(q) ∼ kB T ). These modes have
vs(q) = cs (74)
32
do not affect the mean-free path, so the thermal resistivity (the
inverse of the conductivity) is dominated by scattering from
impurities in the bulk and surface imperfections at low temper-
atures.
At high T momentum conservation in an anharmonic process
may involve a reciprocal lattice vector G if the q1 of an excited
mode is large enough and there exists a sufficiently small G so
that q1 > G/2 (c.f. Fig. 11). This is called an Umklapp process,
q1 = q2 + q3+ G
q3 G q3
Qout
q
1
q2 q2
q1 Q in
first Brillouin zone
Figure 11: Umklapp processes involve a reciprocal lattice vector G in lattice momen-
tum conservation. They are possible whenever q1 > G/2, for some G, and involve a
virtual reversal of the momentum and heat carried by the phonons (far right).
33
smaller for high temperatures where q1 can be larger than half
the smallest G.
So what about diamond? It is very hard and very stiff, so the
sound velocities cs are large, and so thermally excited modes for
which kB T ∼ h̄ω ∼ h̄cq involve small q1 for which Umklapp
processes are irrelevant. Second κ ∼ c2 which is large. Thus κ
for diamond is huge!
34
Chapter 6: The Fermi Liquid
L.D. Landau
Contents
1 introduction: The Electronic Fermi Liquid 3
1
4.2.1 Equilibrium Distribution of Quasiparticles at Finite T . . . . . 48
4.3 Effective Mass m∗ of Quasiparticles . . . . . . . . . . . . . . . . . . . 50
2
1 introduction: The Electronic Fermi Liquid
4
from these sources to be similar (Celectron ≈ Cphonon ≈ 3N rkB ).
This puzzle is resolved, at the simplest level: that of the non-
interacting Fermi gas.
a V(r+a) = V(r)
Figure 1: Schematic core potential (solid line) for a one-dimensional lattice with
lattice constant a.
For the moment, ignore the core potential, then the electronic
wave functions are plane waves ψ ∼ eik·r . Now consider the
core potential as a perturbation. The electrons will be strongly
effected by the periodicity of the potential when λ = 2π/k ∼ a
1
. However, when k is small so that λ À a (or when k is large,
so λ ¿ a) the structure of the potential may be neglected, or
we can assume V (r) = V0 anywhere within the material. The
1
Interestingly, when λ ∼ a, the Bragg condition 2d sin θ ≈ a ≈ λ may easily be satisfied, so the
electrons, which may be though of as DeBroglie waves, scatter off of the lattice. Consequently states
for which λ = 2π/k ∼ a are often forbidden. This is the source of gaps in the band structure, to be
discussed in the next chapter.
6
potential still acts to confine the electrons (and so maintain
charge neutrality), so V (r) = ∞ anywhere outside the material.
Figure 2: Infinite square-well potential. V (r) = V0 within the well, and V (r) = ∞
outside to confine the electrons and maintain charge neutrality.
7
The normalize plane wave solution to this model is
1/2
23
Y
ψ(r) = sin kixi where i = x, y, or z (5)
i=1 L
ky
3
(π/L)
kx
π/L
Figure 3: Allowed k-states for an electron confined by a infinite-square potential. Each
state has a volume of (π/L)3 in k-space.
8
and as a result of these restrictions, states in k-space are con-
fined to the first quadrant (c.f. Fig. 3). Each state has a volume
(π/L)3 of k-space. Thus as L → ∞, the number of states with
energies E(k) < E < E(k) + dE is
(4πk 2 dk)/8
0
dZ = . (7)
(π/L)3
r
h̄2 k 2
Then, since E = 2m ,
2
so k dk = m
h̄2
2mE/h̄2dE
0 3 1 2m 3/2 1/2
dZ = dZ /L = 2 E dE . (8)
4π h̄2
or, the density of state per unit volume is
dZ 1 2m 3/2 1/2
D(E) = = E . (9)
dE 4π 2 h̄2
Up until now, we have ignored the properties of electrons. How-
ever, for the DOS, it is useful to recall that the electrons are
spin-1/2 thus 2S + 1 = 2 electrons can fill each orbital or k-
state, one of spin up the other spin down. If we account for this
spin degeneracy in D, then
1 2m 3/2 1/2
D(E) = 2 E . (10)
2π h̄2
9
2.2 The Fermi Gas
10
k-states until we reach some Fermi radius kF , corresponding to
some Fermi Energy EF
kz
ky 2 2
kf h kf
= Ef
2m
occupied
states
kx
kf D(E)
Figure 4: Due to the Pauli principle, all k-states up to kF , and all states with energies
up to Ef are filled at zero temperature.
h̄2kF2
EF = , (13)
2m
thus,
f (E, T = 0) = θ(EF − E) (14)
and
Z ∞ Z E
F
n = 0
D(E)f (E, T )dE = 0
D(E)DE
2m 3/2 1 Z EF 1/2
= E DE
h̄2 2π 2 0
3/2
2m 1 2 2/3
= 2 E , (15)
h̄ 2π 2 3 F
11
or
h̄2 µ 2 ¶3/2
EF = 3π n = k B TF (16)
2m
which also defines the Fermi temperature TF . Thus for metals,
in which n ≈ 1023/cm3, EF ≈ 10−11 erg ≈ 10eV ≈ kB 105K.
Notice that due to the Pauli principle, the average energy of
the electrons will be finite, even at T = 0!
Z E
F 3
E= 0
D(E)EdE = nEF . (17)
5
However, it is the electrons near EF in energy which may be
excited and are therefore important. These have a DeBroglie
wavelength of roughly
◦
12.3 A ◦
λe = 1/2
≈4A (18)
(E(eV))
thus our original approximation of a square well potential, ig-
noring the lattice structure, is questionable for electrons near
the Fermi surface, and should be regarded as yielding only qual-
itative results.
12
2.2.2 T 6= 0, Fermi Statistics
1 2
δ n1= -1 δ n1= +1
Figure 5: Exchange of electrons in a solid composed of two orbitals.
13
this system, in equilibrium,
X ∂F
0 = δF = δni (19)
i ∂ni
P
electrons are conserved so i δni = 0. Thus, for our two orbital
system
∂F ∂F
δn1 + δn2 = 0 and δn1 + δn2 = 0 (20)
∂n1 ∂n2
or
∂F ∂F
= (21)
∂n1 ∂n2
A similar relation holds for an arbitrary number of particles.
Apparently this quantity, the increased free energy needed to
add a particle to the system, is a constant
∂F
=µ (22)
∂ni
for all i. µ is called the chemical potential.
Now consider an ensemble of orbitals. We will treat the ther-
modynamics of this system within the canonical ensemble (i.e.
the system is in contact with a thermal bath, and the particle
number is conserved) for which F = E − T S is the appropriate
potential. The system energy E and Entropy S may be written
14
as functions of the orbital energies Ei and occupancies ni and
the degeneracy gi of the state of energy Ei. For example,
E4 g=4 n=2
4 4
E3 g=4 n=4
3 3
E2 g=2 n=1
2 2
E1 g=2 n=2
1 1
X
E= ni Ei . (23)
i
S = kB ln P . (24)
1.0
+1)
β(ω−µ)
0.5
1/(e
0.0
0.0 0.5 1.0 1.5 2.0
ω
³ ´
Figure 7: Plot of the Fermi function 1/ e−β(ω−µ) + 1 when β = 1/kB T = 20 and
µ = 1. Not that at energies ω ≈ µ the Fermi function displays a smooth step of width
≈ kB T = 0.05. This allows thermal excitations of particles near the Fermi surface.
17
Specific Heat The form of f (E, T ) also clarifies why the elec-
tronic specific heat of metals is so small compared to the clas-
sical result Cclassical = 23 nkB T . The reason is simple: only the
electrons with energies within about kB T of the Fermi surface
kB T
may be excited (about EF of the electron density) each with
excitation energy of about kB T . Therefore,
kB T T
Uexcitation ≈ kB T n = nkB T (33)
EF TF
so
T
C ≈ nkB (34)
TF
Then as T ¿ TF (TF is typically about 105K in most metals2 )
C ≈ nkB TT ¿ Cclassical ≈ nkB . Thus at temperatures where
F
18
Specific Heat Calculation Of course, since we know the free energy
of the non-interacting Fermi gas, we can calculate the form of
the specific heat. Here we will follow Ibach and Lüth and Kittel;
however, since the chemical potential does depend upon the
temperature, I would like to make the approximations we make
a bit more explicit.
Upon heating from T = 0 to finite T , the Fermi gas will gain
energy
Z ∞ Z E
F
U (T ) = 0
dE ED(E)f (E, T ) − 0
dE ED(E) (35)
so
dU Z ∞ df (E, T )
CV = = 0 dE ED(E) . (36)
dT dT
Then since at constant volume the electronic density is constant,
dn R∞
so dT = 0, and n = 0 dED(E)f (E, T ),
dn Z ∞ df (E, T )
0 = EF = 0 dE EF D(E) (37)
dT dT
so we may write
Z ∞ df
CV = 0
dE (E − EF ) D(E) . (38)
dT
In f , the temperature T enters through both β = 1/kB T and
19
µ
df ∂f ∂β ∂f ∂µ
= +
dT ∂β ∂T ∂µ ∂T
βeβ(E−µ) E − µ ∂µ
= ³ ´2 − (39)
e β(E−µ) +1 T ∂T
∂µ
However, ∂T depends upon the details of the density of states
near the Fermi surface, which can differ greatly from material
∂µ
to material. Furthermore, ∂T < 1 especially in common metals
E−µ
at temperatures T ¿ TF , and the first term T is of order
∂µ
one (c.f. Fig. 7). Thus, for now we will neglect ∂T relative to
E−µ
T (you will explore the validity of this approximation in your
homework), and, consistent with this approximation, replace µ
by EF , so
df βeβ(E−EF ) E − EF
≈³ ´2 , (40)
dT e β(E−E F ) +1 T
and
1 Z∞ (E − EF )2 D(E)βeβ(E−EF )
CV ≈ dE ³ ´2 let x = β(E − EF )
kB T 2 0 e β(E−E F ) +1
x
Z ∞ x 2 e
≈ kB T −βE dx D + EF x x (41)
F β (e + 1)2
x
As shown in Fig. 8, the function x2 (exe+1)2 is only large in the
region −10 < x < 10. In this region, and for temperatures
20
0.5
0.4
2
x e /(e +1)
0.3
x
0.2
2 x
0.1
0.0
-10 -5 0 5 10
x
x
Figure 8: Plot of x2 (exe+1)2 vs. x. Note that this function is only finite for roughly
−10 < x < 10. Thus, at temperatures T ¿ TF ∼ 105 K, we can approximate
³ ´
x
D β
+ EF ≈ D (EF ) in Eq. 42.
µ ¶
x
T ¿ TF , D β + EF ≈ D (EF ), since the density of states
usually does not have features which are sharp on the energy
scale of 10kB T . Thus
Z ∞
2 ex
CV ≈ kB T D (EF ) dx x x
−βEF (e + 1)2
π2 2
≈ k T D(EF ) . (42)
3 B
Note that no assumption about the form of D(E) was made
other than the assumption that it is smooth within kB T of the
Fermi surface. Thus, experimental measurements of the specific
heat at constant volume of the electrons, gives us information
21
about the density of electronic states at the Fermi surface.
Now let’s reconsider the DOS for the 3-D box potential.
1/2
1 2m 3/2 1/2 E
D(E) = 2 E = D(EF ) (43)
2π h̄ EF
R EF
For which n = 0 D(E)dE = D(EF ) 23 EF , so
π2 T 3
CV = nkB ¿ nkB (44)
2 TF 2
where the last term on the right is the classical result. For
room temperatures T ∼ 300K, which is also of the same order
of magnitude as the Debye temperatures θD ,
3
CV phonon ∼ nkB À CV electron (45)
2
So, the only way to measure the electronic specific heat in most
materials is to go to very low temperatures T ¿ θD , for which
CV phonon ∼ T 3. Here the total specific heat
CV ≈ γT + βT 3 (46)
Cu O Cu O Cu O Cu O Cu O Cu O
O O O O O O
Cu O Cu O Cu O Cu O Cu O Cu O
q=2e-
O O O O O O O
Cu O Cu O Cu O Cu O Cu O Cu O
O O O O O O
Figure 9: A charged oxygen defect is introduced into one of the copper-oxygen planes of
a cuprate superconductor. The oxygen defect captures two electrons from the metallic
band, going from a 2s2 2p4 to a 2s2 2p6 configuration.
23
trostatic potential and the electronic charge density will be re-
duced.
If we model the electronic density of states in this material
with our box-potential DOS, we can think of this reduction in
the local charge density in terms of raising the DOS parabola
near the defect (cf. Fig. 10). This will cause the free electronic
near charged Away from
defect charged defect
e
EF
-eδU
Figure 10: The shift in the DOS parabola near a charged defect.
charge to flow away from the defect. Near the defect (since
e < 0 and hence eδU (rnear ) < 0)
Z E +eδU (r
F near )
n(rnear ) ≈ 0
D(E)DE (47)
24
While away from the defect, δU (raway ) = 0, so
Z E
F
n(raway ) ≈ 0
D(E)DE (48)
or
Z E +eδU (r) Z E
F F
δn(r) ≈ 0
D(E)DE − 0
D(E)DE (49)
If |eδU | ¿ EF , then
25
Lets estimate this distance for our square-well model,
a0 π a0
rT2 F = ≈
3(3π 2n)1/3 4n1/3
1 n −1/6
rT F ≈ (54)
2 a30
◦
In Cu, for which n ≈ 1023 cm−3 (and since a0 = 0.53 A)
³ ´−1/6
1 1023 −8 ◦
rT F Cu ≈ ≈ 0.5 × 10 cm = 0.5 A (55)
2 (0.5 × 10−8)−1/2
Thus, if we add a charge defect to Cu metal, the effect of the
1 ◦
defect’s ionic potential is screened away for distances r > 2 A.
r r
/r
/r
-r/rTF
rTF=1/4
-r/rTF
rTF=1
-1/6
-e
rTF= n
-e
bound states
free states
Figure 11: Screened defect potentials. As the screening length increases, states that
were free, become bound.
27
3.2 Fermi liquids
28
conductivity and super fluidity.
One may construct Fermi liquid theory either starting from a
many-body diagrammatic or phenomenological viewpoint. We,
as Landau, will choose the latter. Fermi liquid theory has 3
basic tenants:
3.3 Quasi-particles
30
quantified by δnp = np − n0p
δp,p0 for a particle p0 > pF
δnp =
. (62)
0
−δp,p0 for a hole p < pF
If we consider excitations created by thermal fluctuations, then
E particle
excitation
δn p’ = 1
EF
δn p= -1
hole
excitation
D(E)
Now lets take our system and place it in contact with a par-
ticle bath. Then the appropriate potential is the free energy,
31
E
2
δF = p’ /2m - µ
2
EF = µ = p /2m
2 F
δF = µ - p /2m
2
δF = | µ - p /2m |
D(E)
Figure 13: Since µ = p2F /2m, the free energy of a particle or a hole is δF =
|p2 /2m − µ| > 0, so the system is stable to these excitations.
2 -a/r TF
U ≈ ea e
a
Figure 14: Model for a fermi liquid: a set of interacting particles an average distance
a apart bound within an infinite square-well potential.
33
remains in its ground state, then n0p = np. However, clearly in
some situations (superconductivity, magnetism) we will neglect
some eigenstates of the interacting system in this way.
Now let’s add a particle of momentum p to the non-interacting
ideal system, and slowly turn on the interaction. As U is
p p p
time = 0 time = t
2
U=0 U = (e /a) exp(-a/rTF)
34
if we had introduced a hole of momentum p below the Fermi
surface, and slowly turned on the interaction, we would have
produced a quasihole.
Note that this adiabatic switching on procedure will have
difficulties if the lifetime of the quasi-particle τ < t. If so,
then the process is not reversible. If we shorten t so that again
τ À t, then the switching on of U may not be adiabatic (ie., we
will evolve to a system which is not in its ground state). Such
difficulties do not arise so long as the energy of the particle is
close to the Fermi energy. Here there are few states accessible
for creating particle-hole excitations. In fact, one can formulate
a perturbative argument that the lifetime of a quasi-particle is
proportional to the square of its excitation energy above the
p2 p2F
Fermi energy ε = 2m − 2m ≈ v(p − pF ).
To estimate this lifetime consider the following argument
from AGD: A particle with momentum p1 above the Fermi
surface (p1 > pF ) interacts with one of the particles below the
Fermi surface with momentum p2. As a result, two new par-
ticles appear above the Fermi surface (all other states are full)
35
p4
p3
p1 p2
Figure 16: A particle with momentum p1 above the Fermi surface (p1 > pF ) interacts
with one of the particles below the Fermi surface with momentum p2 . As a result, two
new particles appear above the Fermi surface (all other states are full) with momenta
p3 and p4 ..
k1 − k 3 = k 4 − k 2 . (68)
N(E)
µ ¶
ε1 2
Thus, the scattering rate 1/τ is proportional to EF so that
excitations of sufficiently small energy will always be sufficiently
long lived to satisfy the constraints of reversibility. Finally, the
fact that the quasiparticle only interacts with a small number of
other particles due to Thomas-Fermi screening (i.e. those within
a distance ≈ RT F ), also significantly reduces the scattering rate.
38
3.4 Energy of Quasiparticles.
39
where ²p = δE/δnp. Note that ²p is intensive (ie. it is indepen-
dent of the system volume). If δnp = δp,p0 , then E ≈ E0 + ²p0 ;
i.e., the energy of the quasiparticle of momentum p0 is ²p0 .
In practice we will only need ²p near the Fermi surface where
δnp is finite. So we may approximate
²p ≈ µ + (p − pF ) · ∇p ²p|pF (72)
40
define m∗ as the constant of proportionality at the fermi surface
41
4 Interactions between Particles: Landau Fermi
Liquid
and since
X
E − E0 ≈ ²pδnp , (80)
p
we get
X
F − F0 ≈ (²p − µ) δnp . (81)
p
have
1 X
|δnp| ¿ 1 . (82)
N p
6 0, ²p − µ will also be of order δ. Thus,
Where δnp =
X
(²p − µ) δnp ∼ O(δ 2) , (83)
p
44
reversal, so
fpσ,p0 σ0 = f−p−σ,−p0 −σ0 , (88)
Then
fpσ,p0 σ0 = fp−σ,p0 −σ0 . (90)
a 1³ ´
s 1³ ´
fp,p 0 = fp↑,p0 ↑ − fp↑,p0 ↓ fp,p 0 = fp↑,p0↑ + fp↑,p0 ↓ .
2 2
(91)
a
fp,p 0 may be interpreted as an exchange interaction, or
s 0 a
fpσ,p0 σ0 = fp,p 0 + σ · σ fp,p0 (92)
s 0
fp,p 0 will only depend upon the angle θ between p and p , and
45
a s
so we may expand either fp,p 0 and fp,p0
∞
X
α
fp,p 0 = flα Pl (cos θ) . (93)
l=0
Figure 19: The addition of another particle to a homogeneous system will yeilds in
forces on the quasiparticle which tend to restore equilibrium.
(recall that fp,p0 = fp0 ,p ). Both terms here are O(δ). The
second term describes the free energy of a quasiparticle due to
the other quasiparticles in the system (some sort of Hartree-like
term).
The term ²̃p plays the part of the local energy of a quasi-
particle. For example, the gradient of ²̃p is the force the system
exerts on the additional quasiparticle. When the quasiparticle
is added to the system, the system is inhomogeneous so that
δnp0 = δnp0 (r). The system will react to this inhomogeneity
by minimizing its free energy so that ∇r F = 0. However, only
the additional free energy due the added particle (Eq. 95) is
inhomogeneous, and has a non-zero gradient. Thus, the system
will exert a force
X
−∇r ²̃ = −∇r fp0,p δnp0 (r) (96)
p0
where the first term is O(δ), and the second O(δ 2). Thus,
X 1 X
≈ h²̃piδnp − fp0,p hδnpihδnp0 i + O(δ 4) (100)
p 2 p,p0
At this point, we may repeat the arguments made earlier to de-
termine the fermion occupation probability for non-interacting
48
Fermions (the constant factor on the right hand-side has no
effect). We will obtain
1
np(T, µ) = , (101)
1 + exp β(h²̃pi − µ)
or
1
δnp(T, µ) = − θ(pf − p) . (102)
1 + exp β(h²̃p i − µ)
However, at least for an isotropic system, this expression bears
closer investigation. Here, the molecular field (evaluated within
kB T of the Fermi surface)
X
h²̃p − ²pi = fp0 ,phδnp0 i (103)
p0
49
In going from the second to the third line above, we made use of
the isotropy of the system, so that hδnp0 i is independent of the
angle θ. The evaluation in the third line, follows from particle
number conservation. Thus, to lowest order in δ
1
np(T, µ) = + O(δ 3) (105)
1 + exp β(²p − µ)
51
or integrating by parts (and renaming p → p0 in the last part),
we get
Z d3 p p Z d3 p
δnp = ∇p²̃pδnp (113)
(2πh̄)3 m (2πh̄)3
XZ d 3 p0 Z d 3 p
−V 3 3
δnpfp,p0 ∇p0 np0 ,
σ 0 (2πh̄) (2πh̄)
Then, since δnp is arbitrary, it must be that the integrands
themselves are equal
p X Z d 3 p0
= ∇p²̃p − V fp,p0 ∇p0 np0 (114)
m σ0 (2πh̄)3
0
The factor ∇p0 np0 = − pp0 δ(p0 − pF ). The integral may be eval-
uated by taking advantage of the system isotropy, and setting
p parallel to the z-axis, since we mostly interested in the prop-
erties of the system on the Fermi surface we take p = pF , let θ
be the angle between p (or the z-axis) and p0, and finally note
¯ ¯
¯ ¯
that on the Fermi surface ∇ ²̃ |
¯
¯
¯
p p p=pF ¯ = vF = pF /m∗ . Thus,
pF pF X Z p02 dpdΩ p0 0
= ∗+ 3
fpσ,p0 σ0 0 δ(p − pF ) (115)
m m σ 0 (2πh̄) p
52
However, since both p and p0 are restricted to the Fermi surface
p0
p0 = cos θ, and evaluating the integral over p, we get
1 1 V pF X Z dΩ
= ∗+ fpσ,p0 σ0 cos θ , (116)
m m 2 σ,σ 0 (2πh̄)3
1
where the additional factor of 2 compensates for the additional
spin sum. If we now sum over both spins, σ and σ 0, only the
symmetric part of f survives (the sum yields 4f s), so
1 1 4πV pF Z
= ∗+ d (cos θ) f s(θ) cos θ , (117)
m m (2πh̄)3
We now expand f in a Legendre polynomial series
X
f α (θ) = flα Pl (cos θ) , (118)
l
or m∗/m = 1 + F1s/3.
The effective mass cannot be experimentally measured di-
rectly; however, it appears in many physically relevant measur-
able quantities, including the specific heat
∂E/V 1 ∂ X
CV =
= ²̃pnp. (122)
∂T V N V ∂T p
55
Chapter 7: The Electronic Band Structure of Solids
April 2, 2001
Contents
1 Symmetry of ψ(r) 3
4 Photo-Emission Spectroscopy 24
1
Free electrons -FLT Band Structure
V(r)
V(r) = V
0
E E
Ef metal
E insulator
E f
f
Ef "heavy" metal
D(E) D(E)
Figure 1: The additional effects of the lattice potential can have a profound effect on
the electronic density of states (RIGHT) compared to the free-electron result (LEFT).
1 Symmetry of ψ(r)
3
Schroedinger equation becomes
2
h̄ 2
Hψ(r) = − ∇ + V (r) ψ = Eψ (3)
2m
X h̄2k 2 X 0 X
⇒ Ckeik·r + Ck0 VGei(k +G)·r = E Ckeik·r , k0 → k−G
k 2m k0 G k
(4)
or
2 2
X h̄ k
ik·r X
e − E
Ck + VGCk−G = 0∀r (5)
k 2m
G
Thus the potential acts to couple each Ck only with its recip-
rocal space translations Ck+G and the problem decouples in to
N independent problems for each k in the first BZ. Ie., each of
the N problems has a solution which is a sum over plane waves
whos’ wave vectors differ only by G’s. Thus the eigenvalues
may be indexed by k.
4
X X X
X X X
First B.Z.
Figure 2: The potential acts to couple each Ck with its reciprocal space translations
Ck+G (i.e. x → x, • → •, and ° → °) and the problem decouples into N indepen-
dent problems for each k in the first BZ.
5
Ie., ψk+G(r) = ψk(r) and as a result
First BZ 2π/a
7
First B.Z.
-π⁄a π⁄a
−π⁄a π⁄a kx
Figure 4: The situation becomes more complicated in three dimensions since there are
many more bands and so they can cross the first zone at lower energies. For example
in a 3-D cubic lattice the energy band structure along kx (ky = kz = 0) is already
rather complicated within the first zone.
We can generally take V0 = 0 since this just sets a zero for the
h̄2 k2
potential. Then, those G for which Ek = Ek−G ≈ 2m are
going to give the largest contribution since
X Ck−G
Ck = VG h̄2k2 (16)
2m − Ek−G
G
Ck−G1
Ck ∼ VG1 h̄2k2 (17)
2m − Ek−G1
8
X Ck−G1−G
Ck−G1 = VG h̄2k2 (18)
2m − Ek−G−G1
G
Ck
Ck−G1 ∼ V−G1 h̄2k2 (19)
2m − Ek
Thus to a first approximation, we may neglect the other Ck−G,
and since VG = V−G (so that V (r) is real) |Ck| ≈ |Ck−G1 | À
other Gk−G
X
(eiGx/2 + e−iGx/2 ) ∼ cos πx
a
ψk(r) = Ck−G ei(k−G)·r ∼
G
(eiGx/2 − e−iGx/2 ) ∼ sin πx
a
(20)
The corresponding electron densities are sketched in Fig. 5.
Clearly ρ+ has higher density near the ionic cores, and will
be more tightly bound, thus E+ < E−. Thus a gap opens in
Ek near k = G2 .
Gap!
ρ− (x)
V(x)
D(E)
Figure 5: ρ+ ∼ cos2 (πx/a) has higher density near the ionic cores, and will be more
G
tightly bound, thus E+ < E− . Thus a gap opens in Ek near k = 2
.
P
G0 VG0−GCk−G0
Ck−G = Ã ! (22)
h̄2
Ek − 2m |k − G|2
h̄2 k2
To a first approximation (VG ' 0) let’s set E = 2m (a free-
electron energy) and ignore all but the largest Ck−G ; ie., those
for which the denominator vanishes.
k2 = |k − G|2 , (23)
10
or in 1-d
2π 2 π
k2 = (k −) or k = − (24)
a a
This is just the Laue condition, which was shown to be equiv-
alent to the Bragg condition. Ie., the strongest perturbation
to the free-electron picture occurs for states with energies at
the edge of the first B.Z. Thus the equation above also tells us
highly perturbed
that Ck and Ck−G1 are the most important coefficients (if this
electronic state was unperturbed, only Ck would be important).
Thus approximately for VG ∼ 0, V0 ≡ 0 and for k near the
11
zone boundary
h̄2k 2
G=0 Ck E− = VG1 Ck−G1 (25)
2m
2 2
h̄ |k − G1|
G = G1 Ck−G1 E− = V−G1 Ck, (26)
2m
or ¯
¯
¯
¯
¯ ¯
¯
¯ Ek0 −E V G1 ¯
¯
¯
¯
¯
¯
¯
¯
=0 (28)
0
¯
¯ V−G1 Ek−G 1
−E ¯
¯
∆E = E+ − E− = 2|VG1 | (32)
12
E
k
e−
2 VG k
-π/a 0 π/a k
Figure 7:
And the band structure looks something like Fig. 7. Within this
approximation, the gap, or forbidden regions in which there are
no electronic states arise when the Bragg condition (kf − k0 =
G) is satisfied.
| − k| ≈ |k + G| (33)
13
ever, in chapter one, we considered band structure qualitatively
and determined that gaps could arise from perturbing about
the atomic limit. This in fact, is another natural way of con-
State
Energies
Separation
Figure 8: Band gaps in the electronic DOS naturally emerge when perturbing around
the atomic limit. As we bring more atoms together (left) or bring the atoms in the
lattice closer together (right), bands form from mixing of the orbital states. If the
band broadening is small enough, gaps remain between the bands.
14
V (r)
A
E i ψi
r r
16
This is called the Raleigh-Ritz variational principle.
Consistent with our original motivation, we will approximate
ψk with a sum over atomic states.
X X
ψk ' φ k = anφi(r − rn) = eik·rn φi(r − rn) (41)
n n
17
φ (r-r )
i 1 φi (r-r2 )
Figure 10: In the tight binding approximation, we assume that the atomic orbitals
of adjacent sites have a very small overlap with each other.
18
B
i
Bi
B A
i i
B
i
Figure 11: A simple cubic tight binding lattice composed of s-orbitals, with overlap
integral Bi .
Z
Bi = − φ∗i (r − rm)v(r − rn)φi(r − rn)d3 r (46)
Thus
X
E(k) ' Ei − Ai − Bi eik(rn−rm) sum over m n.n. to n
m
(48)
Now, if we have a cubic lattice, then
The electrons near the zone center act as if they were free with
a renormalized mass.
h̄2k 2 2 2 1
= B i a k , i.e. ∝ curvature of band (52)
2m∗ m∗
For this reason, the hybridization term Bi is often associated
with kinetic energy. This makes sense, from its origins of wave
function overlap and thus electronic transfer.
The width of the band, 12 Bi, will increase as the electronic
overlap increases and the interatomic orbitals (core orbitals or
valance f and d orbitals) will tend to form narrow bands with
high effective masses (small Bi).
20
First B.Z.
Fermi surface
Figure 12: Electronic states for a cubic lattice near the center of the B.Z. act like free
electrons with a renormalized mass. Hence, if the band is partially filled, the Fermi
surface will be spherical.
21
Atomic Potential Tight Binding Bands
E Ek
2 111
E 111
1
A2
k 12 B 2
k=0
E 1k
111
E
2
a A1
12 B1
k
- π/a π/a
Figure 13: In the tight-binding approximation, band form from overlapping orbitals
states (states of the atomic potential). The bandwidth is proportional to the hy-
bridization B (12B for a SC lattice). More localized, compact, atomic states tend to
form narrower bands.
bonding band. (See Fig.14). Here, the gap is not tied to the
periodicity of the lattice, and so an amorphous material of C
may also display a gap.
The tight-binding picture can also explain the variety of fea-
tures seen in the DOS of real materials. For example, in Cu
(Ar)3d10 4s the d-orbitals are rather small whereas the valence
s-orbitals have a large extent . As a result the s-s hybridization
22
E
sp3antibonding
sp3 bonding
ra a
Figure 14: C (diamond) with atomic configuration of 1s2 2s2 2p2 . Its valance s and p
states form a strong sp3 hybrid band which is split into a bonding and anti-bonding
band.
d
1 +
− −
2
+ + + +
− −
−
s
+ +
−
Figure 15: Schematic DOS of Cu 3d10 4s1 . The narrow d-band feature is split due to
crystal fields.
4 Photo-Emission Spectroscopy
24
synchrotron
d
hω r
detector
α e
material
sample
Figure 16: XPS Experiment: By varying the voltage one may select the kinetic
energy of the electrons reaching the counting detector.
mass also varies inversely with the curvature of the bands. The
density of states away from the Fermi surface can allow us to
predict the properties of the material upon doping, or it can
yield information about core-level states. Thus it is important
to be able to measure D(E). This may be done by x-ray pho-
toemission (XPS), UPS or PS in general. The band dispersion
E(k) may also be measured using angle-resolved photoemission
(ARPES) where angle between the incident radiation and the
25
detector is also measured.
The basic idea is that a photon (usually an x-ray) is used
to knock an electron out of the system (See figure 17.) Of
hω - Eb -φ
X-ray
E
hω - Eb
Eb
Intensity(E)
D(E)
Kinetic Energy of electrons hω - φ
Figure 17: Let the binding energy be defined so that Eb > 0, φ = work function, then
the detected electron intensity I(Ekin − h̄ω − φ) ∝ D(−Eb )f (−Eb )
26
h̄ω −φ. From Fermi’s golden rule, we know that the probability
per unit time of an electron being ejected is proportional to
the density of occupied electronic states times the probability
(Fermi function) that the electronic state is occupied
1
I(Ekin ) = ∝ D(−Eb)f (−Eb)
τ (Ekin )
∝ D(Ekin + φ − h̄ω)f (Ekin + φ − h̄ω) (55)
phonon Coulombic
interation
e
Figure 18: Left: Origin of the background in I(Ekin . Right: Electrons excited deep
within the bulk scatter so often that they rarely escape. Thus, most of the signal I
originates at the surface, which must be clean and representative of the bulk.
6
background subtracted
background
’’raw’’ data
4
I(Ekin)
0
0 1 2 3 4
Ekin hω-φ
28
e
measure
E kin
hω
the material. Thus, we only learn about D(E) near the surface
of the material. Therefore it is important for this surface to be
“clean” so that it is representative of the bulk. For this reason
these experiments are often carried out in ultra-high vacuum
conditions.
We can also learn about the electronic states D(E) above the
Fermi surface, E > FF , using Inverse Photoemmision. Here,
an electron beam is focussed on the surface and the outgoing
flus of photons are measured.
29
Chapter 8: Magnetism
Contents
1 Introduction 2
1.1 The Relevance of Magnetostatics . . . . . . . . . . . . . . . . . . . . 3
1.2 Non-interacting Magnetic Systems . . . . . . . . . . . . . . . . . . . . 4
1
5 Spin Waves 43
5.1 Quantization of Ferromagnetic Spin Waves . . . . . . . . . . . . . . . 49
5.2 Antiferromagnetic Spin Waves . . . . . . . . . . . . . . . . . . . . . . 56
2
1 Introduction
Figure 1: Both moment formulation and the correlation between these moments (J)
are due to Coulombic effects
3
1.1 The Relevance of Magnetostatics
4
Or roughly one degree Kelvin! Magnetic correlations due to
magnetic interactions would be destroyed by thermal fluctua-
tions at very low temperatures.
5
at LANL by blowing things up), thus
kB T > µ B B (7)
µ ¶
−β 12 gµB B β 12 gµB B
1 1 e −e
↑s= hmi = gµB µ ¶ (8)
2 2 e−β 12 gµB B + eβ 12 gµB B
µB µB B
'g tanh(βµB B) ≈ µB (g ∼ 2)
2 T
2
∂ hmi µ
χ = ≈ B (9)
∂B kB T
6
Once again, the energy of the moment-field interaction is roughly
µ ¶
−8 eV 2
µ2B 10 Te
E∼ B2 ∼ µ
−4
2
¶2 B , (k = 1) (10)
T T 10◦KeV
When E ∼ T , thermal fluctuations destroy the orientation of
the moments with the external field. If B ∼ 10T e
100 ◦
E∼ K (11)
T
or E ∼ T at 10◦K! However, we know that systems such as
E
∼ kT/EF
D(E)
Figure 2: In a metal, only the electrons near the Fermi surface, which are not paired
2
kT µB
into singlets, contribute to the bulk susceptibility χ ∼ EF k B T
∼ µ2B D(EF )
iron exist for which a small field can induce a relatively large
moment at room temperature. This is surprising since for a
metal, or a free electron gas, the susceptibility is much smaller
7
than the free electron result, since only the spins near the Fermi
µ2B
surface can participate, χ = EF . Note that this is even smaller
kB T
than the free electron result by a factor of EF ¿ 1!
8
Max L
Max S
ψ(x)
e- e-
Figure 3: Hunds rules, Maximize S and L, both result from minimizing the Coulomb
energy.
2ε+U
moment forms
µ Γ
a
e2 − r T F
Figure 5: A moment forms on an orbital provided that Γ À U, |²|. U ∼ a
e
rT F is small for a metal, large for an insulator
ξ n
e
m
Figure 6: Here ξ = − 1c ∂φ
∂t
, m = −µB L and φ = Bπa2
11
⇒ diamagnetism with χ ∝ a2. In the free electron limit (see
J.M. Ziman, )
à !2
2 1 m
χ = µB D(EF ) 1 − ∗
(13)
3 m
2
eV
∆E ∼ 10−8 a10−1 eV −1B 2 ∼ 10−9 eV B 2(T ) (14)
T
m∗
For insulators, often with m < 1 the diamagnetism wins;
m∗
whereas for metals, generally with m ≥ 1, the Pauli para-
magnetism wins.
12
ξ
Figure 7: If the magnetic moments in a small volume are correlated, then the magnetic
susceptibility is strongly enhanced.
(mµB B)2
E ≈ (16)
kB T
For magnetism to be significant at room temperature (300K)
we must increase the energy of our system in a field. This
may be accomplished by increasing the effective moment m by
correlating adjacent moments. If the range of this correlation is
4πξ 3 ξ
ξ, so that roughly 3a3
moments are correlated, then let a =3
so that ∼ 102 moments are correlated and m ∼ 102. This
increases E by about 104 , so that E ∼ kbT at T ∼ 103 K.
The observed (measured) susceptibility also then increases by
about 104, all by only correlating moments in a range of 3 lattice
13
spacings.
Clearly correlations between adjacent spins can make mag-
netism in materials relevant. Such correlations are due to elec-
tronic effects and are hence usually short ranged due to elec-
1
tronic (Thomas-Fermi) screening. If we consider two s = 2
H = −2Jσ1 · σ2 (17)
Et − Es = −J (20)
14
1 2
r12
e e
r1B r2A
r1A r
2B
+ +
e R e
AB
A B
H = H1 + H2 + H12 (21)
h̄2 2 e2 e2
H1 = − ∇ − − (22)
2m r1A r1B
e2 e2
H12 = + (23)
r12 RAB
As we did in Chap. 1 to describe binding, we will use the atomic
wave functions to approximate the molecular wavefunction ψ 12.
or
ψ12 = φA(1)φB (2) ± φB (1)φA (2) ⊗ spin part (27)
The energy of these states may then be calculated by evaluating
hψ12 |H|ψ12 i
hψ12 |ψ12 i .
hψ12|H|ψ12i C± A
E= = 2EI + , + singlet , − triplet
hψ12|ψ12i 1±S
(28)
16
where
Z
2 ∗
h̄2 2 e2
EI = d r1φA(1) − ∇ 1 − φA(1) < 0 (29)
2m r1A
All EI , C, A, S ∈ <. So
C −A C + A
−J = Et − Es = 2EI + − 2EI +
1−S 1+S
C −A C +A
−J = − >0
1−S 1+S
A − SC
J= 2 <0 (33)
1 − S2
where the inequality follows since the last two terms in the {}
dominate the integral for A and in the Heitler-London approx-
17
Figure 9:
18
2.2.2 Exchange Interaction for Delocalized Spins
e
ri
e
V r
j
+ +
Ze Ze
Figure 11: Electron a screens the potential seen by electron b, raising its energy. Any-
thing which keeps pairs of electrons apart, but costs no energy like the exchange hole
for the electronic triplet, will lower the energy of the system. Thus, triplet formation
is favored thermodynamically.
21
i.e. ri ≡ 0
e
O r=rj
e
Figure 12: Geometry to calculate the exchange interaction.
1 1 # electrons
n↑ = n = (39)
2 2 volume
In terms of an electronic charge density, this is
en
ρex (r) = (1 − cos [(ki − kj ) · r])
2
en
1 Z kF 3 3 1 µ ı(ki−kj )·r −ı(ki −kj )·r
¶
= 1 − d k i d k j e + e
2
( 34 kF3 )2 o 2
en 4 3 −2 Z kF 3 ıki·r Z kF 3 ıkj ·r
= 1 − ( kF ) 0 d ki e d kj e
2
3 0
2
en
(sin kF r − kF r cos kF r)
= 1−9 (40)
2
(kF r)6
Note that both of the exponential terms in the second line are
the same, since we integrate over all ki → −ki & kj → −kj .
Since we have only been considering Pauli-principle effects, the
electronic density of spin down electrons remains unchanged.
Thus, the total charge density around the up spin electron fixed
22
ρeff
en
1/2
2 4 kF r
Figure 13: Electron density near an electron fixed at the origin. Coulomb effects would
reduce the density for small r further, but would not significantly effect the size of the
exchange hole or the range of the corresponding potential, both ∼ 1/kF .
at the origin is
9 (sin kF r − kF r cos kF r)2
ρef f (r) = en 1 − (41)
2 (kF r)6
The size of the exchange hole, and the range of the correspond-
1
ing ferromagnetic exchange potential, is ∼ kF ∼ a which is
rather short.
23
theory of FM where the mean effect of a spin-up electron is to
lower the energy of all other band states of spin up electrons by
a small amount, independent of k.
IN↑ < 1eV
E↑(k) = E(k) − ; I∼ (42)
N
Likewise for spin down
IN↓
E↓(k) = E(k) − . (43)
N
Where I, the stoner parameter, quantifies the exchange hole
energy. The relative spin occupation R is related to the bulk
moment
(N↑ − N↓) N
R= , M = µB R (44)
N V
Then
I(N↑ +N↓ )
Eσ (k) = E(k) − 2N − σIR
2 , (σ = ±) (45)
≡ Ẽ(k) − σIR
2 . (46)
E
F
E E E
F
f′′ f′′′
E E
Figure 14:
1 X ∂f Z V V
T = 0, − = dẼ D(Ẽ)δ(Ẽ−EF ) = D(EF ) = D̃(EF )
N k ∂Ek 2N 2N
(52)
So, the condition for FM at T = 0 is I D̃(EF ) > 1. This is
known as the Stoner criterion. I is essentially flat as a function
I (eV)
Ni
∼
1.0 D(E F) (eV-1)
Fe
Co
1.0 Na
Li
Z 50 Z
Figure 15:
26
of the atomic number, thus materials such as Fe, Co, & Ni with
a large D̃(EF ) are favored to be FM.
3.1 Enhancement of χ
or as M = µB N
V R, we get
N D̃(EF )
M = 2µ2B B (55)
V 1 − I D̃(EF )
or
∂M χ0
χ = = (56)
∂B 1 − I D̃(EF )
27
χ0 = 2µ2B N
V D̃(EF ) (57)
= µ2B D(EF ) (58)
Figure 16: Spin fluctuations can reduce the total moment within the correlated region,
and even reduce ξ itself. Both effects lead to a reduction in the bulk susceptibility
χ ∼ moment2
28
3.2 Finite T Behavior of a Band Ferromagnet
correlated
Figure 17: In metallic Ni, the d-orbitals are compact and hybridize weakly due to
low overlap with the s-orbitals (due to symmetry) and with each other (due to low
overlap). Thus, moments tend to form on the d-orbitals and they contribute narrow
features in the electronic density of states. The s-orbitals hybridize strongly and form
a broad metallic band.
1 1 R̃Tc
R̃ = Ã ! − Ã ! = tanh (62)
exp −2R̃Tc
+1 exp 2R̃Tc
+1 T
T T
R 1 n↑ −n↓
If T = 0, then R̃ = 1 = C = C N . For Ni, the measured
µef f n↑ −n↓
ground state magnetization per Ni atom is µB = 0.54 = N .
µef f
Therefore, C = 0.54 = µB .
30
For small x, tanh x ' x − 31 x3 , and for large x
sinh x ex − e−x 1 − e−2x
tanh x = = =
cosh x ex + e−x 1 + e−2x
= (1 − e−2x )(1 − e−2x ) ' 1 − 2e−2x (63)
Thus,
2Tc
R̃ = 1 − 2e− T , for T ¿ Tc (64)
√ 1
R̃ = 3(1 − TT ) 2 , < Tc
for small R̃ or T ∼ (65)
C
∼
R
Eq(65)
T/Tc
Figure 18:
1
critical exponent β = 2 in Eq. 65 is found to be reduced to
1
≈ 3, and Eq. 64 loses its exponential form, in real systems.
Using more realistic D̃(E) or values of I will not correct these
problems. Clearly something fundamental is missing from this
31
model (spin waves).
Figure 19: Local spin-flip excitations, left, due to thermal fluctuations are properly
treated by mean-field like theories such as the one discussed in Secs. 3 and 4. However,
non-local spin fluctuations due to intersite correlations between the spins are neglected
in mean-field theories. These low-energy excitations can fundamentally change the
nature of the transition.
3.2.1 Effect of B
δ=2
δ=3 i δ=1
δ=4
P P
Figure 20: Terms in the Heisenberg Hamiltonian H = − iδ Jiδ Si · Siδ − gµB B0 i Si
Here i refers to the sites and δ refers to the neighbors of site i.
33
systems with localized spins are described by the Heisenberg
Hamiltonian.
X X
H=− Jiδ Si · Siδ − gµB B0 Si (69)
iδ i
Figure 21: The mean or average field felt by a spin Si at site i, due to both its neighbors
1 P P
and the external magnetic field, is gµB
h δ Jiδ Siδ i + B0 = Bief f . Where h δ Jiδ Siδ i
is the internal field, due to the neighbors of site i.
X X X
H≈ − gµB Bief f · Si = − Si · Jiδ hSiδ i + gµB B0
i i δ
(70)
If Jiδ = J is a constant (independent of i and δ) describing the
34
exchange between the spin at site i and its ν nearest neighbors,
then
P
J iδ hSiδ i + gµB B0 Jν
Bief f = = hSi + B0 (71)
gµB gµB
N
M = gµB hSi ; ν = #nn . (72)
V
For a homogeneous, ordered system,
V
Bef f = νJM + B0 = BM F + B0 (73)
N g 2µ2B
and
X
H ≈ −gµB Bef f · Si (74)
i
and
1
P↓ ∝ e−β (+gµB Bef f 2 ) (76)
so, on average
N↓
= e−βgµB Bef f (77)
N↑
35
and, since N↑ + N↓ = N
1 N↑ − N ↓ 1 N β
M = gµB = gµB tanh gµB Bef f (78)
2 V 2 V 2
Since tanh is odd and Bef f ∝ M , this will only have non-
trivial solutions if J > 0 (if B0 = 0). If we identify
N gµB 1 J
Ms = ; Tc = ν (79)
V 2 4 k
M Tc M
= tanh (80)
Ms T Ms
< Tc
and again for T = 0, M (T = 0) = Ms, and for T ∼
a = tanh (ba)
y=a
y = tanh (ba)
initial slope = b
Figure 22: Equations of the form a = tanh(ba), i.e. Eq. 80, have nontrivial solutions
(a 6= 0) solutions for all b > 1.
1
M √ T 2
' 3 1−
(81)
Ms Tc
36
Again, we get the same (wrong) exponent β = 12 .
When is this approximation good? When each spin really
feels an ”average” field. Suppose we have an ordered solid, so
that
Jν
BM F == Breal (82)
2gµB
Now, consider one spin flip excitation adjacent to site i only,
Figure 23: The flip of a single spin adjacent to site i makes a significant change in
the effective exchange field, felt by spin Si , if the site has few nearest neighbors.
Fig. 23.
If there are an infinite # of spins then BM F remains un-
changed but for ν < ∞
Jν ν − 2 Jν
Breal = 6= BM F = . (83)
2gµB ν 2gµB
Clearly, for this approximation to remain valid, we need B real =
37
ν −2
BM F , which will only happen if ν = 1 or ν À 2. The
more nearest neighbors to each spin, the better MFT is! (This
remains true even when we consider other lower energy excita-
tions, other than a local spin flip, such as spin waves).
J<0
"down" sublattice
"up" sublattice
Figure 24: Antiferromagnetism (the Neel state) on a bcc lattice is composed of two
interpenetrating sc sublattices lattices.
38
example, the central site shown in Fig. 24 feels a mean field
from the ν = 8 near-neighbor spins on the “down” sublattice.
so
− 2 2 −
⁄ N g µB )νJM
B = V/(
MF
Figure 25:
+
1 N gµB
V
M+ = gµB tanh − 2 2 νJM − (84)
2 V 2kT N g µB
M− = (+ ↔ −) . . . (85)
− +
M = −M (87)
39
Again, these equations will saturate at
1 N+
Ms+ = −Ms− = gµB (88)
2 V
so
M+ TN M +
= tanh T M
(89)
Ms+ s
where TN = − 14 kνJ
B
M − + ∆M − = (+ ↔ −) . . . (90)
( )
d 1 ∂M
Or, since dx tanh x = cosh2 x
, then ∆M = ∂Bef f ∆Bef f .
+ − 1 N + gµB 1 V νJ
∆M = ∆M +∆M = gµB 2 B 0 + ∆M
2 V 2kT cosh x 2N −g 2µ2B
(91)
TN M +
where x = T Ms+ . For T > TN , M + = 0 and so x = 0, and
g 2µ2B N 4kB TN V
∆M = 2 B 0 −
2 2 ∆M (92)
8V kB T N g µB
g 2µ2B N
T ∆M = B0 − ∆M TN (93)
4V kB
40
g 2µ2B N
∆M = B0 (94)
4V kB (T + TN )
g 2µ2B N
χ = (95)
4V kB (T + TN )
T
0 T
−TN N
Figure 26: Sketch of χ = Const/(T + TN ). Unlike the ferromagnetic case, the bulk
susceptibility χ does not diverge at the transition. However, as we will see, this
equation only applies for the paramagnetic state (T > TN ), and even here, there are
important corrections.
41
B0
Figure 27: When T ¿ TN , a weak field applied parallel to the sublattice magnetization
TN
axis only weakly perturbs the spins. Here M + (T ) ≈ Ms+ and x ≈ T
g 2µ2B N −2 TN
χ' e T (97)
4V kB
Now consider the case where B0 is perpendicular to the mag-
netic axis. The external field will cause each spin to rotate a
B
0
B0
BMF
small angle α. (See Fig. 28) The energy of each spin in this
42
external field and the mean field ∝ ν gµJ to the first order in
B
B0 is
1 1
E = − gµB B0 sin ∝ + νJ cos α (98)
2 2
Equilibrium is obtained when ∂E
∂α = 0. Since B0 is taken as
small, α ¿ 1.
1 1 1
E ∼ − gµB B0α + νJ 1 − α2 (99)
2 2 2
or
∂E 1 1 gµB B0
= 0 = gµB B0 + να ⇒ α = − (100)
∂α 2 2 νJ
The induced magnetization is then
1 gµB B0 g 2µ2B N B0
∆M = α =− (101)
2 V 2νJV
so
g 2µ2B N
χ⊥ = = constant (102)
2ν |J| V
Of course, in general, in a powdered sample, the susceptibility
will reflect an average of the two forms, see for example Fig. 30.
43
χ
χ powdered sample
χ⊥
χ
χ
T T T
N
Figure 29: Below the Neel transition, the lattice responds very differently to a field
applied parallel or perpendicular to the sublattice magnetization. However, in a pow-
dered sample, or for a field applied in an arbitrary direction, the susceptibility looks
something like the sketch on the right.
5 Spin Waves
44
1.2e-5
1.1e-5
1.0e-5
9.0e-6
8.0e-6
7.0e-6
6.0e-6
5.0e-6
4.0e-6
3.0e-6
2.0e-6
1.0e-6
0.0e+0
0 5 10 15 20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 95
T (K)
Figure 30: High temperature superconductor Y123 with 25% Fe substituted on Cu,
courtesy W. Joiner, data from a SQUID magnetometer.
tion.
Consider a ferromagnetic Heisenberg Model
X
H = −J Si · Si+δ (103)
iδ
1
where Si = x̂Six + ŷSiy + ẑSiz . If we define |αi =
(i.e.
0
45
0
|↑i), β =
(i.e. |↓i) so that
1
0
z 1 0
S
=−
··· (104)
1 2 1
and
1 0 −i 1 0 1 1 1 0
Sx =
Sy =
Sz =
2 i 0 2 1 0 2 0 −1
(105)
· ¸
α β
S ,S = i²αβγ S γ (106)
− x y
0 0
·
2 ±
¸
S = S − iS =
S ,S =0 (108)
1 0
0
1
0
S+
=
, S−
= 0... (109)
1 0 1
They allow is to rewrite H as
X 1³ + − + ´
H = −J z
Siz Si+δ + Si Si+δ + S−Si+δ (110)
iδ 2
46
Since J > 0, the ground state is composed of all spins oriented,
for example
|0i = Πi |αii (i.e. all up) (111)
−
Si+Si+δ |0i = 0 (112)
and
1
Siz Si+δ
z
|0i = |0i (113)
4
so
1
H |0i = − JνN |0i ≡ E0 |0i (114)
4
where N is the number of spins each with ν nearest neighbors.
Now consider a spin-flip excitation. (See Fig. 31)
j
Figure 31: A single local spin-flip excitation of a ferromagnetic system. The resulting
state is not an eigenstate of the Heisenberg Hamiltonian.
47
−
This is not an eigenstate since the Hamiltonian operator Sj+Sj+δ
will move the flipped spin to an adjacent site, and hence create
another state.
However, if we delocalize this spin-flip excitation, then we can
create a lower energy excitation (due to the non-linear nature
of the inter-spin potential) which is an eigenstate. Consider the
j
Figure 32: If we spread out the spin-flip over a wider region then we can create a
lower energy excitation. A spin-wave is the completely delocalized analog of this with
one net spin flip.
state
1 X
|ki = √ eik·rj |↓j i . (116)
N j
It is an eigenstate. Consider:
1 X ik·rj 1
H |ki = √ e − νJ(N − 2) |↓j i
N j 4
1 1 X
+ νJ |↓j i − J (|↓j+δ i + |↓j−δ i) (117)
2 2 δ
where the sum in the last two terms on the right is over the near-
48
neighbors δ to site j. The last two terms may be rewritten:
1 X ik·rj √ X ik·(r −r )
√ e |↓j+δ i = 1 N e m δ |↓mi (118)
N j m
where
rj+δ = rj + rδ = rm (119)
so
1 1 X µ ik·rδ −ik·rδ
¶ 1 X
H |ki = − νJN + νJ − J e +e
√ eik·rj |↓j i
4 2 δ N j
(120)
Thus |ki is an eigenstate with an eigenvalue
1X
E = E0 + Jν 1 − cos k · rδ (121)
ν δ
Apparently the energy of the excitation described by |ki van-
ishes as k → 0.
What is |ki? First, consider
X X 1 X
S z |ki = Siz |ki = Siz √ eik· rj
|↓j i
i i N j
1 X ik·rj X z
=√ e Si |↓j i = (SN − 1) |ki (122)
N j i
In the ground state all of the spins are up. If we flip a spin,
using a spin-wave excitation, then
Siz = S − a†i ai
Si+ ∝ ai
Si− ∝ a†i (124)
50
Sz n
3
2 0
1
2 1
− 12 2
− 32 3
Table 1: The correspondence between S z and the number of spin-wave excitations on
a site with S = 3/2.
√
ai |ni = ni |n − 1i (126)
√
a†i |ni = ni + 1 |n + 1i (127)
consider
n
+ − − +o
S S −S S |ni = 2S z |ni = 2(S − n) |ni (129)
51
If S + = a, and S − = a†, then the left-hand side of the above
equation would be {(n + 1) − n} |ni = |ni 6= 2(S − n) |ni.
In order to maintain the commutators, we need
+
√ − †
√
S = 2S − n a S = a 2S − n (130)
Then
n ≤ 2S (132)
This transformation
r r
Si+ = 2S − a†i aiai Si− = a†i 2S − a†i ai Siz = S − a†i ai
(133)
52
is called the Holstein Primakoff transformation.
If we Fourier transform these operators,
1 X 1 X
a†i = √ eik·Ri a†k ; ai = √ e−ik·Ri ak (134)
N k N k
Of course this is only exact for ni ¿ 2S, i.e. for low T where
there are few spin excitations, and large S (the classical spin
limit).
In this limit
v
u
2S X ik·Ri
u
u
Si+ ' t e ak (137)
v
N k
u
u 2S X
u
Si− ' t e−ik·Ri a†k (138)
N k
53
1 X 0
Siz = S − ei(k−k )·Ri a†k ak0 , (139)
N kk 0
the Hamiltonian
X z z 1 + − − +
H = −J S S
i i+δ + (S S + S S ) (140)
iδ 2 i i+δ i i+δ
may be approximated as
X
H ' −N JνS 2 + 2JνS a†k ak
k
X 1 X ik· Rδ †
−2JνS e ak ak + O(a4k ) (141)
k ν δ
X
H ' E0 + 2JνS(1 − γk )a†k ak + O(a4k ) (142)
k
1 P ikRδ
where γk = ν δe . This is the Hamiltonian of a collection of
k-q
k′+q k k′
k k′
Figure 33: The fourth order correction to Eq. 142 corresponds to interactions between
the spin waves, giving them a finite lifetime
54
corresponds to interactions between the spin waves. These in-
teractions are a result of our definition of a spin-wave as an
itinerant spin flip in an otherwise perfect ferromagnet. Once
we have one magnon, another cannot be created in a “perfect”
ferromagnetic background.
Clearly if the number of such excitations is small (T small)
and S is large, then our approximation should be valid. Fur-
thermore, since these are the lowest energy excitations of our
spin system, they should dominate the low-T thermodynamic
properties of the system such as the specific heat and the mag-
netization. Consider
X h̄ωk
hEi = . (143)
k eβh̄ωk − 1
For small k, h̄ωk = 2JνS(1 − γk ) = 2JνSk 2 on a cubic lattice.
Then let’s assume that the k-space is isotropic, so that d3k ∼
k 2dk, then
2 k2
γk = (cos kx + cos ky + · · ·) = 1 − (144)
ν ν
and
X 2JνSk 2 Z ∞ k 4dk
hEi ≈ ∝ (145)
k eβ2JνSk2 − 1 0 eβαk2 − 1
55
1 1
x 2 1 1 2 1
x = βαk 2 k= dk = x− 2 dx (146)
βα 2 βα
so that
−2 −1/2 x3/2Z ∞
hEi ∝ β β 0
dx x ∼ T 5/2 (147)
e −1
Thus, the specific heat at constant volume CV ∼ T 3/2, which
is in agreement with experiment.
M/M(0)
1 - T 3/2
56
since each magnon corresponds to spin flip. Thus
k 2dk Z 3
M (T ) − M (0) ∼ − βαk2 ∼ T2 (149)
e −1
which clearly dominates the exponential form found in MFT
2Tc
(1 − 2e T ). This is also consistent with experiment!
down sublattice
up sublattice
where v
u
u ni
fi(S) = 1 − t and ni = a†i ai (151)
2S
Again this transformation is exact (canonical) within the man-
ifold of allowed states
0 ≤ ni ≤ 2S ⇔ −S ≤ Sz ≤ S . (152)
The Hamiltonian
X 1³ + − + ´
H = −J z
Siz Si+δ = Si Si+δ + Si−Si+δ (153)
iδ 2
may be rewritten in terms of Boson operators as
X
H = +JS 2N ν + J a†i aia†i+δ ai+δ
iδ
X½ †
− JS ai ai + a†i+δ ai+δ
iδ ¾
+ fi(S)aifi+δ (S)ai+δ + a†i fi(S)a†i+δ fi(S) (154)
58
Once again, we will expand
v
u
u ni ni n2i
fi(S) = 1 −
t =1− − − ··· (155)
2S 4S 32S 2
and include terms in H only to O(a2)
X½ † ¾
2
H ' JS N ν − JS ai ai + a†i+δ ai+δ + aiai+δ + a†i a†i+δ
iδ
(156)
This Hamiltonian may be diagonalized using a Fourier trans-
form
1 X −ik·Ri
ai = √ e ak (157)
N k
and the Bogoliubov transform
†
ak = αk cosh uk − α−k sinh uk (158)
a†k = αk† cosh uk − α−k sinh uk (159)
Figure 36: The Neel state of an antiferromagnetic lattice. Due to zero point motion,
this is not the ground state of the Heisenberg Hamiltonian when J < 0 and S is finite.
61
E
f dθ ∝ S(k, ω) ∝ I {F(-i〈[a(t),a†(0)]〉)}
dΩdω ∼ n
n
Ei θ
sample
2θ ∝ k = k i − k j
thermal
spin- h ω = E i − Ef
polarized
neutrons
Figure 37: Polarized neutrons are used for two reasons. First if we look at only spin
flip events, then we can discriminate between phonon and magnon contributions to
S(k, ω). Second the dispersion may be anisotropic, so excitations with orthogonal
polarizations may disperse differently.
62
S(k,ω)
n↓ γ 〈ω k 〉
magnon k
n↑
ω ω
phonon junk k 2θ
subtracted off
Figure 38: Sketch of neutron structure factor from scattering off of a magnetic system.
The spin-wave peak is centered on the magnon dispersion. It has a width due to the
finite lifetime of magnon excitations.
†
a i a i a†i+δ a ⇒
i+δ
Figure 39:
63
Chapter 9: Electronic Transport
Onsager
Contents
1 Quasiparticle Propagation 2
1.1 Quasiparticle Equation of Motion and Effective Mass . . . . . . . . . 5
2 Currents in Bands 8
2.1 Current in an Insulator . . . . . . . . . . . . . . . . . . . . . . . . . . 8
2.2 Currents in a Metal . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11
5 Conductivity of Metals 24
5.1 Drude Approximation . . . . . . . . . . . . . . . . . . . . . . . . . . 24
5.2 Conductivity Using the Linear Boltzmann Equation . . . . . . . . . . 25
6 Thermoelectric Effects 30
6.1 Linearized Boltzmann Equation . . . . . . . . . . . . . . . . . . . . . 31
1
6.2 Electric Current . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
6.3 Thermal and Energy Currents . . . . . . . . . . . . . . . . . . . . . . 34
6.4 Seebeck Effect, Thermocouples . . . . . . . . . . . . . . . . . . . . . 39
6.5 Peltier Effect . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 40
2
As we have seen, transport in insulators (of heat mostly)
is dominated by phonons. The thermal conductivity of some
insulators can be quite large (cf. diamond). However most
insulators have small and uninteresting transport properties.
Metals, on the other hand, with transport dominated by elec-
trons generally conduct both heat and charge quite well. In
addition the ability to conduct thermal, charge, and entropy
currents leads to interesting phenomena such as thermoelectric
effects.
1 Quasiparticle Propagation
3
V
e
V(r)
Figure 1: The dispersion is obtained from band structure E(k) = h̄ω(k) in which the
metal is approximated as an almost free gas of electrons interacting weakly with a
lattice potential V (r), but not with each other.
∆k ∆x ∼ 1 or ∆p ∆x ∼ h̄ , (3)
4
Re ψ(x)
Figure 2: If U (k) = cδ(k − k0 ) then ψ(x, t) ∝ ei(k0 ·x−ωt) and the quasiparticle is
delocalized.
then
1 Z∞ i(k·x−ω0 t− ∇k ω(k)|k (k−k0 )t)
ψ(x, t) ' √ dkU (k)e 0 (6)
2π −∞
5
U(k)
k0 k
6
it gains energy.
So
h̄δk = −eEδt (11)
h̄k̇ = −eE E.O.M (12)
This equation of motion is identical to that for free electrons
(c.f. Jackson); however, it may be shown to be applicable to
general Bloch states provided that E is smaller than the atomic
fields, and it must not vary in space or time too fast.
We may put this EOM in a more familiar form
1d 1 X ∂ 2E dkj
v̇i = (∇k E(k))i =
h̄ dt h̄ j ∂ki∂kj dt
1 X ∂ 2E
= (−eEj ) (13)
h̄ j ∂ki∂kj
This will have the form F = ma, if we define the mass tensor
1 =
1 =
1 ∂E
symmetric & real (14)
m∗ ij m∗ ji h̄2 ∂ki∂kj
7
which may be diagonalized to define three principle axes. In
the simple cubic case, the matrix will have the same element
along each principle direction and
∗ h̄2
m = d2 E
. (15)
dk 2
2
d E <0
dk2
2
d E =0
dk2
k
2
-π/a d E >0 π/a
dk2
m*
Figure 4:
8
2 Currents in Bands
Fermi surface
Figure 5: Different occupied states make different contributions to the current density.
E x
kx kx
Figure 6: The Fermi sea of a partially filled band will shift under the influence of an
applied field E. This destroys the inversion symmetry of the Fermi sea, causing a net
current.
11
2.2 Currents in a Metal
Now imagine that the band is almost full. Near the top of the
Ek E
E
E
holes
k D(E)
full states
Figure 7: Left: A nearly full simple band. States near the Fermi surface that can be
thermally excited have negative mass. Right: Density of states with holes at the top
which have positive charge and mass.
d2 E
band dk 2
< 0, so the mass is negative and the dispersion at the
top of the band is always also parabolic, so
h̄2k 2
E(k) = E0 − ¯¯ ∗ ¯¯ k = deviation from top! (24)
2 ¯¯mˆ¯¯
or
1d h̄k̇ eE
v̇ =∇k Ek = − ¯¯ ∗ ¯¯ = ¯¯ ∗ ¯¯ (25)
h̄ dt ¯m ¯
¯
¯m ¯
ˆ¯ ¯ ˆ¯
This is the EOM of a positively charged particle with positive
mass in an electric field E. I.e., holes at the top of the band
have positive mass.
12
We have just shown that a material with full bands is an
insulator (See Fig. 8 left). Ie., it carries no current, as least at
E E
T=0 T≠0
D(E) D(E)
Figure 8: An insulator form when the fermi energy falls in a gap of D(E). As the
temperature is raised, electrons are promoted over the gap, and both the electrons and
holes contribute to the conductivity which increases with temperature.
13
3 Scattering of Electrons in Bands
Figure 9: Drude thought that electrons scatter off the lattice yielding resistivity. Bloch
showed this to be wrong.
particle state may be defined from a sum over Bloch waves (de-
scribed by k) each of which is a stationary state and describe
14
the unperturbed propagation of electrons. Thus a perfect lat-
tice yields no resistivity. We can get resistivity in two ways.
(a) (b)
e
e
† k+q k+q
c † c a -q or c † c a q
k+q k k+q k -q
q
⇓
c †
c (a q + a†-q )
k+q k k k
Figure 10: Electrons do scatter from defects in the lattice or lattice vibrations. They
contribute to the resistivity, with the phonon contribution increasing with temperature,
and the defect contribution more-or-less constant.
15
Due to the strength of the electron-electron interaction and the
density of electrons, (2) should dominate. However, it is easy
to show, using the Pauli principle, that effect of (2) is quite
often negligible, so that we may return to regarding the pure
electronic system as a (perhaps renormalized) non-interacting
Fermi gas.
According to momentum and energy conservation Fig 11
E1 + E 2 = E 3 + E 4 k 1 + k 2 = k 3 + k 4 . (27)
E E2
1
1 2
k1 k2
k1 − k 3 = k 4 − k 2 (30)
17
E
k1 ky
k4
k3
EF 3
k2 k 1- k3
2 1
k 4- k2
4 k
x
D(E)
Figure 12: Momentum and energy conservation severley restrict the states that can
an electron can scatter with and into.
18
4 The Boltzmann Equation
V
e j defect
Figure 13: Electronic transport due to an applied field E, is limited by inelastic colli-
sions with lattice defects and phonons.
19
in V to return to
1
f0(k) = f (r, k, t)|E=0 = (32)
eβ(E(k)−EF ) + 1
As indicated,
To derive a form for f (r, k, t), we will consider length scales
◦
larger than atomic distances A, but smaller than distances in
which the field changes significantly. In this way the system
is considered essentially homogeneous with any inhomogeneity
driven by the external field. Now imagine that there is no scat-
tering (no defects, phonons), then since electrons are conserved
dr
r- dt
dt r
·
k - dk dt k hk = -eE
dt
t - dt t
dt ∂f
f (r, k, t) = f r − v dt, k + eE , t − dt + dt
h̄ ∂t S
(34)
For small dt we may expand
f ∂f ∂f
f (r, k, t) = f (r, k, t)−v · ∇r f +eE · ∇k − + (35)
h̄ ∂t ∂t S
or
∂f e ∂f
+ v · ∇ r f − E · ∇k f = Boltzmann Equation
∂t h̄ ∂t S
(36)
If the phonon and defect perturbations are small, time-independent,
and described by H, then the scattering rate from a Bloch state
k to k0 (occupied to unoccupied) is wk0k = 2π
h̄ |hk0 |H| ki|2.
Then
3Z
∂f (k) L
= d3 k 0 {(1 − f (k)) wkk0 f (k0)
∂t S 2π
− (1 − f (k0)) wk0k f (k)} (37)
21
Needless to say it is extremely difficult to solve these last two
coupled equations.
∇r f = 0 (42)
then
e ∂f f (k) − f0(k)
− E · ∇k f (k) = = − (43)
h̄ ∂t S τ (k)
ie
e
f (k) = f0(k) + τ (k)E · ∇k f (k) (44)
h̄
which may be solved iteratively, generating a power series in E
(or Ex).
k
1ST BZ y
δk x
k
x
Figure 16: According to the linear Boltzmann equation, the effect of a field E x is to
shift the Fermi surface by δkx = −eτ Eh̄x
Ex
δkx = −eτ (47)
h̄
From the discussion in Sec.??, it is clear that a finite current
results.
Interesting! Note that elastic scattering |k| = |k0| cannot
restore equilibrium. Rather they would only cause the Fermi
24
surface to expand. Inelastic scattering (i.e. from phonons) is
needed to explain relaxation.
Figure 17: Note that elastic scattering |k| = |k0 | cannot restore equilibrium. Rather
they would only cause the Fermi surface to expand.
5 Conductivity of Metals
26
V
E = E x̂
Figure 18: To calculate the conductivity, we apply a field in the x-direction only and
use the linearized Boltzmann Eqn.
k⊥ ∇k E
dk⊥ = dE
∇k E
ky
constant energy
surface
kx
Figure 19:
jx e2 Z vx2 (k)
σ = = dSE dE τ (k)δ(E − EF ) (58)
Ex 8π 3h̄ v(k)
e2 Z vx2 (k)
= dSE τ (k) . (59)
8π 3h̄ E=EF v(k)
As expected only the properties of the electrons on the Fermi
surface are relevant.
E=0
E≠0
Figure 20: Only the electrons near the fermi surface participate in the transport. Far
below the Fermi surface, pairs of states k and −k are occupied. Their contribution
to the conductivity cancels, leaving contributions to only the occupied states near the
fermi surface.
29
that the two mechanisms are independent, they must add
1 1 1
= + (63)
τ τph τdefect
i.e.
ρ = ρph + ρdefect Matthiesen’s Rule (64)
30
1
scattering to τ . Therefore, at T À θD (θD = debye temper-
ph
ature)
ρ = aT + ρdefect (67)
Ni
. 3 2%
+3 i
ρ Cu %N R
2 .16 i αT
Cu
+ %N
1 .12
+
Cu
Cu
5
αT
T
0.1
T Θ
D
Figure 21: The phonon and defect contributions to the resistivity add (left), and the
phonon contribution is linear at high temperatures T À θD .
6 Thermoelectric Effects
31
T1 T2
T1 ≠ T
x̂ 2
Figure 22: Thermoelctric effects are important in systems with both electric potential
and thermal gradients. We will assume both are in the x-direction
32
∂f
where in steady state ∂t → 0. After linearizing (replacing f
by f0 in the left-hand side), we get
( )
e
f (k) ' f0(k) − τ (k) v · ∇r f0 − E · ∇k f0 (69)
h̄
Then, as before
e e ∂f0 e ∂f0
E · ∇ k f0 = E · h̄v = Ex h̄vx (70)
h̄ h̄ ∂E h̄ ∂E
The spatial inhomogeneity is through ∇T , and in a semicon-
ductor for which EF depends strongly upon T , through ∇EF
∂f0 ∂f0 ∂f0 ∂f0
v· ∇r f0 = v· ∇T + ∇EF = v· ∇T − ∇EF
∂T ∂EF ∂T ∂E
(71)
Apparently ∇EF only contributes a term which modifies the
electric field dependence
∂f0 ∂f0 0
vx {eEx + (∇EF )x} ≡ vx eE (72)
∂E ∂E x
Of course, in a metal E 0 = E.
f
0
T=0
T≠0
E E
f
∂f0
Figure 23: The derivative ∂E
is only significant near the fermi surface.
34
Assuming that τ (E) ∼ τ (EF ), we get
2 e ∂T Z ∂f0
jx = σEx0+ τ (E F ) dEE D(E) (77)
3 m∗ ∂x ∂T
2 e ∂T
jx = σEx0 + τ (E F ) c v (T ) c v ∝ m ∗
(78)
3 m∗ ∂x
In general, this intuitive form is rewritten as
∂T
jx = σEx0 + L12
xx
− (79)
∂x
and from it we see that both an electric field (or the generalized
field strength E 0), and the thermal gradient contribute to the
electron current, jx.
35
so that
jQ = j E − E F jn − ejn = j (81)
36
T1 T1 ρ = σ-1
T1 = T
2 11
I≠0 j = L E′ j=σE
V≠0 12
j = L E′ 11
Q L =σ
V
I
Figure 24: Here, there is a potential gradient (V 6= 0) but no thermal gradient since
T1 = T2 . The electric field drives both electric and thermal currents. Thus, a heat
bath is required to keep both sides of the sample at the same temperature.
j = L11E
jQ = L12E (85)
V
I
³ ³ ´ ´
L12
Figure 25: Here j = 0 = L11 E 0 + L12 (−∇T ) and jQ = −L12 L11
+ L22 (−∇T )
³ ´
L12
where −L12 L11
+ L22 = κT
and
12
21 0 21 L
jQ = L E + L (−∇T ) = −L 11 + L22
22
(−∇T )
L
(88)
and since (you will show)
2eτ
L12 = − ∗
cv = L21 (89)
3m
³ ´
12 2
L
κ = L22 − (90)
L11
38
We could also measure the thermal conductivity by driving
a heat current through the sample, maintaining the ends at the
same potential (see Fig. 26 right). Here, we would find
T1 ≠ T
2
V1 V2 V2
V1
j
j Q
Q
V1 ≠ V V1 = V2 I
2
12
j = L (-∇T)
12 22
22 (L ) j = L (-∇T)
κ=L - 11
Q
L
22
κ=L
κ = L22 . (91)
39
David Mast measures κ by the method of the left of Fig. 26.
This yields the more conventional definition of κ.
metal A
T1 1 2 T
2
metal B T0 metal B
12
L dT
j = 0 = L11Ex + L22 (−∇T ) ⇒ Ex =
(93)
L11 dx
40
L12
where S = L11
is called the thermopower and is a property of
a material.
The potential measured around the loop is given by
Z 1 Z 2 Z 0
V = 0
EB dx + 1
EAdx + 2
EB dx (94)
or
∂T
Z 1 Z 0 ∂T Z 2 ∂T
V = SB 0 dx + 2 dx + SA 1 dx (95)
∂x ∂x ∂x
Z 1 ∂T Z 2 ∂T
= SB 2 dx + SA 1 dx (96)
∂x Z ∂x
T
= (SA − SB ) T 2 dT = (SA − SB ) (T2 − T1) (97)
1
R T2
Or if SA and SB are not T -independent V = T1 dT (SA − SB ).
So if T1 6= T2 and SA 6= SB , then the heat current in-
duces an emf! This is called the Seebeck effect ⇒ (solid state
thermometer with ice H2O as a reference).
41
metal A
1 2
j T0 metal B
metal B
0
Figure 28: An electrical current j is driven through the loop which is held at a fixed
temperature
µ ¶
∂T
∂x = 0 . Then
42
π j
j = A
Q A
j = (π A - π )j (π - π )j = j
Q B j A B Q
Q
=
π j
π Bj B
B T0 B
0
V1 V2
∂T
thermal field -
∂x
Figure 30: The thermal and electrical conductivities may be measured independently.
jQ = j E − E F jn (101)
1 Z 3
= d k (E − EF )v(k)f (k) (102)
8π 3
In the linear approximation to the Boltzmann equation for E x0 =
0, we get
1 Z 3 ∂f0 2 ∂T
jQ = 3 d k (E − EF ) v τ − (103)
8π ∂T x ∂x
∂f0
where E −EF and ∂T are odd in (E −EF ), for the Fermi liquid
∂T 1 Z d3 k ∂f0
2
j Q ' − v F τF (E − E F ) (104)
∂x 3 3π 3 ∂T
∂T 1
jQ = − vF3 τF cv (105)
∂x 3
44
∂f 0
(E - E )
F ∂T
EF E
1
κ = vF3 τF cv (106)
3
2
Now recall that for the Fermi liquid cv = kB π2 nkB k TT so that
B F
1 m∗vF2 π2 T π2 kB2 T
κ= τF kB nkB = τF n ∗ (107)
3 m∗ 2 EF 3 m
However, also for the Fermi liquid, we found that σ = e2τF mn∗ ,
so
κ π 2 kB 2
= T (108)
σ 3 e
Of course this relationship only holds in a good metal. There
are two reasons for this. First we are neglecting terms like
2
(L12)
σ in κ which are unimportant for a good metal (or if we
electrically short the sample.) Second, we are assuming that κ
is dominated by electronic transport.
45
Chapter 10: Superconductivity
May 9, 2001
Contents
1 Introduction 2
1.1 Evidence of a Phase Transition . . . . . . . . . . . . . . . . . . . . . 2
1.2 Meissner Effect . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
3 Cooper Pairing 10
3.1 The Retarded Pairing Potential . . . . . . . . . . . . . . . . . . . . . 11
3.2 Scattering of Cooper Pairs . . . . . . . . . . . . . . . . . . . . . . . . 12
3.3 The Cooper Instability of the Fermi Sea . . . . . . . . . . . . . . . . 14
1
6 BCS ⇒ Superconducting Phenomenology 32
9 Tunnel Junctions 43
2
1 Introduction
3
C (J/mol°K)
Cn ∼ γT
CS
T T
c
Figure 1: The specific heat of a superconductor CS and and normal metal Cn . Below
the transition, the superconductor specific heat shows activated behavior, as if there is
a minimum energy for thermal excitations.
Cs ∼ e−β∆ (1)
I Z 1 Z ∂B
0 = IR = V = E · dl = S ∇ × E · dS = − S · dS , (2)
c ∂t
or, since S and C are arbitrary
1
0 = − Ḃ · S ⇒ Ḃ = 0 (3)
c
Thus, for an ideal conductor, it matters if it is field cooled or
zero field cooled. Where as for a superconductor, regardless
of the external field and its history, if T < Tc, then B = 0
inside the bulk. This effect, which uniquely distinguishes an
5
Ideal Conductor
Zero-Field Cooled Field Cooled
T > Tc T > Tc
B=0 B≠0
T < Tc T < Tc
B=0 B≠0
T<T T < Tc
c
B≠0 B=0
Figure 3: For an ideal conductor, flux penetration in the ground state depends on
whether the sample was cooled in a field through the transition.
µ−1
B = µH = 0 ⇒ µ = 0 M = χH = H (4)
4π
6
1
χSC = − (5)
4π
Ie., the measured χ, Fig. 4, in a superconducting metal is very
large and negative (diamagnetic). This can also be interpreted
χ ∝ D(E )
Pauli F
Tc
0
T
js
χ M
∼
∼ ∼
∼
H
-1
4π
Normal
Hc
S.C.
Tc T
8
mv̇ = −eE (7)
∂j
or, since ∂t = −ensv̇,
∂js e2ns
= E (First London Eqn.) (8)
∂t m
Then, using the Maxwell equation
1 ∂B m ∂js 1 ∂B
∇×E =− ⇒ ∇ × + =0 (9)
c ∂t ns e 2 ∂t c ∂t
or
∂ m 1
∇ × j s + B =0 (10)
∂t nse2 c
This described the behavior of an ideal conductor (for which
ρ = 0), but not the Meissner effect. To describe this, the
constant of integration must be chosen to be zero. Then
ns e 2
∇ × js = − B (Second London Eqn.) (11)
mc
m
or defining λL = ns e 2
, the London Equations become
B ∂js
= −λL∇ × js E = λL (12)
c ∂t
9
4π
If we now apply the Maxwell equation ∇×H = c j ⇒ ∇×B =
4π
c µj then we get
4π 4πµ
∇ × (∇ × B) = µ∇ × j = − 2 B (13)
c c λL
and
1 4πµ
∇ × (∇ × j) = − ∇×B=− 2 j (14)
λL c c λL
or since ∇ · B = 0, ∇ · j = 1c ∂ρ
∂t = 0 and ∇ × (∇ × a) =
4πµ 4πµ
∇2 B − B=0 ∇2 j − j=0 (15)
c2 λL c2 λL
x
SC
^ ^
∂Bx
B j ∝∇×B∝z×x
s ∂z
y z
j
Figure 6: A superconducting slab in an external field. The field penetrates into the
q
mc2
slab a distance ΛL = 4πne2 µ
.
10
Now consider a the superconductor in an external field shown
in Fig. 6. The field is only in the x-direction, and can vary in
4π
space only in the z-direction, then since ∇ × B = c µj, the
current is in the y-direction, so
∂ 2Bx 4πµ ∂ 2jsy 4πµ
− 2 Bx = 0 − jsy = 0 (16)
∂z 2 c λL ∂z 2 c2 λL
with the solutions
− z − Λz
Bx = B0xe ΛL jsy = jsy e L (17)
s s
c 2 λL mc2
ΛL = 4πµ = 4πne2 µ
is the penetration depth.
3 Cooper Pairing
11
3.1 The Retarded Pairing Potential
+
e-
ions
e- + + + + + +
region of + +
8 + + + +
vF ∼ 10 cm/s positive charge
attracts a second
+
electron
Figure 7: Origin of the retarded attractive potential. Electrons at the Fermi surface
travel with a high velocity vF . As they pass through the lattice (left), the positive ions
respond slowly. By the time they have reached their maximum excursion, the first
electron is far away, leaving behind a region of positive charge which attracts a second
electron.
ξ ∼ 1000Α°
e
- k↓
Figure 8: To take full advantage of the attractive potential illustrated in Fig. 7, the
spatial part of the electronic pair wave function is symmetric and hence nodeless. To
obey the Pauli principle, the spin part must then be antisymmetric or a singlet.
13
are attracted by the exchange of phonons. However, the max-
imum energy which may be exchanged in this way is ∼ h̄ωD .
Thus the scattering in phase space is restricted to a narrow
shell of energy width h̄ωD . Furthermore, the momentum in
k1
2
Ek ∼ k
k’1
k’ k’
1
2
ω
D
k’ k2
2 k2 k1
Figure 9: Pair states scattered by the exchange of phonons are restricted to a narrow
scattering shell of width h̄ωD around the Fermi surface.
scattering shell
k1 -k
2
K
Figure 10: If the pair has a finite center of mass momentum, so that k1 + k2 = K,
then there are few states which it can scatter into through the exchange of a phonon.
Now consider these two electrons above the Fermi surface. They
will obey the Schroedinger equation.
h̄2 2
− (∇1 + ∇22)ψ(r1r2) + V (r1r2)ψ(r1r2) = (² + 2EF )ψ(r1r2)
2m
(19)
If V = 0, then ² = 0, and
1 ik1·r1 1 ik2·r2 1 ik(r1−r2)
ψV =0 = e e = e , (20)
L3/2 L3/2 L3
15
where we assume that k1 = −k2 = k. For small V, we will
perturb around the V = 0 state, so that
1 X ik·(r1 −r2 )
ψ(r1r2) = g(k)e (21)
L3 k
The sum must be restricted so that
h̄2k2
EF < < EF + h̄ωD (22)
2m
this may be imposed by g(k), since |g(k)|2 is the probability of
finding an electron in a state k and the other in −k. Thus we
take
k < kF
g(k) = 0 for
√ (23)
2m(EF +h̄ωD )
k >
h̄
The Schroedinger equations may be converted to a k-space
equation by multiplying it by
1 Z 3 −ik0· r
dre ⇒ S.E. (24)
L3
so that
h̄2k 2 1 X
g(k) + 3 g(k0)Vkk0 = (² + 2EF )g(k) (25)
m L k0
where
Z
−k0 )·r 3
Vkk0 = V (r)e−i(k dr (26)
16
now describes the scattering from (k, −k) to (k0, −k0). It is
usually approximated as a constant for all k and k0 which obey
the Pauli-principle and scattering shell restrictions
2
h̄2 k2 h̄2 k0
−V0 EF < 2m , 2m < EF + h̄ωD
Vkk0 =
. (27)
0 otherwise
so
2 2
h̄ k V0 X
− + ² + 2EF
g(k) = −
3
g(k0) ≡ −A (28)
m L k0
or
−A h̄2 k2
g(k) = 2 2 (i.e. for EF < 2m < EF + h̄ωD )
− h̄ mk + ² + 2EF
(29)
Summing over k
V0 X A
= +A (30)
L3 k h̄2k2 − ² − 2EF
m
or
V0 X 1
1= (31)
L3 k h̄2k2 − ² − 2EF
m
This may be converted to a density of states integral on E =
h̄2 k2
2m
17
Z E +h̄ω
F D dE
1 = V0 EF
Z(EF ) (32)
2E − ² − 2EF
1 ² − 2h̄ω D
1 = V0Z(EF ) ln (33)
2 ²
2h̄ωD −2/(V0 Z(EF )) V0
² = ' −2h̄ω D e < 0, as → 0
1 − e2/(V0 Z(EF )) EF
(34)
18
N
2² less than that of the old state, since the Fermi surface is
renormalized by the formation of each Cooper pair.
or
|ψk i = uk |0ik + vk |1ik (39)
19
where vk2 = wk and u2k = 1 − wk . Then the BCS state, which
is a collection of these pairs, may be written as
Y
|φBCS i ' {uk |0ik + vk |1ik } . (40)
k
We will assume that uk , vk ∈ <. Physically this amounts to
taking the phase of the order parameter to be zero (or π), so
that it is real. However the validity of this assumption can only
be verified for a more microscopically based theory.
By the Pauli principle, the state (k ↑, −k ↓) can be, at most,
singly occupied, thus a (s = 21 ) Pauli representation is possible
1
0
|1ik =
|0ik =
(41)
0 k 1 k
Where σk+ and σk−, describe the creation and anhialation of the
state (k ↑, −k ↓)
0 1
σk+ = 12 (σk1 + iσk2 ) =
(42)
0 0
0 0
σk− = 12 (σk1 − iσk2 ) =
(43)
1 0
0
1
Of course σk+
=
1 k 0
σk+ |1ik = 0 σk+ |0ik = |1ik (44)
20
σk− |1ik = |0ik σk+ |0ik = 0 (45)
2 2
h k
0 ξ k = -E +
F 2m
23
4.2 The BCS Gap
h̄2 k 02
Then since ξk0 = 2m − EF , the smallest such excitation is just
∆Emin = 2∆ (64)
e
-k′↓
2
w = v2 = 1 vk′ =0
k′ k′
q
Figure 12: Breaking a pair requires an energy 2 ξk2 + ∆2 ≥ 2∆
superconductor normal
metal
Figure 13:
25
above. Since it is a single electron, its energy will be
r
Ek = ξk2 + ∆2 (65)
h̄2 k 02
For ξk2 À ∆, Ek = ξk = 2m − EF , which is just the energy of
a normal metal state. Thus for energies well above the gap, the
normal metal continuum is recovered for unpaired electrons.
To calculate the density of unpaired electron states, recall
that the density of states was determined by counting k-states.
These are unaffected by any phase transition. Thus it must be
that the number of states in d3k is equal.
3
kz d k
ky
3
π
L
k
x
Density of additional
∆ electron states only!
1 Ds Dn
Figure 15:
27
h̄2 k2
ξ < 0 are occupied since ξk = 2m − EF ).
V0 Z h̄ω
D Z(EF + ξ)dξ
∆ = ∆ √ 2 (70)
2 −h̄ωD ξ + ∆2
1 Z h̄ω dξ
= 0 D√ 2 (71)
V0Z(EF ) ξ + ∆2
1 h̄ω D
= sinh−1 (72)
V0Z(EF ) ∆
For small ∆,
h̄ωD 1
∼ e V0 Z(EF )
(73)
∆
1
− V Z(E
∆ ' h̄ωD e 0 F) (74)
sinh x
∼ ex
Figure 16:
28
5 Consequences of BCS and Experiment
∆E ∼ 2∆e−β2∆ T ¿ Tc (75)
∂∆E ∂β ∆2 −β2∆
C∼ ∼ 2e (76)
∂β ∂T T
29
superconductor I s - In
In B=0
cavity
10
hω
microwave
hω
hω = 2∆
B
Figure 17: If B > Bc or h̄ω > 2∆, then absorption reduces the intensity to the
normal-state value I = In . For B = 0 the microwave intensity within the cavity is
large so long as h̄ω < 2∆
30
k′↑ e
kT ∼ 2∆
e
-k′↓
Figure 18:
√ 2µ
2
¶
1 − 2f ξ + ∆ + EF , T . Thus for T 6= 0
( Ãr !)
1 Z h̄ω
D dξ
= 0 √ 2 2 2
1 − 2f ξ + ∆ + EF , T
V0Z(EF ) ξ + ∆2
(77)
√
Note that as ξ 2 + ∆2 ≥ 0, when β → ∞ we recover the
T = 0 result.
This equation may be solved for ∆(T ) and for Tc. To find Tc
In Pb
∆(T) Sn
∆(0)
Real SC data (reflectivity)
T/Tc
1
Figure 19: The evolution of the gap (as measured by reflectivity) as a function of tem-
perature. The BCS approximation is in reasonably good agreement with experiment.
31
T
consider this equation as Tc → 1, the first solution to the gap
equation, with ∆ = 0+, occurs at T = Tc. Here
1 Z h̄ω dξ ξ
= 0 D tanh (78)
V0Z(EF ) ξ 2kB Tc
which may be solved numerically to yield
1.14h̄ωD
1 = V0Z(EF ) ln (79)
k B Tc
kB Tc = 1.14h̄ωD e−1/{V0Z(EF )} (80)
32
5.3 The Isotope Effect
33
(r - r0 )/ΛL
j = j0 e
H H
• ⊗
S r0 j
0
dl
Λ
L
Figure 20: Integration contour within a long thick superconducting wire perpendicular
to a circulating magnetic field. The field only penetrates into the wire a distance Λ L .
4π
ΛL jc Hc = (86)
c
Since both Hc and jc ∝ ∆, they will share the temperature-
dependence of ∆.
At T = 0, we could also get an expression for Hc by noting
34
that, since the superconducting state excludes all flux,
1 1 2
(W n − W BCS ) = H (87)
L3 8π c
However, since we have earlier
1 1 2
(W n − W BCS ) = N (0)∆ , (88)
L3 2
we get
r
Hc = 2∆ πN (0) (89)
c
We can use this, and the relation derived above jc = 4πΛL Hc ,
35
This gives a similar result to what Ibach and Lüth get, but
for a completely different reason. Their argument is similar to
one originally proposed by Landau. Imagine that you have a
fluid which must flow around an obstacle of mass M . From the
perspective of the fluid, this is the same as an obstacle moving
in it. Suppose the obstacle makes an excitation of energy ² and
v M vP M
E
Figure 21: A superconducting fluid which must flow around an obstacle of mass M .
From the perspective of the fluid, this is the same as an obstacle, with a velocity equal
and opposite the fluids, moving in it.
E0 = E − ² P0 = P − p (94)
Figure 22: A large mass M moving with momentum P in a superfluid (a), creates an
excitation (b) of the fluid of energy ² and momentum p
36
P 02 P2 P·p p2
− =− + = E0 − E = ² (95)
2M 2M M 2M
P θ
v = P/M
P′
Figure 23:
pP cos θ p2
² = − (96)
M 2M
p2
² = pv cos θ − (97)
2M
If M → ∞ (a defect in the tube which carries the fluid could
have essentially an infinite mass) then
²
= v cos θ (98)
p
Then since cos θ ≤ 1
²
v ≥ (99)
p
Thus, if there is some minimum ²,then there is also a mini-
mum velocity below which such excitations of the fluid cannot
37
happen. For the superconductor
²min 2∆
vc = = (100)
p 2h̄kF
Or
ne
jc = envc = ∆ (101)
h̄kF
This is the same relation as we obtained with the previous
√
thermodynamic argument (within a factor 2). However, the
former argument is more proper, since it would apply even for
gapless superconductors, and it takes into account the fact that
the S.C. state is a collective phenomena ie., a minuet, not a
waltz of electric pairs.
4πn ξcp 3
∼ 108 (105)
3 2
other pairs have their center of mass.
Figure 24: Many electron pairs fall within the volume of a Cooper wavefunction.
This leads to a degree of correlation between the pairs and to rigidity of the pair
wavefunction.
39
The pairs are thus not independent of each other (regardless
of the BCS wave function approximation). In fact they are
specifically anchored to each other; ie., they maintain coherence
over a length scale of at least ξcp.
Normal Metal
SC 2
φBCS
ξ coh > ξ cp
Figure 25:
2e
j=− {ψp∗ψ ∗ + ψ ∗pψ} (106)
4m
where pair mass = 2m and pair charge = −2e.
2e
p = −ih̄∇ − A (107)
c
A current, or a CM momentum K, modifies the single pair state
1 X (r1 +r2 )/2 ik· (r1 −r2 )
ψ(r1, r2) = 3
g(k)eiK· e (108)
L k
ψ(K, r1, r2) = ψ(K = 0, r1, r2)eiK·R (109)
40
r1 +r2
where R = 2 is the cm coordinate and h̄K is the cm mo-
mentum. Thus
φ = K · (R1 + R2 + · · ·) (111)
∇ = ∇R + ∇r ≈ ∇R (112)
Thus
2e X ∗ 2eA
js ≈ Φ −ih̄∇ R + ΦBCS
4m ν BCS ν
c
∗
2eA ∗
+ΦBCS ih̄∇Rν +
ΦBCS (113)
c
or
2e 2 4eA 2X
js = − |Φ(0)| + 2h̄ |Φ(0)| ∇ Rν φ (114)
2m c ν
ne2
∇×j=− B (116)
mc
which is the second London equation which as we saw in Sec.??
leads to the Meissner effect. Thus the second London equation
can only be derived from the BCS theory by assuming that the
BCS state is spatially homogeneous.
42
superconducting loop
C
X
X
X
X
X
X X X
B
X X X X ΛL
X X
X
X X
Figure 26: Magnetic flux penetrating a superconducting loop is quantized. This may
be seen by integrating Eq. 114 along a contour within the superconducting bulk (a
distance ΛL from the surface).
single valued, so
XZ
∇Rν φ · dl = 2πN N ∈Z (119)
ν
so
e 2 ns Z e 2 ns Z eh̄ns
− A · dl = − B · ds = 2N π (121)
ms ms 2m
Ie., the flux in the loop is quantized.
43
9 Tunnel Junctions
V0
Figure 27:
0 0
ψa = A1eikx + B1e−ikx ψb = A2eik x + B2e−ik x
ψc = B3e−ikx (122)
√
2mE
k = in a & c (123)
r
h̄
2m(E − V0)
k0 = in b (124)
h̄
44
The coefficients are determined by the BC of continuity of ψ
and ψ 0 at the barriers x = 0 and x = d. If we take B3 = 1
and E < V0, so that
r
0 2m(E − V0)
k = iκ = (125)
h̄
then, the probability of having a particle tunnel from left to
right is
−1
|B3|2 1 1
1 k κ 2 1 k κ 2
Pl→r ∝ = = − − + + cosh 2κd
|B1|2 |B1|2 2 8 κ k 8 κ k
(126)
For large κd
−2
k κ
Pl→r ∝ 8 + e−2κd (127)
κ k r
−2
k κ
2d 2m(V 0 − E)
∝ 8 + exp − (128)
κ k
h̄
45
filled on the left-hand side and available on the right-hand side.
This, as can be seen in Fig. 28, is not always the case, es-
pecially in a conductor. Here, we must take into account the
densities of states and their occupation probabilities f . We will
be interested in applied voltages V which will shift the chemical
potential eV . To study the gap we will apply
S I N
E
eV
X N(E)
Figure 28: Electrons cannot tunnel accross the barrier since no unoccupied states are
available on the left with correspond in energy to occupied states on the right (and
vice-versa). However, the application of an appropriate bias voltage will promote the
state on the right in energy, inducing a current.
eV ∼ ∆ (129)
2∆ 4kB Tc
We know that k B Tc ∼ 4, ∆ ∼ 2 ∼ 10◦K. However typical
metallic densities of states have features on the scale of electron-
46
volts ∼ 104◦K. Thus, on this energy scale we may approximate
the metallic density of states as featureless.
EF
Figure 29: If eV= 0, but there is a small overlap of occupied and unoccupied states on
the left and right sides, then there still will be no current due to a balance of particle
hopping.
and
dI Z ∂f (² − eV )
∼ P Nr (EF ) d² Nl (²) (133)
dV ∂V
∂f
∼ eδ(² − eV − EF ) (T ¿ EF ) (134)
∂V
dI
' P Nr (EF )Nl (eV + EF ) (135)
dV
dI
Thus the low temperature differential conductance dV is a mea-
sure of the superconducting density of states.
dI
I
dV
V
∆/e V ∆/e
48
Chapter 11: Dielectric Properties of Materials
Lindhardt
May 8, 2002
Contents
1 Classical Dielectric Response of Materials 2
1.1 Conditions on ² . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
1.2 Kramer’s Kronig Relations . . . . . . . . . . . . . . . . . . . . . . . 6
4 Excitons 19
1
Electromagnetic fields are essential probes of material prop-
erties
• IR absorption
• Spectroscopy
x << λ
x
0
E(x,t) ≅ E(x0 ,t)
Figure 1: If the average excursion of the electron is small compared to the wavelength
of the radiation < x >¿ λ, then we may ignore the wave-vector dependence of the
radiation so that ²(k, ω) ≈ ²(ω).
have
D(ω) = ²(ω)E(ω) (3)
where
Z
R
E(t) = dωe −iωt
E(ω) H(t) = dωe−iωt H(ω) (4)
Z
R
D(t) = dωe−iωt D(ω) B(t) = dωe−iωt B(ω) (5)
3
Z
−iωt 4π Z −iωt 1∂ Z
∇ × dωe H(ω) = dωe j(ω) + dωe−iωt D(ω)
c c ∂t
(7)
Z 4π 1
dωe−iωt ∇ × H(ω) − j(ω) − (−iω)D(ω) = 0 (8)
c c
4π ω
∇×H = j − i D(ω)
c c
4πσ ω
= E − i ²E
c µ
c
¶
ω c 4πσ
−i E ² −
c ω c = − iωc E ²̃
=
µ ¶ (9)
4π c iω 4π
c E σ − 4π c ² = c E σ̃
1.1 Conditions on ²
From the reality of D(t) and E(t), one has that E(+ω) =
E ∗(−ω) and D(ω) = D∗(−ω), hence for D = ²E,
4
Additional constraints are obtained from causality
Z
D(t) = dω²(ω)E(ω)e−iωt
0
−iωt dt iωt0
Z Z
= dω²(ω)e e E(t0)
2π
1 Z 0
= dtdω (²(ω) − 1 + 1) E(t0)e−iω(t−t ) (11)
2π
then we make the substitution
²−1
χ(ω) ≡ (12)
4π
µ ¶
R 1 0
D(t) = 2 dtdω χ(ω) + 4π E(t0)e−iω(t−t )
R 0
D(t) = E(t) + 2 dtdωχ(ω)E(t0 )e−ω(t−t ) (13)
R
Define G(t) ≡ 2 dωχ(ω)e−iωt , then
Z ∞
D(t) = E(t) + −∞
dt0G(t − t0)E(t0 ) (14)
5
no poles
τ<0
X X
poles
Figure 2: If χ(ω) is analytic in the upper half plane, then causality is assured.
contour in the upper half plane and obtain zero for the integral
since χ is analytic within and on the contour.
6
analytic inside and on the contour C, then
1 Z χ(ω 0) 0
χ(z) = ◦ dω (17)
2πi C z − ω 0
Then taking the contour shown in Fig. 3 and assuming that
ω’
ω.
Figure 3: The contour (left) used to demonstrate the Kramer’s Kronig Relations.
Since the contour must contain ω, we must deform the contour so that it avoids the
pole 1/(ω − ω 0 ).
<
χ(ω) ∼ 1 <
for large ω (in fact Reχ ∼ 1 <
and Imχ ∼ 1
) we
ω ω ω2
2.1 Transmission
~
ξ = ξ e-iω(t-nx/c)
0
~=1 ~=1
n
n
~n = n + iκ = √ε(ω)
~n = n2 +2inκ - κ 2 = ε + iε
1 2
2
ε = n - κ2
ε 1
ε = 2nκ
2
8
Imagine that a laser beam of known frequency is normally
incident upon a thin slab of some material (see Fig. 4), and we
are able to measure its transmitted intensity.
Upon passing through a boundary, part of the beam is re-
flected and part is transmitted (see Fig. 5). In the case of normal
incidence, it is easy to calculate the related coefficients from the
conditions of continuity of E⊥ and H⊥ = B⊥(µ = 1)
µ=1 µ=1
B0
B′
X
ξ
0 X
ξ″ ξ′
X
B″
1 ∂B iω
∇×E =− , ik × E = B (24)
c ∂t c
|nE⊥| = |B⊥| (25)
00 0
E0 + E = E
2
t= (26)
(E0 − E ) = nE 00 0
n+1
9
In this way the other coefficients may be calculated (see Fig. 6).
Accounting for multiple reflections the total transmitted field is
t = ~n 2+ 1
1
r ~
2n
t 1
1
t =
2 ~
n+1
r ~n - 1
t 2
2
r = 2
~n + 1
Figure 6: Multiple events contribute the the radiation transmitted through and reflected
from a thin slab.
ñω
E = E0t1t2eikd + E0t1r2r2t2e3ikd + · · · , where k =
c
(27)
E0t1t2eidñω/c
E= (28)
1 − r22e2idñω/c
If ñ ∼ 1 and d is small, then
2.2 Reflectivity
depends upon both ²1 and ²2(ω), and is hence much more dif-
ficult to analyze [See Frederick Wooten, Optical Properties
of Solids, (Academic Press, San Diego, 1972)].
11
2
~
n=1 R= n~ - 1
I ~
n +1
~
n
Figure 7: The coefficient of reflectivity (the ratio of the reflected to the incident inten-
sities) R/I depends upon both ²1 (ω) and ²2 (ω), making the analysis more complicated
than in the transmission experiment.
13
In order to learn more from such measurements, we need to
have detailed models of the materials and their corresponding
dielectric properties. For example, the electric field will interact
with the moving charges associated with lattice vibrations. At
the simplest level, we can model this as the interaction of iso-
lated dipoles composed of bound damped charge e∗ and length
s
p = e∗ s (40)
where the first term on the right-hand side is the damping force,
the second term is the restoring force, and the third term is the
external field. Furthermore, the polarization P is
N ∗ N
P = e s + αE (42)
V V
where α is the polarizability of the different molecules which
make up the material. It is to represent the polarizability of
rigid bodies. For example, (see Fig. 8) a metallic sphere has
α = a3 (43)
14
a
E=0
Figure 8: We may model our material as a system harmonically bound charge and of
metallic spheres with polarizablity α = a3 .
or
∗2
ne /µ
P (ω) = E(ω) + nα (47)
ω02 − ω 2 − iωγ
²∞ = 1 + 4πnα (50)
ωp2 ωp2
²0 = 1 + 4πnα + 2 = ²∞ + 2 (51)
ω0 ω0
ω02(²0 − ²∞)
²(ω) = ²∞ + 2 (52)
ω0 − ω 2 − iγω
For our causality arguments we must have no poles in the upper
complex half plane. This is satisfied since γ > 0. In addition,
we need
1
χ = (²(ω) − 1) → 0, as ω → ∞ (53)
4π
This means that ²∞ = 1 + 4πnα = 1. Of course α is finite.
The problem is that in making α = constant, we neglected the
electron mass. Ie., for our example of a metallic sphere, α < a3
for very high ω! (See Fig. 9) due to the finite electronic mass.
To analyze experiments ² is separated into real ²1 and imag-
inary parts ²2
4π (²0 − ²∞)ω02(ω02 − ω 2)
²1(ω) = − σ2 = ²∞ + (54)
ω (ω02 − ω 2)2 + γ 2ω 2
4π (²0 − ²∞)ω02γω
²2(ω) = σ1 = 2 (55)
ω (ω0 − ω 2)2 + γ 2ω 2
16
a
--
- - -
↑ ξ(ω)
Figure 9: For very high ω, α < a3 since the electrons have a finite mass and hence
cannot respond instantaneously to changes in the field.
ε1 (ω)
ε γ
0
ε∝
ω0 ω0 ω
Figure 10: Sketch of the real and imaginary parts of the dielectric response of the
harmonically bound charge model.
ωp2 1/τ π
σ1(ω) = . (65)
4 ω 2 + 1/τ 2
4 Excitons
∫ σ (ω) dω = ω2p /4
1
1/τ
phonons
Figure 11: A sketch of the optical conductivity of a metal at finite temperatures. The
low-frequency Drude peak is due to the coherent transport of electrons. The higher
frequency peak is due to incoherent scattering of electrons from phonons or electron-
electron interactions.
electron
Eg r
hole R
Figure 12: Excitons are hydrogenic bound states of an electron and a hole. In semi-
conductors with an indirect gap, such as Si, they can be very long lived, since a
phonon or defect must be involved in the recombination process to conserve momen-
tum. For example, in ultra-pure Si (≈ 1012 impurities per cc) the lifetime can exceed
τSi ≈ 10−5 s; whereas in direct-gap GaAs, the lifetime is much shorter τGaAs ≈ 10−9 s
and is generally limited by surface states. On the right, the coordinates of the exciton
in the center of mass are shown.
21
proximated as a hydrogenic atom.
+
e
°
∼ 10A
Figure 13: An exciton may be approximated as a hydrogenic atom if its radius is large
compared to the lattice spacing. In Si, the radius is large due to the reduced hole and
electron masses and the enhanced dielectric constant ² ≈ 10.
22
Chapter 12: Semiconductors
Contents
1 Band Structure 4
3 Doping of Semiconductors 12
1
Semiconductors are of obvious technological importance - so
much so, that a whole chapter will be dedicated to them.
Semiconductors are distinguished from metals in that they
have a gap at the Fermi surface, and are distinguished from
< 1eV . Most condensed
insulators in that the gap is small ∼
metal semiconductor insulator
Figure 1: There is no band gap at the Fermi energy in a metal, while there is a band
gap in an insulator. Semiconductors on the other hand have a band gap, but it is
much smaller than those found in insulators.
-Eg /2kT
Eg ∼e ∼n
Figure 2: This shows the temperature dependence for the excitation of electrons, thus
allowing the number of free carriers to vary with changes in temperature.
1 Band Structure
AB
Eg P
Eg ↓ T↓
S
sp3 ⇒ 4 electrons
per band
B valence band
r0 r
K
111
L Eg
110
K
000
Γ X
100
Si
L Γ X K Γ
Figure 4: Sketch of quasiparticle bands in Si (right) along the high symmetry directions
(left).
1 1 ∂ 2E(k)
= (5)
m∗ij h̄2 ∂ki∂kj
The situation in III-V semiconductors such as GaAs is sim-
ilar, in that covalent sp3 bands still form. However, the gap
is direct. For this reason GaAs makes more efficient optical
devices than does either Si or Ge. A particle-hole excitation
across the gap can readily recombine, emit a photon (which has
essentially no momentum) and conserve momentum in GaAs;
whereas, in an indirect gap semiconductor, this recombination
requires the addition creation or absorption of a phonon or some
5
Eg
Ge
L Γ X K Γ
material τexciton
GaAs 1ns(10−9 s)
Si 19µs(10−5 s)
Ge 1ms(10−3 s)
Table 1:
6
E g ∼ 1.5eV
GaAs
L Γ X K Γ
Figure 6: Sketch of quasiparticle bands in GaAs along the high symmetry directions
of the Brillouin zone. Note the direct, Γ → Γ, minimum gap energy. The nature of
the gap can be tuned with Al doping.
7
conduction
hω ∼ Eg
phonon vs k ≅ ω
photon ck ≅ ω ≅ E g/ h
valence
Figure 7: A particle-hole excitation across the gap can readily recombine, emit a
photon (which has essentially no momentum) and conserve momentum in a direct
gap semiconductor (left) such as GaAs. Whereas, in an indirect gap semiconductor
(right), this recombination requires the addition creation or absorption of a phonon
or some other lattice excitation to conserve momentum.
e+ v j n = # electrons
volume
E e-
p = # holes
v j volume
σ = e(nµ n + pµ r )
8
2 2
h k
∼E k ∼
Ec 2m*
µ ∼ eτ* Eg n
m
E 2 2
v ∼E k ∼ h k*
2m p
Figure 9:
h̄2 k2
the parabolic approximation Ek ' 2m∗ we found that D(E) =
3√
(2m∗ ) 2
2π 2 h̄3
E. Thus,
3
(2m∗n) 2 √
DC (E) = 2 3
E − EC (8)
2π h̄
µ ¶3
2m∗p 2 √
DV (E) = EV −E (9)
2π 2h̄3
for E > EC and E < EV respectively, and zero otherwise
EV < E < E C .
In an intrinsic (undoped) semiconductor n = p, and so EF
must lie in the band gap. However, if m∗n 6= m∗p (ie. DC 6=
DV ), then the chemical potential, EF , must be adjusted up or
down from the center of the gap so that n = p.
Furthermore, the carriers which are induced across the gap
are relatively high in energy, compared to kB T , since typically
9
E g = E C − E V À kB T .
1eV
∼ 10000◦ K À 300◦K ∼ T (10)
kB
>
Thus, assuming that E − EF ∼
Eg
À kB T
2
1 1
' = e−(E−EF )/kB T (11)
e(E−EF )/kB T + 1 e(E−EF )/kB T
ie., Boltzmann statistics. A similar relationship holds for holes
where −(E − EF ) ∼ > E g À kB T
2
1 ½
−(E−EF )/kB T
¾
1− ' 1− 1 − e = e−(E−EF )/kB T
e(E−EF )/kB T + 1
(12)
since (1 − f (E)) = f (−E) and e(E−EF )/kB T is small. Thus, the
concentration of electrons n
3
(2m∗n) 2 EF /kB T Z ∞ √ −E/kB T
n ' 3 e E − E C e dE (13)
2π 2h̄ EC
10
3
(2m∗n) 2 3 Z
−β(EC −EF ) ∞ 21 −x
= (kB T ) e
2 x e dx (14)
2π 2h̄3 0
∗ 3
2πm k T
n B
2
= 2 2
e−β(EC −EF ) = Nef
C −β(EC −EF )
fe (15)
h
Similarly
3
2πm∗p kB T 2 −β(EV −EF ) V −β(EV −EF )
p = 2 e = N ef f e (16)
h2
C V
where Nef f and Nef f are the partition functions for a classical
11
gives us further information.
3
kB T 2 µ ∗ ∗¶ 43 −βEg /2
ni = p i = 2 mn mp e (18)
2πh̄2
However, we already have relationships for n and p involving
EC and EV
C −β(EC −EF ) V β(EV −EF )
n = p = Nef fe = Nef fe (19)
V
Nef
e 2βEF
= C f eβ(EV +EC ) (20)
Nef f
or
1 1 NV
ef f
EF = (EV + EC ) + kB T ln C (21)
2 2 Nef f
∗
1 3 m p
EF = (EV + EC ) + kB T ln
∗
(22)
2 4 mn
Thus if m∗p 6= m∗n, the chemical potential EF in a semiconductor
is temperature dependent. Recall that this T -dependence was
important for the transport of a semiconductor in the presence
of a thermal gradient ∇T .
3 Doping of Semiconductors
Figure 10:
EC EF
EC
ED unoccupied at T= 0
(occupied by holes at T = 0)
occupied at T= 0 EA
EV EV EF
Figure 11:
14
4 Carrier Densities in Doped semiconductor
C V −βEg
np = Nef f Nef f e = n2i = p2i (27)
# ionized
EC
+ 0
ED ND = N D + ND
EF
# un-ionized
+ 0
EA NA = NA + NA
EV
Figure 12:
15
The probability that a donor/acceptor is occupied by an elec-
tron is determined by Fermi statistics
1
nD = ND0 = ND (29)
1 + eβ(ED −EF )
1
pA = NA0 = NA(1 − f (EA )) = NA (30)
1 + eβ(EF −EA)
To provide a solvable example, imagine that we have an n-type
semiconductor (no p-type dopants) so that NA = NA0 = NA+ =
0, then
C −β(EC −EF )
n = Nef fe (31)
ND = ND0 + ND+ (32)
1
ND0 = ND (33)
eβ(ED −EF ) + 1
Furthermore, charge neutrality requires that
n = p + ND+ (34)
ND+ À ni (35)
ie., many more carriers are provided by dopants than are ther-
mally excited over the entire gap, then as np = n2i , it must be
16
that ND+ À p so that
Ed
At low T ¿ kB
r
n' C e−βEd
ND Nef (41)
f
Ed
at higher T À kB
n = ND (42)