Sunteți pe pagina 1din 15

Fortschritte

der Physik
Fortschr. Phys. 63, No. 910, 644658 (2015) / DOI 10.1002/prop.201500023
Progress

Review Paper
of Physics

Functional analysis and quantum mechanics: an introduction


for physicists
Kedar S. Ranade

Received 4 May 2015, revised 4 May 2015, accepted 8 June 2015


Published online 20 August 2015

It is the intention of this tutorial to introduce basic


We give an introduction to certain topics from functional anal- concepts of functional analysis to physicists and to point
ysis which are relevant for physics in general and in particular out their influence on quantum mechanics. We aim at
for quantum mechanics. Starting from some examples, we dis- mathematical rigour in terminology, but we will leave out
cuss the theory of Hilbert spaces, spectral theory of unbounded proofs. Our approach is not necessarily consistent with a
mathematics textbook building up from bottom to top by
operators, distributions and their applications and present
proving every theorem from some basic axioms. Rather,
some facts from operator algebras. We do not give proofs, but
we strive at an intuitive understanding from concepts
present examples and analogies from physics which should be and analogies known either from basic linear algebra or
useful to get a feeling for the topics considered. used in basic quantum mechanics.
Prerequisites for reading this tutorial are a knowledge
of textbook quantum mechanics (such as wave mechan-
ics and Dirac notation) including some basic quantum
1 Introduction optics (harmonic oscillator, ladder operators, coherent
states etc.), but excluding quantum field theory; from
It is well-known that physics and mathematics are mathematics, knowledge of analysis and linear algebra is
closely interconnected sciences. New results in physics required.1
lead to new branches of mathematics, and new concepts
from mathematics are used to describe physical situa-
tions and to predict new physical results. In the study 2 Hilbert space theory
of physics quantum mechanics plays a fundamental
role for almost all fields of modern physics: from quan- The formalism of quantum mechanics uses the concept
tum optics, quantum information to solid-state physics, of a Hilbert space. In this section we will elaborate on the
high-energy physics, particle physics etc. A similar role theory and give a full classification of Hilbert spaces, i. e.
is enjoyed in mathematics by functional analysis in the we list (in some sense) all possible Hilbert spaces. The
theory of differential equations, numerical methods and mathematics covered here can be found in several text-
elsewhere. books on functional analysis, e. g. [13].
While every physicist knows quantum mechanics and
every mathematician knows functional analysis, they of-
ten do not know the connection between these two
fieldsboth of which were founded at the beginning
of the 20th century. While every student of physics at- 1 This manuscript is partially based on a series of lectures enti-
tends lectures on higher mathematics, such as analysis of tled Mathematische Aspekte der Quantenmechanik presented
one or several variables, linear algebra, differential equa- (in german) at the Institut fur Quantenphysik, Universitat Ulm,
tions and complex analysis, it is not so common that from April to August 2014.
he gets to know functional analysis. Though it is possi- Institut fur Quantenphysik, Universitat Ulm, and Center for Inte-
ble to understand the basics of quantum mechanics with grated Quantum Science and Technology (IQST ), Albert-Einstein-
pure linear algebra, a deeper understanding is gained Allee 11, D-89081 Ulm, Deutschland, Germany
by knowing at least the very structure of functional Corresponding author: E-mail: Kedar.Ranade@uni-ulm.de,
analysis. Phone: +49/731/50-22783

644 Wiley Online Library 


C 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim
Fortschritte
der Physik
Fortschr. Phys. 63, No. 910 (2015)
Progress

Review Paper
of Physics

2.1 Introductory examples function on [0; ) with a possibly complex eigenvalue ,


provided that Im > 0. How is this possible?
We start this tutorial by presenting four examples, which
show that the nave use of linear algebra fails in certain
situations.2 2.1.4 Eigenvalues of hermitian operators II

In the previous example, one may argue that the radius


2.1.1 Commutators and traces
cannot be negative and some part of the eigenfunction
is cut off. Now we give a more dramatic example. First
It is a simple exercise in linear algebra to show that
we consider an abstract hermitian operator A with eigen-
there holds Tr AB = Tr BA for any two matrices A and B
vector v and eigenvalue : Av = v. Hermiticity of A is
(prove it by coordinate representation), which may be
defined by Ax|y = x|Ay for all x, y H, and textbook
rewritten as a trace of a commutator: Tr [A, B] = 0. In
quantum mechanics (cf. e. g. Schwabl [10]) tells us that
quantum mechanics, one requires, by analogy to the
eigenvalues of hermitian operators are real by the follow-
Poisson bracket of classical mechanics, [x, p] = i1H for
ing simple argument: we calculate
position and momentum operators x and p, respectively
(known as canonical quantisation). Taking the trace on
v|v = v|v = Av|v = v|Av = v|v = v|v, (1)
both sides of this equation yields 0 = i dim H.3 So does
quantum mechanics really exist? so v|v = 0 yields = , and this implies R.
Now take a spinless pointlike one-dimensional par-
2.1.2 Distributions ticle in position space, which is described by a square-
integrable function on R. Consider the hermitian opera-
Consider a particle in a box (square-well potential) with tor A = x3 p + px3 with the usual x = x and p = i x

and
take the function (plotted in the diagram)
= 0, if |x| a and V (x) =
infinitely high walls: V (x)
otherwise. Let (x) = 4a15 5/2 (a x ) be the wavefunc-
2 2
1
f (x) = |x|3/2 e 4x2 .
1
tion in the interior part which shall vanish outside (2)
2
of the box. If we want to calculate the variance of
the energy H = H 2  H2 , we need the expectation Setting f (0) := 0, the function f is everywhere defined
4 4 
value of H 2 . From H 2  = 4m 2 x4 = 0, we have H  =
2
on R and smooth.4 Furtherunlike plane waves,
a
x=a  (x)H (x) dx = 0. Since H is strictly positive,
2 2
delta functions and the likeit is square-integrable
  2 
  dx = + x3 e 2x2 dx =
1
the variance H is negative, which obviously is impossi- and normalised: xR f (x)
ble. What is wrong here?  1 + x=0
e 2x2 x=0 = 1. For the derivative of g(x) := e 4x2 we find
1

g (x) = ( 14 x2 ) g(x) = 12 x3 g(x), thus for x > 0, there


2.1.3 Eigenvalues of hermitian operators I holds
 
 3 x3
Consider a radially-symmetric potential V ( r ) in three x3 px3/2 g(x) = x3 x5/2 + x3/2 g(x) and
i 2 2
space-dimensions. Using the ansatz ( r ) = R(r)Ylm (, )
(3)
in spherical coordinates we get the spherical harmon-
ics Ylm and by substituting R(r) = u(r) we get a radial  d  3/2 
r px3 x3/2 g(x) = x g(x)
Schrodinger equation for u(r) by adding a centrifugal i dx
 
potential  2mr
2
l(l+1)
to V (r). We thus have a radial position  3 1/2 x3
operator r and a corresponding momentum pr = i r
, = x + x3/2 g(x). (4)
i 2 2
which fulfil the canonical commutator relation. Now, pr
is hermitian, but (r) = e  r is a normalisable eigen-
i
As f is even, and each power of x and p changes sym-
metries once, there holds Af = i f for the function as
a whole. Altogether, A is a hermitian operator with an
2 For these and other related examples see the articles by Gieres eigenfunction f , but its eigenvalue is not a real number.
[4] and Bonneau et al. [5], which are very much recommended Where is the error?
to the reader.
3 For a real commutator (without the imaginary prefactor i), 4 It can be arbitrarily often differentiated. It should not bother us
one can use the ladder operators in a harmonic oscillator with that as a complex function there is an essential singularity in
[a, a ] = 1 in a similar fashion. the origin, since we only consider real functions here.


C 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim Wiley Online Library 645
Fortschritte
der Physik
K. S. Ranade: Functional analysis and quantum mechanicsan introduction
Progress
Review Paper

of Physics

sum is, i. e. in the course of calculation we secretly leave


0.7 the Hilbert space for some time, which is not allowed.
0.6

0.5
2.3 Hilbert spaces

0.4 We start with the definition of a Hilbert space. We will use


0.3
Diracs bra-ket notation when we find it appropriate.

0.2
Definition 1 (Hilbert space). A pre-Hilbert space is a vec-
tor space over some field K equipped with a scalar product
0.1 (or inner product), i. e. a function  |  : V V K with
the following three properties:
2 4
1. sesquilinearity: x|v + w = x|v + x|w and
Figure 1 A wavefunction with imaginary eigenvalue. v + w|x = v|x + w|x,
2. anti-symmetry: w|v = v|w and
2.2 Linear algebra and functional analysis 3. positive definiteness: v|v > 0 for v = 0.

In the previous examples we have used concepts A Hilbert space is a pre-Hilbert space which is com-
familiar from linear algebra, which work well in plete (in the sense of metric spaces, i. e. every Cauchy se-
finite-dimensional systems. But very often quantum- quence converges with a limit in the space itself). A real or
mechanical systems need infinite-dimensional Hilbert complex Hilbert space is a Hilbert space, where K = R or
spaces (e. g. for a particle in space). Let us summarise: K = C, respectively.5
Though completeness is not explicitly used in
1. We often use terminology and methods of linear al-
physics, it is necessary for certain mathematical the-
gebra, such as vector spaces, bases, scalar products,
orems. From now on, we shall only consider complex
orthogonality, matrices and their diagonalisation,
Hilbert spaces, and H shall always denote such a com-
operators etc.
plex Hilbert space. Hilbert spaces are essentially defined
2. Most quantum systemsnotable exceptions are spin
by their bases.
systems have infinite dimension: a free particle, a
particle in a box, a particle in a harmonic oscillator, Definition 2 (Bases of Hilbert spaces). A basis (more pre-
the Hydrogen atom etc. cisely, an orthonormal basis) of H is a set B of vectors in H,
3. A mathematically rigorous description thus must such that there holds (i) v|w = v,w for all v, w B and

make use of functional analysis, in some sense lin- (ii) x = vB xx|v for all x H.
ear algebra in infinite-dimensional vector spaces. A Hilbert space which is very important for quantum
4. Many aspects of linear algebra extend to functional mechanics is the space L2 (R) of square-integrable func-
analysis, but not all, and even if they do, several the- tions.6 Note that the uncountable position and momen-
orems and statements are much more complicated tum bases are not true bases, since neither plane waves
than is expected at first sight. nor delta functions are square-integrable functions.
It is very easy to give a formal classification of Hilbert
At this point we cannot show how to resolve the prob-
spaces in purely mathematical terms. However, it will be
lems posed in detail, for which we refer to the references
already mentioned, but we can give some brief explana-
tion: In the first example the trace is simply not defined 5 Note that the notation in mathematics and physics differs: the
on all operators on a Hilbert space, but only on a subset
complex conjugate of z C is denoted z in mathematics and
called the trace-class operators. In the second example
z in physics, the adjoint A and A , respectively. The scalar
we get by differentiating delta distributions at a and a, product in usually linear in the left component in mathemat-
which we cannot ignore in our calculation. The third ex- ics and in the right component in physics. In older mathemat-
ample show, that on infinite-dimensional spaces we have ics literature and sometimes in physics Hilbert spaces are occa-
to consider domains of definition. In the fourth exam- sionally required to be infinite-dimensional and/or separable;
ple, we presented the applications of x3 p and px3 on the cf. e. g. von Neumann [11] or Scheck [12].
1/4x2
function separately; these map f to e 22 i (x3/2 3x1/2 ), 6 This is strictly speaking not true; a precise definition will follow,
respectively, which are not square-integrable, but their when we discuss measure and integration theory.

646 www.fp-journal.org 
C 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim
Fortschritte
der Physik
Fortschr. Phys. 63, No. 910 (2015)
Progress

Review Paper
of Physics

seen that it is not the different Hilbert spaces which dis- zero (otherwise divergent) and moreover, any series con-
tinguishes different physical systems, but the operators structed out of these non-zero elements converges abso-
on these Hilbert spaces. lutely. The dimension of 2 (I) then is |I|, the scalar prod-

uct x|y := iI xi yi . For example, the set of reals R is
Theorem 1 (Classification of Hilbert spaces). For any pos-
uncountable, so that 2 (R) is inseparable.
sible cardinality (i. e. number of elements) of a set there
To conclude this section: In quantum mechanics we
exists up to an isomorphism precisely one Hilbert space; in
essentially deal with a single Hilbert space, the separa-
particular, two Hilbert spaces are isomorphic, if and only
ble, infinite-dimensional Hilbert spacethough it is not
if their dimensions coincide.7
the Hilbert space itself, but rather the structure of opera-
The isomorphism concept can be understood as in- tors on this space which distinguishes different physical
distinguishable by inner structure, i. e. by Hilbert space situations.
operations only; in the well-known setting of square-
integrable functions we may always think of all func-
tions, some of which lie inside Hilbert space and others 3 Spectral theory
outside (are non-normalisable), but in a pure Hilbert-
space setting such considerations are not allowed. For a Spectral theory most prominently deals with the general-
positive integer d, the Hilbert space Cd of column vec- isation of the spectral theorem from linear algebrathe
tors with d complex entries is essentially the only Hilbert diagonalisation of hermitian (or normal) matricesto
space with dim H = d. A Hilbert space is called separa- infinite-dimensional spaces. To understand these things,
ble, if its dimension is countable, i. e. less or equal of the a basic knowledge of measure and integration theory is
cardinality of the integers.8 Thus, up to an isomorphism necessary, which will be introduced in the first two sub-
there exists precisely one infinite-dimensional separable sections. Then we can finally state the precise meaning
Hilbert space. of the Hilbert space L2 (R).
Consider for example a single particle in one space di-
mension: the Hilbert space is the set of square-integrable
3.1 Measure theory
functions L2 (R). Now change to the Fock basis, the eigen-
states of the harmonic oscillator {|n| n N0 }, which is
Measure theory deals with the problem of associating a
a countable and infinite basis. But obviously we have
measure to certain sets.9 Measures can be thought of the
by this an isomorphism to the Hilbert space 2 (N0 ) of
length of a distance, the area of a surface or the volume of
square-integrable sequences. Note that this is still a ba-
some space. Thus, given a set (think of Rn ), one could
sis regardless of the potential (though not necessarily an
envision a function : P( ) [0; ] (where P( ) is the
eigenbasis), so that L2 (R) is separable. Further, finite or
power set, the set of all subsets of ) which maps to each
countable unions and cartesian products of countable
set its volumewhich is non-negative. However, there
sets are still countable; thus direct sums and tensor prod-
are certain pitfalls, which are best illustrated by the fol-
ucts of finitely and countably many separable Hilbert
lowing paradox (in mathematics, this is related to the ax-
spaces are separable.
iom of choice).
Although inseparable Hilbert spaces are not so com-
mon in physics, it is very easy to construct them. To con- Theorem 2 (Banach-Tarski paradox). Let A, B Rn be
struct a Hilbert space for an arbitrary given dimension, arbitrary sets with non-empty interior. Then, there ex-
choose an arbitrary index set I of that cardinality and let ist countably many sets A1 , A2 , . . . and rigid mo-


tions10 m1 , m2 , . . . , such that A = nN An and B =

2 (I) := (xi )iI CI  |xi |2 < . (5) nN mn (An ), where both decompositions are disjoint, i. e.
iI
Ai Aj = and mi (Ai ) mj (Aj ) = for i = j. For A, B
The sum of possibly uncountably many terms is to be Rn bounded and n 3, this is even possible with finitely
understood in the following sense of summability: it many sets and rigid motions.
is defined only if at most countably many xi are non-

7 The proof is rather simple, it essentially reads Map one basis bi- 9 Measure theory is not to be confused with the theory of mea-
jectively onto another. surements in quantum physics.
8 This notion of separability is completely unrelated to the same 10 These are bijective mappings m on Rn which preserve
i
term from entanglement theory. Euclidean distances, i. e. d(mi (x), mi (y)) = d(x, y).


C 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim Wiley Online Library 647
Fortschritte
der Physik
K. S. Ranade: Functional analysis and quantum mechanicsan introduction
Progress
Review Paper

of Physics

It is actually possible to cut a sphere in just five are orthogonal (these measures will be used in the
parts, which can be reassembled to two spheres both spectral theorem below).
of the same size as the original sphere. To avoid such
seemingly absurd effects, one restricts the measure func-
tion to a class of measurable sets and calls ( , ) 3.2 Integration theory
a measurable space, if P( ) forms a -algebra. A
-algebra must fulfil three conditions: (i) it contains the Usually while learning mathematics in school one starts
basic set: , (ii) it contains every complement: A with the Riemann integral of a function. The preimage
\ A and (iii) it contains countable(!) unions: [a, b] of a bounded function to be integrated, e. g. f :

A1 , A2 , nN An . [a, b] R is partitioned into smaller and smaller inter-
A measure now is a function : [0; ], which vals, and the limit of upper and lower sums, if they both
maps to each measurable set a non-negative number exist and coincide, is called the Riemann integral of f . We
or infinity and fulfils the following axioms: (i) () = 0 shall now introduce the Lebesgue integral.
and (ii) for countably many disjoint sets (An )nN there The guiding principle of the Lebesgue integral is to

holds -additivity: ( n=1 An ) = n=1 (An ). The triple partition the image instead of the preimage space. For
( , , ) is called a measure space. a measurable set M the characteristic function (or
Examples of measures are: indicator function) M is defined by M (x) = 1, if x
M, and zero otherwise. The integral for such a func-
r 
the counting measure | | on arbitrary sets, i. e. the car- tion is then defined as M (x) d(x) := (M) and for
n
dinality (number of elements) of a set; a step function f (x) = i=1 ai Ai (x) with positive ai ac-
r  
the Lebesgue measure on Rn is defined by
  cordingly by f (x) d(x) := ni=1 ai (Ai ). For a general
[a1 ; b1 ] [an ; bn ] = ni=1 (bi ai ) and ex- non-negative function f the integral is defined by ap-
tending it (as a so-called regular measure) to other proximation from below as the supremum of all non-
Lebesgue-measurable sets by exhaustion. negative step functions majorised by f . General real-
valued functions are split into positive and negative part
 
The Lebesgue measure is very important, and the set by f = f+ f with f (x) := max f (x), 0 ), and the inte-
  
of measurable sets is very large; it is rather difficult to find gral is given by f := f+ f , if both parts exist sep-
a set which is not measurable (indeed, without invoking arately; complex-valued functions are further split into
the axiom of choice, it is not possible to construct a non- real an imaginary part.
measurable set). Important are null sets, i. e. sets of mea-
sure zero and their subsets; a property is said to hold al- Theorem 3 (Riemann-integrable functions). A bounded
most everywhere, if it holds everywhere with the possible function f : [a; b] R is integrable in the sense of
exception of a nullset. Finally, note that every countable Riemann, if and only if the set of discontinuities has van-
set is a Lebesgue nullset, but not vice versa (e. g. the Can- ishing Lebesgue measure.
tor set). An example of a function, which is integrable accord-
Measures for which there hold ( ) = 1 are called ing to Lebesgue, but not in the sense of Riemann, is the
probability measures and are the foundation of Kolmogo- Dirichlet function Q on R. However, there may exist
roffs probability theory. There are some generalisations improper integrals in the Riemann sense for functions
of the notion of a measure to be informally mentioned which do not possess a Lebesgue integral, e. g. sinc(x) =
here: for a signed measure, the measure of a set may be sin x
on R; here, positive and negative part diverge, but
x
negative (in R), and for a complex measure, the measure the limit x is possible (compare the conditionally
of a set may be complex (in C). More important in quan- convergent series an := (1)
n
).
n
tum physics, in particular in quantum information, are
operator-valued measures, where the measure of a set is Definition 3 (Lp spaces).  For p [1; )
 p and a measur-
not a number, but an operator on a Hilbert space: able functions f , define f p := [ f (x) d)]1/p . The set
Lp( , , ) is the set of functions f : C, for which

r a positive operator-valued measure (POVM) maps f  is finite. If N ( , , ) denotes the set of all func-
p
a set to a positive operator, such that ( ) = 1 tions which are zero almost everywhere (in the sense men-
(this may be thought of as a generalised probability tioned above), we can identify two functions f and g,
measure); if f g N ( , , ). The set of equivalence classes of
r a spectral measure is a POVM with the additional functions which are identical almost everywhere is the set
property that E1 E2 = implies that (E1 ) and (E2 ) Lp ( , , ) := Lp ( , , )/N ( , , ).

648 www.fp-journal.org 
C 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim
Fortschritte
der Physik
Fortschr. Phys. 63, No. 910 (2015)
Progress

Review Paper
of Physics

Instead of always writing Lp ( , , ), one uses short- of Hilbert space, i. e. x


/ H. At least such cases must
hand notations like Lp ( ). Note that the Hilbert space be excluded by restricting to an appropriate domain of
L2 ( ) is different from the set of square-integrable func- definition D(A).
tions L2 ( ). In particular, since a set with one element An operator A : D(A) H with domain of definition
{x} has Lebesgue measure zero, for a function f Lp ( ), D(A) Hwhich always shall be a dense subset of H
f (x) is strictly speaking not defined. Considering that is called hermitian (in mathematics the term symmetric
in physics we only deal with integrals of L2 functions, is more common), if
it is also resonable not to distinguish between func-
tions which integrated over every subset give the same w|Av = Aw|v for all v, w D(A). (7)
value.
Due to the Hellinger-Toeplitz theorem, D(A) = H is in
this case only possible, if A is bounded. The operators
3.3 Operators and their domains from quantum mechanics (x, p, H etc.) are mostly un-
bounded, and their domains are dense subsets of H.
After introducing the Lebesgue integral, we can come The adjoint operator of A has (precisely) the domain of
back to spectral theory. In the examples we presented a definition
hermitian operator with an imaginary eigenvalue. On the
other hand, one requires physical quantities like energy
D(A ) := {w H| v  w|Av is continuous on D(A)} ,
etc. to be real. The crucial point is that the terms her-
mitian and self-adjoint are different for unbounded (8)
operators, while they coincide for matrices and, more
general, bounded operators. Only for the self-adjoint op- and is on this domain defined as follows: For w D(A ),
erators the spectral theorem holds. the map v  w|Av can uniquely be extended to H and
A linear mapping A : H H on a Hilbert space can by the Riesz-Frechet theorem (see below), there exists a
either be bounded or unbounded. Such mapping is z H, such that z|v = w|Av; we then define A w := z.
bounded, if the operator norm An operator is self-adjoint (rarely also hypermaximal her-
mitian), if A = A , where in particular the domains of def-
Ax inition coincide, i. e. D(A) = D(A ); only for these opera-
A := sup (6)
x =0 x tors the spectral theorem holds.
The (algebraic) dual of a vector space E over some
exists, i. e. it is finite. (For example, the operator norm of a field K is defined as the space of linear functionals E
hermitian matrix is the maximum of the absolute values K ; for a normed space the (topological) dual E of E con-
of the eigenvalues.). sists of the functionals which are additionally required to
It can be shown that linear operators are bounded, be continuous.
if and only if they are continuous. In particular, all
Theorem 4 (Riesz-Frechet theorem). Let H be a Hilbert
linear operators on finite-dimensional Hilbert spaces
space and consider some H. The mapping  |
essentially matricesare bounded and thus continuous.
is in H , and, on the other hand, every element of H can
Examples of unbounded operators from physics are po-
be written in this form by some H.
sition and momentum operator and many Hamiltonians
(free particle, harmonic oscillator etc.). In physics, one denotes elements of H as ket vec-
Several theorems of linear algebra (in particular, the tors | and elements of the dual H as bra vectors |;
spectral theorem) can be extended to the set of bounded the stated theorem is the mathematical reason for Diracs
operators (sometimes to a subset known as compact op- formalism.
erators) and in the mathematical theory one only later
discusses unbounded operators. We shall not present
proofs here and thus deal only with the general case. 3.4 Multiplication operators
We consider an unbounded linear operator A. In gen-
eral, such an operator is not defined on the Hilbert Given an operator A, it is not always easy to determine
space, but only on a subset, its domain D(A). For ex- D(A ). An example, where the domain of definition of the
ample, the position operator in positions representa- adjoint is straightforward to specify, is the case of mul-
tion (x)(x) = x(x) may (by multiplication) throw some tiplication operators. Let ( , , ) be a measure space,
L2 function with asymptotics (x) |x| 1
for |x|  1 out e. g. = R with the usual Lebesgue measure, and let


C 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim Wiley Online Library 649
Fortschritte
der Physik
K. S. Ranade: Functional analysis and quantum mechanicsan introduction
Progress
Review Paper

of Physics

f : R be a measurable function. The Hilbert space On this set, the resolvent (mapping) is RA () := (1
again is L2 ( ) := L2 ( , , ). A)1 . The spectrum (A) := C \ (A) is the complement
We can define a multiplication operator Af : D(Af ) of the resolvent set; it is closed and can be separated into
H by (Af g)(x) := f (x) g(x) on the domain D(Af ) := three disjoint parts:
 
g L2 ( )| f g L2 ( ) . Obviously, this is the largest
1. the point spectrum: 1 A is non-injective (i. e. there
possible domain of definition for Af , since we require the
are eigenvectors);
image of Af to lie in L2 ( ). It can be shown that this op-
2. the continuous spectrum: 1 A is injective, but not
erator is selfadjoint with D(A ) = D(A), and the spectral
surjective, and possesses a dense image;
theorem will tell us, that every selfadjoint operator can
3. the residual spectrum: 1 A is injective, but not sur-
be written in this fashion. One example for such an oper-
jective and its image is not dense.
ator is the position operator x.
The residual spectrum of a selfadjoint operator is
empty, but e. g. zero is in the residual spectrum of the
3.5 Extensions of operators raising operator a+ . The spectrum of the lowering opera-
tor a is C and a pure point spectrum (cf. coherent states).
Sometimes, for non-selfadjoint operators, there exist The principal property of selfadjoint operators is that
selfadjoint extensions. An operator B is called an exten- for these the spectral theorem holds, the diagonalisa-
sion of another operator A, if D(A) D(B) and A = B|D(A) tion of operators. There are several forms of the spectral
(B restricted to the domain of A); this is usually denoted theorem, but we shall discuss only two of them. Let
as A B. The question of whether selfadjoint extensions A be a selfadjoint operator on an appropriate domain
exist, is answered by the theory of defect indices. Let z D(A):11
be arbitrary complex numbers with positive and nega-
1. There exists a measure space ( , , ), a measurable
tive imaginary part, respectively (one usually takes z =
function f : R and a unitary operator U : H
i). The defect indices areindependent of the choice
  L2 (), such that (i) x D(A) f Ux L2 () and (ii)
of z given by n := dim D(A )| A = z N0 .
UAU = f for L2 ( ) with some f L2 ( ).
An operator is selfadjoint, if and only if n+ = n = 0;
2. There exists a unique spectral measure E : B(H)
there exist selfadjoint extensions, if and only if n+ =
(i.e. the bounded operators on H) such that y|Ax =
n > 0, but not otherwise. 
R dy|E x for x D(A) and y H; here, a spec-
A semibounded operator is an operator which ful-
tral measure is a mapping from subsets of R (the
fils v|Av C v2 (and therefore is hermitian) for all
so-called Borel- -algebra), where EA are orthogonal
v D(A) (or, similarly ). These operators possess self-
projections, E = 0, ER = 1 and for pairwise disjoint
adjoint extensions with the same constant C (Friedrichs 
A1 , A2 , . . . there holds
i=1 EAi = E i=1 Ai point-
extension). Note that Hamiltonian operators are usually
wise.
semibounded because there should exist a ground-state
energy. The first version informally says that every selfadjoint
operator can be understood as multiplication operator
on some Hilbert space. The second version allows us to
3.6 Spectral theorem apply a measurable function f : R R to A: in the spec-

tral integral replace by f (), i. e. R f () dy|E x.
For a matrix A, the spectrum is the set of eigenvalues, and
C is an eigenvalue of A, if and only if det(1 A) = 0,
i. e., if 1 A is not invertible. For operators on infinite- 4 Distributions
dimensional Hilbert spaces, there are different classes of
non-invertibility defining different parts of the spec- In classical physics concepts like point charges
trum. We shall start with the complement of the spec- ( r ) = q ( r ) in electrodynamics, line and area charges,
trum, the resolvent set. For an operator A : D(A) H the currents etc. are commonly used. In quantum mechan-
resolvent set is defined by ics the eigenstates of the position operator x (in position

 11 This and related statements usually also hold for normal oper-
(A) := C| 1 A : D(A) H is bijective and
 ators, i. e. operators N, for which NN = N N, but where the
(1 A)1 is bounded . (9) spectrum does not need to be real.

650 www.fp-journal.org 
C 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim
Fortschritte
der Physik
Fortschr. Phys. 63, No. 910 (2015)
Progress

Review Paper
of Physics

representation) are said to be delta functions, because vice versa, for every T H there exists some , such that

x(x) = x(x), those of the momentum operator p = i x T = T .
ikx
plane waves e . In both cases, these eigenfunctions are
non-normalisable or not even true functions and thus
To each element of a Hilbert space there is associated
lie outside of the Hilbert space H = L2 (R). Nevertheless,
a unique element of its dual and vice versa, i. e. there is
one regularly writes for position and momentum the
some type of symmetry between H and H ; they are iso-
equalities
morphic, H = H , and of the same size.
 
In case of the Hilbert space L2 (R), the explicit form
|xx| dx = |pp| dp = 1H (10) 
xR pR
of the functional Tf is  f (x) (x) dx. Now, if we
choose a subset  Hlimiting the elements allowed
and in the domain of definition, the number of admissible
  functionals  increases. The space  is in general not
x |xx| dx = x, p |pp| dp = p, (11) a Hilbert space any more, and not all functionals can be
xR pR
written in the form of an integral. By an appropriately
which do work well in calculations. Similarly, for a limited choice of  (the test functions), we get a very
potential well V ( r ), one usually distinguishes between large set  (the distributions, which test the test
bound discrete and free continuous eigenstates and functions). A second aspect is that the functionals in
often writes the identities  need to be continuous; (sequential) continuity of f



 is the property that x x0 in  implies f (x) f (x0 )
|EE| dE = 1H and E|EE| dE = H. (12) in C. If one would like to have many continuous func-
tionals, there must be not too many convergent series
But how to mathematically understand these things? In in . This will be achieved by an appropriate notion of
the 1930s Dirac postulated a delta function with the continuity.

property that (x) = 0 for x = 0, but (x) dx = 1; for Before going into details, we shall give an overview on

some other function f , this implies f (x)(x) dx = f (0). various mathematical structures in functional analysis in
It is obvious that strictly speaking no such func- the following in Table 1; usually a set with some function
tion can exist, but the concept is nevertheless has a specific name (such as Hilbert space).
very useful in physics. The mathematical theory of For the details we must refer to the usual textbooks,
such objects as the delta function is the theory of but we give some notes here:
distributions (or generalised functions), which was devel-
oped already in the 1940s by S. L. Sobolev and Laurent
Schwartz.
1. A structure in an upper row induces a structure in
a lower below; there sometimes is the possibility to
go from a lower row to a higher one (parallelogram
4.1 Introduction
equality, metrisability).
2. Scalar products and norms need a vector space in or-
The basic idea of distributions in mathematics is, not
der to be defined, metrics and topologies can be de-
to consider the delta function and similar objects as
fined on sets in general.
functions but as functionalsobjects mapping vectors
3. Weakening the axioms of a scalar product yields no-
to scalars, as e. g. the delta function mapping the func-
tions of bilinear and sesquilinear forms; without defi-
tion f to a complex number f (0); in physics one would
niteness norms and metrics become semi-norms and
say that a delta function is defined only inside an inte-
semi-metrics. (Note that the term metric is used in a
gral. Mathematically, one can give a one-sentence defini-
different fashion in differential geometry.)
tion of distributions: they are the continuous linear func-
4. Completeness is a notion in metric spaces (because
tionals on the space of test functions. Before explaining
of Cauchy sequences). In topological spaces other no-
these terms in detail, we shall rephrase the Riesz-Frechet
tions of convergence (net or filter convergence, which
theorem in a slightly different fashion.
are equivalent) can be used.
Theorem 5 (Riesz-Frechet theorem). Let H be a Hilbert 5. Characteristic notions for Hilbert spaces are orthogo-
space and let H be its dual, i. e. the set of continuous lin- nality, for normed spaces duality (algebraic, topolog-
ear functionals H C. For every H, the functional ical), for metric spaces sequences, Cauchy sequences
T : H C with T () = | is an element of H , and, etc.


C 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim Wiley Online Library 651
Fortschritte
der Physik
K. S. Ranade: Functional analysis and quantum mechanicsan introduction
Progress
Review Paper

of Physics

Table 1 Mathematical structures.

Function Name of the space Name, if complete



scalar product | : V V K pre-Hilbert space Hilbert space
vector space V
norm  : V R+
0 normed space Banach space

metric d : X X R+
0 metric space complete metric space
general set X
topology system of open sets topological space

4.2 Notions of convergence set of C -functions with compact support in :

In a very general sense, continuity in mathematics is a D( ) := Cc ( )


 
property of topological spaces (Pre-images of open sets = f : C| f C ( ), supp f compact . (14)
are open), but in metric spaces the sequential conti-
nuity is more intuitive. In normed spaces, continuity is Compact support here is the property that the function
induced by the norm: a sequence (xn )nN converges to vanishes outside of a certain bounded set.12 An exam-
n
x, if xn x 0. In the theory of distributions, we ple of such a functions, which may be adapted in var-
consider so-called topological vector spaces, i. e. vector ious ways, is (x) := exp( x211 ), if |x| < 1, and (x) = 0
spaces equipped with a topology, such that the vector otherwise.
operations are continuous, or in particular locally con- The notion of continuity on D( ) is derived from the
vex spaces which are characterised by a family of semi- topology of the strict inductive limit, which is rather com-
norms (pi )iI : a sequence (xn )nN converges to x, if for all plicated. However, it is possible to describe sequences
n
i I there holds pi (xn x) 0. converging to zero in this topology: a sequence (n )nN
As an example consider the vector space C[a; b] of of test functions does so, if
continuous functions on an interval [a; b]. There are two
1. there is a single compact set K , such that
well-known notions of convergence here: pointwise and
supp n K holds for all n N, and
uniform convergence. The latter is defined by the norm
2. for all m N and all Nm 0 the sequence D n con-
verges uniformly to zero. 13
    
f  := max f (x) | x [a; b] ; (13) The distributions D ( ) now are the continuous lin-

ear functionals on D( ). Examples are the delta distribu-
tion: () := (0). Many functions can be interpreted as
1
a sequence of functions distributions. Let Lloc ( ) be the set of locally integrable
  (fn )nN converges to f uniformly, functions f : C, i. e. for every point in there ex-
if and only if fn f  converges to zero in R. Point-
wise convergence cannot be described by a single norm; ists a neighbourhood on which f is integrable (i. e. es-
therefore, for every x [a; b] the seminorm sentially excluding non-integrable poles). For each f
 consider
 1
Lloc ( ) the associated regular distribution is defined by
px (f ) := f (x): the sequence converges to f pointwise, 
if every (of the uncountably many) sequences px (fn ) Tf () := x f (x)(x) dx.
converges to px (f ) in R. On D ( ) one uses the weak*-convergence: a sequence
(Tn )nN of distributions converges to zero, if Tn () does

4.3 Test functions and distributions 12 I. e., the functions must be exactly zero, not just approaching
zero. In complex analysis, this is not possible for holomorphic
We shall start by defining a certain class of functions, the functions C C because of the identity theorem, except for
so-called test functions. To this aim, consider an open set 0.
Rn . The support of a function f is the closure of the 13 D is a shorthand notation for derivatives: for a function
 
set, where f does not vanish: supp f := x | f (x) = 0 . f : Rn C higher-order and higher-dimensional derivatives
Further, define by C k ( ) the set of functions which are are written in by a multi-index = (1 , . . . , n ) Nn0 : there
1 n
k times differentiable with a continuous derivative holds D f := x 1 . . .
xnn
f , and || := 1 + + n is
1
(where k may be infinite). The test functions now are the called the order of .

652 www.fp-journal.org 
C 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim
Fortschritte
der Physik
Fortschr. Phys. 63, No. 910 (2015)
Progress

Review Paper
of Physics

so for all D( ); the set D ( ) is then complete (in the 4.6 Tempered distributions and the Fourier transform
sense of sequences).
For test functions (or more generally for L1 and L2 func-
tions) on Rn one can define the Fourier transform. To de-
4.4 Structure of distributions fine the Fourier transform for distributions, note that for
L2 (Rn ) functions, there holds
The set of distributions D ( ) obviously is a vector space.  
Moreover, every distribution can be multiplied with a (Ff )(x)g(x) dx = f (x)(Fg)(x) dx, (15)
C -function f : (f T )() := T (f ), D( ), but two dis-
tributions cannot be multiplied.14 It is also possible to and one can, in the same spirit as in the case of deriva-
define the convolution of two distributions, and distri- tives, try to define the Fourier transform by acting on the
butions can be approximated by C -functions (e. g. the test function. However, F is never a test function, except
delta distribution by Gaussians). for 0. Therefore, it is necessary to enlarge the space of
Distributions can be differentiated arbitrarily: in test functions, which results in a smaller dual. The space

analogy to the product rule for a C 1 (R) functions, f = which one considers is known as the space of rapidly de-

f (the boundary terms vanish), on defines the creasing functions (or Schwartz functions):
derivative of a distribution T by T () := T ( ). Higher-
order and higher-dimensional derivatives are accord- S(Rn ) := f : Rn C| f C (Rn ),
ingly defined by (D T )() := (1)|| T (D ).  
(, N0 ) lim x D f (x) = 0 . (16)
n

4.5 Sobolev spaces These are the functions which are arbitrarily often dif-
ferentiable and go to zero faster than every polynomial,
Given a differentiable function f and any D( ), there and their derivatives should do the same. Here the stan-
 
holds that f = f (the boundary terms are zero). dard example is a Gaussian (x) = exp( x), > 0. The
Even if f is non-differentiable, there may exist some important property with respect to Fourier transforms
 
function g, such that f = g for all D( ). In is that F(S(Rn )) S(Rn ). There holds D(Rn ) S(Rn ) and
contrast to the common (strong) derivative, g is called the thus S (Rn ) D (Rn ) we shall not mention the topology
weak derivative of f . The derivative in the sense of dis- here. The space S (Rn ) is called the space of tempered dis-
tributions is still weaker. Altogether there are three no- tributions.
tions of derivatives: the common (strong), the weak and The Fourier transform now is defined by (FT )() :=
the distributional derivative. T (F). There holds e. g. (FD() T )() = (i/)|| x FT ,
F = exp[ i ax]/(2)n/2 and F1 = (2)n/2 0 (in Rn ),
Example 1 (Derivatives). The absolute-value function where a is the delta function at some point a Rn , i. e.
| | : R R is continuous for x = 0, but not differentiable. 
a (x) = (x a). For a polynomial f (x) = nk=1 ak xk there
Nevertheless, there is a weak derivative, the sign function n
follows (Ff )(p) = k=1 ak ( i p k
) . There is a regularisa-
sgn, which is +1 for x > 0, 0 for x = 0 and 1 for x < 0. tion of distributions: for f S (Rn ) there exists a function
This function does not possess a weak derivative, but 2 is g and some Nn0 , such that f = D g.
its derivative as a distribution.

The space of m-times weakly differentiable Lp -


functions (i. e. all D f exists for all with || m), where 4.7 Applications
all derivatives again lie in Lp , is called the Sobolev space
W m,p ( ) (or H m,p ( )), and p = 2 is often omitted. An Having discussed the guiding ideas in the definition
interesting connection of weak and strong derivative is of distributions, we shall present some applications
given by Sobolevs lemma (or embedding theorem): Let of the theory of distributions, which are relevant to
Rn be an open set, k N0 and m > k + np . Then, physics.
there holds W m,p ( ) C k ( ).
4.7.1 Differential operators

Consider a linear partial differential operator L : S(Rn )


14 In physics, however, this is done in some instances. 
S(Rn ) of order m N given by T  ||m c T with


C 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim Wiley Online Library 653
Fortschritte
der Physik
K. S. Ranade: Functional analysis and quantum mechanicsan introduction
Progress
Review Paper

of Physics

c C (Rn ) being smooth functions and the linear par- 5 Further topics
tial differential equation Lu = f with f D (Rn ). We look
for a fundamental solution for L, i. e. a distribution E, for In this final section of this tutorial, we discuss some fur-
which there holds LE = . If this E is known, u := f E ther topics from functional analysis and also from alge-
solves the original equation, because of Lu = L(f E) = bra. However, a fully rigorous treatment of these topics
f LE = f = f ; note that this is essentially the method requires in general much more terminology and nota-
of Greens functions. In particular, the delta function is tion, which would render it too long for this introductory
the identity with respect to the convolution. text. Nevertheless, we want to include these topics in or-
The standard example is the Poisson equation from der to show that mathematics has many aspects relevant
electrostatics: ( r ) = 4( r ). A fundamental solution to physics, which are somewhat less known to physicists.
 r )
of the Laplacian  is E = 41|r | and ( r ) = |( r r |
d r .
(Note that for convolutions there holds Tf1 Tf2 = Tf1 f2 )
The basic idea is to use the Fourier transform: E = 5.1 Operator algebras
becomes (x2 y 2 z2 ) FE = r 2 FE = 1 with r =

x2 + y 2 + z2 , so FE = 1/ r 2 ; the inverse transform of Operator algebras in some sense make abstract the no-
1/ r 2 is 1/(4r) which gives the desired result. tions of operators on a Hilbert space. The theory of op-
erator algebras is relevant to physics, since there is quite
a lot of mathematics literature dealing with topics from
4.7.2 Rigged Hilbert spaces quantum mechanics, but in general, difficult to read for
physicists.
At the end of this chapter, we would like to mention The term operator is as before a linear mapping on a
that there indeed exists a possibility to mathematically Hilbert space. The term algebra appears in two notions:
explain generalised eigenfunctions of position and mo- on the one hand as the name of a mathematical field of
mentum and of other operators, although this is rather the study of algebraic structures such as groups, rings,
complicated. Note first, that in the continuous spec- fields etc., on the other handas we use it hereas
trum of a selfadjoint (or more generally normal) oper- one of these algebraic structures, which can informally
ator there a no true eigenfunctions, but approximate be described as a vector space with multiplication.
eigenfunctions. Axiomatically, it is a quadruple (V, K , +, ) consisting
of a vector space V over a scalar field K , on which
Theorem 6 (Approximate eigenvectors). Let T be a
there is additionally defined a multiplication of two
bounded selfadjoint operator on H and (T ). Then,
vectors : V V which fulfils the following conditions
there exists a sequence (xn )nN in H with xn  = 1 for all
(for all a, b, c V and K ); as is customary, we
n N, such that limn (1 T )xn = 0.
suppress the multiplication symbol:
For example, in case of the position operator consider
Gaussians of decreasing width approach a delta function 1. associativity: (ab)c = a(bc);15
(up to normalisation). 2. distributivity: a(b + c) = ab + ac and (a + b)c = ac +
For a generalised version of the spectral theorem one bc;
must choose a space  on which there is defined a scalar 3. scalar multiplicativity: (ab) = (a)b = a(b).
product and which additionally is nuclear (a property The algebra is commutative, if there holds ab = ba,
which cannot be explained here); possible choices for and it is called unital (or algebra with identity) if there is a
 are the test functions or the Schwartz functions. The neutral element e V , such that there holds ae = ea = a.
completion of  is a Hilbert space H and dual of  is The principal examples of such algebras are n n
 . With the appropriate embeddings, the triplet  matrices Mn (K ) over a field K . Moreover, the bounded
H  is called rigged Hilbert space or Gelfand triplet. linear operators on a Hilbert space H, written as B(H) (or
There holds the Gelfand-Kostyuchenko theorem [1, 8, 9], L(H)) form such an algebra as well as their subalgebras,
which we cannot give in rigorous form here. But infor-
mally spoken, it states that for an operator T under cer-
tain assumptions, there exist enough generalised eigen- 15 Note that in Lie theory a Lie algebra with Lie bracket [ , ] :

vectors |x , such that we may write 1 = |x x | d V V V is in general non-associative, and algebras in

and T = |x x | d . Note that these operators can- our sense are called associative algebras there. (The Lie bracket
not act on the whole Hilbert space, but only on the can be thought of an abstract version of the commutator of
subset . operators.)

654 www.fp-journal.org 
C 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim
Fortschritte
der Physik
Fortschr. Phys. 63, No. 910 (2015)
Progress

Review Paper
of Physics

like block matrices. Commutative algebras are e. g. the Cauchy sequence), but weakly: for every bra-vector |,
set of diagonal matrices or continuous functions C[a; b] |n converges to zero.
on an interval or more generally on a compact set. An al- Given a sequence not of vectors but of operators,
gebra which does not possess an identity is the continu- there are even more notions of convergence; we shall
ous functions on an interval vanishing at the boundary consider only the three most important. Let (An )nN be a
C0 [a; b]. sequence in B(H), the bounded operators, and A B(H)
An involution is a function : A A on an algebra A be an operator.
fulfilling
1. The sequence (An )nN converges in the norm (or uni-
1. involutivity: x = x; formly) to A, if limn An A = 0 in R. Here, A is
2. anti-linearity: (x + y) = x + y ; the as usual operator norm of eq. (6).
3. anti-multiplicativity (xy) = y x . 2. It converges strongly (more precisely: with respect to
the strong operator topology) to x, if for every x
An involutive algebra is an algebra with an involution.
H, the sequence (Axn )nN in H converges strongly to
In mathematics, usually a star is used instead of a dag-
Ax H.
ger (A instead of A ), and such algebras are also called
3. It converges weakly (more precisely: with respect to
-algebra; this is the star in C -algebra. The standard ex-
the weak operator topology) to x, if for every x H, the
ample of involutive algebras are complex square matri-
sequence (Axn )nN in H converges weakly to Ax H.
ces or bounded operators with their adjoints, i. e the
transposed and complex conjugated matrix. The guid- Convergence in the norm implies strong, and this im-
ing idea is to make terminology from spectral theory ab- plies weak convergence. To give counterexamples for the
stract, i. e. to characterise them by their inner properties. opposite, consider An := |nn| and Bn := |n0|. Both se-
The topic of operator algebras is connected to func- quences do not converge in the norm, but weakly to the
tional analysis by normed algebras, i. e. algebras with a null operator; the first one does so also strongly, the lat-
norm. A Banach algebra (similarly
  to a Banach
 space) is ter does not. In the mathematical-physics literature there
a normed algebra, where xy  x y  for all x, y A; often appear notions for which we shall give just the def-
this condition implies continuity of the multiplication. initions, in order that the reader may grasp the idea, if he
If there holds the
 (at first sight somewhat odd-looking) finds them somewhere (usually it is always good to think
condition x x = x2 , the algebra is called C -algebra. of square matrices):
(Using the operator norm this holds e. g. for square
1. A C -algebra is a a norm-closed involutive subalgebra
matrices.)
of B(H);
2. a von-Neumann algebra is a strongly (or equivalently
weakly) closed involutive subalgebra of B(H).
5.2 Notions of convergence
It can be shown that every C -algebra in this sense ful-
In the preceding sections we already have mentioned the fils the C -property from above. The finite-dimensional
possibility to use different notions of convergence. Here C -algebraswhich always contain the identityare
we should consider in more detail some notions of con- easy to classify; they are the direct sums of simple
vergence one usually encounters in mathematics litera- C -algebras, which again are the full matrix algebras
ture. Let (xn )nN be a sequence and x be an element in Mn (C) of complex n n-matrices.
some Hilbert space H: The commutant of a set M B(H) is the subalge-
bra M := {A B(H)| (B M)(AB = BA)} of all elements
1. The sequence (xn )nN converges strongly or in the
in B(H) commuting with every element in M (the nota-
norm to x, if limn xn x = 0 in R.
tion should not be confused with the dual); for exam-
2. It converges weakly to x, if for every y H the se-
ple, as an instance of Schurs lemma, if A = Mn (C), then
quence y|xn x converges to zero in C.
A = C1. The bicommutant theorem says that M is a von-
In case of doubt, on usually takes strong conver- Neumann algebra, if and only if M = M . von-Neumann
gence; more generally, weak convergence can be defined algebras can be decomposed into so-called factors (of
on normed vector spaces and their dual. Strong conver- types In , I , II1 , II and III), where a factor is defined
gence implies weak, but in infinite-dimensional spaces by M M = C1.
not vice versa: consider an infinite-dimensional Hilbert Coming back to quantum mechanics: the postulates
space, e. g. Fock space {|0, |1, |2, . . . }, and a sequence of quantum mechanics say that there is a Hamiltonian
xn = |n. This does not converge strongly (e. g. it is not a operator H, which generates the dynamics of a state


C 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim Wiley Online Library 655
Fortschritte
der Physik
K. S. Ranade: Functional analysis and quantum mechanicsan introduction
Progress
Review Paper

of Physics

vector by the Schrodinger equation i dt d


| = H|. If the eigenvalues of |A| and their p norm is the norm of
H is time-independent, the solution is the unitary time the operator
evolution |(t) = eiHt/ |(0). So for every time t there 1/p
exist a unitary operator U(t) mapping |(0) to |(t). Ap := Tr |A|p . (17)
In mathematical terms, the function U(t) = eiHt/ is a
unitary-operator-valued function of the (time) parame- Those operators, for which the norm is finite, form the
ter t R which fulfils the following conditions: Schatten classes Sp . Special cases are the trace class op-
erators S1 (such as density operators), Hilbert-Schmidt
1. Semigroup property: U(t + s) = U(t)U(s) for t, s 0 operators S2 and the (non-compact) bounded opera-
and tors S = B(H). As is usually the case for p-norms, the
2. Strong continuity: for t t0 , there holds U(t)| 2-norm is induced by a scalar product, A|B := Tr A B,
U(t0 )| (it suffices to show this for t = 0). and Holders inequality, ABr Ap Bq with 1r =
1
p
+ q1 and r, p, q [1; ], holds.
On the other hand, given such a function U, the
generator H can be recovered through formal deriva-
tion; Stones theorem tells us that this is indeed math- 5.5 Commutators of position and momentum
ematically possible. The one-parameter group is not
necessarily norm-continuous: take the harmonic oscil- Concepts and methods from the theory of operators may
lator Hamiltonian H = (|nn| + 1/2), where U(t) = be used to give a mathematical formulation of quantum
 (n+1/2)t
n=0 e
 |nn|. The n + 1/2 term in the exponent mechanics. This shall be presented here in a short and
oscillates arbitrarily fast and therefore is not continuous simplified way. In general, one starts from the Heisen-
(e. g. at t = 0). berg commutator relations [xi , pj ] = iij . Now consider
two cases of these operators in one mode:
r a free particle: Hilbert space L2 (R) and
5.3 Polar decomposition
r a particle in a box: Hilbert space L2 [a; b].
A complex number z = a + b i C may be decomposed
The spectrum of x and p is both R in the first case,
into polar representation: there holds z = rei for r =
but [a; b] and discrete in the second case. This shows
|z| R+
0 and [0; 2). This is generalised to the po- that the Heisenberg commutator relations do not specify
lar decomposition of an operator. The absolute
value of
the spectrum uniquely. On the other hand, we can (for-
an operators A B(H) is defined as |A| := A A, and this
mally) use the Baker-Campbell-Hausdorff identity to get
operator is always positive semidefinite. In the finite-
the commutator relations in the Weyl form:
dimensional case A is a matrix, and x|A Ax = Ax|Ax
0, so we can diagonalise it and take the square root; by
eit xi eispj = eistij eispj eit xi for t, s R. (18)
the spectral theorem, we can do the same for all bounded
operators B(H).
Note that this is a special case of the displacement oper-
The polar decomposition now states that there exist
ators D() for C, usually used in discussing the har-
partial isometries U and V in B(H) (i. e. U U and UU
monic oscillator and coherent states. Further, the opera-
are projections on H), such that A = U |A| = |A| V . If the
tors appearing in this equation are bounded, thus easier
Hilbert space is finite-dimensional, U and V can be cho-
to handle.
sen to be unitary.
The commutators used in the Weyl form are not
the usual commutators of operators [A, B] = AB BA
(in mathematics also known as Lie bracket in terms of
5.4 Classes of operators Lie algebras) but a related concept, the group-theoretic
commutator ABA1 B1 , which can be defined on any
An operator is called compact, if the image of a com-
group and gives identity, if AB = BA. The following
pact set is relatively compact, i. e., the closure of the im-
theorem says that these operators are essentially unique
age is compact; on a Hilbert space they are the uniform
[13].
limit of finite-rank operators. The compact operators in
some sense behave like matrices, e. g. the set of non- Theorem 7 (Stone-von Neumann theorem). Let Ai and
zero eigenvalues is discrete, so that we can define the Bi for i {1, . . . , n} be operators on a Hilbert space H,
Schatten-p-norms as follows: the singular values of A are such that for all i, j {1, . . . , n} the Weyl commutator

656 www.fp-journal.org 
C 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim
Fortschritte
der Physik
Fortschr. Phys. 63, No. 910 (2015)
Progress

Review Paper
of Physics

relations eitAi eisAj = eisAj eitAi and eitBi eisBj = eisBj eitBi and the mathematical sense. However, not all mathematical
states can be described this way, and the notion there is
eitAi eisBj = eistij eisBj eitAi somewhat more general, but under some continuity as-
sumptions, the converse also holds.
are fulfilled. If the operators act irreducibly16 on H, The states moreover form a convex and compact
then there exists a unitary mapping U : H L2 (Rn ), set. Within these sets there exist extremal points, i. e.
unique up to a phase ei , such that UeitAi U = eit xi and points which cannot be written as true convex combi-
UeitBj U = eit pj where xi = xi and pj = i x i are the usual nations of other states. Formally: a state is extremal, if
position and momentum operators on Rn . from = 1 + 2 with , > 0 and + = 1, it fol-
The intuitive reason behind this is that the Heisen- lows that 1 = 2 = . Extremal points in the states space
berg commutators exhibit the local properties (at every are the pure states, other states are mixed states. Accord-
point x), but not the global ones. The Weyl commutators ing to the Krein-Milman theorem, a convex and com-
take into account global properties; remember that the pact set (such as the states) is the closed convex span of
exponentiated operators are translations of momentum its extremal states. For a two-dimensional Hilbert spaces
and position, respectively, and thus connect different this is very well illustrated by the Bloch sphere. The in-
pointsin the case of the particle in a box, one would finitely many points on the surface correspond to the
hit the wall if the shift distance is too large. (This is pure states, the convex hull (the interior) to the mixed
related to the theory of Lie groups and Lie algebras.) states. (One can compare the example of a probabil-
ity distribution with finitely many outcomes: there the
extremal points are the probability distributions with
5.6 Mathematical structure of quantum mechanics
one certain outcome.).

Thinking of the postulates of quantum mechanics


(Hilbert space, state vector or density matrix etc.), one 5.7 More examples
may ask what precise reason there is to use these pos-
tulates. There is a mathematical modelling of quantum It is beyond the scope of this introduction to present
mechanics, which could explain this. Starting from the more than an overview of all the applications of modern
assumption that quantum mechanics is a theory of what mathematics, i. e. still mathematics of the 20th century,
is measurable, we must first state the measurable quanti- to physics. But at the end of this introduction we would
ties (observables). A state of a system then is an object (a like to give some examples, where theorems from math-
function) which maps to every observable its expectation ematics have direct application in physics:
value. r The Schmidt decomposition was derived by Erhard
The crucial assumption now is to describe observ-
Schmidt in his 1906 doctoral thesis [14] and reap-
ables by elements of an operator algebra with certain
peared in Schrodingers papers on entanglement in
commutation relations. A state is a functional on the al-
1936 [15].
gebra of these operators, which should be continuous, r The idea of realising quantum operations as unitary
linear, positive and normalised. A mathematical theo-
operators on larger Hilbert spaces and Kraus opera-
rem, the Gelfand, Neumark and Segal (GNS) represen-
tors [16] is related to Stinesprings dilation theorem
tation, then tells us that, if these operators form a C -
from 1955 [17].
Algebra and we have such a state, these operators may r The quantum de-Finetti theorem was derived by
be represented by a subalgebra of the bounded linear
Strmer in 1969 [18], then by Hudson and Moody [19]
operators A = B(H) on some Hilbert space H. A state in
and in physics by Caves, Fuchs and Schack [20].
mathematics is a functional : A C which is positive r The proof of entropic uncertainty relations for Renyi
(maps positive operators to non-negative numbers) and
entropies by Maassen and Uffink [21] essentially re-
which fulfils  = (1) = 1 (the name of such a func-
lies on the Riesz-Thorin interpolation theorem found
tional even in mathematics is state). Comparing with
in several textbooks.
physics, a state is a density operator ; this operator gen-
erates a functional by (A) := Tr A, which is a state in
6 Summary
16 That is, if only {0} and H are invariant closed subspaces of
all eitAi and eitBi . If this is not the case, the theorem can be It could not have been the aim of this introduction to give
adapted accordingly. a complete overview of functional analysis or its use in


C 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim Wiley Online Library 657
Fortschritte
der Physik
K. S. Ranade: Functional analysis and quantum mechanicsan introduction
Progress
Review Paper

of Physics

physics and quantum mechanics, but to give the reader M. Gelfand, Collected papers, Vol. I (Springer 1987),
examples of mathematical thinking in a language appro- 505509.
priate for physicists. In this spirit we discussed some top- [8] I. M. Gelfand and N. J. Wilenkin, Verallgemeinerte
Funktionen (Distributionen) Vol. IV (VEB Deutscher
ics of functional analysis which are relevant for physics.
Verlag der Wissenschaften, 1964).
It is clear that there is a vast amount of mathematical [9] G. G. Gould, J. London Math. Soc. 43, 745754
knowledge relevant to quantum mechanics and physics (1968).
in general, but which is unfortunately not so widely [10] F. Schwabl, Quantenmechanik (Springer, 6th ed.,
known by physicists. 2002).
[11] J. von Neumann, Mathematische Grundlagen der
Key words. Functional analysis, quantum mechanics, tutorial. Quantenmechanik (Springer, 1932).
[12] F. Scheck, Theoretische Physik 2 (Springer 2nd ed.,
2006).
References [13] B. C. Hall, Quantum Theory for Mathematicians
(Springer, 2013).
[1] D. Werner, Funktionalanalysis (Springer 5th ed., 2005). [14] E. Schmidt, Math. Ann. 63, 433476 (1907).
[2] M. Reed and B. Simon, Methods of modern mathe- [15] E. Schrodinger, Proc. Cambridge Phil. Soc. 31, 555563
matical physics, Vol. I: Functional Analysis (Revised (1935), and 32, 446452 (1936).
and enlarged edition 1980), Vol. II: Fourier Analysis, [16] K. Kraus, States, effects, and operations (Springer,
Self-Adjointness (1975). 1983).
[3] J. B. Conway, A Course in Functional Analysis [17] W. F. Stinespring, Proc. Amer. Math. Soc. 6, 211216
(Springer 2nd ed., 1990). (1955).
[4] F. Gieres, Reports on Progress in Physics, 63, 18931931 [18] E. Strmer, J. Funct. Anal. 3, 4868 (1969).
(2000). [19] R. L. Hudson and G. R. Moody, Z. Wahrschein-
[5] G. Bonneau, J. Faraut, and G. Valent, American Journal lichkeitstheorie verw. Gebiete 33, 343351
of Physics 69, 322331 (2001). (1976).
[6] J. Elstrodt, Ma- und Integrationstheorie (Springer 4th [20] C. M. Caves, C. A. Fuchs, and R. Schack, J. Math. Phys.
ed., 2005). 43, 45374559 (2002).
[7] I. M. Gelfand and A. G. Kostyuchenko, Dokl. Akad. [21] H. Maassen and J. B. M. Uffink, Phys. Rev. Lett. 60,
Nauk SSSR 103, 349352 (1955), translated in Izrail 11031106 (1988).

658 www.fp-journal.org 
C 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim

S-ar putea să vă placă și