Documente Academic
Documente Profesional
Documente Cultură
Dragt
Alex J. Dragt
Department of Physics and Astronomy, University of Maryland, College Park, Maryland 20742
Here each function fn is a homogeneous polynomial of degree :f2:Pa' = {(n2 - nl)/(2r))[(qi) 2 , pai] = I(n2- nl)/rlqai,
n in the components of w. Moreover, the map is symplectic (4.6b)
for any set of polynomials. Finally, if the product is truncated
at any stage, the result is still a symplectic map. f2:2pai = t(n2 - n1)/rI:f2:qai= 0, etc. (4.6c)
When applied to optics, the factorization theorem indicates Proceeding as before, one finds the net result
that the effect of any collection of lenses, prisms, and mirrors
can be characterized by a set of homogeneous polynomials. qaf = Jfqai = exp(:f 2 :)qai = qail
It will be shown in Section 4 that the polynomials f2 reproduce Paf = Jhpia = exp(/f2:)pai = pai + J(n 2 - nl)/rqai.
Gaussian optics, and the higher-order polynomials f3, /A,etc. (4.7)
describe departures from Gaussian optics and are related to Comparison of the far left-hand and right-hand sides of Eqs.
aberrations in the case of an imaging system. Thus, from a (4.7) in this case shows that the relation described by st is that
Lie algebraic perspective, the fundamental problem of geo- predicted by Gaussian optics for refraction at a spherical in-
metrical optics is to study what polynomials correspond to terface of radius r when traversed from a medium having
various optical elements, to study what polynomials result index of refraction n, to a second medium having index n2*
from concatenating various optical elements, and to study Consequently, there is also the correspondence
what polynomials correspond to various desired optical
properties."1 Finally, it should be noted, at least in passing, At = expl(n2 - nl)/(2r):(qi) 2 :1 - Refraction in the Gauss-
that Eq. (3.7) provides a complete solution to the lens equa- ian approximation by a
tion. spherical surface of radius
r separating media having
refractive indices n1 and
4. SIMPLE APPLICATIONS
n2.
In this section some simple Lie transformations are studied,
At this point it should be evident that all Gaussian optics
and a correspondence will be made between them and various
can be represented by maps or products of maps of the form
idealized optical elements.
A = exp(:f 2 :). As an exercise, the reader is invited to work
To begin such a study, consider the case in which
out the effect of si when f2 is given by the expression
At = exp(:f2:), (4.1)
with f2 =-a(pi qi), (4.8)
where a is some parameter. She or he should find the result
f2 = -{1/( 2 n)J(pi)2. (4.2) that
The effect of :f2: and its powers on each component qai and qf = (ea)qi,
Pai of wi is easily worked out by using definitions (3.1) and
pf = (e--T)pi. (4.9)
(3.2). One finds the results:
These are the equations for a system that is both imagingand
f2 Pai = -Jl/(2n)1[(pi)2, pai] = 0, (4.3a) telescopic.
:f2:qai = -Il/(2n)}[(pi)2, qai] = (I/n)pi, (4.3b) The next question to address is that of departures from
Gaussian optics. Suppose, as is often the case, that the optical
:f2:2qi = (l/n):f 2:pai = 0, etc. (4.3c) system under consideration is axially symmetric about some
Now substitute these results in Eq. (2.9) with the aid of Eq. axis and is alsosymmetric with respect to reflections through
(3.6). The net result is the relations some plane containing the axis of symmetry. Then axial
symmetry requires that the various fn be functions only of the
Paf = itpai = exp(:f2:)Pa, = Pai, variables p 2 , q 2 , p *q, and p X q; reflection symmetry rules out
qaf = Atqai = exp(:f2:)qci = qai + (I/n)pai. the variable p X q. It follows that all fn with odd n must
(4.4)
vanish, since it is impossible to construct an odd-order ho-
Comparison of the far left-hand and right-hand sides of Eqs. mogeneous polynomial by using only the variables p 2 , p * q,
(4.4) shows that the relation described by At in this case is just and q2 . Consequently, in any case having the assumed sym-
that predicted by Gaussian optics for transit by a distance 1 metries, the optical symplectic map At must be of the general
through a medium of refractive index n. Consequently, the form
Alex J. Dragt Vol. 72, No. 3/March 1982/J. Opt. Soc. Am. 375
A = exp(:f 2 :)exp(:f 4 :)exp(:f 6 :).... (4.10) rations.2 Finally, the last term F(q2 )2 has no effect on the
quality of an image since [F(q 2 )2 , qj] = 0. (It does, however
Of course in the general case of no particular symmetries, the
affect the arrival direction p of a ray at the image plane and
odd-degree polynomials f3, /A,etc. can also occur, and all the
therefore may be important if the optical system under study
polynomials fn can, in principle, depend on the components
is to be used for some other purpose as part of a larger optical
of the vectors p and q in an unrestricted fashion.
system.) Specifically, one finds in the imaging case the fol-
The burden of the remainder of this section is to show that
lowing one-to-one correspondence between the terms in ex-
the polynomials f4, /6, etc. in the case of an imaging system are
pansion (4.13) and the classical Seidel third-order mono-
related to third-order and fifth-order aberrations, etc., re-
chromatic aberrations:
spectively. In particular, specific attention will be devoted
to third-order aberrations. They are completely described
Table of Aberrations
by f4. That is, attention will be restricted to maps of the
Term Seidel Aberration
form
A (p 2 )2 Spherical Aberration
A = exp(:f 2 :)exp(:f 4 :). (4.11)
Bp 2 (p *q) Coma
The higher-order cases can be treated in a similar fashion. C(p . q) 2 Astigmatism
Suppose that the Gaussian portion of a map [Eq. (4.11)], Dp2 q2 Curvature of Field
exp(:f 2 :), has been selected to produce imaging. Also, let w E(p . q)q2 Distortion
denote the location and direction of a ray at the image plane
In reflecting on what has been accomplished so far, it is
as predicted by the Gaussian approximation. Then it can
evident that all that has been assumed is that the optical map
be shown that the actual ray four-vector, which will be denoted
A is symplectic (a consequence of Fermat's principle) and has
by W, is affected by the term exp(:f 4 :) and is given by the ex-
certain symmetries. As a consequence of these assumptions,
pression
Gaussian optics was obtained as a first approximation, and
w = exp(:f 4 :)w = w + [V4,
wI + [f4, [f4,w]]/2!.... (4.12) it was found that for an imaging system only five well-defined
kinds of monochromatic aberration can occur in third order.
Next observe that the operation of Poisson bracketing involves The use of Lie algebraic methods seems to be an optimal way
multiplication and two differentiations. Thus the term [f4, of arriving at and understanding these basic results.
w] is a polynomial of order 3, and [/4,[V4,
w]] is a polynomial
of order 5, etc. Consequently, since only third-order effects
are of interest at this point, attention will be devoted only to 5. BEGINNING OF A CATALOG
the third-order term [V4, w].
According to the previous discussion of symmetry, f4, in the The purpose of this section is to present the beginning of a
cases of present interest, can depend only on the variables p 2 , catalog of Lie operators corresponding to various simple op-
p *q, and q 2 . Consequently, /4 must be of the general form tical elements. Results will be presented for axially sym-
metric systems through order 4. A more detailed catalog for
A = A(p2 ) 2 + Bp2 (p - q) + C(p *q)2 axially symmetric systems including terms through order 6
+ Dp2 q2 + E(p - q)q2 + F(q2 )2 . (4.13) is currently in preparation. When the catalog is complete,
as this section will show, it should be possible with sufficient
Here the quantities A-F are arbitrary coefficients whose algebraic effort to compute manually explicit formulas for
values depend on the particular optical system under con- both third- and fifth-order aberrations. If the catalog is
sideration. further enlarged to include general elements without any
The significance for an imaging system of the various terms special symmetry, then the resulting even-order aberrations
in /h as given by expansion (4.13) can be evaluated by using can also be computed.
Eq. (4.12). Suppose that an image is to be produced in the The first simple optical element to be considered is transit
image plane and that the effect of the term A(p2)2 on this through a slab of thickness 1 composed of a homogeneous
image is to be studied. Then use of Eq. (4.12) through third medium having constant refractive index n. In this case, the
order gives the result that associated symplectic map A is found to be given by the ex-
pression
qa = q. + [A(p 2 ) 2 , q]. (4.14)
it = expll:(n2 - p2) 1 / 2:1 = exp{(-l)/(2n):p2 :j
Explicit evaluation of the Poisson bracket following Eq. (3.1)
gives the result that Xexp{(-l)/(8n3):(p2)2:}
.... (5.1)
q= q - = -Aa(p 2 ) 2 /ap. = -4Ap 2 p. (4.15) Next consider a lens. Figure 2 shows a lens with planar
entrance and exit faces. It is composed of two media having
Evidently, the effect of the term A(p2)2 is that the deviations indices of refraction n1 and n2, respectively, separated by a
in the actual arrival point of a ray at the image plane depend curved interface. Suppose a Cartesian coordinate system is
on its Gaussian arrival direction p in the manner indicated in located in the exit face (right face) of the lens, with the z axis
Eq. (4.15)and do not depend on its Gaussian arrival point q. along the optical axis and the x and y axes lying in the face of
Those familiar with aberration effects will recognize that this the lens. Then the shape of the curved interface is taken to
effect is exactly what is usually referred to as spherical aber- be given by the equation
ration. z = -a(X 2 + y 2 ) + /3(X2 + y2 )2 + y(X 2 + y 2 )3 + .... (5.2)
Similarly, the other terms involving the coefficients B-E
can be shown to describe the other well-known Seidel aber- For the case of a spherical lens with radius of curvature r, the
376 J. Opt. Soc. Am./Vol. 72, No. 3/March 1982 Alex J. Dragt
as would be expected from Gaussian optics. The transfor- tion. The first term in A* arises from transit in air over the
mation matrix corresponding to AtG is given by the distance d1 and transit through the thickness of the lens, the
product second set of terms is produced by the lens itself, and the last
term is caused by the final transit over the distance d 2 . From
(01I) (-1/f I) (01i) (1 d) (5 ) this result it is clear that simple transit, when combined with,
even perfect Gaussian refraction, is itself a source of aberra-
Here use has been made of relations (4.4) and (4.7) and defi- tion.
nition (5.9). When the indicated matrix multiplication is Since all the terms A *-F* depend on F, and F depends on
carried out and use is made of Eq. (5.8), the result is the ma- the asphericity parameter f, it is evident that any one of the
trix terms A*-F* and its corresponding aberration can be made
to vanish by suitably shaping the lens surfate. However,the
(?1/f 1/m) (5.13) Petzval linear combination of astigmatism and curvature of
field given by the relation
where m is the magnification defined by the ratio
2D* -C* = 2D = cx(n2 - nl)/(nin 2) (5.18)
m = -d 2 /(d 1 + t/n). (5.14)
Observe that the matrices appearing in Eq. (5.12) occur in is independent of the asphericity of the lens.
the opposite order of their corresponding Lie transformations The statements of the paragraphs above concerning aber-
in Eq. (5.11). That this should be the case can be seen as rations hold even in the thin-lens limit t = 0. At first reading
follows: Suppose the Lie transformations exp(:f 2 :) and this may seem strange to those familiar with the usual aber-
exp(:g2 :) have the corresponding transformation matrices Mf ration theory since it is usually stated that a thin lens has no
and Mg, respectively. That is, there are the relations distortion and that all its other aberrations except spherical
aberration are independent of the asphericity parameter j.
exp(:f2:)wi = E Mfjwj, etc. (5.15) However, in the usual discussion of aberrations it is customary
to express the deviation Aq of a ray in the image plane as a
Then one finds that power series in the components of the coordinate qi in the
object 'lane and of a second coordinate Q that describes the
exp(:f2:)exp(:g 2 :)wi = exp(:f2 :) E Mg-Wi position of the ray as it passes through some reference aper-
2
ture. With this method, the size of a given aberration coef-
ficient generally depends on the choice of the reference ap-
= E Mzgexp(:f 2:)w1 erture plane, and the aberration coefficients for one choice of
reference aperture plane are related to those for a different
choice by stop-shift formulas. However, as long as the ref-
Ik
=E Mzgj E Mjf Wk
erence and other apertures are centered on the optical axis,
the aberration coefficients are independent of the actual sizes
= E (MgMf)ikWk- (5.16) of the apertures. The effect of the apertures is simply to limit
k
the range of the variable quantities q and Q by removing rays
The determination of the function f4 requires somewhat that exceed desirable limits. By contrast, the Lie algebraic
more effort. In general, f4 has an expansion of the same form
description makes no reference to apertures since the image
as that in Eq. (4.13). The explicit determination of the quantities qf, pf are expressed directly in terms of the object
coefficients can be carried out using the Campbell-Baker- quantities qi, pi. Apertures still play a role, of course, in
Hausdorff formula. 1 0 By using this formula, which is a limiting the range of the quantities qi, pi, qf, pf by removing
standard calculational tool in the theory of Lie algebras, and the same undesirable rays.
denoting the coefficients occurring in f4 by the symbols Suppose the case of a thin lens is to be treated by using the
A*-F*, one finds the results that usual aberration theory and that the reference aperture is
*=-m4(d1 +tn3)/8 +(d2D
- d2E
+d2F)- d2/8, centered on the optical axis and is placed in the plane of the
lens. Then Lie algebraic methods employing coefficients
B* = -m3 (d1 + tn- 3 )/(2f) + (-2d 2D + 3d E- 4d3F),
(5.17) evaluated for t = 0 can be used to produce an expansion
C* = -m 2 (di + tn- 3 )/(2f2 ) + (-2d 2E + 4d2F), of Aq in terms of qi and Q. One finds that the coefficients
D* = -m 2 (di + tn- 3 )/(4f2 ) + (D-d 2E + 2d2F), appearing in the expansion are linear combinations of the
E* = -m(di + tnr 3)/(2f 3 ) + (E - 4d2F), coefficients A*-E*. In particular, one finds that the coeffi-
F* = -(di.+ tn- 3)/(8f 4) + F. (5.17) cient of (qi2)qi, which describes distortion in the usual
treatment, does indeed vanish. Also, the coefficient of (Q2 )Q,
Here the quantities D, E, and F are those given by Eq. which describes spherical aberration in the usual treatment,
(5.6). is equal to the quantity -4A*/(mdl) 3 . Finally, all other
Evidently all the coefficients A*-F* are nonzero in the coefficients are found to be specific linear combinations of
general case. Thus an imaging system made with a single lens A*-E*, that work out to be independent of F, and therefore
having one flat face generally suffers from all five Seidel ab- they do not depend on the asphericity.
errations. In general, the coefficients used to describe aberrations in
By looking at the terms appearing in the coefficients A*-F*, the Lie algebraic method are linear combinations of those used
it is possible to determine the various sources of aberration. in the conventional method, and vice versa. In the case in
For example, consider A*, which describes spherical aberra- which all the aberrations listed above a given aberration in the
378 J. Opt. Soc. Am./Vol. 72, No. 3/March 1982 Alex J. Dragt
Table of Aberrations vanish, then the coefficientused to de- methods provides an operator extension of the matrix meth-
scribe the given aberration in the Lie algebraic method and ods of Gaussian optics to the general case.
that used in the conventional method are directly proportional The preparation of a complete catalog of Lie transforma-
to each other. tions for common optical elements through order 6 seems to
The last example to be considered in this section is that of be feasible by hand calculation. This catalog, along with
propagation in a graded-index medium.12 Suppose that the standard Lie algebraic procedures for handling noncom-
medium is translationally invariant in the z direction. Then, muting operators, should make it possible to compute man-
taking the z axis to the optical axis, the symplectic map A ually explicit formulas for both third- and fifth-order aber-
describing transit over a distance d in this medium is given rations for any axially symmetric optical system. Since the
by the expression procedures for developing a catalog and manipulating non-
...].
2 commuting operators can be stated in explicit terms, it is also
A = expid:(n - p 2 )1 /2: . (5.19) possible to program a computer to carry out the required al-
Suppose further that the medium is axially symmetric. gebraic operations. With the aid of symbolic manipulation
More specifically, assume that the square of the index of re- programs such as MACSYMA,it should be possible to compute
fraction, n 2 , can be expanded in the form routinely explicit formulas for aberrations of seventh and
ninth order, and probably beyond. The catalog could also be
n2= n[1 - a(x 2 + y2 ) extended to include more general elements without axial
+ /3(x2 + y2 )2 + y(X2 + y2)3 + (5.20) symmetries, such as cylindrical lenses, and the resulting
Then one finds that expression (5.19) for A can be expanded even-order aberrations could then also be computed.
and written as a factored product in the form With the existence of high-speed computer ray-tracing
programs, there may be less interest in explicit formulas for
A = exp(:f2:)exp(:f 4 :).... (5.21) aberrations in some cases. Indeed, the commercially available
Moreover, in this case /2 is given explicitly by the expres-
program ACCOS V was used to make an indirect and partial
check on the algebra employed to derive Eqs. (5.6) and
sion
(5.17).13 The simple imaging system described in Section 5
f2 = -d~p2 /(2no) + noaq2/21. (5.22) with the values
As might be expected, the expression for f4 is more com- d- = 19.0, t = 1.5,
plicated. With the aid of the Campbell-Baker-Hausdorff
n = 1.5, r = 5.0, (6.1)
formula, one finds the result that
d2 = 20.0
f4 = -(f 2 )2 /(2nod) + dh4. (5.23)
was considered, and the program ACCOS V was instructed to
The function h4 can be written most compactly as an integral search for the asphericity / required to remove spherical ab-
in the form erration. The computer found for the quantity Ad, defined
by the relation
h4 = (noB/2) r dXfqcos(XdV4) Ad = / + 1/(8r3), (6.2)
2 2
+ [p/(n o/)]sin(Xdv/a)]
0 1. (5.24) the value
If desired, the expressions for (f2)2 and h 4 can easily be ex- Ad = 1.1450X 10-3. (6.3)
panded to give a representation of the form [Eq. (4.13)] for f4
with explicit values for all the coefficients A-F. Observe that If Eqs. (5.6) and (5.17) are used to solve for the value of / and
only the h4 term in Eq. (5.23)involves1 and that none of the hence of Ad, which results in the condition that A* = 0, one
terms in Eq. (5.23) depends on terms in Eq. (5.20) beyond finds the result that
fourth order. Thus departures from Gaussian optics again Ad = 1.1449653X 10-3. (6.4)
occur, and there is again relatively little freedom to adjust
them. The values of Eqs. (6.3) and (6.4) agree to within the accuracy
of the ACCOSV output format.
However, despite their usefulness, computer codes tend to
6. CONCLUDING DISCUSSION run rather blindly. One of the major advantages of the Lie
A new method has been presented for characterizing optical algebraic approach may be in providing additional insight
systems and computing aberrations. In Section 2 it was concerning the sources of various aberrations and to what
shown that every optical system gives rise to, and is charac- extent they may be best corrected by bending lenses, by the
terized by, an optical symplectic map. Section 3 described use of special elements, and by the imposition "of various
how symplectic maps can be written as products of Lie symmetries.
transformations, and Section 4 showed how these Lie trans-
formations can be related to Gaussian optics and the de-
ACKNOWLEDGMENTS
scription, of aberrations. A final section presented the be-
ginning of a catalog of Lie transformations through fourth The author thanks John A. Farrell of the Los Alamos National
order for various common optical elements. It also illustrated Laboratory for many helpful comments. He also is grateful
briefly how the Lie transformations for these elements can be for correspondence and useful criticism from H. Buchdahl,
concatenated, by using mathematical tools already available R. Hopkins, D. Shealy, 0. Stavroudis, and A. Vander Lugt.
from the theory of Lie algebras, to provide a description of This research was supported in part by U.S. Department
compound optical systems. Thus the use of Lie algebraic of Energy contract no. DE-AS05-80ER10666.AOOO.
Alex J. Dragt Vol. 72, No. 3/March 1982/J. Opt. Soc. Am. 379