Documente Academic
Documente Profesional
Documente Cultură
Fluorescence Spectroscopy
Volume 2
Principles
Topics in Fluorescence Spectroscopy
Edited by JOSEPH R. LAKOWICZ
Volume 1: Techniques
Volume 2: Principles
Volume 3: Biochemical Applications
Topics in
Fluorescence
Spectroscopy
Volume 2
Principles
Edited by
JOSEPH R. LAKOWICZ
Center for Fluorescence Spectroscopy
Department of Biological Chemistry
University of Maryland School of Medicine
Baltimore, Maryland
No part of this eBook may be reproduced or transmitted in any form or by any means, electronic,
mechanical, recording, or otherwise, without written consent from the Publisher
v
vi Contributors
s
Fluorescence spectroscopy and its applications to the physical and life sciences
have evolved rapidly during the past decade. The increased interest in
fluorescence appears to be due to advances in time resolution, methods of
data analysis, and improved instrumentation. With these advances, it is now
practical to perform time-resolved measurements with enough resolution to
compare the results with the structural and dynamic features of macro-
molecules, to probe the structures of proteins, membranes, and nucleic acids,
and to acquire two-dimensional microscopic images of chemical or protein
distributions in cell cultures. Advances in laser and detector technology have
also resulted in renewed interest in fluorescence for clinical and analytical
chemistry.
Because of these numerous developments and the rapid appearance of
new methods, it has become difficult to remain current on the science of
fluorescence and its many applications. Consequently, I have asked the
experts in particular areas of fluorescence to summarize their knowledge and
the current state of the art. This has resulted in the initial two volumes of
Topics in Fluorescence Spectroscopy, which is intended to be an ongoing
series which summarizes, in one location, the vast literature on fluorescence
spectroscopy. The third volume will appear shortly.
The first three volumes are designed to serve as an advanced text. These
volumes describe the more recent techniques and technologies (Volume 1),
the principles governing fluorescence and the experimental observables
(Volume 2), and applications in biochemistry and biophysics (Volume 3).
Additional volumes will be published as warranted by further advances in
this field. I welcome your suggestions for future topics or volumes, offers to
contribute chapters on specific topics, or comments on the present volumes.
Finally, I thank all the authors for their patience with the delays incurred
in release of the first three volumes.
Joseph R. Lakowicz
Baltimore, Maryland
vii
!"#$%&'()%#*+)*+#,*'--.%-)/+%0-'*1
Contents
ix
x Contents
Index . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 429
!"#$%&'()%#*+)*+#,*'--.%-)/+%0-'*1
1
Fluorescence Anisotropy:
Theory and Applications
Robert F. Steiner
1.1. Introduction
1
2 Robert F. Steiner
viscosity of the solvent. The theory of Perrin was tested using fluorophores of
known molecular volume and yielded reasonably self-consistent results.(6)
Subsequently, the use of the polarization of extrinsic fluorescent labels to
study proteins was introduced by Weber and applied to the characterization
of a number of proteins by Weber and others.(10, 11) The mathematical for-
mulation was substantially simplified by the adoption, first suggested by
Jablonski, of a linear parameter, the fluorescence anisotropy. (12) The use of
anisotropy is particularly advantageous for the study of heterogeneous
systems.
The development of the single-photon counting technique (13–15) made it
feasible to examine the time dependence of anisotropy directly. The basic
theory of anisotropy decay was described by Wahl(16, l7) and applied by Wahl
and co-workers to the study of the rotational dynamics of biopolymers(17–19).
The basic theory was expanded by Gottlieb and Wahl, (20) who con-
sidered the effects of internal rotations, and by Belford et al., who presented
a description of the anisotropy decay of ellipsoidal particles.(21) Kinosita et al.,
have extended the theory to the hindered rotation of fluorophores embedded
in membranes.(22)
The last decade has witnessed a progressive refinement of the technique
of time-domain measurements of anisotropy decay and the application of this
method to a wide variety of biopolymer and membrane systems. In recent
years frequency-domain determinations of anisotropy decay have been
developed as a useful alternative to time-domain measurements. The present
chapter will describe the current status of the technique and present some
selected applications.
1.2. Theory
When the exciting beam is polarized along the Oz axis, symmetry now
dictates that
and
and
4 Robert F. Steiner
and
where
Employing Eqs. (1.5) and (1.10), we obtain, for vertically polarized exciting
radiation,
It may be shown that the emission anisotropy for the two cases are related by
and
Fluorescence Anisotropy: Theory and Applications 5
which would be observed if the excitation pulse were infinitely sharp by the
convolution integral (25) :
Here, t is the time and E(u) is the time profile of excitation. (In this chapter
the convention will be followed of denoting experimentally observed quan-
tities, such as i, s, d, etc., by lowercase letters, while capital letters will be used
for the corresponding quantities I, S, D, etc., which have been corrected for
convolution effects so as to correspond to the behavior expected if the excita-
tion pulse were infinitely sharp.) The mathematical procedures which have
been developed for analyzing the time decay of fluorescence intensity in terms
of discrete decay times also provide for the deconvolution of the experimental
decay curve so as to remove the distortion caused by the finite duration of the
excitation pulse.
defined direction.(1) This model has proved adequate for the interpretation
of nearly all the available experimental data and will be assumed in the
discussion to follow. In this case,
and
Here,
where are the rotary diffusion coefficients for rotation about the
axis of symmetry and about either equatorial axis, respectively, and
where are the angles formed by the absorption and emission tran-
sition moments, respectively, with the axis of symmetry of the ellipsoid, and
is the angle formed by the projections of the two moments in the plane
perpendicular to the axis of symmetry.
For low values of corresponding to the initial slope of the
anisotropy decay curve (1) ,
Here, is the harmonic mean of the three correlation times and is defined,
in the general case, by:
For the special case where the transition moments are randomly oriented with
respect to the axes of the particle so that
then
12 Robert F. Steiner
In the case where the moments of absorption and emission coincide, then
so that Eqs. (1.36) become
Here,
where is the angle formed by the emission moment with the z axis.
The simulation of Brownian rotation was done by increasing the time in
a series of small increments and rotating the molecule at each step about
each of its axes by angular increments (/= 1, 2, 3) with the sign of
chosen randomly. The magnitude of each rotational step was determined by
the rotary diffusion coefficient about the corresponding axis, as predicted by
the classical theory of Brownian rotation:
After a set of time increments, the anisotropy was computed using Eqs.
(1.45)–(1.47). This was repeated for a series of such data sets, and the resultant
anisotropies were plotted as a function of time. This was done for oblate and
prolate ellipsoids of revolution with varying axial ratios and orientations of
the transition moments. In all cases the simulated anisotropy decay corre-
sponded closely to the predictions of the theory of Belford et al.(21)
A surprising prediction of the theory of Belford et al., which was verified
by the simulations of Harvey and Cheung, is that, for certain orientations of
the transition moments, the anisotropy actually increases with time for sort
intervals after excitation, so as to pass through a maximum before decaying
to zero at long times (Figure 1.4).
14 Robert F. Steiner
the plane of the bilayer. In this case, rotational diffusion cannot depolarize the
fluorescence entirely, so that the anisotropy approaches a finite value at long
times. The time decay of anisotropy is given by, for a spherical particle(22, 34):
where is the correlation time for rotation of the particle about the fixed
axis, and The most general expression for the limiting
anisotropy at long times, is
where is the average value of the cosine squared for all possible angles
between the direction of the absorption transition moment at the time of
absorption and the emission transition moment at the time of emission.
If rotational wobble of the fluorophore is absent, so that rotation is
strictly confined to a single axis of the labeled biopolymer, then the motion
of the emission moment will be effectively confined to the surface of a cone of
semiangle is the angle formed by the emission dipole with the
direction of the axis about which rotation of the particle occurs. (1, 22, 34)
If the absorption and emission moments are parallel, then
Kinosita et al.(22) and Lipari and Szabo(34) have considered the parallel
case for which rotational wobble of the fluorophore is allowed. The rotating
unit, which may be either an unattached fluorescent probe or a labeled
biopolymer, is assumed to have cylindrical symmetry and the medium to have
uniaxial symmetry. The orientations of the transition dipoles of absorption
and emission with respect to the symmetry axis of the rotating unit are
assumed to be invariant, so that the equilibrium orientational distribution
of the two dipoles depends solely upon the angle between the axis of the
rotating unit and that of the medium (Figure 1.5). Since, for this case also, the
membrane-embedded fluorophore cannot assume all possible orientations
with equal probability, the anisotropy does not decay to zero at long times
but instead approaches a finite value,
A simple solution is possible if either the absorption dipole or the
emission dipole is parallel to the direction of the unique symmetry axis
of the rotating unit. If are unit vectors in the directions of the
absorption and emission dipoles and the axis of symmetry, respectively, then
A(t) may be expressed in terms of a correlation function as
and
Fluorescence Anisotropy: Theory and Applications 17
where are the angles between the symmetry axis of the probe and
respectively.
Equation (1.54) relates the limiting fluorescence anisotropy to the order
parameter S of the probe, defined as
The order parameter governs the magnitude of the first nontrivial term in
the series expansion of the orientational distribution function in terms of
Legendre polynomials(34):
To the extent that the diffusion in a cone model is valid, Eq. (1.59) permits
evaluation of the semiangle of the cone.
In order to evaluate the time-dependence of A(t) according to equation
(1.51), it is necessary to evaluate the correlation function
This requires the choice of a dynamic model. For the model corresponding to
diffusion in a cone, Kinosita et al. showed that (22)
It may be shown (34) that is equal to the area under the curve of
where are the correlation time of the particle and the effective
correlation time of the label, respectively, and is given by Eq. (1.59). If
then Eq. (1.62) reduces to the following relationship:
Fluorescence Anisotropy: Theory and Applications 19
Here, depends solely on the localized rotation of the label, and reflects
the rotation of the entire macromolecule. The sum of is equal to the
anisotropy at zero time, A0.
For the case where the bond linking the fluorophore to the macro-
molecule has a sufficient degree of rotational wobble so that the fluorophore
may rotate freely within a cone, the magnitudes of a1 and a2 are given by an
equation analogous to Eq. (1.59)(22,34).
where is the semi-apex angle of the cone. Lipari and Szabo have shown
that Eq. (1.65) is valid for a probe with cylindrical symmetry, for which either
the absorption or emission moment is parallel to the cylindrical axis.(34) This
model implies the existence of a square-well potential which restricts rotation.
If wobble is absent, so that the emission moment is confined to the sur-
20 Robert F. Steiner
face of a cone, as would be the case if the emission moment makes a constant
angle with the axis of rotation, then we have
The preceding treatment is strictly valid only for the case when either
is parallel to the axis of the probe. If this is not the case, rotational
diffusion about the axis of the probe, as well as the rotational wobble of the
axis itself, can contribute to the time decay of anisotropy. The theory now
becomes considerably more complex. Lipari and Szabo have derived the
following approximate expression for the time dependence of anisotropy(34):
the time spent in transit between positions. In this case the rate-limiting factor
for rotation is the probability of a jump between specific orientations. The
following equation was derived by Gottlieb and Wahl for this model(20):
favorable cases where the probe is firmly immobilized in the tertiary structure,
the detection of a correlation time close to that expected for a well-defined
molecular domain is evidence for the free rotation of the domain.
where g is a factor correcting for the (usually imperfect) optical system and
is given by
where are the amplitude and decay time, respectively, of the ith
decay component.
The values of the may be determined from the experimental data
by a least-squares fitting procedure.(36) Trial values of the are used to
generate the intensity decay function S(t). For each set of the values
of the are obtained by solution of the corresponding set of equations linear
in the which are obtained for different times. The computed curve of
Fluorescence Anisotropy: Theory and Applications 23
Here, Ai and are the amplitude and rotational correlation time, respec-
tively, corresponding to the ith rotational mode, and is the limiting value
of the anisotropy attained at very long times after excitation. For fluorescent
molecules which are not partially, or wholly, immobilized in a matrix,
In evaluating the the previously computed values of the and
may be used to generate an impulse response function which represents
S(t), according to Eq. (1.77).(37) Trial values of the Ai and are used to
compute an analogous function for the anisotropy, A(t), according to
Eq. (1.79).(37) A(t) is then multiplied by S(t) to yield a trial representation of
the deconvoluted difference decay function D(t). D(t) is next convolved with
the time profile, E(t), of the excitation pulse, according to Eq. (1.18), to
generate a computed difference decay function is compared
with the experimental difference decay curve and a value of is
computed.
24 Robert F. Steiner
For each set of trial values of the Ai and the corresponding value of
is computed from
where are the values of these quantities for the jth data
point. The precision is taken as equal to where V(j) is the
associated variance. The optimum values of the Ai and are identified by an
iterative procedure similar to that used to determine the decay times.
The ultimate criterion of the quality of fit associated with a particular set
of parameters is the normalized value of This is equal to where
F is the number of degrees of freedom and is equal to the number of data
points (time channels) minus the number of determined parameters. For a
perfect fit, would have a value of unity.
A second useful criterion of quality of fit for both intensity and
anisotropy decay data is the distribution of residuals, that is, the difference
between observed and computed values. For an optimum fit the distribution
of residuals plotted as a function of time should be essentially random.
An alternative criterion of quality of fit is the Durbin–Watson parameter,
which is defined by (38)
If only a single intensity decay mode is present, the phase and demodulation
factor are related to the decay time by
and
If multiple decay modes are present, the apparent values of computed from
Eqs. (1.86) and (1.87) will correspond to averages. Ih this case measurements
over a range of frequencies are necessary in order to obtain well-defined
physical parameters. The phase angle and modulation values may be pre-
dicted as a function of frequency for any postulated decay law. The assumed
decay law contains the information necessary for this prediction, just as it also
contains the information required to predict the directly observed time
dependence of fluorescence.
For frequency-domain measurements, sine and cosine transformations of
Fluorescence Anisotropy: Theory and Applications 27
the assumed decay function (Eq. 1.77) are used instead of convolution with
the excitation lamp profile. If the quantities are defined by
then the phase angle and modulation may be predicted for each frequency
from
and
where are the computed values of the phase angle and demodula-
tion factor, respectively, for frequency
The quantities may be related directly to the intensity decay
parameters by
The goodness of fit between the computed and observed values of phase
angle and modulation may be assessed from the value of
where
The quantities N and D may be evaluated for the vertically and horizon-
tally polarized components, using Eqs. (1.88) and (1.89). The quantities which
28 Robert F. Steiner
are actually measured are the phase angle difference between the two
components and the ratio of the modulated amplitudes:
and
where n is the number of degrees of freedom (the number of data points minus
the number of variable parameters), and are the estimated
uncertainties at frequency respectively.
In fitting actual data, trial values of the correlation times, are used to
compute the anisotropy as a function of time. The previously determined set
of intensity decay times, are used to compute the intensity, i(t), as a func-
tion of time. The substitution of A(t) and i(t) in Eqs. (1.95) yields computed
values of introduction of these into Eqs. (1.88) and (1.89)
provides values of as a function of frequency. Finally,
Eqs. (1.97) and (1.98) are used to generate trial values of
these are compared with the observed values, and a value of is calculated.
The set of assumed correlation times is varied systematically by an iterative
process until the set yielding a minimum value of is identified.
and
and
where now refers to the jth time window, which extends from time
channel aj to bj for the jth time-resolved emission spectrum. Since convolution
affects solely the time axis, the matrix elements now are
Also,
transition between different potential energy minima and the rate-limiting step
is the probability of release from a position of minimum energy. Alternatively,
if the tryptophan rotates freely within a cone, the solvent composition and
effective viscosity in its vicinity may be different from those of the bulk
solution.
Lakowicz et al. have subsequently reexamined the ACTH system using
frequency-domain measurements.(46) There was qualitative agreement with the
time-domain results in that the time decays of both fluorescence intensity and
anisotropy were found to be biexponential (Table 1.1). As in the earlier study,
the anisotropy decay could be described by a subnanosecond correlation time,
reflecting the localized motion of the tryptophan, and a longer correlation
time corresponding to the motion of all, or a major portion of, the molecule.
While the values of the correlation times are smaller than those found by Ross
et al., a quantitative comparison is difficult in view of the different experi-
mental conditions. However, the magnitude of the shorter correlation time for
36 Robert F. Steiner
both sets of data is such as to suggest that it reflects the motion of several
amino acids.
It remains uncertain whether the rapid rotational mode reflects solely the
motion of the tryptophan with respect to the balance of the protein or
whether neighboring residues are involved. It also remains to be seen whether
only two rotational modes are strictly present or whether additional modes
may be detected with the development of techniques of higher resolution.
The amphipathic peptide melittin, which is isolated from bee venom,
consists of 26 amino acids.(48) The sole aromatic chromophore is a tryptophan
residue at position 19. In solution at low electrolyte concentration, melittin is
believed to exist as a largely structureless monomer. At high ionic strengths, self-
association occurs to form a tetrameric species, which is mostly
Lakowicz et al. have employed frequency-domain measurements to
examine the anisotropy decay of monomeric and tetrameric melittin. (49) In
each case, determinations were made in the absence and presence of the
quencher acrylamide. Dynamic quenching by acrylamide increases the fraction
of the total emission which occurs on the subnanosecond time scale, thereby
providing increased information about rotational motions in the picosecond
range. This is of particular usefulness in studying the internal motions of
proteins and peptides.
In 0.01 M Tris, pH 7, at 20 °C, where melittin is monomeric, the intensity
decay was found to be multiexponential and characterized by decay times of
0.2, 2, and 4 ns in the absence of quenching. The addition of increasing levels
of acrylamide resulted in a progressive reduction of the magnitude of the
decay times, which were equal to 20 ps, 260 ps, and 1.39 ns in 2 M acrylamide.
The anisotropy decay of monomeric melittin could be analyzed
acceptably in terms of two rotational modes (Figure 1.7). The corresponding
correlation times were essentially independent of the degree of quenching by
acrylamide, indicating that the decays of intensity and anisotropy were not
coupled and that acrylamide does not modify the rotational modes. Almost
60% of the anisotropy decay was associated with the more rapid rotational
mode, with a correlation time of about 160ps, while the remainder decayed
with a correlation time near 1.7 ns.
In the presence of 2 M NaCl, in which case melittin is tetrameric, the
intensity decay was also multiexponential and required the assumption of
three decay times for acceptable fitting. The decay times decreased from 0.2,
2, and 5 ns in the absence of quencher to 0.04, 0.3, and 1.2 ns in 2 M
acrylamide.
As in the case of monomeric melittin, the anisotropy decay was
biexponential. In this case the dominant rotational mode, which accounted for
about two-thirds of the anisotropy decay, corresponded to a correlation time
near 3.5 ns, while the balance of the anisotropy decayed with a correlation
time close to 60 ps.
Fluorescence Anisotropy: Theory and Applications 37
The longer correlation times observed for both monomeric and tetrameric
melittin are in the range expected for rotation of the entire molecules, to
which they can probably be attributed. The more rapid rotational mode must
arise from some form of internal rotation involving tryptophan. It is of interest
that the correlation time associated with the more rapid rotational mode is
longer for monomeric than for tetrameric melittin (160ps versus 60 ps),
presumably reflecting the differing contribution of segmental motion involving
tryptophan in the two cases. It is also worthy of mention that no convincing
evidence was found for the presence of a correlation time of magnitude 1–2 ps,
which would arise from the unhindered rotation of a single indole group.
Time-domain measurements of the anisotropy decay of melittin have been
made by Tran and Beddard.(50) Their findings agree qualitatively with those
of Lakowicz et al. in that short and long correlation times were observed; the
magnitude of the latter was in the range expected for rotation of the
entire molecule. The principal difference between the two studies was in the
magnitude of the shorter correlation time, for which Tran and Beddard found
a value of 600–700 ps, which is substantially larger than that reported by
Lakowicz et al. An anomalously low value of 0.14 was found for A0 for
excitation at 300 nm. It is possible that the time resolution of this study was
insufficient to recover the early portion of the anisotropy decay, resulting in
a low value of A0 and an elevated correlation time.(49)
reduced the decay time of fluorescence intensity to 590 ps. If the aromatic
portion of the molecule is approximated by an oblate ellipsoid with semiaxes
of 4 and 2 an average correlation time close to 40 ps may be computed for
the unhydrated molecule. Since the ribityl side chain, as well as bound water,
would be expected to increase the observed correlation time, the measured
value is in reasonable agreement with that expected. This study, which
probably approaches the practical limits of time-domain measurements of
anisotropy decay, underlines the importance of rigorous deconvolution from
the excitation profile, especially for short correlation times. In the present
case, simulated data indicated that, in the absence of deconvolution, the
observed time profile of anisotropy is severely distorted by the finite duration
of the excitation pulse.
The anisotropy decay of the lumazine-containing protein was also
examined. It could be adequately fitted in terms of a single correlation time
of 9 ns (19 °C). The addition of sufficient KI to reduce the decay time of
fluorescence intensity from 14 to 3 ns did not alter significantly the computed
correlation time, although the shape of the observed anisotropy decay curve
was considerably distorted. This provided a confirmation of the adequacy of
the deconvolution procedure.
The lumazine-containing protein has a molecular weight of 2.1 x 104. An
anhydrous particle of this molecular weight with spherical symmetry would
have a predicted correlation time of about 6 ns. It is probable that the
difference between the predicted and observed values may be attributed to the
combined effects of hydration and of deviation from a strictly spherical shape.
The lumazine-containing protein can associate with luciferase to form a
protein complex of total molecular weight 1.0 × 105. The quantum yield of the
lumazine fluorophore is not altered by the complex formation. The dissocia-
tion constant may be controlled by varying such parameters as ionic strength
and temperature. In view of the constancy of the quantum yield and the
observation that the anisotropy decay of the lumazine protein in both the free
and complexed states can be adequately described by a single correlation
time, the preexponential factors ai in the equation
quantum yield of the Y base. Beardsley et al. found that the intensity decay
could be described by a single decay time of 4.3 ns, which increased to 6.3 ns
in the presence of 10mM Mg 2 + . However, the quality of the fits was poor,
as judged from the values, suggesting the presence of unresolved multi-
exponential decay.
Parallel time-domain measurements of anisotropy decay were analyzed in
terms of a single rotational mode. Correlation times of 9.2 and 9.8 ns were
found in the absence and presence of Mg 2 + , respectively. The predicted
correlation time for a rigid anhydrous spherical molecule of the same
molecular weight is about 15 ns. Since the effect of hydration, or any deviation
from spherical symmetry, would be to increase the average correlation time,
the above value is a lower limit. It was accordingly concluded that internal
rotational modes involving the Y base were present and that tRNA P h e pos-
sesses a significant degree of flexibility.
Wells and Lakowicz have recently reexamined this system by making
frequency-domain measurements of intensity and anisotropy decay.(53) With
the improved resolution of the current instrumentation, it is evident that
at least two decay times are required to fit the intensity decay (Table 1.3).
In both the absence and presence of Mg 2+ , the intensity decay could be
adequately described by a short decay time of about 2 ns and a longer time
near 6 ns. However, the relative contributions of the two depend on the Mg 2 +
level. In the absence of Mg2 + , the amplitudes associated with the two decay
times are approximately equal; the addition of Mg 2 + increases the relative
amplitude corresponding to the 6-ns decay time to over five times that for the
shorter decay time.
A plausible explanation for the above observations is that the Y base
exists in two microenvironments which result in different decay times. A Mg 2 + -
induced conformational change favors the microenvironment associated with
the 6-ns decay time.
Anisotropy decay studies also indicated a major influence of Mg 2 +
Fluorescence Anisotropy: Theory and Applications 41
(Table 1.4). In both the absence and presence of Mg 2+ , the anisotropy decay
could be accounted for in terms of two rotational modes, corresponding to
correlation times near 18ns and 0.3–0.4 ns. The presence of Mg 2+ substan-
tially increased the relative contribution of the slower rotational mode, which
now dominated the anisotropy decay. It is probably that the longer correla-
tion time represents the global rotation of the molecule, while the shorter
arises from a localized motion of the fluorophore. An obious explanation is
that the anticodon loop, which contains the Y base, is somewhat flexible in
the absence of Mg2 + , with a significant degree of mobility of the bases.
The binding of Mg 2+ constrains the anticodon loop into a relatively rigid
conformation in which the mobility of the bases is substantially reduced.
The muscle protein myosin is roughly Y-shaped, with a rodlike stem and
two globular (S-l) units at the head. A requirement of currently popular
models for the process whereby a myosin “cross-bridge” is linked to the thin
actin filaments within a muscle fiber and causes a mechanical thrust is that a
flexible hinge point be present within the myosin molecule.(54) Mendelson
et al. have employed anisotropy decay to obtain direct evidence for such a
hinge point.(54)
There is present in each S-l unit a single highly reactive sulfhydryl group,
which is convenient for the selective attachment of iodoacetamide derivatives.
Mendelson et al. employed the iodoacetamide derivative 1,5-IAEDANS
[N-iodoacetyl-N 1-(5-sulfoamino-l-naphthyl)ethylenediamine], which reacts
with sulfhydryl groups and whose fluorescence properties resemble those of
42 Robert F. Steiner
time reverted to that of free F-actin, thereby ruling out an artifact arising from
the denaturation of F-actin.
In the presence of the anisotropy decay parameters of
F-actin were substantially altered from their values in A biexponential
fit yielded in this case a1 =0.04, a2 = 0.25, =5.8ns, and = 682ns. The
longer correlation time thus undergoes a major increase in the presence of
The addition of HMM in the presence of resulted in a biphasic
response of the magnitude of the longer correlation time, which passed
through a minimum at a mole ratio of HMM to actin of 0.02, followed by an
increase. The addition of S-l under these conditions produced a similar
biphasic pattern.
These findings suggest that some kind of segmental flexibility exists in
F-actin; the degree of flexibility is dependent upon conditions. The shorter
correlation time, presumably arises from some form of localized rotational
motion, while the longer time, corresponds to the concerted motion
of a larger unit, probably a set of actin monomers. The initial decrease in
resulting from complex formation with HMM or S-l perhaps arises from an
increase in the flexibility of the links joining actin monomers, while the
increase observed at higher levels of HMM in the presence of and at
all levels of HMM or S-l in the presence of may reflect a stiffening of
these contacts.
The molecular dynamics of F-actin have also been studied by other
physical techniques, with results which differ quantitatively from those
summarized above. Thus, correlation times of and have been
computed from quasi-elastic light scattering and from saturation transfer
electron spin resonance, respectively.(58,59) In both cases a substantial increase
in correlation time occurred upon complex formation with HMM. While the
reasons for the varying results obtained with the different techniques remain
obscure, it seems clear that different molecular motions are being sensed by
these methods.
The independent rotation of the Fab units may be a significant factor in the
antibody function, facilitating the combination with antigen.
More recently, these studies have been continued by Lovejoy et al.,(63)
who utilized rabbit antibodies directed against the hapten pyrenebutyrate
(PBA). This fluorophore has a much longer average lifetime than dansyl;
the value for the free hapten is about 100 ns for an air-saturated solution and
somewhat larger for an O2-free solution. The binding of PBA by the anti-
body resulted in a significant red shift of the primary excitation maxima
(from 326.5, 341.5 nm to 330.5, 347 nm) and of the emission maxima (from
375, 395 nm to 376, 396 nm). The average binding constant was sufficiently
high to permit virtually quantitative binding of hapten.
The time profile of fluorescence intensity decay was generally hetero-
geneous, except in the case of antibody produced with very long (11-month)
immunization times, for which a single fluorescence decay time of 157 ns was
found. The time decay of fluorescence anisotropy, which was similar for two
different preparations, was fitted to Eq. (1.64) by a least-squares procedure.
The values obtained were
The magnitudes of the correlation times are similar to those reported for a
different hapten by Yguerabide et al. Inasmuch as the time decay for PBA
could be monitored over a sufficient time interval to make possible the
accurate determination of the longer correlation time, these observations
strengthen the conclusion that mobility of the Fab units is a general property
of IgG molecules.
Holowka and Cathou have made analogous fluorescence dynamics
studies on the macroglobulin (IgM) class of antibodies.(64) Immunoglobulins
of the IgM class generally occur in animal sera as disulfide-linked pentamers
of total molecular weight near 900,000. Each monomer unit is somewhat
similar in structure to an IgG molecule, containing two light (L) and two
heavy chains linked by disulfide bonds and noncovalent interactions. A
third unrelated (J) chain is also present and may be involved in the assembly
of IgM from its subunits. A total of ten antigen-combining sites occur on its
ten Fab units.
Holowka and Cathou prepared IgM antibodies directed against
from horse, pig, and shark antisera obtained by immunization with a
dansyllysine streptococcal conjugate, which favors the formation of IgM
antibodies in these species.(64) The time decay of fluorescence intensity of
complexes with purified IgM molecules varied with the
species. For horse and pig IgM, the dominant component had a decay time
near 24 ns, while a secondary component had a decay time of 8–12 ns. In the
case of shark IgM the pattern was reversed, with the major component having
a short (4 ns) decay time. However, the contribution of the long decay time
was sufficient in all three cases to permit the monitoring of anisotropy for
times up to 200 ns.
48 Robert F. Steiner
References
1. P. Wahl, in: Biochemical Fluorescence (R. F. Chen and H. Edelhoch, eds.), Vol. 1, p. 1,
Plenum, New York (1975).
2. R. F. Steiner, in: Excited Stales of Biopolymers (R. F. Steiner, ed.), p. 117, Plenum, New York
(1983).
3. J. R. Lakowicz, Principles of Fluorescence Spectroscopy, p. 155, Plenum, New York (1983).
4. I. Munro, I. Pecht, and L. Stryer, Proc. Natl. Acad. Sci. U.S.A. 76, 56 (1979).
5. F. Perrin, J. Phys. 7, 390 (1926).
6. F. Perrin, Ann. Phys. (Paris) 12, 169 (1929).
7. F. Perrin, J. Phys. (Paris) 5, 497 (1934).
8. F. Perrin, J. Phys. (Paris) 7, 1 (1936).
9. F. Perrin, Acta. Phys. Pot. 5, 335 (1936).
10. G. Weber, Biochem. J. 51, 145, 165 (1952).
11. R. F. Steiner and A. McAlister, J. Polym. Sci. 24, 107 (1957).
12. A. Jablonski, Bull. Acad. Pol. Sci. Ser. Sci. Math. Astron. Phys. 8, 259 (1960).
13. W. R. Bennett, in: Advanced Quantum Electronics (J. Singer, ed.), Columbia University Press,
New York (1961).
14. L. M. Bollinger and G. E. Thomas, Rev. Sci. Instrum. 32, 1044 (1961).
15. Y. Koechlin, C.R. Acad. Sci. 252, 391 (1961).
16. P. Wahl, C.R. Acad. Sci. 260, 6891 (1965).
17. P. Wahl, C.R. Acad. Sci. 263, 1525 (1966).
18. P. Wahl and S. N. Timasheff, Biochemistry 8, 2945 (1969).
19. P. Wahl, J. Paoletti, and J. B. LePecq, Proc. Natl. Acad. Sci. U.S.A. 65, 417 (1970).
20. Y. Gottlieb and P. Wahl, J. Chim. Phys. 60, 849 (1963).
21. C. G. Belford, R. L. Belford, and G. Weber, Proc. Natl. Acad. Sci. U.S.A. 69, 1392 (1972).
22. K. Kinosita, S. Kawato, and A. Ikegami, Biophys. J. 20, 289 (1977).
23. I. Isenberg, in: Biochemical Fluorescence (R. F. Chen and H. Edelhoch, eds.), Vol. 1, p. 43,
Plenum, New York (1975).
24. V. J. Koester and R. M. Dowben, Rev. Sci. Instrum. 49, 1186 (1978).
25. W. R. Ware, in: Creation and Detection of the Excited States (A. A. Lamola, ed.), p. 213,
Dekker, New York (1971).
26. A. Jablonski, Z. Phys. 94, 38 (1935).
27. P. Soleillet, Ann. Phys. (Paris) 12, 23 (1929).
28. P. Wahl, G. Meyer, and J. Parrod, Eur. Polym. J. 6, 585 (1970).
29. R. Memming, Z. Phys. Chem. 28, 168 (1961).
30. J. Y. Yguerabide, Methods Enzymol. 26, 498 (1972).
31. T. Tao, Biopolymers 8, 609 (1969).
32. S. C. Harvey and H. C. Cheung, Proc. Natl. Acad. Sci. U.S.A. 69, 3670 (1972).
33. R. D. Dale and J. Eisinger, Biopolymers 13, 1573 (1974).
52 Robert F. Steiner
Fluorescence Quenching:
Theory and Applications
Maurice R. Eftink
2.1. Introduction
53
54 Maurice R. Eftink
The inverse of Eq. (2.2) is the classic form of the Stern–Volmer equation,
a relationship which describes the effect of quencher on the steady-state
fluorescence of a sample(13–15):
where D and are the sum of the diffusion coefficients and molecular radii,
respectively, of the quencher and fluorophore, and N' is Avogadro’s number
divided by 1000. The diffusion coefficient for each species can be predicted by
the Stokes–Einstein equation:
Fluorescence Quenching: Theory and Applications 57
where is the dynamic quenching constant for the ith species and
is the fractional contribution of the ith species to the total fluorescence in
the selected excitation and emission wavelength regions. If there are two
components I and the for one is times larger than that for the
second, the Stern–Volmer plot of versus [Q] will be downward curving.
By fitting Eq. (2.8) to the data, the values of and can be determined
(see Section 2.6.2). In Figure 2.3B are shown data for the acrylamide
quenching of a mixture of 3-methylindole and 2-methylindole
Data for three emission wavelengths are shown, for reasons
which will be presented in Section 2.6.1. For each wavelength the plots curve
downward very slightly at low quencher concentration.
Scheme 1
is the rate constant for the internal quenching process. The electronic
mechanism for the internal quenching process is thought to be different for
different quenchers. Molecular oxygen and paramagnetic species, for example,
are thought to quench aromatic fluorophores by an electron spin exchange
process (leading to rapid intersystem crossing and thus facilitating conversion
through the triplet manifold to the ground state). (15,18,19) Acrylamide, other
amides, and amines appear to quench via an electron transfer process, that is,
transfer of an electron from the excited singlet state to the quencher to form
a transient charge transfer complex [in this case (A ... Q) above may be
written as For other quenchers, the quencher may be
the electron donor, and the excited state may be the acceptor.(23) Quenchers
that possess halogens or other heavy atoms appear to quench by enhancing
intersystem crossing via a spin–orbital coupling mechanism. (24,25) Still other
quenchers may act by a resonance energy transfer mechanism, when spectral
overlap exists. Except for energy transfer, the quenching mechanisms appear
to involve close contact between the excited state and the quencher, and thus
the quenchers may be considered to be contact quenchers (see below). Orbital
overlap is thought to be necessary for quenching by oxygen. Electron transfer
quenching may show a very slight distance dependence of exp
where is a measure of the size of the molecular orbitals, r is the actual
separation distance, and is a constant that is near unity. (243)
It is difficult to experimentally determine the quenching mechanism for a
particular quencher. This is especially true for cases in which the value of
is very large compared to that of and that is, when the efficiency is
near unity. The quenching efficiency, for the reaction in Scheme 1 is equal
to
This expression for should apply for most situations; see Ref. 9 for further
discussion and alternate expressions for then
This means that every time an encounter complex forms, quenching
follows. When will be equal to [see Eq. 2.5)], the rate constant
for collision between the quencher and fluorophore. If ,
then and the encounter complexes may dissociate before quenching
occurs. For such inefficient cases, will be equal to which of course will
be less than When fluorescence quenching reactions are applied to bio-
chemical systems, it is desirable to employ an efficient quencher–fluorophore
pair, so that the interpretation of values will be more straightforward.
Molecular oxygen seems to be efficient for virtually all aromatic fluoro-
phores,(26) but acrylamide and iodide(22) are not efficient for all common
fluorophores. Inefficient systems need not be completely avoided, but more
60 Maurice R. Eftink
This complete version of Eq. 2.6 includes the transient term in the square
brackets; the symbols are defined above. In principle, is the sum of the van
der Waals radii for the two reactants, but in reality it may be slightly
larger.(32) Sveshnik off (35) was the first to apply this rate expression to solute
fluorescence quenching reactions, and, in doing so, he introduced a proba-
bility factor (i.e., an efficiency term) to account for the possibility that only
a fraction of the collisions may be effective in quenching. Collins and
Kimball(36) modified this theory to include the possibility that not every
approach to results in quenching. Instead, Collins and Kimball used the
Fluorescence Quenching: Theory and Applications 61
Due to the transient term, the decay time will be time-dependent. At times
immediately following an excitation pulse, the decay will be rapid, as a result
of the term. For typical values of and
the transient term will be larger in magnitude than the steady-state term
until At the transient term will be only 5 % of
the steady-state term, and the transient term can be neglected at times longer
than this. This transient term will be folded into the steady-state decay rate
and will often be difficult to observe. For smaller D values, or for fluorophores
with smaller the transient term will be of greater significance.
Several workers have experimentally demonstrated the existence of such
a transient term in fluorescence quenching reactions. Nemzek and Ware (32)
62 Maurice R. Eftink
indole ring. The fitted D values were somewhat larger than expected with
Eq. (2.11), but were reasonable for Eq. (2.10). Lakowicz and co-workers have
also recently extended this treatment to the analysis of lifetime fluorescence
quenching data with proteins, and they believe that in some cases the
transient term makes a significant contribution to the quenching rate.(41)
More on this will be presented in Section 2.3.8.
where is the association constant for the one-to-one complex. (In fact,
there are some cases in which complex formation is the dominant quenching
process; see, for example, Ref. 244 and references therein.) For the probability
of the nonspecific occurrence of quencher–chromophore neighbors, one can
define an “active volume” element, V, surrounding the chromophore. If a
quencher (one or more molecules) exists within this volume at the instant that
the chromophore becomes excited, “static” quenching is assumed to occur
instantaneously. In this case, the modified Stern–Volmer equation is (40)
Note that Eqs. (2.13) and (2.14) are similar, since is simply the expan-
sion of exp(x) when x is small. Either modified Stern–Volmer equation has
been used; we prefer the former only when a one-to-one complex is believed
to form.
Yguerabide et al.(37) and Nemzek and Ware (32) have shown that the
Fluorescence Quenching: Theory and Applications 65
transient effect discussed above leads to the following modified form of the
steady-state Stern–Volmer equation, which is similar in form to Eq. (2.14):
where
The factor embodies the quenching that is caused by the transient term.
The term will lead to a slight upward curvature in a steady-state
Stern–Vomer plot. Furthermore, Andre et al.(33) included a factor for true
static quenching to give the following complete form of the Stern–Volmer
equation:
demonstrate that Eq. (2.14) can describe the data, regardless of the cause of
the apparent static effect. I also show calculated lifetime Stern–Volmer plots,
for lifetime values that would be measured by the phase lag method. Note that
the lifetime plots are also predicted to curve upward, although to a lesser
degree than the intensity plot, due to the transient effect. The dashed line
gives the dynamic quenching component, as given by the steady-state
Fluorescence Quenching: Theory and Applications 67
the exposure of Trp residues in proteins. Figure 2.9 also contains several
points for the iodide quenching of single-Trp proteins. The slope is larger
(~1.6), and the plot indicates, as expected, that iodide is a more selective
quencher of surface Trp residues than is acrylamide. (Of course, electrostatic
effects play a role in the selectivity of iodide quenching.)
Static quenching is sometimes seen for the quenching of these single-Trp
proteins. This is most often seen for acrylamide as quencher, but some
examples with oxygen have also been reported.(57) Figure 2.10 shows data for
the oxygen quenching of asparaginase and ribonuclease These plots
show a comparison of and Stern–Volmer plots. The larger slope (and
upward curvature that is sometimes discernible) in the former is indicative of
a contribution from static quenching. Generally, such static contributions are
found to be smaller than the dynamic contribution. Human serum albumin
has an exceptionally large degree of static quenching by acrylamide (51) and
oxygen,(52) and the static component for oxygen quenching of asparaginase
(Figure 2.10), while small in magnitude, is relatively large when compared to
the dynamic quenching component.
In simplest terms, such static components must mean that there is a finite
probability that the quencher exists near a Trp residue at the instance of
excitation. One must keep in mind that the static quenching constants seen in
proteins are generally smaller in magnitude than those seen for quenching of
indole or tryptophan in water. Thus, the static quenching for proteins
apparently does not represent a large partitioning of quenchers into the
Fluorescence Quenching: Theory and Applications 75
protein matrix next to the Trp residues, but it also indicates that quenchers
are often not excluded from a steady-state existence near Trp residues inside
proteins. In a following section, I will comment on transient quenching effects
in proteins. This phenomenon may also contribute to an apparent static
quenching, particularly when one compares intensity data and average
lifetime data that are measured at a single frequency (via phase fluorometry).
azurin, and phospholipase and the interaction of these with small ligands,
to very large lipoprotein and nucleoprotein complexes.
An elegant application of solute quenching, and other fluorescence
methods, is the study by O’Neil et al.(91) of complexes between calmodulin
and a series of basic, amphiphilic, peptides. These peptides contained
a single Trp residue, which was systematically positioned throughout the
sequence. In the resulting complexes, the accessibility of the Trp residues to
acrylamide was found to vary in a periodic manner (repeat unit of 3 to 4
residues), consistent with the periodicity of the
to be larger than the van der Waals In fact, for efficient quenchers such
as acryfamide, an of about 7 is calculated for the quenching of indole in
water. (40) This is close to the sum of the molecular radii of indole
and acrylamide Also, analysis of multifrequency phase/modulation
lifetime data for the indole–acrylamide reaction, in terms of the radiation
boundary form of the time-dependent Smoluchowski equation, yields
reasonable values.(39) Of course, the calculated value may be com-
promised by a slight degree of inefficiency, but the point is that model system
studies are consistent with requirement of contact for the quenching reaction.
Recently, we have prepared covalent adducts containing an indole ring
and an acrylamide moiety, which are separated by one or two bonds. We
find that intramolecular quenching occurs. Further study is needed to
evaluate the extent to which this represents quenching over a distance, or
quenching by intramolecular collisions between the groups.
At this time we cannot eliminate the possibility that some electron transfer
over a distance occurs in the acrylamide quenching of Trp fluorescence in
proteins, but it seems likely that quenching must involve very close approach,
if not contact, between the reactants.
The question of an unfolding process versus an inward penetration
process can be expressed by the two following kinetic schemes(84,117,123).
for the penetration model are taken from Gratton et al.,(117) who provided a
thorough kinetic description or this model. The apparent rate constants for
solute quenching will be, for the penetration model,
Thus, expressions for the rate constant are of slightly different form. (These
models are analogous to the unfolding and penetration kinetic models for
hydrogen exchange in proteins.) (112,118) If the term in the denominator
of Eq. (2.20) is larger than then the two rate expressions are dis-
tinguishable, for Eq. (2.20) predicts a downward-curving Stern–Volmer plot
(for a single type of fluorophore!). If, however, in Eq. (2.20), then
the two rate expressions are not easily distinguished. In Eq. (2.19), may be
limited either by diffusion of the quencher through the solvent (when
or by penetration through the protein matrix (when In Eq. (2.20),
will be the product of the rate constant for diffusion through the solvent times
the equilibrium constant for the segmental unfolding transition.
The two models represent extreme kinetic mechanisms (in one the
quencher goes in, in the other the fluorophore come out), and in reality the
difference may be subtle. We suspect that for some buried fluorophores a
penetration mechanism may be a better model, while for others an unfolding
mechanism may be required. For some of the proteins with single internal Trp
Fluorescence Quenching: Theory and Applications 81
In studies with different quencher types, one can vary the charge, size,
and efficiency of the quencher. One should, of course, avoid comparing
“contact” quenchers (see Section 2.2.2 for qualification of the term “contact”)
quenchers, such as oxygen, acrylamide, and iodide, with those which probably
quench over a distance (i.e., via resonance energy transfer), such as nitrite and
methyl vinyl ketone.(111) Figure 2.9 shows a comparison, for several single-Trp
proteins, of the quenching rate constants for oxygen, acrylamide, and iodide.
For proteins with buried Trp residues, such as ribonuclease and par-
valbumin, the rate constants vary in the order > acrylamide > iodide. Here
again, the pattern is indicative of a penetration quenching process, with
oxygen being very effective (large ) and the iodide ion being very poor (small
) at penetrating into the globular structures. If an unfolding mechanism were
to hold for all quenchers except oxygen, one would expect to see a similar
for most quenchers. This is because the would be the same for all
quenchers. This clearly is not the case for most proteins.
In studies with the slightly inefficient quencher succinimide, we obtained
what may be the strongest evidence for a penetration model. (17) Succinimide
has a quenching efficiency of about 0.7 in aqueous solution and is about 20%
larger in diameter than acrylamide. In comparative studies with several single-
Trp proteins, the ratio of the apparent quenching constants for succinimide
and acrylamide, was found to range from ~ 0.1 to
~0.7 (see Figure 2.15). Proteins having relatively buried Trp residues were
found to have small values of That is, succinimide quenches these with
a smaller rate constant than does acrylamide. Proteins with relatively solvent-
exposed Trp residues, such as glucagon and adrenocorticotropin, were found
to have larger values. This wide range of values could be due to the
critical size dependence of the dynamic penetration of quencher through a
protein matrix. However, we also discovered another explanation, that being
the inherent dependence of succinimide quenching on the microenvironment
of the indole ring. Whereas acrylamide is found to be ~100% efficient at
quenching the fluorescence of indole in all solvent ( being dependent only
on the inverse of the solvent viscosity, as shown in Figure 2.2), we found that
succinimide is a relatively inefficient quencher in aprotic solvents. For example,
succinimide is 70% as efficient as acrylamide in water, but is only 15% as
efficient in dioxane, 10% as efficient in acetonitrile, and 1% as efficient in
dimethylformamide. Thus, a low for a Trp residue in a protein could be
due to the aprotic microenvironment in which the quenching takes place.
While there are two possible explanations (i.e., a critical size dependence or
an aprotic microenvironment dependence) for the small for buried Trp
residues in proteins, both explanations are consistent with the dynamic
penetration model for solute quenching reactions. That is, low for buried
Trp residues can be explained as being due to the fact that succinimide is
slightly larger than acrylamide and thus will not penetrate as well or as being
Fluorescence Quenching: Theory and Applications 85
due to the fact that succinimide, upon penetrating to reach an internal Trp,
experiences an aprotic microenvironment and thus will not quench well. The
unfolding mechanism, on the other hand, offers no means of explaining the
wide range of since in this mechanism the quenching reaction is assumed
to occur when a Trp is periodically exposed to the aqueous environment and
thus no dependence on the size of the quencher is predicted.
then the apparent values (obtained with reference to the bulk quencher
concentration) will give an overestimate of the kinetic exposure of the
fluorophore. Blatt et al.(114) have recently emphasized this point, in com-
parison with quenching studies with micelles (see Section 2.4.1). Based on
measurements of the acrylamide quenching of proteins as a function of
protein concentration, they calculated partition coefficients in the range of 30
to 100 for the interaction of acrylamide with serum albumin, monellin, and
-lactoglobulin. We have investigated this matter using both fluorescence
lifetime and intensity measurements and find no significant dependence of the
acrylamide quenching of serum albumin and monellin on protein concentra-
tion. ( 1 2 7 ) Furthermore, equilibrium dialysis measurements show no significant
interaction of acrylamide with serum albumin. Thus, we do not find evidence
for a high local concentration of this quencher in the globular structure of
these proteins.
The evidence presented above for static quenching suggests that there is
a certain probability that a quencher molecule (particularly neutral quenchers
like acrylamide and oxygen) can exist adjacent to a Trp residue at the instant
that excitation occurs. The magnitude of static quenching constants for the
quenching of Trp residues in proteins is less than that for aqueous indole,
however, for all proteins studied. Thus, the small degree of static quenching
in proteins does not indicate a strong binding of the quencher to the Trp
residues. The observed static quenching may be more correctly attributed to
transient effects as discussed below.
There is some evidence that certain quenchers may interact specifically
with certain proteins. For example, acrylamide is an inhibitor of the enzymatic
activity of alcohol dehydrogenase, which is not surprising in view of its
structural similarity to another strong inhibitor, isobutyramide. (81) Acrylamide
is also a weak competitive inhibitor of chymotrypsin, (127) a weak activator of
trypsin, (127) and a weak competitive inhibitor of cytochrome P450C-21.(128) For
several other enzymes, there is little or no effect of acrylamide on their
activity. (51,104,110) Acrylamide will covalently react with lysine and cysteine
side chains at high pH, (129,130) but there is no indication that the adducts
produced will act as quenchers.
The specific interaction of the charged quencher iodide with serum
albumin is well known, (67,131) and nonspecific electrostatic interactions
between a quencher and a macromolecule-associated fluorophore must always
be considered (see Section 2.3.3).
It is reasonable to suspect that the nonpolar quencher oxygen will parti-
tion weakly into the oily core of proteins. However, the degree of static
quenching by oxygen is not unusually large for most proteins.(52,71) Jameson
et al.(132) for example, fitted their data for the oxygen quenching of the
porphyrin (iron-free) fluorescence of myoglobin and hemoglobin with modest
partition coefficients of 0.3 to 0.6. Trichloroethanol is another nonpolar
Fluorescence Quenching: Theory and Applications 87
quencher which one would expect to show a tendency to interact with oily
regions in proteins. This interaction may occur with human serum albumin,
but, for most proteins, quenching by trichloroethanol (at low quencher
concentration) shows a pattern similar to that by acrylamide.(133) At high
(0.2 to 0.5 M) concentrations, trichloroethanol appears to induce a change in
the conformation of some proteins. This solvent-induced transition is thought
to be similar to that induced by 2-chloroethanol, which involves interaction
with the nonpolar side chains of the protein and the subsequent unfolding of
the globular structure.(135)
It is always necessary to consider the possibility that a quencher interacts
with the system being studied. However, we believe that the evidence indicates
that the commonly used neutral quenchers, oxygen and acrylamide, do not
partition into proteins to a significant degree.
In cases where specific quencher–protein binding occurs, it does not
necessarily follow that this leads to enhanced quenching. If the quencher inter-
acts near a Trp residue, this would probably produce quenching, but interac-
tion at a remote site may not cause a change in fluorescence. An example
in which quenching can be attributed to the specific binding of quencher
is cytochrome P450C-21.(128) The binding of acrylamide results in a static
quenching of the Trp fluorescence of this protein, and the association constant
and static quenching constant are both found to be about 10
There are data that demonstrate that the transient term of the
Smoluchowski equation (Eq. 2.10) must be included to fit time- and fre-
quency-domain measurements of the solute quenching of indole and other
simple fluorophores in isotropic solution.(32,33,37,39) Lakowicz et al.(39) have
studied whether such transient effects can be observed in the solute quenching
of Trp fluorescence in proteins. One protein that they studied was nuclease
from Staphylococcus aureus.(41) This single-Trp protein shows a fluorescence
decay which is nearly a single exponential. When acrylamide or oxygen is
added, the fluorescence decay becomes more nonexponential. This is illustrated
(for oxygen) by the phase–modulation data in Figure 2.16A. Notice that
Lakowicz et al. were able to use a modulation frequency as high as 2000 MHz
(as in Figure 2.4A) due to their novel application of a microchannel plate
detection system.(134) Such high frequencies are necessary to enable the short
lifetime contributions to be revealed.
The phase–modulation data for acrylamide-quenched nuclease can be
fitted by a double-exponential decay law. However, Lakowicz et al.(41)
demonstrated that the data can also be fitted by a transient effects model,
using either the time-dependent Smoluchowski equation (Eq. 2.10) or the
“radiation boundary” form of the Smoluchowski equation (Eq. 2.11). In fact,
88 Maurice R. Eftink
Lakowicz et al. found that the radiation boundary transient equation provides
a superior fit for the solute quenching of nuclease by both oxygen and
acrylamide. This is illustrated in Figure 2.16B by the lower and the more
uniform deviation plots for the fits of the radiation boundary model. The
fitting parameters obtained for this model were D, the effective diffusion coef-
ficient for the quencher–fluorophore reaction, and k, the intrinsic quenching
Fluorescence Quenching: Theory and Applications 89
Both the rate constant for solute quenching and the fluorescence life-
time of a fluorophore in a protein may not be discrete values. Instead
there may be multiple or even pseudo-continuous distributions of or
values. (137,138,141) Consider the model for a protein in Figure 2.17. Three routes
are shown for the penetration of a quencher to an internal fluorophore. If this
is a reasonable model, then the apparent will have contributions form the
three routes. Furthermore, if the different routes have different activation
energies, then one would expect to see a curved Arrhenius plot. We have
carefully studied the temperature dependence of the acrylamide quenching of
ribonuclease (Figure 2.12), and we find a linear Arrhenius plot from 10 to
45°C. This indicates that either (a) there is one dominant rate for the collision
of the quencher with Trp-59 of this protein, or (b) that all routes have about
the same thermal activation energy. Figure 2.17 gives a model with multiple
but one could also argue for multiple values by consideration of the
existence of multiple conformational states of a protein.
There has also been much discussion recently about the fact that the
fluorescence decay of individual Trp residues in proteins is not usually mono-
exponential (see Refs. 66, 74, 137, and 138-143). This again may be due to the
existence of multiple conformations of proteins and may be best described as
a pseudo-continuous distribution of decay times. (140,141) In Section 2.6.3 I will
discuss the consequences of such nonexponential lifetimes on Stern–Volmer
quenching plots.
92 Maurice R. Eftink
Solute quenching reactions have been used quite often in studies with
micelles and membrane systems. Much less has been done with nucleic acids.
Here we will focus on the significant differences in the application of the
solute quenching method to such structures. With micelles and membranes,
quenching reactions are controlled by the extent to which the quencher enters
the hydrocarbon-like subphase. Polar or charged quenchers do not enter, to
a significant extent, into most lipid subphases. As demonstrated by the work
of Shinitzky and Rivnay (44) or Pownall and Smith, (145) charged quenchers,
such as N-methylpicolinium, iodide, or cesium ions, can be used to determine
the aqueous surface accessibility of fluorophores in lipid assemblies. (Many
similar applications of charged and polar quenchers to assess surface
accessibility are given as entries 1–17 in Table 2.6.) Nonpolar quenchers can
partition into the lipid subphases, as discussed in Section 2.4.1 below, and this
can lead to enhanced quenching. Upward-curving Stern–Volmer plots are
often seen when nonpolar quenchers are used with micelles and membranes,
and this is probably due to the transient term in the Smoluchowski equation,
as well as true static quenching. Since diffusion is often limited to two dimen-
sions, a different form of the Smoluchowski equation must be considered
(Section 2.4.2). Some quenchers have been made to include a quencher moiety
as part of a fatty acid or phospholipid molecule. These “quencher lipids”
provide advantage in assessing the location and lateral mobility of membrane-
associated fluorophores and the quencher itself (Section 2.4.3).
Below we will expand on these aspects of quenching reactions applied to
micelles and membranes.
(Eq. 2.22) applies, but the quencher concentration term will be that in the
lipid phase,
where is the bimolecular rate constant in the lipid phase, and V is a static
quenching constant. If one defines a partition coefficient as
where is the quencher concentration in the aqueous solvent phase, then,
from Eq. (2.23), which is a conservation of mass relationship, one can derive
an expression (Eq. 2.24) for
the fluorophore and may involve jumping of the quencher from its binding
site to strike the fluorophore. Blatt et al.(149) discussed the possibility that
bound quenchers may quench only by a static mechanism. Regardless, the
resulting Stern–Volmer equation will have a maximum of four unknown
parameters ( and ), and simultaneous nonlinear least-squares
analysis of data sets at different ratios may enable fits to be obtained.
Again, a graphical procedure is described by Blatt et al.(149) for determining
the fitting parameters. These researchers have also pointed out that both
partitioning and binding of the quencher may occur together in a system, and
they have provided some interesting simulations. It should also be pointed out
that Eq. (2.25), and the discussion that follows, assumes that quenching
occurs only by lipid-associated quencher. If some quenching occurs by
quencher from the aqueous phase, an extra term must be added to
Eq.(2.25).
The advantage of a complete analysis of quenching data as a function
of the ratio is that one can (a) obtain intrinsic rate constants for
quenching, and (b) determine the way in which the quencher associates with
the lipid phase. If one were to work at a single ratio, only an apparent
value could be obtained, and it would be a function of (or n
and ), and that is, it would be incorrect to interpret such (app)
in terms of the microviscosity of the lipid phase. By resolving and
parameters are obtained that can be related to the physical characteristics of
the quencher and the lipid phase.(151) When values are large, they may be
difficult to determine. Omann and Glaser(148) have developed a protocol,
in which excess nonfiuorescent membrane vesicles are added, to aid in the
determination of large values.
In Figure 2.18 are shown typical data for the quenching in a compart-
96 Maurice R. Eftink
where is the vertical distance between the bilayer center and the more
shallow quencher, and is the vertical distance between the quenchers
on the two quencher lipid molecules. Using pairs of nitroxide-labeled
phospholipids, Chattopadhyay and London(250) employed this parallax
method to determine the penetration depth of various membrane-bound
fluorophores.
The above type of lipid quenchers are also useful because they can par-
ticipate in phase transitions and phase separations like other phospholipids.
Measurement of the quenching by these agents as a function of temperature
can reveal differences in the fluidity of the gel and liquid-crystalline states.(166)
Also, London and Feigenson(159) have shown that the relative affinity of other
types of phospholipids for proteins can be measured via their displacement of
such lipid quenchers from the boundary region about embedded proteins.
These workers used this method to study the interaction of phospholipids
with -ATPase from sarcoplasmic reticulum (159) also see Ref. 165 for an
earlier, less specific application).
Another interesting strategy is to selectively incorporate a quencher lipid
molecule into one monolayer of a vesicle and to then observe the degree of
quenching of a fluorophore (i.e., protein) that is incorporated into the
opposite monolayer.(167)
as a quencher and transport species. has about the same ionic radius
as , and they found that the acetylcholine receptor, embedded in mem-
brane vesicles, will facilitate the uptake of into the vesicles. A fluorescent
probe (8-amino-l,3,6-naphthalenetrisulfonate) was loaded into the inner
aqueous volume of the vesicles, and the inward flux of was then
monitored by quenching of the fluorophore, following stopped-flow mixing.
The inward flux was described by the relationship
The value of k, determined by analysis with the above equation, was further
related to a transport number per channel per second, from knowledge of the
vesicle size and the number of receptors per vesicle.
The electrostatic potential on a membrane surface can be estimated using
ionic quenchers, as demonstrated by Winiski et al.(182) These workers used
and tempamine (4-amino-2,2,6,6-tetramethylpiperidine-l-oxyl) as cationic
quenchers of the fluorescence of 2-(N-hexadecylamino)-naphtalene-6-sulfonate
incorporated into phospholipid vesicles (which were neutral or negatively
charged by inclusion of phosphatidylglycerols). The apparent for this
reaction will be
where . is the quenching constant when charge effects are absent (i.e.,
neutral vesicles), z is the valency of the quencher, F is Faraday's constant, and
is the electrostatic potential sensed by the quencher adjacent to the
fluorophore.
Most nucleic acids do not have intrinsic fluorophores (at room tem-
perature), and very few solute quenching studies have been performed with this
class of biomolecules. The Y base of phenylalanine tRNA does fluoresce,(183)
and the quenching of this Y base by acrylamide has been studied.(246)
Fluorescence Quenching: Theory and Applications 101
allows the emission of the two residues to be separated. One of the best
applications of this use of quenchers to resolve spectra is the iodide quenching
of horse liver alcohol dehydrogenase. This protein also possesses only two
types of Trp residues; one type (Trp-15) lies on the surface of this dimeric
protein, and the other type (Trp-314) is buried at the intersubunit interface.
Laws and Shore(80) and Abdallah et al.(79) have shown that iodide selectively
quenches Trp-15, allowing the emission spectrum of the two residues to be
resolved.
As mentioned in Sections 2.2.1 and 2.6.1, solute quenching can be used
to determine the relative contribution (in terms of fractional fluorescence
intensities) of components (two or, at most, three) at any choice of excitation
and emission wavelengths, provided that the components have different
accessibilities to quencher. Recently, we have compared the dissection of
component spectra by solute quenching with that obtained by phase-resolved
spectral measurements for various two-Trp proteins.(59)
If one can selectively quench the emission of one of the components, the
component values can be obtained. For example, by selectively quenching
Trp-15 of alcohol dehydrogenase with acrylamide, a limiting anisotropy of
about 0.265 is reached at high [Q] (at =300nm and 20°C). This can be
assigned to the anisotropy of the inaccessible Trp-314 residue. From the r
value at [Q] = 0 and the values, the anisotropy for Trp-15 (r = 0.210) can
also be calculated.(193)
For systems in which there is a single fluorophore, measurement of r as
a function of [Q] can be used to construct a Perrin plot. From the slope of
this plot one can determine the rotational correlation time, for the emitting
center. This is because a dynamic solute quencher will cause a lowering of the
fluorescence lifetime. The r value is dependent on the ratio as given by the
Perrin equation:
In Figure 2,22 are shown plots of Eq. (2.35b) for the acrylamide
quenching of several single-Trp proteins. (60) Lakowicz and co-workers (56,58)
have used oxygen as a quencher of a large number of peptides and proteins.
In Figure 2.23 is an example of their use of oxygen quenching and anisotropy
measurements to study the monomer tetramer equilibrium in melittin.
The ^'-intercept in Figures 2.22 and 2.23 is The limiting
anisotropy, of the fluorescence of the fluorophore will usually depend on
excitation wavelength, when there is more than one absorption oscillator. For
tryptophan and indole in a low-temperature, vitrified solvent, for example,
the anisotropy shows a distinct dependence on and the reaches a
plateau at 300 nm due to selective absorption into the band.(194,195) The
value at 300 nm is about 0.31 ± 0.01 for immobilized indole.(194–196) If the
Fluorescence Quenching: Theory and Applications 107
that is found from a modified Perrin plot (Figures 2.22 and 2.23)
is much less than the limiting value of this indicates the occurrence
of very rapid motion of the fluorophore that is independent of global rotation
of the macromolecule. One can reasonably assume that this rapid motion
will be limited within a cone.(56) The cone angle, can be calculated
as For the data in Figure 2.22, the N form of
human serum albumin and the -deficient form of parvalbumin are found
to have much smaller than the other forms of these respective
proteins. In Table 2.7 are summarized anisotropy data obtained for several
single-Trp proteins, by use of oxygen and acrylamide as solute quencher.
Solute quenching can also aid in the analysis of anisotropy decay
measurements. This has been demonstrated by Lakowicz et al.,(197) who
progressively quenched samples of the peptide melittin with acrylamide and
measured the resulting intensity and anisotropy decays (frequency-domain
measurements). The dynamic quencher reduces the mean lifetime of the single
Trp residue in melittin. By enabling fluorescence data to be collected at
shorter times (higher frequencies), the contribution to the anisotropy decay
from rapid, picosecond motion becomes enhanced. In the study with melittin,
in both its monomeric and tetrameric forms, Lakowicz et al. were able to
resolve 60- and 160-ps rotational correlation times, in addition to the longer
108 Maurice R. Eftink
transfer efficiency. Let be located well beyond the distance so that its
transfer efficiency is <10%. Further, let A be fluorescent so that sensitized
emission can be observed, and let A not be directly quenched by a particular
solute quencher. If one were to excite exclusively into the donor fluorophores,
the following patterns would result when the solute quenching of sensitized A
emission is observed. Energy that is transferred from to A would not be
significantly affected by the quencher. Since transfers little energy to A,
solute quenching of would cause only a small, indirect quenching
of A. Solute quenching of would, however, lead to a significant quenching
of A. The point is that a solute quencher will most effectively quench the
sensitized emission of an acceptor by quenching those donors that are at a
distance of from the acceptor (not by quenching those donors that are
closest to the acceptor, as has been claimed by some).
The analysis of the time- (or frequency-) domain kinetics of resonance
energy transfer reactions can, in some instances, be aided by employing
solute quenching. To illustrate this usage, consider the frequency-domain
data in Figure 2.25 for the Trp-to-NADH energy transfer in an alcohol
dehydrogenase–NADH complex. There are two Trp residues that can transfer
energy to bound NADH. The surface Trp residue, Trp-15, can be selectively
quenched by iodide, leaving only Trp-314 as an energy donor. Figure 2.25
shows that the multifrequency phase and modulation data are altered by the
quenching of Trp-15. In another example of the use of solute quenching to aid
in the analysis of energy transfer reactions, Gryczynski et al.(218) have used
steady-state solute quenching to evaluate the distribution of end-to-end
distances of flexible molecules having a fluorescence donor and acceptor at
either end.
Stern-Volmer constant for the protein–ligand complex to that for the protein
alone and R is the ratio of the fluorescence intensity of the
complex to that of the protein alone. The apparent for this system
will be
If R = l.0 (i.e., the binding of ligand does not quench or enhance the
fluorescence of the protein) and at the ratio of the initial
in the presence of L to that in the absence is
the data sets to obtain quenching constants plus other constants that depend
on the nature of the linkage (i.e., when emission wavelength is varied, the
other fitting parameters are the wavelength dependence of the fractional inten-
sities of the component spectra). Beechem and Grattan (215) have generalized
a global fitting routine that will apply to many types of fluorescence data,
including solute quenching. We believe that in the near future global analysis
of solute quenching data sets will become a standard procedure for extracting
quenching constants and other fitting parameters. This will particularly be so
with the expanding use of on-line data acquisition and other equipment, such
as photodiode array detectors, which will enable large bodies of data to be
rapidly acquired.
The three data sets in Figure 2.3B (the acrylamide quenching of a
mixture of 2-methyl- and 3-methylindole) were actually fitted simultaneously
by Eq. (2.17), using global dynamic and static quenching constants and the
fractional contributions of the components at each emission wavelength. If
only a single data set (single emission wavelength) had been analyzed alone,
the heterogeneity could easily have been missed. This is because the downward
curvature is slight, since the difference between the is not great, and since
static quenching further masks the downward curvature. By linking the three
data sets, the components are easily recovered. We have also applied such
simultaneous analysis of quenching data to a two-Trp-containing protein.(219)
2.8. Conclusion
Acknowledgments
References
16. A. H. Alwatter, M. D. Lumb, and J. B. Birks, in: Organic Molecular Photophysics (J. B.
Birks, ed.), Vol. 1, Chapter 8, John Wiley & Sons, New York (1973).
17. M. R. Eftink and C. A. Ghiron, Biochemistry 23, 3891-3899 (1984).
18. G. Porter and M. W. Windsor, Proc. Roy. Soc. A 245, 238 (1958).
19. J. A. Green, L. A. Singer, and J. H. Parks, J. Chem. Phys. 58, 2690–2695 (1973).
20. R. F. Steiner and E. P. Kirby, J. Phys. Chem. 73, 4130–413 (1969).
21. P. M. Froehlich and K. Nelson, J. Phys. Chem. 82, 2401-2403 (1978).
22. M. R. Eftink, T. Selva, and Z. Wasylewski, Photochem. Photobiol. 46, 23-30 (1987).
23. D. A. Labiana, G. N. Taylor, and G. S. Hammond, J. Am. Chem. Soc. 94, 3679-3683 (1972).
24. D. O. Cowan and R. L. E. Drisko, J. Am. Chem. Soc. 92, 6281–6285 (1970).
25. A. R. Watkins, J. Phys. Chem. 78, 2555-2558 (1974).
26. J. R. Lakowicz and G. Weber, Biochemistry 12, 4161–4170 (1973).
27. M. Jullien, J.-R. Garel, F. Merola, and J.-C. Brochon, Eur. Biophys. J. 13, 131–137
(1986).
28. N. Mafaga and M. Ottolenghi, in: Molecular Associations (R. Foster, ed.), Vol. 2, pp. 17-78,
Academic Press, New York (1979).
29. E. E. Johnson, J. Phys. Chem. 84, 2940–2946 (1980).
30. A. Ahmad and G. Durocher, Photochem. Photobiol. 34, 573–578 (1981).
31. W. R. Ware and J. S. Novros, J. Phys. Chem. 70, 2346–3253 (1966).
32. T. L. Nemzek and W. R. Ware, J. Chem. Phys. 62, 477–489 (1975).
33. J. C. Andre, M. Niclause, and W. R. Ware, Chem. Phys. 28, 371–377 (1978).
34. M. V. Smoluchowski, Z. Phys. Chem. 92, 129–168 (1917).
35. B. Sveshnikoff, Acta Physicochim. U.R.S.S. 3, 257 (1935).
36. F. C. Collins and G. E. Kimball, J. Colloid Sci. 4, 425–439 (1949).
37. J. Yguerabide, M. A. Dillon, and M. Burton, J. Chem. Phys. 40, 3040–3052 (1964);
J. Yguerabide, J. Chem. Phys. 47, 3049-3061 (1967).
38. R. W. W. van Resandt, Chem. Phys. Lett. 95, 205–208 (1983).
39. J. R. Lakowicz, M. L. Johnson, I. Gryczynski, N. Joshi, and G. Laczko, J. Phys. Chem. 91,
3277-3284 (1987).
40. M. R. Eftink and C. A. Ghiron, J. Phys. Chem. 80, 486–493 (1976).
41. J. R. Lakowicz, N. B. Joshi, M. L. Johnson, H. Szmacinski, and I. Gryczynski, J. Biol. Chem.
262, 10907-10910 (1987).
42. M. L. Johnson, J. R. Lakowicz, N. Joshi, and I. Gryczynski, Biophys. J. 51, 286a (1987).
43. D. Peak, T. C. Werner, R. M. Dennin, Jr., and J. R. Baird, J. Chem. Phys. 79, 3328–3335
(1983).
44. J. Keizer, J. Phys. Chem. 85, 940–941 (1981).
45. J. Keizer, Acc. Chem. Res. 18, 235-241 (1985).
46. R. I. Cukier, J. Am. Chem. Soc. 107, 4115–4117 (1985).
47. P. Midoux, P. Wahl, J.-C. Auchet, and M. Monsigny, Biochim. Biophys. Acta 801, 16–25
(1984).
48. T. Ando and H. Asai, J. Biochem. (Tokyo) 88, 255-264 (1980).
49. T. Ando, H. Fujisak, and H. Asai, J. Biochem. (Tokyo) 88, 265–276 (1980).
50. A. R. Horrocks, A. Kearvell, K. Tickle, and F. Wilkinson, Trans. Faraday Soc. 62,
3393-3399 (1966).
51. M. R. Eftink and C. A. Ghiron, Biochemistry 15, 672–680 (1976).
52. J. R. Lakowicz and G. Weber, Biochemistry 12, 4171–4179 (1973).
53. A. Follenius and D. Gerard, Photochem. Photobiol. 38, 373–376 (1983).
54. L. Stryer, D. D. Thomas, and C. F. Meares, Anna. Rev. Biophys. Bioeng. 11, 203–222 (1982).
55. A. M. Kleinfeld, Biochemistry 24, 1874–1882 (1985).
56. J. R. Lakowicz, B. Maliwal, H. Cherek, and A. Baiter, Biochemistry 22, 1741-1752 (1983).
57. K. A. Hagaman and M. R. Eftink, Biophys. Chem. 20, 201-207 (1984).
122 Maurice R. Eftink
142. J. M. Beechem and L. Brand, Annu. Rev. Biochem. 54, 43–71 (1985).
143. S. A. Cockle and A. G. Szabo, Photochem. Photobiol. 34, 23–27 (1981).
144. M. Shinitzky and B. Rivnay, Biochemistry 16, 982–986 (1977).
145. H. J. Pownall and L. C. Smith, Biochemistry 13, 2594–2597 (1974).
146. J. R. Lakowicz, D. Hogen, and G. Omann, Biochim. Biophys. Acta 471, 401–411 (1977).
147. M. V. Encinas and E. A. Lissi, Chem. Phys. Lett. 91, 55–57 (1982).
148. G. M. Omann and M. Glaser, Biophys. J. 47, 623–627 (1985).
149. E. Blatt, R. C. Chatelier, and W. H. Sawyer, Biophys. J. 50, 349–356 (1986).
150. E. Blatt and W. H. Sawyer, Biochim. Biophys. Acta 822, 43–62 (1985).
151. K. R. Thulborn and W. H. Sawyer, Biochim. Biophys. Acta 511, 125–140 (1978).
152. E. A. Haigh, K. R. Thulborn, and W. H. Sawyer, Biochemistry 18, 3535–3532 (1979).
153. D. B. Chalpin and A. M. Kleinfeld, Biochim. Biophys. Acta 731, 465–474 (1983).
154. W. W. Mantulin, H. J. Pownall, and D. M. Jameson, Biochemistry 25, 8034–8042 (1986).
155. C. S. Owen, J. Chem. Phys. 62, 3204–3207 (1975).
156. M. F. Blackwell, K. Gounaris, S. J. Zara, and J. Barber, Biophys. J. 51, 735–744 (1987).
157. R. Fato, M. Battino, M. D. Esposti, G. P. Castelli, and G. Lenaz, Biochemistry 25,
3378–3390 (1986).
158. S. S. Atik, C. L. Kwan, and L. A. Singer, J. Am. Chem. Soc. 101, 5696–5702 (1979).
159. E. London and G. W. Feigenson, Biochemistry 20, 1932-1938, 1939-1948, and 1949-1961
(1981).
160. K. I. Florine and G. W. Feigenson, Biochemistry 26, 2978–2983 (1987).
161. R. C. Chatelier, P. J. Rogers, K. P. Ghiggino, and W. H. Sawyer, Biochim. Biophys. Acta
776, 75-82 (1984).
162. T. Markello, A. Zlotnick, E. James, J. Tennyson, and P. W. Holloway, Biochemistry 24,
2895–2901 (1985).
163. M. J. S. DeWolf, G. A. F. Van Dessel, A. R. Largrou, H. J. J. Hilderson, and W. S. H.
Dierick, Biochemistry 26, 3799–3806 (1987).
164. J. Luisetti, H. Mohwald, and H.-J. Galla, Biochim. Biophys. Acta 552, 519–530 (1979).
165. V. G. Bieri and D. F. H. Wallach, Biochim. Biophys. Acta 406, 415–423 (1975).
166. V. G. Bieri and D. F. H. Wallach, Biochim. Biophys. Acta 443, 198–205 1976).
167. J. Everett, A. Zlotnick, J. Tennyson, and P. W. Holloway, J. Biol. Chem. 261, 6725–6729
(1986).
168. M. Esfahani and T. M. Devlin, J. Biol. Chem. 257, 9919–9921 (1982).
169. J. C. Gomez-Fernandez, M. D. Baena, J. A. Teruel, J. Villalain, and C. J. Vidal, J. Biol.
Chem. 260, 7168–7170 (1985).
170. R. A. Badley, Biochim. Biophys. Acta 379, 517–528 (1975).
171. A. Jonas, J.-P. Privat, P. Wahl, and J. C. Osborne, Jr., Biochemistry 24, 6205–6221 (1982).
172. K. A. Muczynski and W. L. Stahl, Biochemistry 22, 6037–6048 (1983).
173. I. Kahan, R. M. Epand, and M. A. Moscarello, Biochemistry 25, 562–566 (1986).
174. A. M. Batenburg, P. E. Bougis, H. Rochat, A. J. Verkleij, and B. de Kruijff, Biochemistry 24,
7101–7110 (1985).
175. P. Cavatorta, A. Spisni, E. Casali, L. Lindner, L. Masotti, and D. W. Urry, Biochim.
Biophys. Acta 689, 113–120 (1982).
176. J. R. Lakowicz and J. R. Knutson, Biochemistry 19, 905–911 (1980).
177. S. Cheng and J. K. Thomas, Radiat. Res. 60, 268–279 (1974).
178. B. C. Hill, P. M. Horowitz, and N. C. Robinson, Biochemistry 25, 2287-2292 (1986).
179. J. C. Mclntyre, P. Hundley, and W. D. Behnke, Biochem. J. 245, 281–829 (1987).
180. L. R. McLean, K. A. Hagaman, J. L. Krstenansky, T. J. Owen, and D. J. Sprinkle,
Biochemistry 28, 8403–8410 (1989).
181. H. P. H. Moore and M. A. Raftery, Proc. Natl. Acad. Sci. U.S.A. 77, 4509–4513 (1980).
Fluorescence Quenching: Theory and Applications 125
127
128 Herbert C. Cheung
estimate a range of the orientation factor, (2) choosing donor and acceptor
chromophores that have mixed polarizations and exhibit small limiting
polarization properties, and (3) using statistical interpretations of energy
transfer data to define the limits for the donor–acceptor distance and the most
probable distance.
The Förster formalism of FRET is based on the assumption that the
donor and acceptor chromophores are stationary on a time scale comparable
to the lifetime of the excited state and, as a consequence, the donoracceptor
separation is static and can be described by a single distance. The dynamic
nature of globular proteins and polymers has long been recognized, and the
possible existence of a distribution of the donor–acceptor distances in such
systems was first explored over 15 years ago. Very little progress was made
in the experimental investigation of such distributions for specific donor–
acceptor probes attached to proteins until 1987. The recent progress in this
direction indicates that FRET should be useful in monitoring structural tran-
sitions. Only a few physical methods can yield direct structural information on
macromolecules in terms of molecular distances, and FRET is one of them.
The distribution of energy transfer distances offers a potential experimental
approach to the investigation of the mechanism of protein folding and unfolding
and the intermediates involved, flexibility of nucleic acids, and distribution of
lipids in membrane.
The transfer efficiency is defined as the ratio of the transfer rate to the
sum of the rates of all deexcitation processes and is given by
Resonance Energy Transfer 131
where kf is the rate of fluorescence decay, and k' represents the sum of the
rates of all other deexcitation processes. Experimentally, E can be calculated
from either the relative quantum yields (or fluorescence intensities) or
lifetimes of the donor determined in the presence and absence of the acceptor:
where is the angle between the emission dipole of the donor and the
absorption dipole of the acceptor, and are the angles between the
vector joining the donor and acceptor and the emission and absorption
dipoles, respectively. k2 can range from 0 to 4.0. The minimum value obtains
when the donor emission and acceptor absorption dipoles are perpendicular
132 Herbert C. Cheung
to each other, and the maximum value corresponds to parallel and aligned
dipoles. If both dipoles sample all orientations during the interval of the
excited state (isotropic, dynamic averaging),
which was modified with the donor at the donor site and labeled at the
acceptor site with the nonabsorbing sulfhydryl reagent N-ethylmaleimide
(NEM). Sample B was an acceptor-labeled protein which was also labeled at
the donor site with NEM. Curve B is the spectrum of this sample. Curve C is
the spectrum of S-1 doubly labeled with the donor and the acceptor. Curve D
is the spectrum of a mixture of samples A and B. This spectrum agrees well
with the sum of curves A and B. The quenching of the donor fluorescence
in the donor–acceptor-labeled protein was accompanied by a concomitant
enhancement of the acceptor fluorescence. These reciprocal spectral changes
are qualitative evidence of resonance energy transfer. Since the labeling of
donor and acceptor was not stoichiometric, Eq. (3.16) was used to determine
the transfer efficiency:
Resonance Energy Transfer 135
where and are the axial depolarization factors for the donor and
acceptor, respectively, and is the axial transfer depolarization factor
associated with the average axial orientations of donor and acceptor. The
axial depolarization factors are related to observable depolarizations by
Since the theory developed by Dale and co-workers assumes single transition
moments, it may be appropriate to use 0.4 for for both donor and acceptor.
However, absorption and emission transition dipoles are in general not
collinear, and the experimentally observed values for a number of
chromophores are indeed considerably less than 0.4. A second origin for
is mixed polarizations; that is, the absorption or emission across a
spectral region is characterized not by a single transition dipole moment,
but by a combination of two or more incoherent dipole moments.(11) For
chromophores with observed it is not entirely clear whether the
observed value or 0.4 is more appropriate for calculation of depolarization
factor. Both values have been used in published studies. (8,12) The use of the
observed value offers a more conservative estimate of because this choice
yields a slightly wider range in than the use of 0.4.
Probes that possess spherical symmetry with triply degenerate transitions
polarized along mutually perpendicular directions approximate isotropic
oscillators. Chromophoric metal ions such as and the lanthanide ions
such as belong to this group. If such a metal ion is
used as either an energy donor or energy acceptor, the uncertainty in is
reduced to 12% or better. If these isotropic probes are used as both the donor
and the acceptor, the uncertainty in is completely removed and
The lanthanides compete for sites in several -binding proteins and
in Ca-ATPase. Their use can yield highly accurate intersite distances for these
systems.
It has been frequently suggested that if both attached donor and attached
acceptor have unconstrained, isotropic, and rapid motions a reversal
of the points of their attachment should have little effect on the isotropic and
rapid motions. To obtain an “experimental” verification of the validity of
the assumption of many investigators in the past performed reverse
labeling with the same donor–acceptor pair in which the donor and acceptor
sites were interchanged. If the interchange did not result in a gross alteration
of the observed value of E, the assumption was then considered valid. This
Resonance Energy Transfer 139
protocol was used in our previous study (8) of the distance in S-1.
A reversal of the labeling sites produced a very similar transfer efficiency, but
the depolarization factors were large, clearly indicating that the donor and
acceptor were not in randomizing motions. In another study, (12) essentially
identical transfer efficiencies were obtained between two sites in the actin .S-l
complex when the donor and acceptor sites were interchanged. Yet the axial
depolarization factors were above 0.5, leading to a range of 0.155 to 2.999,
again showing that the attached fluorophores were in restricted, not ran-
domizing, motions. For these systems (and possibly some other systems that
have been studied with reverse labeling), the assumption of isotropic and
dynamic averaging was clearly not valid, and it was necessary to describe
the donor–acceptor separation not by a single distance, but by a range of
distances.
from Eqs. (3.21) and (3.22). The presence of actin shortened the distance by
whereas MgADP had no effect on the distance. A different acceptor,
N-(4-dimethylamino-3,5-dinitrophenyl)maleimide (DDPM), was also used to
modify The distance from 1,5-IAEDANS at to DDPM at
was determined by the lifetime technique using phase fluorometry (19) and
pulse fluorometry.(20) This second donor–acceptor pair yielded a value of
27–28 for shorter than from the 1,5-IAEDANS–IAF pair. The
1,5-IAEDANS–DDPM distance in S-l decreased by 7-8 upon addition of
MgADP, demonstrating a perturbation of the localized structure by
nucleotide binding. The difference in structure between the two acceptors may
be responsible for the different results.
Of the five cysteines in actin, the penultimate residue (Cys-374) is
accessible for chemical modification under mild conditions. In the monomeric
form (G-actin) the protein contains a single bound ATP. Upon polymeriza-
tion to F-actin, the bound ATP is split, converting the bound nucleotide to
ADP. We previously reported that replacement of bound ATP in G-actin by
1, -ethenoadenosine triphosphate did not impair the ability of actin
to polymerize (21) and determined the distance from bound (donor) to
Cys-374 modified with 4-[N-(iodoacetoxy)ethyl-N-methyl]amino-7-nitro-
benz-2-oxa-l,3-diazole (IANBD) as the acceptor to be has
also been used by other investigators to determine the separation between
the nucleotide site and Cys-10 and between the nucleotide site and two
specific points on the myosin S-l heavy chain in the acto · S-1 complex.
Figure 3.2 is a space lattice for S-l and acto · S-1 constructed from FRET
distances.(23) The seven specific points in S-l include three residues on the
heavy chain, one residue on each of the two light chains, and two points
within the nucleotide binding site that is involved in ATP hydrolysis. Ten
distances have been reported between these seven points. Two distances in
actin and eight distances across actin and S-l in acto · S-1 are also included.
The distances shown in Figure 3.2 are far short of the 4N – 10 requirement for
complete spatial specification of acto · S-l or the individual proteins, but have
already provided some important insight into the transduction mechanism.
Filamentous actin (F-actin) is comprised of two strands of actin polymer
that are assembled into a suprahelical structure. Individual actin monomer
(G-actin) is a single-polypeptide protein which is bilobar. The orientation of the
monomer in the filamentous structure based on FRET was first investigated
by Taylor et al.,(38) who measured the transfer between chemically equivalent
sites located on different monomeric units within the suprahelical structure.
In this study two preparations of labeled G-actin were used: one preparation
in which Cys-374 was labeled with IAF serving as the energy donor, and the
other in which the same residue was labeled with iodoacetamidoeosin or
tetramethylrhodamine serving as the energy acceptor. Copolymers of pure
actin filaments were obtained by using various molar ratios of the donor-
142 Herbert C. Cheung
from a theoretical analysis that, within certain symmetry limits, four radial
coordinates (N = 4) and six intramolecular distances (4N – 10 = 6) are
required to completely define the orientation of the actin monomer in the
filament. In the same study Kasprakz et al.(41) also found that myosin S-l
binding to F-actin increased the radial distance of Gln-41 from
but had negligible effect on the radial distances of Cys-374 and the nucleotide-
binding site. These results do not rule out conformational changes which may
be induced on actin monomer in the region of the nucleotide site or Cys-374
by interaction with S-l, but probably can eliminate certain models of inter-
action which would require global rotation of actin monomer with large
amplitudes. Of special interest is the effect of binding of activator calcium to
regulated actin filament (actin filament decorated with a full complement of
the regulatory proteins troponin and tropomyosin) on the radial coordinates.
We (Censullo and Cheung, unpublished results) have recently shown that
the radial coordinate of Cys-374 decreases by 9% when the
regulatory proteins are incorporated into the actin filament. In response to
calcium binding the coordinate decreases further to suggesting a
compression of the actin filament. While four radial coordinates have been
reported, complete characterization of the orientation of actin monomer
in the filament must await additional intramolecular distances because
4N – 10 = 6 for this system and only two such distances are at hand (see
Figure 3.2).
The method developed by Taylor et al.(38) was intended for the use of
FRET to investigate actin assembly and disassembly during motile events in
nonmuscle cells. The authors also showed that the emission decay of the
labeled actin microinjected into living Chaos carolinensis was essentially
unaltered and unquenched by the cytoplasm. This observation suggested the
possibility of quantitative measurements of energy transfer in living cells.
FRET was also used recently to follow assembly of synthetic myosin thick
filaments and demonstrate exchange of myosin molecules between the
filaments.(42) Donor (IAEDANS)-labeled and acceptor (IAF)-labeled myosins
were preincubated in 0.5 M KCI. The assembly into filaments was initiated by
reduction of ionic strength by dilution and monitored by following the
decrease of donor intensity. From these data the concentration of myosin that
remained unassembled (critical concentration) was calculated and found to be
40 nM in the range of myosin concentration This determination
was possible because of the high sensitivity of the FRET assay and the
measurement was considerably easier to carry out than that based on analyti-
cal ultracentrifugation. Dynamic exchange of myosin molecules between thick
filaments was demonstrated by following the quenching of donor intensity
resulting from donor-labeled thick filaments and acceptor-labeled thick
filaments. The extent of exchange was found to be 75% after 180 min and was
independently confirmed by using 125I-labeled myosin and ultracentrifugation.
144 Herbert C. Cheung
where A is actin and A · S-1 (acto · S-1) is the complex formed between actin
and the myosin heads (S-l). The question of which chemical step is coupled
to force generation during muscular contraction has been extensively
investigated. It is believed that S-l attaches to actin in two states. Weakly
bound states (e.g., A · S-l · ATP, A · S-1 · ADP · Pi ) are associated with the
beginning of the power stroke, and the strongly bound states (A · S-1,
A · S-1 · ADP ) are associated with the end of the power stroke. Transition
from the weakly bound states to the strongly bound states may be accom-
panied by large-scale structural changes within the acto · S-1 complex. These
changes are particularly relevant in deciding whether the myosin head or a
portion of the myosin rod is the energy-transducing element that is involved
in converting chemical free energy to mechanical work. The structures of the
strongly bound states have been extensively studied by various techniques,
but not much is known about the weakly bound states. To investigate this
second attached state requires a method to arrest the contractile process at
the pre-power stroke, where S-l is weakly attached to actin. One way to
obtain stable analogous of the weakly bound state (A · S-1 · ATP) is through
trapping of ATP at the active site by cross-linking the thiols
When distance 14 between actin Cys-374 and S-1 light-chain Cys-177
(Figure 3.2) was measured with non-cross-linked S-l (A · S-1, A · S-1 · ADP),
was found to be in the range When cross-linked S-l was used
to mimic the weakly bound species (A · S-1 · ATP), This
large decrease has been taken as direct evidence for a change in the structure
of myosin heads that could account for tension generation. Qualitatively
similar, but less pronounced, results have also been reported recently on
distance 10 between actin Cys-374 and S-1 Cys-707 by using acto · S-1
that was cross-linked by a carbodiimide in the presence of ATP.(31) was
for non-cross-linked acto S-1 (rigor) and increased to 54 Å for cross-
linked acto · S-l. Not resolved in these studies, however, is to what extent the
observed distance change occurs within S-l as opposed to being due to
reorientation of S-l relative to the actin filament. There is no information on
Resonance Energy Transfer 145
the kinetics of the distance change that occurs during the transition from
species associated with the pre-power stroke to those associated with the end
of the stroke.
Skeletal TnC has four sites. Two sites bind with a high
affinity also binds competitively at these sites.
The other two sites bind specifically with a low affinity
Since the intracellular concentration is in the millimolar
range, the two high-affinity sites are likely occupied by in relaxed
muscle, where the cytosolic level is in the submicromolar range.
activation involves binding of to the two low-affinity sites. The FRET
distances determined in the presence of and provide a basis for
understanding structural changes that are induced when muscle is activated
by With the exception of the intersubunit distance C(25)-I(158), the
other five distances were significantly perturbed when the ionic medium was
changed from to The changes were in both directions in the
range of 5–10Å . A decrease of 5 Å in C(98)–I(133) in fully reconstituted
troponin was also observed(44) when the medium was changed from to
Although there are inherent uncertainties in the individual measured
distances, large changes in the distances must reflect substantial structural
perturbations. These results suggest large-scale movements of two regions in
Resonance Energy Transfer 147
each protein toward or away from each other when muscle is switched on.
These movements must play a role in the transmission of the signal.
What is not resolved at this time is whether these changes occur with time
constants that are compatible with the known kinetics of force generation.
The crystal structure of TnC is of considerable interest. It shows the
protein to be dumbbell-shaped,(45) with the N- and C-terminal regions folded
into two globular domains. The two domains are connected by a nine-turn
(32 residues). The middle third of this long helix is not in contact with
other regions of the molecule and is exposed to solvent. Since the protein
crystals were obtained at the question arises as to whether the protein
is also dumbbell-shaped at neutral Our previous study(46) showed that
for the distance between Met-25 (labeled with DNZ) and Cys-98 (labeled
with IAE) was 39 A The actual separation between the points of
attachment is likely smaller because of the size of the probes. When the
was reduced from 7.5 to 5.0, the transfer efficiency was reduced 13-fold,
corresponding to an increase of the donor–acceptor separation by a factor of
2.(47) These results are interpreted in terms of an acid-induced dimerization of
the protein,(48) and suggest that the conformation of TnC in neutral solution
may be different from that predicted by crystallography.
From these distances and the assumption of the transfer efficiency was
predicted to be 83%, (66) in reasonable agreement with the value observed in
solution. The difference almost certainly arises from the possibility that some,
if not many, of the donor and acceptor chromophores have limited motional
freedom. Since only the short distances contribute significantly to E because
of the dependence, the question is whether the side chains of the residues
that are separated by short distances are highly polarized or depolarized. This
question cannot be answered without other types of information.
A final example is the C-terminal octapeptide of cholecystokinin,
(Asp-Tyr-Met-Gly-Trp-Met-Asp-Phe- ). NMR studies(67) showed that
exists preferentially in a folded conformation with and turns
around the sequence Gly-Trp-Met-Asp and Met-Asp..Phe- . This folded
conformation results in a structure in which the side chains of Tyr and Trp
must be pushed away from each other, resulting in a relatively large separa-
tion between them. This structural feature was confirmed by energy transfer
measurements,(67) which yielded The choice of for this
case is reasonable because indole (acceptor) is characterized by two linear
transition moments and the Trp in the peptide is relatively free to rotate, as
demonstrated by NMR results.
bifunctional reagents. The FRET distance between the two thiols and
of myosin subfragment-1 (distance 6, Figure 3.2) is in the range of
30–45 Å, dependent upon which acceptor is used. It has been known that the
same two thiols can be cross-linked by a variety of bifunctional reagents with
widely different spanning lengths (2–14 Å). The segment of the polypeptide
chain between the two thiol groups is thought to be flexible. There must exist
a wide distribution in the separations between the two –SH groups. FRET
measurements yield an average of these distances. The average contains not
only a population weighting, but also weighting by the inverse sixth power of
the distances, which favors shorter distances. The cross-linking experiments
investigate instantaneous separations that fall into a certain narrow range,
and the method has a very narrow distance window, whereas FRET results
reflect the entire distance range.
Another example is the TnI • TnC complex. The transfer distance in the
presence of between Cys-133 of TnI and Cys-98 of TnC is in the range
of 41–74 Å.(43) These two groups have been cross-linked by l,3-difluoro-4,6-
dinitrobenzene in the presence of , (69) providing evidence for their very
close proximity. Even allowing for the finite size of the donor and acceptor
probes, it is unlikely that the average transfer distance could be as small as
10 Å. The binding constant(70) of the labeled TnC for unlabeled TnI or of the
labeled TnI for unlabeled TnC is in the presence of
The proteins are not very tightly held together. Some conformational fluctua-
tions within each protein and in the complex as a whole can be expected.
In consequence, the FRET distance is an average value. These examples
illustrate the inadequacy of using a single average distance to describe a
proximity relationship. Section 3.9 will address the experimental determina-
tion of distribution of FRET distances from both lifetime and steady-state
data.
rate (turnover number), The reaction velocity during the steady state is
related to and by where is the initial substrate
concentration. The Michaelis–Menten equation can be expressed in terms of
the two stopped–flow parameters:
The initial increase in fluorescence to signals the formation of ES, and the
exponential increase to reflects conversion of ES to the acyl intermediate
EA. This example illustrates how both pre-steady-state and steady-state
kinetic parameters can be extracted from one experiment on the basis of
energy transfer.
Several anions including chloride, sulfide, and carboxylate have been
known for over 30 years to inhibit carboxypeptidase A (CPA). The nature of
anion inhibition has been difficult to investigate partly because of the limited
solubility of CPA in the absence of salt and hence the necessity to work with
fairly high concentrations of anions. This difficulty has recently been over-
come(74) by using dansylated di- and tripeptides. Chloride was found to
be a competitive inhibitor as qualitatively indicated by the changes of the
shape of the stopped-flow kinetic traces and by detailed analysis of the traces.
Since for chloride inhibition was strong (40–120 mM) and not strongly
pH-dependent, the investigators concluded that the site of anion interaction is
unlikely to be the metal atom.
3.9.1. Theory
where is the donor intensity at zero time, and is the donor lifetime. In
the presence of a single acceptor located at a unique distance R from the
donor, the donor decay is given by
Resonance Energy Transfer 159
with as the Förster critical distance. The donor still decays monoexpo-
nentially, but with a composite rate constant of corre-
sponding to a lifetime The second term in the
exponent of Eq. (3.28), is the rate constant of energy transfer
to the acceptor.
In a system containing donor–acceptor pairs of different distances, the
donor decay will not be single exponential, but will be dependent upon the
probability distribution of donor–acceptor distances, P(R). Equation (3.28)
becomes
where the are the i donor lifetimes. Thus, direct determination of the donor
decay curve will allow extraction of P(R). Equations (3.29) and (3.30) are
valid for a distribution that remains unchanged during the lifetime of the
donor. Perturbation of the distribution by diffusion may become significant if
the donor lifetime is long and if the donor and acceptor can diffuse relatively
unconstrainedly toward or away from each other.
Fluorescence lifetimes can now be determined with high accuracy by
multifrequency phase fluorometry. In the frequency domain, one measures the
phase shift and the modulation as a function of modulation
frequency The sine and cosine transforms of Eq. (3.30) yield the following
frequency response functions (84,85) :
where is the ith donor lifetime observed in the presence of the acceptor.
If the donor emission in the absence of the acceptor is single exponential, no
summation is needed under the integral sign in Eqs. (3.31) and (3.32).
To evaluate P(R) from time-domain decay data, one selects an
appropriate distribution function P(R) that will yield the best fit between the
calculated according to Eq. (3.29) or (3.30) and the experimental decay
The constants and are obtained from independent experiments.
160 Herbert C. Cheung
When frequency-domain data are used, the chosen P(R) is used to calculate
the frequency response function, Eqs. (3.31) and (3.32), for comparison with
experimental and
where and are the dynamic and static quenching constants, respec-
tively. A plot of versus [Q] should yield a straight line and
allow determination of both and Alternatively, can be determined
from a plot of versus [Q], where and are the donor lifetimes in
the absence and presence of quenching, respectively. The dynamic quenching
constant is then used to obtain the quenched donor quantum yield at
various values of [Q]:
symbol is used here for quantum yield to avoid confusion with that for
quencher.) Once a range of values are obtained, they can be used together
with the Förster critical distance determined in the absence of quenching
to calculate the corresponding range of quenched Förster critical distances
3.9.2. Examples
distances. The data were then analyzed by using Eqs. (3.31) and (3.32) and a
Gaussian distribution function
164 Herbert C. Cheung
instrumentation and analysis methods that are essential with the frequency-
domain or time-domain lifetime techniques.
TnC in the complex. This finding differs from previous results based on a
single distance. The reason for the discrepancy is unclear at this time. It is of
interest to note that binding to TnC reduced the half-width of the
distance distribution in Tnl, and the decrease was shown to be statistically
significant. This narrower distribution shows an effect of binding on the
dynamics of TnI, and this effect may be increased dynamic fluctuations. The
altered dynamics may play an important role in the transmission of the
signal from TnC via TnI to activate actomyosin ATPase.
Simulation studies and examination of the ratio surfaces for the
simulated frequency-domain data indicated that values of and HW can be
recovered to within 2 Å.(88) Probe mobility and the uncertainty in the orienta-
tion factor can contribute to the observed distribution width. These
problems were considered in detail, and the possible effect of on HW was
investigated by measurements of anisotropy decay.(85,88) While it was not yet
possible to assign unequivocally the various factors contributing to any
observed distribution width, it was concluded that experimental half-widths in
excess of 10 Å, such as those observed with TnI in the native state, must
reflect to a certain extent local dynamic fluctuations of the protein. The dis-
tribution was used to follow structural changes as a function of denaturation
by guanidine hydrochloride (GuHCl). The half-width increased progressively
and dramatically with increasing denaturant concentration from about 12 Å
to a final level of about 50 Å, which was observed at The
midpoint of the transition was near 1.5 M GuHCl. The change in followed
a similar pattern. TnI is known to be stabilized against denaturation by
complexation with TnC in the presence of This protection was also
evident from the recovered distributions determined under these conditions.
The large HW observed in the presence of GuHCl is compatible with the
expectation of a wide range of distances and decreased dynamic fluctuations
in the denatured protein. These results demonstrate that the distributions of
FRET distances in protein can be recovered with good accuracy and are use-
ful in studies of protein folding and unfolding.
168 Herbert C. Cheung
domain data has also been accomplished in several studies. Amir and Haas(91)
determined the distribution of several intramolecular distances in bovine
pancreatic trypsin inhibitor (BPTI). They labeled the N-terminal amino
group with the energy donor (2-methoxy-l-naphthyl)methyl (MNA) and the
group of one of the four lysine residues (positions 15, 26, 41, and 46)
with the acceptor [7-(dimethylamino)coumarin-4-yl]acetyl (DA-coum). For
each of the four derivatives with attached donor and acceptor, the
monoexponential decay curve of the donor emission was used to
recover the distance distribution. The half-widths of these distributions were
in the range 9–11 Å, and the average distances ranged from 21 to 34 Å. When
the sizes of the donor and acceptor probes were estimated and taken into
account, the recovered average distances were very close to those expected
from the crystal structure of BPTI.
In another study of the distribution of distances based on time-domain
data, Haas et al.(92) labeled Glu-49 and Asp-53 of bovine pancreatic RNase
with ENA (ethylenediamine monoamide of 2-naphthoxyacetic acid) as energy
donor and the N-terminal amino group with l-fluoro-2,4-dinitrobenzene as
energy acceptor. The attached ENA decayed monoexponentially in the
absence of the acceptor. The distribution of distances recoverd from data
obtained in 50% glycerol showed an average distance of and a half-
width of The average distance compares favorably with the value
previously determined in aqueous solution on the basis of a single donor–
acceptor separation.(68) Upon denaturation by 6 M guanidine hydrochloride
in 26% glycerol the average distance increased to and the half-width
to These authors also discussed in detail possible contributions
to the observed half-width. Since the attached donor probe used in this study
has a considerable length (eight or nine single bonds and two amide bonds),
probe flexibility (as opposed to protein dynamics) may have contributed
significantly to the observed distribution recovered for the protein in the
native state. Nevertheless, these results qualitatively corroborate those we
have obtained with TnI in that the parameters of the distribution of FRET
distances are sensitive to transition from an ordered state to a partially dis-
ordered or completely disordered state.
In two recent reports Brand and co-workers reported the distributions of
the distances between two specific residues of a staphylococcal nuclease
mutant that were recovered from time-domain data as a function of tem-
perature(93) and guanidine hydrochloride concentration.(94) Bimodal distribu-
tions were needed to fit the data obtained at high temperature or in high
guanidinium concentration, providing evidence for two denatured states of the
mutant protein consistent with a circular dichroism study (95) that
staphylococcal nuclease mutants can exist in two denatured states. These
results clearly suggest potential applications of the distribution of FRET
distances to protein folding/unfolding problems.
170 Herbert C. Cheung
Acknowledgment
References
1. T. Förster, Intermolecular energy migration and fluorescence, Ann. Physik. (Leipzig) 2, 55–75
(1948). Translated by R. S. Knox.
2. T. Förster, Mechanism of energy transfer, in: Comprehensive Biochemistry (M. Florkin and
E. H. Statz, eds.), Vol. 22, pp. 61–77, Elsevier, New York (1967).
3. S. Latt, H. T. Cheung, and E. R. Blout, Energy transfer. A system with relatively fixed
donor–acceptor separation, J. Am. Chem. Soc. 87, 995–1003 (1965).
172 Herbert C. Cheung
4. L. Stryer and R. P. Haugland, Energy transfer: A spectroscopic ruler, Proc. Natl. Acad. Sci.
U.S.A. 58, 719–726 (1967).
5. H. Buecher, K. H. Drexhage, M. Fleck, H. Kuhn, D. Mobius, F. Schafer, J. Sondermann,
W. Sperling, P. Tillmann, and J. Weigand, Controlled transfer of excitation energy through
thin layers, Mol. Cryst. 2, 199–230 (1967).
6. R. P. Haugland, G. J. Yguerabide, and L. Stryer, Dependence of the kinetics of singlet–singlet
energy transfer on the spectral overlap integral, Proc. Natl. Acad. Sci. U.S.A. 63, 23–30
(1969).
7. P. Schiller, Intermolecular distances: Energy transfer in: Biochemical Fluorescence: Concepts
(R. F. Chen and H. Edelhoch, eds.), Vol. 1, pp. 285–303, Marcel Dekker, New York (1975).
8. H. C. Cheung, F. Gonsoulin, and F. Garland, Fluorescence energy transfer studies on the
proximity of the two essential thiols of myosin subfragmcnt-1, J. Biol. Chem. 258, 5775–5786
(1983).
9. R. E. Dale and J. Eisinger, Polarized excitation energy transfer, in: Biochemical Fluorescence:
Concepts (R. F. Chen and H. Edelhoch, eds.), Vol. 1, pp. 115–284, Marcel Dekker, New York
(1975).
10. R. E. Dale, J. Eisinger, and W. E. Blumberg, The orientation freedom of molecular probes,
Biophys. J. 26, 161–193 (1979).
11. E. E. Haas, E. Katchalsky-Katzir, and I. Z. Steinberg, Effect of the orientation of donor and
acceptor on the probability of energy transfer involving electronic transitions of mixed
polarizations, Biochemistry 17, 5064–5070 (1978).
12. H. R. Trayer and I. P. Trayer, Fluorescence energy transfer between the myosin subfragment-1
isoenzymes and F-actin in the absence and presence of nucleotides, Eur. J. Biochem. 135,
47–59 (1983).
13. W. D. Horrocks, Jr., B. Holmquist, and B. L. Vallee, Energy transfer between terbium (III)
and cobalt (II) in thermolysin: A new class of metal–metal distance probes, Proc. Natl. Acad.
Sci. U.S.A. 72, 4764–4768 (1975).
14. P. M. Tongerson and M. F. Morales, Application of the Dale–Eisinger analysis to proximity
mapping in the contractile system, Proc. Natl. Acad. Sci. U.S.A. 81, 3723–3727 (1984).
15. R. E. Jones, Nanosecond fluorimetry, Ph.D. thesis, Stanford University (1970).
16. Z. Hillel and C.-W. Wu, Statistical interpretation of fluorescence energy transfer measurements
in macromolecular systems, Biochemistry 15, 2105–2113 (1976).
17. H. C. Cheung and M. F. Morales, Studies of myosin conformation by fluorescent techniques,
Biochemistry 8, 2177–2182 (1969).
18. H. C. Cheung, Conformation of myosin: Effects of substrate and modifiers, Biochim. Biophys.
Acta 194, 478–485 (1969).
19. R. E. Dalbey, J. Weiel, and R. G. Yount, Förster energy transfer measurements of thiol 1 to
thiol 2 distances in myosin subfragment-1, Biochemistry 22, 4696–4706 (1983).
20. H. C. Cheung, F. Gonsoulin, and F. Garland, An investigation of the and
-ATPase distances in myosin subfragment-1 by resonance energy transfer using
nanosecond fluorimetry, Biochim. Biophys. Acta 832, 52–62 (1985).
21. K. E. Thames, H. C. Cheung, and S. C. Harvey, Binding of 1, -ethenoadenosine tri-
phosphate to actin, Biochem. Biophys. Res. Commun. 60, 1252-1261 (1974).
22. H. C. Cheung and B. M. Liu, Distance between nucleotide site and cysteine-373 by resonance
energy transfer measurements, J. Muscle Res. Cell Motil. 5, 65–80 (1984).
23. J. Bolts, R. Takashi, P. Torgerson, T. Hozumi, A. Muhlrad, D. Mornet, and M. F. Morales,
On the mechanism of energy transduction in myosin subfragment-1, Proc. Natl. Acad. Sci.
U.S.A. 81, 2060–2064 (1984).
24. D. J. Moss and D. R. Trentham, Distance measurement between the active site and
cysteine-177 of the alkali one light chain of subfragment-1 from rabbit skeletal muscle,
Biochemistry 22, 5261–5270 (1983).
Resonance Energy Transfer 173
25. T. Tao and M. Lamkin, Excitation energy transfer studies on the proximity between and
the adenosinetriphosphatase site in myosin subfragment-1, Biochemistry 20, 5051–5055
(1981).
26. J. Perkins, J. A. Weiel, J. Grammer, and R. G. Yount, Introduction of a donor–acceptor pair
by a single protein modification, J. Biol. Chem. 259, 8786–8793 (1984).
27. D. J. Marsh and S. Lowey, Fluorescence energy transfer in myosin subfragment-1,
Biochemistry 19, 774–784 (1980).
28. R. Takashi, A. Muhlrad, and J. Botts, Spatial relationship between a fast-reacting thiol and
a reactive lysine residue of myosin subfragment-1, Biochemistry 21, 5661–5668 (1982).
29. R. Takashi, P. Torgerson, and J. Duke, Proximity of LC3, thiol to the reactive lysine of
myosin heavy chain, Biophys. J. 45 (2, Part 2), 223 (Abstract) (1984).
30. R. Takashi, Fluorescence energy transfer between subfragment-1 and actin in the rigor
complex of actosubfragment-1, Biochemistry 23, 5164–5169 (1979).
31. T. Arata, Structure of the actin–myosin complex produced by crosslinking in the presence of
ATP, J. Mol. Biol. 191, 107–116 (1986).
32. M. Miki and P. Wahl, Fluorescence energy transfer in labeled G-actin and F-actin, Biochim.
Biophys. Ada 786, 188–196 (1984).
33. D. G. Bhandari, H. R. Trayer, and I. P. Tray, Resonance energy transfer evidence for two
attached states of the actomyosin complex, FEBS Lett. 187, 160–166 (1985).
34. C. G. dos Remedios and R. Cooke, Fluorescence energy transfer between probes on actin and
probes on myosin, Biochim. Biophys. Ada 788, 193–205 (1984).
35. M. Miki, J. A. Barden, and C. G. dos Remedios, Fluorescence resonance energy transfer
between the nucleotide binding site and Cys-10 in G-actin and F-actin, Biochim. Biophys.
Ada 872, 76–82 (1986).
36. R. Aguirre, S. S. Lin, F. Gonsoulin, C. K. Wang, and H. C. Cheung, Characterization of
the ethenoadenosine diphosphate-binding site of myosin subfragment 1. Energetics of the
equilibrium between two states of nucleotide · S1 and vanadate-induced global conformation
changes detected by energy transfer, Biochemistry 28, 799–807 (1988).
37. R. Takashi and A. A. Kasprazk, Measurement of interprobe distances in the acto-subfragment
1 rigor complex, Biochemistry 26, 7471–7477 (1987).
38. D. L. Taylor, J. Reidler, J. A. Spudich, and L. Stryer, Detection of actin assembly by
fluorescence energy transfer, J. Cell Biol. 89, 362–367 (1981).
39. M. Miki, J. A. Barden, B. D. Hambly, and C. G. dos Remedios, Fluorescence energy transfer
between Cys-10 residues in F-actin filaments, Biochem. Int. 12, 725–731 (1986).
40. M. Miki, B. D. Hambly, and C. G. dos Remedios, Fluorescence energy transfer between
nucleotide binding sites in F-actin filament, Biochim. Biophys. Ada 871, 137–141 (1986).
41. A. A. Kasprzak, R. Takashi, and M. F. Morales, Orientation of actin monomer in the F-actin
filament: Radial coordinate of glutamine-41 and effect of myosin subfragment 1 binding on
the monomer orientation, Biochemistry 27, 4512–4522 (1988).
42. A. D. Saad, J. D. Pardee, and D. A. Fischman, Dynamic exchange of myosin molecules
between thick filaments, Proc. Natl. Acad. Sci. U.S.A. 83, 9483–9487 (1986).
43. C. K. Wang and H. C. Cheung, Proximity relationship in the binary complex formed between
troponin I and troponin C, J. Mol. Biol. 191, 509–521 (1986).
44. T. Tao, G. Strasburg, E. Gowell, and P. C. Leavis, Excitation energy transfer measurements
of the distance between Cys-98 of TnC and Cys-133 of TnI in reconstituted rabbit skeletal
troponin, Biophys. J. 47, (2, Part 2) 509 (Abstract) (1985).
45. O. Herzberg and M. N. G. James, Structure of the calcium regulatory muscle protein
troponin-C at 2.8 Å resolution, Nature 313, 653–659 (1985).
46. H. C. Cheung, C. K. Wang, and F. Garland, Fluorescence energy transfer studies of skeletal
troponin C: Proximity between methionine-25 and cysteine-98, Biochemistry 21, 2135–2142
(1982).
174 Herbert C. Cheung
47. C. K. Wang and H. C. Cheung, Effect of pH on the distance between Met-25 and Cys-98 of
troponin C, Biochemistry 24, 3364 (Abstract) (1985).
48. C. K. Wang, J. Lebowitz, and H. C. Cheung, Acid dimerization of skeletal troponin C
Proteins: Structure, Function and Genetics 6, 424–430 (1989).
49. K.-H. Huang, R. H. Fairclough, and C. R. Cantor, Singlet energy transfer studies of the
arrangement of proteins in the 30S Escherichia coli ribosome, J. Mol. Biol. 97, 443–470
(1975).
50. J. A. Langer, D. M. Engelman, and P. B. Moore, Neutron-scattering studies of the ribosome
of Escherichia coli: A provisional map of the locations of S3, S4, S5, S7, S8, and S9 in the
30S subunit, J. Mol. Biol. 119, 463–485 (1978).
51. W. Rychlik, W. Odom, and B. Hardesty, Localization of the elongation factor Tu binding site
on Escherichia coli ribosomes, Biochemistry 22, 85–93 (1983).
52. O. W. Odom, E. R. Dabbs, C. Dionne, M. Muller, and B. Hardesty, The distance between
S1, S2, and the 3´ end of 16S RNA in the 30S ribosomal subunits, Eur. J. Biochem. 142,
261–267 (1984).
53. K. B. Steinhauser, P. Wooley, J. Dijk, and B. Epe, Distance measurement by energy transfer.
Ribosomal proteins L6, L10, and L11 of Escherichia coli, Eur. J. Biochem. 156, 497–503
(1986).
54. H.-Y. Deng, O. W. Odom, and B. Hardesty, Localization of L11 on the Escherichia coli
ribosome by singlet–singlet energy transfer, Eur. J. Biochem. 156, 497–503 (1986).
55. M. J. Rhee, D. R. Sudnick, V. K. Arkle, and W. D. Horrocks, Jr., Lanthanide ion
luminescence probes. Characterization of metal ion binding sites and intermetal energy
transfer distance measurements in calcium-binding proteins. 1. Parvalbumin, Biochemistry 20,
3328–3334 (1980).
56. A. P. Snyder, D. R. Sudnick, V. K. Arkle, and W. D. Horrocks, Jr., Lanthanide ion
luminescence probes. Characterization of metal ion binding sites and intermetal energy
transfer distance measurements in calcium binding proteins. 2. Thermolysin, Biochemistry 20,
3334–3339 (1980).
57. C.-L. A. Wang, T. Tao, and J. Gergely, The distance between the high affinity sites of
troponin C measured by interlanthanide ion energy transfer, J. Biol. Chem. 257, 8372–8375
(1982).
58. P. Mulqueen, J. M. Tingey, and W. D. Horrocks, Jr., Characterization of lanthanide (III) ion
binding to calmodulin using luminescence spectroscopy. Biochemistry 24, 6639–6645 (1985).
59. C.-L. A. Wang, Distance measurements between metal-binding sites of calmodulin and from
these sites to cys-133 of troponin I in the binary complex, J. Biol. Chem. 261, 11106–11109
(1986).
60. R. H. Kretsinger and C. E. Nockholds, Carp muscle calcium-binding protein. II. Structure
determination and general description, J. Biol. Chem. 248, 3313–3326 (1973).
61. B. W. Mathews, L. H. Weaver, and W. R. Kester, The conformation of thermolysin, J. Biol.
Chem. 249, 8039–8044 (1974).
62. Y. S. Babu, private communication.
63. O. Herzberg and M. N. G. James, Crystallography determination of lanthanide ion binding
to troponin C, FEBS Lett. 199, 279–282 (1986).
64. L. Stryer, D. D. Thomas, and C. F. Mears, Diffusion-enhanced fluorescence energy transfer,
Annu. Rev. Biophys. Bioeng. 11, 203–222 (1982).
65. R. F. Steiner and M. Motevalli-Alibadi, The determination of the separation of tyrosine-99
and tyrosine-138 in calmodulin: Radiationless energy transfer, Arch. Biochem. Biophys. 234,
522–530 (1984).
66. R. A. Borkman and S. R. Phillips, Tyrosine-to-tryptophan energy transfer and the structure
of calf gamma-II crystallin, Exp. Eye Res. 40, 819–826 (1985).
67. M. C. Fournie-Zaluski, J. Belleney, B. Lux, C. Durieux, D. Gerard, G. Gacel, B. Maigret, and
Resonance Energy Transfer 175
Least-Squares Analysis of
Fluorescence Data
Martin Straume, Susan G. Frasier-Cadoret,
and Michael L. Johnson
4.1. Introduction
177
178 Martin Straume et al.
squares analysis were developed for, and are commonly applied to, a wide
range of types of experimental data in fields such as physics, chemistry,
biology, sociology, and economics.
considering each of the as a vector of length two, the first element being
the amplitude modulation of the emitted light and the second being the phase
shift of the emitted light.
The mathematical relationship between the independent and dependent
variables is referred to as the fitting function:
and
the perpendicular or horizontal distance from each data point to the best-fit
curve which is minimized. Therefore, in order for a least-squares analysis to
correctly predict the maximum-likelihood parameter values, the analytical
problem must be formulated, and the data collected, such that the X axis
(abscissa) is known to much greater precision than the Y axis (ordinate).
Experimental uncertainties in the independent variables cannot be correctly
compensated for by “appropriate weighting factors.” In general, there is no
way to circumvent this requirement and still correctly utilize a least-squares
procedure. A different method of analysis which does allow Gaussian
distributed experimental uncertainty on both the dependent and independent
variables is presented elsewhere. (2)
The first assumption allows transformations of the independent variables,
the X axis or abscissa. For example, in frequency-domain fluorescence lifetime
experiments the independent variable is frequency, but it is usually more
convenient to graphically represent the experimental data as a function of the
logarithm of frequency. Such a logarithmic transformation of the independent
variables does not invalidate the least-squares procedure.
The data obtained from fluorescence experiments are usually consistent
with the first assumption. For time-domain fluorescence lifetime measurements
the uncertainty in the time measurement (the X axis or abscissa) is small.
For frequency-domain fluorescence lifetime measurements the modulation
frequency of the excitation light (the X axis or abscissa) is known to excellent
presicion. The precision of the wavelength (the X axis or abscissa) for the
steady-state fluorescence case is also sufficient for the least-squares method of
analysis.
The second assumption states that experimental uncertainty of the
dependent variables, must follow a Gaussian distribution. This means
that if a particular data point was measured an infinite number of times,
a frequency histogram of the observed values for that point would follow
a Gaussian distribution. Figure 4.2A depicts a distribution of this form.
This assumption is met by many, but not all, experimental procedures.
In particular, the validity of this assumption, and the consequences of
Least-Squares Analysis of Fluorescence Data 183
violating this assumption, have been discussed with reference to the use of
the method of moments for the analysis of time-domain fluorescence lifetime
measurements.(3)
Nonlinear transformations of the dependent variables, the Y axis or
ordinate, violate the second assumption. While Figure 4.2A shows a Gaussian
frequency distribution of a dependent variable, Figure 4.2B illustrates the
perturbation of the same frequency distribution introduced by performing a
logarithmic transformation on the dependent variable. It is clear that the
frequency distribution in Figure 4.2B is not Gaussian and thus violates the
second assumption of least-squares parameter estimation. For example, in
time-domain fluorescence lifetime experiments the counting uncertainty in the
dependent variable, the frequency of counts within a given time range, follows
a Poisson distribution. When the frequency of counts is large, these Poisson
distributions can be approximated by a Gaussian distribution with a standard
deviation equal to the square root of the mean. Consequently, the second
assumption is usually reasonable for time-domain fluorescence experiments
when the data are represented as counts as a function of time. A common
method employed to analyze this type of data, which violates this assumption
of least-squares analysis is to plot the logarithm of the number of counts
as a function of time. For the case of a single exponential decay this plot
should be a straight line. However, such a logarithmic transformation of
the dependent variables produces a skewed distribution of experimental
uncertainties which is no longer a Poisson distribution and cannot be
approximated by a Gaussian distribution, as shown in Figure 4.2B, and thus
contradicts the assumptions of the least-squares procedure. Consequently, for
this example, it is not valid to use any least-squares procedure to determine
the slope of a plot of the logarithm of counts as a function of time.
The third assumption states that the experimental data have no systematic
uncertainties. All parameter estimation procedures make this assumption.
Systematic errors are a problem that must be corrected in the experimental
protocol rather than by the analysis procedure. The only way to circumvent
this assumption in the analysis procedure is to include a series of terms into
the fitting function, to describe all of the systematic uncertainties. The
184 Martin Straume et al.
vector of fitting parameters, will then include parameters which describe the
systematic uncertainties. A possible systematic uncertainty for time-domain
fluorescence lifetime measurements might be a slight difference in the initial
time value between the lamp intensity profile, L(t), and the observed fluo-
rescence intensities, If such a difference exists, and it cannot be removed
by instrument calibration, then a time offset term, could be included in
Eq. (4.3) such that
The exact number of data points required to satisfy this assumption is difficult
to specify and differs for each set of experimental conditions. Most time-
domain fluorescence lifetime instruments collect a minimum of several hundred
data points, which is usually sufficient for this type of experiment. However,
some of the older frequency-domain fluorescence lifetime instruments do
not produce an adequate number of data points for a proper least-squares
analysis since they collect data at only three fixed modulation frequencies.
The newer multi-frequency fluorometers can collect data at any number of
frequencies and can thus provide ample data for the analysis.
The last assumption states that each experimental observation,
is independent of any other data point. This means that the concomitant
experimental uncertainties in a particular data point do not influence the
experimental uncertainties in any other data point. For example, if the multi-
channel analyzer used for time-domain fluorescence lifetime experiments did
not provide stable time windows, then the uncertainties in the data points
would appear to be correlated with each other; that is, too many counts in
one channel would probably be correlated with too few counts in an adjacent
channel. Error dependence between data points might cause the analysis
procedure to produce erroneous parameter values.
Utilizing these six basic assumptions, it is relatively easy to demonstrate
that the method of least squares does produce values for the parameters,
with the highest probability, that is, maximum likelihood, of being correct. If
the parameters, are correct and all six assumptions are met, we can write
the probability of observing a particular dependent variable, at a
particular independent variable, as being equal to
The total probability that the values of the fitting parameters, are correct
for a complete data set is the product of the probabilities for the individual
data points:
186 Martin Straume et al.
where the product, and summation, apply to each of the n data points
with subscript i. The maximum probability that the fitting parameters are
correct, will occur when the summation in the exponential of Eq. (4.13)
is a minimum. Consequently, a procedure which will minimize this summation
will yield the set of fitting parameters which have the maximum likelihood of
being correct. We can thus define a NORM of the data to be minimized to
yield the maximum-likelihood parameter values as
where the i subscripts refer to a particular data point, the j subscripts refer to
a particular fitting parameter, and the k and superscripts refer to the
iteration number. Equation (4.16) is actually a series of equations, one for
each data point. This series of equations can be used to evaluate the values
of the fitting parameters for the iteration, because the form of the
fitting function, is known and we know the values of the fitting
parameters from the previous iteration, For the first iteration we must
arbitrarily assign the initial values of the fitting parameters, The procedure
is repeated until the fitting parameters, do not change significantly for
several iterations.
188 Martin Straume et al.
It should be noted that the only difference between linear and nonlinear
least-squares analysis is the number of significant terms in Eq. (4.16). If all the
second and higher order derivatives of the function with respect to the
Fitting parameters are zero, then the first-order series expansion, Eq. (4.16), is
exact. If Eq. (4.16) is exact, then the solution for the fitting parameters,
will be exact, and the process requires only a single iteration to reach stable
values of the fitting parameters, If the second and higher derivatives are not
zero, then Eq. (4.16) is only approximate, the parameter values produced will
be only approximately correct, and the process will require multiple iterations
before stable parameter values are attained.
Some authors have claimed that linear least-squares analysis does not
require an initial estimate, or guess, of the values of the fitting parameters,
This is not strictly correct. All of the standard equations for the solution of
linear least-squares problems can be derived from the Gauss-Newton non-
linear least-squares procedure by a careful selection of the initial values of the
fitting parameters and the use of a single iteration. When zero is used as the
initial value for the fitting parameters, then all terms which are explicitly
multiplied by the fitting parameters become zero and can be eliminated. So,
although it seems that no initial estimate of the parameter values is required
for the linear case of least-squares analysis, actually the initial estimate has
been assumed to be zero.
The evaluation of the fitting parameters at the next iteration, is
most easily explained in matrix notation such that Eq. (4.16) can be written
as
NORM (Eqs. 4.11, 4.13, and 4.14). If the value of the dependent variable,
represents the mean of a group of observations, then is the standard error
of the mean (SEM) of that group of observations, not the standard deviation
(SD) of the group of observations.
The matrix form of the series expansion, Eq. (4.17), can be solved for the
correction vector, if both sides of Eq. (4.17) are multiplied by the transpose,
of the partial derivative matrix, P, as in Eq. (4.21). Standard matrix
inversion techniques can then be applied to evaluate the correction vector,
as in Eq. (4.22):
by one part per thousand then the process has converged. A much better
criterion is that both the values of the fitting parameters, and the variance
change by less than one part per hundred thousand between successive
iterations. It is realistic to apply this more stringent criterion to the Gauss–
Newton procedure because usually only a few additional iterations are required
to produce the more rigorous convergence.
The derivation of the Gauss–Newton procedure presented in Eqs.
(4.15)–(4.23) has not been based on the minimization of the least-squares
NORM (i.e., sample variance ) in Eq. (4.14). However, from a careful
examination of Eq. (4.21), it can be demonstrated that the Gauss–Newton
procedure is a least-squares minimization procedure. It can be shown that the
elements of the vector on the left-hand side of Eq. (4.21), are propor-
tional to the derivative of the least-squares NORM, with respect to the
corresponding fitting parameter, The Gauss–Newton process is repeated
until the correction vector, on the right-hand side of Eq. (4.21) is equal to
zero. Thus, at convergence the terms on the left-hand side of Eq. (4.21), the
elements of the vector, are proportional to the respective first derivatives
of the least-squares NORM, and the right-hand side of Eq. (4.21), is
equal to zero. The standard method to locate a minimum of any function,
such as the least-squares NORM, is to set the first derivative of the function
equal to zero and to solve for the variable parameters, The Gauss–Newton
procedure is, consequently, a least-squares minimization procedure since the
first derivatives of the function are equal to zero at convergence.
For typical parameter estimation problems to which the Gauss–Newton
procedure is applied, the matrix (P'P) in Eq. (4.21) is difficult to invert
because it is almost singular. Consequently, the numerical procedure is very
sensitive to numerical truncation and round-off errors. For example, a typical
set of fitting parameters for time-domain fluorescence lifetime experiments is
composed of an amplitude of tens of thousands of counts per time channel
Least-Squares Analysis of Fluorescence Data 191
The LINPACK software package is a library of FORTRAN subroutines to perform linear algebra
manipulations. It is available from the International Mathematical and Statistical Library
(IMSL) Distribution Service, P.O. Box 4605, Houston, Texas 77210-4605, for a nominal charge.
192 Martin Straume et al.
where the k and j subscripts refer to elements of the correlation matrix and
the matrix.
The cross-correlation between fitting parameters, is particularly
useful to test the reliability of the determined parameters, If any of the
values of approach plus or minus one, then any variation in can be
almost totally compensated for by a variation of As a consequence, unique
values of and cannot be evaluated without additional information.
Unfortunately, the degree to which the can approach plus or minus one
without indicating a serious problem with the parameter estimation is not well
defined for nonlinear least-squares problems. The definition of the is
based on a number of limiting assumptions, such as the assumption that the
fitting equation was linear in the fitting parameters. Consequently, the use of
Least-Squares Analysis of Fluorescence Data 193
where is the reflection of the current simplex vertex with the highest
variance, relative to the centroid of the remaining vertices, and is the
reflection coefficient The reflected vector is thus on the line joining
Least-Squares Analysis of Fluorescence Data 195
determine the enzyme’s catalytic velocity for each set of conditions. The
Nelder–Mead parameter estimation algorithm can then be applied to arrive
at the next set of conditions (pH, temperature, and salt concentration) for
an experimental determination of the catalytic velocity. The Nelder–Mead
process is continued with actual experimental measurements as the method of
evaluation of the NORM, catalytic velocity, to be maximized. Convergence is
achieved when the observed catalytic activity of the enzyme no longer changes
within the precision of the experimental observations. Such an approach will
permit identification of the desired optimal conditions with much less effort
than that involved in performing the assay at all of the possible one thousand
sets of conditions.
The Hessian matrix of second derivatives of the NORM with respect to
the parameters being estimated is analogous to the (P´P) matrix of the
Gauss–Newton procedure. This matrix can be utilized to evaluate the same
variance–covariance matrix and the cross-correlation coefficients of the
estimated parameters as with the Gauss–Newton procedure. A convenient
method for estimating the shape of the variance surface near the minimum
involves using a quadratic approximation.(14) The n + 1 vertices characteristic
of the final, converged simplex are used to create “half-way
points” defined as A quadratic surface is then estimated
from this combined set of (n + 1)(n + 2)/2 points. The original vertices of the
final, converged simplex are used in defining a set of oblique
axes with coordinates thus permitting the points to be represented as:
where the are elements of the vector A and the are the elements of
the matrix B, which is analogous to the (P´P) matrix of the Gauss–Newton
procedure. Equation (4.32) can be written in matrix notation, as
Least-Squares Analysis of Fluorescence Data 199
where the prime refers to the transpose of the matrix. The coefficients are
estimated as:
was used to compare the classical square root of time decay law, Eq. (4.9),
with the more complete radiation boundary condition decay law. For more
details about the experimental protocol, refer to Joshi et al.(4) In the present
example we will analyze these data by the classical square root of time decay
law and the more complete radiation boundary condition formulation of the
decay law.(4)
The independent variable for this example is excitation frequency,
Each of the 21 data points has two different dependent variables,
amplitude modulation and phase shift, for a total of 42 experimental observa-
tions. The form of the fitting equations for these two dependent variables,
and is given by Eqs. (4.4)–(4.7) in terms of the intensity
decay law,
The fluorescence intensity decay, for the square root of time
approximation of the decay law is given by Eqs. (4.9) where
where
where x is defined by Eq. (4.46). A series of values of f(x) were evaluated once
and stored for 12 logarithmically spaced values of x. The error function
complement, erfc( ), was evaluated by the method of Hastings et al.(15) The
evaluation of Eqs. (4.44)–(4.47) utilized a cubic spline interpolation (Ref. 7,
p. 70) of the values f(x) and x for each iteration of the parameter estimation
process. The integrations in Eqs. (4.6) and (4.7) were performed by an 8-panel
Newton–Cotes adaptive quadrature routine, DQUANC (Ref. 7, p. 97).
The choice of units for the fitting parameters, is important to avoid
truncation and round-off errors during the calculations. Consequently, the
intrinsic fluorophore lifetime, was calculated in nanoseconds, the unit for
the interaction radius, R, was angstroms, the sum of the donor and acceptor
diffusion coefficients was expressed as the logarithm (base ten) of D in cm2/s,
and the specific rate constant, K, was expressed as cm/s.
For the purposes of this example we have actually only performed the
parameter estimation process on two parameters, the interaction radius, R,
and the log of the diffusion coefficient, log D. This was done to simplify the
problem so that two-dimensional graphs of the process could be presented.
The donor lifetime was assigned the value 38.8 ns and the quenching rate
constant was assumed to be 150 cm/s, as in the paper by Joshi et al.(4) Given
sufficient data, the least-squares procedures would, of course, be capable of an
evaluation of more than two parameters. The number of degrees of freedom
for this analysis is 40, that is, 21 data points times 2 dependent variables per
data point minus 2 parameters being estimated.
In Table 4.3 the values of the interaction radius, R, and the logarithm of
the sum of the donor and acceptor diffusion coefficients, log D, are presented
for successive iterations of the Gauss–Newton procedure applied to the data in
Least-Squares Analysis of Fluorescence Data 203
Table 4.2 and the radiation boundary intensity decay law, Eqs. (4.44)–(4.46).
These results were presented in Table 2 of Joshi et al.(4) The rapid, quadratic
convergence properties of the Gauss–Newton procedure are clearly shown
in Table 4.3. The sample variance, decreased from 5605 to 98 in the first
iteration. By the fourth iteration the procedure has converged to where the
fractional changes in the parameters were 0.00005 for log D and 0.00018 for
R while the variance decreased by a fractional change of only 0.00018. The
changes for the fifth iteration were below the level of the single-precision
arithmetic of the computer. At each iteration the derivatives of the fitting
parameters were evaluated by a three-point Lagrange differentiation for-
mula, (12) so each iteration required five evaluations of the fitting function at
each data point.
Once the Gauss–Newton procedure has converged, the (P'P) matrix
can be used to evaluate several measures of the reliability of the parameter
estimation procedure. The cross-correlation coefficient between R and log D,
for this example, is –0.90280. This value is within an acceptable range,
indicating that the parameter estimation procedure has worked reasonably
well. The asymptotic variance of the estimate of the interaction radius, R, is
which corresponds to an asymptotic standard error of
The asymptotic variance of the estimate of the logarithm of the sum of the
donor and acceptor diffusion coefficients, log D, is which
corresponds to an asymptotic standard error of The asymptotic
covariance between R and log D was
The eigenvalues of the correlation matrix were 1.90280 and 0.09719545.
Neither eigenvalue is zero, indicating that the analysis procedure has
produced reasonable values. The largest eigenvalue is proportional to the
error in the asymptotic standard errors of the fitted parameters when cross-
204 Martin Straume et al.
correlation with other fitted parameters is neglected. In this case, the actual
standard errors of the determined parameters will be about 1.9 times the
asymptotic standard errors.
Table 4.4 presents a comparison of the values of R, D, and for the
square root of time and the radiation boundary intensity decay laws. (4) It is
clear that the values of the interaction radius and the total diffusion coefficient
as obtained from the two fitting functions, Eqs. (4.44)–(4.46) as compared to
Eqs. (4.9) and (4.42)–(4.43), are quite different. This is an example of a conse-
quence of the fourth assumption of least-squares parameter estimation. The
physical meaning of the determined parameters is based on an assumption
that the fitting equation, is correct. This assumption applies to all
parameter estimation procedures, not just least-squares. Consequently, the
determined mechanistic parameters should be reported only with reference to
the assumed mechanism.
It is also clear from this example that the variance of fit, is sub-
stantially larger for the square root of time intensity decay law as compared
to the radiation boundary condition intensity decay law. Thus, the radiation
boundary condition intensity decay law appears to be a better description of
the actual experimental data. If the two sample variances, in Table 4.4,
were only different by 10% instead of 480%, the statistical validity of this
statement would be questionable. An F test(16) can be applied to evaluate the
probability that the radiation boundary condition intensity decay law is a
better description of the actual experimental data than the square root of time
intensity decay law.
In the present example the F test is used as a measure of the probability
that the data analysis by the two independent intensity decay laws yields
residuals—the differences between the calculated values and the actual data
points—which are different. The F statistic is defined as the ratio of the two
variances:
Least-Squares Analysis of Fluorescence Data 205
Table 4.6 and Figure 4.4. demonstrate the use of the Nelder–Mead simplex
procedure for the same radiation boundary condition example as used for the
Gauss–Newton procedure above. Points A, B, and C are the initial simplex
†
The IMSL is the International Mathematical and Statistical Library. It is the most complete
library of subroutines of its type and was written in standard FORTRAN. It is available from the
International Mathematical and Statistical Library (IMSL) Distribution Service, P.O. Box
4605, Houston, Texas 77210-4605.
206 Martin Straume et al.
the estimated values. The joint confidence interval for a determined parameter
includes the effects of variations of all of the other parameters, within their
respective joint confidence intervals, while simultaneously maintaining a
statistically valid fit of the data. The maximum-likelihood parameter values
have little meaning without a reasonable estimate of their precision. Conse-
quently, the maximum-likelihood parameter values and the associated joint
confidence intervals should always be reported as a single entity; that is, never
report the parameter values without a reasonable estimate of their uncertainty.
Eq. (4.14), and the entire least-squares procedure need not be repeated. Thus,
if the joint variation in the fitting parameters for a minimum variance can be
predicted a priori, then the support plane method for the evaluation of the
confidence intervals is practical for multiparameter problems.
The advantage of the linear joint confidence interval method is that, for
a linear fitting equation, it provides a method of predicting the shape of
the variance space for any variation of the parameters. The major axis of
the multidimensional elliptically shaped solution of Eq. (4.49) is a good
approximation, even for most nonlinear problems, of how the variation
of a single fitting parameter will induce a variation in the remaining fitting
parameters and still maintain the lowest possible variance. Consequently, the
solutions of Eq. (4.49) can be used to vary the parameters jointly in the
support plane method for multiparameter problems. The reader is cautioned
that this approach is only an approximation to the complete support plane
method, but our experience has been that it is almost always a very good
approximation.
The solutions, of Eq. (4.49) define a new set of fitting parameters.
These new fitting parameters, NEW i , are formed as
where the summation is over each of the n fitting parameters, and the SFij are
a set of scaling factors. These scaling factors are chosen such that the new
parameters, NEW i , are all orthogonal. Since these new parameters are all
orthogonal to each other, the variation of one of these new parameters,
NEW i , will not affect the values of the other new parameters, NEW j , for
which correspond to a minimum variance.
The geometric interpretation of this new set of parameters is relatively
straightforward. Sets of parameter values which can be considered as “correct”
answers to within some statistical probability lie within the multidimensional
Least-Squares Analysis of Fluorescence Data 213
elliptical region (Figures 4.4 and 4.6) while sets of parameter values out-
side this elliptical region are considered to be “incorrect.” The axes of this
multidimensional ellipse do not, in general, coincide with the axes of our
coordinate system. The process of choosing a new coordinate system is simply
a rotation of the coordinate system such that the new coordinate system,
NEW i , coincides with the major and minor axes of the multidimensional
ellipse (as in Figures 4.4 and 4.6).
The support plane method can be applied to the variation of each of
these new parameters, NEW i , in order to find the value of each of the new
parameters which corresponds to the critical variance ratio (Eq. 4.52). This
is equivalent to searching each of the major and minor axes of the multi-
dimensional elliptically shaped confidence region (see Figures 4.4 and 4.6) for
the critical variance ratio. The values of the original fitting parameters, at
the ends of the axes of the elliptically shaped region can be evaluated from the
scaling factors, SFij, and the new parameters, NEW i , which correspond to
the critical variance ratio defined in Eq. (4.52). The maximum variations of
the original fitting parameters, are then taken as an approximation to the
nonlinear joint confidence intervals.
Figure 4.6 graphically depicts an example of this search procedure. It
corresponds to the analysis of the confidence intervals for the example which
was previously used to demonstrate the Gauss-Newton and the Nelder–Mead
procedures (see Figure 4.4 and Tables 4.2–4.6). The example involved the
evaluation of log D and R by the radiation boundary model based on the
data presented in Table 4.2. The elliptical region in Figure 4.6 is the same as
the region in Figure 4.4 and corresponds to a one-standard deviation, 67%
confidence region. The diamonds are the results of the search of the new
parameters NEW i which corresponded to the major axis of the solution of
Eq. (4.49). The triangles are the results of the search of the new parameters
NEW i which corresponded to the minor axis of the solution of Eq. (6.49). The
plus signs correspond to the values found by searching the original fitting
parameters, independently of each other, that is, without repeating the
fitting procedure at each of the values of currently being evaluated.
It is recommended that confidence regions be defined by the evaluation
of all of the points shown in Figure 4.6. This will require searches for the
critical variance ratio in 4n directions, where n is the number of parameters
being estimated, that is, each in both directions and each NEW i in both
directions. The reader is reminded that these 4n searches involve only the
evaluation of the variance; that is, they do not require additional least-squares
parameter estimation at every point in the search. Consequently, even though
this represents twice as many searches as the support plane method, each of
the searches is much faster. It is important to perform each of the searches in
both directions, because for nonlinear functions the joint confidence interval
is usually asymmetrical.(5, 20–22) The net result of performing all 4n searches is
214 Martin Straume et al.
that the entire shape of the joint confidence region is characterized, even when
it is asymmetrical.
The evaluation of the scaling factors, SFij, is relatively easy. The eigen-
vectors of the correlation matrix, Eq. (4.26), define the directions of the axes
of the multidimensional elliptical region in the cross-correlation space. By
multiplying the elements of these eigenvectors times the square roots of the
respective diagonal elements of the variance–covariance matrix, the eigen-
vectors become direction vectors in the original parameter space. It is these
direction vectors starting with the maximum-likelihood values of the fitting
parameters, which are to be searched for a critical variance ratio. The
eigenvalues provide the relative lengths of the axes. Thus, in order to search
the major axis for a critical variance ratio Eq. (4.52), the following steps need
to be performed. First, find the eigenvector of the correlation matrix that
corresponds to the largest eigenvalue of the correlation matrix. Second,
convert the eigenvector into the parameter space by multiplying its elements
by the particular eigenvalue and by the square roots of the respective diagonal
elements of the variance–covariance matrix. Third, search along this eigen-
vector for the set of parameter values which yield the critical variance ratio
calculated from Eq. (4.52). It should be noted that the shape of the variance
space for this search is proportional to the square of the distance along the
eigenvector. Be sure to search the eigenvector in both directions since the
variance space will, in general, not be symmetrical. Furthermore, it is
important to search all of the eigenvectors, that is, the eigenvectors which
correspond to all of the eigenvalues of the correlation matrix (see Section
4.6.6). The reader is referred to the EISPACK software package(8,9) for a group
of FORTRAN subroutines, such as TQL2 and TRED2, which can be used to
evaluate the eigenvalues and corresponding eigenvectors of the correlation
matrix.
In recapitulation, this approximate method for the evaluation of the
confidence intervals of determined parameters has the advantages of being
relatively fast, of considering the nonlinear nature of the fitting function,
and of considering the cross-correlation between the fitting parameters.
Although it is only an approximation of the support plane method, it is highly
recommended.
The last method for the evaluation of confidence intervals which we will
present is a Monte Carlo method. This method has the advantage that it
directly generates the entire probability distribution for each of the determined
parameters without making an assumption about the shape of the probability
distribution. However, this method requires a knowledge of the shape of the
distribution of the experimental uncertainties associated with each of the data
Least-Squares Analysis of Fluorescence Data 215
points. It should also be noted that this method requires a large amount of
computer time and, as a consequence, probably should only be used in cases
where the entire probability distribution of the fitted parameters is required.
The basic method is very simple. The first step is to analyze the data by
any method, such as least-squares, which will yield a set of fitted parameters,
which have the maximum likelihood of being correct. Given this set of
answers, and the original distribution of independent variables, Xi, a set of
synthetic data is generated. This set of data simply consists of the evaluation
of the fitting function at and each of the Xi, The next step is to take this
perfect simulated data set and add to it realistic simulated experimental
uncertainties such that we have a data set which is based on a known set
of parameters and contains realistic experimental uncertainties. The con-
sideration of uncertainties requires the generation of pseudo-random noise
to be superimposed upon the simulated dependent variable values, Yi. The
particular distribution of this pseudo-random noise should be consistent with
the experimental uncertainty distribution of the actual experimental data. This
distribution of experimental uncertainties is required to follow a Gaussian
distribution by the second basic assumption of the least-squares method.
For a discussion of how to generate Gaussian distributed pseudo-random
numbers, see Ref. 7, p. 240. This last step is repeated a few hundred times such
that a few hundred simulated data sets are generated, each with a different set
of random uncertainties included. The least-squares analysis is then performed
on each of the few hundred simulated data sets, and frequency histograms of
each of the parameters, are generated from the results of the few hundred
least-squares parameter estimations. These histograms will represent a
reasonable approximation of the probability distributions for the determined
parameters based on an analysis of the original experimental data.
In effect this process has asked the question, “How does a particular dis-
tribution of experimental uncertainties affect the analysis of a particular data
set?” The results are dependent on the assumed magnitude and distribution of
experimental uncertainties. The results are also dependent on the particular
distribution of independent variables, Xi. As the number of simulated data
sets is increased, the precision of the resulting probability histograms will
increase. The exact number of simulated data sets which are required is
impossible to predict a priori, but a reasonable number seems to be a few
hundred. This, of course, means that the computer time required to perform
this process will be a few hundred times what was required for the original
least-squares analysis.
other derived quantity. For example, for the square root of time description
of collisional quenching the investigator may wish to calculate the value of
from Eq. ( 4.42 ) or the value of b from Eq. (4.43). For the radiation boundary
example the investigator might want the value of the quenching rate at time
zero, k(0), from Eq. (4.45). It is relatively easy to calculate the maximum-
likelihood values of these quantities from the maximum-likelihood values of
the fitting parameters, However, the method for the propagation of the
confidence intervals is not obvious since the individual fitting parameters,
are expected to be correlated with each other and their respective confidence
intervals are expected to be asymmetrical.
The confidence intervals of the fitting parameters can be evaluated
relatively simply by the method outlined in Section 4.6.4.(22) The main
objective of this method is to characterize the shape of the one-standard
deviation joint confidence intervals by evaluating 4n points on the elliptically
shaped confidence region. A similar confidence region exists for any derived
quantity. All that is required to determine confidence intervals for any derived
parameter is to map each of the 4n points which characterize the original
confidence ellipse into a parameter space for each of the derived quantities.
This is done by evaluating the derived quantity at each of the 4n points which
describe the original confidence contour. The extreme spread of the resulting
4n values of each of the derived quantities is simply the one-standard
deviation confidence interval for that derived parameter.
Evaluating confidence intervals of derived parameters from the proba-
bility distributions obtained by the Monte Carlo method is also simple. The
derived parameters are evaluated from each set of the maximum-likelihood
fitted values, obtained from each of the few hundred sets of synthetic data.
A complete probability distribution of the resulting derived parameters can
then be constructed.
4.7.1. Plots
two distributions. The point distribution in the scatter diagram of the upper
panel of Figure 4.7 shows no obvious non-Gaussian behavior, whereas the
lower panel of Figure 4.7 clearly exhibits an upward trend in the residual
values with increasing abscissa (X axis). If the variability in the residuals is
not constant, the experimenter is readily made aware of the presence of some
potential systematic behavior in the observed data. Some inadequacy of the
analytical model is implied if the variance of the residuals (proportional to the
width of the scatter band) remains approximately constant but the mean
residuals appear to show some trend with or This phenomenon is
shown in the lower panel of Figure 4.7. Differences in the variance of residuals
as a function of or on the other hand, imply that some potential
systematic behavior is arising from the measuring process itself rather than
from an inappropriate model used in analysis. Of course, combinations of
both of these effects may be present. This very simple, visual tool may there-
fore yield substantial insight into difficulties with the experimental procedure
and/or the fitting function.
Plots of the residuals versus [case (c) above] offer a test of the
hypothesis that the functional relationship(s), of one or more independent
variables, represents implied, or necessary, functional behavior that
should have been, but was not, included in the analytical model currently
employed in estimating the observed behavior. For example, suppose our
model is given by
4.7.2. Distributions
with
where is the mean value (in this case, zero), s is the standard deviation of
the residuals (the square root of the sample variance), and X is one end of the
interval being considered. The absolute value is used to eliminate the
occurrence of negative Z values which would arise when For example,
suppose that the relative frequency of occurrence is desired within the interval
that is, for residuals with values greater than and less than
Values for and are calculated as per Eq. (4.59) by substituting or
for X. The relative frequency of occurrence over the interval will then
be defined by the expression where freq corresponds
to the frequency found from a table of areas under standard normal curves at
The absolute value is again used to eliminate the occurrence of
negative values. This process is carried out for all intervals, and the relative
expected frequencies are multiplied by the total number of residuals to yield
the absolute expected frequencies.
The next step involves quantitatively characterizing the discrepancies
between observed and expected frequencies by calculating a chi-square statistic
as
Least-Squares Analysis of Fluorescence Data 225
A point must be made about small expected frequencies, that is, those
less than one. This approximation of is not strictly valid when the expected
frequencies are small. Generally, adjacent intervals should be combined to
achieve a desired minimum frequency of not less than one.
4.7.3. Trends
where is the actual number of runs observed. The value of Z will be dis-
tributed approximately as a standard normal deviate if both and are
greater than 10. The value of 0.5 is a continuity correction which partially
compensates for the approximation of a discrete distribution by a continuous
distribution. If too many runs, instead of too few, are being tested for, a value
of –0.5 is used. This approach therefore permits one to estimate the statistical
probability that the number of runs actually encountered is different from that
expected if the residuals had occurred randomly with respect to the variable
under consideration. In other words, the greater the value of the greater
Least-Squares Analysis of Fluorescence Data 227
the probability that some correlation exists in the residuals relative to the
variable being considered.
Table 4.10 demonstrates an application of the runs test to the residual
distributions presented in Table 4.7 and Figure 4.7. The derived Z values
(between 0.40 and 1.24) are not sufficiently large to necessarily conclude that
either of these two distributions is not Gaussian distributed, based on this test
alone. On the other hand, values of Z greater than approximately 2.5 will
correspond to probabilities of less than about 1 % that the distributions are
actually random. It is important to note that a Z value greater than 2.5
implies that one may justifiably reject the hypothesis that the residuals are
Gaussian distributed. However, if a Z value less than or equal to 2.5 is
obtained with the runs test, one cannot necessarily conclude Gaussian
distribution of residuals without further confirmation.
Results of an analysis of the number of runs may indicate that too few
runs are present (implying possible positive serial correlation) or that too
many runs are present (implying possible negative serial correlation). The
term “serial correlation” implies a repeated pattern among residuals. Serial
correlation is most frequently applied to time series experiments. Variables
other than time, however, may conceivably offer practical significance with
regard to the concept of serial correlation under appropriate conditions.
Correlation among residuals may be determined 1,2,..., n units apart,
thus characterizing the lag1, lag2,..., lagn serial correlations, respectively.(26)
An examination of the lagn serial correlation of residuals is accomplished by
generating a plot of the value of each residual versus the value of the residual
occurring n units before the one currently being considered. Of course, the
first n residuals therefore cannot be plotted. Positive lagn serial correlation will
228 Martin Straume et al..
4.7.4. Outliers
where and are the ith and (i – l)th residuals, respectively, and and
are parameters defining the form of serial correlation hypothesized to be
present in the residuals. The are assumed to follow a Gaussian distribution
with some constant variance, and are independent of all and when
It is also assumed that the have a constant mean and variance and are
independent of when Therefore, the follow a Gaussian distribution
with the variance given by When no serial correlation is apparent,
and the situation reduces to that stated in the null hypothesis.
To decide whether or not any serial correlation is present in the residuals,
a parameter, d, is calculated as
alternative to the more familiar chi-square procedure and does not suffer from
the aforementioned limitations. This test permits a statistically quantifiable
comparison to be made between a sample cumulative distribution and
some theoretical cumulative distribution function. A cumulative distribution
function is that which gives the probability that a value less than or equal to
some specified value will occur. The statistic calculated in the Kolmogorov–
Smirnov test, D, is the maximum difference between the observed and
theoretical cumulative distribution functions over all values of the variable
under consideration, X (Ref. 25, p. 387):
The optimal choice of a language is one which meets all of the investigator’s
absolute requirements and is available on almost every computer system.
For example, ten years ago when at the National Institutes of Health, one
of us (MLJ) asked their computer specialists which language should be used.
He was told that if he wanted to use their Digital Equipment Corporation
computer (DEC System 10), he should use the SAIL language, and if he wanted
to use their International Business Machines computer (IBM 360), he should
use PL/1. This investigator instead chose FORTRAN. Later, after moving
to the University of Virginia, he discovered that SAIL programs can only
run on DEC System 10/20 computers. Furthermore, the University of
Virginia’s Control Data Corporation computer (CDC 855) did not have a
PL/1 compiler. However, all three computers had FORTRAN compilers, and,
consequently, there was no need to translate any programs. Translations
can be very time-consuming and expensive, especially when the computer
programs are long and complex. Almost every computer currently in use has
a FORTRAN-77 compiler available. Most computers now also have Pascal, C,
and BASIC available. Other languages like ALGOL, BLISS, FOCAL, and Modula-2
are not commonly available. Therefore, in this discussion of relative merits we
will consider only the most popular languages: FORTRAN-77, Pascal, C, and
BASIC.
The reader is cautioned not to use the convenient language extensions
which most manufacturers have included in their compilers. For example,
some versions of BASIC allow the direct manipulation of matrices while others
and the language standard do not. Consequently, if a BASIC program which
uses these matrix features on one computer is transfered to another computer
whose compiler does not have these extensions, the program will not run.
Another example is the number of attractive services provided by the UNIX
operating system which are exclusive to particular versions of UNIX. If a
computer program is developed which relies on operating system features on
one computer, it will be difficult or impossible to transfer it to a different
computer system! To make matters more confusing, a number of computer
languages, such as Pascal and BASIC, have such poorly defined “standards”
that they vary substantially among computers from different manufacturers.
A computer program should use only those features of a language which
are the same on all computers which implement that language, that is, the
minimal language standard.
There are a number of computer language features that are required for
the development of least-squares analysis programs which are not common
among FORTRAN -77, Pascal, C, and BASIC. BASIC has a number of unattractive
Least-Squares Analysis of Fluorescence Data 237
features for this application. It does not always permit the easy use of sub-
routines (or subprocedures), and it does not always permit more than single-
character variable names. A potential problem with Pascal relates to the
“Pascal standard,”(27) which does not provide for double-precision floating-
point variables. Matrix manipulations, such as dot products, must be performed
with greater precision than that offered by single-precision floating-point
calculations, which typically store variables as 32-bit numbers. Furthermore,
the “Pascal standard” also does not allow subprocedures to be compiled
separately from the main program. It is inefficient and time-consuming to
recompile thousands of lines of Pascal code in order to make a simple change
in a ten-line subprocedure. The Pascal enthusiast can argue that most Pascal
compilers are not restricted by the “Pascal standard.” However, beware
that different compilers implement these features in different ways and thus
translation from one computer system to another may be difficult.
The efficiency of a computer program is important for some of the
procedures which we have presented, which require hundreds of evaluations
of complex functions. Inefficiencies can be grouped into two categories:
(a) the inherent slowness of a language, for example, BASIC, which is usually
interpreted† rather than compiled, and (b) inherent inefficiencies of some
compilers. For example, we have two different Pascal compilers for our
DEC PDP-11/73 running the operating system. We used a prime
number generation program to compare our Pascal-2 compiler with our
NBS-Pascal compiler†† and found that the NBS-Pascal compiler required
almost five times as much computer time for the same calculation. However,
the Pascal-2 compiler was substantially more expensive.
Of less importance, but still worth considering, are the optional features
which the languages do not have in common. None of these are required for
the least-squares application but are needed in some other applications. The
FORTRAN “labeled common blocks” are very useful and are not allowed by the
Pascal, C, and BASIC standards. The FORTRAN “equivalence statements” can
also be very useful. Recursion, the ability of a procedure, or subprocedure, to
†
An “interpreted” computer language is one in which code is translated to machine instructions
line by line. Each translated line is executed and discarded before the next line of the computer
program is compiled. This process is usually substantially slower than that followed in the case
of a “compiled” computer language, whereby the entire computer program is translated into
machine instructions, then executed in a separate subsequent step.
is a time-sharing operating system for DEC PDP-11 computers which is available
f r o m S and H Computer Systems, Inc., 1027 I7th Avenue South, Nashvill, Tennessee 37211.
This operating system is essentially an extension of the DEC RT-11 operating system.
††
Pascal-2 is a commercial Pascal compiler which is available from Oregon Software, 2340 S.W.
Canyon Road, Portland, Oregon 97201. NBS-Pascal is available from the Digital Equipment
Computer Users Society (DECUS), 219 Boston Post Road, Marlboro, Massachusetts 01752,
for a nominal fee. The NBS-Pascal compiler is included in the Spring 198S RT-11 Special
Interest Group Distribution Tape.
238 Martin Straume et al.
either directly or indirectly call itself, can also be useful. The and
BASIC standards do not allow recursion, but some implementations of these
languages do allow recursion. Another language feature which is attractive,
but not essential, is the ability to write “structured” programs,
and allow structured programming, while Pascal and C require struc-
tured programming. It should ne noted that it is sometimes very convenient
not to be required to use structured programs, even though it is usually an
excellent way to program.
The last important point is to avoid writing routines which have already
been programmed. Subroutine libraries exist to perform most of the linear
algebra and eigensystem operations required for least-squares parameter
estimation. Examples are the and
libraries of subroutines. These are all written in and/or
Some computers will allow linking a routine written in one
language to a program written in another language if certain specific condi-
tions are met, but this is a difficult process at best. A better way to approach
the encoding of an analysis procedure is to begin coding in the language of
choice, that is, the standard, so that implementing preexisting
routines to handle complicated, but standard operations is straightforward.
There is little justification for ever converting a functional computer program
from one language to another unless the computer system being used does not
have a compiler for the first language. Again, we recommend that computer
programs be written in either the standard (28) or, sometimes,
if the computer is running the operating system, the C language
standard. (29)
4.9. In Summary
Computers are not oracles. The results of any computer analysis are
dependent on the quality of the data, on the assumptions of the analysis
method, and on the expertise of the computer programmer who generated the
software. Consequently, computer users should always be aware that com-
puter programs might give them wrong answers. Computer programs should
always be tested with various types of “test” data and the possibility that
answers derived from the analysis of actual experimental data may have no
physical meaning should always be considered.
Two of the cited references are excellent entry-level numerical methods
References
1. J. R. Lakowicz, Principles of Fluorescence Spectroscopy, pp. 65–82, Plenum Press, New York
(1983).
2. M. L. Johnson, The analysis of ligand binding data with experimental uncertainties in the
independent variables, Anal. Biochem. 148, 471–478 (1985).
3. I. Isenberg, Robust estimation in pulse fluorometry: A study of the method of moments and
least squares, Biophys. J. 43, 141 (1983).
4. N. Joshi, M. L. Johnson, I. Gryczynski, and J. Lakowicz, Radiation boundary conditions in
collisional quenching of fluorescence: determination by frequency domain fluorometry, Chem.
Phys. Lett. 13S, 200–207 (1987).
5. M. L. Johnson and S. G. Frasier, Nonlinear least-squares analysis, Methods Enzymol. 117,
301–342 (1985).
6. J. A. Nelder and R. Mead, A simplex method for function minimization, Comput. J. 7,
308–313 (1965).
7. G. E. Forsythe, M. A. Malcolm, and C. B. Moler, Computer Methods for Mathematical
Computations, Prentice-Hall, Englewood Cliffs, New Jersey (1977).
8. B. T. Smith, J. M. Boyle, J. J. Dongarra, B. S. Garbow, Y. Ikebe, V. C. Klema, and
C. B. Moler, in: Matrix Eigensystem Routines—EISPACK Guide, Second Ed. (G. Goos and
J. Hartmanis, eds.), Springer-Verlag, New York (1976).
9. B. S. Garbow, J. M. Boyle, J. J. Dongarra, and C. B. Moler, in: Matrix Eigensystem
Routines—EISPACK Guide Extensions, (G. Goos and J. Hartmanis, eds.), p. 69, Springer-
Verlag, New York (1977).
10. V. N. Faddeeva, Computational Methods of Linear Algebra, p. 81, Dover, New York (1959).
11. J. J. Dongarra, C. B. Moler, J. R. Bunch, and G. W. Stewart, LINPACK Users' Guide, Society
for Industrial and Applied Mathematics, Philadelphia (1979).
12. F. B. Hildebrand, Introduction to Numerical Analysis, p. 82, McGraw-Hill, New York (1956).
13. M. S. Caceci and W. P. Cacheris, Fitting curves to data, Byte Magazine 9(5), 340–362
(1984).
14. W. Spendley, G. R. Hext, and F. R. Himsworth, Sequential application of simplex designs in
optimization and evolutionary operation, Technometrics 4, 441 (1962).
15. C. Hastings, J. T. Hayward, and J. P. Wong, Approximations for Digital Computers, p. 187,
Princeton University Press, Princeton, New Jersey (1955).
240 Martin Straume et at.
16. P. R. Bevington, Data Reduction and Error Analysis for the Physical Sciences, pp. 187–203,
McGraw-Hill, New York (1969).
17. G. E. P. Box, W. G. Hunter, and J. S. Hunter, Statistics for Experimenters, p. 636, Wiley-
Interscience, New York (1978).
18. M. Zelen, and N. C. Severo, Probability functions, in: Handbook of Mathematical Functions
with Formulas, Graphs, and Mathematical Tables, Ninth Printing (M. Abramowitz and
I. A. Stegun, eds.), pp. 925–997, National Bureau of Standards Applied Mathematics Series
#55, Washington, D. C. (1970).
19. G. E. P. Box, Fitting empirical data, Ann. N.Y. Acad, Sci. 86, 792–816 (1960).
20. M. L. Johnson, H. R. Halvorson, and G. K.. Ackers, Oxygenation-linked subunit interactions
in human hemoglobin: Analysis of the linkage functions for constituent energy terms,
Biochemistry 25, 5363–5371 (1976).
21. M. L. Johnson, J. J. Correia, D. A. Yphantis, and H. R. Halvorson, Analysis of data from the
analytical ultracentrifuge by nonlinear least-squares techniques, Biophys. J. 36, 575-588
(1981).
22. M. L. Johnson, Evaluation and propagation of confidence intervals in nonlinear, asymme-
trical variance spaces: Analysis of ligand binding data, Biophys. J. 44, 101–106 (1983).
23. P. Armitage, Statistical Methods in Medical Research, Fourth Printing, pp. 316–319,
Blackwell Scientific Publications, Oxford (1977).
24. Y. Bard, Nonlinear Parameter Estimation, pp. 201–202, Academic Press, New York (1974).
25. W. W. Daniel, Biostatistics: A Foundation for Analysis in the Health Science, Second Ed.,
John Wiley & Sons, New York (1978).
26. N. R. Draper and H. Smith, Applied Regression Analysis, Second Ed., pp. 153–174,
John Wiley & Sons, New York (1981).
27. K. Jensen and N. Wirth, Pascal User Manual and Report, Second Ed., Springer-Verlag,
New York (1978).
28. L. P. Meissner and E. I. Organick, FORTRAN 77: Featuring Structured Programming,
pp. 427–482, Addison-Wesley, Reading, Massachusetts (1980).
29. B. W. Kernighan and D. M. Ritchie, The C Programming Language, Prentice-Hall,
Englewood Cliffs, New Jersey (1978).
5
5.1. Introduction
241
242 Joseph M. Beechem et al.
obtain time-resolved fluorescence data during the past ten years. The use of
high-repetition-rate picosecond dye lasers and fast detectors (such as micro-
channel plate photomultipliers) has had a major impact on both time- and
frequency-domain measurements. Data acquisition time has been greatly
decreased, especially for single-photon counting experiments, which used to be
limited by the repetition rate of the excitation light source. Accurate single-
photon counting decay data may now be collected in under a second(1–3) and
will probably approach the millisecond time scales very soon. These fast
collection rates are beginning to make it possible to perform “double-kinetic”
experiments: multiple picosecond-microsecond fluorescence experiments per-
formed on the millesecond time scale. Reports of similar efforts to decrease
data acquisition time in the frequency domain have also appeared.(4) Parallel
(“multifrequency”) data collection(5) (i.e., where many frequencies are acquired
simultaneously) should soon result in a drastic reduction in data acquisition
time in the frequency domain. Alternative stroboscopic data acquisition
schemes in the time domain are also being developed.(6)
Many numerical techniques, such as nonlinear least squares,(7–9) method
of moments,(10,11) Laplace transforms, (12,13) and modulating functions, (14)
have been used to analyze fluorescence decay data. It has been found that
for the accurate recovery of closely spaced exponentials (or distributions
of exponentials), the analysis of individual fluorescence decay experiments
is usually insufficient. For the accurate recovery of complex fluorescence
decay phenomena, it is advantageous to combine more than one fluorescence
decay experiment into a single analysis.(15–20) A number of nonlinear last-
squares(19–26) and Laplace transform (27) software routines have been developed
with multiexperiment capabilities. The simultaneous analysis of multiple
fluorescence decay experiments is frequently referred to as “global” analysis
and has proven useful for both time- and frequency-domain data.
Global analysis procedures are of significant advantage when some
unknown parameter(s) of interest is linked between two or more fluorescence
decay experiments obtained under different conditions. A “condition” in this
context refers to temperature, pressure, excitation/emission wavelengths, or
any other independent variable which can be experimentally manipulated. In
many situations, the parameters are not linked directly between experiments,
but rather through a mathematical function. The overall strategy behind a
typical global analysis can best be illustrated with a few examples.
lifetimes, closely spaced quenching decay constants are more easily resolved
by global multiwavelength analysis (either steady state or time-resolved).
The global analysis examples described above (and many others) have
proven very useful for many different studies. However, these examples mainly
represent an adaptation of classical single-curve analysis algorithms to perform
a global analysis. The global analysis approach has since been expanded, in
a manner that is farther removed from simple curve fitting, to an emphasis on
physical-model evaluation.
For instance, in the rotational diffusion global analysis example (see Sec-
tion 5.1.1), the quantities of physical interest are not the preexponential factors
and characteristic rotational correlation times, but rather the shape and size
of the molecule and its interactions with the solvent. There is ample theory
which describes the relationships between the size and shape of molecules and
the exponential relaxation times which are observed. An important result
which often arises from theory is the fact that the observed set of rotational
correlation time(s) and preexponential factor(s) are in general not
independent of each other. If the data analysis is performed in space,
then the recovered values for these parameters may mathematically represent
the data very well, but may not have any physical analogue. Abondoning this
type of curve fitting and directly analyzing the rotational data in terms
Global Analysis of Fluorescence Intensity and Anisotropy Decay Data 245
of the physical parameters of the system has been termed “target analysis”
( physical model fitting) by Arcioni and Zannoni(36) and has been rigorously
adopted by the current global analysis programs. See Ref. 37 for a recent
study of anisotropic rotations using a “global-target” analysis of differential
phase/modulation data obtained at multiple temperatures.
In the current global analysis programs, every effort has been put forth
to allow the user to select the physical invariants of the system as fitting
parameters, obviating multistep analyses. In the case of rotational diffusion,
the physical invariants of the system are the lengths of the major axes of the
molecule, the set of angles which describe the orientation of the absorption/
emission oscillators and the principal diffusion axes of the molecule, and the
“slip” or “sticking” boundary conditions. Of course, from a single anisotropy
experiment, one should not expect to recover all the values of the physical
invariants of a system, and, generally, a global analysis proceeds using the
following approach.
Initially, the emission anisotropy may be fit to a series of exponential
terms. From these results, some of the possible physical models for the
molecular shape can be disregarded. For example, if the data fit very well to
only a single rotational correlation time, then there is little reason to perform
an analysis in terms of lengths and multiple diffusion coefficients. However,
upon excitation of the molecule into different absorption oscillators, it may be
found that the single recovered rotational correlation time varies. One should
then consider performing a global analysis of the two data sets in terms of an
internally consistent molecular shape with varying angles describing the
relative orientation of the absorption and emission oscillators and the principal
diffusion axes of the molecule.
Experiments performed at multiple temperatures and viscosities can
further assist in unraveling complex decay. In this case, one needs to utilize
a physical model which will relate how these variables affect rotational
behavior. For instance, one can model rotational diffusion as an activated
process and directly analyze the data in terms of the energy of activation for
rotational diffusion (i.e., the physical invariant of the system).
From this example, a general pattern for performing a global analysis
emerges:
†
To be described in Section 5.7.3.
248 Joseph M. Beechem et al.
There are three basic methods with which one can implement a global
analysis program:
1. Specific case programs: If the number and type of experiments which
are going to be examined are relatively limited, one may wish to
directly hardwire specific global linkages directly into the software.
For instance, combining multiple excitation/emission wavelengths in
terms of an internally consistent set of lifetimes is a specific case, and
several special-purpose routines have been written for these types of
experiments.
2. User subroutines: Nonlinear minimization routines can be written
which only contain the logic needed to perform a global minimization
over multiple experiments, but there is no explicit linking mechanism
within the analysis program. In this case, each particular user (of
the program) would be required to write a subroutine, which would
contain all of the logic required to link the various experiments
together.
3. General-purpose linking algorithms: In a general-purpose linking
nonlinear least-squares algorithm, a mechanism within the program
allows the user to explicitly enter the number and types of linkages
desired over the entire data surface. A general-purpose numerical
algorithm is utilized to calculate the observed fluorescence response;
all derivatives are calculated numerically, so that a wide variety of
complex models can be examined.
Of the above-described methods, no single methodology is advantageous
in all cases. Specific-case algorithms suffer from a lack of flexibility, but are
often very fast and produce relatively compact programs. This type of
methodology should definitely be pursued if only a single type of global
analysis needs to be performed. Programs which rely on external user-written
subroutines are more useful, but incur the added expense of requiring that
the program be recompiled for each new type of analysis. The user must
also supply the necessary linking logic, which usually requires a reasonably
Global Analysis of Fluorescence Intensity and Anisotropy Decay Data 249
of this diagram is also allowed. The excited state species may emit fluorescence
with a functional form depicted as These emitted photons are observed by
a detector through an emission monochromator/filter which operates as a
“gain-discrimination” device, weighting the photon from each particular
species by a factor proportional to the emission spectra of that state. These
gain factors are denoted as The fluorescence actually detected will then
simply be the weighted sum of the individual emissions.
Consider the case of a simple two-state excited state reaction. These
schemes have been extensively utilized to model excited state proton transfer,
exciplex and excimer formation, etc. Historically, these types of systems have
been analyzed utilizing the simple solution to the coupled two-compartment
linear differential equations, which yields
Global Analysis of Fluorescence Intensity and Anisotropy Decay Data 255
Naturally, the individual and are functions of all the rate constants of the
system and the initial boundary conditions. Previously, systems of this type
were first analyzed in terms of individual and as a function of pH,
concentration of species, etc. These results were then analyzed graphically, to
obtain the rate constants of the system (and spectra).
With the systems theory formulation, this sequential type of analysis is
no longer needed. The kinetic system described above is fit directly in terms
of the rate constants and spectra associated with each particular system. For
this simple case, analysis with either methodology may yield comparable
results. However, in more complicated systems (e.g., many more states,
distributed or time-dependent rate constants, etc.), the analytical forms of the
impulse response functions may be very difficult (impossible?) to obtain, and,
hence, analysis using impulse response equations [e.g., Eqs. (5.6) and (5.7)]
is not possible.
The kinetics of systems shown in Figure 5.5 can be written in terms of the
standard systems theory equations as(43, 44)
where is the jth eigenvalue of the matrix T (Eq. 5.8), V is the matrix of
eigenvectors of T (modal matrix), n is the number of excited state species, and
is the emission spectral contour of the kth species at the ith emission
wavelength.
Note that the final amplitude associated with a particular decay rate
(commonly denoted as is actually formed as an inner product of the “s”
256 Joseph M. Beechem et al.
vector and the eigenvector associated with that particular eigenvalue. In the
current global analysis programs, the individual can be actual fitting
parameters. The are referred to as because they represent a
particular chemical’s “species-associated spectrum” (i.e., the emission spectrum
of this species if it could be observed separately). This is in contrast to
(decay-associated spectra), a term which has come to represent the-
set of amplitudes associated with a particular decay time. † In the case of strict
ground state heterogeneity, the eigenvectors are just the normal basis vectors,
and therefore the amplitude associated with a particular lifetime is directly
proportional to its emission spectrum (i.e., for ground state heterogeneity,
DAS = SAS). However, for excited state reactions, the final observed
amplitude is a linear combination of the SAS and each particular eigenvector
(Eq. 5.9) and hence the DAS are not the same as the SAS. The advantage of
being able to fit in an “SAS” space is that, since the represent species
emission spectra (a physical invariant), they can be linked over experiments
where the kinetics of interconversion have been altered (e.g., by changing pH,
concentration of quenchers, etc.) whereas simple amplitude (DAS) terms
cannot.
The “distributed-case” solution to Eq. (5.8) [i.e., where the parameter
vector is distributed according to can be written as
Note that the final represent the “end result” of a complex set of
operations: the inner product of the eigenvectors of and the emission
spectral contours of each state (Eq. 5.9) weighted by the multidimensional
†
Knutson et al.(30) originally defined DAS to encompass both ground state and excited state
systems.
Global Analysis of Fluorescence Intensity and Anisotropy Decay Data 257
probability function Although Eqs. (5.12) and (5.13) are of the correct
mathematical form for fitting fluorescence data, Eqs. (5.10) (convoluted) and
(5.11) are the form actually utilized in the global analysis because it allows
one to “link” specific subsets of the “lumped” parameters between
experiments.
In the case of excited state reactions, the set of all and
are not independent of each other (due to eigenvector–eigenvalue system-
wide relationships) so that these distributed cases are very difficult (if not
impossible) to solve analytically, and a numerical integration procedure must
be performed. However, for cases where there are no excited state interactions
(i.e., the T matrix is diagonal), then the complete set of and
decouple, and one can rewrite Eqs. (5.12) and (5.13) as
The integrals given by Eqs. (5.14) and (5.15) need to be calculated only over
the parameters used to characterize each individual distribution, because they
are not a function of the entire set of constants and distributions describing
the total kinetic system
Equation (5.14) is the definition of the Laplace transform of the
distribution function, Within the context of fluorescence decay experi-
ments, this integral is also similar to what is known in various other fields
as the “moment-generating function” or “characteristic function” of the
distribution. (46) Since there exist large tables of Laplace transforms, moment-
generating functions, and characteristic functions, a wide variety of analytical
forms now exist to analyze time- and frequency-domain data given a specific
analytical form for the distribution function (in addition to standard sums-of-
exponentials models).
An approach which can be taken to analyze fluorescence decay data
is to perform an inverse Laplace transform of the data to directly recover
the probability distribution associated with each lifetime. Many different
“regularized” solutions of the inverse transform have been applied with varying
amounts of success.(47, 48) In general, an entire family of distributions will be
consistent (i.e., have the same value) for any given data set. To choose
among these different distributions, one can analyze for the distribution
consistent with the data that has the largest “entropy.”(49–5l) An important
advantage of these techniques is that no analytical form needs to be assumed
to describe the distribution function itself. However, an important point to
consider is the fact that the intrinsic distribution of the amplitudes as a
258 Joseph M. Beechem et al.
function of lifetime generally does not represent the final parameters of interest
in a global analysis. These methods do not currently take advantage of multiple
axes of overdetermination.
For example, consider the case in which a particular fluorescence sample
is measured as a function of some independent variable and the distribution
of versus versus independent variable is recovered perfectly. Is this result,
in itself, of any use without a particular model which can predict such shapes?
For this reason, regardless of which analysis methodology is employed to
recover a set of versus eventually it is desirable to decompose the
recovered versus in terms of an internally consistent physical model.
It is advantageous not to operate on the versus themselves, but rather
to examine the original data surface in terms of an internally consistent
physical model. The amplitudes and lifetimes should be utilized as a guide to
performing physical modeling.
where N is the number of data points in the entire data surface, is the
standard deviation of the ith data point in the qth experiment, m is the
number of total fitting parameters, is the total number of experiments,
and n(q) is the total number of data points taken in the qth experiment.
Division by the degrees of freedom (N — m—1) does not vary during each fit,
so that minimization of a nonreduced will produce exactly the same result.
The reduction process, however, allows for some preliminary model testing.
Using the definition of and assuming a Gaussian error distribution, one
may assemble F tests (or related criteria) to judge the “quality” of the fit. A
reduced that significantly differs from unity calls into question the validity
of either the model or the weighting estimates In the time domain, one
can use single-photon statistics and show that the variance in each channel
is directly proportional to the number of counts. In the frequency domain,
no simple relationships exist to predict analytically what the error in the
measured phase and modulation should be. Instead, measurements are
repeated and an experimental variance is determined. Improperly weighted
data can have drastic effects on the final recovered parameters, especially for
anisotropy decay.
The mechanics of minimization are many and varied. We have adopted
Global Analysis of Fluorescence Intensity and Anisotropy Decay Data 259
where
In relation to Figure 5.3, there are three structures which must be estab-
lished before the global optimization can be performed:
• The physical model/solution process/experimental conditions matrix
• The linking connectivity matrix
• The data matrix.
260 Joseph M. Beechem et al.
5.4.2. Flow Chart for the LFD Global Analysis Program "Global"
performing a global type of analysis is that one must provide some type of
supporting logic to tell the fitting program which elements of the fitting vector
are appropriate for a particular experiment. What is often implemented in
single-experiment analysis programs is some type of standard “ordering” of
the parameters within the fitting vector so that the program knows what type
of fitting parameter is in each element of the vector. For instance, when fitting
discrete exponentials to a single experiment, the parameters could be entered
into the fitting vector as
The logic required to inform the program which elements of the fitting vector
are appropriate for a particular experiment can be presented using a matrix
of integer values. This matrix ( M ) (mapping matrix) is dimensioned with its
first index equal to the maximum number of experiments which can be
simultaneously analyzed and its second index equal to the maximum number
of fitting parameters which can be local to a single experiment, and a third
index (which is either 1 or 2) describes whether this number points to a
parameter-vector element or a parameter-descriptor element (to be described
shortly). The third index will be set to 1 when pointing to the fitting
parameter array element and 2 when pointing to a parameter descriptor.
When performing a loop over the entire data surface, the current experiment
index counter (nc) is used as the row index of M [M(nc,·,·)]. The numbers
which are then placed along a particular row of M will then be the indices
of the total parameter fitting vector P which are appropriate for the ncth
experiment. For instance, if the ncth experiment utilizes elements 5, 6, 8, and
10 of the main fitting parameter vector, then M(nc, 1, 1) = 5, M(nc, 2, 1) = 6,
M(nc, 3, 1) = 8, M(nc, 4, 1) = 10. Utilizing the logic in the M matrix, there-
fore, allows one to pass to a subroutine only those vales of the fitting
parameter vector which are appropriate for the jth experiment. The local
fitting terms [LT(i)] for the jth experiment can be extracted from the total
surface fitting parameter vector (P) by using the logic:
Global Analysis of Fluorescence Intensity and Anisotropy Decay Data 263
In this way the first level of support logic [item (a) above] has been
accomplished in a completely general way. However, one additional logical
step remains. Now that the elements of the total fitting vector appropriate for
any single experiment within the data surface have been obtained, one must
determine what type of fitting parameter this element represents (e.g., is the
fitting parameter a simple rate or perhaps a length distribution parameter or
an activation energy, etc.). Again, there are many types of logic which can be
implemented to perform this function, but we have found the following
methodology very flexible.
Again, an indexed vector can perform this function. One can define a
vector dimensioned to the maximum number of fitting parameter “descriptors”
which can be used to define the entire data surface [D(np)]. Each element of
this vector (D) (descriptor vector) is a string of ASCII keywords which
uniquely describes the functional characteristics of this fitting parameter.
Obviously, many different types of descriptor vectors could be used. We
currently utilize the above formalism because it assists both in making
the source code of the program easy to read (and hence to modify) and
establishes a “natural-language interface” for describing the fitting functions
desired. For instance, a typical keyword descriptor for a simple transfer rate
between compartment 1 and 2 would appear as D(1) = “rate 21.” If that rate
were distributed as a Gaussian, then D(2) = “rate 21 distributed Gaussian.”
If the transfer between compartments involved an energy transfer process,
then D(3) = “length distributed Gaussian 21.” These descriptor terms would
then be “decoded” by the parser subroutine to map to the following fitting
functions:
This type of methodology has proven very useful and flexible and allows
the establishment of a “vocabulary” of keywords which can continually
expand and grow. Eventually, the program evolves to the extent that it
becomes essentially an “expert system” for the analysis of fluorescence decay
264 Joseph M. Beechem et al.
models. One of the advantages for an indexed list of fitting functions is that
one does not need to expend a large amount of programming effort to incor-
porate new models into the analysis. In addition, since all of the old model
“keywords” are maintained, a very general purpose nonlinear analysis fitting
environment develops, whereby a wide variety of different models and linkage
patterns can be applied to a particular data surface. There is no longer any
need to stop and redevelop the analysis programs each time one desires to
analyze the data using a new model.
3. Particular Forms for the Functions Linking the Compartments Are Chosen
The appropriate form for the function linking the compartments together
is usually decided by choosing that functional form in which the physical
quantity of interest appears explicitly (eliminate multistep analysis). For
instance, if a multiple-temperature study is being performed, instead of a
rate-space type of analysis, one may wish to globally analyze the data directly
in terms of activation energies and frequency factors. In an energy space,
one useful functional form to describe the excited state interconversion from
compartment 1 to 2 would be
If the activation energy linking the two compartments was distributed, one
would have
instance, within the analysis program there are characteristic data matrices
of the form data(x, nc), where nc represents the experimental curve number
index and x follows the number of “channels” of data collected. Data(5, 10)
would therefore represent the fifth data point in the tenth experiment. One
can now set up an index variable (nc) to keep track of the current experiment
which is being examined. This index will therefore be incremented from one
to the total number of experiments which are being combined in the analysis.
c
c Section of LFD global analysis FORTRAN 77 code which
c assembles the normal least-squares equations using the
c mapping matrix to index the various elements.
c
c DEFINITIONS OF TERMS
c
c nlocterms=the number of fitting parameters for the ncth
c experiment
c derphase (i, j)=the derivative of the calculated ith
c phase data with respect to the jth local
c fitting parameter
c dermod (i, j)=the derivative of the calculated ith
c demodulation data with respect to the jth
c local fitting parameter
c residphase (i)=the ith residual in the phase of the ncth
c experiment
c residmod (i)=the ith residual in the modulation of the
c ncth experiment
c errorphase (i, nc)=the variance of the experimental
c measurement of the ith phase in the
c ncth experiment
c errormod (i, nc)=the variance of the experimental
c measurement of the ith modulation in the
c ncth experiment
c varphase=the variance of the ith phase data point
Global Analysis of Fluorescence Intensity and Anisotropy Decay Data 271
As an aside, one should note that Eq. (5.18) is seldom solved by matrix
inversion. More efficient equation solvers are available for such symmetric
systems. We have found that the use of the Cholesky decomposition or
“square-root” method (52) is sufficient for almost all data sets. The most robust
numerical method of performing nonlinear least squares is by by passing the
formation of Eq. (5.18) altogether and using the singular-value decomposition
technique (see, for example, Refs. 53 and 54). However, this technique
requires the formation of a matrix of dimension as compared to
matrices with dimension in Eq. (5.18) (N represents the total number
of data points used in the analysis, and m the total number of fitting
parameters). For many global analyses, the data densities are simply too great
(often as many as a million total data points) for the use of the singular-value
decomposition technique for solving the nonlinear least-squares problem.
However, in cases where the analysis has been performed using both techni-
ques (when applied to typical fluorescence decay data/models), identical
results are usually obtained (J. M. Beechem, unpublished results). It should be
emphasized that it is almost always not the mechanics of the minimization
that causes problems in the data analysis, but rather the limited “information
content” of the experiments.
which fitting space will minimize the number of fitting parameters neccessary
to describe the fluorescence surface and provide numerical values (and
associated error bounds) of the primary parameters of interest?
Consider the global analysis of two pH experiments with four different
emission wavelengths. This data surface can be represented in terms of
impulse response parameters as:
space can result in a transformation of the error surface from a rather ill-
defined flat surface to a very well defined, nearly quadratic error surface.
This type of analysis was applied to the excited state proton transfer of
-naphthol. (24) Time-resolved data were collected at pH 2.15 and 3.0 at 75
emission wavelengths in each case (this number of emission wavelengths is
only necessary if high-resolution species-associated spectra are desired). A
global analysis was now performed by simultaneous analysis of a data surface
consisting of fluorescence versus time versus emission wavelength versus pH.
The total number of fitting parameters are the four rate constants plus two
species-associated spectra (a total of 154 fitting parameters!). It should be
noted that the rate constants can be completely determined using only two
emission wavelengths, so that only six fitting parameters are needed for that
analysis. However, it was desired to attempt to recover the high-resolution
SAS for this system so that all of the emission wavelengths were utilized. The
rate constants recovered from this system are shown in Table 5.2.
One can see that the simultaneous analysis of only two pH values was
sufficient to resolve the rate constants of this system, compared to the
extensive pH study performed previously.(56) The global analysis results, of
course, only strictly pertain to the pH region in which the study was actually
performed.
One reason the 2-naphthol case was reexamined was because the species-
associated spectra can be experimentally determined, and therefore the SAS
recovered from the analysis (using 154 fitting parameters) can be directly
compared with the known spectra. The results of this comparison are shown
in Figure 5.7. The spectrum associated with each emitting state recovered
from the analysis program is plotted along with the experimentally determined
SAS (pH = 0 yields only naphthol, pH = 13 yields only naphtholate). From
these results, it appears that not only all the rate constants for the reaction
can be recovered, but also each individual SAS. Therefore, all 154 fitting
parameters are uniquely recoverable and can be shown to faithfully represent
physical quantities.
For many biochemical systems of interest, it will not be possible to alter
Global Analysis of Fluorescence Intensity and Anisotropy Decay Data 275
276 Joseph M. Beechem et al.
This type of analysis result clearly represents the philosophy behind the
global analysis step: transform a series of empirical fittings into an
internally consistent physical result. The unlinked empirical analyses
represent the best model-independent representation of a given set of experi-
ments. However, the solutions, although mathematically correct, may have
no physical analogue. The global analysis procedures do not compete with
the empirical single-experiment analysis approaches; they simply represent
the logical next step in a multistep process which transforms data into
“biochemical information.”
These expressions reveal that both and are determined by the same
set of fitting functions [i.e., f(t) and r(t)] and hence the same set of fitting
parameters. This statement can be made for any polarized intensity obtained
under any combination of polarization directions in the excitation and the
emission. The intensity corresponding to excitation and emission polarizers at
angles and with respect to the normal to the excitation–emission plane
is given by
From the general expression for the decay of the emission anisotropy
(Eq. 5.37), various specific cases can be calculated. The anisotropy decay
of a general rigid asymmetric body in an isotropic environment is described
by a sum of five exponentials.(66–68) In many cases, these five exponentials can
be approximated by a sum of three exponentials.(69) When approximating
the hydrodynamic shape of a fluorescent system in terms of an ellipsoid of
revolution, one has(28):
where
where and are the angles made by the absorption and emission dipoles
with respect to the symmetry axis and where is the angle between their
projections in the plane perpendicular to the symmetry axis. The corre-
sponding rotational correlation times are given by:
with
where l(t) is the measured excitation function, and denotes the convolution
product.
The sum curve is proportional to the decay of the total intensity and
contains no fitting parameters for the anisotropy. In the sequential analysis,
the total fluorescence decay parameters are determined first from nonlinear
minimization of S(t). The results from the S(t) analysis are used as fixed
parameters in the subsequent analysis of the D(t) curve.
As an alternative to this “sum-and-difference” analysis, and can be
simultaneously analyzed for the parameters of S(t) and r(t) using complete
linking of parameters. (70,71,23,25) This method has certain advantages with
respect to the “sums-and-difference” analysis approach and has been adopted
for all of the current global analysis programs. What will be discussed in
the following sections will be the global analysis of multiple anisotropy
experiments, keeping in mind that each individual experiment requires
simultaneous analysis of a single and measured decay.
The actual fitting programs for both empirical and model-dependent
global analysis can be implemented using the general methodologies described
in detail in Section 5.2. One merely defines an additional set of anisotropy-
specific “keywords.” A description of global anisotropy analysis and essential
program code are also given elsewhere.(23)
In the following sections, we present a series of examples to illustrate how
global analysis has been used to study some complex rotational behavior.
Some of these cases are of general applicability, whereas other cases are
specific to the study of particular biological systems.
Global Analysis of Fluorescence Intensity and Anisotropy Decay Data 283
global analysis simply provides a rigorous way of recovering the most inter-
nally consistent ratio of diffusion coefficients for the molecule
In the case of 9-aminoacridine,(28,23) however, it was impossible to determine from
single-experiment analysis whether this molecule was an anisotropic rotator.
However, with global analysis, one could determine, in a statistically signifi-
cant manner, that this molecule was not an isotropic rotator and recover
Recent work (performed in the frequency domain) suggests that
the rotational behavior of perylene may be even more complicated.(80)
where and
When is large, can take the place of found in previous models.
One may extract and terms as a function of temperature. Trends for
the and terms can be analyzed in a sequential manner or by global
analysis. Examination of these intermediate parameters revealed behavior
not consistent with this model framework. Thus, an alternative framework
was developed which predicts a distribution of melting rates. Jahnig (84) has
discussed the use of a Landau model for packing fluctuations. A model based
on rotation “gated” by such packing fluctuations was proposed. In this model,
288 Joseph M. Beechem et al.
the free energy potential shapes for dipalmitoyllecithin (85) were used by
Davenport et al.(81) to model the emission anisotropy behavior of coronene in
liposomes as a function of temperature. A target parameter space, consisting
of free energy profiles, a gating factor, and a diffusion coefficient, was estab-
lished for the analysis. Simulation of the observed emission anisotropy of
coronene as a function of temperature was performed. This use of Landau
theory provided a method whereby functional “linkages” could be constructed
in a physically meaningful manner over the temperature axis.
The first item is directly related to the identifiability of the model, while the
second addresses error analysis within a model. A description of both the
identifiability problem and the error analysis problem will be discussed in
this section. Although the error discussion will be completely general, the
identifiability study will focus only on nondistributed compartmental models.
made concerning how “closely” the impulse responses resemble one another,
only that they are not identical. From this figure, it is apparent that one does
not need to discuss the concept of identifiability if one only fits each experi-
ment in terms of the parameters of an impulse response function ( and ).
However, with the use of global analysis, problems are often reparameterized
so that the actual fitting parameters are no longer a direct description of the
impulse response function. In these cases, it is important to determine whether
the recovered parameters provide a unique mapping to a set of impulse
response functions.
Consider the following simple two-state excited state reaction with
and allow only
compartment 1 to be populated in the ground state (can be generalized to any
combination of initial ground state populations). The above system is
examined at two different emission wavelengths toward the blue and red edges
of the emission spectrum The impulse
response functions for these two experiments are
[see Eq. (5.8) for definition of terms]. The observed fluorescence can be
written as
where s is the Laplace transform parameter. Solving for the Laplace transform
of the concentration of each individual species yields
(given that the inverse exists). The Laplace transform of the fluorescence can
therefore be written:
Expanding out this equation and substituting the explicit rates into yields
Therefore, from our model (two-state excited state reaction, known input
and output vectors) one can derive only three independent relationships (Eqs.
5.65–5.67), but there exist four independent fitting parameters (the four rate
constants). Therefore, one has determined a priori that a global target analysis
of this particular experiment directly in terms of the rate constants of the
system is not possible. This result is completely independent of the accuracy
of the collected data.
Global Analysis of Fluorescence Intensity and Anisotropy Decay Data 293
One can proceed with this type of approach to examine the identifiability
of the two-state excited state reaction, but in this case consider the combina-
tion of two fluorescence experiments performed by altering one of the rate
constants for interconversion (e.g., changing the pH for excited state proton
transfer). In this case, an additional transfer function is generated, with
altered to where [M] is the concentration of interactant. This
additional transfer function generates a new set of independent equations
(similar to those above). By combining the number of independent equations
from experiments 1 and 2, one can determine that the number of unknowns
in the system is now five. † The number of independent equations generated
from this combination of experiments is also five. From these two experiments,
then, one has a set of five nonlinear equations in five unknowns. Since these
equations are nonlinear, one is not guaranteed that there is a solution;
however, the possibility exists that this system is identifiable. In fact, we know
from the study of the excited state reaction of (see Section 5.5.1)
that this type of system can be uniquely recovered.
More complex models with additional compartments are even less likely
to be identifiable than the two-compartmental model described above, as
they allow for many different connectivities between the compartments. The
number of compartments used in the analysis, of course, cannot exceed the
number of relaxation times which can be resolved by the global analysis.‡
If the dependencies of the rate constants on concentration are unknown, this
will create additional identifiability problems.
The structural identifiability problem is well known in the field of com-
partmental modeling(86,44,87,88) and is still a subject of intensive study. For a
detailed example of the use of identifiability analysis applied to fluorescence
decay data, see the study of Ameloot et al.(27) From this study, no single set
of experimental conditions was found to uniquely determine the system
parameters for all types of two-state excited state reactions. In all cases,
at least two different concentrations of interactants are needed. This is a
necessary (but not a sufficient) condition for identifiability. The set of fluo-
rescence lifetimes alone may, in some cases, resolve all of the rate constants.
In these special cases, neither normalization between experiments nor know-
ledge of the absorption vector is required. In most cases, the information
content of the preexponentials associated with each particular lifetime must be
incorporated. Multi-emission or multi-excitation wavelengths may also be
essential to obtain complete identifiability. Identifiability may also depend on
the particular values of the system parameters.
Fluorescent systems may also be “partially” identifiable. When only a
†
If the effect of [M] on the rate constant is known, then there will be only four unknowns.
‡
Hence, simple linkages over emission wavelength are still useful to identify the dimensionality
of the system.
294 Joseph M. Beechem et al.
which the error analysis is performed depends on only a single factor: the size
of the region of the error surface which is explored.† One can classify the
various error analysis algorithms as follows:
1. Linear approximation: None of the error surface is directly explored.
The information contained in the curvature matrix at the chi-square
minimum is utilized to calculate the uncertainties in the recovered
parameters, assuming no correlation between the various fitting
parameters.
2. Unidimensional search: Directed searches along each parameter axis,
not allowing any of the other fitting parameters to vary.
3. Directed search: Directed searches along the eigenvectors of the
curvature matrix at the minimum.
4. Exhaustive search: Directed searches along each parameter axis,
allowing all other parameters to vary so as to obtain a minimum
chi-square at each point.
Each of these error analysis methods is diagrammatically depicted in
Figure 5.10 (for a two-parameter analysis of “P” and “Q”).
In the linear approximation, the errors on the recovered parameters are
estimated utilizing the square root of the diagonal elements of the inverse of
the curvature matrix see Eq. (5.18)] at the chi-square
minimum. This type of error analysis is strictly valid only for linear models
and assumes that there is no correlation between the individual fitting
parameters. This methodology always produces a symmetric error result
and should always be regarded with a large amount of skepticism. These error
estimates will always predict the smallest amount of uncertainty (see shaded
region in Figures 5.10 and 5.11).
The unidimensional search algorithm simply calculates the observed chi-
square by sequentially altering the recovered fitting parameters along each
independent axis. None of the other fitting parameters are allowed to vary
during this process (no correlations between the parameters are taken into
account). The region of the error surface which is examined by this approach
is schematically represented as the lines A–B and C–D in Figure 5.10.
The error bars recovered from this analysis may be asymmetric. The major
limitation in this error analysis technique is that no correlations between the
fitting parameters are allowed.
In the directed search algorithms, an attempt is made to take into account
some of the correlation between the fitting parameters. By calculating the
†
Of course, knowledge of the chi-square surface over the entire domain of the fitting space would
provide all of the information necessary to establish the error bounds and identifiability.
However, this type of error analysis is usually not possible (too computationally intensive), and
one needs to resort to examination of the chi-square surface along particular axes.
296 Joseph M. Beechem et al.
Global Analysis of Fluorescence Intensity and Anisotropy Decay Data 297
5.8. Conclusions
One of the chief “complaints” concerning the use of global analysis is that
it is model dependent. This is certainly true, and application of a physical
model which is inappropriate for a given data type will certainly yield
questionable results. However, as far as the analysis is concerned, the global
application of a particular model over a multidimensional data surface is as
Global Analysis of Fluorescence Intensity and Anisotropy Decay Data 299
data types. By specializing in particular data types, one may obtain both a
reduction in computational time and, in some cases, additional information
which might not be immediately apparent when performing a global analysis.
The global analysis methodology, as described in this chapter, does not have
any qualifiers attached to the types of data which can be combined or the
models which can be applied (other than those specific to all nonlinear least-
squares analyses). As such, it is a general-purpose analysis methodology,
which can be applied to a wide variety of fluorescence data. These features
make global analysis, with the “target” approach, an ideal vehicle for testing
models and recovering fitting parameters. An important advantage of the
numerical procedures discussed here is that the original data are tested
directly against the final model of interest. Multistep analysis is completely
eliminated.
Do the global analysis procedures described here provide a “black box”
that can always be used to obtain system parameters of interest? Certainly
this is not the case. A reasonably good global fit over a large numbr of
different experimental data sets may obscure a poor “local” fit for a particular
experiment(s). The regions where a global fit is not satisfactory can provide as
much useful information as those areas in which the fits are very good. These
regions should not be overlooked. Once trivial instrumental artifacts are
eliminated as a source of the mismatch, the globally applied models should be
reevaluated. The ability to disprove a particular model is often as important
as the determination of the recovered fitting parameters.
The exploration and development of new physical models (and theories)
which will allow additional sets of overdetermination axes to be used in a
global analysis is an important direction for future application. Determining
the “information content” of these high-dimension data surfaces using the
identifiability approach may help in establishing which sets of experimental
axes are worth investigating. The application of the widest possible variety of
physical models to the highest dimensional data set that can be obtained still
provides the experimenter with the best overall chance of understanding the
observed fluorescent system.
Acknowledgments
Much of the inspiration and many of the ideas in this work evolved
through interactions with scientists who have spent time in (or around) the
laboratories of Ludwig Brand and Enrico Gratton. These scientists include
(from the Brand lab) Drs. D. W. Walbridge, R. P. DeToma, G. Ackers,
B. Turner, L. Davenport, R. Dale, M. Barkley, A. Kowalczyk, J. B. A. Ross,
W. W. Laws, M. K. Han, and M. Pritt and (from the Gratton lab) Drs. R.
Global Analysis of Fluorescence Intensity and Anisotropy Decay Data 301
References
39. Matrix Eigensystem Routines, Eispack Guide, Lecture Notes in Computer Science, Vol. 6,
Springer, Berlin (1976).
40. D. W. Marquardt, An algorithm for least squares estimation of nonlinear parameters, Society
for Industrial and Applied Mathematics 11, 431–441 (1963).
41. K. Levenberg, A method for the solution of certain nonlinear problems in least squares,
Quart. Appl. Math. 2, 164–168 (1944).
42. J. R. Knutson, Global analysis of fluorescence data: Some extensions, Biophys. J. 51, 285a
(1987).
43. C. Chen, Linear System Theory and Design, Holt, Rinehart & Winston, New York (1984).
44. J. A. Jacquez, Compartmental Analysis in Biology and Medicine, Elsevier, Amsterdam (1984).
45. J. E. Löfroth, Time-resolved emission spectra, decay-associated spectra, and species-associated
spectra, J. Phys. Chem. 90, 1160–1168 (1986).
46. W. C. Giffin, Transform Techniques for Probability Modeling, Academic Press, New York
(1975).
47. S. W. Provencher, A constrained regularization method for inverting data represented by
linear algebraic or integral equations, Comp. Phys. Commun. 27, 213–227 (1982).
48. S. W. Provencher, CONTIN: A general purpose constrained regularization program for
inverting noisy linear algebraic and integral equations, Comp. Phys. Commun. 27, 229–242
(1982).
49. J. Skilling and R. K. Bryan, Maximum entropy image reconstruction, general algorithm,
Mon. Not. R. Astron. Soc. 211, 111-124 (1984).
50. A. K. Livesey, M. Delaye, P. Licinio, and J. C. Brochon, Maximum entropy analysis of
dynamic parameters via the Laplace transform, Faraday Discuss. Chem. Soc. 83, 1–12 (1987).
51. A. K. Livesey and J. C. Brochon, Analyzing the distribution of decay constants in pulse-
fluorimetry using the maximum entropy method, Biophys. J. 52, 693-706 (1987).
52. D. K. Faddeev and V. N. Faddeeva, Computational Methods of Linear Algebra, W. H.
Freeman, San Francisco (1963).
53. G. H. Golub and C. F. Van Loan, Matrix Computations, The Johns Hopkins University
Press, Baltimore (1983).
54. P. E. Gill, W. Murray, and M. H. Wright, Practical Optimization, Academic Press, New York
(1981).
55. M. Ameloot, J. M. Beechem, and L. Brand, Compartmental modeling of excited-state
reactions: Identifiability of the rate constants from fluorescence decay curves, Chem. Phys.
Lett. 129, 211–219(1986).
56. W. R. Laws and L. Brand, Analysis of two state excited state reactions. The fluorescence
decay of 2-naphthol, J. Phys. Chem. 83, 795–802 (1979).
57. L. Davenport, J. R. Knutson, and L. Brand, Excited-state proton transfer of equilinin and
dihydroequilinin: Interaction with bilayer vesicles, Biochemistry 25, 1186–1195 (1986).
58. E. Haas, M. Wilchek, E. Katchalski-Katzir, and I. Z. Steinberg, Distribution of end-to-end
distances of oligopeptides in solution as estimated by energy transfer, Proc. Natl. Acad. Sci.
U.S.A. 72, 1807-1811 (1975).
59. E. Haas and I. Z. Steinberg, Intramolecular dynamics of chain molecules monitored by
fluctuations in efficiency of excitation energy transfer, Biophys. J. 46, 429–437 (1984).
60. A. Grinvald, E. Haas, and I. Z. Steinberg, Evaluation of the distribution of distances between
energy donors and acceptors by fluorescence decay, Proc. Natl. Acad. Sci. U.S.A. 69,
2273–2277 (1972).
61. J. R. Lakowicz, M. L. Johnson, W. Wiczk, A. Bhat, and R. F. Steiner, Resolution of a
distribution of distances by fluorescence energy transfer and frequency-domain fluorometry,
Chem. Phys. Lett. 138, 587–593 (1987).
62. H. C. Cheung, C. Wang, I. Gryczynski, M. L. Johnson, and J. R. Lakowicz, Distribution of
distances in native and denatured troponin I, from frequency-domain measurements of
304 Joseph M. Beechem et al.
heterogeneity, in: Subcellular Biochemistry (J. R. Harris, ed.), Vol. 14, Plenum, New York
(1988).
83. J. R. Knutson and J. R. Lakowicz, Studies on the correlation between fluorophore rotation
and solvent relaxation in bilayers, Biophys. J. 36, 80a (1980).
84. F. Jahnig, Critical effects from lipid-protein interaction in membranes I., Biophys. J. 36,
329–345 (1981).
85. S. Mitaku, T. Jippo, and R. Kataoka, Thermodynamic properties of the lipid bilayer trans-
ition: Pseudocritical phenomena, Biophys. J. 42, 137–144 (1983).
86. G. L. Atkins, Multicompartment Models in Biological Systems, Methuen, London (1969).
87. D. H. Anderson, Compartmental Modeling and Tracer Kinetics, Lecture Notes in Bio-
mathematics, Vol. 50, Springer, Berlin (1983).
88. K. Godfrey, Compartmental Models and Their Application, Academic Press, New York
(1983).
89. H. van Langen, Y. K. Levine, M. Ameloot, and H. Pottel, Ambiguities in the interpretation
of time-resolved fluorescence anisotropy measurements on lipid vesicle systems, Chem. Phys.
Lett. 140, 394 (1987).
90. J. Eisenfeld, A simple solution to the compartmental structural-identifiability problem, Math.
Biosci. 79, 209–220 (1986).
91. J. A. Jacquez and P. Grief, Numerical parameter identifiability and estimability: Integrating
identifiability, estimability and optimal sampling design, Math. Biosci. 77, 201–227 (1985).
92. W. H. Press, B. Flannery, S. Teukolsky, and W. T. Vetterling, Numerical Recipes: The Art of
Scientific Programming, Cambridge University Press, Cambridge (1986).
93. R. T. Ross, C. Lee, and S. Leurgans, Multilinear analysis of biomolecular fluorescence,
Biophys, J. 55, 191a (1989).
This page intentionally left blank.
6
6.1. Overview
307
308 Thomas P. Burghardt and Katalin Ajtai
the molecular frames have the unique polar angle and are randomly
distributed in the other Euler angles and The normalized angular
probability density for such a system is
electron spin resonance) in the summation of Eq. (6.1). The plots demonstrate
both the best estimate obtainable by fluorescence polarization when describing
the probability density of Eq. (6.3) and the relationship between angular
resolution and order parameter rank.
In the next section we derive the relationship between the order
parameters and the measurable fluorescence polarization signal.
Substituting Eqs. (6.1) and (6.5) into Eq. (6.6) and using Eq. (6.2), we find
where c is a constant, and are the unit electric dipole moments of the
fluorophore for absorption and emission, respectively, E is the electric field
vector of the excitation light, and v is the unit vector of polarization of the
emitted light that is observed (using a polarizer).
312 Thomas P. Burghardt and Katalin Ajtai
The are calculated using Eqs. (6.2), (6.5), and (6.8) and then
substituted into Eq. (6.7), and the summation is carried out. This calculation
of F, using Eq. (6.7), is straightforward but tedious and has been done for the
general case, including corrections for high-aperture optics.(8) The reader
should consult Ref. 8 for a slightly more detailed description of the calcula-
tion. An important and general result for fluorescence polarization, however,
is derived in this calculation where it is shown that the summation on j
for is limited to This result holds generally for fluorescence
polarization in homogeneous space and, as discussed in Section 6.2.1, sets
a theoretical upper limit on the angular resolution obtainable from the
technique.
We performed the summation of Eq. (6.7) for specific applications using
a symbolic manipulation computer program (SMP, Inference Corp.,
Pasadena, California). This latter procedure is more convenient for all but the
most simplified applications for the analysis of fluorecence polarization data.
Below we apply this formalism to the study of
ethylenediamine (l,5-IAEDANS)-labeled myosin cross-bridges in
muscle fibers.
To solve Eqs. (6.18)–(6.20) for the unknown order parameters, we must also
know the optical correction factors and and the emission dipole
polar angle, The correction factors are computed from formulas derived
previously (see Eq. 20 in Ref. 9) and are and
(for glycerol immersion quartz objective with a numerical aperture
of 1.25 and quartz interfaces). The angle between the absorption and emission
dipoles was estimated by measuring from 1,5-IAEDANS-labeled S-l in
buffer + 50% glycerol (1:1 buffer to glycerol by volume, pH 7.0) solution
cooled to –20°C and solving Eq. (6.9) with We
found for the excitation wavelength of 364 nm and filter block cutoff
wavelength at 420 nm that
The restrictions on the order parameters for muscle fibers, shown pre-
viously to require with j = 0, 2, and 4 and n = 0, cause
Eqs. (6.18)–(6.20) to depend only on the four parameters
and Because there are four unknowns with three
equations to constrain them, we must supply one new constraint. We show in
Section 6.2.4 that we can obtain the additional constraint from the model-
independent analysis of ESR spectra from muscle fibers.(2)
Fluorescence polarization experiments on 1,5-IAEDANS-labeled S-l
bound to muscle fibers were performed on two states of the muscle fiber. One
state, called MgADP, occurs when the fiber is bathed in buffer containing
MgADP. In this state both the nucleotide and actin are bound to S-l. The
second state, rigor, occurs when the nucleotide is removed. With the help of
the ESR data, as described in Section 6.2.4, we find for rigor fibers:
Fluorescence Polarization from Oriented Systems 317
Plots of for fibers in rigor and in the presence of MgADP are shown in
Figure 6.4. These plots of the probe orientation probability density indicate
that the probe rotates upon the binding of MgADP. To ascertain if this result
also indicates that S-l rotates upon MgADP binding, we must undertake
control experiments to determine whether or not the probe remains rigidly
fixed in the S-l under these conditions. Generally, two time-resolved fluores-
cence experiments are helpful as controls. First, the probe lifetime is measured
under appropriate conditions since the lifetime is often environment sensitive
and may change if the probe rotates locally.(11, 12) Second, if possible, the time-
resolved fluorescence depolarization from the labeled, isolated protein element
from the ordered ensemble (S-l in this case) tumbling by Brownian motion
in solution is measured. This measurement senses motion of the probe or
a domain of the protein containing the probe, if the motion is a rotation
that changes the probe orientation relative to the hydrodynamic frame of
the protein element.(13, 14) In this experiment (in agreement with previous
318 Thomas P. Burghardt and Katalin Ajtai
studies (15, 16 ) ) both controls indicate that the 1,5-IAEDANS remained fixed in
the hydrodynamic frame of S-l in the presence and absence of MgADP,
suggesting that the probe rotation observed with fluorescence polarization
was due to S-1 rotation.
MgADP and for fibers in rigor. (2) For a random distribution of probes
at any These data indicate that the probe angular probability
densities for these two states are unmistakably different since the addition of
a random probability density to either probe density will not transform one
into the other. As described in Section 6.2.2.1, the controls indicate that the
1,5-IAEDANS remained fixed in the hydrodynamic frame of S-l in the
presence and absence of MgADP, suggesting that the probe rotation observed
was due to S-1 rotation.
where G(t) contains the time dependence of the signal due to the fluorescence
lifetime of the probe. The explicit computation of F(t) using Eq. (6.34) was
described in detail previously.(31) This calculation is performed using
Eqs. (6.2) and (6.32) and the expression for in terms of Wigner
functions that has been derived in several papers (e.g., Ref. 14). We find
322 Thomas P. Burghardt and Katalin Ajtai
and
where and are spherical unit vectors along the x, y, and z axes in
the laboratory frame. T is the time development operator relating and
through the linear operation such that
where
and
where is the shape of the lamp pulse. Using Eq. (6.43), we find
or
where
expanded. Unlike the of Eq. (6.41), the are not orthogonal. The
departure of the from orthogonality depends on the shape of the lamp pulse
such that when the pulse is infinitesimal in duration, that is, it is a function
in time, Using a general numerical method described for an ESR
application, we calculate the coefficients from F(t). The procedure is
described briefly below.
We define a projection operator by the equation
Then,
where
Higher order matrix elements are calculated from Eq. (6.65) using the com-
pleteness of the Wigner functions on the interval . Completeness requires
where is a Dirac function. With Eq. (6.67) we can show that the
second-order matrix element, is
axes other than the fiber axis. In the horizontal configuration the fiber is
rotated by 90° so that the excitation light polarization is perpendicular to the
fiber axis, and its light propagation vector makes an angle of 45° with the
fiber axis (see Figure 6.5). Depolarization curves from the horizontal fibers are
sensitive to probe motion about axes parallel to the fiber axis. By requiring
our model of Brownian motion in an angular potential to account for the
relaxation rates of the cross-bridges in both fiber configurations, we acquire
sufficient constraints on the free parameters to estimate the rank 6 order
parameters of N.
With the available steady-state fluorescence polarization data,(40) we
showed that in the vertical fiber configuration, N is described by the order
parameters
For the horizontal fiber configuration we can readily compute the new
horizontal order parameters, denoted by from the vertical order
parameters, using the relation
A similar relation holds for in Eq. (6.66). With the rank order
parameters given as known quantities, we estimate the rank 6 order
parameters using the model.
An interesting new feature of the proposed polar steady-state angular
probability density computed using Eq. (6.29) is introduced by the addition
of the rank 6 order parameters. Shown in Figure 6.7 is a comparison of proba-
bility densities with and without the inclusion of the rank 6 parameters. The
higher resolution plot has a bimodal feature not observed at lower resolution.
If we can neglect the width of the lamp pulse, then the observed signal is given
by Eq. (6.71). F(t) is Laplace transformed to give the quantity such that
so that
(see, for example, Ref. 33). We then solve Eqs. (6.18)–(6.20) for
and (fluorescence) and compare these values with the values of
and (fluorescence) computed directly from Eq. (6.76) and (ESR).
On the basis of this comparison, an improved estimate of is made, and
we repeat the above steps until the difference between the fluorescence order
parameters computed by the two methods is minimized. By this method, infor-
mation from ESR can augment fluorescence data to provide a more complete
description of the fluorescent probe angular probability density.
ester) derivatives. Examples of these probes are drawn in Figure 6.8, where it
is shown that tetramethylrhodamine can be used as both a sulfhydryl- and an
amino-selective probe.
In most cases, the amino acid side chains of the protein that are involved
in the molecular aspects of biological activity have altered reactivities due to
their irregular values. This gives one the possibility to introduce labels
exclusively at these points. By lowering the pH of the reaction medium (to
within pH 6–7) far below the of the majority of the lysine groups
the ionized form of these groups becomes unfavorable for
covalent reactions while cysteinyls, especially those having altered values,
can still react. The sulfhydryl (Cys-707) of myosin is an example of this
phenomenon. reacts specifically with iodoacetamide derivatives of dif-
ferent fluorophores near pH 7 under mild conditions. (45,46) The remaining SH
groups available to the dye react slowly. The reactivity of is so enhanced
that it can serve as a single labeling point in a whole muscle fiber system. (47,48)
Using sulfhydryl-specific labels, we can also change the possible target side
chain by prereacting the system with certain reversible SH-blocking com-
pounds. The reactive thiols are reversibly blocked, and the chemically less
334 Thomas P. Burghardt and Katalin Ajtai
reactive groups are exposed to the covalent by binding probe. After the
covalent binding procedure is completed, the blocking is reversed, restoring
reactive thiols to their native state.(46,49)
The chemical characteristics of the probes themselves also affect the
binding of the label to the target molecule. Rings with apolar character react
preferentially with lipids, membranes, or protein side chains in hydrophobic
crevices. Polar probes label groups on the more hydrophilic surfaces of the
protein.
We see that the reaction of a specific fluorescent probe with a protein
depends on the nature of the reactive group attached to the chromophore
and on the interaction between the dye and the environment of the side
chain to be labeled. The three-dimensional fit of an optical probe also could
be influenced by other factors. The best fit of the chromophore ring on the
protein surface could vary for different stereoisomers of the label, leading to
nonhomogeneous labeling. This would be detected by nonhomogeneity in the
fluorescence lifetime and would cause difficulties in the interpretation of
the polarized fluorescence. The length of the spacer group from the dye to the
reactive group also is an important factor with regard to the specificity and
rigidity of the probe. With short spacers, dyes are introduced at the protein
surface while longer side chains allow the probes to penetrate into proteins
and membranes. This latter property is facilitated by an apolar spacer group.
An example of this is provided by the derivatives of pyrene shown in
Figure 6.9. An increasing number of apolar groups allows the label to
modify points deep inside a folded macromolecule.
The affinity probes are a rather heterogeneous group varying from the
fluorescent lanthanides (which substitute for calcium in calcium-binding
proteins such as actin (52) and calmodulin) to fluorescent derivatives of
nucleotides, coenzymes, enzyme substrate inhibitors, and other proteins.
Fluorescent derivatives of adenine nucleotides were used for orientation
studies in muscle fibers,(53) as was rhodamine-labeled phalloidin, a molecule
that binds very tightly to actin.(54,55) Fluorescent-labeled antibodies provide
a whole class of probes that can be targeted to a particular sequence of
polypeptides with three-dimensional structure.(56)
of the dye was incorporated into the heavy polypeptide chain of myosin while
18 % of the total fluorescence is associated with the troponin I protein.
The localization of the label in the primary sequence of the myosin is
carried out on myosin purified from labeled muscle fibers. The labeled protein
is proteolytically cut to yield S-l, and further tryptic digestion of S-l yields
three peptides of 20, 50, and 27 kilodaltons (kDa). Following the fluorescence
on the gel as shown in Figure 6.12 allows us to identify the labeled peptide as
the 20-kDa fragment, which contains the two fast-reacting sulfhydryls
(57)
and More specialized methods are required to biochemically separate
and These methods are described elsewhere.(58)
6.3. Discussion
Acknowledgment
The writing of this chapter was supported by a grant from the Mayo
Foundation.
340 Thomas P. Burghardt and Katalin Ajtai
References
24. A. Szabo, Theory of polarized fluorescent emission in uniaxial liquid crystals, J. Chem. Phys.
72, 4620–4626 (1980).
25. A. Szabo, Theory of fluorescence depolarization in macromolecules and membranes, J. Chem,
Phys. 81, 150–167 (1984).
26. C. Zannoni, A. Arcioni, and P. Cavatorta, Fluorescence depolarization in liquid crystals and
membrane bilayers, Chem. Phys. Lipids 32, 179–250 (1983).
27. T. P. Burghardt, Time-resolved fluorescence polarization from ordered biological assemblies,
Biophys. J. 48, 623–631 (1985).
28. W. L. Hubbell and H. M. McConnell, Orientation and motion of amphiphilic spin labels in
membranes, Proc. Natl. Acad. Sci. U.S.A. 64, 20–27 (1969).
29. S. A. Goldman, G. W. Bruno, C. F. Polnaszek, and J. H. Freed, An ESR study of anisotropic
rotational reorientation and slow tumbling in liquid and frozen media, J. Chem. Phys. 56,
716–735 (1972).
30. C. F. Polnaszek and J. Freed, Electron spin resonance studies of anisotropic ordering, spin
relaxation, and slow tumbling in liquid crystalline solvents, J. Phys. Chem. 79, 2283–2306
(1975).
31. T. P. Burghardt and K. Ajtai, Model-independent time-resolved fluorescence depolarization
from ordered biological assemblies applied to restricted motion of myosin cross-bridges in
muscle fibers, Biochemistry 25, 3469–3478 (1986).
32. M. Abramowitz and I. E. Stegun (eds.) Handbook of Mathematical Functions, National
Bureau of Standards, Washington, D.C. (1970).
33. T. P. Burghardt and N. L. Thompson, Model-independent electron spin resonance for
measuring order of immobile components in a biological assembly, Biophys. J. 48, 401–409
(1985).
34. B. Brenner, M. Schoenberg, J. M. Chalovich, L. E. Green, and E. Eisenberg, Evidence for
cross-bridge attachment in relaxed muscle at low ionic strength, Proc. Natl. Acad. Sci. U.S.A.
79, 7288–7291 (1982).
35. B. Brenner, J. M. Chalovich, L. E. Green, E. Eisenberg, and M. Schoenberg, Stiffness of
skinned rabbit psoas fibers in MgATP and MgPP i solution, Biophys. J. 50, 685–691 (1986).
36. J. Borejdo and S. Putnam, Polarization of fluorescence from single skinned glycerinated
rabbit psoas fibers in rigor and relaxation, Biochim. Biophys. Acta 459, 578–595 (1977).
37. J. Duke, R. Takashi, K. Ue, and M. F. Morales, Reciprocal reactivities of specific thiols when
actin binds to myosin, Proc. Natl. Acad. Sci. U.S.A. 73, 302–306 (1976).
38. T. Nihei, R. A. Mendelson, and J. Botts, The site of force generation in muscle contraction
as deduced from fluorescence polarization studies, Proc. Natl. Acad. Sci. U.S.A. 71, 274–277
(1974).
39. R. A. Mendelson, M. F. Morales, and J. Botts, Segmental fiexibility of the S-1 moeity of
myosin, Biochemistry 12, 2250–2255 (1973).
40. M. G. A. Wilson and R. A. Mendelson, A comparison of order and orientation of cross-
bridges in rigor and relaxed muscle fibres using fluorescence polarization, J. Muscle Res. Cell
Motil. 4, 671–693 (1983).
41. K. Ajtai, A. R. French and T. P. Burghardt, Myosin cross-bridge orientation in rigor and in
the presence of nucleotide studied by electron spin resonance, Biophys. J. 56, 535–542 (1989).
42. R. Hentschel, J. Schlitter, H. Sillescu, and H. W. Speiss, Orientational distributions in
partially ordered solids as determined from NMR and ESR line shapes, J. Chem. Phys. 68,
56–66 (1978).
43. R. Friesner, J. A. Nairn, and K. Sauer, Direct calculation of the orientational distribution
function of partially ordered ensembles from the EPR line shape, J. Chem. Phys. 71, 358–365
(1979).
44. R. P. Haugland, Handbook of Fluorescent Probes and Research Chemicals, Molecular Probes,
Inc., Eugene, Oregon (1985).
342 Thomas P. Burghardt and Katalin Ajtai
67. T. P. Burghardt and D. Axelrod, Total internal reflection fluorescence study of energy transfer
in surface-adsorbed and dissolved bovine serum albumin, Biochemistry 22, 979–985 (1983).
68. T. P. Burghardt and D. Axelrod, Total internal reflection/fluorescence photobleaching
recovery study of serum albumin adsorption dynamics, Biophys. J. 33, 455–468 (1981).
69. N. L. Thompson, T. P. Burghardt, and D. Axelrod, Measuring surface dynamics of
biomolecules by total internal reflection fluorescence photobleaching recovery or correlation
spectroscopy, Biophys. J. 33, 435–454 (1981).
70. N. L. Thompson, H. M. McConnell, and T. P. Burghardt, Order in supported phospholipid
monolayers detected by the dichroism of fluorescence excited with polarized evanescent
illumination, Biophys. J. 46, 739–747 (1984).
71. W. Lukosz and R. E. Kunz, Light emission by magnetic and electric dipoles close to a plane
interface. I. Total radiated power, J. Opt. Soc. Am. 67, 1607–1614 (1977).
72. W. Lukosz and R. E. Kunz, Light emission by magnetic and electric dipoles close to a plane
dielectric interface. II. Radiation patterns of perpendicular oriented dipoles, J. Opt. Soc. Am.
67, 1615–1619 (1977).
73. C. K. Carniglia, L. Mandel, and K. H. Drexhage, Absorption and emission of evanescent
photons, J. Opt. Soc. Am. 62, 479–486 (1972).
74. C. K. Carniglia and L. Mandel, Quantization of evanescent electromagnetic waves, Phys. Rev.
D 3, 280–296 (1971).
75. E. H. Hellen and D. Axelrod, Fluorescence emission at dielectric and metal film interfaces,
J. Opt. Soc. Am., B4, 337–350 (1987).
76. D. Axelrod, R. M. Fulbright, and E. H. Hellen, Adsorption kinetics on biological membranes:
Measurement by total internal reflection fluorescence, in: Application of Fluorescence in the
Biological Sciences (L. Taylor, A. S. Waggoner, F. Lanni, R. F. Murphy and R. Birge eds.),
pp. 461–476, Alan R. Liss, New York (1986).
77. N. L. Thompson and T. P. Burghardt, Total internal reflection fluorescence: Measurement of
spatial and orientational distributions of fluorophores near planar dielectric interfaces,
Biophys. Chem. 25, 91–97 (1986).
78. J. D. Jackson, Classical Electrodynamics, John Wiley & Sons, New York (1975).
79. P. M. Rentzepis, Advances in picosecond spectroscopy, Science 218, 1183–1189 (1982).
80. T. M. Nordlund and D. A. Podolski, Streak camera measurement of tryptophan and
rhodamine motions with picosecond time resolution, Photochem. Photobiol. 38, 665–669
(1983).
This page intentionally left blank.
7
Fluorescence- Based
Fiber-Optic Sensors
Richard B. Thompson
7.1. Introduction
345
346 Richard B. Thompson
their ability to propagate particular modes of light; that is, the optical power
in the waveguide will have particular spatial, temporal, and polarization
properties, which are determined by the geometry of the waveguide in a
fashion analogous to modes in a laser cavity or microwave waveguide. For
a few ideal cases, the mode distribution can be calculated exactly,(6) but in
many cases the modes a fiber will propagate are determined empirically. The
reader is referred to recent texts(6–8) for details, but some important concepts
are described below.
Some fibers are constructed with thin cores and a very small
difference in refractive index between the core and cladding, so as to conduct
only a single mode (the mode). These singlemode fibers have a number
of characteristics potentially important in fluorescence sensors. They have the
lowest possible temporal dispersion; that is, pulse spreading is minimized,
increasing the effective bandwidth (useful for telecommunications) and maxi-
mizing the resolution of time-resolved techniques (see below). Because they
are singlemode, such fibers are insensitive to influences such as temperature
or bending, which change modal distribution, and therefore propagated
intensity. Some fiber-optic properties, such as polarization preservation, are
only available in singlemode fibers.
By comparison, fibers with larger cores and/or larger refractive index
differences are termed multimode since they will propagate several modes at
once. The physical properties which determine how many modes a fiber will
accommodate are expressed in the waveguide parameter, or V-number:
348 Richard B. Thompson
where is the wavelength, is the core diameter, and and are the core
and cladding indices, respectively; a V-number of 2.405 or less ensures that
the fiber is singlemode. The waveguide parameter is wavelength-dependent,
and thus a fiber that is singlemode in the infrared will typically carry a few
modes at visible wavelengths. Among the advantages of multimode fibers is
the ease of coupling light into the fiber, since they have both larger cores and
(because of the larger difference in refractive indices) higher numerical aper-
tures (see Figure 7.2). While singlemode fibers have both core and cladding
made of glass or fused silica, plastic-clad fused-silica (multimode) fibers are
available; the plastic cladding is easy to remove, permitting access to the
core (see below). Multimode fibers with large cores are also better suited to
propagating high-powered laser (excitation) beams; a multiwatt argon laser
beam might create some interesting but unproductive effects if focused to a
micron-sized spot on the end of a singlemode fiber.
Another important phenomenon not described by Snell’s law is the
evanescent wave. When total internal reflection takes place at an interface, the
electric field strength of the incident light beam does not drop abruptly to
zero at the interface; rather, it decreases exponentially from the interface into
the lower refractive index medium over a distance of a few hundred
nanometers. The electric field present in the lower refractive index medium
can be absorbed by fluorophores near the interface and thus permits surface-
specific fluorescence spectroscopy. This phenomenon has been extensively
exploited by Harrick (9, 10) and Axelrod and Burghardt.(11, 12) Others (13, 14) have
made use of the surface specificity to devise immunoassays. As a first
approximation for multimode waveguides, the depth of penetration of the
excitation into the surrounding medium (if the cladding has been removed) is
a function of wavelength, angle of incidence and refractive index ratio of the
media (where ):
The polarization of the incident light also is a factor, and for waveguides
propagating only a few modes, the above equation is inexact. As a rule
of thumb, however, the fewer modes a fiber optic carries, the greater the
proportion of power propagated in the cladding. The evanescent wave is
important because it represents an alternate means of coupling power out of
the fiber to excite fluorescence, in addition to the distal tip.
From the standpoint of performing fluorescence experiments, perhaps the
most important property of the fiber is its attenuation at the wavelengths of
interest. Remember that in a fiber optic, one is essentially looking through a
window of glass several meters thick; the achievement of ultralow-loss fiber
optics is all the more impressive considering that exceptionally clear water is
Fluorescence-Based Fiber-Optic Sensors 349
optic sensors are well known: fluorescence is intrinsically more sensitive than
absorbance due to the Stokes’ shift and is more flexible in that a great variety
of analytes and influences are known to change the emission of particular
fluorophores. The operating principles (observables) used in several sensors
will be discussed, followed by design considerations of sensing tips, fibers
themselves, optical components of the system, and light sources.
The essence of fluorescence analysis is to understand and exploit changes
in fluorescence. These changes may be in the intensity, lifetime, color,
and/or polarization of the emission. Indeed, any interpretable observation
of fluorescence might be made the basis of a sensor. Thus, temperature
sensors available commercially are based on changes in relative intensity of
emission lines of europium: gadolinium oxysulfide or changes in lifetime of a
phosphor.(15) The former illustrates an important point, namely, that simply
measuring changes in intensity is unsatisfactory for many applications. This is
because interfering factors such as microbending, temperature, source fluctua-
tion, fluorophore degradation, and detector aging affect accuracy. Moreover,
recalibrating a sensor that may be in a remote or hostile environment could
prove difficult. Thus, several pioneering sensors that measured changes in
fluorescence intensity due to oxygen quenching, (16) binding of metal ions, (17)
or pH changes(18) have yet to come into wide use. It is unfortunate that many
of the analytical applications of fluorescence(19–21) are based strictly on
intensity changes and thus are less useful than they might be. By comparison,
spectral shifts can be measured ratiometrically, thus minimizing some of
the above problems. Use of external standards or monitoring scattered
excitation in a “double-beam” arrangement can also improve accuracy.(22, 23)
Fluorescence lifetimes(15, 24, 25) are also largely intensity-independent and there-
fore are increasingly used.
How exciting light is coupled into and fluorescence collected from the
fiber is also an important design question. The simplest solution is to employ
different fibers for guiding excitation to and fluorescence from the sensing tip.
This has the advantage of simplicity in design and permits optimizing the fiber
for each wavelength. However, it also has drawbacks: the receiving fiber easily
collects scattered excitation, it precludes the use of the waveguide binding
configuration, and it multiplies the engineering problems of the distal cuvette
configuration. These last include the difficulty of registration: exciting the
same volume of sample that the emission fiber is capable of viewing, and
attaching and sealing the sensing tip. Thus, this geometry is less favored by
current workers in the field.
By contrast, use of a single fiber to guide excitation and emission
simultaneously avoids these problems and makes use of the fiber’s natural
advantages. In this case, it is necessary to separate the excitation and emission
based on their differing wavelengths or spatial characteristics; a number of
configurations have been described for doing this (Figure 7.10A, B). The two
most common make use of multilayer dielectric-coated mirrors or interference
filters. In Figure 7.10A the interference filter (with the depicted spectrum)
passes the exciting light, while reflecting the emission; the dielectric-coated
mirror in Figure 7.10B does the converse. The configuration in Figure 7.10A
suffers some loss because the interference filter (which may be angle-tuned to
admit the desired wavelength) seldom passes more than 30% of the light at
the peak. The configuration in Figure 7.10B takes advantage of the fact that
mirrors with multilayer dielectric coatings can be outstanding reflectors, with
coatings available that reflect 99 + % of particular laser lines, while passing
longer wavelengths. Problems can arise with this configuration, however,
when powerful (laser) or short-wavelength sources are used. In both con-
figurations the excitation hits the filter directly and often causes it to fluoresce
itself. The importance of this problem varies with the particular filter, the
wavelength of the signal fluorescence, and the sensitivity required. It is of course
generally true that the greater the sensitivity required, the more important
obscure sources of background fluorescence become. The configurations in
Figure 7.10C (3) and D use a spatial filtering method for separating (laser)
excitation from emission coming back out of the fiber. In both cases the
highly collimated laser beam passes through the hole in the mirror into the
fiber; the fluorescence comes out with a conical distribution determined by
the fiber’s numerical aperture and is reflected by the mirror into the detector.
The drawback to this method is that such mirrors must often be custom-made.
The diameter of the laser beam, the numerical aperture of the fiber, and the
magnification of the coupling lens all affect the placement of the mirror and
the size of the hole. Thus, a fiber with a low numerical aperture would need
358 Richard B. Thompson
Fluorescence-Based Fiber-Optic Sensors 359
the mirror placed far from the coupling lens to maximize the fluorescence
collected. The configuration in Figure 7.10D is simply an extension of that in
Figure 7.10C which eliminates the collecting lens; we have found that a small
aspheric mirror produced by Polaroid(41) is satisfactory for this purpose.
Up to this point, we have neglected the lens(es) used to couple light into
and out of the fiber. It is fortunate that microscope objectives often fulfill most
of the desiderata for such lenses, being highly corrected, compact, widely
available with a variety of characteristics, and relatively inexpensive, con-
sidering their quality. Design, construction, and alignment of comparable
optical systems using discrete lenses would be slow and costly. In fact,
microscope objectives of low power are well suited to coupling light
into multi- or singlemode fibers. Practical concerns here include matching the
angle of launching and spot size to the fiber numerical aperture and core size
or matching mode distributions in singlemode fibers.(7) Most microscope
objectives have low loss in the visible region of the spectrum, but transmission
falls off abruptly in the near UV; also, the glass in the objective can fluoresce
as well. Quartz objectives that minimize these problems are available, but are
very expensive. Reflective objectives are also expensive and are tricky to use
with lasers, but are largely wavelength-independent (see Figure 7.7). Other
optics used for coupling light into fibers include spherical lenses and gradient-
index (GRIN) rods,(7) which are akin to gradient-index fibers. These small
devices are best suited for use with laser sources and are less convenient to use
than microscope objectives; they are widely used to couple diode laser output
into fibers.
tuned. Most gas lasers also produce plasma lines at wavelengths differing from
the laser wavelength, which may mimic the fluorescence to be detected
(Figure 7.7). These plasma lines are incoherent and uncollimated, however,
and can be eliminated by a monochromator or spatial filter; the perforated
mirrors in Figure 7.10C and D are well suited for this.
Several attributes of a laser must be taken into account for fluorescence
sensors. Foremost is the wavelength of the laser, which is constrained by (or
constrains) the choice of fluorophores. In particular, a probe having a lower
peak extinction coefficient or quantum yield might still provide more signal,
if it could be excited more effectively by a particular laser line. (5,44) There
are several lasers available with output at wavelengths above 500 nm, but
comparatively few with output in the blue and UV regions. The paucity of
UV laser types, along with the high attenuation of fibers in the UV (see
Figure 7.3), is unfortunate, given that many interesting and useful fluorescent
probes are excited in this wavelength range. Future probe development will
likely be at wavelengths ranging into the near IR, where attenuation is low,
bandwidths are high, and a variety of sources are available.
In addition to wavelength, other attributes of the laser are important: its
power, stability, spatial and temporal characteristics, cost, and reliability are
all factors in the laboratory. For laser-based sensors employed in the field or
harsh environments, the cooling, space, and electrical power requirements may
also be important. Output power must be great enough to provide adequate
sensitivity, yet not so great as to damage components of the system or create
nonlinear effects. Damage is wavelength-dependent: picosecond microjoule
pulses (having very high peak powers) in the infrared may be compressed
in singlemode optical fibers,(40) but coupling even modest output from an
excimer laser in the far UV into a large-core silica fiber remains difficult.
Stability, both in intensity and beam pointing, is a concern, but most lasers
are adequate in this respect. The temporal characteristics are primarily of
interest for time-resolved experiments, and they are dealt with in the next
section.
The recent trend toward development of low-power solid-state lasers
promises to have a large impact on fluorescence-based fiber-optic sensors. The
diode-pumped YAG lasers with output frequency-doubled in the green† are
indicative of this trend, being all solid state, small, lightweight, and rugged,
and having low power consumption. At present, they remain expensive and of
modest (< 5 mW) output, but both characteristics are certain to improve.
Prototype continuous-wave (cw) diode lasers emitting frequency-doubled
(blue) output have been shown.(45) The commercial availability of gas lasers
†
Diode-pumped YAG lasers with frequency-doubled output in the green are currently available
from Spectra-Physics, Moutain View, California; Amoco Laser, Inc., Naperville, Illinois, and
AB Lasers, Concord, Massachusetts.
Fluorescence-Based Fiber-Optic Sensors 361
7.10. Polarization
7.11. Conclusion
Acknowledgments
I would like to thank Susan McBee and Lynne Kondracki for their help
in preparing the figures; Frances Ligler, David Kidwell, and Carl Villarruel
for helpful discussions and careful reading of the manuscript; Michael P. Eden
of Polaroid Corporation for the gift of an aspheric mirror; and the Office of
Naval Technology for support.
References
16. O. S. Wolfbeis, H. E. Posch, and H. W. Kroneis, Fiber optical fluorosensor for determination
of halothane and/or oxygen, Anal. Chem. 57, 2556–2561 (1985).
17. L. A. Saari and W. R. Seitz, Immobilized morin as fluorescence sensor for determination of
Al(III), Anal. Chem. 55, 667–670 (1983).
18. L. A. Saari and W. R. Seitz, pH sensor based on immobilized fluorcsceinamine, Anal. Chem.
54, 821–823 (1982).
19. D. M. Hercules (ed.), Fluorescence and Phosphorescence Analysis, Wiley-Interscience,
New York (1965).
20. S. Udenfriend, Fluorescence Assay in Biology and Medicine, Vols. 1 and 2, Academic Press,
New York (1962, 1969).
21. C. E. White and R. J. Argauer, Fluorescence Analysis: A Practical Approach, Marcel Dekker,
New York (1970).
22. J. I. Peterson, R. V. Fitzgerald, and D. K. Buckhold, Fiber optic probe for in vivo measure-
ment of oxygen partial pressure, Anal. Chem. 56, 62–67 (1984).
23. E. D. Lee, T. C. Werner, and W. R. Seitz, Luminescence ratio indicators for oxygen, Anal.
Chem. 59, 279–283 (1987).
24. F. V. Bright, Remote sensing with a multifrequency phase-modulation fluorometer in: Time-
Resolved Laser Spectroscopy in Biochemistry (J. R. Lakowicz, ed.), Proc. SPIE 909, pp. 23–28
(1988).
25. O. S. Wolfbeis and M. J. P. Leiner, Recent progress in optical oxygen sensing, in: Proceedings
of the SPIE Conference on Optical Fibers in Medicine III, Proc. SPIE 906, pp. 42–48
(1988).
26. D. M. Jordan, D. R. Walt, and F. P. Milanovich, Physiological pH fiber-optic chemical
sensor based on energy transfer, Anal. Chem. 59, 437–439 (1987).
27. J. L. Gehrich, D. W. Lubbers, N. Opitz, D. R. Hansmann, W. W. Miller, J. K. Tusa, and
M. Yafuso, Optical fluorescence and its application to an intravascular blood gas monitoring
system, IEEE Trans. Biomed. Eng. BME-33, 117–132 (1986).
28. J. I. Peterson, S. R. Goldstein, R. V. Fitzgerald, and D. K. Buckhold, Fiber optic pH probe
for physiological use, Anal. Chem. 52, 864–869 (1980).
29. W. A. Wyatt, F. V. Bright, and G. M. Hieftje, Characterization and comparison of three fiber-
optic sensors for iodide determination based on dynamic fluorescence quenching of
rhodamine 6G, Anal. Chem. 59, 2272–2276 (1987).
30. C. Dahne, R. M. Sutherland, J. F. Place, and A. S. Ringrose, Detection of antibody–antigen
reactions at a glass–liquid interface: A novel fibre-optic sensor concept, in: Proceedings of the
Second International Conference on Optical Fiber Sensors (R. T. Kersten and R. Kist, eds.),
pp. 75–79, VDE-Verlag, Berlin (1984).
31. J. D. Andrade, R. A. Vanwagenen, D. E. Gregonis, K. Newby, and J.-N. Lin, Remote fiber-
optic biosensors based on evanescent-excited fluoroimmunoassay: Concept and progress,
IEEE Trans. Electron Devices, ED-32, 1175–1179 (1985).
32. C. A. Villarruel, D. D. Dominguez, and A. Dandridge, Evanescent wave fiber optic chemical
sensor, in: Proceedings of the SPIE Conference on Fiber Optic Sensors II, Proc. SPIE 798,
225–229 (1987).
33. W. F. Love and R. E. Slovacek, Fiber optic evanescent sensor for fluoroimmunoassay, in:
Proceedings of the Fourth International Conference on Optical Fiber Sensors, Institute of
Electronics and Communication Engineers of Japan, Tokyo (1986).
34. T. P. Burghardt and N. L. Thompson, Effect of planar dielectric interfaces on fluorescence
emission and detection, Biophys. J. 46, 729–737 (1984),
35. E.-H. Lee, R. E. Benner, J. B. Fenn, and R. K. Chang, Angular distribution of fluorescence
from liquids and monodispersed spheres by evanescent wave excitation, Appl. Opt. 18,
862–868 (1979).
36. J. P. Dakin and A. J. King, Limitations of a single optical fibre fluorimeter system due to
Fluorescence-Based Fiber-Optic Sensors 365
Inhomogeneous Broadening of
Electronic Spectra of
Dye Molecules in Solutions
Nicolai A. Nemkovich, Anatolyi N. Rubinov, and
Vladimir I. Tomin
8.1. Introduction
367
368 Nicolai A. Nemkovich et al.
due to the presence of a constant dipole moment in the dye molecule and
induces the reactive field R in the cell; and (2) the restoring force which is due
to the action of the reactive field on the dipolar molecules of the solvation shell.
The action of the restoring force becomes evident if we consider what would
happen if the dye molecule were extracted from the cell. In this case the cell
would tend to reach its initial state at under the action of the restoring
force. The cell is in its equilibrium state when the two forces cancel each
other. This state is the most stable one, and it is characterized by the reactive
field Any deviation from this state requires some work to be done. The
latter will just be the measure of the potential energy of the nonequilibrium
solvate configurational interaction. Obviously, an absolute thermodynamic
equilibrium must lead to a Boltzmann equilibrium distribution of cells with
respect to the energy of the configurational interaction. Thus, the problem
is reduced to determining the configurational energy of the cell versus its
reactive field R.
Taking into account that the reactive field R is parallel to the dipole
moment inducing this field, the dipole–field interaction energy can be
represented as
This energy is essentially the work done by the dipole to reorient (polarize)
an elementary cell of the solution. The work necessary to perform such
reorientation of the solvation shell and to change the reactive field by dR can
be written in differential form as
It can easily be seen that in the solvate and R may be treated as a polarizing
force and the generalized coordinate for the solvation shell configuration,
respectively. The restoring force acts along with the polarization force
in the solvate. Thus, the total force affecting the solvation shell is
Assuming that at the total force is zero and taking into account
Eqs. (8.1) and (8.4), we obtain the proportionality factor
Inhomogeneous Broadening of Electronic Spectra of Dye Molecules 371
Thus, according to Eqs. (8.1) and (8.5), the total force acting upon the
solvation shell is
It can be seen from Eq. (8.8) that the total force is zero only for cells with
Such cells are the most stable ones and possess minimum configura-
tional interaction energy.
The deviation of the cell configuration from the equilibrium configuration
involves an increase in energy as compared to the minimum value. The
magnitude of this excess may be considered as a measure of the potential or
free energy of the nonequilibrium solvate. For a cell with the reactive field R,
this energy can be determined as the work required to restructure the solvate
from its equilibrium state with the field to the state with the field R:
As can be seen from Eq. (8.9), a plot of the potential energy of the solvate
configurational interaction versus the internal reactive force has a parabolic
shape with its minimum at
†
In thermodynamics it may be called “the free energy of the system.”
‡
For simplicity, we will consider the electronic transition with a constant dipole moment of the
dye molecule changing its magnitude rather than orientation. We will also neglect the quasi-
continuum of vibrational states of a complex molecule in this section.
372 Nicolai A. Nemkovich et al.
This equation takes into account that the reactive field for an equilibrium
solvate in its excited state is equal to
As seen from Eq. (8.11), the potential energy of the excited solvate is repre-
sented by the same parabola as for the unexcited state, but raised by
The field diagram in Figure 8.1a corresponds to the case in which the
dipole moment of the molecule increases on excitation
In this most common case, the solvent causes both inhomogeneous broaden-
ing and a general spectral shift to ward longer wavelengths (Figure 8.10)
(negative solvatochromism). When the solvent causes a spectral shift
to ward shorter wavelengths (positive solvatochromism), and the field
diagram of such a solution has the form shown in Figure 8.1b. The following
analysis of the inhomogeneous broadening characteristics holds equally for
both cases.
374 Nicolai A. Nemkovich et al.
Substitution of Eq. (8.18) into Eq. (8.17) gives the susceptibility fluctuation
distribution function for the cells
where
where is the dielectric constant of the medium, and a is the Onsager sphere
radius. Using this expression, Eq. (8.22) may be written in the form
where
The local fluctuation field R acting on the dye molecule will polarize it, thus
creating an induced dipole moment:
Taking into account that unexcited cells obey the Boltzmann law and using
Eq. (8.28), we obtain the solvate distribution as a function of local field
fluctuations:
where
(8.32)
As seen, in this case also, the cell distribution over local field fluctuations is
of a Gaussian form. The distribution over electronic transition frequencies is,
however, of a different form. We obtain from Eqs. (8.30) and (8.31)
378 Nicolai A. Nemkovich et al.
where
The values of and can be found from the balance equations if the
Boltzmann distribution of the configurational and vibrational energies of the
particles is taken into account:
where is the fluorescent lifetime of a molecule in the solvate with the type
I configuration, and is the Einstein coefficient defining the absorption of
such solvates at a frequency v. Using Eqs. (8.35) and (8.37), we obtain, from
Eq. (8.38),
form
where is the 0–0 transition frequency for ith type solvates, and is the
width of the homogeneous absorption band. For ith type solvates resonantly
excited by frequency v, the condition is fulfilled, and hence
For the type I solvates, and the Einstein coefficient for the exciting
light frequency is equal to
Substituting these values of and into Eq. (8.42), we obtain for the
spectral excitation range (corresponding to the long-wavelength slope of
the total absorption band) the following expression for the degree of selectivity:
A similar analysis can be made for the exciting light frequency range
(corresponding to the short-wavelength slope of the total absorption band).
The difference between these two cases is that radiation of frequency
can be absorbed by all the equilibrium solvates irrespective of the vibrational
energy stored by the dye molecule. Integration of the expression analogous
Inhomogeneous Broadening of Electronic Spectra of Dye Molecules 381
to Eq. (8.38) will therefore cover the range from 0 to For excitation
frequencies this results in
It follows from Eqs. (8.45) and (8.46) that the excitation selectivities for the
blue and the red edge of the total absorption band of a solution are essentially
different. Excitation can be regarded to be selective when since only in
this case is the number of selectively excited nonequilibrium solvates greater
than that of nonselectively excited equilibrium solvates.
Following Eq. (8.46), in the case where and is assumed
to increase with blue-edge selective excitation is only possible for
that is, under the trivial condition that the width of the inhomo-
geneous spectrum is wider than that of the homogeneous spectrum. For
complex molecules, the width of the vibrational (homogeneous) spectrum
amounts to hundreds of wave numbers; that is, it is of the same order of
magnitude as the width of the inhomogeneous spectrum. Therefore, no
selectivity will be observed with blue-edge excitation, as a rule.
The situation is, however, different for the long-wavelength, red-edge
absorption. According to Eq. (8.45), in this case the degree of selectivity can
be greater than 1 even at In other words, if the incident light
frequency is localized at the long-wavelength edge of the absorption band,
selective excitation of nonequilibrium solvates will be ensured even when
the width of the homogeneous spectrum exceeds that of the inhomogeneous
spectrum. As seen from Eq. (8.45), however, this is true provided the
excitation frequency is within the frequency range given below:
calculated using Eqs. (8.21) and (8.22) for polar molecules and Eqs. (8.33)
and (8.34) for nonpolar molecules if the solution melting point is taken as the
value of T. As shown above (see Table 8.1), the width of the inhomogeneous
function has been estimated as for polar molecules and
for nonpolar molecules. These results are of the same order of
magnitude as measurements made at liquid helium temperature.(23, 26)
Thus,
for the polar 1-iodonaphthalene molecule in butyl bromide at
an inhomogeneous spectral width of was reported,(26) whereas, as
mentioned above, for the nonpolar perylene molecule in n-hexane at the same
temperature, was measured to be
Thus, the characteristics of inhomogeneous spectral broadening of
complex molecules at liquid helium and liquid nitrogen temperatures prove to
be the same. However, the homogeneous spectra of molecules are essentially
different at these temperatures. As shown in Refs. 7 and 27 for amorphous
matrices, fine-structure, rather than diffuse, absorption and fluoresencee
spectra consisting of a set of narrow nonphonon lines are observed at liquid
helium temperature. Superposition of such spectra belonging to cells of
different configurations leads to a structureless spectrum. Therefore, it is the
inhomogeneous broadening effect that plays the determining role in the
commonly observed structureless appearance of absorption and fluorescence
bands. If the homogeneous linewidth is rather small (experiments(28, 29) show
Inhomogeneous Broadening of Electronic Spectra of Dye Molecules 383
Also, for simplicity, let the configurational interaction be weak compared to the
intramolecular one. Then may be assumed to be independent
of the stored vibrational energy that is, the shape of the spectrum is the
384 Nicolai A. Nemkovich et al.
same for all the solvates. Thus, the configurationally broadened absorption
spectrum can be represented as
is fulfilled, the solvate distribution in the ground state should differ from the
equilibrium one. In this case, the solution becomes more transparent and
behaves as a spectrally inhomogeneous system.
With knowledge of the shape of the homogeneous absorption spectrum
the absorption band of the solution can be calculated from Eq.
(8.52). As shown in Ref. 30, for complex organic molecules the absorption
band can be nicely approximated by the function
The matching coefficients a and b are related to the width of the absorp-
tion band (for the level determined by the pure electronic transition
frequency ) and the distance between the absorption maximum and the
frequency by the relations
where
where is the excited state lifetime for molecules with electronic transi-
tion frequency is the mean lifetime of excited molecules, and is
determined from Eq. (8.21) and from Eq. (8.53) with the excitation
frequency substituted for v.
The solution fluorescence spectrum changes with the distribution function
Figure 8.4 shows the fluorescence spectra calculated from Eqs.
(8.55)–(8.60) as a function of the exciting light frequency. Upon excitation at
the long-wavelength slope of the absorption band, the fluorescence spectrum
depends on the excitation frequency; it is shifted toward longer wavelengths
as decreases. Initially, the change in does not alter the shape of the
fluorescence band. Only for rather small incident light frequencies, that is,
upon excitation of essentially nonequilibrium solvates, is spectral broadening
observed owing to the above-mentioned decrease in the excitation selectivity
also triplet states of dyes were broadened configurationally and, under the
condition of slow reorientation of molecules in a solution, this broadening
was inhomogeneous.(37)
The application of tunable dye lasers for excitation opens up new
possibilities of studying configurationally nonequilibrium solvates in a
solution. Figure 8.5 gives BL measurement data for a frozen ethanol solution
of the well-known dye rhodamine 6G under fluorescence excitation by a
continuous-wave (cw) tunable dye laser. The solid curve (I) represents the
absorption spectrum. At the bottom of the figure, a set of luminescence curves
corresponding to excitation at different wavelengths (indicated by the arrows)
in the far anti-Stokes spectral region is given. Absorption in this region is
already very small due to the small population of the upper configurational
states. Nevertheless, the excitation of luminescence proved to be possible due
to the high laser radiation intensity employed. Excitation at frequency
near the absorption maximum yielded the fluorescence spectrum labeled 1.
The position and the shape of this spectrum are practically independent of
the frequency for varying within the main part of the absorption
band, where the broadening is largely due to the vibrational sublevels of the
equilibrium unexcited configurational state. The situation changes, however,
when is shifted to the anti-Stokes region. Because of the abrupt break in
the long-wavelength slope of the absorption spectrum of complex molecules in
390 Nicolai A. Nemkovich et al.
It has been known for many years that the influence of inductive-
resonance energy transfer between similar molecules on the luminescence
properties of rigid solutions is restricted by the fact that the degree of
polarization, the quantum yield, and the decay time decrease with increasing
concentration. (39) The last two effects are believed to be caused by energy
migration to low-luminescence centers(39); that is, they would not be observed
if luminescence centers of only one type were present in the solution. The
majority of previous theories which described the influence of energy transfer
on the luminescence characteristics of solutions (see Refs. 39–41) did not
predict the existence of any concentrational effects besides those mentioned
above.
The presence of a continuous set of configurational solvate states presup-
poses the possibility of inductive–resonance energy transfer between different
states provided that the concentration of dye molecules in the solution is high
enough. If such a transfer does occur, it will apparently be directed from the
solvates with a high frequency of the pure electronic transition to those with
Inhomogeneous Broadening of Electronic Spectra of Dye Molecules 391
lower values of this frequency, that is, from the “blue” solvates to the “red”
ones (the reverse transfer must be hindered due to its low probability).
All this is illustrated by the diagram in Figure 8.6 of electronic–configura-
tional states in solution. For simplicity, the diagram depicts a discrete, rather
than a continuous, set of electronic–configurational states.
When the solution concentration is increased, the sublevel may be deac-
tivated by both spontaneous emission and nonradiative energy transfer
(dashed lines in Figure 8.6) to solvates with lower frequencies of
the pure electronic transition, since the probability of energy transfer in this
direction is higher than in the reverse direction because of different overlap
392 Nicolai A. Nemkovich et al.
integrals of the absorption and emission spectra of “blue” and “red” solvates.
For rigid solutions, gradual passage to excitation on the low-frequency slope
of the absorption spectrum, that is, decreasing the excitation frequency,
results in selective excitation to the first singlet state of the solvates that, in
their ground state, populate configurational sublevels higher than the equi-
librium sublevel (Figure 8.6). When excited, these solvates will, accordingly,
populate the configurational sublevels lying below the Franck–Condon sub-
level I* (up to the equilibrium sublevel II*). At the same time, the efficiency
of nonradiative energy transfer decreases, since energy transfer to more
“red” solvates practically does not occur due to their insignificant proportion
in the solution, and the reverse energy transfer to the sublevel I* is of low
probability.
Directed nonradiative energy transfer (DNRET) in rigid polar solutions
and polymer matrices is responsible for a number of peculiar concentrational
effects. These include the long-wavelength shift of luminescence spectra with
increasing luminophore concentration(13) and the dependence of emission
anisotropy on the luminescence and absorption spectra.(42) For the emission
spectra, luminescence anisotropy is higher on the blue slope and drops toward
longer wavelengths. In the case of red-edge excitation, the emission anisotropy
values increase to close to the limiting values.
It was also found that in solutions of complex molecules with inhomo-
geneously broadened spectra, at high concentrations some unusual time
characteristics of the luminescence were observed: the luminescence decay
time increased toward longer wavelengths, and the luminescence decay on the
slopes differed appreciably from single exponential.(38, 42) Finally, it has been
shown (43) that for red-edge excitation the concentrational effects disappear in
systems with inhomogeneously broadened spectra, as follows from the
diagram in Figure 8.6.
DNRET was directly observed for the first time using nanosecond
spectrofluorimetry. This technique makes it possible to visualize the dynamics
of the process and measure the required kinetic parameters. (44) In Ref. 45,
DNRET was observed between different configurational states of solvated
molecules in a rigid solution. Poly(vinyl alcohol) was used as the solvent.
At a low dye concentration, steady-state inhomogeneous broadening was
observed: for 3-aminophthalimide molecules the luminescence spectrum
shifted as the exciting light frequency was varied. In other words, in this case
ordinary bathochromic luminescence was observed, consistent with selective
excitation of various configurational states of solvated molecules. The batho-
chromic luminescence spectrum was time-independent. At high concentrations
(Figure 8.7) the situation was, however, different. The instantaneous emission
spectrum shifted gradually to the red. In this case, each instantaneous
spectrum corresponds to a specific solvate state whereas the entire sequence
of spectra represents the energy transfer dynamics for a set of configurational
Inhomogeneous Broadening of Electronic Spectra of Dye Molecules 393
states. The higher the dye molecule concentration, the faster is the energy
transfer. As would be expected, the energy transfer direction is always the
same, from the “blue” to the “red” solvates. The time shift of the luminescence
spectrum is responsible for the nonexponential luminescence decay in the
emission spectrum, being most distinct on its slopes. In addition, the lumines-
cence decay time depends on the recording wavelength (Figure 8.7, curve 5).
Figure 8.8 shows the position of the luminescence spectrum maximum versus
time for solutions with different concentrations of 3-aminophthalimide. In line
with theory, the rate at which the spectrum shifts is seen to depend on time.
It is maximum just upon excitation and then decreases. With decreasing
concentration, the time shift amplitude decreases.
The unique feature of directed energy transfer in the systems involved is
that it occurs between chemically identical molecules with different shells
rather than between chemically different molecules with differently located
energy levels. This means that in this case the directed transfer results in
spontaneous concentration of excitation energy on molecules with a specific
surrounding structure, that is, in some structurally specified local regions of the
rigid solution. Of particular interest is the fact that energy is transferred from
structures with a low degree of molecular orientation to those characterized
by a high degree of orientation. Indeed, the dependence of the electronic
transition frequency of a molecule on the local field intensity is defined by
Eq. (8.10). This suggests that the higher the solvate reactive field intensity,
the lower is the transition frequency. In other words, the “blue” centers
correspond to solvates with low R, that is, with a more chaotic orientation of
394 Nicolai A. Nemkovich et al.
Inhomogeneous Broadening of Electronic Spectra of Dye Molecules 395
solvent molecules in the solvate shell, whereas the “red” centers represent
solvates with a high-intensity reactive field and, therefore, with a higher
degree of orientation of their molecules.
Thus, the migration of electronic excitation from “blue” to “red” centers
can be regarded as directed energy transfer from structures of low order to
highly ordered structures in the solution. Such a process can naturally be
expected to play a certain part in the mechanism of directed energy transfer
in biological systems, in particular, in the transfer of electronic excitation
energy from the antenna chlorophyll molecules to the reactive center in the
photosynthetic system of plants. In Refs. 19 and 20 energy exchange between
molecules of the photosynthetic pigments chlorophyll a and pheophytin a was
studied experimentally under model conditions with pigments introduced into
the polar matrix. Data for chlorophyll a are presented in Figure 8.9. Both the
instantaneous spectrum and the spectrum provide evidence for the existence
of DNRET in this case.
We conclude by noting that the theory of inductive–resonance energy
transfer in solutions with inhomogeneous spectral broadening is given in Refs.
17 and 42.
Equation (8.62) is rather general as it takes into account not only the
396 Nicolai A. Nemkovich et al.
change with time and the dependence of the excited state population on the
experimental conditions, but also the dependence of the spontaneous emission
spectrum on these conditions.
Analysis of Eq. (8.62) reveals all the experimental features of lumines-
cence of polar solutions including the kinetics of the instantaneous emission
spectra [at a given /, we have the instantaneous spectrum for a given instant
of time, and the decay law if the frequency is fixed
and t is variable.
If
substituting Eqs. (8.4) and (8.64) into Eq. (8.65) and integrating over t, we
obtain
and taking into account the relationship between the frequencies and
according to Eq. (8.15), one can obtain the molecular electronic transition
frequency as a function of time and exciting light frequency:
Assuming for simplicity that the absorption and fluorescence spectra are
mirror images and taking into account that for the cells with field R0 to be
selectively excited it is necessary to irradiate the solution with radiation of
frequency rather than determined from Eq. (8.67), we obtain
the explicit form of the dependence of the solvate fluorescence maximum on
time and the exciting light frequency
As seen from Eq. (8.69), the relaxation rate of the fluorescence spectrum
is determined by the value of If the friction coefficient is a constant,
then the decay obeys a simple exponential law. In a more general case, of
course, the coefficient must depend on the rate of change of the reactive
field, dR/dt, and increase with it. This, in turn, must inevitably lead to
more complicated decay laws which can be described by assuming to be
a function of R or by replacing the exponent in Eq. (8.69) by a sum of
exponents with corresponding coefficients, as was done in
Ref. 46.
Note that the nature of all the implications of Eq. (8.69) can be
qualitatively explained using the field diagram of a solvate.
The relationship is calculated from Eq. (8.69) for a typical
phthalimide molecule in n-propanol at different temperatures in Ref. 11.
Inhomogeneous Broadening of Electronic Spectra of Dye Molecules 399
that is, the decay is faster than without relaxation. In the long-wavelength
spectral range,
that is, the fluorescence decay time does not vary due to the configurational
relaxation. The above types of behavior are shown in Figure 8.10a (curve 1).
It is quite clear, from the physical point of view, that the drift of the
fluorescence spectrum away from the recording frequencies is equivalent to
a decrease in whereas the shift of the spectrum toward the recording
400 Nicolai A. Nemkovich et al.
frequency, the fluorescence decay time will be described not by Eq. (8.73), but
by Eq. (8.72), whereas for the low-frequency portion of the spectrum,
is specified by Eq. (8.72) rather than by Eq. (8.73). However, in the region of
the maximum of the equilibrium spectrum the decay law is not distorted by
relaxation, as before. These results are illustrated by curve 3 in Figure 8.10a.
Light pumping at the equilibrium configuration excitation frequency
yields a purely exponential decay law (Eq. 8.74) for all regions of the
fluorescence spectrum, since in this case the fluorescence spectrum does not
undergo a shift with time (Figure 8.10a, curve 2).
Summarizing the results of the analysis of the spectral behavior, it is
necessary to describe three different types of dependences of the fluorescence
decay for variation of the excitation frequency over a wide range. Such a
diversity of spectra is due to the existence in the solution of different
types of solvates, each having its own dynamic luminescence spectral
behavior, and the possibility of their selective excitation by monochromatic
laser light. If this fact is neglected, the picture obtained is poor and describes
the solvation shell reorientation kinetics for only one type of centers. This
was the situation that existed before the concepts of the configurational
broadening of polar solution spectra were introduced. The fluorescence decay
behavior in a viscous homogeneous solution was considered for the first
time in Ref. 48 in terms of the Onsage model under the assumption that
the maximum shift of the fluorescence spectrum due to the reorientation
of the molecules of the solvation shell followed an exponential law. The
authors calculated the curves from the emission spectrum, assuming
instantaneous excitation, and showed their essential dependence on the
recording frequency.
Somewhat later, identical results were obtained using both a discrete and
a continuous model of the solution.(49)
All the theoretical dependences considered in this section have been
verified and supported by the methods of time-resolved spectroscopy. The
results are given in the next section.
The study of the time shift of the spontaneous emission spectrum with
varying energy of the exciting radiation is of indisputable interest, since it
enables the conclusions about the process of intermolecular relaxation that
have been made on the basis of the model presented here to be verified.
The first comprehensive measurements in both the Stokes and anti-
Stokes regions were reported in Refs. 10 and 11. The studies were performed
with a nanosecond spectrofluorimeter (time resolution about 1 ns) using a
tunable pulsed dye laser as the excitation source.
402 Nicolai A. Nemkovich et al.
where is a constant.
Inhomogeneous Broadening of Electronic Spectra of Dye Molecules 405
Here the subscripts g and e indicate the ground and the excited singlet state,
respectively; and are the root-mean-square deviation of this
distribution and the position of its maximum during relaxation, respectively.
Since the width of the homogeneous absorption spectrum is large due to
vibrational broadening, it is impossible to excite selectively only one type of
solvates. Therefore, immediately after optical excitation the instantaneous
luminescence spectrum is inhomogeneously broadened. Since the number of
solvates with the 0–0 transition frequency at the zeroth instant of time in the
excited state is equal to the product of the number of solvates in the ground
state with the same frequency times the absorption coefficient of this type of
solvate, the inhomogeneous broadening function at time t = 0 is equal to
these same times; has been calculated from Eq. (8.77), and
was found by deconvolution of instantaneous luminescence spectra recorded
on a laser spectrometer.
The steady-state luminescence and absorption spectra of 3-ANMF in
nonpolar n-hexane have been taken as the homogeneous spectra. The calcula-
tions show that, with the assumptions of a Gaussian shape, is about
at t = 0, then increases to (t = 3 ns), and decreases again
to (t =16 ns)
As follows from Eq. (8.75), the fluorescence kinetics at a fixed frequency
are determined by the variation of the inhomogeneous broadening function
with time, which, in turn, depends on the functions and Calcula-
tions using Eq. (8.75) have shown that experimental and calculated decay
curves are well matched (Figure 8.13) if is specified as
noted that this additional kinetic energy imparted to the molecules turns out
to be significant and is sufficient to rotate them by a particular angle. The
value of this angle depends on the intermolecular solvate energy after optical
excitation. Such an effect, as mentioned above, can be regarded as light-
induced molecular rotation.
The emission anisotropy additivity rule has been used to estimate the
angle of rotation of the luminophore molecule.(53) According to this rule, if
there are several types of luminescent centers in the system, the emission
anisotropy of the whole system is equal to the sum of the values of the
emission anisotropy of the individual components. If the emission only from
the Franck–Condon and equilibrium sublevels is taken into account (Figure
8.la), this rule can be written as
where the characterize the emission anisotropy kinetics for the Franck–
Condon (i = 1) and equilibrium (i = 2) states, and is the parameter charac-
terizing the contribution to the total fluorescence of the ith state, which is
determined by the expression
where the characterize the decay kinetics for the Franck–Condon and
equilibrium states calculated from the differential balance equations for the
populations of the states, the are the spectra of the states, and
is the integrated luminescence intensity of both types of states.
412 Nicolai A. Nemkovich et al.
Using the Levshin-Perrin formula (54) for the limiting value of the
emission anisotropy,
where is the angle between the absorption and emission oscillators, one can
determine the angle of light-induced rotation of the luminophore molecule
during configurational relaxation. For 3-ANMP in glycerin, this value was
at the excitation frequency
therefore, be static, as in the case of rigid polar solutions. The broadening due
to fluctuations in the probe–environment interaction energy at a given depth
will be dynamic, as in the case of viscous solutions.
Let us construct, by analogy with polar solutions, a diagram of electronic-
configurational states of the system consisting of the probe molecule and its
immediate surroundings in the membrane (we shall call this system the
solvate). Let us plot on the abscissa the local electric field intensity R and on
the ordinate the solvate potential energy, which includes the electronic energy
of the probe and the probe–environment interaction energy as well as the
interaction energy of the solvate membrane segments.
Such a diagram for two different probe localization depths is shown in
Figure 8.18, where the left-hand pair of curves corresponds to the most
probable localization. On each curve the energy minimum corresponds to the
Inhomogenaous Broadening of Electronic Spectra of Dye Molecules 415
most stable configuration of the probe and the solvate membrane segments,
for which all the types of interaction are in relative equilibrium. Thermal
motion can also give rise to other solvate configurations with lower or higher
values of R. In this case, however, the relative equilibrium of intermolecular
forces is disturbed and the potential energy of the solvate increases.
As the solvate configuration and, therefore, the field do not change
during the transition of the probe molecule to another electronic state, the
electronic transitions in Figure 8.18 are represented by vertical lines (inter-
molecular treatment of the Franck–Condon principle). In the excited state
the electric dipole moment generally increases. Thus, in the upper state
the potential energy minima are shifted toward higher R as compared to in
the ground state. It can be shown that, as in the case of solutions, the pure
electronic transition frequency of the probe in the membrane is unambiguously
416 Nicolai A. Nemkovich et al.
probe in mixing states which differ in depth within the membrane is not
significant.
spectrum with time is the dependence of the fluorescence lifetime on the wave-
length (see Figures 8.22 and 8.23). The parallel processes of configurational
relaxation and deactivation of the excited state lead to essentially non-
exponential decay kinetics for the luminescence at the red and the blue edge
of the spectrum.
422 Nicolai A. Nemkovich et al.
Let us now direct our attention to the diagram of energy levels of fluo-
rescent centers in a membrane (Figure 8.18). In the case of excitation such that
(the subscript i indicates that the frequency of zero configurational
excitation is different for centers with different depths of localization), energy
relaxation begins upon excitation. During relaxation, the excess configura-
tional energy is expended in other degrees of freedom of the fluorescent
center. As in the case of LIR in solution, this must lead to an increase in the
fluctuation force acting on the probe and acceleration of the process of
fluorescence depolarization. It is clear that the effect is the stronger, the
greater the intermolecular interaction energy released in the course of
relaxation. This conclusion is confirmed by experiment.
The studies reported in Ref. 58 have shown that the emission anisotropy
kinetics of 1-PNA in MPB essentially depend on the excitation and recording
wavelengths. For excitation within the long-wavelength absorption band the
kinetics are nonexponential, and the depolarization rate decreases with time
(Figure 8.23). The red-edge fluorescence depolarization is always faster than
the depolarization at the blue edge.
With excitation in the region of the maximum of the long-wavelength
absorption band (337 nm), the moment the excitation pulse causes, the
emission anisotropy values are significantly lower than the limiting value. This
points to the existence of a subnanosecond component of the depolarization
that is invisible under the experimental conditions. With excitation at the red
edge of the absorption band, (390 nm, Figure 8.23), the values of the emission
anisotropy increase practically up to the limiting value; that is, the subnano-
second component of the depolarization disappears.
The data indicate that for 1-PNA molecules in phospholipid membranes
the process of configurational relaxation of the luminescence spectrum,
which follows the process of excitation, accelerates the probe fluorescence
depolarization. This depolarization is most pronounced when the recording
frequency is at the red edge of the fluorescence spectrum, where the greatest
contribution to the luminescence is made by solvates that have undergone
relaxation.
The increase in the fluorescence anisotropy on going to longer excitation
wavelengths (Figure 8.23) may be attributed to the lower configurational
energy of the solvates absorbing the lower frequency radiation.
The dependence of the luminescence depolarization rate on the recording
wavelength influences the character of the steady-state polarization spectrum;
that is, as the wavelength increases, the depolarization monotonically
decreases. The wavelength dependence of the fluorescence decay time also
essentially contributes to this effect.
As indicated above, light-induced rotation was also observed in polar
Inhomogeneous Broadening of Electronic Spectra of Dye Molecules 423
8.6. Conclusions
We will summarize here the results presented in this chapter. The data
that have been obtained to date enable us to draw the following conclusions:
1. Polar solutions of complex molecules are systems with inhomo-
geneous configurational broadening. The differences in the spectra
of the fluorescent centers in a polar solution are caused by the
variability in structure of the solvation shells surrounding the fluo-
rescent molecule owing to the thermal motion of the molecules in the
solution.
2. In rigid solutions, inhomogeneous broadening is static in nature and
shows up in steady-state spectra. Static inhomogeneous broadening
is observed in liquid solutions, too, when the condition of slow
molecular reorientation is satisfied. This condition can be
424 Nicolai A. Nemkovich et al.
Acknowledgment
References