Documente Academic
Documente Profesional
Documente Cultură
4, APRIL 1975
Invited Paper
L I
hlt) ;H (1)
n (1)
Fig. 1. A recording setup typical of those before the mid 1920’6.
Fig. 2. A signal block diagram for the recording setup of FW.1 includ-
ing the effect of additive surface noise.
produced. One can firmly convince oneself of this by taking a
modem recording and passing it through sharp cutoff filters Stated in mathematical terms, one is given the result of hav-
(digitally realized if one desires) with cutoff frequencies of the ing convolved two unknown signals, s ( t ) and h ( 0 , and from
kind mentioned. One can even add some artificially generated that combination is asked to estimate one of them. It has been
surface noise if one wishes and still the listening quality will our philosophy from thevery beginning t o approach this prob-
far exceed any available on old acoustic disks. Indeed, mea- lem as a filtering problem in which one is required to separate
surementsmade on old recording equipment indicate that two signals. Thedifference from conventionalfiltering,such
sharp resonant amplitude distortions with variations from 10 as that used to separate two television channels or two radio
t o 20 dB or more are commonly encountered in the frequency stations, is that in this case the undesirable component, h ( t ) ,
range between 100 and 1000 Hz on such recordings. This ef- has not been added t o but instead convolved with the desired
fect not only produces a megaphone quality (which the reader component, s(t).
might simulate by cupping his hands in front of his mouth and Following the homomorphic theory of Oppenheim [ 101, we
speaking) but also producesa very unpleasanteffect in the started our work by assuming that we could ignore the effect
form of loud bursts of soundwhencertainfrequencies are of surface noise on our process [ 11I , map the process of con-
played or sung. volution into one of addition, and treat the transformed prob-
Unfortunately, the amplitude distortionsproduced by acous- lem by conventional filtering techniques. While this approach
tic recording equipment were not of fixed character from re- led to interesting results from the outset, we have been con-
cording to recording. A typical setup, depicted in Fig. 1, in- stantly refining our attitude and approach to theproblem to a
volved five basic mechanisms (91. They were a hornfor point far beyond our initial understanding. As we shall see,
gathering acoustic energy, a tube for conducting that energy there is a close relationship between the problem of adapting
to a device called a sound box, a sound-boxdiaphram mecha- power spectrum estimation techniques to nonstationarysignals
nism, a lever mechanism and associated stylus for cutting the and the homomorphic deconvolution filtering idea we just d e
groove, and a turntable driving a wax disk into which the re- scribed. As aresult, we have beenforced to gain a better
cording was cut. As shown in thefigure, the singing which ap- understanding of both subjects.
peared at the mouthof the recording horn as an acoustic signal
s ( t ) was delivered to the disk after having been modified by
HOMOMORPHIC
DECONVOLUTION
the horn and its associated mechanism. The recording setup depicted in Fig. 1 is represented in Fig.
Surprisingly enough, the engineers of that day were excellent 2 in terms of a signal block diagram. In addition, we have in-
craftsmen and managed t o avoid what we presently call non- cluded the effects of surface noise which becomes combined
linear distortion quite well. The major effect of the recording withthe recorded waveform u ( t ) to produce the playback
horn system is, therefore, the same as that of a linear filter. In waveform p ( t ) that is available to us today when we play back
terms of functions, the singing waveform s ( t ) was convolved an old disk on modem reproductionequipment. Although the
with the impulse response of the recording mechanism h ( t ) t o surface noise on all records tends to become louder as the r e
produce the recorded waveform u ( t ) . Except for surface noise, corded signal increases in volume, this effect is minor in high-
it is this latter waveform which is available to us today by play- quality copies. As a result, we consider the effects of surface
ing an old recording. noise as additive. Since the recorded signal u ( t ) is related to
Unfortunately, the recording horn impulse response h ( t ) was the original acoustic waveform by
varied from one recording t o another. The primary cause of u(t) =s(t) 0h(t) (1)
this effect was probably due to the fact that the sound box
mechanism played more the role of a resonant amplifier than and the playback waveform to therecorded signal by
that of an acoustic to mechanical transducer. The recording
At)=W )+ (2)
engineers knew this and were constantly making an effort to
tune their sound boxes for maximum efficiency. Such tuning, the overall relatiodhip is
coupled with the placement of recording horns, the length of p ( t ) = ( s ( t ) 0 h ( t ) )+ no).
connecting tubes, and the shape of the horns, provided marked
variation in the characteristics of h ( t ) from recording to re- The objecthere is, given the playback waveform p ( t ) , t o
cording even for the same performer from day t o day. Thus reproduce the original acoustic waveform s ( t ) as closely as
it is that in restoring these old recordings, we are faced with possible. In this form, the problem sounds very much like a
a blind deconvolution problem since we know neither the sing- classical Wiener filtering problem. Unfortunately, such prob-
ing signal s ( t ) nor the impulse response h ( t ) involved in any lems usually assume that thedetails of the system H are known.
particular recording under consideration. Of course,the major difficulty of the old recordingsrestoration
680 PROCEEDINGS OF THE IEEE, APRIL 1975
problem is that the recording system frequency response H ( f ) Again, taking the complex logarithm,we obtain
is unknown. A second difference encountered here is that in
classical filtering problems, the signals are assumed to be sta-
tionary.Acoustic waveforms of singing such as s ( t ) arefar If windows whose length is on the order of half-second are
from stationary as can be readily appreciated by the fact that used and a 50-percent overlap from window to window is em-
the energies in such signals possess strongvariationsfrom ployed (e.g., as is commonlydonewitha simplehanning
moment to moment. window), and since a typical old recording might be between
Noting these issues and assuming for the time being that the 3 and 5 min in duration, then from 300 to 600 intervals for
noise n ( t ) is negligible, one is faced with the problem: given which (9) holdswouldbe available.Averaging over all of
u ( t ) as in (l), find s ( t ) . Following the homomorphic filtering these intervals yieldsthe relationship
concept, the first stepin the process isto transform the convo-
lution equation of (1) into a linear equation in the hope that
the welldeveloped discipline of linear signal processing might
be applied. This can be done by first taking the Fourier trans-
form of both sides of (l), thus producing We would at this point hope that the first term in the right-
hand side of (10) would converge to zero for large' N . Un-
= S c f ) H(f) - (4 1 fortunately, this does not happen. Thisfact is made clearer
which is simply the familiar frequency response relationship by writing (10) as two separate equations, the first equating
between the input and the output spectra for a linear system. the real parts and the second equating the imaginary parts of
By further taking the complex logarithmsof both sides of (4), the complex quantities involved. This is done in
we obtain
log V ( f l =log S(fl + log H c f ) (5 1
which possesses the additive propertywe wish. and
At this juncture, one approach might be t o take several re-
cordings made with the same recording equipment, put each
one in the form of (51, and average both sides of these equa-
tions across all of the recordings. If there were enough record-
ings and the singing on each were sufficiently different, one which descrii the approximateattenuationand phase rela-
might expect, according to the central limit theorem, that the tionships between the original s i n g i n g and the recorded wave
right-hand side of ( 5 ) would converge t o log HV). forms, respectively.
There are difficulties with this idea. The most important is Concentrating first on the log amplitudes of (1 la), itwould
that there are not a multiplicity of recordings available. There be our hope that the first term in the right-hand side would
is only one, because h ( t ) was varied from one recording to converge to zero for large N. Theoretical considerations would
another. This problem is overcome by chopping that recording predict that this does not happen since, crudely speaking, this
into many intervals of moderate length (e.g., one-half-second termrepresents in decibels the average distribution in fre-
intervals).Each of theseintervals,which we will call ui(t), quency of the energies constituting musical signals. Practical
represents a slightly differentmusical passage perturbed by the experiencebears out thisexpectation.Indeed, aswe shall
same recording mechanism. A small difficulty arises, however. show shortly, if the singing waveform were a stationary pro-
Each of these intervals is not given precisely as a convolution cess, this term would be onehalf of the logarithmof the power
of the corresponding interval of the original acousticwave and spectrum of that signal except for asmall additive constant. It
the impulse response of the recording horn. The problem d e is common knowledge that even if one were to assume that a
tails have to do with edge effects at the ends of the chopped music waveform were stationary, its spectrum would be cer-
intervals. Nevertheless, if the intervals are long compared to tainly far from white (i.e., constant or flat). In pursuing the
the temporal extent of the impulse response h ( t ) , the approxi- blind deconvolution objective, it remains to remove this term
mate relationship so that log IH(fll may be revealed.
This goal is reached by processing a modem recording of a
ui ( t )N si ( t ) 8 h ( t ) (6)
musical selection similar t o the one tobe restored in a manner
results. identical to thatjustdescribed. The recordingmechanisms
Furthermore,bymultiplyingthechoppedintervalsbya used to make the modem recordings, hereafter called proto-
smooth window function, we obtain a slightly modified set of types, possess virtuallyflatfrequency responses. Forthem,
vi's which are related closely t o the corresponding modified set the second term in the right-hand side of (1 1) reducesto zero.
of SI'S by Thus, for the modern recording, the averaging process yields
an isolated version of the first term in the right-hand side of
-
Ui(t) = Wi(t) * u ( t ) = Wi(t) [ s ( t ) 0 h ( t ) ] (1 1a) which can then be removed by subtraction.' The basic
N [ w i ( t )* s(t)l 8 h ( t ) = si(t) 0 h ( t ) . (7)
If the windows wi(t) are smooth and long compared to the 'T h e requirement for large N, and that the windowsbe long compared
temporal extent of the impulse response h ( t ) ,the approximate to the temporal extent of h(t), is the essential motivation for the as-
equality of (6) holds very closely. Taking the Fourier trans- sumption made in the introduction that one unknown signal be of con-
aidesably d e r extent than the other.
form of both sides of ( 6 ) ,we obtain 'Claims for the subjective importance of this atep were fnst made by
N. J. Miller who also selected the data and made th6 f i estimates that
Vi (f)N Si (f)* H(fl. (8) were used.
STOCKHAM e? 01.: BLIND DECONVOLUTION 68 1
In Figs. 3 through 14, inclusive, the abscissa represents frequency in the original acoustic disk by high-speed convolution processing.
hertz on a logarithmic scale. The ordinate represents decibels on a scale ~ind the
~resulting
, samplesare convert& to an =dogsignal
of 10 dB per main division. The equations in this paper corresponding
to these fiiures have been formulated using the units of nepers because and presented for listening.
thesimplicity ofthe M t U d logarithm was required to produce qua- The results of this processing are very striking, especially
tions of manageable complexity. In all cases, the results obtained from
the rmry be from n e ~ to s decibels simply by from an artistic
point
of view where the emphasis is not so
multiplying
them
by 8.686. - . much on producing
recording
a of modem
quality as it is on
682 PROCEEDINGS OF THE IEEE, APRIL 1975
SActually, the Wiener filter would involve the phase of HCf) in its
formulation, but recall that we have assumed that phase compensation
'The a's are of course functions of frequencyf. is completely neglected in this application at this time.
684 PROCEEDINGS OF THE IEEE,APRIL 1975
4 0 of (14):
A 30
M
p 20
L
I 1 0 Following our desire to keep ourresults in termsof attenuation
T
u oo and to allow comparison with the homomorphic approach,we
take the logarithm of both sides of (241, divide by two, and
- 1 0 obtain
E
n
- 20
d
6 -30
U
-40
1 0 1 0 0 1000 5000
F R E Q U E N C Y [HZ1
Fig. 7. Empirically determined bass boost. Retaining our supposition about the stationary nature of s ( t ) ,
as shown inAppendixes A and C, the left-hand side of (25) is a
second sample mean estimate of half of the logarithm of the
power spectrum of the playback waveform. Considering the
expected values and variances of both sides of (251, we obtain
l , l '1
p 20 1 ! , .,
L
I 1 0
T
u oo
- 1 0
E
n
-20
d
B -30
U
-40
= $'(N)/4 = 1/(4N)
1 0 1 0 0 1000 5 0 0 0
F R E Q U E N C Y [HZ]
A -10 A 30
M M
-20 p 20
L
I -30
T
u -40
-50
E
~ -60
d
I 1
-70 -30
U U
-80 I 1 , , :I - 40
1 0 100 1000 5000 10 100 1000 5000
F R E Q U E N C Y [HZ1
Fig. 9. The averages of (25) for the same C m s o recording as used t o
produce Fig. 3.
A -IO
M
P -20
,
L
-30
O - 1 0
T E
-4 0
~ -20
E
-50
-60
d
6
U -30- 1
n
-40 1 1 1 , ,I
d
8 -70
U 10 1 0 0 1000 5000
- 80 F R E Q U E N C Y [HZ1
1 0 1 0 0 1000 5000 (b)
F R E CJ U E N C Y [HZ1 Fig. 11. Thedifferencebetween Figs. 9 and lo@). (a)Estimate of
amplitudedistortions. (b) Inverteddataproducingcompensation
(a) frequency response.
0
4 0
A - 1 0
M A 30
p -20 M
L p 20
I -30 L
T I 1 0
u - 4 0 T
U O0
-50
E
-10
~ -60 E
d
n
-20
u
e -70 d
8 -30
- 80 U
1 0 1 0 0 1000 5000
- 40
10 100 1000 5 0 0 0
F R E Q U E N C Y [HZ1
(b) F R E Q U E N C Y [HZ1
Fig. 10. The averagesof (25) for the sameBjoerlingrecordingused Fig. 12. Thefrequencyresponseof Fig. ll(b) truncated to avoid ill-
to produce Fig. 4. (a)Therawdata. (b) Thedatasmoothed in conditioning due t o surface noise.
frequencies.
being about 22 percent more stable (i.e., having smaller stan-
through (22), except that (1 9b) becomes darddeviation or being smoother)thanthehomomorphic
estimate.
v u { A (f))= 1/ ( u v ) . ( 1 9 ~ ) Nevertheless, as impliedbefore, significant differences be-
tween this power spectrum approach and the homomorphic
In other words,these two system estimators are equivalent, approach are confirmed experimentally. Specifically,
Figs. 9-1 2
(19), with the onebased upon direct power spectrum estimates correspond to Figs. 3-6 except forthis change in strategy. What
686 APRIL IEEE,PROCEEDINGS OF THE 1975
is more, notice the similarity between Figs. 8 and 12. In Fig. where &(flis the frequency response of the slowlyvarying
12, the restoration frequencyresponse agrees very closely with linear system during the ith interval. Adding noise as before,
the empirically bass boosted homomorphic estimate.Also,upon we get
listening, the two sound very much alike. Furthermore, even
though (19b) and (19c) predict that the results of Fig. 5(a) PAfl= GAfl P A f l *H ( f l + NAfl (31)
ought to be less smooththanthose of Fig. ll(a), just the instead of (12). Computing averages according to the homo-
reverse appears to be true. morphic theory leads to
At this point it is only natural to wonder about the cause of
1 N 1 N
theexperimentaldifferencesbetweenthetwoapproaches. - log IfKfll N ; log IGAfl PAfl * H(fl +NAfll. (32)
Considering the physical nature of audio signals, especially N i=1 i=1
music signals, it is somewhat obvious that the stationary as-
Proceeding as motivated by the power spectrum approach, we
sumption for the singing signal is at the heart of the issue. In
have
addition, since the power spectrum estimatesgive more natural
sounding results directly, a shadow of doubt is cast across the
advisabilityofusing homomorphic blind deconvolution for
noisy signals inspite of its attractive theoretical motivation in
the noise-free case. Somehowthismakes no sense, espe-
cially since the homomorphic theory makes no assumptions
about the stationarity of the signals involved and the power
spectral approach does. As a result, let us set out to determine
the effects of nonstationarity on bothmethods. It should be noted that (32) and (33)are no longer estimators
of true log powerspectra.This is so because the pi's are
THE EFFECTOF NONSTATIONARY SIGNALS UPON different for eachinterval.
HOMOMORPHICVERSUS POWERSPECTRUM ESTIMATESOF In the noise-free case, as is shown in Appendixes D and E ,
AMPLITUDE DISTORTIONS the expected value and variances for these expressions are
As is well known, the application of power spectrum estima-
tion techniques is permissible in the strict sense only when the
signals involved are stationary. In addition, estimates involving
time averages are possible only if the signals are ergodic. If, on
the other hand, the timevariations which characterize the non-
stationary signals change slowly enough with time, powerspec-
tralestimatesapplied to such signals maystillmake sense.
Since all power spectrum estimates used in this work involve
averages over finite lengths of time, the requirement for this
to be true is that the statistics of the signal change so slowly
in each time interval to be analyzed that the estimation calcu-
lations are virtually unaffected.
In thislight,let us modelthe singing signal s ( t ) as being
formed by passing a stationary random signal through a time-
varying linear system. Also let the variations of this system be
so slow that we can consider the system to be time-invariant
over any interval for which a Fourier analysis is to be applied. (35b)
This is equivalent to modelingthe singingwaveform as if it h
were produced by a speech synthesis system suchas commonly where L,(n represents the right-hand side of (32) and P,(fl
used in vocoders subject to the constraint that only unvoiced represents the right-hand side of (33).Comparetheseequa-
(i.e., hiss) [ 161 excitation to be used and that theparameters de- tionswith(16a),(16b),(27a),and(27b), respectively. As-
termining the frequency response of the vocal tract be slowly suming no noise (is., @N = 0 ) and that all of the pi's are
varying. Admittedly, this is a relatively simple model. It does identical, the equations become identical, and
not permit coherent components such as those produced by
the very nearly periodic vibration of the vocal chords. None- 4 s = @G * 8'(fl (36)
theless, it has served us very well in this analysis and to pro-
duce a more sophisticated working model begins to approach as is expected in the stationary case. ^Permitting the pi's to
the complexity of creating an automated singing music box. vary, we see that both E {L,} and E {P,} will in general be
Proceeding with this model and paying attention to the con- modifiedfrom thestationary result,eachbyadifferent
siderations with respect to slow variations and windows, (7) amoxnt. In
addition, while
var{Lp) remains the same,
becomes var {P,} is increased, the increase being larger the greater the
variability.
v t W = W t ) 0 b,<t) 0 h ( t ) (29) Proceeding similarly for a modemrecording, we obtain
where bt<t) is the impulse response of the slowly varying linear
system during the ith interval. Taking the Fourier transform
of both sides of (29), we obtain
K W = G,U) PXfl M f )
* (30)
STOCKHAM et al.: BLIND
ECONVOLUTION 681
where a,{n is the frequency response of the slowly varying for the homomorphic approach,and
linear systemduring theith interval of theprototype.The
assumption that the prototype and the signal to be restored
have the same statistics requires, forthehomomorphic ap-
proach, that
Fig. 1 5 . Blind deconvolution of image motion blur. (a) Photograph of Fig. 16. Blind deconvolutionof lens defocus blur. (a) Photograph of
a sign blurred in a real camera. (b) Restoration of (a) achieved through bunding blurred in a real camera.(b). Restoration of (a) achieved
homomorphic blind deconvolution. through blind homomorphic deconvolutlon.
of the cepstrum has yielded the three parameters.Once the For image deblurring, the choice between the homomorphic
parameters are found, we form the phase compensation func- approachandthe powerspectrumapproach is based upon
tion from the appropriate known analytic expressions, we as- differentconsiderationsthan for dereverberatingsound. The
sociate that phase with the amplitude compensation of (21), reason is that the noise is signal dependent.
and we proceed with the digital filtering. Consider a multiplicative noise model. Also note that image
Figs. 15 and 16 demonstrate some results of o w experiments signals arenonstationaryinmuchthesame way as sound
with blurred images. In both examples, the blurring degrada- signals. For example, in one area, an image may be very dark
tions were produced through deliberate misuse of a real cam- and shadowy, resultingin low signal amplitudes, and in another
era. The restorations shown were then produced by the blind area, the image may be well illuminated and bright, resulting
deconvolution methods just outlined. in high s i g n a l amplitudes. The result of all this is that the noise
690 PROCEEDINGS OF THE IEEE, APRIL 1975
amplitudes will also be small in the dark areas and high in the Using (A3), it can be shown [ 181 that
bright areas.
These effects reduce the perturbations of the pi's in (43) and
E {z} = E {log ( y ) }= log p,, -C (A44
(44) caused by the presence of Ni(f) in (31). The result is that var{z} = n2/6 = 1.6449341 * - (A4b)
(43) and (44) possess more nearly equal biases than in the case
where Cis Euler's constant ( C = 0.57721 e).
of additive noise and produce results closer to those of (1 9a).
This fact reduces the bias problems previously encountered
with the homomorphic approach.
we now let y - -
Generalizing for sums of independent random variables,
2~and z log &. It has been shown by
Bartlett and Kendall[22] that
In contrast, the smoothness of the estimatesproduced by
the two approaches remains much the same as in the additive E{z} = log py + $(N) - log ( N ) log py (A5a)
noise case. Specifically, for multiplicative noise, the dynamic
range of the image signals is greater and (42b) deviates more var {z} = $ '(N) = 1/N (A5b)
from (1 9c). Thus, in the case of images, the two attenuation where $ ( N ) is the digamma function.' The derivation of (A5)
estimates are both biased by the same amount, and inthe involves development of theappropriate characteristicfunc-
direction away from ill-conditioning, while for large dynamic tion and is beyond the scope of this paper. The reader is
range signals, the homomorphic approach provides smoother referred t o [221 for details.
results. Using a Euler-Maclaurin expansion [23],
CONCLUSIONS $(N)=log(N)- 1/(2N)- 1/(12W2+ l/(120N)4- * a * .
y -
Consider, f i t , two random variables, y and z = log ( y ) with
d (x' with2 degrees of freedom),E { y } = py , and
var{y} = u$ = p$. Noting that a d
distribution is equivalent
var{ '2
N i=1
log&(f)12} =n2/6N= 1.6449341 - * - / N (Blb)
to an exponential distribution,
it follows that p y ( y ) =
k * exp (-ky),where p y ( y ) is the probability density distribu- where #p = E{Pt{f)12} is thetrue powerspectrum of p ( t )
tion associated withy and k = l/py. underthe usual assumptions of powerspectralestimation.
2
We will now derive pz(z) (called a log distribution with 2 Noting that
degrees of freedom and denoted log d). Recall that z = f l y )=
log ( y ) . Under appropriate conditions [ 2 1 1,
PAZ) = Py(f-l(Z)) * I(d/dz)f-'(z)l. (AI)
Since df-'(z)/dz = f'(z) = exp (z), itfollows that we have, as desired,
APPENDIX C
Equations (26a) and (26b) may be derived as follows. v u { t , ( f ) ) = (i)
{2
var
1 N
log IGi(f) * ~ ( f ) 1 2 }
As in Appendix B, we will consider IPiffl12to be distributed
as X:. Accordingl , ( 1 / N ) Zgl CPi(f)l' is distributed as X ~ N(Wb)
, = nZ/(24N)
K
and log ((l/N) CZz11pz&fl12) as log 2 ~ Again, . using the where q 5 ~= E{IG&fll'} is the true power spectrum of g(f) under
results of Appendix A, we have
the appropriate conditions.
= );( 1% $ (C 1a)
Assuming the process is noiseless, we may write
1 N
{):( [f 2 Pt091'] /
p p ~=)- IGi(f) * ~ ( f Pi(n12
) (El)
N i=l
vu log = $'(N)/4 zz 1/(4N) (C1 b)
and
where 6 = E {lPt&fl12} is the true power spectrumof p ( f )under
the usual assumptions of power spectral estimation. The ap-
proximations of (Cl) are valid for largeN(e.g., N > , 20). It is
interesting to note that as N -P 00,
Note that if a l l the &(f)’s are identical, then (E6) becomes [ 5 1 R. B. Smith and R. M. Otis, “Homomorphic deconvolution by log
spectral averaging,” submitted for publication t o Geophysics.
K = N as expected for a stationary process. [ 6 ] T. G. Stockham, Jr., “A-D and D-A converters: Their effect on
Using this approximation with the results of Appendix A, digital audio fidelity,” in DiB’ral Signal Processing, L. R. Rabiner
we have, as desired, and C. M. Rader, Eds. New York:IEEE Press, 1972, pp.
484-496.
[ 7 ] S. Kriz, “A 16-bit A-D-A conversionsystemfor high fidelity
audio research,” in Proc. ZEEE Symp. Speech Recognition, pp.
278-282, Apr. 1974.
[ 8 ] C.-M. Tsai, “A digital techniquefortestingA-Dand D A con-
verters,” M.S. thesis, Univ. Utah, Salt Lake City, June 1973.
[ 9 ] 0. Read and W.L. Welch, From Tin Foil to Stereo. Indianapolis,
Ind.: Howard W. Sams, 1959.
[ l o ] A. V. Oppenheim, R. W. Schafer, and T. G. Stockham, Jr., “Non-
linear fdtering of multiplied and convolved signals,” h o c . ZEEE,
VOl. 56, pp. 1264-1291, Aug. 1968.
[ 11] M. Medress, “Noise analysis of a homomorphic automatic volume
control,” S.M. thesis, Dep. Elec. Eng.,M.I.T., Cambridge, Mass.,
Jan. 1968.
[ 121 R. W. Schafer, ‘‘Echo removalbydiscrete generalized linear
fdtering,” Tech. Rep. 466, M.I.T. Res. Lab. Electron.,Cambridge,
var{iiP(f)}
(-3 $’(K) 2: 1/(4K) Mass., Feb. 1969.
[ 131 J. L. Goldstein, “Auditory spectral fdtering and monaural phase
perception,” J. Acoust. Soc. Amer., vol. 41, no. 2, pp. 458-479,
1967.
[ 141 B. Gold and C. M. Rader, Digital ProcesPing of Signals. New
York: McGraw-Hill, 1969, pp. 203-232.
[ 1 5 ] H. D. Helms, “Nonrecursive digital fdters: Design methodsfor
where the last terms ineach equation are valid for large N . achieving specifications on frequencyresponse,” ZEEE Trans.
Audio Elecrroacoust.,vol. AU-16, pp. 336-342, Sept. 1968.
[ 161 J. L. Flanapn, Speech Analysis Synthesis and Perception, 2nd
ACKNOWLEDGMENT Ed. New York: Springer-Verlag, 1972, pp. 321-395.
The authors wish t o thank the people who have helped along [ 171 R. B. Ingebretsen, “Log spectralestimationforstationaryand
nonstationary processes,’’ M.S. thesis, Comput. Sci. Dep.,Univ.
the course of research leading to the ideas presentedhere. Utah, Salt Lake City, 1975.
They are grateful for the contributions of G.Randall, R. B. [ 181 E. R. C d e , “The removal of unfiown image blurs by homomor-
phic filtering,” Comput. Sci. Dep.,Univ. Utah, Salt Lake City,
Warnock, M. Milochik, K. Gerber, N. J. Miller, R. Rom, E. UTECCSc-74-029, June 1973.
Ferretti,and many others who have given encouragement, [ 191 T. M. Cannon, “Digital image deblurring by nonlinear homomor-
interest, criticism, and ideas. Special thanks are due D. W. phic filtering,” Comput. Sci. Dep.,Univ. Utah, Salt Lake City,
UTECCSC-74-091,AUg. 1974.
Evansof Salt Lake City, Utah, for the acoustic restoration [20] J. F. Walkup and R.C. Choens, “Image processing in signal-
idea, and S. B. Fassett of Boston, Mass., for his deep insight dependent noise,” Opr. Eng., vol. 13, no. 3, May/June, 1974.
into and interest in the technical, musical, and historic aspects [ 2 11 E. Parzen, Modem Probability Theory and Its Applications. New
York: Wdey, 1960, p. 312.
of acoustic recordings. [22] M. S. Bartlettand D.G. Kendall, “Thestatisticalanalysisof
variance heterogeneityandthelogarithmictransformation,” J.
REFERENCES Res. Statist. Soc. (Suppl.),vol. 8, pp. 128-138, 1946.
[23] CRC Standard Mathematical Tables, 22nd ed., Samuel M. Selby,
J. Makhoul, “Linear prediction: A tutorial review,” this issue, pp. Ed. Cleveland, Ohio: CRC Press, 1973, p. 483.
561-580. [24] R.B. Blackman and J. W. Tukey, TheMeasurement of Power
R. W. Schaferand L. R. Rabiner, “Digital representationsof Spectra, New York: Dover,-1958, p. 22.
speech signals,” this issue, pp. 662-677. (251 P. D. Welch, “The use of fast Fourier transform for the estimation
L. C. Wood and S. Treitel, “Seismic signal processing,” this issue, of power spectra: A method based on time averages over short,
pp. 6 4 9 6 6 1 . modified periodograms,” IEEE Trans. Audio Elecrroacousr., vol.
B. R. Hunt, “Digital image processing,” this issue, pp. 693-708. AU-15, pp. 70-73, June 1967.