Documente Academic
Documente Profesional
Documente Cultură
3, JUNE 2009
Authorized licensed use limited to: University of Ottawa. Downloaded on January 7, 2010 at 20:30 from IEEE Xplore. Restrictions apply.
LI et al.: A FAST DIGITAL PD ALGORITHM FOR RF PA LINEARIZATION 375
Authorized licensed use limited to: University of Ottawa. Downloaded on January 7, 2010 at 20:30 from IEEE Xplore. Restrictions apply.
376 IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, VOL. 3, NO. 3, JUNE 2009
where
(7)
Authorized licensed use limited to: University of Ottawa. Downloaded on January 7, 2010 at 20:30 from IEEE Xplore. Restrictions apply.
LI et al.: A FAST DIGITAL PD ALGORITHM FOR RF PA LINEARIZATION 377
Fig. 5. Performance comparison between conventional LUT and ML-LUT Fig. 6. Evaluation of the amplitude-difference correlation function.
with uniformly distributed random input signal: (1) convergence time for
LUT, (2) convergence time for ML-LUT, (3) MSE for LUT, and (4) MSE for
ML-LUT.
Note that the feedback signal is a severely distorted (stretched
and rotated) version of the input signal initially. However, the
word-length effect—the coefficient update stops when the fol- AM-AM distortion curve is almost monotonic for the input sig-
lowing condition holds nals below the saturation level, shown in Fig. 2, especially for
OFDM signals, most of which are located far away from the sat-
uration region. This fact guarantees that larger input amplitude
(9) always results in a larger feedback signal; thus, the polarity of
the amplitude-difference between neighboring samples will be
Note that the step size for each table in the -level ML-LUT is retained even with the PA’s distortion, justifying the use of the
only of that in the -LUT; and stalling is more signif- amplitude-difference correlation to determine the integer loop
icant in the ML-LUT case when the word lengths are the same. delay. The delay that maximizes the correlation function is
Further experiments reveal that, with larger step sizes, the MSE the closest integer delay of the loop. Fig. 6 shows the ampli-
difference between the two methods becomes increasingly neg- tude-difference correlation function with under var-
ligible. ious estimated delay (horizontal axis), where the actual in-
A similar exploitation of the features of coarse and fine tables teger delay is set to 3 clock cycles or unit intervals (UIs) with
has been reported in the broadcasting technique [21]. However, a fractional delay of 0, 0.5 UI and 0.9 UI, respectively. For the
there the characteristic is temporal and only exists in the initial- case of 3.5-UI delay, the integer part is estimated to be 3 UIs
ization phase. The ML-LUT method proposed here retains the and the residual fractional part is 0.5 UI; while for the case of
interdependence between multi-tables in a hardwired configu- 3.9-UI delay, the integer part is estimated to be 4 UIs and the
ration, thereby enabling the scheme to track time-varying PA residual part is 0.1 UI. Also note that the multiplication in
characteristics at all times without losing compensation accu- (10) can be replaced by an XOR function, and (11) can be real-
racy. ized by a comparator. The architecture proposed here not only
significantly simplifies the hardware implementation, but also
enhances the estimation robustness over the PA’s gross nonlin-
B. Integer Loop Delay Estimation earity.
Fig. 7 illustrates the implementation of the integer delay es-
The loop delay compensation is accomplished in two steps. timator, which searches the delay from 0 up to 7 UIs. When
In the first step, an integer delay is estimated from the ampli- the peak of the correlation function is found, the Delay Locked
tude-difference correlation function of the input signal and the signal is asserted, which stops the counter and subsequently out-
feedback signal: puts . The decision threshold is set at to desensitize the
algorithm to the effect of random noise.
(10) C. Fractional Loop Delay Estimation and Compensation
The residual fractional loop delay is located in the range
where is the sequence length to calculate the correlation, of (-UI, UI) after the integer delay has been corrected, and can
is the estimated integer delay, and the amplitude-difference be compensated by a 4-tap FIR interpolation filter with a modi-
function is defined as fied Farrow structure [30]. The Farrow FIR filter that produces a
positive delay is revised here to accommodate both the positive
(11) and negative fractional delays (shown in Fig. 8). In either case,
Authorized licensed use limited to: University of Ottawa. Downloaded on January 7, 2010 at 20:30 from IEEE Xplore. Restrictions apply.
378 IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, VOL. 3, NO. 3, JUNE 2009
(13)
(14)
(16)
(12)
where (19)
Authorized licensed use limited to: University of Ottawa. Downloaded on January 7, 2010 at 20:30 from IEEE Xplore. Restrictions apply.
LI et al.: A FAST DIGITAL PD ALGORITHM FOR RF PA LINEARIZATION 379
TABLE I
HARDWARE COMPLEXITY OF VARIOUS PD TREATMENTS
Following the same procedure, we can obtain a similar itera-
tive equation for the case of . The overall fractional delay
estimation is summarized as
.
(21) A. Emulation Platform
Note that a larger block length will improve the stability of the
algorithm, however at the cost of a slow convergence and a de- In order to evaluate the proposed ML-LUT scheme with
graded tracking performance. Fig. 9 shows the learning curve loop delay compensation and to compare its performance with
with a block length of 32. other PD approaches, a hardware emulation platform was
Fig. 10 shows the implementation of the fractional delay es- constructed using an Altera Stratix II FPGA, which includes a
timator with a block length of 32. Fig. 11 illustrates the revised 7-level ML-LUT PD with loop delay compensation, a conven-
4-tap Farrow FIR filter, where the multiplexers are controlled tional 64-LUT PD, and a 5th-order polynomial PD. Fig. 12 is
by the sign from the fractional delay estimator. The parameter the block diagram of the FPGA emulation platform, including
is set to 0.25 for both hardware simplicity and interpolation a baseband signal generator, a PA model, an MSE calculator,
accuracy in this work. Hence, there are only two real multipliers a readout FIFO, and some control logics. Table I lists the
required for each of the I- and Q-channel. hardware costs of the three PD approaches.
Authorized licensed use limited to: University of Ottawa. Downloaded on January 7, 2010 at 20:30 from IEEE Xplore. Restrictions apply.
380 IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, VOL. 3, NO. 3, JUNE 2009
TABLE II
MSE AND ACPR PERFORMANCE
(22) Fig. 14. AM-AM curves of the Class-B PA w/ and w/o ML-LUT PD.
where and .
In the experiment, a 64-QAM OFDM signal was adopted
as the baseband input signal, which consists of 64 subcarriers C. Steady-State Performance
with a 20-MHz bandwidth, an 11-dB peak-to-average power
ratio (PAPR), and a 0-dB peak back-off (PBO). A typical 4 Table II summarizes the steady-state MSE and adjacent
oversampling, i.e., a sample rate of 80 MHz, was assumed with channel power ratio (ACPR) performance of the three PD algo-
10-bit DAC and ADC in the TX and RX, respectively. The pre- rithms upon training. It is apparent that the two LUT schemes
distorter is initialized as “transparent,” i.e., the output equals the exhibit comparable steady-state performance, and both are
input at the beginning. The emulation runs at an actual clock fre- better than that of the polynomial approach. Fig. 14 shows the
quency of 50 MHz. Some experimental results are discussed in PA transfer curve with and without the ML-LUT PD. Note
detailss in this section. that the compensated curve is drawn with data from the actual
emulation; hence, the data points of large amplitude are rare
due to the 11-dB PAPR of the OFDM signal.
B. Convergence
D. Tracking Performance
The learning curves of three adaptive predistorters, i.e., the
fifth-order polynomial, 64-LUT, and 7-level ML-LUT, during A simplified time-varying PA was modeled as follows:
initialization are shown in Fig. 13, where each iteration con-
sists of 256 samples. The step sizes for the LUT methods are
7/32 as before, while the step size for the polynomial PD is set (23)
to 0.05, nearly the maximum value for an acceptable MSE in (24)
steady state. The emulation results indicate that the proposed
ML-LUT scheme converges significantly faster than the conven- where the PA’s AM-AM and AM-PM responses are assumed to
tional LUT PD and exhibits lower steady-state errors than the vary with time in a sinusoidal fashion— denotes the variation
polynomial PD. In addition, the conventional LUT curve shows frequency, is the peak AM-AM variation, which is set to
occasional large error spikes that are mainly attributable to the 10%, and is the peak AM-PM variation, which is also set to
rarely updated LUT cells residing at the upper end. These spikes 10% of the maximum phase shift around 20 .
severely degrade the performance of the algorithm in the steady Experimental results demonstrate that the MSE rises with
state. Note that this phenomenon largely disappears in the pro- the increase of for all PD algorithms (Fig. 15). The proposed
posed ML-LUT approach. ML-LUT is most insensitive to fast variations—capable of
Authorized licensed use limited to: University of Ottawa. Downloaded on January 7, 2010 at 20:30 from IEEE Xplore. Restrictions apply.
LI et al.: A FAST DIGITAL PD ALGORITHM FOR RF PA LINEARIZATION 381
Fig. 17. PA output spectra with a 0.5-UI loop delay: (1) without PD, (2) with
Fig. 15. Tracking performance of the three PD algorithms.
ML-LUT PD alone, (3) with ML-LUT PD and loop-delay compensation, and
(4) with ideal PA.
TABLE III
WORD-LENGTH EFFECT ON MSE
TABLE IV
STEP-SIZE EFFECT ON MSE
Authorized licensed use limited to: University of Ottawa. Downloaded on January 7, 2010 at 20:30 from IEEE Xplore. Restrictions apply.
382 IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, VOL. 3, NO. 3, JUNE 2009
Authorized licensed use limited to: University of Ottawa. Downloaded on January 7, 2010 at 20:30 from IEEE Xplore. Restrictions apply.
LI et al.: A FAST DIGITAL PD ALGORITHM FOR RF PA LINEARIZATION 383
Dae Hyun Kwon (S’08) received the B.S. degree for Excellent Teamwork from Aplus Design Technologies in 2001, the Arnold
in electronics engineering from Korea University O. Beckman Research Award from UIUC in 2007, the National Science
in 2002 and the M.S. degree from the School of Foundation CAREER Award in 2008, and the ASPDAC Best Paper Award in
Electrical Engineering and Computer Science, Seoul 2009. He was included in the List of Teachers Ranked as Excellent in 2008.
National University, Seoul, Korea, in 2004, focusing
on offset-PLL RF transmitter for GSM wireless
system. He is currently pursuing the Ph.D. degree in
electrical and computer engineering at the University Yun Chiu (S’97–M’04) received the B.S. degree in
of Illinois at Urbana-Champaign in the area of physics from the University of Science and Tech-
CMOS RF circuits and systems with emphasis on nology of China, Hefei, the M.S. degree in electrical
RF power amplifier and its efficiency enhancement engineering from the University of California at Los
techniques. Angeles, and the Ph.D. degree in electrical engi-
neering and computer sciences from the University
of California at Berkeley.
From 1997 to 1999, he was with CondorVision
Deming Chen (M’01) received the B.S. degree from Technology Inc. (later Pixart Technology Inc.),
the University of Pittsburgh, Pittsburgh, PA, in 1995 Fremont, CA, where he was a Senior Staff Member
and the Ph.D. degree from the University of Cali- in charge of developing data converters for CMOS
fornia at Los Angeles in 2005, all in computer sci- digital imaging products. In 2004, he joined the Department of Electrical and
ence. Computer Engineering, University of Illinois at Urbana-Champaign, where he
He was a Software Engineer between 1995-1999 is now an Assistant Professor. He holds one U.S. patent.
and 2001–2002. He joined the Electrical and Dr. Chiu is has received many awards and honors from academia and in-
Computer Engineering Department, University of dustry. At UCLA, he was the recipient of the Foreign Scholar Award in 1994.
Illinois at Urbana-Champaign (UIUC), as a Faculty At Berkeley, he received the Regents’ Fellowship (1999), the Intel Fellowship
Member in 2005. His current research interests (2001), the Cal View Teaching Fellow Award (2003), and the Outstanding Over-
include nano-systems design and nano-centric seas Student Award from the Ministry of Education of China (2005). In addi-
CAD techniques, FPGA synthesis and physical design, high-level synthesis, tion, he received the Jack Kilby Award from the International Solid-State Cir-
microprocessor architecture design under process/parameter variation, and cuits Conference (ISSCC) in 2005, was a co-recipient of the 46th DAC/ISSCC
reconfigurable computing. Student Design Contest Award in 2009, and recipient of the Chun-Hui Award
Dr. Chen is a Technical Committee Member for a series of conferences and for foreign visiting scholars from the MOE of China in 2006. He served on the
symposia, including FPGA, ASPDAC, ICCD, ISCAS, RAW, FPL, VLSI-DAT, Technical Program Committees of the Custom Integrated Circuits Conference
ISQED, DAC, and SASP. He also served as a Session Chair for some of these (CICC), the Asian Solid-State Circuits Conference (ASSCC), the International
and other conferences and symposia. He is a Technical Program Committee Symposium on VLSI Design, Automation, and Test (VLSI-DAT), and the In-
Subcommittee Chair for ASPDAC’09-10 and a CAD Track Co-Chair for ternational Conference on Solid-State and Integrated-Circuit Technology (IC-
ISVLSI’09. He is an Associate Editor for the IEEE TRANSACTIONS ON VERY SICT).
LARGE SCALE INTEGRATION SYSTEMS. He received the Achievement Award
Authorized licensed use limited to: University of Ottawa. Downloaded on January 7, 2010 at 20:30 from IEEE Xplore. Restrictions apply.