Documente Academic
Documente Profesional
Documente Cultură
A. Astarloa
ABSTRACT
This paper presents a new Single Event Upset (SEU),
Multiple Bit Upset (MBU) and Single Hardware Error (SHE)
mitigation strategy to be used in Virtex-4 FPGAs. This strategy aims to increase not only the effectiveness of traditional
Triple Module Redundancy (TMR), but also the overall system availability. Frame readback with ECC detection and
frame scrubbing are combined in a dynamically reconfigurable TMR architecture, designed under both spatial and implementation diversification premises. Moreover, since the
strategy works on the devices bitstream domain, the basis
for Virtex-4 FPGAs bitstream definition are also shown.
1. INTRODUCTION
Since the introduction of SRAM-based Field Programmable
Gate Arrays (FPGAs) around two decades ago, this technology has progressively consolidated a place in the current
competitive electronic market. Nowadays they are systematically being used for embedding complex digital systems in
a single chip (SoC). In this way, the experienced increase
in density brought about by smaller SRAM cells and logic
structures, makes these state-of-the-art reconfigurable logic
devices very attractive for modern applications.
However, SRAM technology is specially sensitive to radiation induced SEUs. According to released data from the
last Rosetta experiment [1], Virtex-4 FPGA devices present
238 FIT/Mb1 in the configuration memory and 379 FIT/Mb
in Block RAM memories due to SEU effect. Moreover,
MBUs are starting to be also a matter to be addressed in
very deep sub-micron process technologies such as those
used for FPGAs fabrication. The NASA Jet Propulsion Laboratory (JPL) points out in [2] that MBUs are nearly three
times more likely to occur in the Virtex-4 family (90 nm)
than in the Virtex-II family (130 nm), and 69 times more
1 One
569
570
VWU 9X
:9 89: ;
Y 7Z; 8 [
:
\7Z]
^
<=>?>@ABC =DEF / KLL ?MND@I =DEIF O,0
!"
,
O/.
P
4
QRS22T
/ -1
112
+++
/2,
-,3H
P
44
QRS//T
H/1
-/00
+++
#$%
#$%
+++
#$%
#$%
&'%
#$%
(# $)
#$%
#$%
&'%
&'%
&'%
#$%
#$%
#$%
#$%
#$%
#$%
&'%
&'%
&'%
#$%
+++
,
#$%
45 677 89: ;
<=>?>@ABC =DEF GH. - II H0, J
KLL ?MND@I =DEIF G-/1HII-0,OJ
5U \7Z] 89:;
&'%
&'%
#$%
#$%
+++
+++
#$%
+++
+++
-,
&'%
&'%
#$%
#$%
*
#
$
)
&'%
-/
VWU 9X
:9 89:;
Y 7Z; 8
^ :
\7Z]
^
\7Z] 9X ^
#$%
#$%
+++
+++
&'%
&'%
#$%
#$%
#$%
&'%
#$%
#$%
#$%
#$%
#$%
#$%
&'%
&'%
#$%
-0
#$%
#$%
+++
+++
-.
#$%
-1
+++
#$%
2,
2.
,
+++
&'%
23
IOB
CLB
IOB
VCLK
CLB
DSP48
CLB
IOB
BRAM Conn.
BRAM Data
Major addr.
Minor addr.
0
029
112
021
13
029
14
02
1518
021
19
020
2027
021
28
029
02
019
02
063
571
t
oq
oq
nopo
qrstuv PPC qrstuv
wxyz{
wyz{
405
|}}~
|}}~
pq
qn
rsr
s
pq
qn
s
rsr
pq
qn
rsr
:;
.
/01234156
-
" $$ %%",(
"!#%%"$%&'( 7$+7*+89#8&+&*
!
)
*
&+
,
one TMR module, next section describing SHE recovery actions are undergone, or else the ICAP port access is disabled
in order to prevent a potential self-corruption of the configuration memory content due to system malfunctioning.
-
572
PRSEUMBU
BM Scr
z }| {
M T T R = Nf T W b f
M T BE [in hours] =
Nf T
Nf
| {z }
Mf Rb
}|
{ z
FITRAD
Nf
+(
FITRAD + FITSHE
2
109
4
(
+
FITRAD
3
|{z}
Spatial Diversity
Imp. Diversity
}|
PRSHE
Mi Rp
}|
{
FITSHE
FITRAD + FITSHE
109
Nf TRb f
)
FITSHE Nf TRb f 2 MTTM
{z
}
|
z
z }| {
TRb f + Nf TW b f )
(1)
(2)
PRFE
6. STRATEGY CHARACTERIZATION
7. CONCLUSIONS
In this paper a novel strategy for SEU, MBU and SHE handling in Virtex-4 FPGAs has been described, as well as the
basis used for its development. The strategy has been firstly
validated in a self-reconfigurable TMR scheme and then characterized by means of a single analytical model which incorporates experimentally obtained results.
8. REFERENCES
[1] Xilinx Inc., Reliability report, 4th quarter 2008, UG116.
[2] M. Berg, Assessing and mitigating radiation effects in
Xilinx FPGAs, JPL Publication, 2008.
[3] C. Constantinescu, Trends and challenges in VLSI circuit
reliability, IEEE Micro, vol. 23, no. 4, pp. 1419, 2003.
[4] S. Suresh, M. Prasanth, X. Yuan, N. Vijaykrishnan, and
S. Karthik, FLAW: FPGA lifetime awareness, in IEEE
Design Automation Conference, 2006, pp. 630635.
[5] Xilinx Inc., Xilinx TMRTool user guide, UG156, 2006.
[6] K. Chapman and L. Jones, SEU strategies for Virtex-5 devices, Xilinx Inc. XAPP864, 2009.
[7] A. Lesea and P. Alfke, Xilinx FPGAs overcome the side
effects of sub-90 nm technology, Xilinx Inc. WP256, 2007.
[8] M. Berg, C. Poivey, D. Petrick, D. Espinosa, A. Lesea, K. A.
Label, M. Friendlich, H. Kim, and A. Phan, Effectiveness of
internal versus external SEU scrubbing mitigation strategies
in a Xilinx FPGA: Design, test, and analysis, IEEE Trans.
on Nuclear Science, vol. 55, no. 4, pp. 22592266, 2008.
[9] S. Pontarelli, M. Ottavi, V. Vankamamidi, G. Cardarilli,
F. Lombardi, and A. Salsano, Analysis and evaluations of reliability of reconfigurable FPGAs, in Journal of Electronic
Testing, vol. 24, no. 1, 2008, pp. 105116.
[10] W. E. Cory, D. P. Schultz, and S. P. Young, Error checking
parity and syndrome of a block of data with relocated parity
bits, U.S. Patent 7 426 678, Sept. 16, 2008.
[11] A. Avizienis, J. C. Laprie, B. Randell, and C. Landwehr, Basic concepts and taxonomy of dependable and secure computing, IEEE Trans. on Dependable Secure Comput., vol. 1,
no. 1, pp. 1133, 2004.
573