Documente Academic
Documente Profesional
Documente Cultură
AND PERFORMANCE
ESTIMATION CONTD
Prof. N.S.Murthy,
PPKKP/UNIMAP
murthy@unimap.edu.my
02/09/16
EMT251_NSM_09
V in
V out
CL
CMOS Inverter
N Well
VDD
VDD
PMOS
Contacts
PMOS
In
Out
In
NMOS
Out
Metal 1
Polysilicon
NMOS
GND
Two Inverters
Share power and ground
Abut cells
VDD
Connect in Metal
CMOS Inverter
First-Order DC Analysis
V DD
V DD
Rp
V out
V out
Rn
V in 5 V DD
V in 5 0
VOL = 0
VOH = VDD
VM = f(Rn, Rp)
PMOS
Vin = 0
Vin = 2.5
Vin = 0.5
Vin = 2
Vin = 1
Vin = 1.5
Vin = 1.5
Vin = 2
Vin = 2.5
NMOS
Vin = 1
Vin = 1.5
Vin = 1
Vin = 0.5
Vin = 0
Vout
2.5
Vout
NMOS s at
PMOS res
1.5
NMOS sat
PMOS sat
0.5
NMOS res
PMOS sat
0 .5
1 .5
NMOS res
PMOS off
2 .5
Vin
VM
V in
V OL
V IL
V IH
A simplified approach
0.2
0.15
Vout (V)
Vout (V)
1.5
0.1
1
0.05
0.5
0
0
Gain=-1
0.5
Vin (V)
1.5
2.5
0
0
0.05
0.1
Vin (V)
0.15
0.2
Simulated VTC
2.5
Vout (V)
1.5
0.5
0.5
Vin (V)
1.5
2.5
Good PMOS
Bad NMOS
Vout(V)
1.5
Nominal
1
Good NMOS
Bad PMOS
0.5
0
0
0.5
Vin (V)
1.5
2.5
Propagation
Delay
tpHL = f(Ron.CL)
= 0.69 RonCL
Vout
CL
Ron
ln(0.5)
Vout
1
VDD
0.5
0.36
Vin = V DD
RonCL
CMOS Inverters
VDD
PMOS
1.2m
=2
In
Out
Metal1
Polysilicon
NMOS
GND
Transient Response
3
2.5
tp = 0.69 CL
(Reqn+Reqp)/2
Vout(V)
1.5
1
tpHL
tpLH
0.5
0
-0.5
0
0.5
t (sec)
1.5
2.5
-10
x 10
tp(normalized)
4.5
4
3.5
3
2.5
2
1.5
1
0.8
1.2
1.4
1.6
VDD(V)
1.8
2.2
2.4
Device Sizing
-11
3.8
x 10
3.6
3.4
tp(sec)
3.2
3
2.8
Self-loading effect:
Intrinsic capacitances
dominate
2.6
2.4
2.2
2
8
S
10
12
14
NMOS/PMOS ratio
-11
x 10
tpHL
tpLH
tp(sec)
4.5
= Wp/Wn
tp
4
3.5
3
1
1.5
2.5
3.5
4.5
Inverter
Sizing
Inverter Chain
In
Out
CL
If CL is given:
- How many stages are needed to minimize the delay?
- How to size the inverters?
May need some additional constraints.
Inverter Delay
Minimum length devices, L=0.25m
Assume that for WP = 2WN =2W
same pull-up and pull-down currents
approx. equal resistances RN = RP
approx. equal rise tpLH and fall tpHL delays
Analyze as an RC network
1
1
WP
WN
RP Runit
Runit
RN RW
Wunit
Wunit
Delay (D): tpHL = (ln 2) RNCL
Load for the next stage:
C gin
W
3
Cunit
Wunit
2W
RW
CL
RW
Load (CL)
tp = k RWCL
k is a constant, equal to 0.69
Assumptions: no load -> zero delay
Wunit = 1
Delay
2W
W
CN = Cunit
Cint
CL
Load
Example
In
C1
Out
1
f2
CL= 8 C1
f 38 2
Buffer Design
1
16
2.8
22.6
tp
64
64
65
64
18
64
15
64
2.8
15.3
Power
Dissipation
Leakage
Leaking diodes and transistors
Vin
Vout
CL
Energy/transition = CL * Vdd2
Power = Energy/transition * f = CL * Vdd2 * f
E 0 1 = CL Vdd V dd Vt
Vin
Vout
CL
IVDD (mA)
0.15
0.10
0.05
0.0
1.0
2.0
3.0
Vin (V)
4.0
5.0
Vdd =3.3
Pnorm
5
4
Vdd =2.5
3
2
1
0
Vdd =1.5
0
tsin/tsout
Leakage
Vdd
Vout
Drain Junction
Leakage
Sub-Threshold
Current
p+
p+
ReverseLeakageCurrent
+
V
dd
IDL=JSA
2
fora1.2mCMOStechnology
JS = 10-100 pA/m2
at 25 deg C for 0.25m CMOS
JS =15pA/m
Jsdoublewithevery9oCincreaseintemperature
Istat
Vin =5V
Vout
CL
Wasted energy
Impact of
Technology
Scaling
Technology Scaling
Goals of scaling the dimensions by 30%:
Reduce gate delay by 30% (increase operating
frequency by 43%)
Double transistor density
Reduce energy per transition by 65% (50% power
savings @ 43% increase in frequency
1999
Technology node
[nm]
180
Supply [V]
2000
2001
2004
2008
2011
2014
130
90
60
40
30
0.6-0.9
0.5-0.6
0.3-0.6
9-10
10
3.5-2
7.1-2.5
11-3
14.9
-3.6
Wiring levels
6-7
6-7
Max frequency
[GHz],Local-Global
1.2
90
106
130
160
171
177
186
1.4
1.7
2.0
2.4
2.1
2.3
2.5
1.6-1.4 2.1-1.6
10
10
10
10
-1
-2
10
1960
1970
1980
1990
Year
2000
2010
Propagation Delay
10
1
0.1
0.01
80
MPU
DSP
85
Year
90
95
100
100
10
1
1
Scaling Factor
normalized by 4 m design rule
(b) Power density vs. scaling factor.
From Kuroda
10
General Scaling
most realistic for todays situation
voltages and dimensions scale with different factors
Processor Scaling
Processor Power
Processor Performance
2010 Outlook
Performance 2X/16 months
1 TIP (terra instructions/s)
30 GHz clock
Size
No of transistors: 2 Billion
Die: 40*40 mm
Power
10kW!!
Leakage: 1/3 active Power
P.Gelsinger: Processors for the New Millenium, ISSCC 2001