Documente Academic
Documente Profesional
Documente Cultură
Effect of increasing chip density on the evolution of computer architectures, IBM J. Res & Dev, Vol. 46. Brendan Repeater scaling and its impact on CAD, IEEE Trans. on CAD, Vol. 23(4) Elif SOI technology for the GHz era, IBM J. Res. & Dev., Vol. 46. Cesare Turning Silicon on Its Edge, IEEE Circuits & Devices Magazine, Jan/Feb04. Yiwen
Processor Evolution New generations depending on prediction algorithms Performance benefit decreasing Sometimes simpler is better!
5
The Current Techniques (Benchmarks) Increasing pipeline depth and frequency Fewer applications responding well
Cellular Architectures Little communication overhead between threads Connectionist architecture large number of processors with little memory Advantages
Off-the-shelf commodity parts Use existing compilers Possibilities of redundancy
7
System-on-a-Chip Integrate functions that are outside processor Reduce communication costs between elements Current state performance decrease when combining technologies on one die Help with clock skew
Conclusions Convergence of processors Less focus on more computation power Scalable, distributed computing
10
Introduction:
FROM LAST LECTURE: Cmos scaling in VLSI chips bring new design concerns:
increasing dominance of interconnects leakage
In this paper : Results of scaling studies in the context of typical block level wiring distributions, and study the impact of the identified trends on post-RTL design process. Goal of the paper: To show how does exponentially increasing repeater and clocked repeater count will effect logic synthesis, technology mapping, layout and new research problems relevant to future designs.
11
CRL for M3 & M6 shrink at the rate of 0.57x per generation ~ faster than normal scaling of 0.7x
CSL shrink at a rate of 0.43x per generation ~ faster than normal scaling and the rate of decrease in CRL
Additional repeaters need to be added during a shrink of an optimal repeated interconnect from one process generation to next.
Ideally shrink interconnects wont only require additional repeaters but many of them need to be clocked
12
# of nets requiring repeaters ~ area under histogram curve to right of line representing critical length left migration of critical length exponentially increasing # of nets
13
Increasingly steep slope of curve (log scale on y-axis) => # impacted nets exploding!
14
15
Gate count metric can lead to wrong heuristic choices during early stage of synthesis due to more delay migration to repeated interconnects. Amount of logic available in a single pipeline stage shrinks
16
Block level placement algorithms at any level have to deal with the complications that arising from repeater requirements for nets at any other levels of hierarchy. When CSL shrink below to the dimensions of synthesizable block placement algorithms has to handle clocked repeater insertion which is not as straight forward as buffering. Routers cant operate in a purely geometric world, it must understand buffering Complications due to large number of via blockage Complications due to the multi pin nets
Routing:
17
SOI technology for the GHz era (by G. G. Shahidi 2002 IBM) Presented by Cesare Ferri
18
Silicon-on-Insulator (SOI) : Technology Introduction Brief History SOI vs. Bulk (power, performance, scaling) Applications Future Trends
19
SOI - Introduction
G
SOI : Process Technology Basic Idea : placing a thin layer of insulator upon the substrate
Less area junction capacitance smaller Capacitance of the switch
S n+ CSB
Si-poly SiO2
n+
CDB
p-substrate
B
n+
CDB
CSB
p-substrate
B
Faster Transistor!
21
22
Cons:
History-dependent timing (floating body) Floating Body (Vsb) Reduced effective VT (=F(Vsb)) higher off current, Ioff (OSS: on the other hand, we are decreasing VDD Ioff is the same than in Bulk..) Self heating (the channel is isolated from the bulk)
23
25
Lower threshold voltage for a given off-current Higher drive current at lower power-supply voltage
26
Halo doping
Two gate electrodes of differing work functions e.g. degenerately doped n+ & p+ polysilicon
Double-gate Taxonomy
Planar DG
Vertical DG
FinFET
28
Demonstration of DGCMOS
static operation: to prove the device parametrics can all be centered to the practical values demanded for VLSI (W,L,T) transient operation: to prove the numerous parasitic elements that can degrade circuit performance can be tamed (inverter delay)
Potential for double-gate applications a. Low-power design b. Variable threshold CMOS c. Simplified logic gates