Documente Academic
Documente Profesional
Documente Cultură
33 32 31
23 22 21
13 12 11
33 32 31
23 22 21
13 12 11
33 32 31
23 22 21
13 12 11
c c c
c c c
c c c
b b b
b b b
b b b
a a a
a a a
a a a
31 13 21 12 11 11 11
b a b a b a c + + =
b a c c + =
31 13 21 12 11 11 11 11
b a b a b a c c + + + =
C
11
= 0
Computer System Architecture Dept. of Info. Of Computer Chap. 9 Pipeline and Vector Processing
9-13
Pipeline for calculating an inner product : Fig. 9-12
Floating point multiplier pipeline : 4 segment
Floating point adder pipeline : 4 segment
)
after 1st clock input
after 8th clock input
Four section summation
k k
B A B A B A B A C + + + + =
3 3 2 2 1 1
Source
A
Source
B
Multiplier
pipeline
Adder
pipeline
after 4th clock input
A
1
B
1
Source
A
Source
B
Multiplier
pipeline
Adder
pipeline
A
4
B
4
A
3
B
3
A
2
B
2
A
1
B
1
Source
A
Source
B
Multiplier
pipeline
Adder
pipeline
Source
A
Source
B
Multiplier
pipeline
Adder
pipeline
after 9th, 10th, 11th ,...
A
8
B
8
A
7
B
7
A
6
B
6
A
5
B
5
A
4
B
4
A
3
B
3
A
2
B
2
A
1
B
1
A
8
B
8
A
7
B
7
A
6
B
6
A
5
B
5
A
4
B
4
A
3
B
3
A
2
B
2
A
1
B
1
5 5 1 1
B A B A +
, , ,
6 6 2 2
B A B A +
+ + + + +
+ + + + +
+ + + + +
+ + + + =
16 16 12 12 8 8 4 4
15 15 11 11 7 7 3 3
14 14 10 10 6 6 2 2
13 13 9 9 5 5 1 1
B A B A B A B A
B A B A B A B A
B A B A B A B A
B A B A B A B A C
Computer System Architecture Dept. of Info. Of Computer Chap. 9 Pipeline and Vector Processing
9-14
Memory Interleaving : Fig. 9-13
Simultaneous access to memory from two or
more source using one memory bus system
AR 2 bit 4 1
memory module
) Even / Odd Address Memory Access
AR
Memory
array
DR
AR
Memory
array
DR
AR
Memory
array
DR
AR
Memory
array
DR
Address bus
Data bus
Supercomputer
Supercomputer = Vector Instruction + Pipelined floating-point arithmetic
Performance Evaluation Index
MIPS : Million Instruction Per Second
FLOPS : Floating-point Operation Per Second
megaflops : 10
6
, gigaflops : 10
9
Cray supercomputer : Cray Research
Clay-1 : 80 megaflops, 4 million 64 bit words memory
Clay-2 : 12 times more powerful than the clay-1
VP supercomputer : Fujitsu
VP-200 : 300 megaflops, 32 million memory, 83 vector instruction, 195 scalar instruction
VP-2600 : 5 gigaflops
Computer System Architecture Dept. of Info. Of Computer Chap. 9 Pipeline and Vector Processing
9-15
9-7 Array Processors
Performs computations on large arrays of data
Array Processing
Attached array processor : Fig. 9-14
Auxiliary processor attached to a general purpose computer
SIMD array processor : Fig. 9-15
Computer with multiple processing units operating in parallel
Vector C = A + B c
i
= a
i
+ b
i
PE
i
Vector processing : Adder/Multiplier pipeline
Array processing : array processor
General-purpose
computer
Input-Output
interface
Attached array
Processor
Main memory Local memory
High-speed memory to-
memory bus
PE1
PE3
PE2
M1
M3
M2
PEn Mn
Master control
unit
Main memory