Bine ați venit la Scribd!

Comparison of Multimedia SIMD, GPUs and Vector

Încărcat de

0% au considerat acest document util (0 voturi)

1K vizualizări13 pagini

SIMD architectures can exploit significant data-level parallelism for: matrix-oriented scientific computing media-oriented image and sound processors SIMD is more energy efficient than MIMD SIMD Extensions Graphics Processor Units (GPUs) these architectures are designed to execute Data Level parallel Programs.

Descriere originală:

Drepturi de autor

Formate disponibile

PDF, TXT sau citiți online pe Scribd

Partajați acest document

Partajați sau inserați document

Opțiuni de partajare

Vi se pare util acest document?

Este necorespunzător acest conținut?

Raportați acest document

Drepturi de autor:

Attribution Non-Commercial (BY-NC)

Formate disponibile

Descărcați ca PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

0% au considerat acest document util (0 voturi)

1K vizualizări13 pagini

Comparison of Multimedia SIMD, GPUs and Vector

Încărcat de

Harsh Prasad

Drepturi de autor:

Attribution Non-Commercial (BY-NC)

Formate disponibile

Descărcați ca PDF, TXT sau citiți online pe Scribd

Indicator pentru conținut neadecvat

Salt la pagina

Sunteți pe pagina 1din 13

Căutați în document

Comparison of Multimedia SIMD, GPUs and Vector Architectures

(Data Parallelism Hennessy Section 4.4) ByHarsh Prasad 2008CS50210

CSL718 05-Apr-12

Introduction

A common way to increase parallelism among instructions is to exploit data parallelism among independent iterations of a loop SIMD architectures can exploit significant data-level parallelism for:
matrix-oriented scientific computing media-oriented image and sound processors

SIMD is more energy efficient than MIMD SIMD Parallelism

Vector architectures SIMD extensions Graphics Processor Units (GPUs)

These architectures are designed to execute Data Level parallel Programs

CSL718 05-Apr-12

Vector Architectures

Read sets of data elements into vector registers Operate on those registers Disperse the results back into memory Example: VMIPS Improvements
Multiple Lanes Gather-Scatter Memory Addressing

CSL718

05-Apr-12

Basic Structure of Vector Register Architecture (Vector MIPS)

Multi-Banked memory for bandwidth and latencyhiding

Pipelined Vector Functional Units

Vector LoadStore Units (LSUs)

Each Vector Register has MVL elements (each 64 bits) 2

Vector Control Registers
VLR Vector Length Register VM Vector Mask Register
CSL718

MVL = Maximum Vector Length

05-Apr-12

SIMD Extensions

Media applications operate on data types narrower than the native word size Limitations, compared to vector instructions:
Number of data operands encoded into op code No sophisticated addressing modes (stride, scatter-gather) No mask registers

CSL718

05-Apr-12

Graphics Processing Unit

Offers higher potential performance than traditional multicore computers. Heterogeneous execution model
CPU is the host, GPU is the device

Develop a C-like programming language for GPU Unify all forms of GPU parallelism as CUDA (Compute Unified Device Architecture) thread Programming model is Single Instruction Multiple Thread

CSL718

05-Apr-12

Comparison: Vector Architectures and GPUs

many lanes in GPU, therefore GPU chimes are smaller

compiler manages mask register explicitly in software

CSL718

Implicitly using branch synchronization markers and internal stack to save, complement and restore masks.

05-Apr-12

Vector processor and a multithreaded SIMD Processor of a GPU

Supplies scalar operands for scalarvector operations, increments addressing for unit and non-unit stride accesses to memory

one PC per SIMD thread

Ensures High Memory Bandwidth

CSL718 05-Apr-12

GPU have hardware support for Multithreading

VMIPS register holds the entire vector.

Vector is spread across the registers of SIMD lanes.

CSL718

05-Apr-12

Memory Latency is hidden by paying latency once per load/store instructions in Vector Architecture. GPU hides it using Multithreading. Conditional Branch Mechanism of GPU handles StripMining problem of Vector Architectures by iterating the loop until all the SIMD lanes reach the loop bound.

CSL718

05-Apr-12

Comparison: Multimedia SIMD Computers and GPUs

CSL718

Scalar processor and Multimedia instructions are separated by an I/O bus in GPUs with separate main memories.

05-Apr-12

Also, Multimedia SIMD instructions do not support scatter-gather memory accesses. In short it can be said that GPUs are multithreaded SIMD processors with more number of lanes, processors and better hardware for multi-threading.

CSL718

05-Apr-12

Thank You

CSL718

05-Apr-12

S-ar putea să vă placă și

8086 Memory Segmentation
Document11 pagini
8086 Memory Segmentation
MAHALAKSHMI MALINI
Încă nu există evaluări
A Design Implementation and Comparative Analysis of Advanced Encryption Standard (AES) Algorithm On FPGA
Document4 pagini
A Design Implementation and Comparative Analysis of Advanced Encryption Standard (AES) Algorithm On FPGA
Anand Parakkat Parambil
100% (1)
William Stallings Computer Organization and Architecture 9 Edition
Document40 pagini
William Stallings Computer Organization and Architecture 9 Edition
Adnan Aslam Merchant
Încă nu există evaluări
Unit 1 - 80386 Architecture and Programmers Model
Document43 pagini
Unit 1 - 80386 Architecture and Programmers Model
pj
Încă nu există evaluări
Upto CT1 Chapter 3 ERROR DETECTION, CORRECTION & WIRELESS COMMUNICATION
Document7 pagini
Upto CT1 Chapter 3 ERROR DETECTION, CORRECTION & WIRELESS COMMUNICATION
anushka bhandare
Încă nu există evaluări
WAP
Document23 pagini
WAP
Dhruvi
Încă nu există evaluări
MIC Solved Questions Bank
Document14 pagini
MIC Solved Questions Bank
Advait kamthekar
Încă nu există evaluări
Practical - 1: (2.) Element of IDE Screen
Document41 pagini
Practical - 1: (2.) Element of IDE Screen
jims bca2019
Încă nu există evaluări
MPMC Lab Manual Exps
Document29 pagini
MPMC Lab Manual Exps
Janardhan Ch
Încă nu există evaluări
5.high Speed LAN
Document29 pagini
5.high Speed LAN
Utsav Kakkad
100% (1)
Cartoonifying An Image Using ML Algorithms
Document25 pagini
Cartoonifying An Image Using ML Algorithms
Shreya Martha
Încă nu există evaluări
MCQ Mad MCQ
Document31 pagini
MCQ Mad MCQ
Saquibh Shaikh
Încă nu există evaluări
Computer Graphics Viva Lab
Document4 pagini
Computer Graphics Viva Lab
ajmal
Încă nu există evaluări
3 CSE - EC8395 CE Unit 3 PDF
Document75 pagini
3 CSE - EC8395 CE Unit 3 PDF
Pragna Sidhireddy
Încă nu există evaluări
Project Report On Tic Tac Toe Game Using Java
Document15 pagini
Project Report On Tic Tac Toe Game Using Java
Abu
Încă nu există evaluări
Operating System (Lab) : Project Report: Threads
Document19 pagini
Operating System (Lab) : Project Report: Threads
Mashal Ud Din
Încă nu există evaluări
Vishveshwarya Technological University BELAGAVI - 590018.: A Mini Project Report On
Document35 pagini
Vishveshwarya Technological University BELAGAVI - 590018.: A Mini Project Report On
Soul Reaper
Încă nu există evaluări
1
Document17 pagini
1
MAHALAKSHMI
Încă nu există evaluări
MAD Question Bank
Document3 pagini
MAD Question Bank
Stuti Shah
Încă nu există evaluări
Question Bank Chapter 02 (AJP) - Objectives
Document4 pagini
Question Bank Chapter 02 (AJP) - Objectives
api-3728136
Încă nu există evaluări
High Performance Computing L T P J C Pre-Requisite Nil Syllabus Version Course Objectives
Document2 pagini
High Performance Computing L T P J C Pre-Requisite Nil Syllabus Version Course Objectives
Harish Muthyala
Încă nu există evaluări
BT0091 - WML and WAP Programming - Practical
Document48 pagini
BT0091 - WML and WAP Programming - Practical
cahmadh
Încă nu există evaluări
Scheme - I Sample Test Paper - I
Document4 pagini
Scheme - I Sample Test Paper - I
neha chaugule
Încă nu există evaluări
San 18Cs822 Mod 5 Short Notes
Document28 pagini
San 18Cs822 Mod 5 Short Notes
Jacob Dragonette
Încă nu există evaluări
Digital Code Lock System
Document17 pagini
Digital Code Lock System
hareeshkesireddy
100% (3)
Comparative Study of File Systems (NTFS, FAT, FAT32, EXT2, EXT3, EXT4)
Document4 pagini
Comparative Study of File Systems (NTFS, FAT, FAT32, EXT2, EXT3, EXT4)
IJRASETPublications
Încă nu există evaluări
WC Lab Manual
Document27 pagini
WC Lab Manual
Kivvi Singh
Încă nu există evaluări
DCDR Question Bank
Document4 pagini
DCDR Question Bank
Shubhu
Încă nu există evaluări
Contents:: Salient Features of 80386 Functional Block Diagram of 80836 Pin Description of 8086
Document26 pagini
Contents:: Salient Features of 80386 Functional Block Diagram of 80836 Pin Description of 8086
ajay
Încă nu există evaluări
Embedded System: Shibu K V
Document29 pagini
Embedded System: Shibu K V
harish
Încă nu există evaluări
Microprocessors Project
Document6 pagini
Microprocessors Project
Akash
Încă nu există evaluări
Short Questions: Past Papers Network Design & Management
Document5 pagini
Short Questions: Past Papers Network Design & Management
rumi noor
Încă nu există evaluări
Biometric Voting System Seminar Report
Document12 pagini
Biometric Voting System Seminar Report
Kranthi Kumar
67% (3)
UML Airport-Simulation - Alpha
Document9 pagini
UML Airport-Simulation - Alpha
Anand Varma
100% (1)
Advanced Computer Networks PDF
Document4 pagini
Advanced Computer Networks PDF
sravan
Încă nu există evaluări
EE6502 Microprocessors and Microcontrollers PDF
Document79 pagini
EE6502 Microprocessors and Microcontrollers PDF
Jayamani Krishnan
100% (1)
Computer Organization and Architecture
Document3 pagini
Computer Organization and Architecture
Anil Marsani
Încă nu există evaluări
AI-based Self-Driving Car
Document9 pagini
AI-based Self-Driving Car
International Journal of Innovative Science and Research Technology
Încă nu există evaluări
Anatomy of A MapReduce Job
Document5 pagini
Anatomy of A MapReduce Job
kumar
Încă nu există evaluări
Android Project: Santu@netcamp - in YEAR:-2019-20
Document45 pagini
Android Project: Santu@netcamp - in YEAR:-2019-20
Rajat Chopra
Încă nu există evaluări
Macro Processor Notes
Document23 pagini
Macro Processor Notes
sebinbenjamin
100% (1)
Unit 5 Arrays Strings and Vectors (By Arabinda Saikia) Edited
Document25 pagini
Unit 5 Arrays Strings and Vectors (By Arabinda Saikia) Edited
Mridupaban Dutta
Încă nu există evaluări
18CS653 - NOTES Module 1
Document24 pagini
18CS653 - NOTES Module 1
Supritha
Încă nu există evaluări
Digital Integrated Circuit Tester Ieee
Document5 pagini
Digital Integrated Circuit Tester Ieee
ashwin_nakman
100% (2)
System Programming & Operating System: A Laboratory Manual FOR
Document45 pagini
System Programming & Operating System: A Laboratory Manual FOR
Shubham
Încă nu există evaluări
2019 Summer Model Answer Paper (Msbte Study Resources)
Document17 pagini
2019 Summer Model Answer Paper (Msbte Study Resources)
Yash Somani
Încă nu există evaluări
Pentium Cache
Document5 pagini
Pentium Cache
Ismet Bibić
Încă nu există evaluări
MCQ On Awt
Document45 pagini
MCQ On Awt
Atharva
Încă nu există evaluări
UNIT 2 Virtualization CC
Document73 pagini
UNIT 2 Virtualization CC
jetowi8867
Încă nu există evaluări
Seminar ON Intelligent Ram
Document37 pagini
Seminar ON Intelligent Ram
Surangma Parashar
Încă nu există evaluări
Multi - Core Architectures and Programming - Lecture Notes, Study Material and Important Questions, Answers
Document49 pagini
Multi - Core Architectures and Programming - Lecture Notes, Study Material and Important Questions, Answers
M.V. TV
0% (1)
Assignment Questions: Module - 1 Application Layer
Document2 pagini
Assignment Questions: Module - 1 Application Layer
Naveen Setty
Încă nu există evaluări
Localization and Calling: Mobile Station International ISDN Number (MSISDN) : The Only Important Number
Document3 pagini
Localization and Calling: Mobile Station International ISDN Number (MSISDN) : The Only Important Number
Harish Sarki
Încă nu există evaluări
Question Paper Code:: (10×2 20 Marks)
Document2 pagini
Question Paper Code:: (10×2 20 Marks)
HOD ECE KNCET
Încă nu există evaluări
Nested Classes
Document23 pagini
Nested Classes
adarsh raj
Încă nu există evaluări
Compiler Lab Manual RCS 652
Document33 pagini
Compiler Lab Manual RCS 652
Afsana Saleem
Încă nu există evaluări
EC6018 MULTIMEDIA COMPRESSION AND COMMUNICATION Question Bank
Document28 pagini
EC6018 MULTIMEDIA COMPRESSION AND COMMUNICATION Question Bank
balabasker
Încă nu există evaluări
Liquid SIMD: Abstracting SIMD Hardware Using Lightweight Dynamic Mapping
Document12 pagini
Liquid SIMD: Abstracting SIMD Hardware Using Lightweight Dynamic Mapping
malliwi88
Încă nu există evaluări
26-27 SIMD Architecture
Document33 pagini
26-27 SIMD Architecture
fanna786
Încă nu există evaluări
GCN Architecture Whitepaper
Document18 pagini
GCN Architecture Whitepaper
Илија Петровић
Încă nu există evaluări
Arkov Ogic: NASSLLI 2010 Mathias Niepert
Document57 pagini
Arkov Ogic: NASSLLI 2010 Mathias Niepert
Harsh Prasad
Încă nu există evaluări
Cainabel Manual
Document15 pagini
Cainabel Manual
DCLXVI
Încă nu există evaluări
STD Code of India
Document440 pagini
STD Code of India
yuvibt5491
Încă nu există evaluări
History
Document18 pagini
History
Bắp Ngọt
Încă nu există evaluări
SRT 4922 - en
Document2 pagini
SRT 4922 - en
FLAMMMMME
Încă nu există evaluări
KP 4 en-US
Document5 pagini
KP 4 en-US
Swee
Încă nu există evaluări
Sharp Delta Dps-126cp-1 A Runtka685wjqz Psu SCH PCB
Document4 pagini
Sharp Delta Dps-126cp-1 A Runtka685wjqz Psu SCH PCB
00dark
100% (3)
SV420XVT1A LPL Service Manual
Document30 pagini
SV420XVT1A LPL Service Manual
Jay Hunter
Încă nu există evaluări
Ferrari 328 Microplex ECU Testing
Document18 pagini
Ferrari 328 Microplex ECU Testing
cesareconto
100% (1)
Optical NW
Document39 pagini
Optical NW
Abhijeet Nandanwankar
Încă nu există evaluări
Layering With TSN and EtherCAT
Document20 pagini
Layering With TSN and EtherCAT
thierry42
Încă nu există evaluări
LCD Control Made Easy
Document3 pagini
LCD Control Made Easy
Kowshik Bevara
Încă nu există evaluări
All Alinco Modification From Mods DK in One File PDF
Document136 pagini
All Alinco Modification From Mods DK in One File PDF
ucnop
Încă nu există evaluări
(GUIDE) 1st Generation Intel HD Graphics QE/CI
Document47 pagini
(GUIDE) 1st Generation Intel HD Graphics QE/CI
12babon
Încă nu există evaluări
Color Organ
Document7 pagini
Color Organ
GabeliDurresit
Încă nu există evaluări
IRC5 Compact IO Circuit Diagram
Document3 pagini
IRC5 Compact IO Circuit Diagram
gapam_2
Încă nu există evaluări
b522f Compal LA-8581P PDF
Document60 pagini
b522f Compal LA-8581P PDF
luix101
Încă nu există evaluări
Behringer FCB1010 ENG Rev F
Document16 pagini
Behringer FCB1010 ENG Rev F
tvdwouw
Încă nu există evaluări
S7 Hvsysb
Document418 pagini
S7 Hvsysb
Dwi Wijayanto
Încă nu există evaluări
Woot17 Paper Guri
Document10 pagini
Woot17 Paper Guri
aragon1974
Încă nu există evaluări
M571 en M B11
Document64 pagini
M571 en M B11
Sean Moffitt
Încă nu există evaluări
Read Me First-Firmware Update Instructions-Le700
Document4 pagini
Read Me First-Firmware Update Instructions-Le700
dinotopia1
Încă nu există evaluări
Versatile, Compact, Up To 40 GHZ 170 GHZ With Upconverter: R&S®Smb100A Microwave Signal Generator
Document22 pagini
Versatile, Compact, Up To 40 GHZ 170 GHZ With Upconverter: R&S®Smb100A Microwave Signal Generator
ketab_doost
Încă nu există evaluări
Design of UHF RFID Reader and The Solution of Crosstalk Problems
Document6 pagini
Design of UHF RFID Reader and The Solution of Crosstalk Problems
mano012
Încă nu există evaluări
FTS MainboardD2415ShortdescriptionKurzbeschrei 20091221 1081158
Document28 pagini
FTS MainboardD2415ShortdescriptionKurzbeschrei 20091221 1081158
Josico Mán
Încă nu există evaluări
Simulation and Design Tools
Document2 pagini
Simulation and Design Tools
nisarg
Încă nu există evaluări
MIT Radiation Lab V18 - G Valley H Wallman - Vacuum Tube Amplifiers - 1948
Document761 pagini
MIT Radiation Lab V18 - G Valley H Wallman - Vacuum Tube Amplifiers - 1948
kgrhoads
Încă nu există evaluări
MOSFET Experiment
Document13 pagini
MOSFET Experiment
suresh
Încă nu există evaluări
LM358
Document23 pagini
LM358
Anonymous sIAUue
Încă nu există evaluări
SMSC SCH5617C Desktop System Controller Hub With Advanced, 8051 C-Based Auto Fan Control
Document0 pagini
SMSC SCH5617C Desktop System Controller Hub With Advanced, 8051 C-Based Auto Fan Control
bhtooefr
Încă nu există evaluări
Turkcell U900 Trial Project Wireless Solution Application
Document25 pagini
Turkcell U900 Trial Project Wireless Solution Application
Sangwani Nyirenda
Încă nu există evaluări
Modeling Digital Systems: © Sudhakar Yalamanchili, Georgia Institute of Technology, 2006
Document19 pagini
Modeling Digital Systems: © Sudhakar Yalamanchili, Georgia Institute of Technology, 2006
mahendra singh
Încă nu există evaluări
10m Reference List - n6spp - Apr2020 PDF
Document6 pagini
10m Reference List - n6spp - Apr2020 PDF
apl_75
Încă nu există evaluări