Sunteți pe pagina 1din 40

ADVANCED DIGITAL SIGNAL

PROCESSING
Dr Mohammad Faiz Liew bin Abdullah

Department of Communication
Faculty of Electrical & Electronic Engineering
University Tun Hussien Onn Malaysia

Semester I 10/11

Slide 1

Contents
What is DSP?
What is DSP used for?
Speech & Audio processing
Image & Video processing
Adaptive filtering

DSP Devices and Architectures


Summary & Conclusions

Semester I 10/11

Slide 2

INTRODUCTION

Semester I 10/11

Slide 3

What is DSP?
Digital Signal Processing the processing
or manipulation of signals using digital
techniques

Input
Signal

ADC
Analogue
to Digital
Converter

Semester I 10/11

Digital
Signal
Processor

DAC
Digital to
Analogue
Converter

Slide 4

Output
Signal

DSP

Semester I 10/11

Slide 5

WHY PROCESS SIGNAL?


Signals are corrupted by interference due to
noise.
Signals are distorted due to the transmission
medium.
Recover information present in the signal.
Perform algorithm such as compression.

Semester I 10/11

Slide 6

HISTORICAL CONTRIBUTORS

Semester I 10/11

Slide 7

SIGNAL PROCESSING
Signal Processing

Digital

Analog
V1(t)

V2 (t)

i(t)

Semester I 10/11

Slide 8

ADVANTAGES &
DISADVANTAGES OF DSP
Advantages:
Programmability.
Stability.
Repeatability.
Easier implementation of adaptive algorithms.
Disadvantage:
Bandwidth limitation related to the processor cycle
and algorithm complexity.
Semester I 10/11

Slide 9

PROCESSING METHODOLOGY
HARDWARE

SOFTWARE

PC
DSP PROCESSOR
FPGA
ASIC

Semester I 10/11

MICROPROCESSOR
MICROCONTROLLER

Slide 10

Semester I 10/11

Slide 11

What is DSP Used For?

And much more!


Semester I 10/11

Slide 12

APPLICATION OF DSP
MULTIMEDIA
SPEECH

COMMUNICATION
POWER

BIOMEDICAL

DSP

OCEONOGRAPHIC

SEISMIC APPLICATION
IMAGE/VIDEO

SONAR

AEROSPACE

RADAR
INDUSTRIAL

Semester I 10/11

Slide 13

Semester I 10/11

Slide 14

Speech Processing

Speech coding/compression
Speech synthesis
Speech recognition
Semester I 10/11

Slide 15

Some Properties of Speech

The blue--- s---p--o---------t i-s--on--the-- k--ey a---g--ai----n------

oo
in
blue
e
ee
o
s
kin
in
in
inagain
spot
key
key

Semester I 10/11

Slide 16

Some Properties of Speech


Vowels

oo in blue

o in spot

ee in key

e in again

Quasi-periodic
Relatively high signal power
Consonants

s in spot

k in key

Non-periodic (random)
Relatively low signal power
Semester I 10/11

Slide 17

Speech Coding
TRAU

MSC

64 kbits/s
22.8 kbits/s
BSC
13 kbits/s

BTS

Semester I 10/11

Slide 18

Speech Coding Linear Prediction


Try to predict the current sample value;
Transmit the prediction error.
s(n)
A(z)

se(n)

d(n)

d(n)
+

sr(n)

+
se(n)

Semester I 10/11

A(z)

Slide 19

Speech Coding Vocoder


Encoder
Original Speech
Analysis:
Voiced/Unvoiced decision
Pitch Period (voiced only)
Signal power (Gain)

Decoder

Pitch
Period

Signal Power
Pulse Train

V/U
Vocal Tract
Model

Random Noise
Semester I 10/11

LPC-10:

Synthesized Speech

Slide 20

Text-to-Speech Synthesis
To be or
not to be
that is the
question

Input
text

Tu bee awr
nawt tu bee
dhat iz dhe
kwestchun

Text
normalization

Parsing

expands
abbreviations
dates, times,
money..etc

semantic &
syntactic parts
of speech
analysis of text

Prosody
rules
Apply word
stress, duration
and pitch

Waveform
generation
Phonetic-toacoustic
transformation

phonetic form

Pronunciation
phonetic description
of each word, dictionary
with letter-to-sound
rules as a back up

Synthesized
speech

Text-to-speech synthesis sounds very natural these days.


Semester I 10/11

Slide 21

Speech Synthesis Applications

Speaking clocks
Spoken (variable) announcements
Talking emails + talking heads for mobile
Synthesis of location-based information
(e.g. traffic information)
Interactive systems (e.g. catalogue ordering,
Yellow Pages, ...)
Semester I 10/11

Slide 22

Speech/Speaker Recognition
Speech Recognition What has been spoken?
Speaker dependent Recognition system trained
for a particular persons voice.
Speaker independent Recognition system
expected to deal with a wide variety of speakers.

Speaker Recognition Who has spoken?


Not easy
Sometimestherearenogapsbetweenwords.
Sometim esthereareg aps inthe mid dleofwords.

Accents, dialects and Stress eggsist.


Semester I 10/11

Slide 23

Speech Recognition System

Phoneme
models

speech

Feature
extraction

Phoneme
recognition

Word
pronunciation

Word
recognition

Semantic
knowledge

Sentence
recognition

Syntactic
knowledge

Semester I 10/11

decision

Dialogue
knowledge

Slide 24

Digital Audio
Standard music CD:

Sampling Rate: 44.1 kHz


16-bit samples
2-channel stereo
Data transfer rate = 21644,100 = 1.4 Mbits/s
1 hour of music = 1.43,600 = 635 MB

Semester I 10/11

Slide 25

Audio Coding (Contd)


Key standards:
MPEG: Layers I, II, and III (MP3); AAC.
used in DAB, DVD

Dolby AC3, Dolby Digital, Dolby Surround.

Typical bit rates for 2-channel stereo:


64kbits/s to 384 kbits/s.

Subband- or transform-based, making use


of perceptual masking properties.
Semester I 10/11

Slide 26

Audio Coding (Contd)


Typical 3/2 multichannel stereo configuration:
Right

Surround
Right

Centre

Left

Surround
Left

5.1 channels (3/2) with LFE channel:


Left, Right, Centre,
Left Surround, Right Surround,
Low Frequency Effects (LFE) (Reduced Bandwidth).
LFE loudspeaker can, in general, be placed anywhere in the
listening room.
Semester I 10/11

Slide 27

Audio Coding Masking


Auditory Masking:
Spectral: Strong frequency components mask weaker
neighbouring frequency components.
Temporal: Strong temporal events mask recent and
future events.
Spectral Masking

Temporal Masking
SPL/dB

SPL/dB

Semester I 10/11

freq/kHz

10ms

160ms

Slide 28

time

Masking Example

60

dB

50

40

30

20

10
200

300

400

500

600

700

800

Hz

Semester I 10/11

Slide 29

Image/Video
Still Image Coding:
JPEG (Joint Photographic Experts Group):
Discrete Cosine Transform (DCT) based

JPEG2000: Wavelet Transform based

Video Coding:
MPEG (Moving Pictures Experts Group):
DCT-based,
Interframe and intraframe prediction,
Motion estimation.

Applications: Digital TV, DVD, etc.


Semester I 10/11

Slide 30

JPEG Example
Original

JPEG (4:1)

Semester I 10/11

JPEG (100:1)

Slide 31

Adaptive Filtering
Self-learning: Filter coefficients adapt in response
to training signal.
d(n)

x(n)

W(z)

e(n)

y(n)

Filter update: Least Mean Squares (LMS) algorithm


w(n +1) = w(n) + 2e(n)x(n)
Semester I 10/11

Slide 32

Adaptive Filtering Applications


Echo cancellation (telephone lines)
Used in modems (making Internet access possible!!)

Acoustic echo cancellation


Hands-free telephony

Adaptive equalization
Active noise control
Medical signal processing
e.g. foetal heart beat monitoring

Semester I 10/11

Slide 33

Some Other Application Areas


Image analysis, e.g:
Face recognition,
Optical Character Recognition (OCR);

Restoration of old image, video, and audio signals;


Analysis of RADAR data;
Analysis of SONAR data;
Data transmission (modems, radio, echo
cancellation, channel equalization, etc.);
Storage and archiving;
Control of electric motors.
Semester I 10/11

Slide 34

DSP Devices & Architectures


Selecting a DSP several choices:
Fixed-point;
Floating point;
Application-specific devices
(e.g. FFT processors, speech recognizers,etc.).

Main DSP Manufacturers:


Texas Instruments (http://www.ti.com)
Motorola (http://www.motorola.com)
Analog Devices (http://www.analog.com)

Semester I 10/11

Slide 35

Typical DSP Operations


Filtering
Energy of Signal
Frequency transforms

y ( n) =

L 1

ai x(n i)

i =0

Pseudo C code
for (n=0; n<N; n++)
{
s=0;
for (i=0; i<L; i++)
{
s += a[i] * x[n-i];
}
y[n] = s;
}

Semester I 10/11

Slide 36

Traditional DSP Architecture


X RAM

ai

x(n-i)

Y RAM

Multiply/Accumulate

Accumulator

y(n)

N.B. Most modern DSPs have more advanced features.


Semester I 10/11

Slide 37

SDC Core Skills


DSP

Speech

Audio

Mobile

Services
Administration

System modelling

Speech compression

MP3

Baseband processing

Firmware design

Speech Recognition

Other digital audio

Channel coding

CAD Tools

System Integration

Speech synthesis

Performance
Assessment

AMR Coding

Computer
&
Networking

CPU (Oak, ARM)

Speech enhancement

H/w & S/w


Co-design

Speech Testing

System on Chip (SoC)

Semester I 10/11

Slide 38

SDC Firmware Development


Algorithm
Definition
Floating-point
and
Fixed-point
Co-Simulation
Co-Design
Implementation

COSSAP
Matlab ...
Behavioural,
RTL, Logic ...

Co-Verification MCU, DSP ...

Product Development With Barcelona and Tokyo


Design Centres
Semester I 10/11

Slide 39

Summary & Conclusions


DSP used in a wide range of everyday applications
Looked at:
Speech coding; Speech synthesis & recognition;
Image/Video;
Adaptive filtering.

Other areas include:

Image analysis (e.g. face recognition, OCR, etc.);


RADAR/SONAR;
Data transmission and reception;
And many more..!!

Semester I 10/11

Slide 40

S-ar putea să vă placă și