Documente Academic
Documente Profesional
Documente Cultură
PROCESSING
Dr Mohammad Faiz Liew bin Abdullah
Department of Communication
Faculty of Electrical & Electronic Engineering
University Tun Hussien Onn Malaysia
Semester I 10/11
Slide 1
Contents
What is DSP?
What is DSP used for?
Speech & Audio processing
Image & Video processing
Adaptive filtering
Semester I 10/11
Slide 2
INTRODUCTION
Semester I 10/11
Slide 3
What is DSP?
Digital Signal Processing the processing
or manipulation of signals using digital
techniques
Input
Signal
ADC
Analogue
to Digital
Converter
Semester I 10/11
Digital
Signal
Processor
DAC
Digital to
Analogue
Converter
Slide 4
Output
Signal
DSP
Semester I 10/11
Slide 5
Semester I 10/11
Slide 6
HISTORICAL CONTRIBUTORS
Semester I 10/11
Slide 7
SIGNAL PROCESSING
Signal Processing
Digital
Analog
V1(t)
V2 (t)
i(t)
Semester I 10/11
Slide 8
ADVANTAGES &
DISADVANTAGES OF DSP
Advantages:
Programmability.
Stability.
Repeatability.
Easier implementation of adaptive algorithms.
Disadvantage:
Bandwidth limitation related to the processor cycle
and algorithm complexity.
Semester I 10/11
Slide 9
PROCESSING METHODOLOGY
HARDWARE
SOFTWARE
PC
DSP PROCESSOR
FPGA
ASIC
Semester I 10/11
MICROPROCESSOR
MICROCONTROLLER
Slide 10
Semester I 10/11
Slide 11
Slide 12
APPLICATION OF DSP
MULTIMEDIA
SPEECH
COMMUNICATION
POWER
BIOMEDICAL
DSP
OCEONOGRAPHIC
SEISMIC APPLICATION
IMAGE/VIDEO
SONAR
AEROSPACE
RADAR
INDUSTRIAL
Semester I 10/11
Slide 13
Semester I 10/11
Slide 14
Speech Processing
Speech coding/compression
Speech synthesis
Speech recognition
Semester I 10/11
Slide 15
oo
in
blue
e
ee
o
s
kin
in
in
inagain
spot
key
key
Semester I 10/11
Slide 16
oo in blue
o in spot
ee in key
e in again
Quasi-periodic
Relatively high signal power
Consonants
s in spot
k in key
Non-periodic (random)
Relatively low signal power
Semester I 10/11
Slide 17
Speech Coding
TRAU
MSC
64 kbits/s
22.8 kbits/s
BSC
13 kbits/s
BTS
Semester I 10/11
Slide 18
se(n)
d(n)
d(n)
+
sr(n)
+
se(n)
Semester I 10/11
A(z)
Slide 19
Decoder
Pitch
Period
Signal Power
Pulse Train
V/U
Vocal Tract
Model
Random Noise
Semester I 10/11
LPC-10:
Synthesized Speech
Slide 20
Text-to-Speech Synthesis
To be or
not to be
that is the
question
Input
text
Tu bee awr
nawt tu bee
dhat iz dhe
kwestchun
Text
normalization
Parsing
expands
abbreviations
dates, times,
money..etc
semantic &
syntactic parts
of speech
analysis of text
Prosody
rules
Apply word
stress, duration
and pitch
Waveform
generation
Phonetic-toacoustic
transformation
phonetic form
Pronunciation
phonetic description
of each word, dictionary
with letter-to-sound
rules as a back up
Synthesized
speech
Slide 21
Speaking clocks
Spoken (variable) announcements
Talking emails + talking heads for mobile
Synthesis of location-based information
(e.g. traffic information)
Interactive systems (e.g. catalogue ordering,
Yellow Pages, ...)
Semester I 10/11
Slide 22
Speech/Speaker Recognition
Speech Recognition What has been spoken?
Speaker dependent Recognition system trained
for a particular persons voice.
Speaker independent Recognition system
expected to deal with a wide variety of speakers.
Slide 23
Phoneme
models
speech
Feature
extraction
Phoneme
recognition
Word
pronunciation
Word
recognition
Semantic
knowledge
Sentence
recognition
Syntactic
knowledge
Semester I 10/11
decision
Dialogue
knowledge
Slide 24
Digital Audio
Standard music CD:
Semester I 10/11
Slide 25
Slide 26
Surround
Right
Centre
Left
Surround
Left
Slide 27
Temporal Masking
SPL/dB
SPL/dB
Semester I 10/11
freq/kHz
10ms
160ms
Slide 28
time
Masking Example
60
dB
50
40
30
20
10
200
300
400
500
600
700
800
Hz
Semester I 10/11
Slide 29
Image/Video
Still Image Coding:
JPEG (Joint Photographic Experts Group):
Discrete Cosine Transform (DCT) based
Video Coding:
MPEG (Moving Pictures Experts Group):
DCT-based,
Interframe and intraframe prediction,
Motion estimation.
Slide 30
JPEG Example
Original
JPEG (4:1)
Semester I 10/11
JPEG (100:1)
Slide 31
Adaptive Filtering
Self-learning: Filter coefficients adapt in response
to training signal.
d(n)
x(n)
W(z)
e(n)
y(n)
Slide 32
Adaptive equalization
Active noise control
Medical signal processing
e.g. foetal heart beat monitoring
Semester I 10/11
Slide 33
Slide 34
Semester I 10/11
Slide 35
y ( n) =
L 1
ai x(n i)
i =0
Pseudo C code
for (n=0; n<N; n++)
{
s=0;
for (i=0; i<L; i++)
{
s += a[i] * x[n-i];
}
y[n] = s;
}
Semester I 10/11
Slide 36
ai
x(n-i)
Y RAM
Multiply/Accumulate
Accumulator
y(n)
Slide 37
Speech
Audio
Mobile
Services
Administration
System modelling
Speech compression
MP3
Baseband processing
Firmware design
Speech Recognition
Channel coding
CAD Tools
System Integration
Speech synthesis
Performance
Assessment
AMR Coding
Computer
&
Networking
Speech enhancement
Speech Testing
Semester I 10/11
Slide 38
COSSAP
Matlab ...
Behavioural,
RTL, Logic ...
Slide 39
Semester I 10/11
Slide 40