Sunteți pe pagina 1din 15

www.jntuworld.

com

Audio Coding
Introduction
S. R. M. Prasanna
Dept of ECE,
IIT Guwahati,
prasanna@iitg.ernet.in

Audio Coding p. 1/1

www.jntuworld.com

Goal of Audio Coding


Terms Coding and Compression are used
interchangeably.
Goal of audio coding is to develop methods for compact
digital representation of audio signals.
Efficient transmission or storage.
Minimum number of bits with transparent perceptual
quality.

Audio Coding p. 2/1

www.jntuworld.com

First Generation Audio Coders


Digital representation of audio signals.
Compact Disc (CD) is the digital storage medium.
Sampling frequency is 44.1 kHz and Bit rate 16
bits/sample
20 kHz audio spectrum + 2.05 guard band = 22.05
kHz
Sampling freq = 22.05 2 = 44.1kHz .
Data rate:
44100 16 = 705.6 kb/s for mono
705.6 2 = 1.41 Mb/s for stereo

Audio Coding p. 3/1

For network and wireless multimedia digital audio.

www.jntuworld.com

Second Generation Audio Coders


Bandwidth is the severe constraint.
At the same time, end-users need CD quality.
Conflicting requirements.
Goal is to reduce data rate without compromising on
the perceptual quality.
Led to several audio compression algorithms.
Exploit both perceptual irrelevancies and statistical
redundancies.

Audio Coding p. 4/1

Lossless audio

www.jntuworld.com

Third Generation Audio Coders


Spatial audio
Real-time source localization
Head related transfer function (HRTF)
Immersive audio

Audio Coding p. 5/1

www.jntuworld.com

Audio Coding Methods


PCM (1.41 Mb/s).
DPCM (0.75 x PCM data rate).
ADPCM (0.5 x PCM data rate).
Not much data rate reduction.
Need for high compression methods driven by potential
applications.
New approaches for audio coding based on the
principles of psychoacoustics.

Audio Coding p. 6/1

www.jntuworld.com

Psychoacoustics
Characterizing human auditory perception.
Time-frequency analysis capabilities of the inner ear.
Perceptually irrelevant audio signal information.
Contributions from psychoacoustics:
Perceptual entropy
Auditory filter bank
Perceptual entropy deals with estimate of the
fundamental limit of transparent audio signal
compression.
Auditory filter bank based on the time-frequency
analysis capabilities of the inner ear.

Audio Coding p. 7/1

MPEG-1 Audio (1992).

www.jntuworld.com

Some Audio Coding Standards


MPEG-2 Audio (1996).
MPEG-4 Audio v1 (1999).
MPEG-4 Audio v2 (2000)

Audio Coding p. 8/1

www.jntuworld.com

Block Diagram of Generic Audio Coder

Audio Coding p. 9/1

Segment input signals into quasi-stationary frames of


2-50 ms.

www.jntuworld.com

Principle of Generic Audio Coder

Time-frequency analysis estimates the temporal and


spectral components of each frame.
TFA approach employed is based on human auditory
system.
Objective is to extract a set of time-frequency
parameters that are robust to quantization according to
a perceptual distortion metric.
Perceptual distortion control is achieved by a
psychoacoustic signal analysis section that estimates
signal masking power based on psychoacoustic
principles.

Audio Coding p. 10/1

Psychoacoustic model delivers masking thresholds that


quantify the maximum amount of distortion at each
point in the time-frequency plane such that quantization
of the time-frequency parameters does not introduce
audible artifacts.

www.jntuworld.com

Principle of AC (contd.)

Psychoacoustic model allows the quantization section


to exploit perceptual irrelevancies.
Final redundancy removal based on the perceptual
entropy coding scheme.

Audio Coding p. 11/1

www.jntuworld.com

Audio Coder Attributes


Audio reproduction quality.
Operating bit rates.
Computational complexity.
Codec delay.
Channel error robustness.
High quality audio at low bit rates (< 32 kb/s) with an
acceptable algorithm delay (5-20 ms), and with low
computational complexity (1-10 MIPS).

Audio Coding p. 12/1

Based on the signal model or analysis-synthesis


technique.

www.jntuworld.com

Types of Audio Coders

LP
Transform
Subband
Sinusoidal

Audio Coding p. 13/1

www.jntuworld.com

AC-Expt.1
Effect of Sampling Frequency and Bit Resolution
Objective is to analyze the effect of sampling frequency
and bit resolution on the perceptual quality of audio.
Take a CD quality music signal of 1 sec, sampled at
44.1 kHz with 16 bits/sample and perform the following.
Change its sampling frequency to 16, 8 and 4 kHz.
Keep bit resolution constant at 16 bits/sample.
Consider about 50 ms segment in a high energy
region.
Plot the time domain and DFT spectra for all the
four cases.
Comment on the effect of different sampling
frequency.
Comment also on the perceptual quality of the
audio.

Audio Coding p. 14/1

www.jntuworld.com

AC-Expt.1
Effect of Sampling Frequency and Bit Resolution
Change its bit resolution to 8, 4 and 1 bits/sample.
Keep sampling frequency constant at 44.1 kHz.
Consider the same 50 ms segment in a high energy
region.
Plot the time domain and DFT spectra for all the
four cases.
Comment on the effect of different bit resolutions.
Comment also on the perceptual quality of the
audio.

Audio Coding p. 15/1

S-ar putea să vă placă și