Documente Academic
Documente Profesional
Documente Cultură
Acoustic Modeling
Introduction and Methodology
Fellowship in collaboration with Prof. Carlos Teixeira, FCUL
Carla Simões
t-carlas@microsoft.com
Overview
• Introduction
– Speech Components
– What are Acoustic Models?
– Why to use them?
• Methodology
– Training Acoustic Models
• Modelling
– English Spoken by Portuguese speakers
S1 S2 S3 … Sequence of symbols
Speech Waveform
Front End
• Corpus description
– 4689 Utterances for a universe of 227 Words
– Files are sampled at 8Khz for 16 bits linear
– 11 male speakers
• Model settings
– 3468 utterances for training 1221 utterances for test
– 43 minutes of speech
– Senones 1200
Modelling
English Spoken by Portuguese Speakers
English spoken by Training New Model
Portuguese corpus
Testing
English spoken by
Portuguese corpus
Test corpus
Update
New Model
Testing
ENU Model
Test corpus
Modelling
English Spoken by Portuguese Speakers
English spoken by
Portuguese corpus
English to Portuguese
mapped phoneset
PTG corpus
Training
Test corpus Testing New Model
Future Work
• The improvement of Acoustic Models requires gathering
hundreds of hours of speech data
Possible solutions:
• Define new phonesets which implies a phonetic study
concerning the Portuguese English pronunciation
• Train the native models with the English spoken by
Portuguese corpus
Acoustic Modeling
Introduction and Methodology
Carla Simões
t-carlas@microsoft.com
www.microsoft.com/portugal/mldc