Spoken Language Technology. Speaker Recognition

Course in BIOSINF Master programme (1st year)

Teachers

Teachers: Prof. Corneliu Burileanu
Teaching Assistant: Assoc.Prof. Horia Cucu

Course Description

The “Spoken Language Technology. Speaker Recognition” course presents the most important concepts regarding speech analysis (production and modeling, perception, features extraction, etc.) and speech synthesis. Moreover, the course approaches speech analysis and synthesis from a hardware point of view, discussing speech signal processors. Finally, this course highlights the most important concepts in Speaker Recognition, such as speaker modeling, classification and decision.

The laboratory aims to make the student familiar with speech signal properties. It starts with the speech analysis in time and frequency and it continues with the configuration of several feature extraction methods. Several speech processing techniques are approached: pitch estimation, speaker recognition with Dynamic Time Warping and Gaussian Mixture Models.

Course

The importance of speech analysis and synthesis systems
Speech recognition strategy
Acoustic and phonetic processor structures
Speech signal analysis techniques
Speech recognition techniques
Artificial Neural Networks – ANN
Speaker recognition

Laboratory

Speech processing in time domain and frequency domain
Speech features extraction methods
Pitch estimation
Speaker recognition with Dynamic Time Warping (DTW)
Speaker recognition with Gaussian Mixture Models (GMMs)

Download

The course slides and the laboratory papers are available on Moodle.

Grading

Laboratory (Semester project + oral evaluation): 50%
Course final exam (oral evaluation): 50%