Logo ČVUT
CZECH TECHNICAL UNIVERSITY IN PRAGUE
STUDY PLANS
2018/2019

Phonetic signals and their coding

Login to KOS for course enrollment Display time-table
Code Completion Credits Range Language
XP31FSK ZK 4 2+2s Czech
Lecturer:
Jan Uhlíř (guarantor), Petr Pollák
Tutor:
Jan Uhlíř (guarantor), Petr Pollák
Supervisor:
Department of Circuit Theory
Synopsis:

The subject introduces the processing of speech signals. Within the subject students should manage from basic to advanced and modern algorithms of speech analysis, synthesis, coding or enhancement. Further reasonable part is focused on speech recognition, where students will get to know modern and advanced technique in task as small and large vocabulary speech recognition or speaker recognition. Special attention is devoted to usage of classification techniques based on GMM, DTW, HMM, ANN/DNN, WFST, JFA, i-vectrors, etc.

Requirements:

Digital signal processing are supposed as preliminary knowledge.

Syllabus of lectures:

1. Speech production and perception model, phonetic description of speech

2. Spectral characteristics of speech (DFT, LPC, filter banks)

3. Cepstral reprezentation of speech and possible applications

4. Voice activity detection and speech enahncement.

5. Speech synthesis

6. Speech coding

7. Basic and advanced feature extraction techniques (PCA, LDA)

8. Classification approaches for particular ASR tasks (GMM, HMM, VQ)

9. Modern methods of speaker verification and identification (UBM-GMM, JFA, i-vectors)

10. DTW- and HMM-based speech recognition

11. Continuous speech recognition, language modelling, WFST

12. Adaptation techniques in speech recognition

13. Modern ASR systems based on ANN/DNN, methods of deep learning

14. Reserve.

Syllabus of tutorials:

Seminars are organized as common consultations of registered students. The main focus is paid on individual work of students during the semester and a their work on chosen individual topics. The solutions of individual projects are discussed at common consultations.

Study Objective:
Study materials:

[1] Deller, J.R. - Hansen, J.H.L. - Proakis, J.G.: Discrete-time processing of speech signals, New York: IEEE Press 2000, 908 s., ISBN

0-7803-5386-2

[2] http://www.ee.ic.ac.uk/hp/staff/dmb/courses/speech/speech.htm

Note:
Time-table for winter semester 2018/2019:
Time-table is not available yet
Time-table for summer semester 2018/2019:
06:00–08:0008:00–10:0010:00–12:0012:00–14:0014:00–16:0016:00–18:0018:00–20:0020:00–22:0022:00–24:00
Mon
Tue
room
Uhlíř J.
10:00–11:45
(lecture parallel1)
room
Uhlíř J.
11:45–13:30
(lecture parallel1
parallel nr.101)

Fri
Thu
Fri
The course is a part of the following study plans:
Data valid to 2019-02-22
For updated information see http://bilakniha.cvut.cz/en/predmet11845104.html