Thesis Details

Robustní rozpoznávání mluvčího

Bachelor's Thesis Student: Profant Ján Academic Year: 2015/2016 Supervisor: Matějka Pavel, Ing., Ph.D.
English title
Robust Speaker Verification
Language
Czech
Abstract

The goal of this paper is to analyze the impact of codec degraded speech on a state-ofthe-art speaker recognition system. Two feature extraction techniques are analyzed - Mel Frequency Cepstral Coefficients (MFCC) and the state-of-the-art system using Bottleneck features together with MFCC. Speaker recognition system is based on i-vector and Probabilistic Linear Discriminant Analysis (PLDA). We compared scenarios where PLDA is trained only on clean data, then system where we added also noise and reverberant data, and at last, codec degraded speech. We evaluated the systems on the matched conditions (data from the same codec are seen with PLDA) and also mismatched conditions (PLDA does not see any data from the tested codec). We experimented also with recently introduced technique for channel adaptation - Within-class Covariance Correction (WCC). We can see clear benefit of adding transcoded data to PLDA or WCC (with approximately same gain) for both tested conditions (matched and mismatched).

Keywords

speaker verification, Probabilistic Linear Discriminant Analysis, Within-class Covariance Correction, i-vector

Department
Degree Programme
Information Technology
Files
Status
defended, grade B
Date
15 June 2016
Reviewer
Committee
Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT), předseda
Bidlo Michal, doc. Ing., Ph.D. (DCSY FIT BUT), člen
Drahanský Martin, prof. Ing., Dipl.-Ing., Ph.D. (DITS FIT BUT), člen
Rychlý Marek, RNDr., Ph.D. (DIFS FIT BUT), člen
Španěl Michal, Ing., Ph.D. (DCGM FIT BUT), člen
Citation
PROFANT, Ján. Robustní rozpoznávání mluvčího. Brno, 2016. Bachelor's Thesis. Brno University of Technology, Faculty of Information Technology. 2016-06-15. Supervised by Matějka Pavel. Available from: https://www.fit.vut.cz/study/thesis/18679/
BibTeX
@bachelorsthesis{FITBT18679,
    author = "J\'{a}n Profant",
    type = "Bachelor's thesis",
    title = "Robustn\'{i} rozpozn\'{a}v\'{a}n\'{i} mluv\v{c}\'{i}ho",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2016,
    location = "Brno, CZ",
    language = "czech",
    url = "https://www.fit.vut.cz/study/thesis/18679/"
}
Back to top