Thesis Details

Microphone Arrays for Speaker Recognition

Master's Thesis Student: Mošner Ladislav Academic Year: 2016/2017 Supervisor: Černocký Jan, prof. Dr. Ing.
Czech title
Microphone Arrays for Speaker Recognition
Language
English
Abstract

This thesis addresses the problem of remote speaker recognition. The accuracy of standard speaker recognition decreases considerably in the presence of far-field data, therefore, we devised two strategies to improve the results. First, we employed a microphone array (purposely positioned set of microphones) that is able to steer a virtual "beam" to the position of the speaker. We also performed system adaptation of different parts of the system (PLDA scoring and i-vector extraction). We have synthesized our training and test data from the standard NIST 2010 data by room simulation and we have shown that both techniques and their combination significantly improve the results. We have also dealt with joint speaker identity and position estimation. While the results in simulated outdoor environment (reverberation-free) are encouraging, the results from interiors (with reverberation) are mixed and require further investigation. Finally, we were able to test our system on a limited amount of real re-transmitted data. While the results for male speakers match the simulation, the results for females are not convincing and need further analysis.

Keywords

Speaker recognition, microphone arrays, beamforming, speaker localization, i-vector, room impulse response

Department
Degree Programme
Information Technology, Field of Study Computer Graphics and Multimedia
Files
Status
defended, grade A
Date
22 June 2017
Reviewer
Committee
Zemčík Pavel, prof. Dr. Ing. (DCGM FIT BUT), předseda
Beran Vítězslav, doc. Ing., Ph.D. (DCGM FIT BUT), člen
Herout Adam, prof. Ing., Ph.D. (DCGM FIT BUT), člen
Rychlý Marek, RNDr., Ph.D. (DIFS FIT BUT), člen
Sochor Jiří, prof. Ing., CSc. (FI MUNI), člen
Szőke Igor, Ing., Ph.D. (DCGM FIT BUT), člen
Citation
MOŠNER, Ladislav. Microphone Arrays for Speaker Recognition. Brno, 2017. Master's Thesis. Brno University of Technology, Faculty of Information Technology. 2017-06-22. Supervised by Černocký Jan. Available from: https://www.fit.vut.cz/study/thesis/19199/
BibTeX
@mastersthesis{FITMT19199,
    author = "Ladislav Mo\v{s}ner",
    type = "Master's thesis",
    title = "Microphone Arrays for Speaker Recognition",
    school = "Brno University of Technology, Faculty of Information Technology",
    year = 2017,
    location = "Brno, CZ",
    language = "english",
    url = "https://www.fit.vut.cz/study/thesis/19199/"
}
Back to top