Improving Robustnes in Automatic Speaker Recognition

Czech title:Zvýšení spolehlivosti v automatickém rozpoznávání řečníka
Research leader:Glembek Ondřej
Team leaders:Fér Radek, Novotný Ondřej
Agency:Czech Science Foundation
Code:GJ17-23870Y
Start:2017-01-01
End:2019-12-31
Keywords:automatic speaker recognition;robustness;adaptation;speech
Annotation:
Speaker recognition systems have gained very high recognition performance in the recent years. However, it has been shown that system performance degrades when the recognition data domain differs from the one used for system parameter training. Also, introducing additive noise (e.g. background traffic noise), convolutive noise (e.g. reverb of the room), or channel noise (e.g. telephone codec) to the recording further degrades the performance. The solutions to these issues are to a) seek for techniques for robust modeling, and b) to develop methods for system adaptation. In this project, we want to focus on both of these approaches.

Project description:
The goals of this project are research, development and analysis of universal adaptation techniques and techniques for increasing robustness in automatic speaker recognition.

Publications

2018DIEZ Sánchez Mireia, LANDINI Federico Nicolás, BURGET Lukáš, ROHDIN Johan A., SILNOVA Anna, ŽMOLÍKOVÁ Kateřina, NOVOTNÝ Ondřej, VESELÝ Karel, GLEMBEK Ondřej, PLCHOT Oldřich, MOŠNER Ladislav and MATĚJKA Pavel. BUT system for DIHARD Speech Diarization Challenge 2018. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, pp. 2798-2802. ISSN 1990-9770.
 MOŠNER Ladislav, MATĚJKA Pavel, NOVOTNÝ Ondřej and ČERNOCKÝ Jan. Dereverberation and Beamforming in Far-Field Speaker Recognition. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, pp. 5254-5258. ISBN 978-1-5386-4658-8.
 MOŠNER Ladislav, PLCHOT Oldřich, MATĚJKA Pavel, NOVOTNÝ Ondřej and ČERNOCKÝ Jan. Dereverberation and Beamforming in Robust Far-Field Speaker Recognition. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, pp. 1334-1338. ISSN 1990-9770.
 NOVOTNÝ Ondřej, MATĚJKA Pavel, PLCHOT Oldřich and GLEMBEK Ondřej. On the use of DNN Autoencoder for Robust Speaker Recognition. Brno: Faculty of Information Technology BUT, 2018.
 NOVOTNÝ Ondřej, PLCHOT Oldřich, MATĚJKA Pavel, MOŠNER Ladislav and GLEMBEK Ondřej. On the use of X-vectors for Robust Speaker Recognition. In: Proceedings of Odyssey 2018. Les Sables d´Olonne: International Speech Communication Association, 2018, pp. 168-175. ISSN 2312-2846.
 PLCHOT Oldřich, MATĚJKA Pavel, NOVOTNÝ Ondřej, CUMANI Sandro, LOZANO-DIEZ Alicia, SLAVÍČEK Josef, DIEZ Sánchez Mireia, GRÉZL František, GLEMBEK Ondřej, KAMSALI Veera Mounika, SILNOVA Anna, BURGET Lukáš, ONDEL Lucas, KESIRAJU Santosh and ROHDIN Johan A. Analysis of BUT-PT Submission for NIST LRE 2017. In: Proceedings of Odyssey 2018 The Speaker and Language Recognition Workshop. Les Sables d'Olonne: International Speech Communication Association, 2018, pp. 47-53. ISSN 2312-2846.
 ROHDIN Johan A., SILNOVA Anna, DIEZ Sánchez Mireia, PLCHOT Oldřich, MATĚJKA Pavel and BURGET Lukáš. End-to-End DNN Based Speaker Recognition Inspired by i-Vector and PLDA. In: Proceedings of ICASSP. Calgary: IEEE Signal Processing Society, 2018, pp. 4874-4878. ISBN 978-1-5386-4658-8.
 SILNOVA Anna, MATĚJKA Pavel, GLEMBEK Ondřej, PLCHOT Oldřich, NOVOTNÝ Ondřej, GRÉZL František, SCHWARZ Petr and ČERNOCKÝ Jan. BUT/Phonexia Bottleneck Feature Extractor. In: Proceedings of Odyssey 2018. Les Sables d´Olonne: International Speech Communication Association, 2018, pp. 283-287. ISSN 2312-2846.
2017PLCHOT Oldřich, MATĚJKA Pavel, SILNOVA Anna, NOVOTNÝ Ondřej, DIEZ Sánchez Mireia, ROHDIN Johan A., GLEMBEK Ondřej, BRÜMMER Niko, SWART Albert du Preez, PRIETO Jesús J., GARCIA Perera Leibny Paola, BUERA Luis, KENNY Patrick, ALAM Jahangir and BHATTACHARYA Gautam. Analysis and Description of ABC Submission to NIST SRE 2016. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 1348-1352. ISSN 1990-9772.
 SILNOVA Anna, BURGET Lukáš and ČERNOCKÝ Jan. Alternative Approaches to Neural Network based Speaker Verification. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 1572-1575. ISSN 1990-9772.

Your IPv4 address: 54.82.73.21
Switch to IPv6 connection

DNSSEC [dnssec]