Conference paper

NOVOTNÝ Ondřej, MATĚJKA Pavel, PLCHOT Oldřich, GLEMBEK Ondřej, BURGET Lukáš and ČERNOCKÝ Jan. Analysis of Speaker Recognition Systems in Realistic Scenarios of the SITW 2016 Challenge. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, pp. 828-832. ISBN 978-1-5108-3313-5. Available from: https://www.researchgate.net/publication/307889224_Analysis_of_Speaker_Recognition_Systems_in_Realistic_Scenarios_of_the_SITW_2016_Challenge
Publication language:english
Original title:Analysis of Speaker Recognition Systems in Realistic Scenarios of the SITW 2016 Challenge
Title (cs):Analýza systémů pro ověřování mluvčího v realistických podmínkách SITW 2016 Challenge
Pages:828-832
Proceedings:Proceedings of Interspeech 2016
Conference:Interspeech 2016
Place:San Francisco, US
Year:2016
URL:https://www.researchgate.net/publication/307889224_Analysis_of_Speaker_Recognition_Systems_in_Realistic_Scenarios_of_the_SITW_2016_Challenge
ISBN:978-1-5108-3313-5
Publisher:International Speech Communication Association
URL:http://www.fit.vutbr.cz/research/groups/speech/publi/2016/novotny_interspeech2016_IS160981.pdf [PDF]
Files: 
+Type Name Title Size Last modified
iconnovotny_interspeech2016_IS160981.pdf393 KB2016-09-19 17:02:51
^ Select all
With selected:
Keywords
speaker recognition, SRE systems, diarization
Annotation
In this paper, we summarize our efforts for the Speakers In The Wild (SITW) challenge, and we present our findings with this new dataset for speaker recognition. Apart from the standard comparison of different SRE systems, we analyze the use of diarization for dealing with audio segments containing multiple speakers, as in part of the newly introduced enrollment and test protocols, diarization is a necessary system component. Our state-of-the-art systems used in this work utilize both cepstral and DNN-based bottleneck features and are based on i-vectors followed by Probabilistic Linear Discriminant Analysis (PLDA) classifier and logistic regression calibration/fusion. We present both narrow-band (8 kHz) and wide-band (16 kHz) systems together with their fusions.
Abstract
In this paper, we summarize our efforts for the Speakers In The Wild (SITW) challenge, and we present our findings with this new dataset for speaker recognition. Apart from the standard comparison of different SRE systems, we analyze the use of diarization for dealing with audio segments containing multiple speakers, as in part of the newly introduced enrollment and test protocols, diarization is a necessary system component. Our state-of-the-art systems used in this work utilize both cepstral and DNN-based bottleneck features and are based on i-vectors followed by Probabilistic Linear Discriminant Analysis (PLDA) classifier and logistic regression calibration/fusion. We present both narrow-band (8 kHz) and wide-band (16 kHz) systems together with their fusions.
BibTeX:
@INPROCEEDINGS{
   author = {Ond{\v{r}}ej Novotn{\'{y}} and Pavel Mat{\v{e}}jka and
	Old{\v{r}}ich Plchot and Ond{\v{r}}ej Glembek and
	Luk{\'{a}}{\v{s}} Burget and Jan {\v{C}}ernock{\'{y}}},
   title = {Analysis of Speaker Recognition Systems in Realistic
	Scenarios of the SITW 2016 Challenge},
   pages = {828--832},
   booktitle = {Proceedings of Interspeech 2016},
   year = {2016},
   location = {San Francisco, US},
   publisher = {International Speech Communication Association},
   ISBN = {978-1-5108-3313-5},
   language = {english},
   url = {http://www.fit.vutbr.cz/research/view_pub.php?id=11270}
}

Your IPv4 address: 54.158.212.93
Switch to IPv6 connection

DNSSEC [dnssec]