Conference paper

PLCHOT Oldřich, MATSOUKAS Spyros, MATĚJKA Pavel, DEHAK Najim, MA Jeff, CUMANI Sandro, GLEMBEK Ondřej, HEŘMANSKÝ Hynek, MESGARANI Nima, SOUFIFAR Mehdi Mohammad, THOMAS Samuel, ZHANG Bing and ZHOU Xinhui et al. Developing A Speaker Identification System For The DARPA RATS Project. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, pp. 6768-6772. ISBN 978-1-4799-0355-9.
Publication language:english
Original title:Developing A Speaker Identification System For The DARPA RATS Project
Title (cs):Vývoj systému identifikace řečníka pro DARPA RATS projekt
Pages:6768-6772
Proceedings:Proceedings of ICASSP 2013
Conference:38th International Conference on Acoustics, Speech, and Signal Processing
Place:Vancouver, CA
Year:2013
ISBN:978-1-4799-0355-9
Publisher:IEEE Signal Processing Society
URL:http://www.fit.vutbr.cz/research/groups/speech/publi/2013/plchot_icassp2013_0006768.pdf [PDF]
Keywords
speaker identification, noisy speech processing
Annotation
This paper is focusing on the development of a speaker identification system for the DARPA RATS Project.
Abstract
This paper describes the speaker identification (SID) system developed by the Patrol team for the first phase of the DARPA RATS (Robust Automatic Transcription of Speech) program, which seeks to advance state of the art detection capabilities on audio from highly degraded communication channels. We present results using multiple SID systems differing mainly in the algorithm used for voice activity detection (VAD) and feature extraction. We show that (a) unsupervised VAD performs as well supervised methods in terms of downstream SID performance, (b) noise-robust feature extraction methods such as CFCCs out-perform MFCC front-ends on noisy audio, and (c) fusion of multiple systems provides 24% relative improvement in EER compared to the single best system when using a novel SVM-based fusion algorithm that uses side information such as gender, language, and channel id.
BibTeX:
@INPROCEEDINGS{
   author = {Old{\v{r}}ich Plchot and Spyros Matsoukas and Pavel
	Mat{\v{e}}jka and Najim Dehak and Jeff Ma and Sandro Cumani
	and Ond{\v{r}}ej Glembek and Hynek He{\v{r}}mansk{\'{y}} and
	Nima Mesgarani and Mohammad Mehdi Soufifar and Samuel Thomas
	and Bing Zhang and Xinhui Zhou},
   title = {Developing A Speaker Identification System For The DARPA
	RATS Project},
   pages = {6768--6772},
   booktitle = {Proceedings of ICASSP 2013},
   year = {2013},
   location = {Vancouver, CA},
   publisher = {IEEE Signal Processing Society},
   ISBN = {978-1-4799-0355-9},
   language = {english},
   url = {http://www.fit.vutbr.cz/research/view_pub.php?id=10313}
}

Your IPv4 address: 54.167.44.32
Switch to IPv6 connection

DNSSEC [dnssec]