Department of Computer Graphics and Multimedia

RATS Patrol - Robust Automatic Transcription of Speech Patrol

Czech title:RATS Patrol - Robustní automatická transkripce řeči
Reseach leader:Matějka Pavel
Team leaders:Andrla Petr, Cipr Tomáš, Černocký Jan, Grézl František, Chalupníček Kamil, Otáhalová Sylva, Szőke Igor
Agency:Raytheon BBN Technologies
Code:Contract D10PC20015
Start:2010-09-23
End:2014-06-30
Keywords:speech recognition, speaker recognition, language recognition, keyword spotting, robustness, noise, transmission channels
Annotation:
Existing speech signal processing technologies are inadequate for most noisy or degraded speech signals that are important to military intelligence. The Robust Automatic Transcription of Speech (RATS) program is creating algorithms and software for performing the following tasks on potentially speech-containing signals received over communication channels that are extremely noisy and/or highly distorted: Speech Activity Detection, Language Identification, Speaker Identification and Key Word Spotting.

Publications

2014BAHARI Mohamad H., DEHAK Najim, VAN hamme Hugo, BURGET Lukáš, ALI Ahmed M. and GLASS Jim. Non-Negative Factor Analysis of Gaussian Mixture Model Weight Adaptation for Language and Dialect Recognition. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING. New York City: IEEE Signal Processing Society, 2014, vol. 2014, no. 7, pp. 1117-1129. ISSN 2329-9290.
 CUMANI Sandro, LAFACE Pietro and PLCHOT Oldřich. On the use of i-vector posterior distributions in Probabilistic Linear Discriminant Analysis. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING. New York City: IEEE Signal Processing Society, 2014, vol. 22, no. 4, pp. 846-857. ISSN 2329-9290.
 GLEMBEK Ondřej, MA Jeff, MATĚJKA Pavel, ZHANG Bing, PLCHOT Oldřich, BURGET Lukáš and MATSOUKAS Spyros. Domain Adaptation Via Within-class Covariance Correction in I-Vector Based Speaker Recognition Systerms. In: Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014, pp. 4060-4064. ISBN 978-1-4799-2892-7.
 MARTÍNEZ González David, BURGET Lukáš, STAFYLAKIS Themos, LEI Yun, KENNY Patrick and LLEIDA Eduardo. Unscented Transform For Ivector-based Noisy Speaker Recognition. In: Proceedings of ICASSP 2014. Florencie: IEEE Signal Processing Society, 2014, pp. 4070-4074. ISBN 978-1-4799-2892-7.
 MATĚJKA Pavel, ZHANG Le, NG Tim, MALLIDI Sri Harish, GLEMBEK Ondřej, MA Jeff and ZHANG Bing. Neural Network Bottleneck Features for Language Identification. In: Proceedings of Odyssey 2014. Joensuu: International Speech Communication Association, 2014, pp. 299-304. ISSN 2312-2846.
 NG Tim, HSIAO Roger, ZHANG Le, KARAKOS Damianos, MALLIDI Sri Harish, KARAFIÁT Martin, VESELÝ Karel, SZŐKE Igor, ZHANG Bing, NGUYEN Long and SCHWARTZ Richard. Progress in the BBN Keyword Search System for the DARPA RATS Program. In: Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014, pp. 959-963. ISBN 978-1-63439-435-2.
 PLCHOT Oldřich, DIEZ Sánchez Mireia, SOUFIFAR Mehdi and BURGET Lukáš. PLLR Features in Language Recognition System for RATS. In: Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014, pp. 3048-3051. ISBN 978-1-63439-435-2.
2013CUMANI Sandro, BRUMMER Niko, BURGET Lukáš, LAFACE Pietro, PLCHOT Oldřich and VASILAKAKIS Vasileios. Pairwise Discriminative Speaker Verification in the I -Vector Space. IEEE Transactions on Audio, Speech, and Language Processing. 2013, vol. 2013, no. 6, pp. 1217-1227. ISSN 1558-7916.
 CUMANI Sandro, PLCHOT Oldřich and LAFACE Pietro. Probabilistic Linear Discriminant Analysis Of I-Vector Posterior Distributions. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, pp. 7644-7648. ISBN 978-1-4799-0355-9.
 PLCHOT Oldřich, MATSOUKAS Spyros, MATĚJKA Pavel, DEHAK Najim, MA Jeff, CUMANI Sandro, GLEMBEK Ondřej, HEŘMANSKÝ Hynek, MESGARANI Nima, SOUFIFAR Mehdi Mohammad, THOMAS Samuel, ZHANG Bing and ZHOU Xinhui et al. Developing A Speaker Identification System For The DARPA RATS Project. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, pp. 6768-6772. ISBN 978-1-4799-0355-9.
 SOUFIFAR Mehdi Mohammad, BURGET Lukáš, PLCHOT Oldřich, CUMANI Sandro and ČERNOCKÝ Jan. Regularized Subspace n-Gram Model for Phonotactic iVector Extraction. In: Proceedings of Interspeech 2013. Lyon: International Speech Communication Association, 2013, pp. 74-78. ISBN 978-1-62993-443-3. ISSN 2308-457X.
2012BRUMMER Niko, CUMANI Sandro, GLEMBEK Ondřej, KARAFIÁT Martin, MATĚJKA Pavel, PEŠÁN Jan, PLCHOT Oldřich, SOUFIFAR Mehdi Mohammad, DE Villiers Edward and ČERNOCKÝ Jan. Description and analysis of the Brno276 system for LRE2011. In: Proceedings of Odyssey 2012: The Speaker and Language Recognition Workshop. Singapur: International Speech Communication Association, 2012, pp. 216-223. ISBN 978-981-07-3093-2.
 D'HARO Luis Fernando, GLEMBEK Ondřej, PLCHOT Oldřich, MATĚJKA Pavel, SOUFIFAR Mehdi Mohammad, CORDOBA Ricardo and ČERNOCKÝ Jan. Phonotactic Language Recognition using i-vectors and Phoneme Posteriogram Counts. In: Proceedings of Interspeech 2012. Portland, Oregon: International Speech Communication Association, 2012, pp. 1-4. ISBN 978-1-62276-759-5. ISSN 1990-9772.
 LEI Yun, BURGET Lukáš and SCHEFFER Nicolas. Bilinear Factor Analysis for iVector Based Speaker Verification. In: Proceedings of Interspeech. Portland, Oregon: International Speech Communication Association, 2012, pp. 1-4. ISBN 978-1-62276-759-5.
 MATĚJKA Pavel, PLCHOT Oldřich, SOUFIFAR Mehdi Mohammad, GLEMBEK Ondřej, D'HARO Luis Fernando, VESELÝ Karel, GRÉZL František, MA Jeff, MATSOUKAS Spyros and DEHAK Najim. Patrol Team Language Identification System for DARPA RATS P1 Evaluation. In: Proceedings of Interspeech 2012. Portland, Oregon: International Speech Communication Association, 2012, pp. 1-4. ISBN 978-1-62276-759-5. ISSN 1990-9772.
 NG Tim, ZHANG Bing, NGUYEN Long, MATSOUKAS Spyros, ZHOU Xinhui, MESGARANI Nima, VESELÝ Karel and MATĚJKA Pavel. Developing a Speech Activity Detection System for the DARPA RATS Program. In: Proceedings of Interspeech 2012. Portland, Oregon: International Speech Communication Association, 2012, pp. 1-4. ISBN 978-1-62276-759-5. ISSN 1990-9772.
 PLCHOT Oldřich, KARAFIÁT Martin, BRUMMER Niko, GLEMBEK Ondřej, MATĚJKA Pavel, DE Villiers Edward and ČERNOCKÝ Jan. Speaker vectors from Subspace Gaussian Mixture Model as complementary features for Language Identification. In: Proceedings of Odyssey 2012, The Speaker and Language Recognition Workshop. Singapur: International Speech Communication Association, 2012, pp. 330-333. ISBN 978-981-07-3093-2.
2011MARTÍNEZ González David, PLCHOT Oldřich, BURGET Lukáš, GLEMBEK Ondřej and MATĚJKA Pavel. Language Recognition in iVectors Space. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, pp. 861-864. ISBN 978-1-61839-270-1. ISSN 1990-9772.
 SOUFIFAR Mehdi, KOCKMANN Marcel, BURGET Lukáš, PLCHOT Oldřich, GLEMBEK Ondřej and SVENDSEN Torbjorn. iVector Approach to Phonotactic Language Recognition. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, pp. 2913-2916. ISBN 978-1-61839-270-1. ISSN 1990-9772.

Your IPv4 address: 54.198.35.26
Switch to IPv6 connection

DNSSEC [dnssec]