Doc. Dr. Ing. Jan Černocký

Advancing the automatic language recognition using streamed audio media

Reseach leader:Černocký Jan
Team leaders:Kašpárek Tomáš, Matějka Pavel, Schwarz Petr
Agency:CESNET
Code:162/2005
Start:2006
End:2007
Files: 
+Type Name Title Size +Modified
iconprojekt-blade.pdf159 KB2006-02-20 07:57:04
iconform-blade.pdf88,1 KB2006-02-20 07:57:11
iconrozpocet-upravy.pdf37,5 KB2006-02-20 08:18:21
^ Select all
With selected:
Keywords:speech processing, language identification, parallel computing, unsupervised acquisition of speech data, streaming
Annotation:
The projects aims at massive usage of streamed audio for a qualitative improvement of LID (automatic language identification) system accuracy. The speech processing research group at Faculty of Information Technology, Brno University of Technology (Speech@FIT) disposes of a state-of-the-art LID system based on acoustic and phonotactic modeling. For further improvement of its accuracy, it is crucial to gather huge amounts of language-specific data. In the framework of this project, such data will be collected from available streamed sources (Internet radios), on-line stored, parameterized and processed. We will develop software for training of LID models. Resulting models and algorithms will be evaluated in international evaluation campaigns organized by NIST and in cooperation with Czech law enforcement forces.

Publications

2008Burget Lukáš, Schwarz Petr, Matějka Pavel, Hannemann Mirko, Rastrow Ariya, White Christopher, Khudanpur Sanjeev, Heřmanský Hynek, Černocký Jan: Combination of strongly and weakly constrained recognizers for reliable detection of OOVs, In: Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, US, IEEESP, 2008, p. 4, ISBN 1-4244-1484-9
 Plchot Oldřich, Hubeika Valiantsina, Burget Lukáš, Schwarz Petr, Matějka Pavel: Acquisition of Telephone Data from Radio Broadcasts with Applications to Language Recognition, In: Proc. 11th International Conference on Text, Speech and Dialogue, Berlin, DE, Springer, 2008, p. 477-483, ISBN 978-3-540-87390-7
2007Burget Lukáš, Matějka Pavel, Schwarz Petr, Glembek Ondřej, Černocký Jan: Analysis of feature extraction and channel compensation in GMM speaker recognition system, In: IEEE Transactions on Audio, Speech, and Language Processing, Vol. 15, No. 7, 2007, US, p. 1979-1986, ISSN 1558-7916
 Černocký Jan, Burget Lukáš, Schwarz Petr, Matějka Pavel, Karafiát Martin, Glembek Ondřej, Kopecký Jiří, Szőke Igor, Fapšo Michal, Grézl František, Hubeika Valiantsina, Oparin Ilya: Search in speech, language identification and speaker recognition in Speech@FIT, In: Proc. 17th International Conference Radioelektronika, 2007, Brno, CZ, UREL FEKT VUT, 2007, p. 1-6, ISBN 978-80-214-3390-8
 Černocký Jan, Szőke Igor, Fapšo Michal, Karafiát Martin, Burget Lukáš, Kopecký Jiří, Grézl František, Schwarz Petr, Glembek Ondřej, Oparin Ilya, Smrž Pavel, Matějka Pavel: Search in speech for public security and defense, In: Proc. IEEE Workshop on Signal Processing Applications for Public Security and Forensics, 2007 (SAFE '07), Washington D.C., US, IEEESP, 2007, p. 1-7, ISBN 1-4244-1226-9
 Fapšo Michal: Search in speech records, In: Proc. 13th Conference STUDENT EEICT 2007, Brno, CZ, FEKT VUT, 2007, p. 1-3, ISBN 978-80-214-3410-3
 Grézl František, Černocký Jan: TRAP-based Techniques for Recognition of Noisy Speech, In: Proc. 10th International Conference on Text Speech and Dialogue (TSD 2007), Berlin, DE, Springer, 2007, p. 270-277, ISBN 978-3-540-74627-0
 Grézl František, Karafiát Martin, Kontár Stanislav, Černocký Jan: Probabilistic and bottle-neck features for LVCSR of meetings, In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007), Hononulu, US, IEEESP, 2007, p. 757-760, ISBN 1-4244-0728-1
 Hubeika Valiantsina, Burget Lukáš, Matějka Pavel, Černocký Jan: Channel Compensation for Speaker Recognition, Brno, CZ, 2007, p. 1-1
 Hubeika Valiantsina, Szőke Igor, Burget Lukáš, Černocký Jan: Maximum Likelihood and Maximum Mutual Information Training in Gender and Age Recognition System, In: Proc. 10th International Conference on Text Speech and Dialogue (TSD 2007), Pilsen, CZ, Springer, 2007, p. 1-6, ISBN 978-3-540-74627-0
 Matějka Pavel, Burget Lukáš, Schwarz Petr, Glembek Ondřej, Karafiát Martin, Grézl František, Černocký Jan, van Leeuwen David, Brümmer Niko, Strasheim Albert: STBU system for the NIST 2006 speaker recognition evaluation, In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007), Honolulu, US, IEEESP, 2007, p. 221-224, ISBN 1-4244-0728-1
 Mikolov Tomáš, Oparin Ilya, Glembek Ondřej, Burget Lukáš, Karafiát Martin, Černocký Jan: Použití mluvených korpusů ve vývoji systému pro rozpoznávání českých přednášek, Praha, CZ, UK, 2007, p. 1-5
 Szőke Igor, Burget Lukáš, Karafiát Martin: Combination of Word and Phoneme Approach for Spoken Term Detection, Brno, CZ, 2007, p. 1-1
2006Burget Lukáš, Matějka Pavel, Černocký Jan: Discriminative Training Techniques for Acoustic Language Identification, In: Proceedings of ICASSP 2006, Toulouse, FR, 2006, p. 209-212
 Matějka Pavel, Burget Lukáš, Schwarz Petr, Černocký Jan: Brno University of Technology System for NIST 2005 Language Recognition Evaluation, In: Proceedings of Odyssey 2006: The Speaker and Language Recognition Workshop, San Juan, PR, 2006, p. 57-64, ISBN 1-4244-0472-X
 Matějka Pavel, Burget Lukáš, Schwarz Petr, Černocký Jan: NIST 2005 Language Recognition Evaluation, In: Proceedings of NIST LRE 2005, Washington DC, US, NIST, 2006, p. 1-37
 Matějka Pavel, Schwarz Petr, Burget Lukáš, Černocký Jan: Use of anti-models to furher improve state-of-the-art PRLM language recognition system, In: Proceedings of ICASSP 2006, Toulouse, FR, 2006, p. 197-200
 Schwarz Petr, Matějka Pavel, Černocký Jan: Hierarchical structures of neural networks for phoneme recognition, In: Proceedings of ICASSP 2006, Toulouse, FR, 2006, p. 325-328

Your IPv4 address: 184.72.197.101
Switch to IPv6 connection

DNSSEC [dnssec]