Kvalitativní posun v automatickém rozpoznávání jazyků s využitím streamovaných audio-médií

English title

Advancing the automatic language recognition using streamed audio media

Type

grant

Keywords

speech processing, language identification, parallel computing, unsupervised acquisition of speech data, streaming

Abstract

The projects aims at massive usage of streamed audio for a qualitative improvement of LID (automatic language identification) system accuracy. The speech processing research group at Faculty of Information Technology, Brno University of Technology (Speech@FIT) disposes of a state-of-the-art LID system based on acoustic and phonotactic modeling. For further improvement of its accuracy, it is crucial to gather huge amounts of language-specific data. In the framework of this project, such data will be collected from available streamed sources (Internet radios), on-line stored, parameterized and processed. We will develop software for training of LID models. Resulting models and algorithms will be evaluated in international evaluation campaigns organized by NIST and in cooperation with Czech law enforcement forces.

Team members

Černocký Jan, prof. Dr. Ing. (UPGM FIT VUT) , research leader
Kašpárek Tomáš, Ing. (CVT FIT VUT) , team leader
Matějka Pavel, Ing., Ph.D. (UPGM FIT VUT) , team leader
Schwarz Petr, Ing., Ph.D. (UPGM FIT VUT) , team leader

Publications

2008

PLCHOT Oldřich, HUBEIKA Valiantsina, BURGET Lukáš, SCHWARZ Petr and MATĚJKA Pavel. Acquisition of Telephone Data from Radio Broadcasts with Applications to Language Recognition. In: Proc. 11th International Conference on Text, Speech and Dialogue. Berlin: Springer Verlag, 2008, pp. 477-483. ISBN 978-3-540-87390-7. Detail
BURGET Lukáš, SCHWARZ Petr, MATĚJKA Pavel, HANNEMANN Mirko, RASTROW Ariya, WHITE Christopher, KHUDANPUR Sanjeev, HEŘMANSKÝ Hynek and ČERNOCKÝ Jan. Combination of strongly and weakly constrained recognizers for reliable detection of OOVs. In: Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Las Vegas: IEEE Signal Processing Society, 2008, p. 4. ISBN 1-4244-1484-9. Detail

2007

BURGET Lukáš, MATĚJKA Pavel, SCHWARZ Petr, GLEMBEK Ondřej and ČERNOCKÝ Jan. Analysis of feature extraction and channel compensation in GMM speaker recognition system. IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 7, 2007, pp. 1979-1986. ISSN 1558-7916. Detail
SZŐKE Igor, BURGET Lukáš and KARAFIÁT Martin. Combination of Word and Phoneme Approach for Spoken Term Detection. Brno, 2007. Detail
HUBEIKA Valiantsina, BURGET Lukáš, MATĚJKA Pavel and ČERNOCKÝ Jan. Channel Compensation for Speaker Recognition. Brno, 2007. Detail
HUBEIKA Valiantsina, SZŐKE Igor, BURGET Lukáš and ČERNOCKÝ Jan. Maximum Likelihood and Maximum Mutual Information Training in Gender and Age Recognition System. In: Proc. 10th International Conference on Text Speech and Dialogue (TSD 2007). Pilsen: Springer Verlag, 2007, pp. 1-6. ISBN 978-3-540-74627-0. Detail
MIKOLOV Tomáš, OPARIN Ilya, GLEMBEK Ondřej, BURGET Lukáš, KARAFIÁT Martin and ČERNOCKÝ Jan. Použití mluvených korpusů ve vývoji systému pro rozpoznávání českých přednášek. Praha: Charles University, 2007. Detail
GRÉZL František, KARAFIÁT Martin, KONTÁR Stanislav and ČERNOCKÝ Jan. Probabilistic and bottle-neck features for LVCSR of meetings. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Hononulu: IEEE Signal Processing Society, 2007, pp. 757-760. ISBN 1-4244-0728-1. Detail
ČERNOCKÝ Jan, SZŐKE Igor, FAPŠO Michal, KARAFIÁT Martin, BURGET Lukáš, KOPECKÝ Jiří, GRÉZL František, SCHWARZ Petr, GLEMBEK Ondřej, OPARIN Ilya, SMRŽ Pavel and MATĚJKA Pavel. Search in speech for public security and defense. In: Proc. IEEE Workshop on Signal Processing Applications for Public Security and Forensics, 2007 (SAFE '07). Washington D.C.: IEEE Signal Processing Society, 2007, pp. 1-7. ISBN 1-4244-1226-9. Detail
FAPŠO Michal. Search in speech records. In: Proc. 13th Conference STUDENT EEICT 2007. Brno: Faculty of Electrical Engineering and Communication BUT, 2007, pp. 1-3. ISBN 978-80-214-3410-3. Detail
ČERNOCKÝ Jan, BURGET Lukáš, SCHWARZ Petr, MATĚJKA Pavel, KARAFIÁT Martin, GLEMBEK Ondřej, KOPECKÝ Jiří, SZŐKE Igor, FAPŠO Michal, GRÉZL František, HUBEIKA Valiantsina and OPARIN Ilya. Search in speech, language identification and speaker recognition in Speech@FIT. In: Proc. 17th International Conference Radioelektronika, 2007. Brno: Department of Radioelectronics FEEC BUT, 2007, pp. 1-6. ISBN 978-80-214-3390-8. Detail
MATĚJKA Pavel, BURGET Lukáš, SCHWARZ Petr, GLEMBEK Ondřej, KARAFIÁT Martin, GRÉZL František, ČERNOCKÝ Jan, VAN Leeuwen David, BRÜMMER Niko and STRASHEIM Albert. STBU system for the NIST 2006 speaker recognition evaluation. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Honolulu: IEEE Signal Processing Society, 2007, pp. 221-224. ISBN 1-4244-0728-1. Detail
GRÉZL František and ČERNOCKÝ Jan. TRAP-based Techniques for Recognition of Noisy Speech. In: Proc. 10th International Conference on Text Speech and Dialogue (TSD 2007). LNCS. Berlin: Springer Verlag, 2007, pp. 270-277. ISBN 978-3-540-74627-0. Detail

2006

MATĚJKA Pavel, BURGET Lukáš, SCHWARZ Petr and ČERNOCKÝ Jan. Brno University of Technology System for NIST 2005 Language Recognition Evaluation. In: Proceedings of Odyssey 2006: The Speaker and Language Recognition Workshop. San Juan, 2006, pp. 57-64. ISBN 1-4244-0472-X. Detail
BURGET Lukáš, MATĚJKA Pavel and ČERNOCKÝ Jan. Discriminative Training Techniques for Acoustic Language Identification. In: Proceedings of ICASSP 2006. Toulouse, 2006, pp. 209-212. Detail
SCHWARZ Petr, MATĚJKA Pavel and ČERNOCKÝ Jan. Hierarchical structures of neural networks for phoneme recognition. In: Proceedings of ICASSP 2006. Toulouse, 2006, pp. 325-328. Detail
MATĚJKA Pavel, BURGET Lukáš, SCHWARZ Petr and ČERNOCKÝ Jan. NIST 2005 Language Recognition Evaluation. In: Proceedings of NIST LRE 2005. Washington DC: National Institute of Standards and Technology, 2006, pp. 1-37. Detail
MATĚJKA Pavel, SCHWARZ Petr, BURGET Lukáš and ČERNOCKÝ Jan. Use of anti-models to furher improve state-of-the-art PRLM language recognition system. In: Proceedings of ICASSP 2006. Toulouse, 2006, pp. 197-200. Detail