Doc. Dr. Ing. Jan Černocký
Advancing the automatic language recognition using streamed audio media |
| Reseach leader: | Černocký Jan |
| Team leaders: | Kašpárek Tomáš, Matějka Pavel, Schwarz Petr |
| Agency: | CESNET |
| Code: | 162/2005 |
| Start: | 2006 |
| End: | 2007 |
| Files: | |
|---|
|
| | Keywords: | speech processing, language identification, parallel computing, unsupervised acquisition of speech data, streaming
|
| Annotation: |
The projects aims at massive usage of streamed audio for a qualitative improvement of LID (automatic
language identification) system accuracy. The speech processing research group at Faculty of Information
Technology, Brno University of Technology (Speech@FIT) disposes of a state-of-the-art LID system based
on acoustic and phonotactic modeling. For further improvement of its accuracy, it is crucial to gather huge
amounts of language-specific data. In the framework of this project, such data will be collected from available
streamed sources (Internet radios), on-line stored, parameterized and processed. We will develop software
for training of LID models. Resulting models and algorithms will be evaluated in international evaluation
campaigns organized by NIST and in cooperation with Czech law enforcement forces.
|
Publications
| 2008 | Burget Lukáš, Schwarz Petr, Matějka Pavel, Hannemann Mirko, Rastrow Ariya, White Christopher, Khudanpur Sanjeev, Heřmanský Hynek, Černocký Jan: Combination of strongly and weakly constrained recognizers for reliable detection of OOVs, In: Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas, US, IEEESP, 2008, p. 4, ISBN 1-4244-1484-9 |
| | Plchot Oldřich, Hubeika Valiantsina, Burget Lukáš, Schwarz Petr, Matějka Pavel: Acquisition of Telephone Data from Radio Broadcasts with Applications to Language Recognition, In: Proc. 11th International Conference on Text, Speech and Dialogue, Berlin, DE, Springer, 2008, p. 477-483, ISBN 978-3-540-87390-7 |
| 2007 | Burget Lukáš, Matějka Pavel, Schwarz Petr, Glembek Ondřej, Černocký Jan: Analysis of feature extraction and channel compensation in GMM speaker recognition system, In: IEEE Transactions on Audio, Speech, and Language Processing, Vol. 15, No. 7, 2007, US, p. 1979-1986, ISSN 1558-7916 |
| | Černocký Jan, Burget Lukáš, Schwarz Petr, Matějka Pavel, Karafiát Martin, Glembek Ondřej, Kopecký Jiří, Szőke Igor, Fapšo Michal, Grézl František, Hubeika Valiantsina, Oparin Ilya: Search in speech, language identification and speaker recognition in Speech@FIT, In: Proc. 17th International Conference Radioelektronika, 2007, Brno, CZ, UREL FEKT VUT, 2007, p. 1-6, ISBN 978-80-214-3390-8 |
| | Černocký Jan, Szőke Igor, Fapšo Michal, Karafiát Martin, Burget Lukáš, Kopecký Jiří, Grézl František, Schwarz Petr, Glembek Ondřej, Oparin Ilya, Smrž Pavel, Matějka Pavel: Search in speech for public security and defense, In: Proc. IEEE Workshop on Signal Processing Applications for Public Security and Forensics, 2007 (SAFE '07), Washington D.C., US, IEEESP, 2007, p. 1-7, ISBN 1-4244-1226-9 |
| | Fapšo Michal: Search in speech records, In: Proc. 13th Conference STUDENT EEICT 2007, Brno, CZ, FEKT VUT, 2007, p. 1-3, ISBN 978-80-214-3410-3 |
| | Grézl František, Černocký Jan: TRAP-based Techniques for Recognition of Noisy Speech, In: Proc. 10th International Conference on Text Speech and Dialogue (TSD 2007), Berlin, DE, Springer, 2007, p. 270-277, ISBN 978-3-540-74627-0 |
| | Grézl František, Karafiát Martin, Kontár Stanislav, Černocký Jan: Probabilistic and bottle-neck features for LVCSR of meetings, In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007), Hononulu, US, IEEESP, 2007, p. 757-760, ISBN 1-4244-0728-1 |
| | Hubeika Valiantsina, Burget Lukáš, Matějka Pavel, Černocký Jan: Channel Compensation for Speaker Recognition, Brno, CZ, 2007, p. 1-1 |
| | Hubeika Valiantsina, Szőke Igor, Burget Lukáš, Černocký Jan: Maximum Likelihood and Maximum Mutual Information Training in Gender and Age Recognition System, In: Proc. 10th International Conference on Text Speech and Dialogue (TSD 2007), Pilsen, CZ, Springer, 2007, p. 1-6, ISBN 978-3-540-74627-0 |
| | Matějka Pavel, Burget Lukáš, Schwarz Petr, Glembek Ondřej, Karafiát Martin, Grézl František, Černocký Jan, van Leeuwen David, Brümmer Niko, Strasheim Albert: STBU system for the NIST 2006 speaker recognition evaluation, In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007), Honolulu, US, IEEESP, 2007, p. 221-224, ISBN 1-4244-0728-1 |
| | Mikolov Tomáš, Oparin Ilya, Glembek Ondřej, Burget Lukáš, Karafiát Martin, Černocký Jan: Použití mluvených korpusů ve vývoji systému pro rozpoznávání českých přednášek, Praha, CZ, UK, 2007, p. 1-5 |
| | Szőke Igor, Burget Lukáš, Karafiát Martin: Combination of Word and Phoneme Approach for Spoken Term Detection, Brno, CZ, 2007, p. 1-1 |
| 2006 | Burget Lukáš, Matějka Pavel, Černocký Jan: Discriminative Training Techniques for Acoustic Language Identification, In: Proceedings of ICASSP 2006, Toulouse, FR, 2006, p. 209-212 |
| | Matějka Pavel, Burget Lukáš, Schwarz Petr, Černocký Jan: Brno University of Technology System for NIST 2005 Language Recognition Evaluation, In: Proceedings of Odyssey 2006: The Speaker and Language Recognition Workshop, San Juan, PR, 2006, p. 57-64, ISBN 1-4244-0472-X |
| | Matějka Pavel, Burget Lukáš, Schwarz Petr, Černocký Jan: NIST 2005 Language Recognition Evaluation, In: Proceedings of NIST LRE 2005, Washington DC, US, NIST, 2006, p. 1-37 |
| | Matějka Pavel, Schwarz Petr, Burget Lukáš, Černocký Jan: Use of anti-models to furher improve state-of-the-art PRLM language recognition system, In: Proceedings of ICASSP 2006, Toulouse, FR, 2006, p. 197-200 |
| | Schwarz Petr, Matějka Pavel, Černocký Jan: Hierarchical structures of neural networks for phoneme recognition, In: Proceedings of ICASSP 2006, Toulouse, FR, 2006, p. 325-328 |
|
|