Department of Computer Graphics and Multimedia
|
| Reseach leader: | Matějka Pavel |
| Team leaders: | Černocký Jan, Grézl František, Hannemann Mirko, Karafiát Martin, Kohlová Renata, Mikolov Tomáš, Otáhalová Sylva, Szőke Igor, Veselý Karel |
| Agency: | IARPA |
| Code: | P11168-BBN (Babelon) |
| Start: | 2012 |
| End: | 2016 |
| Keywords: | speech recognition, speaker recognition, language recognition, LVCSR, feature extraction, acoustic modeling, neural-network |
| Annotation: |
| The Babel Program will develop agile and robust speech recognition technology that can be rapidly applied to any human language in order to provide effective search capability for analysts to efficiently process massive amounts of real-world recorded speech. Today’s transcription systems are built on technology that was originally developed for English, with markedly lower performance on non-English languages. These systems have often taken years to develop and cover only a small subset of the languages of the world. Babel intends to demonstrate the ability to generate a speech transcription system for any new language within one week to support keyword search performance for effective triage of massive amounts of speech recorded in challenging real-world situations. |
Publications
| 2012 | Veselý Karel, Karafiát Martin, Grézl František, Janda Miloš, Egorova Ekaterina: The Language-Independent Bottleneck Features, In: Proceedings of IEEE 2012 Workshop on Spoken Language Technology, Miami, US, IEEESP, 2012, p. 336-341, ISBN 978-1-4673-5124-9 |
|
|