Conference paper

KARAFIÁT Martin, BASKAR Murali K., SZŐKE Igor, MALENOVSKÝ Vladimír, VESELÝ Karel, GRÉZL František, BURGET Lukáš and ČERNOCKÝ Jan. BUT OpenSAT 2017 speech recognition system. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, pp. 2638-2642. ISSN 1990-9770. Available from: https://www.isca-speech.org/archive/Interspeech_2018/abstracts/2457.html
Publication language:english
Original title:BUT OpenSAT 2017 speech recognition system
Title (cs):VUT systém rozpoznávání řeči pro OpenSAT 2017
Pages:2638-2642
Proceedings:Proceedings of Interspeech 2018
Conference:Interspeech 2018
Place:Hyderabad, IN
Year:2018
URL:https://www.isca-speech.org/archive/Interspeech_2018/abstracts/2457.html
Journal:Proceedings of Interspeech - on line, Vol. 2018, No. 9, BAIXAS, FR
ISSN:1990-9770
DOI:10.21437/Interspeech.2018-2457
Publisher:International Speech Communication Association
URL:http://www.fit.vutbr.cz/research/groups/speech/publi/2018/karafiat_interspeech2018_2457.pdf [PDF]
Keywords
speech recognition, multilingual training, BLSTM, data augmentation, robustness
Annotation
(ASR) systems for two domains in OpenSAT evaluations: Low Resourced Languages and Public Safety Communications. The first was challenging due to lack of training data, therefore multilingual approaches for BLSTM training were employed and recently published Residual Memory Networks requiring less training data were used. Combination of both approaches led to superior performance. The second domain was challenging due to recording in extreme conditions: specific channel, speaker under stress, high levels of noise. A data augmentation process was very important to get reasonably good performance.
BibTeX:
@INPROCEEDINGS{
   author = {Martin Karafi{\'{a}}t and K. Murali Baskar and
	Igor Sz{\H{o}}ke and Vladim{\'{i}}r
	Malenovsk{\'{y}} and Karel Vesel{\'{y}} and
	Franti{\v{s}}ek Gr{\'{e}}zl and Luk{\'{a}}{\v{s}}
	Burget and Jan {\v{C}}ernock{\'{y}}},
   title = {BUT OpenSAT 2017 speech recognition system},
   pages = {2638--2642},
   booktitle = {Proceedings of Interspeech 2018},
   journal = {Proceedings of Interspeech - on line},
   volume = {2018},
   number = {9},
   year = {2018},
   location = {Hyderabad, IN},
   publisher = {International Speech Communication Association},
   ISSN = {1990-9770},
   doi = {10.21437/Interspeech.2018-2457},
   language = {english},
   url = {http://www.fit.vutbr.cz/research/view_pub.php?id=11838}
}

Your IPv4 address: 54.227.31.145
Switch to IPv6 connection

DNSSEC [dnssec]