KARAFIÁT Martin, GRÉZL František, VESELÝ Karel, HANNEMANN Mirko, SZŐKE Igor and ČERNOCKÝ Jan. BUT 2014 Babel System: Analysis of adaptation in NN based systems. In: Proceedings of Interspeech 2014. Singapore: International Speech Communication Association, 2014, pp. 3002-3006. ISBN 978-1-63439-435-2. Available from:
Publication language:english
Original title:BUT 2014 Babel System: Analysis of adaptation in NN based systems
Title (cs):BUT 2014 Babel systém: Analýza adaptace v systémech založených na neuronových sítích
Proceedings:Proceedings of Interspeech 2014
Conference:Interspeech 2014
Place:Singapore, SG
Publisher:International Speech Communication Association
speech recognition, discriminative training,
bottle-neck neural networks, deep neural networks, adaptation
of neural networks, fundamental frequency
Features based on a hierarchy of neural networks with compressive layers - Stacked Bottle-Neck (SBN) features - were recently shown to provide excellent performance in LVCSR systems. This paper summarizes several techniques investigated in our work towards Babel 2014 evaluations: (1) using several versions of fundamental frequency (F0) estimates, (2) semi-supervised training on un-transcribed data and mainly (3) adapting the NN structure at different levels. They are tested on three 2014 Babel languages with full GMM- and DNN-based systems. Separately and in combination, they are shown to outperform the baselines and confirm the usefulness of bottle-neck features in current ASR systems.
