Conference paper

KARAFIÁT Martin, GRÉZL František, HANNEMANN Mirko, VESELÝ Karel and ČERNOCKÝ Jan. BUT BABEL System for Spontaneous Cantonese. In: Proceedings of Interspeech 2013. Lyon: International Speech Communication Association, 2013, pp. 2589-2593. ISBN 978-1-62993-443-3. ISSN 2308-457X.
Publication language:english
Original title:BUT BABEL System for Spontaneous Cantonese
Title (cs):BUT BABEL systém pro spontání kantonštinu
Pages:2589-2593
Proceedings:Proceedings of Interspeech 2013
Conference:Interspeech 2013
Place:Lyon, FR
Year:2013
ISBN:978-1-62993-443-3
Journal:Proceedings of the 14th Annual Conference of the International Speech Communication Association (Interspeech 2013)., No. 8, Lyon, FR
ISSN:2308-457X
Publisher:International Speech Communication Association
URL:http://www.fit.vutbr.cz/research/groups/speech/publi/2013/karafiat_interspeech2013_IS131522.pdf [PDF]
Keywords
speech recognition, discriminative training, bottle-neck neural networks, region-dependent transforms
Annotation
This article describes the novel things we have brought to our BABEL Cantonese system include 6-layer Stacked Bottle-Neck features and using f0 at the input of this NN. We have also investigated into robustness of SBN training (silence, normalization) and shown an efficient combination with PLP and (again!) F0 features using Region-Dependent transforms. Last by not least, a combination of RDT with another popular adaptation technique (SAT) was shown beneficial.
Abstract
This paper presents our work on speech recognition of Cantonese spontaneous telephone conversations. The key-points include feature extraction by 6-layer Stacked Bottle-Neck neural network and using fundamental frequency information at its input. We have also investigated into robustness of SBN training (silence, normalization) and shown an efficient combination with PLP using Region-Dependent transforms. A combination of RDT with another popular adaptation technique (SAT) was shown beneficial. The results are reported on BABEL Cantonese data.
BibTeX:
@INPROCEEDINGS{
   author = {Martin Karafi{\'{a}}t and Franti{\v{s}}ek Gr{\'{e}}zl and
	Mirko Hannemann and Karel Vesel{\'{y}} and Jan
	{\v{C}}ernock{\'{y}}},
   title = {BUT BABEL System for Spontaneous Cantonese},
   pages = {2589--2593},
   booktitle = {Proceedings of Interspeech 2013},
   journal = {Proceedings of the 14th Annual Conference of the
	International Speech Communication Association (Interspeech
	2013).},
   number = {8},
   year = {2013},
   location = {Lyon, FR},
   publisher = {International Speech Communication Association},
   ISBN = {978-1-62993-443-3},
   ISSN = {2308-457X},
   language = {english},
   url = {http://www.fit.vutbr.cz/research/view_pub.php?id=10423}
}

Your IPv4 address: 54.159.252.103
Switch to IPv6 connection

DNSSEC [dnssec]