Conference paper

JANDA Miloš, KARAFIÁT Martin and ČERNOCKÝ Jan. Dealing with Numbers in Grapheme-Based Speech Recognition. In: Proceedings of 15th International Conference on Text, Speech and Dialogue. Springer-Verlag Berlin Heidelberg 2012: Springer Verlag, 2012, pp. 438-445. ISBN 978-3-642-32789-6. ISSN 0302-9743. Available from: http://www.springerlink.com/content/yx9807202033v381/
Publication language:english
Original title:Dealing with Numbers in Grapheme-Based Speech Recognition
Title (cs):Rozpoznávání řeči založené na grafémech s důrazem na zpracování číslovek.
Pages:438-445
Proceedings:Proceedings of 15th International Conference on Text, Speech and Dialogue
Conference:15th International Conference on Text, Speech and Dialogue
Series:Lecture Notes in Computer Science, 2012, Volume 7499
Place:Springer-Verlag Berlin Heidelberg 2012, DE
Year:2012
URL:http://www.springerlink.com/content/yx9807202033v381/
ISBN:978-3-642-32789-6
Journal:Lecture Notes in Computer Science, Vol. 2012, No. 9, DE
ISSN:0302-9743
Publisher:Springer Verlag
URL:http://www.fit.vutbr.cz/research/groups/speech/publi/2012/janda_TSD2012_LNSC%20744_pp438_445.pdf [PDF]
Keywords
LVCSR, ASR, grapheme, phoneme, speech recognition.
Annotation
Grapheme-based speech recognition approach is suitable in situation of low resource languages, where obtaining of pronunciation dictionary is time- and cost-consuming. The paper describes the process of automatic generation of pronunciation dictionaries with emphasis on the expansion of numbers and presents results on GlobalPhone database.
Abstract
This article presents the results of grapheme-based speech recognition for eight languages. The need for this approach arises in situation of low resource languages, where obtaining a pronunciation dictionary is time- and cost-consuming or impossible. In such scenarios, usage of grapheme dictionaries is the most simplest and straight-forward. The paper describes the process of automatic generation of pronunciation dictionaries with emphasis on the expansion of numbers. Experiments on GlobalPhone database show that grapheme-based systems have results comparable to the phoneme-based ones, especially for phonetic languages.
BibTeX:
@INPROCEEDINGS{
   author = {Milo{\v{s}} Janda and Martin Karafi{\'{a}}t and Jan
	{\v{C}}ernock{\'{y}}},
   title = {Dealing with Numbers in Grapheme-Based Speech Recognition},
   pages = {438--445},
   booktitle = {Proceedings of 15th International Conference on Text, Speech
	and Dialogue},
   series = {Lecture Notes in Computer Science, 2012, Volume 7499},
   journal = {Lecture Notes in Computer Science},
   volume = {2012},
   number = {9},
   year = {2012},
   location = {Springer-Verlag Berlin Heidelberg 2012, DE},
   publisher = {Springer Verlag},
   ISBN = {978-3-642-32789-6},
   ISSN = {0302-9743},
   language = {english},
   url = {http://www.fit.vutbr.cz/research/view_pub.php.en.iso-8859-2?id=10129}
}

Your IPv4 address: 54.161.73.123
Switch to IPv6 connection

DNSSEC [dnssec]