Conference paper

GOEL Nagendra K., THOMAS Samuel, AGARWAL Mohit, AKYAZI Pinar, BURGET Lukáš, FENG Kai, GHOSHAL Arnab, GLEMBEK Ondřej, KARAFIÁT Martin, POVEY Daniel, RASTROW Ariya, ROSE Richard and SCHWARZ Petr. Approaches to automatic lexicon learning with limited training examples. In: Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010, pp. 5094-5097. ISBN 978-1-4244-4296-6. ISSN 1520-6149.
Publication language:english
Original title:Approaches to automatic LEXICON learning with limited training examples
Title (cs):Přístupy k automatickému učení slovníku s omezenými trénovacími daty
Pages:5094-5097
Proceedings:Proc. International Conference on Acoustics, Speech, and Signal Processing
Conference:International Conference on Acoustics, Speech, and Signal Processing 2010
Place:Dallas, US
Year:2010
ISBN:978-1-4244-4296-6
Journal:Proc. International Conference on Acoustics, Speech, and Signal Processing, Vol. 2010, No. 3, Piscataway, US
ISSN:1520-6149
Publisher:IEEE Signal Processing Society
URL:http://www.fit.vutbr.cz/research/groups/speech/publi/2010/goel_icassp2010_0005094.pdf [PDF]
Keywords
Lexicon Learning, LVCSR
Annotation
The paper is on approaches to automatic lexicon learning with limited training examples. We use a combination of lexicon learning techniques.
Abstract
Preparation of a lexicon for speech recognition systems can be a significant effort in languages where the written form is not exactly phonetic. On the other hand, in languages where the written form is quite phonetic, some common words are often mispronounced. In this paper, we use a combination of lexicon learning techniques to explore whether a lexicon can be learned when only a small lexicon is available for boot-strapping. We discover that for a phonetic language such as Spanish, it is possible to do that better than what is possible from generic rules or hand-crafted pronunciations. For a more complex language such as English, we find that it is still possible but with some loss of accuracy.
BibTeX:
@INPROCEEDINGS{
   author = {K. Nagendra Goel and Samuel Thomas and Mohit Agarwal and
	Pinar Akyazi and Luk{\'{a}}{\v{s}} Burget and Kai Feng and
	Arnab Ghoshal and Ond{\v{r}}ej Glembek and Martin
	Karafi{\'{a}}t and Daniel Povey and Ariya Rastrow and
	Richard Rose and Petr Schwarz},
   title = {Approaches to automatic LEXICON learning with limited
	training examples},
   pages = {5094--5097},
   booktitle = {Proc. International Conference on Acoustics, Speech, and
	Signal Processing},
   journal = {Proc. International Conference on Acoustics, Speech, and
	Signal Processing},
   volume = {2010},
   number = {3},
   year = {2010},
   location = {Dallas, US},
   publisher = {IEEE Signal Processing Society},
   ISBN = {978-1-4244-4296-6},
   ISSN = {1520-6149},
   language = {english},
   url = {http://www.fit.vutbr.cz/research/view_pub.php?id=9309}
}

Your IPv4 address: 54.156.92.243
Switch to IPv6 connection

DNSSEC [dnssec]