Conference paper

MIKOLOV Tomáš. Language modeling of Czech using neural networks. In: Proc. 13th Conference STUDENT EEICT 2007. Brno: Faculty of Electrical Engineering and Communication BUT, 2007, pp. 1-3. ISBN 9788021434103.
Publication language:english
Original title:Language modeling of Czech using neural networks
Title (cs):Jazykové modelování češtiny s využitím neuronových sítí
Pages:1-3
Proceedings:Proc. 13th Conference STUDENT EEICT 2007
Conference:Student EEICT 2007
Place:Brno, CZ
Year:2007
ISBN:9788021434103
Publisher:Faculty of Electrical Engineering and Communication BUT
URL:http://www.fit.vutbr.cz/research/groups/speech/publi/2007/mikolov_eeict_2007.pdf [PDF]
Keywords
language modeling
Annotation
The work concentrates on language modeling of Czech using neural networks
Abstract
Language models are used in many systems involving natural language processing, like speech and handwriting recognition. The most widely used techniques are based on backoff n-grams. However, it is commonly believed that this approach is insufficient. One of the best improvements over back-off language models has been achieved by using neural networks that project words onto a continuous space. This work concentrates on comparison of standard 4-gram language model with modified Kneser-Ney smoothing and neural network, both trained on spoken corpora with 1M words. Significant improvements in perplexity are reported.
BibTeX:
@INPROCEEDINGS{
   author = {Tom{\'{a}}{\v{s}} Mikolov},
   title = {Language modeling of Czech using neural networks},
   pages = {1--3},
   booktitle = {Proc. 13th Conference STUDENT EEICT 2007},
   year = {2007},
   location = {Brno, CZ},
   publisher = {Faculty of Electrical Engineering and Communication BUT},
   ISBN = {9788021434103},
   language = {english},
   url = {http://www.fit.vutbr.cz/research/view_pub.php?id=8476}
}

Your IPv4 address: 54.211.225.175
Switch to IPv6 connection

DNSSEC [dnssec]