Článek ve sborníku konference

DEORAS Anoop, MIKOLOV Tomáš a CHURCH Kenneth. A Fast Re-scoring Strategy to Capture Long-Distance Dependencies. In: Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing July 2011 Edinburgh, Scotland, UK. Edinburgh: Association for Computational Linguistics, 2011, s. 1116-1127. ISBN 978-1-937284-11-4.
Jazyk publikace:angličtina
Název publikace:A Fast Re-scoring Strategy to Capture Long-Distance Dependencies
Název (cs):Strategie pro rychlé reskórování se závislostmi přes dlouhé kontexty
Strany:1116-1127
Sborník:Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing July 2011 Edinburgh, Scotland, UK
Konference:Conference on Empirical Methods in Natural Language Processing
Místo vydání:Edinburgh, GB
Rok:2011
ISBN:978-1-937284-11-4
Vydavatel:Association for Computational Linguistics
URL:http://www.fit.vutbr.cz/research/groups/speech/publi/2011/deoras_emnlp2011_D11-1103.pdf [PDF]
URL:http://www.aclweb.org/anthology-new/D/D11/D11-1103.pdf [PDF]
Klíčová slova
jazkový model, strategie pro reskórování, rekurentní neuronová síť
Anotace
Článek popisuje nový přístup k reskórování latic s komplikovanými jazykovými modely, jako je například jazykový model založený na rekurentních neuronový sítích.
Abstrakt
A re-scoring strategy is proposed that makes it feasible to capture more long-distance dependencies in the natural language. Two pass strategies have become popular in a number of recognition tasks such as ASR (automatic speech recognition), MT (machine translation) and OCR (optical character recognition). The first pass typically applies a weak language model (n-grams) to a lattice and the second pass applies a stronger language model to N-best lists. The stronger language model is intended to capture more longdistance dependencies. The proposed method uses RNN-LM (recurrent neural network language model), which is a long span LM, to rescore word lattices in the second pass. A hill climbing method (iterative decoding) is proposed to search over islands of confusability in the word lattice. An evaluation based on Broadcast News shows speedups of 20 over basic N-best re-scoring, and word error rate reduction of 8% (relative) on a highly competitive setup.
BibTeX:
@INPROCEEDINGS{
   author = {Anoop Deoras and Tom{\'{a}}{\v{s}} Mikolov and Kenneth
	Church},
   title = {A Fast Re-scoring Strategy to Capture Long-Distance
	Dependencies},
   pages = {1116--1127},
   booktitle = {Proceedings of the 2011 Conference on Empirical Methods in
	Natural Language Processing July 2011 Edinburgh, Scotland,
	UK},
   year = {2011},
   location = {Edinburgh, GB},
   publisher = {Association for Computational Linguistics},
   ISBN = {978-1-937284-11-4},
   language = {english},
   url = {http://www.fit.vutbr.cz/research/view_pub.php.cs?id=9687}
}

Vaše IPv4 adresa: 54.225.17.239
Přepnout na IPv6 spojení

DNSSEC [dnssec]