Článek ve sborníku konference | |
| Szőke, I., Fapšo, M., Burget, L., Černocký, J.: Hybrid word-subword decoding for spoken term detection, In: Proc. SSCS 2008: Speech search workshop at SIGIR, Singapore, SG, ACM, 2008, s. 4, ISBN 978-90-365-2697-5 | | Jazyk publikace: | angličtina |
|---|
| Název publikace: | Hybrid word-subword decoding for spoken term detection |
|---|
| Název (cs): | Hybridní slovní a podslovní dekóodování pro detekci klíčových frází v řeči |
|---|
| Strany: | 4 |
|---|
| Sborník: | Proc. SSCS 2008: Speech search workshop at SIGIR |
|---|
| Konference: | 31st International ACM SIGIR Conference |
|---|
| Místo vydání: | Singapore, SG |
|---|
| Rok: | 2008 |
|---|
| ISBN: | 978-90-365-2697-5 |
|---|
| Vydavatel: | Association for Computing Machinery |
|---|
| URL: | http://www.fit.vutbr.cz/research/groups/speech/publi/2008/szoke_sigir2008.pdf [PDF] |
|---|
| Klíčová slova |
|---|
spoken term detection
|
| Anotace |
|---|
Článek pojednává o hybridním slovním a podslovním dekódování pro detekci klíčových frází v řeči
|
| Abstrakt |
|---|
| This paper deals with a hybrid word-subword recognition
system for spoken term detection. The decoding is driven
by a hybrid recognition network and the decoder directly
produces hybrid word-subword lattices. One phone and two
multigram models were tested to represent sub-word units.
The systems were evaluated in terms of spoken term detection
accuracy and the size of index. We concluded that
the best subword model for hybrid word-subword recognition
is the multigram model trained on the word recognizer
vocabulary. We achieved an improvement in word
recognition accuracy, and in spoken term detection accuracy
when in-vocabulary and out-of-vocabulary terms are
searched separately. Spoken term detection accuracy with
the full (in-vocabulary and out-of-vocabulary) term set was
slightly worse but the required index size was significantly
reduced. |
| BibTeX: |
|---|
@INPROCEEDINGS{
author = {Igor Szőke and Michal Fapšo and Lukáš Burget and Jan
Černocký},
title = {Hybrid word-subword decoding for spoken term detection},
pages = {4},
booktitle = {Proc. SSCS 2008: Speech search workshop at SIGIR},
year = {2008},
location = {Singapore, SG},
publisher = {Association for Computing Machinery},
ISBN = {978-90-365-2697-5},
language = {english},
url = {http://www.fit.vutbr.cz/research/view_pub.php?id=8729}
} |
|