Článek ve sborníku konference

SZŐKE Igor, FAPŠO Michal, BURGET Lukáš a ČERNOCKÝ Jan. Hybrid word-subword decoding for spoken term detection. In: Proc. SSCS 2008: Speech search workshop at SIGIR. Singapore: Association for Computing Machinery, 2008, s. 4. ISBN 978-90-365-2697-5.
Jazyk publikace:angličtina
Název publikace:Hybrid word-subword decoding for spoken term detection
Název (cs):Hybridní slovní a podslovní dekóodování pro detekci klíčových frází v řeči
Sborník:Proc. SSCS 2008: Speech search workshop at SIGIR
Konference:31st International ACM SIGIR Conference
Místo vydání:Singapore, SG
Vydavatel:Association for Computing Machinery
URL:http://www.fit.vutbr.cz/research/groups/speech/publi/2008/szoke_sigir2008.pdf [PDF]
Klíčová slova
spoken term detection
Článek pojednává o hybridním slovním a podslovním dekódování pro detekci klíčových frází v řeči
This paper deals with a hybrid word-subword recognition system for spoken term detection. The decoding is driven by a hybrid recognition network and the decoder directly produces hybrid word-subword lattices. One phone and two multigram models were tested to represent sub-word units. The systems were evaluated in terms of spoken term detection accuracy and the size of index. We concluded that the best subword model for hybrid word-subword recognition is the multigram model trained on the word recognizer vocabulary. We achieved an improvement in word recognition accuracy, and in spoken term detection accuracy when in-vocabulary and out-of-vocabulary terms are searched separately. Spoken term detection accuracy with the full (in-vocabulary and out-of-vocabulary) term set was slightly worse but the required index size was significantly reduced.
