Článek ve sborníku konference

HANNEMANN Mirko, KOMBRINK Stefan, KARAFIÁT Martin a BURGET Lukáš. Similarity Scoring for Recognizing Repeated Out-of-VocabularyWords. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba: International Speech Communication Association, 2010, s. 897-900. ISBN 978-1-61782-123-3. ISSN 1990-9772.
Jazyk publikace:angličtina
Název publikace:Similarity Scoring for Recognizing Repeated Out-of-VocabularyWords
Název (cs):Skórování podobnosti pro rozpoznávání opakovaných výskytů slov mimo slovník
Strany:897-900
Sborník:Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010)
Konference:Interspeech 2010
Místo vydání:Makuhari, Chiba, JP
Rok:2010
ISBN:978-1-61782-123-3
Časopis:Proceedings of Interspeech, roč. 2010, č. 9, FR
ISSN:1990-9772
Vydavatel:International Speech Communication Association
URL:http://www.fit.vutbr.cz/research/groups/speech/publi/2010/hanneman_interspeech2010_IS100358.pdf [PDF]
Klíčová slova
out-of-vocabulary, OOV, hybrid word/sub-word recognizer, similarity measure, alignment error model
Anotace
Článek pojednává o vývoji měření podobnosti za účelem odhalení pravidelně se opakujících slov, vyskytujících se mimo slovník, protože tato jsou nositelem důležitých informací.
Abstrakt
We develop a similarity measure to detect repeatedly occurring Out-of-Vocabulary words (OOV), since these carry important information. Sub-word sequences in the recognition output from a hybrid word/sub-word recognizer are taken as detected OOVs and are aligned to each other with the help of an alignment error model. This model is able to deal with partial OOV detections and tries to reveal more complex word relations such as compound words. We apply the model to a selection of conversational phone calls to retrieve other examples of the same OOV, and to obtain a higher-level description of it such as being a derivation of a known word.
BibTeX:
@INPROCEEDINGS{
   author = {Mirko Hannemann and Stefan Kombrink and Martin Karafiát and
	Lukáš Burget},
   title = {Similarity Scoring for Recognizing Repeated
	Out-of-VocabularyWords},
   pages = {897--900},
   booktitle = {Proceedings of the 11th Annual Conference of the
	International Speech Communication Association (INTERSPEECH
	2010)},
   journal = {Proceedings of Interspeech},
   volume = {2010},
   number = {9},
   year = {2010},
   location = {Makuhari, Chiba, JP},
   publisher = {International Speech Communication Association},
   ISBN = {978-1-61782-123-3},
   ISSN = {1990-9772},
   language = {english},
   url = {http://www.fit.vutbr.cz/research/view_pub.php.cs?id=9358}
}

Vaše IPv4 adresa: 54.234.225.23
Přepnout na IPv6 spojení

DNSSEC [dnssec]