Conference paper

SOUFIFAR Mehdi, KOCKMANN Marcel, BURGET Lukáš, PLCHOT Oldřich, GLEMBEK Ondřej and SVENDSEN Torbjorn. iVector Approach to Phonotactic Language Recognition. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, pp. 2913-2916. ISBN 978-1-61839-270-1. ISSN 1990-9772.
Publication language:english
Original title:iVector Approach to Phonotactic Language Recognition
Title (cs):iVektorový přístup k fonotaktickému rozpoznávání jazyka
Pages:2913-2916
Proceedings:Proceedings of Interspeech 2011
Conference:Interspeech 2011
Place:Florence, IT
Year:2011
ISBN:978-1-61839-270-1
Journal:Proceedings of Interspeech, Vol. 2011, No. 8, FR
ISSN:1990-9772
Publisher:International Speech Communication Association
URL:http://www.fit.vutbr.cz/research/groups/speech/publi/2011/soufifar_interspeech2011_703.pdf [PDF]
Keywords
language recognition, subspace modeling, multinomial distribution
Annotation
We proposed a novel method to extract the iVectors by means of subspace multinomial modelling of the n-gram counts. Using the proposed subspace model, the huge vector of the n-gram counts are represented by the low-dimensional iVector while preserving the discriminative power of the vector.
Abstract
This paper addresses a novel technique for representation and processing of n-gram counts in phonotactic language recognition (LRE): subspace multinomial modelling represents the vectors of n-gram counts by low dimensional vectors of coordinates in total variability subspace, called iVector. Two techniques for iVector scoring are tested: support vector machines (SVM), and logistic regression (LR). Using standard NIST LRE 2009 task as our evaluation set, the latter scoring approach was shown to outperform phonotactic LRE system based on direct SVM classification of n-gram count vectors. The proposed iVector paradigm also shows comparable results to previously proposed PCA-based phonotactic feature extraction.
BibTeX:
@INPROCEEDINGS{
   author = {Mehdi Soufifar and Marcel Kockmann and Luk{\'{a}}{\v{s}}
	Burget and Old{\v{r}}ich Plchot and Ond{\v{r}}ej Glembek and
	Torbjorn Svendsen},
   title = {iVector Approach to Phonotactic Language Recognition},
   pages = {2913--2916},
   booktitle = {Proceedings of Interspeech 2011},
   journal = {Proceedings of Interspeech},
   volume = {2011},
   number = {8},
   year = {2011},
   location = {Florence, IT},
   publisher = {International Speech Communication Association},
   ISBN = {978-1-61839-270-1},
   ISSN = {1990-9772},
   language = {english},
   url = {http://www.fit.vutbr.cz/research/view_pub.php?id=9758}
}

Your IPv4 address: 54.161.25.213
Switch to IPv6 connection

DNSSEC [dnssec]