Conference paper

MARTÍNEZ González David, PLCHOT Oldřich, BURGET Lukáš, GLEMBEK Ondřej and MATĚJKA Pavel. Language Recognition in iVectors Space. In: Proceedings of Interspeech 2011. Florence: International Speech Communication Association, 2011, pp. 861-864. ISBN 978-1-61839-270-1. ISSN 1990-9772.
Publication language:english
Original title:Language Recognition in iVectors Space
Title (cs):Rozpoznávání jazyka v prostoru iVektorů
Pages:861-864
Proceedings:Proceedings of Interspeech 2011
Conference:Interspeech 2011
Place:Florence, IT
Year:2011
ISBN:978-1-61839-270-1
Journal:Proceedings of Interspeech, Vol. 2011, No. 8, FR
ISSN:1990-9772
Publisher:International Speech Communication Association
URL:http://www.fit.vutbr.cz/research/groups/speech/publi/2011/martinez_interspeech2011_291.pdf [PDF]
Keywords
Acoustic Language Recognition, iVectors, Joint Factor Analysis
Annotation
We have introduced a novel approach for language recognition. Three classifiers (linear generative model, SVM and logistic regression) have been tested in the iVector space, and all outperform the state-of-the-art JFA system. Very simple and fast classifier based on linear generative model provides excellent performance over all conditions. The advantage of this classifier is also its scalability: addition of a new language only requires estimating the mean over the corresponding iVectors. Most of the computational load is in the iVector generation. Hence, as a next step, we will try to obtain iVectors from the utterances and the corresponding sufficient statistics in a more direct way.
Abstract
The concept of so called iVectors, where each utterance is represented by fixed-length low-dimensional feature vector, has recently become very successfully in speaker verification. In this work, we apply the same idea in the context of Language Recognition (LR). To recognize language in the iVector space, we experiment with three different linear classifiers: one based on a generative model, where classes are modeled by Gaussian distributions with shared covariance matrix, and two discriminative classifiers, namely linear Support Vector Machine and Logistic Regression. The tests were performed on the NIST LRE 2009 dataset and the results were compared with stateof- the-art LR based on Joint Factor Analysis (JFA). While the iVector system offers better performance, it also seems to be complementary to JFA, as their fusion shows another improvement.
BibTeX:
@INPROCEEDINGS{
   author = {David Gonz{\'{a}}lez Mart{\'{i}}nez and Old{\v{r}}ich Plchot
	and Luk{\'{a}}{\v{s}} Burget and Ond{\v{r}}ej Glembek and
	Pavel Mat{\v{e}}jka},
   title = {Language Recognition in iVectors Space},
   pages = {861--864},
   booktitle = {Proceedings of Interspeech 2011},
   journal = {Proceedings of Interspeech},
   volume = {2011},
   number = {8},
   year = {2011},
   location = {Florence, IT},
   publisher = {International Speech Communication Association},
   ISBN = {978-1-61839-270-1},
   ISSN = {1990-9772},
   language = {english},
   url = {http://www.fit.vutbr.cz/research/view_pub.php?id=9754}
}

Your IPv4 address: 54.225.53.253
Switch to IPv6 connection

DNSSEC [dnssec]