All types of publications
| Martínez, G., D., Plchot, O., Burget, L., Glembek, O., Matějka, P.: Language Recognition in iVectors Space, In: Proceedings of Interspeech 2011, Florence, IT, ISCA, 2011, p. 861-864, ISBN 978-1-61839-270-1, ISSN 1990-9772 | | Publication language: | english |
|---|
| Original title: | Language Recognition in iVectors Space |
|---|
| Title (cs): | Rozpoznávání jazyka v prostoru iVektorů |
|---|
| Pages: | 861-864 |
|---|
| Proceedings: | Proceedings of Interspeech 2011 |
|---|
| Conference: | Interspeech 2011 |
|---|
| Place: | Florence, IT |
|---|
| Year: | 2011 |
|---|
| ISBN: | 978-1-61839-270-1 |
|---|
| Journal: | Proceedings of Interspeech, Vol. 2011, No. 8, FR |
|---|
| ISSN: | 1990-9772 |
|---|
| Publisher: | International Speech Communication Association |
|---|
| URL: | http://www.fit.vutbr.cz/research/groups/speech/publi/2011/martinez_interspeech2011_291.pdf [PDF] |
|---|
| Keywords |
|---|
| Acoustic Language Recognition, iVectors, Joint
Factor Analysis |
| Annotation |
|---|
| We have introduced a novel approach for language recognition.
Three classifiers (linear generative model, SVM and logistic regression)
have been tested in the iVector space, and all outperform
the state-of-the-art JFA system. Very simple and fast classifier
based on linear generative model provides excellent performance
over all conditions. The advantage of this classifier
is also its scalability: addition of a new language only requires
estimating the mean over the corresponding iVectors. Most of
the computational load is in the iVector generation. Hence, as a
next step, we will try to obtain iVectors from the utterances and
the corresponding sufficient statistics in a more direct way. |
| Abstract |
|---|
| The concept of so called iVectors, where each utterance is represented
by fixed-length low-dimensional feature vector, has
recently become very successfully in speaker verification. In
this work, we apply the same idea in the context of Language
Recognition (LR). To recognize language in the iVector space,
we experiment with three different linear classifiers: one based
on a generative model, where classes are modeled by Gaussian
distributions with shared covariance matrix, and two discriminative
classifiers, namely linear Support Vector Machine and
Logistic Regression. The tests were performed on the NIST
LRE 2009 dataset and the results were compared with stateof-
the-art LR based on Joint Factor Analysis (JFA). While the
iVector system offers better performance, it also seems to be
complementary to JFA, as their fusion shows another improvement. |
| BibTeX: |
|---|
@INPROCEEDINGS{
author = {David González Martínez and Oldřich Plchot and Lukáš Burget
and Ondřej Glembek and Pavel Matějka},
title = {Language Recognition in iVectors Space},
pages = {861--864},
booktitle = {Proceedings of Interspeech 2011},
journal = {Proceedings of Interspeech},
volume = {2011},
number = {8},
year = {2011},
location = {Florence, IT},
publisher = {International Speech Communication Association},
ISBN = {978-1-61839-270-1},
ISSN = {1990-9772},
language = {english},
url = {http://www.fit.vutbr.cz/research/view_pub.php?id=9754}
} |
|