Publication Details

PCA-based Feature Extraction for Phonotactic Language Recognition

MIKOLOV Tomáš, PLCHOT Oldřich, GLEMBEK Ondřej, MATĚJKA Pavel, BURGET Lukáš and ČERNOCKÝ Jan. PCA-based Feature Extraction for Phonotactic Language Recognition. In: Proc. Odyssey 2010 - The Speaker and Language Recognition Workshop. Brno: International Speech Communication Association, 2010, pp. 251-255. ISBN 978-80-214-4114-9.
Czech title
Extrakce parametrů pro fonotaktické rozpoznávání jazyka založená na PCA
Type
conference paper
Language
english
Authors
URL
Keywords

speech, language recognition, automatic recognition, large amounts of data.

Abstract

This paper is on PCA-based Feature Extraction for Phonotactic Language Recognition. This technique improves speed of the training, in some cases more than 1000 times.

Annotation

Phonotactic language recognition is one of major techniques used for automatic recognition of spoken languages. We propose a feature extraction technique based on PCA to be used with SVM-based systems. This technique improves speed of the training, in some cases more than 1000 times, allowing systems to be effectively trained on much larger data sets. Speed-up of the test phase can be even greater, which makes the resulting systems much more useful for processing large amounts of data. We report our results on NIST LRE 2009 task.

Published
2010
Pages
251-255
Proceedings
Proc. Odyssey 2010 - The Speaker and Language Recognition Workshop
Conference
The Speaker and Language Recognition Workshop, Brno, Czech Republic, CZ
ISBN
978-80-214-4114-9
Publisher
International Speech Communication Association
Place
Brno, CZ
EID Scopus
BibTeX
@INPROCEEDINGS{FITPUB9317,
   author = "Tom\'{a}\v{s} Mikolov and Old\v{r}ich Plchot and Ond\v{r}ej Glembek and Pavel Mat\v{e}jka and Luk\'{a}\v{s} Burget and Jan \v{C}ernock\'{y}",
   title = "PCA-based Feature Extraction for Phonotactic Language Recognition",
   pages = "251--255",
   booktitle = "Proc. Odyssey 2010 - The Speaker and Language Recognition Workshop",
   year = 2010,
   location = "Brno, CZ",
   publisher = "International Speech Communication Association",
   ISBN = "978-80-214-4114-9",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/9317"
}
Back to top