Ing. Pavel Matějka, Ph.D.
| Brümmer, N., Burget, L., Černocký, J., Glembek, O., Grézl, F., Karafiát, M., van, L., D., Matějka, P., Schwarz, P., Strasheim, A.: Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006, In: IEEE Transactions on Audio, Speech, and Language Processing, roč. 15, č. 7, 2007, US, s. 2072-2084, ISSN 1558-7916 | | Jazyk publikace: | angličtina |
|---|
| Název publikace: | Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006 |
|---|
| Název (cs): | Fúze heterogeeních systémů pro rozpoznávání mluvčího v STBU systému pro NIST evaluace v rozpoznávání mluvčího 2006 |
|---|
| Strany: | 2072-2084 |
|---|
| Místo vydání: | US |
|---|
| Rok: | 2007 |
|---|
| Časopis: | IEEE Transactions on Audio, Speech, and Language Processing, roč. 15, č. 7, US |
|---|
| ISSN: | 1558-7916 |
|---|
| URL: | http://www.fit.vutbr.cz/research/groups/speech/publi/2007/brummer_stbu_t-asl_2007.pdf [PDF] |
|---|
| Klíčová slova |
|---|
speaker recognition
|
| Anotace |
|---|
Článek pojednává o fúzi heterogeeních systémů pro rozpoznávání mluvčího v STBU systému pro NIST evaluace v rozpoznávání mluvčího 2006.
|
| Abstrakt |
|---|
This paper describes and discusses the `STBU' speaker recognition system, which performed well in the NIST Speaker Recognition Evaluation 2006 (SRE). STBU is a consortium of 4 partners: Spescom DataVoice (South Africa), TNO (The Netherlands), BUT (Czech Republic) and University of Stellenbosch (South Africa). The STBU system was a combination of three main kinds of sub-systems: (1) GMM, with shorttime MFCC or PLP features, (2) GMM-SVM, using GMM mean supervectors as input to an SVM, and (3) MLLR-SVM, using MLLR speaker adaptation coefficients derived from an English LVCSR system. All sub-systems made use of supervector subspace channel compensation methodsóeither eigenchannel adaptation or nuisance attribute projection. We document the design and performance of all sub-systems, as well as their fusion and calibration via logistic regression. Finally, we also present a cross-site fusion that was done with several additional systems from other NIST SRE-2006 participants.
|
| BibTeX: |
|---|
@ARTICLE{
author = {Niko Brümmer and Lukáš Burget and Jan Černocký and Ondřej
Glembek and František Grézl and Martin Karafiát and David
Leeuwen van and Pavel Matějka and Petr Schwarz and Albert
Strasheim},
title = {Fusion of heterogeneous speaker recognition systems in the
STBU submission for the NIST speaker recognition evaluation
2006},
pages = {2072--2084},
journal = {IEEE Transactions on Audio, Speech, and Language Processing},
volume = {15},
number = {7},
year = {2007},
ISSN = {1558-7916},
language = {english},
url = {http://www.fit.vutbr.cz/research/view_pub.php?id=8470}
} |
|