Článek ve sborníku konference | |
| Kockmann, M., Ferrer, L., Burget, L., Shriberg, E., Černocký, J.: Recent Progress in Prosodic Speaker Verification, In: Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011, Praha, CZ, IEEESP, 2011, s. 4556-4559, ISBN 978-1-4577-0537-3 | | Jazyk publikace: | angličtina |
|---|
| Název publikace: | Recent Progress in Prosodic Speaker Verification |
|---|
| Název (cs): | Aktuální pokrok v prosodickém ověřování mluvčího |
|---|
| Strany: | 4556-4559 |
|---|
| Sborník: | Proceedings of the 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2011 |
|---|
| Konference: | International Conference on Acoustics, Speech and Signal Processing 2011 |
|---|
| Místo vydání: | Praha, CZ |
|---|
| Rok: | 2011 |
|---|
| ISBN: | 978-1-4577-0537-3 |
|---|
| Vydavatel: | IEEE Signal Processing Society |
|---|
| URL: | http://www.fit.vutbr.cz/research/groups/speech/publi/2011/kockmann_icassp2011_4556.pdf [PDF] |
|---|
| Klíčová slova |
|---|
| Prosodic speaker verification, SNERFs, MSM, iVector, PLDA |
| Anotace |
|---|
Publikace pojednává o aktuálním pokroku v prosodickém ověřování mluvčího. Autoři navrhli techniky pro modelování komplexních prosodických znaků.
|
| Abstrakt |
|---|
| We describe recent progress in the field of prosodic modeling for speaker verification. In a previous paper, we proposed a technique for modeling syllable-based prosodic features that uses a multinomial subspace model for feature extraction and within-class covariance normalization or linear discriminant analysis for session variability compensation. In this paper, we show that performance can be significantly improved with the use of probabilistic linear discriminant analysis (PLDA) for session variability compensation. This system does not require score normalization. We report an equal error rate below 7% on a NIST 2008 task. To our knowledge, this is the best reported result to date for a prosodic system for speaker recognition. Fusion of this system with a state-of-the-art acoustic baseline system yields 10% relative improvement in the new detection cost function (DCF) as defined by NIST. |
| BibTeX: |
|---|
@INPROCEEDINGS{
author = {Marcel Kockmann and Luciana Ferrer and Lukáš Burget and
Elisabeth Shriberg and Jan Černocký},
title = {Recent Progress in Prosodic Speaker Verification},
pages = {4556--4559},
booktitle = {Proceedings of the 2011 IEEE International Conference on
Acoustics, Speech, and Signal Processing, ICASSP 2011},
year = {2011},
location = {Praha, CZ},
publisher = {IEEE Signal Processing Society},
ISBN = {978-1-4577-0537-3},
language = {english},
url = {http://www.fit.vutbr.cz/research/view_pub.php?id=9656}
} |
|