Publication Details

Prosodic Speaker Verification using Subspace Multinomial Models with Intersession Compensation

KOCKMANN Marcel, BURGET Lukáš, GLEMBEK Ondřej, FERRER Luciana and ČERNOCKÝ Jan. Prosodic Speaker Verification using Subspace Multinomial Models with Intersession Compensation. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba, Japan: International Speech Communication Association, 2010, pp. 1061-1064. ISBN 978-1-61782-123-3. ISSN 1990-9772.

Czech title

Prosodické ověřování mluvčího pomocí multinomiálních modelů v podprostorech s kompensací variability mezi nahrávkami

Type

conference paper

Language

english

Authors

Kockmann Marcel, Dipl.-Ing. (DCGM FIT BUT)
Burget Lukáš, doc. Ing., Ph.D. (DCGM FIT BUT)
Glembek Ondřej, Ing., Ph.D. (DCGM FIT BUT)
Ferrer Luciana (SRI)
Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT)

URL

http://www.fit.vutbr.cz/research/groups/speech/publi/2010/kockmann_interspeech2010_IS100048.pdf PDF

Keywords

speaker verification, prosody, JFA, multinomial model

Abstract

The paper is on the proposal of a novel approach to modeling prosodic features. Our model is based on the idea of introducing subspace of model parameters.

Annotation

We propose a novel approach to modeling prosodic features. Inspired by Joint Factor Analysis model (JFA), our model is based on the same idea of introducing subspace of model parameters. However, the underlying Gaussian Mixture distribution of JFA is replaced by multinomial distribution to model sequences of discrete units rather than continuous features. In this work, we use the subspace model as a feature extractor for support vector machines (SVMs), similar to the recently proposed JFA in total variability space. We can show the capability to reduce high-dimensional count vectors to low dimension while keeping system performance stable. With additional intersession compensation, we can improve 30% relative to the baseline system and reach an equal error rate of 8.8% on the NIST 2006 SRE dataset.

Published

2010

Pages

1061-1064

Journal

Proceedings of Interspeech - on-line, vol. 2010, no. 9, ISSN 1990-9772

Proceedings

Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010)

Conference

Interspeech Conference, Tokyo, JP

ISBN

978-1-61782-123-3

Publisher

International Speech Communication Association

Place

Makuhari, Chiba, Japan, JP

BibTeX

@INPROCEEDINGS{FITPUB9360,
   author = "Marcel Kockmann and Luk\'{a}\v{s} Burget and Ond\v{r}ej Glembek and Luciana Ferrer and Jan \v{C}ernock\'{y}",
   title = "Prosodic Speaker Verification using Subspace Multinomial Models with Intersession Compensation",
   pages = "1061--1064",
   booktitle = "Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010)",
   journal = "Proceedings of Interspeech - on-line",
   volume = 2010,
   number = 9,
   year = 2010,
   location = "Makuhari, Chiba, Japan, JP",
   publisher = "International Speech Communication Association",
   ISBN = "978-1-61782-123-3",
   ISSN = "1990-9772",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/9360"
}