Conference paperKOCKMANN Marcel, BURGET Lukáš and ČERNOCKÝ Jan. Investigations into prosodic syllable contour features for speaker recognition. In: Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010, pp. 4418-4421. ISBN 978-1-4244-4296-6. ISSN 1520-6149. | Publication language: | english |
---|
Original title: | Investigations into prosodic syllable contour features for speaker recognition |
---|
Title (cs): | Výzkum prosodických parametrů založených na slabikových konturách pro rozpoznávání mluvčího |
---|
Pages: | 4418-4421 |
---|
Proceedings: | Proc. International Conference on Acoustics, Speech, and Signal Processing |
---|
Conference: | International Conference on Acoustics, Speech, and Signal Processing 2010 |
---|
Place: | Dallas, US |
---|
Year: | 2010 |
---|
ISBN: | 978-1-4244-4296-6 |
---|
Journal: | Proc. International Conference on Acoustics, Speech, and Signal Processing, Vol. 2010, No. 3, Piscataway, US |
---|
ISSN: | 1520-6149 |
---|
Publisher: | IEEE Signal Processing Society |
---|
URL: | http://www.fit.vutbr.cz/research/groups/speech/publi/2010/kockmann_icassp2010_4418.pdf [PDF] |
---|
Keywords |
---|
Speaker recognition, prosodic features, syllable contours |
Annotation |
---|
The paper is on investigation into prosodic syllable contour features for speaker recognition. We investigate various ways of generating prosodic syllable contour features. |
Abstract |
---|
We investigate various ways of generating prosodic syllable contour features that have recently been applied to enhance systems for speaker recognition. We compare different approaches for segmentation of speech into syllable-like units, techniques for contour modeling and the extraction of pitch and energy, taking into account the computational complexity and gender dependence. We show that the performance is especially affected by the segmentation and the quality of the pitch tracking algorithm and that the features are highly gender dependent. Still, computationally simple ways of segmentation of speech can be used to achieve good results, as experiments on 2006 NIST speaker recognition evaluation task indicate. |
BibTeX: |
---|
@INPROCEEDINGS{
author = {Marcel Kockmann and Luk{\'{a}}{\v{s}} Burget and Jan
{\v{C}}ernock{\'{y}}},
title = {Investigations into prosodic syllable contour features for
speaker recognition},
pages = {4418--4421},
booktitle = {Proc. International Conference on Acoustics, Speech, and
Signal Processing},
journal = {Proc. International Conference on Acoustics, Speech, and
Signal Processing},
volume = {2010},
number = {3},
year = {2010},
location = {Dallas, US},
publisher = {IEEE Signal Processing Society},
ISBN = {978-1-4244-4296-6},
ISSN = {1520-6149},
language = {english},
url = {http://www.fit.vutbr.cz/research/view_pub.php.en.iso-8859-2?id=9310}
} |
|