Článek ve sborníku konference

PLCHOT Oldřich, BURGET Lukáš, ARONOWITZ Hagai a MATĚJKA Pavel. Audio Enhancing With DNN Autoencoder For Speaker Recognition. In: Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016, s. 5090-5094. ISBN 978-1-4799-9988-0.
Jazyk publikace:angličtina
Název publikace:Audio Enhancing With DNN Autoencoder For Speaker Recognition
Název (cs):Obohacování audia pomocí DNN autoenkodéru pro rozpoznávání mluvčího
Strany:5090-5094
Sborník:Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016
Konference:41th IEEE International Conference on Acoustics, Speech and Signal Processing
Místo vydání:Shanghai, CN
Rok:2016
ISBN:978-1-4799-9988-0
Vydavatel:IEEE Signal Processing Society
URL:http://www.fit.vutbr.cz/research/groups/speech/publi/2016/plchot_icassp2016_0005090.pdf [PDF]
Klíčová slova
rozpoznávání mluvčího, denoising, de-reverbation, neuronové sítě, DNN
Anotace
Článek pojednává o obohacování audia pomocí Deep Neural Networks (DDN) autoenkodéru pro rozpoznávání mluvčího.
Abstrakt
In this paper we present a design of a DNN-based autoencoder for speech enhancement and its use for speaker recognition systems for distant microphones and noisy data. We started with augmenting the Fisher database with artificially noised and reverberated data and trained the autoencoder to map noisy and reverberated speech to its clean version. We use the autoencoder as a preprocessing step in the later stage of modelling in state-of-the-art text-dependent and text-independent speaker recognition systems. We report relative improvements up to 50% for the text-dependent system and up to 48% for the text-independent one. With text-independent system, we present a more detailed analysis on various conditions of NIST SRE 2010 and PRISM suggesting that the proposed preprocessig is a promising and efficient way to build a robust speaker recognition system for distant microphone and noisy data.
BibTeX:
@INPROCEEDINGS{
   author = {Old{\v{r}}ich Plchot and Luk{\'{a}}{\v{s}} Burget
	and Hagai Aronowitz and Pavel Mat{\v{e}}jka},
   title = {Audio Enhancing With DNN Autoencoder For Speaker
	Recognition},
   pages = {5090--5094},
   booktitle = {Proceedings of the 41th IEEE International Conference on
	Acoustics, Speech and Signal Processing (ICASSP 2016), 2016},
   year = {2016},
   location = {Shanghai, CN},
   publisher = {IEEE Signal Processing Society},
   ISBN = {978-1-4799-9988-0},
   language = {english},
   url = {http://www.fit.vutbr.cz/research/view_pub.php.cs?id=11139}
}

Vaše IPv4 adresa: 18.212.222.217
Přepnout na https

DNSSEC [dnssec]