Ing. Ladislav Mošner

MOŠNER Ladislav, MATĚJKA Pavel, NOVOTNÝ Ondřej and ČERNOCKÝ Jan. Dereverberation and Beamforming in Far-Field Speaker Recognition. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, pp. 5254-5258. ISBN 978-1-5386-4658-8.
Publication language:english
Original title:Dereverberation and Beamforming in Far-Field Speaker Recognition
Title (cs):Odstranění dozvuku a směrování paprsku pro rozpoznávání mluvčího ze vzdálených mikrofonů
Pages:5254-5258
Proceedings:Proceedings of ICASSP 2018
Conference:IEEE International Conference on Acoustics, Speech and Signal Processing
Place:Calgary, CA
Year:2018
ISBN:978-1-5386-4658-8
Publisher:IEEE Signal Processing Society
URL:http://www.fit.vutbr.cz/research/groups/speech/publi/2018/mosner_icassp2018_0005254.pdf [PDF]
Keywords
Speaker recognition, microphone array, beamforming, dereverberation, audio retransmission
Annotation
This paper deals with far-field speaker recognition. On a corpus of NIST SRE 2010 data retransmitted in a real room with multiple microphones, we first demonstrate how room acoustics cause significant degradation of state-of-the-art ivector based speaker recognition system. We then investigate several techniques to improve the performances ranging from probabilistic linear discriminant analysis (PLDA) re-training, through dereverberation, to beamforming. We found that weighted prediction error (WPE) based dereverberation combined with generalized eigenvalue beamformer with powerspectral density (PSD) weighting masks generated by neural networks (NN) provides results approaching the clean closemicrophone setup. Further improvement was obtained by re-training PLDA or the mask-generating NNs on simulated target data. The work shows that a speaker recognition system working robustly in the far-field scenario can be developed.
BibTeX:
@INPROCEEDINGS{
   author = {Ladislav Mo{\v{s}}ner and Pavel Mat{\v{e}}jka and
	Ond{\v{r}}ej Novotn{\'{y}} and Jan
	{\v{C}}ernock{\'{y}}},
   title = {Dereverberation and Beamforming in Far-Field
	Speaker Recognition},
   pages = {5254--5258},
   booktitle = {Proceedings of ICASSP 2018},
   year = {2018},
   location = {Calgary, CA},
   publisher = {IEEE Signal Processing Society},
   ISBN = {978-1-5386-4658-8},
   language = {english},
   url = {http://www.fit.vutbr.cz/research/view_pub.php?id=11717}
}

Your IPv4 address: 54.82.10.219
Switch to IPv6 connection

DNSSEC [dnssec]