Ing. Martin Karafiát, Ph.D.

Povey, D., Karafiát, M., Ghoshal, A., Schwarz, P.: A Symmetrization of the Subspace Gaussian Mixture Model, In: Proceedings of 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing, Praha, CZ, IEEESP, 2011, s. 4504-4507, ISBN 978-1-4577-0537-3
Jazyk publikace:angličtina
Název publikace:A Symmetrization of the Subspace Gaussian Mixture Model
Název (cs):Symetrizace Subspace Gaussian Mixture Modelů
Strany:4504-4507
Sborník:Proceedings of 2011 IEEE International Conference on Acoustics, Speech, and Signal Processing
Konference:International Conference on Acoustics, Speech and Signal Processing 2011
Místo vydání:Praha, CZ
Rok:2011
ISBN:978-1-4577-0537-3
Vydavatel:IEEE Signal Processing Society
URL:http://www.fit.vutbr.cz/research/groups/speech/publi/2011/povey_icassp2011_4504.pdf [PDF]
Klíčová slova
Speech Recognition, Hidden Markov Models, Subspace Gaussian Mixture Models
Anotace
Publikace pojednává o symetrizaci Subspace Gaussian Mixture Modelu. Autoři článku popisují modifikaci SGMM modelu, kterou nazývají Symetric SGMM.
Abstrakt
Last year we introduced the Subspace Gaussian Mixture Model (SGMM), and we demonstrated Word Error Rate improvements on a fairly small-scale task. Here we describe an extension to the SGMM, which we call the symmetric SGMM. It makes the model fully symmetric between the "speech-state vectors" and "speaker vectors" by making the mixture weights depend on the speaker as well as the speech state. We had previously avoided this as it introduces difficulties for efficient likelihood evaluation and parameter estimation, but we have found a way to overcome those difficulties. We find that the symmetric SGMM can give a very worthwhile improvement over the previously described model. We will also describe some larger-scale experiments with the SGMM, and report on progress toward releasing open-source software that supports SGMMs.
BibTeX:
@INPROCEEDINGS{
   author = {Daniel Povey and Martin Karafiát and Arnab Ghoshal and Petr
	Schwarz},
   title = {A Symmetrization of the Subspace Gaussian Mixture Model},
   pages = {4504--4507},
   booktitle = {Proceedings of 2011 IEEE International Conference on
	Acoustics, Speech, and Signal Processing},
   year = {2011},
   location = {Praha, CZ},
   publisher = {IEEE Signal Processing Society},
   ISBN = {978-1-4577-0537-3},
   language = {english},
   url = {http://www.fit.vutbr.cz/research/view_pub.php?id=9652}
}

Vaše IPv4 adresa: 54.235.20.17
Přepnout na IPv6 spojení

DNSSEC [dnssec]