Článek ve sborníku konference

PEŠÁN Jan, BURGET Lukáš, HEŘMANSKÝ Hynek a VESELÝ Karel. DNN derived filters for processing of modulation spectrum of speech. In: Proceedings of Interspeech 2015. Dresden: International Speech Communication Association, 2015, s. 1908-1911. ISBN 978-1-5108-1790-6. ISSN 1990-9772.
Jazyk publikace:angličtina
Název publikace:DNN derived filters for processing of modulation spectrum of speech
Název (cs):Filtry získané pomocí DNN pro zpracování modulačního spektra řeči
Strany:1908-1911
Sborník:Proceedings of Interspeech 2015
Konference:INTERSPEECH 2015
Místo vydání:Dresden, DE
Rok:2015
ISBN:978-1-5108-1790-6
Časopis:Proceedings of Interspeech, roč. 2015, č. 09, FR
ISSN:1990-9772
Vydavatel:International Speech Communication Association
URL:http://www.fit.vutbr.cz/research/groups/speech/publi/2015/pesan_interspeech2015_IS150892.pdf [PDF]
Klíčová slova
deep neural network, convolutive layer, modulation filters, mammalian auditory processing
Anotace
Článek pojednává o filtrech získaných pomocí deep neural network (DNN) pro zpracování modulačního spektra řeči.
Abstrakt
We propose a novel approach to design modulation frequency filters for the first stage processing of critical band spectrum of speech using deep neural network (DNN). These filters replace conventional modulation frequency filters currently used in state-of-the-art BUT speech recognition system and yield about 10% relative improvement in phoneme recognition accuracy. The resulting filters are consistent with some known temporal properties of higher levels of mammalian auditory processing and suggest more efficient scheme for pre-processing of speech for ASR.
BibTeX:
@INPROCEEDINGS{
  author = {Jan Pe{\v{s}}{\'{a}}n and Luk{\'{a}}{\v{s}} Burget
	and Hynek He{\v{r}}mansk{\'{y}} and Karel
	Vesel{\'{y}}},
  title = {DNN derived filters for processing of modulation
	spectrum of speech},
  pages = {1908--1911},
  booktitle = {Proceedings of Interspeech 2015},
  journal = {Proceedings of Interspeech},
  volume = 2015,
 number = 09,
  year = 2015,
  location = {Dresden, DE},
  publisher = {International Speech Communication Association},
  ISBN = {978-1-5108-1790-6},
  ISSN = {1990-9772},
  language = {english},
  url = {http://www.fit.vutbr.cz/research/view_pub.php.cs?id=10969}
}

Vaše IPv4 adresa: 18.232.124.77
Přepnout na https