Conference paper

PEŠÁN Jan, BURGET Lukáš, HEŘMANSKÝ Hynek and VESELÝ Karel. DNN derived filters for processing of modulation spectrum of speech. In: Proceedings of Interspeech 2015. Dresden: International Speech Communication Association, 2015, pp. 1908-1911. ISBN 978-1-5108-1790-6. ISSN 1990-9772.
Publication language:english
Original title:DNN derived filters for processing of modulation spectrum of speech
Title (cs):Filtry získané pomocí DNN pro zpracování modulačního spektra řeči
Pages:1908-1911
Proceedings:Proceedings of Interspeech 2015
Conference:INTERSPEECH 2015
Place:Dresden, DE
Year:2015
ISBN:978-1-5108-1790-6
Journal:Proceedings of Interspeech, Vol. 2015, No. 09, FR
ISSN:1990-9772
Publisher:International Speech Communication Association
URL:http://www.fit.vutbr.cz/research/groups/speech/publi/2015/pesan_interspeech2015_IS150892.pdf [PDF]
Files: 
+Type Name Title Size Last modified
iconpesan_interspeech2015_IS150892.pdf305 KB2017-03-01 18:29:42
^ Select all
With selected:
Keywords
deep neural network, convolutive layer, modulation filters, mammalian auditory processing
Annotation
In this paper DNN paradigm was successfully used for design of modulation frequency FIR filters. This technique optimized the whole process of deriving posterior probabilities of speech sound classes
(three-state phonemes).
Abstract
We propose a novel approach to design modulation frequency filters for the first stage processing of critical band spectrum of speech using deep neural network (DNN). These filters replace conventional modulation frequency filters currently used in state-of-the-art BUT speech recognition system and yield about 10% relative improvement in phoneme recognition accuracy. The resulting filters are consistent with some known temporal properties of higher levels of mammalian auditory processing and suggest more efficient scheme for pre-processing of speech for ASR.
BibTeX:
@INPROCEEDINGS{
   author = {Jan Pe{\v{s}}{\'{a}}n and Luk{\'{a}}{\v{s}} Burget and Hynek
	He{\v{r}}mansk{\'{y}} and Karel Vesel{\'{y}}},
   title = {DNN derived filters for processing of modulation spectrum of
	speech},
   pages = {1908--1911},
   booktitle = {Proceedings of Interspeech 2015},
   journal = {Proceedings of Interspeech},
   volume = {2015},
   number = {09},
   year = {2015},
   location = {Dresden, DE},
   publisher = {International Speech Communication Association},
   ISBN = {978-1-5108-1790-6},
   ISSN = {1990-9772},
   language = {english},
   url = {http://www.fit.vutbr.cz/research/view_pub.php?id=10969}
}

Your IPv4 address: 54.166.19.237
Switch to IPv6 connection

DNSSEC [dnssec]