Conference paper

RYANT Neville, BERGELSON Elika, CHURCH Kenneth, CRISTIA Alejandrina, DU Jun, GANAPATHY Sriram, KHUDANPUR Sanjeev, KOWALSKI Diana, KRISHNAMOORTHY Mahesh, KULSHRESHTA Rajat, LIBERMAN Mark, LU Yu-Ding, MACIEJEWSKI Matthew, METZE Florian, PROFANT Ján, SUN Lei, TSAO Yu and YU Zhou. Enhancement and Analysis of Conversational Speech: JSALT 2017. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, pp. 5154-5158. ISBN 978-1-5386-4658-8.
Publication language:english
Original title:Enhancement and Analysis of Conversational Speech: JSALT 2017
Title (cs):Zvýrazňování a analýza konverzační řeči: JSALT 2017
Pages:5154-5158
Proceedings:Proceedings of ICASSP 2018
Conference:2018 IEEE International Conference on Acoustics, Speech and Signal Processing
Place:Calgary, CA
Year:2018
ISBN:978-1-5386-4658-8
Publisher:IEEE Signal Processing Society
URL:http://www.fit.vutbr.cz/research/groups/speech/publi/2018/profant_icassp2018_0005154.pdf [PDF]
Keywords
diarization, overlap detection, speech enhancement, automatic speech recognition
Annotation
Automatic speech recognition is more and more widely and effectively used. Nevertheless, in some automatic speech analysis tasks the state of the art is surprisingly poor. One of these is "diarization", the task of determining who spoke when. Diarization is key to processing meeting audio and clinical interviews, extended recordings such as police body cam or child language acquisition data, and any other speech data involving multiple speakers whose voices are not cleanly separated into individual channels. Overlapping speech, environmental noise and suboptimal recording techniques make the problem harder. During the JSALT Summer Workshop at CMU in 2017, an international team of researchers worked on several aspects of this problem, including calibration of the state of the art, detection of overlaps, enhancement of noisy recordings, and classification of shorter speech segments. This paper sketches the workshops results, and announces plans for a "Diarization Challenge" to encourage further progress.
BibTeX:
@INPROCEEDINGS{
   author = {Neville Ryant and Elika Bergelson and Kenneth
	Church and Alejandrina Cristia and Jun Du and
	Sriram Ganapathy and Sanjeev Khudanpur and Diana
	Kowalski and Mahesh Krishnamoorthy and Rajat
	Kulshreshta and Mark Liberman and Yu-Ding Lu and
	Matthew Maciejewski and Florian Metze and
	J{\'{a}}n Profant and Lei Sun and Yu Tsao and Zhou
	Yu},
   title = {Enhancement and Analysis of Conversational 
	Speech: JSALT 2017},
   pages = {5154--5158},
   booktitle = {Proceedings of ICASSP 2018},
   year = {2018},
   location = {Calgary, CA},
   publisher = {IEEE Signal Processing Society},
   ISBN = {978-1-5386-4658-8},
   language = {english},
   url = {http://www.fit.vutbr.cz/research/view_pub.php?id=11730}
}

Your IPv4 address: 54.221.9.6
Switch to IPv6 connection

DNSSEC [dnssec]