Publication Details

Real-Time ASR from Meetings

GARNER Phillip N., DINES John, HAIN Thomas, EL Hannani Asmaa, KARAFIÁT Martin, KORCHAGIN Danil, LINCOLN Mike, WAN Vincent and ZHANG Le. Real-Time ASR from Meetings. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 2119-2122. ISSN 1990-9772.
Czech title
Real-Time rozpoznávání řeči pro meetingy
Type
conference paper
Language
english
Authors
Garner Phillip N. (IDIAP)
Dines John (IDIAP)
Hain Thomas (USF)
El Hannani Asmaa (USF)
Karafiát Martin, Ing., Ph.D. (DCGM FIT BUT)
Korchagin Danil (IDIAP)
Lincoln Mike (IDIAP)
Wan Vincent (USF)
Zhang Le (UEDIN)
URL
Keywords

real-time speech recognition, meeting ASR, beam-forming, speech meta-data

Abstract

The paper deals with Real-Time ASR from Meetings

Annotation

The AMI(DA) system is a meeting room speech recognition system that has been developed and evaluated in the context of the NIST Rich Text (RT) evaluations. Recently, the "Distant Access" requirements of the AMIDA project have necessitated that the system operate in real-time. Another more difficult requirement is that the system fit into a live meeting transcription scenario. We describe an infrastructure that has allowed the AMI(DA) system to evolve into one that fulfils these extra requirements. We emphasise the components that address the live and real-time aspects.

Published
2009
Pages
2119-2122
Journal
Proceedings of Interspeech - on-line, no. 9, ISSN 1990-9772
Proceedings
Proc. Interspeech 2009
Conference
Interspeech Conference, Brighton, GB
Publisher
International Speech Communication Association
Place
Brighton, GB
BibTeX
@INPROCEEDINGS{FITPUB9039,
   author = "N. Phillip Garner and John Dines and Thomas Hain and Asmaa Hannani El and Martin Karafi\'{a}t and Danil Korchagin and Mike Lincoln and Vincent Wan and Le Zhang",
   title = "Real-Time ASR from Meetings",
   pages = "2119--2122",
   booktitle = "Proc. Interspeech 2009",
   journal = "Proceedings of Interspeech - on-line",
   number = 9,
   year = 2009,
   location = "Brighton, GB",
   publisher = "International Speech Communication Association",
   ISSN = "1990-9772",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/9039"
}
Back to top