GARNER Phillip N., DINES John, HAIN Thomas, EL Hannani Asmaa, KARAFIÁT Martin, KORCHAGIN Danil, LINCOLN Mike, WAN Vincent and ZHANG Le. Real-Time ASR from Meetings. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 2119-2122. ISSN 1990-9772.
Publication language:english
Original title:Real-Time ASR from Meetings
Title (cs):Real-Time rozpoznávání řeči pro meetingy
Proceedings:Proc. Interspeech 2009
Conference:Interspeech 2009
Place:Brighton, GB
Journal:Proceedings of Interspeech, No. 9, FR
Publisher:International Speech Communication Association
real-time speech recognition, meeting ASR, beam-forming, speech meta-data
The AMI(DA) system is a meeting room speech recognition system that has been developed and evaluated in the context of the NIST Rich Text (RT) evaluations. Recently, the "Distant Access" requirements of the AMIDA project have necessitated that the system operate in real-time. Another more difficult requirement is that the system fit into a live meeting transcription scenario. We describe an infrastructure that has allowed the AMI(DA) system to evolve into one that fulfils these extra requirements. We emphasise the components that address the live and real-time aspects.
