Článek ve sborníku konference | |
| Hain, T., Wan, V., Burget, L., Karafiát, M., Dines, J., Vepa, J., Garau, G., Lincoln, M.: The AMI System for the Transcription of Speech in Meetings, In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007), Hononulu, US, IEEESP, 2007, s. 357-360, ISBN 1-4244-0728-1 | | Jazyk publikace: | angličtina |
|---|
| Název publikace: | The AMI System for the Transcription of Speech in Meetings |
|---|
| Název (cs): | AMI systém pro přepis řeči v meetinzích |
|---|
| Strany: | 357-360 |
|---|
| Sborník: | Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007) |
|---|
| Konference: | 32nd IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) |
|---|
| Místo vydání: | Hononulu, US |
|---|
| Rok: | 2007 |
|---|
| ISBN: | 1-4244-0728-1 |
|---|
| Vydavatel: | IEEE Signal Processing Society |
|---|
| URL: | http://www.fit.vutbr.cz/research/groups/speech/publi/2007/hain_icassp07.pdf [PDF] |
|---|
| Klíčová slova |
|---|
rozpoznávání řeči
|
| Anotace |
|---|
Článek je o AMI systému pro přepis řeči v meetinzích, který byl postaven ve spolupráci 5 výzkumných skupin. Zahrnuje generické i nově vyvinuté techniky.
|
| Abstrakt |
|---|
| This paper describes the AMI transcription system for speech in
meetings developed in collaboration by five research groups. The system
includes generic techniques such as discriminative and speaker adaptive
training, vocal tract length normalisation, heteroscedastic linear
discriminant analysis, maximum likelihood linear regression, and phone
posterior based features, as well as techniques specifically designed
for meeting data. These include segmentation and cross-talk
suppression, beam-forming, domain adaptation, Web-data collection, and
channel adaptive training. The system was improved by more than 20%
relative in word error rate compared to our previous system and was
used in the NIST RT106 evaluations where it was found to yield
competitive performance |
| BibTeX: |
|---|
@INPROCEEDINGS{
author = {Thomas Hain and Vincent Wan and Lukáš Burget and Martin
Karafiát and John Dines and Jithendra Vepa and Giulia Garau
and Mike Lincoln},
title = {The AMI System for the Transcription of Speech in Meetings},
pages = {357--360},
booktitle = {Proc. IEEE International Conference on Acoustics, Speech and
Signal Processing (ICASSP 2007)},
year = {2007},
location = {Hononulu, US},
publisher = {IEEE Signal Processing Society},
ISBN = {1-4244-0728-1},
language = {english},
url = {http://www.fit.vutbr.cz/research/view_pub.php?id=8463}
} |
|