Publication Details

Manual and Semi-Automatic Approaches to Building a Multilingual Phoneme Set

EGOROVA Ekaterina, VESELÝ Karel, KARAFIÁT Martin, JANDA Miloš and ČERNOCKÝ Jan. Manual and Semi-Automatic Approaches to Building a Multilingual Phoneme Set. In: Proceedings of ICASSP 2013. Vancouver: IEEE Signal Processing Society, 2013, pp. 7324-7328. ISBN 978-1-4799-0355-9.
Czech title
Manuální a poloautomatické přístupy k tvorbě multilingvální fonémové sady
Type
conference paper
Language
english
Authors
Egorova Ekaterina, Ing., Ph.D. (FIT BUT)
Veselý Karel, Ing., Ph.D. (DCGM FIT BUT)
Karafiát Martin, Ing., Ph.D. (DCGM FIT BUT)
Janda Miloš, Ing. (FIT BUT)
Černocký Jan, prof. Dr. Ing. (DCGM FIT BUT)
URL
Keywords

multilingual speech recognition, phoneme set mapping, phoneme confusion matrix

Abstract

This articles describes a comparison between manual and semi-automatic approaches to building a multilingual phoneme set. The two approaches were compared in cases of 1) a multilingual system with abundant data for all the languages, 2) multilingual systems excluding target language 3) multilingual systems with small amount of data for target languages. The work shows that careful choice of merging methods can help improve recognition of languages with no or little training data and reasonably reduce multilingual phoneme set without losing a lot of accuracy.

Annotation

The paper addresses manual and semi-automatic approaches to building a multilingual phoneme set for automatic speech recognition. The first approach involves mapping and reduction of the phoneme set based on IPA and expert knowledge, the later one involves phoneme confusion matrix generated by a neural network. The comparison is done for 8 languages selected from GlobalPhone on three scenarios: 1) multilingual system with abundant data for all the languages, 2) multilingual systems excluding target language 3) multilingual systems with small amount of data for target languages. For 3), the multilingual system brought improvement for languages close enough to the others in the set.

Published
2013
Pages
7324-7328
Proceedings
Proceedings of ICASSP 2013
Conference
38th International Conference on Acoustics, Speech, and Signal Processing, Vancouver, CA
ISBN
978-1-4799-0355-9
Publisher
IEEE Signal Processing Society
Place
Vancouver, CA
UT WoS
000329611507098
BibTeX
@INPROCEEDINGS{FITPUB10323,
   author = "Ekaterina Egorova and Karel Vesel\'{y} and Martin Karafi\'{a}t and Milo\v{s} Janda and Jan \v{C}ernock\'{y}",
   title = "Manual and Semi-Automatic Approaches to Building a Multilingual Phoneme Set",
   pages = "7324--7328",
   booktitle = "Proceedings of ICASSP 2013",
   year = 2013,
   location = "Vancouver, CA",
   publisher = "IEEE Signal Processing Society",
   ISBN = "978-1-4799-0355-9",
   language = "english",
   url = "https://www.fit.vut.cz/research/publication/10323"
}
Back to top