Conference paper

SCHARENBORG Odette, BESACIER Laurent, BLACK Alan, HASEGAWA-JOHNSON Mark, METZE Florian, NEUBIG Graham, STÜKER Sebastian, GODARD Pierre, MÜLLER Markus, ONDEL Lucas, PALASKAR Shruti, ARTHUR Philip, CIANNELLA Francesco, DU Mingxing, LARSEN Elin, MERKX Danny, RIAD Rachid, WANG Liming and DUPOUX Emmanuel. Linguistic Unit Discovery from Multi-Modal Inputs in Unwritten Languages: Summary of the 'Speaking Rosetta' JSALT 2017 Workshop. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, pp. 4980-4984. ISBN 978-1-5386-4658-8.
Publication language:english
Original title:Linguistic Unit Discovery from Multi-Modal Inputs in Unwritten Languages: Summary of the 'Speaking Rosetta' JSALT 2017 Workshop
Title (cs):Objevování lingvistických jednotek z mutli-modálních vstupů v nepsaných jazycích - souhrn JSALT 2017 workshpou "řečová Rosettská deska"
Pages:4980-4984
Proceedings:Proceedings of ICASSP 2018
Conference:IEEE International Conference on Acoustics, Speech and Signal Processing
Place:Calgary, CA
Year:2018
ISBN:978-1-5386-4658-8
DOI:10.1109/ICASSP.2018.8461761
Publisher:IEEE Signal Processing Society
URL:http://www.fit.vutbr.cz/research/groups/speech/publi/2018/scharenborg_icassp2018_0004979.pdf [PDF]
Keywords
unwritten languages, multi-modal data, unsupervised unit discovery, image retrieval, machine translation.
Annotation
We summarize the accomplishments of a multi-disciplinary workshop exploring the computational and scientific issues surrounding the discovery of linguistic units (subwords and words) in a language without orthography. We study the replacement of orthographic transcriptions by images and/or translated text in a well-resourced language to help unsupervised discovery from raw speech.
BibTeX:
@INPROCEEDINGS{
   author = {Odette Scharenborg and Laurent Besacier and Alan
	Black and Mark Hasegawa-Johnson and Florian Metze
	and Graham Neubig and Sebastian St{\"{u}}ker and
	Pierre Godard and Markus M{\"{u}}ller and Lucas
	Ondel and Shruti Palaskar and Philip Arthur and
	Francesco Ciannella and Mingxing Du and Elin
	Larsen and Danny Merkx and Rachid Riad and Liming
	Wang and Emmanuel Dupoux},
   title = {Linguistic Unit Discovery from Multi-Modal Inputs
	in Unwritten Languages: Summary of the 'Speaking
	Rosetta' JSALT 2017 Workshop},
   pages = {4980--4984},
   booktitle = {Proceedings of ICASSP 2018},
   year = {2018},
   location = {Calgary, CA},
   publisher = {IEEE Signal Processing Society},
   ISBN = {978-1-5386-4658-8},
   doi = {10.1109/ICASSP.2018.8461761},
   language = {english},
   url = {http://www.fit.vutbr.cz/research/view_pub.php?id=11718}
}

Your IPv4 address: 54.210.158.163