Department of Computer Graphics and Multimedia
Papers
| Hradiš Michal, Eivazi Shahram, Bednařík Roman: Voice activity detection in video mediated communication from gaze, In: ETRA '12 Proceedings of the Symposium on Eye Tracking Research and Applications, Santa Barbara, US, ACM, 2012, p. 329-332, ISBN 978-1-4503-1221-9 | | Publication language: | english |
|---|
| Original title: | Voice activity detection in video mediated communication from gaze |
|---|
| Title (cs): | Detekce mluvčího z pohledu při videokonferencích |
|---|
| Pages: | 329-332 |
|---|
| Proceedings: | ETRA '12 Proceedings of the Symposium on Eye Tracking Research and Applications |
|---|
| Conference: | Eye Tracking Research & Applications |
|---|
| Place: | Santa Barbara, US |
|---|
| Year: | 2012 |
|---|
| ISBN: | 978-1-4503-1221-9 |
|---|
| Publisher: | Association for Computing Machinery |
|---|
| Files: | |
|---|
|
| | Keywords |
|---|
| gaze tracking, voice activity detection, speaker recog-nition, machine learning, Support Vector Machines |
| Annotation |
|---|
| This
paper discuses prediction of active speaker in multi-party video
mediated communication from gaze data. In the explored setting, we
predict voice activity of participants in one room based on gaze
recordings of a single participant in another room. The two rooms were
connected by high definition and low delay audio and video links and the
participants engaged in different activities ranging from casual
discussion to simple casual games. We treat the task as classification
problem. We evaluate different types of features and parameter setting
in the context of Support Vector Machine classification framework. The
results show that the speaker activity can be correctly predicted with
the proposed approach in 90 % of the time for which the gaze data are
available. |
| BibTeX: |
|---|
@INPROCEEDINGS{
author = {Michal Hradiš and Shahram Eivazi and Roman Bednařík},
title = {Voice activity detection in video mediated communication
from gaze},
pages = {329--332},
booktitle = {ETRA '12 Proceedings of the Symposium on Eye Tracking
Research and Applications},
year = {2012},
location = {Santa Barbara, US},
publisher = {Association for Computing Machinery},
ISBN = {978-1-4503-1221-9},
language = {english},
url = {http://www.fit.vutbr.cz/research/view_pub.php?id=9861}
} |
|