Thesis Details
Odhad emocí řečníka z mluvené řeči
This Bachelor Thesis deals with research in the field of emotion recognition mainly from speech and marginally from other modalities (video and physiological data). It closely describes the topology of the systems built specifically for the subject of this work. Moreover, it describes experiments leading to optimized pre-processing, regressor training and post-processing. Data used for these research origins from evaluation AV+EC 2015. Results of fusion systems producing the most precise prediction were sent to this evaluation. The Bottle-Neck features are newly tested and combined favorably with commonly used eGeMAPS features for the recognition of arousal. For valence, two kinds of video features are used. Muli-task system (recognizing both valence and arousal) using Bottle-Neck features produces competitive results and is only 13 % relatively behind the mentioned fusion system. This is especially appealing for applications where only audio is available.
Emotion recognition, speech, fusion, context, Bottle-Neck features.
Bidlo Michal, doc. Ing., Ph.D. (DCSY FIT BUT), člen
Drahanský Martin, prof. Ing., Dipl.-Ing., Ph.D. (DITS FIT BUT), člen
Rychlý Marek, RNDr., Ph.D. (DIFS FIT BUT), člen
Španěl Michal, Ing., Ph.D. (DCGM FIT BUT), člen
@bachelorsthesis{FITBT18675, author = "Anna Popkov\'{a}", type = "Bachelor's thesis", title = "Odhad emoc\'{i} \v{r}e\v{c}n\'{i}ka z mluven\'{e} \v{r}e\v{c}i", school = "Brno University of Technology, Faculty of Information Technology", year = 2016, location = "Brno, CZ", language = "czech", url = "https://www.fit.vut.cz/study/thesis/18675/" }