Ústav počítačové grafiky a multimédií
|
| Hlavní řešitel: | Smrž Pavel |
| Spoluřešitelé: | Černocký Jan, Grézl František, Kombrink Stefan |
| Agentura: | EU-7FP-IST |
| Kód: | 248984 |
| Začátek: | 2011 |
| Konec: | 2012 |
| Klíčová slova: | event analysis, speech processing
|
| Anotace: |
| The key idea underlying project GLOCAL is to use events as the primary
means for organizing and indexing media. Events have a local and a
global dimension. The local dimension involves the assignment of tags
(conceptualizations) to media (personal experiences). The global
dimension involves the sharing of general event structures and specific
event descriptions, which enables social sharing and networking of
events, tags and media. Within networked communities, common (global)
descriptions of the world can be built and continuously enriched by a
continuous flow of individual (local) descriptions. One of the project
enabling technologies deals with the analysis of multimedia content. The
running GLOCAL project focuses mainly on text, image and video
analysis. The recent study of user requirements showed that the project
would benefit from advanced speech recognition components. The proposed
GLOCAL project extension will bring additional functionality in the area
of context-aware semantic annotation of multimedia files based on
sophisticated audio and video processing. The search on the multimedia
files will overcome limits of the current "one-best" voice transcription
systems by employing graph-based representation of all probable
alternative speech transcripts. A special attention will be paid to the
detection of OOV (out-of-vocabulary) words. In contrast to the existing
state-of-the-art speech processing systems, the GLOCAL-EEU will also
provide sufficient accuracy when searching for new names of persons,
places, companies etc. To address this, the project will combine the
vocabulary-based speech recognizer with a keyword spotting module
implementing the functionality of phonetic search. The addition of BUT
to the consortium will help to lessen the impact of the semantic gap by
exploiting the local and global (GLOCAL) knowledge about events and
related contents to locally index media and experiences and later to
globally share them within networked communities. This will improve the
end-user experience with the GLOCAL-enhanced technologies and hence make
the project more effective and more efficient. |
Publikace
| 2012 | Hradiš Michal, Kolář Martin, Král Jiří, Láník Aleš, Zemčík Pavel, Smrž Pavel: Annotating images with suggestions - user study of a tagging system, In: Advanced Concepts for Intelligent Vision Systems, Brno, CZ, Springer, 2012, s. 155-166, ISBN 978-3-642-33139-8, ISSN 0302-9743 |
| | Král Jiří, Hradiš Michal: Restricted Boltzman Machines for Image Tag Suggestion, In: Proceedings of the 19th Conference STUDENT EEICT 2012, Brno, CZ, VUT v Brně, 2012, s. 5 |
|
|