Department of Computer Graphics and Multimedia

AMIDA - Augmented Multi-party Interaction with Distance Access

Czech title:AMIDA - Posílená skupinová interakce s dálkovým přístupem
Reseach leader:Zemčík Pavel
Team leaders:Burget Lukáš, Černocký Jan
Agency:The Information Society Technologies (IST) 6th Framework programme
Code:IST-033812-AMIDA
Start:2006-10-01
End:2009-12-31
Keywords:speech recognition, video processing, teleconference
Annotation:
AMIDA will develop and expand the research vision that we initiated in the previous (still ongoing) EU-IST AMI Integrated Project, to understand better and build new support for human communication. The ground-breaking research that we shall undertake in AMIDA will span several traditionally separate disciplines, including:
  • Qualitative human analysis and human factors;
  • Audio-video processing, including unconstrained speech recognition and natural scene analysis;
  • Multimodal structure and content analysis, including the modelling of individuals and groups, through the joint processing of multiple (multimodal) information channels (audio, visual, slides, handwriting, and white board activity);
  • HCI, application prototyping, evaluation, and system integration.
The AMIDA research work will directly build upon the recognized achievements and large multimodal corpora (becoming a standard reference in the area of multimodal processing) resulting from AMI. However, there will also be a very challenging shift in emphasis to live meetings with remote participants, using affordable commodity sensors (such as webcams and cheaper microphones), and targeting the development of advanced videoconferencing systems featuring new functionalities such as (1) filtering, searching and browsing; (2) remote monitoring; (3) interactive accelerated playback; (4) meeting support; and (5) shared context and presence. While addressing additional scientific challenges (such as real-time processing and processing of lower quality audio and visual signals), AMIDA has also raised the exploitation transfer potential through genuine integration of the AMIDA industrial partners collaborating on common prototypes and applications. Finally, through its "Community of Interest" (CoI)1, AMIDA will also actively engage beyond the consortium to spread awareness and knowledge.

Products

2009A Compact Speech Recognition System for lectures in English, software, 2009
Authors: Karafiát Martin, Burget Lukáš, Glembek Ondřej
 Automatic SVM creator in SGE environment, software, 2009
Authors: Řezníček Ivo, Zemčík Pavel
 Automatic Video Editing Software, software, 2009
Authors: Sumec Stanislav, Zemčík Pavel, Kubíček Radek, Žák Pavel, Hradiš Michal, Navrátil Jan, Kajan Rudolf
 Camera Localization using RANSAC, software, 2009
Authors: Potúček Igor, Beran Vítězslav, Zemčík Pavel
 Lattice Spoken Term Detection toolkit (LatticeSTD), software, 2009
Authors: Szőke Igor, Fapšo Michal
 Object Detection Framework, software, 2009
Authors: Beran Vítězslav, Havel Jiří, Herout Adam, Hradiš Michal, Jošth Radovan, Juránek Roman, Polok Lukáš, Zemčík Pavel
 OmniView system, software, 2009
Authors: Potúček Igor, Sumec Stanislav, Polok Lukáš, Zemčík Pavel
 Video and Feature processing, software, 2009
Authors: Beran Vítězslav, Chmelař Petr, Řezníček Ivo, Herout Adam, Zemčík Pavel, Hradiš Michal, Juránek Roman, Bařina David
2007CVE Library, software, 2007
Authors: Pečiva Jan
2006System for Collaborative Data Sharing, software, 2006
Authors: Pečiva Jan

Preceding projects

2004Augmented Multi-party Interaction, EU-6FP-IST, 506811-AMI, 2004-2006, completed
Research leader: Heřmanský Hynek
Team leaders: Burget Lukáš, Černocký Jan, Grézl František, Kadlec Jaroslav, Karafiát Martin, Matějka Pavel, Motlíček Petr, Pečiva Jan, Potúček Igor, Schwarz Petr, Sumec Stanislav, Španěl Michal, Zemčík Pavel

Publications

2010BERAN Vítězslav, HEROUT Adam and ZEMČÍK Pavel. On-line Video Synchronization Based on Visual Vocabularies. In: Proceedings of WSCG'10. Plzeň: University of West Bohemia in Pilsen, 2010, p. 7. ISBN 978-80-86943-86-2.
 HAIN Thomas, BURGET Lukáš, DINES John, GARNER Phillip N., EL Hannani Asmaa, HUIJBREGTS Marijn, KARAFIÁT Martin, LINCOLN Mike and WAN Vincent. The AMIDA 2009 Meeting Transcription System. In: Proceedings of the 11th Annual Conference of the International Speech Communication Association (INTERSPEECH 2010). Makuhari, Chiba: International Speech Communication Association, 2010, pp. 358-361. ISBN 978-1-61782-123-3. ISSN 1990-9772.
 ROSE Richard, NOROUZIAN Atta, REDDY Aarthi, COY Andre, GUPTA Vishwa and KARAFIÁT Martin. Subword-based spoken term detection in audio course lectures. In: Proc. International Conference on Acoustics, Speech, and Signal Processing. Dallas: IEEE Signal Processing Society, 2010, pp. 5282-5285. ISBN 978-1-4244-4296-6. ISSN 1520-6149.
 SANTHOSH Kumar Chellappan Pillai, LI Haizhou, TONG Rong, MATĚJKA Pavel, BURGET Lukáš and ČERNOCKÝ Jan. Tuning phone decoders for language identification. In: Proc. International Conference on Acoustics, Speech, and Signal Processing 2010. Dallas: IEEE Signal Processing Society, 2010, pp. 5010-5013. ISBN 978-1-4244-4296-6. ISSN 1520-6149.
2009BERAN Vítězslav, JURÁNEK Roman, MLÍCH Jozef, ŽÁK Pavel, HEROUT Adam and ZEMČÍK Pavel. On-Line Object Behaviour Analysis for Surveillance Systems. In: 10th Annual ICT Conference. Nairobi, 2009, p. 5.
 BRÜMMER Niko, BURGET Lukáš, GLEMBEK Ondřej, HUBEIKA Valiantsina, JANČÍK Zdeněk, KARAFIÁT Martin, MATĚJKA Pavel, MIKOLOV Tomáš, PLCHOT Oldřich and STRASHEIM Albert. BUT-AGNITIO System Description for NIST Language Recognition Evaluation 2009. In: Proceedings NIST 2009 Language Recognition Evaluation Workshop. Baltimore, Maryland, USA: National Institute of Standards and Technology, 2009, pp. 1-7.
 BURGET Lukáš, FAPŠO Michal, HUBEIKA Valiantsina, GLEMBEK Ondřej, KARAFIÁT Martin, KOCKMANN Marcel, MATĚJKA Pavel, SCHWARZ Petr and ČERNOCKÝ Jan. BUT system for NIST 2008 speaker recognition evaluation. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 2335-2338. ISBN 978-1-61567-692-7. ISSN 1990-9772.
 BURGET Lukáš, MATĚJKA Pavel, HUBEIKA Valiantsina and ČERNOCKÝ Jan. Investigation into variants of Joint Factor Analysis for speaker recognition. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 1263-1266. ISBN 978-1-61567-692-7. ISSN 1990-9772.
 CHMELAŘ Petr, BERAN Vítězslav, HEROUT Adam, HRADIŠ Michal, ŘEZNÍČEK Ivo and ZEMČÍK Pavel. Brno University of Technology at TRECVid 2009. In: TRECVID 2009: Participant Notebook Papers and Slides. Gaithersburg, MD: National Institute of Standards and Technology, 2009, pp. 1-11.
 GARNER Phillip N., DINES John, HAIN Thomas, EL Hannani Asmaa, KARAFIÁT Martin, KORCHAGIN Danil, LINCOLN Mike, WAN Vincent and ZHANG Le. Real-Time ASR from Meetings. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 2119-2122. ISSN 1990-9772.
 GLEMBEK Ondřej, BURGET Lukáš, DEHAK Najim, BRÜMMER Niko and KENNY Patrick. Comparison of Scoring Methods used in Speaker Recognition with Joint Factor Analysis. In: Proc. ICASSP 2009. Taipei: IEEE Signal Processing Society, 2009, p. 4. ISBN 978-1-4244-2354-5.
 GRÉZL František, KARAFIÁT Martin and BURGET Lukáš. Investigation into bottle-neck features for meeting speech recognition. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 2947-2950. ISBN 978-1-61567-692-7. ISSN 1990-9772.
 KOCKMANN Marcel, BURGET Lukáš and ČERNOCKÝ Jan. Brno University of Technology System for Interspeech 2009 Emotion Challenge. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 348-351. ISSN 1990-9772.
 KOMBRINK Stefan, BURGET Lukáš, MATĚJKA Pavel, KARAFIÁT Martin and HEŘMANSKÝ Hynek. Posterior-based Out of Vocabulary Word Detection in Telephone Speech. In: Proc. Interspeech 2009. Brighton: International Speech Communication Association, 2009, pp. 80-83. ISSN 1990-9772.
 MLÍCH Jozef, ZEMČÍK Pavel and JIŘÍK Leoš. Trajectory classification using HMMs. In: WSCG 2009 Communication Papers. Plzeň: University of West Bohemia in Pilsen, 2009, pp. 67-72. ISBN 978-80-86943-94-7.
 MLÍCH Jozef. Wiimote Gesture Recognition. In: Proceedings of the 15th Conference and Competition STUDENT EEICT 2009 Volume 4. Brno: Faculty of Electrical Engineering and Communication BUT, 2009, pp. 344-349. ISBN 978-80-214-3870-5.
 NIJHOLT Anton, ZWIERS Job and PEČIVA Jan. Mixed reality participants in smart meeting rooms and smart home environments. Personal and Ubiquitous Computing. London: Springer London, 2009, vol. 2009, no. 1, pp. 85-94. ISSN 1617-4909.
2008BURGET Lukáš, FAPŠO Michal, HUBEIKA Valiantsina, GLEMBEK Ondřej, KARAFIÁT Martin, KOCKMANN Marcel, MATĚJKA Pavel, SCHWARZ Petr and ČERNOCKÝ Jan. BUT system description: NIST SRE 2008. In: Proc. 2008 NIST Speaker Recognition Evaluation Workshop. Montreal: National Institute of Standards and Technology, 2008, pp. 1-4.
 BURGET Lukáš, SCHWARZ Petr, MATĚJKA Pavel, HANNEMANN Mirko, RASTROW Ariya, WHITE Christopher, KHUDANPUR Sanjeev, HEŘMANSKÝ Hynek and ČERNOCKÝ Jan. Combination of strongly and weakly constrained recognizers for reliable detection of OOVs. In: Proc. International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Las Vegas: IEEE Signal Processing Society, 2008, p. 4. ISBN 1-4244-1484-9.
 CHMELAŘ Petr, BERAN Vítězslav, HEROUT Adam, HRADIŠ Michal, JURÁNEK Roman, LÁNÍK Aleš, MLÍCH Jozef, NAVRÁTIL Jan, ŘEZNÍČEK Ivo, ŽÁK Pavel and ZEMČÍK Pavel. Brno University of Technology at TRECVid 2008. In: Proceedings of TRECVID 2008. Gaithersburg: National Institute of Standards and Technology, 2008, pp. 1-16.
 GLEMBEK Ondřej, MATĚJKA Pavel, BURGET Lukáš and MIKOLOV Tomáš. Advances in Phonotactic Language Recognition. In: Proc. Interspeech 2008. Brisbane: International Speech Communication Association, 2008, p. 4. ISSN 1990-9772.
 HEROUT Adam, BERAN Vítězslav, HRADIŠ Michal, POTÚČEK Igor, ZEMČÍK Pavel and CHMELAŘ Petr. TRECVID 2007 by the Brno Group. In: Proceedings of TRECVID 2007. Gaithersburg: National Institute of Standards and Technology, 2008, pp. 1-6. ISBN 978-1-59593-780-3.
 HEROUT Adam, KUBÍČEK Radek, ZEMČÍK Pavel and ŽÁK Pavel. Automatic Video Editing for Multimodal Meetings. In: Proceedings of International Conference on Computer Vision and Graphics 2008. Heidelberg: Springer Verlag, 2008, pp. 1-12. ISSN 0302-9743.
 HUBEIKA Valiantsina, BURGET Lukáš, MATĚJKA Pavel and SCHWARZ Petr. Discriminative Training and Channel Compensation for Acoustic Language Recognition. In: Proc. Interspeech 2008. Brisbane: International Speech Communication Association, 2008, p. 4. ISSN 1990-9772.
 KARAFIÁT Martin, BURGET Lukáš, HAIN Thomas and ČERNOCKÝ Jan. Discrimininative training of narrow band - wide band adaptated systems for meeting recognition. In: Proc. Interspeech 2008. Brisbane: International Speech Communication Association, 2008, p. 4. ISSN 1990-9772.
 KOCKMANN Marcel and BURGET Lukáš. Contour modeling of prosodic and acoustic features for speaker recognition. In: Proc. 2008 IEEE Workshop on Spoken Language Technology. Goa: IEEE Signal Processing Society, 2008, p. 4. ISBN 978-1-4244-3472-5.
 KOCKMANN Marcel and BURGET Lukáš. Syllable based Feature-Contours for Speaker Recognition. In: Proc. 14th International Workshop on Advances in Speech Technology. Maribor, 2008, p. 4.
 MATĚJKA Pavel, BURGET Lukáš, GLEMBEK Ondřej, SCHWARZ Petr, HUBEIKA Valiantsina, FAPŠO Michal, MIKOLOV Tomáš, PLCHOT Oldřich and ČERNOCKÝ Jan. BUT language recognition system for NIST 2007 evaluations. In: Proc. Interspeech 2008. Brisbane, Australia: International Speech Communication Association, 2008, p. 4. ISSN 1990-9772.
 PLCHOT Oldřich, HUBEIKA Valiantsina, BURGET Lukáš, SCHWARZ Petr and MATĚJKA Pavel. Acquisition of Telephone Data from Radio Broadcasts with Applications to Language Recognition. In: Proc. 11th International Conference on Text, Speech and Dialogue. Berlin: Springer Verlag, 2008, pp. 477-483. ISBN 978-3-540-87390-7.
 SZŐKE Igor, BURGET Lukáš, ČERNOCKÝ Jan and FAPŠO Michal. Sub-word modeling of out of vocabulary words in spoken term detection. In: Proc. 2008 IEEE Workshop on Spoken Language Technology. Goa: IEEE Signal Processing Society, 2008, p. 4. ISBN 978-1-4244-3472-5.
 SZŐKE Igor, FAPŠO Michal, BURGET Lukáš and ČERNOCKÝ Jan. Hybrid word-subword decoding for spoken term detection. In: Proc. SSCS 2008: Speech search workshop at SIGIR. Singapore: Association for Computing Machinery, 2008, p. 4. ISBN 978-90-365-2697-5.
2007BRÜMMER Niko, BURGET Lukáš, ČERNOCKÝ Jan, GLEMBEK Ondřej, GRÉZL František, KARAFIÁT Martin, VAN Leeuwen David, MATĚJKA Pavel, SCHWARZ Petr and STRASHEIM Albert. Fusion of heterogeneous speaker recognition systems in the STBU submission for the NIST speaker recognition evaluation 2006. IEEE Transactions on Audio, Speech, and Language Processing. 2007, vol. 15, no. 7, pp. 2072-2084. ISSN 1558-7916.
 BURGET Lukáš, MATĚJKA Pavel, SCHWARZ Petr, GLEMBEK Ondřej and ČERNOCKÝ Jan. Analysis of feature extraction and channel compensation in GMM speaker recognition system. IEEE Transactions on Audio, Speech, and Language Processing. 2007, vol. 15, no. 7, pp. 1979-1986. ISSN 1558-7916.
 FAPŠO Michal. Search in speech records. In: Proc. 13th Conference STUDENT EEICT 2007. Brno: Faculty of Electrical Engineering and Communication BUT, 2007, pp. 1-3. ISBN 978-80-214-3410-3.
 GRANÁT Jiří, HEROUT Adam, HRADIŠ Michal and ZEMČÍK Pavel. Hardware Acceleration of AdaBoost Classifier. In: Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI). Brno, 2007, pp. 1-12.
 GRÉZL František, KARAFIÁT Martin and ČERNOCKÝ Jan. Neural network topologies and bottle neck features in speech recognition. Brno, 2007.
 GRÉZL František, KARAFIÁT Martin, KONTÁR Stanislav and ČERNOCKÝ Jan. Probabilistic and bottle-neck features for LVCSR of meetings. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Hononulu: IEEE Signal Processing Society, 2007, pp. 757-760. ISBN 1-4244-0728-1.
 HAIN Thomas, WAN Vincent, BURGET Lukáš, KARAFIÁT Martin, DINES John, VEPA Jithendra, GARAU Giulia and LINCOLN Mike. The AMI System for the Transcription of Speech in Meetings. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Hononulu: IEEE Signal Processing Society, 2007, pp. 357-360. ISBN 1-4244-0728-1.
 HUBEIKA Valiantsina, BURGET Lukáš, MATĚJKA Pavel and ČERNOCKÝ Jan. Channel Compensation for Speaker Recognition. Brno, 2007.
 HUBEIKA Valiantsina, SZŐKE Igor, BURGET Lukáš and ČERNOCKÝ Jan. Maximum Likelihood and Maximum Mutual Information Training in Gender and Age Recognition System. In: Proc. 10th International Conference on Text Speech and Dialogue (TSD 2007). Pilsen: Springer Verlag, 2007, pp. 1-6. ISBN 978-3-540-74627-0.
 KARAFIÁT Martin, BURGET Lukáš, ČERNOCKÝ Jan and HAIN Thomas. Real-Time ASR from Meetings. In: Proc. INTERSPEECH 2007. Antwerpen: International Speech Communication Association, 2007, p. 4. ISSN 1990-9772.
 MATĚJKA Pavel, BURGET Lukáš, GLEMBEK Ondřej, SCHWARZ Petr, HUBEIKA Valiantsina, FAPŠO Michal, MIKOLOV Tomáš and PLCHOT Oldřich. BUT system description for NIST LRE 2007. In: Proc. 2007 NIST Language Recognition Evaluation Workshop. Orlando: National Institute of Standards and Technology, 2007, pp. 1-5.
 MATĚJKA Pavel, BURGET Lukáš, SCHWARZ Petr, GLEMBEK Ondřej, KARAFIÁT Martin, GRÉZL František, ČERNOCKÝ Jan, VAN Leeuwen David, BRÜMMER Niko and STRASHEIM Albert. STBU system for the NIST 2006 speaker recognition evaluation. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2007). Honolulu: IEEE Signal Processing Society, 2007, pp. 221-224. ISBN 1-4244-0728-1.
 POTÚČEK Igor, BERAN Vítězslav, SUMEC Stanislav and ZEMČÍK Pavel. Evaluation and comparison of tracking methods using meeting omnidirectional images. In: Workshop on Multimodal Interaction and Related Machine Learning Algorithms (MLMI). Brno, 2007, p. 12.
 SZŐKE Igor, BURGET Lukáš and KARAFIÁT Martin. Combination of Word and Phoneme Approach for Spoken Term Detection. Brno, 2007.
 SZŐKE Igor, FAPŠO Michal, KARAFIÁT Martin, BURGET Lukáš, GRÉZL František, SCHWARZ Petr, GLEMBEK Ondřej, MATĚJKA Pavel, KOPECKÝ Jiří and ČERNOCKÝ Jan. Spoken Term Detection System Based on a Combination of LVCSR and Phonetic Search. Brno, 2007.
 ČERNOCKÝ Jan, BURGET Lukáš, SCHWARZ Petr, MATĚJKA Pavel, KARAFIÁT Martin, GLEMBEK Ondřej, KOPECKÝ Jiří, SZŐKE Igor, FAPŠO Michal, GRÉZL František, HUBEIKA Valiantsina and OPARIN Ilya. Search in speech, language identification and speaker recognition in Speech@FIT. In: Proc. 17th International Conference Radioelektronika, 2007. Brno: Department of Radioelectronics FEEC BUT, 2007, pp. 1-6. ISBN 978-80-214-3390-8.
 ČERNOCKÝ Jan, SZŐKE Igor, FAPŠO Michal, KARAFIÁT Martin, BURGET Lukáš, KOPECKÝ Jiří, GRÉZL František, SCHWARZ Petr, GLEMBEK Ondřej, OPARIN Ilya, SMRŽ Pavel and MATĚJKA Pavel. Search in speech for public security and defense. In: Proc. IEEE Workshop on Signal Processing Applications for Public Security and Forensics, 2007 (SAFE '07). Washington D.C.: IEEE Signal Processing Society, 2007, pp. 1-7. ISBN 1-4244-1226-9.

Your IPv4 address: 54.198.35.26
Switch to IPv6 connection

DNSSEC [dnssec]