Processing, recognition and imaging of multimeadia and 3D data

Czech title:Zpracování, rozpoznávání a zobrazování multimediálních a 3D dat
Reseach leader:Zemčík Pavel
Team leaders:Černocký Jan, Herout Adam, Chudý Peter, Smrž Pavel
Team members:Adámek Jakub (FIT VUT), Behúň Kamil, Beran Vítězslav, Brejcha Jan, Burget Lukáš, Dittrich Petr, Dytrych Jaroslav, Kapinus Michal, Klíma Ondřej, Kobrtek Jozef, Koplík Karel, Lysek Tomáš, Maršík Lukáš, Materna Zdeněk, Matýšek Michal, Milet Tomáš, Mlích Jozef, Musil Martin, Musil Petr, Najman Pavel, Nosko Svetozár, Ondel Lucas, Pavelková Alena, Pešán Jan, Polok Lukáš, Přibyl Bronislav, Rydlo Karol, Svoboda Pavel, Široký Adam, Škoda Petr, Šolony Marek, Španěl Michal, Veľas Martin, Veselý Karel, Vlk Jan
Agency:Brno University of Technology
Code:FIT-S-17-3984
Start:2017-03-01
End:2019-12-31
Keywords:ecognition, processing, multimedia, image processing, feature extraction, data mining, 3D data
Annotation:
Multimodal and 3D data are very important and useful kind of data processed by nowadays computers. Information processing of such data is though very difficult and computationally expensive and the same applies to recognition and imaging of such data. Therefore the research in this field of information technologies is one of the most difficult ones and also looking into its application potencial, it is one of the very important research direction. This project is a follow-up to an earlier one, called: "Advanced recognition and presentation of multimedia data".
Project description:
Cíle navrhovaného projektu: (i) zkoumat moderní možnosti zpracování a rozpoznávání multimediálních a 3D dat a návaznosti na aplikace a souvislosti s dalším výzkumem (ii) zkoumat a zlepšovat možnosti efektivní realizace výpočtů s výše uvedenými daty v mobilních zařízeních, na superpočítači, apod.

Products

2017HIP 1.1 - High-sensitive Innominate Processing, software, 2017
Authors: Králík Miroslav, Urbanová Petra, Klíma Ondřej, Mikešová Tereza, Wagenknechtová Martina, Jungerová Jana
 Non-Separable Schemes for Discrete Wavelet Transform for Multi-Core CPUs, software, 2017
Authors: Najman Pavel, Klepárník Petr, Bařina David
 Non-Separable Schemes for Discrete Wavelet Transform in Pixel Shaders, software, 2017
Authors: Matýšek Michal, Bařina David, Zemčík Pavel
 uFFT, software, 2017
Authors: Bařina David

Publications

2018BARTOS Anthony L., CIPR Tomáš, NELSON Douglas J., SCHWARZ Petr, BANOWETZ John and JERABEK Ladislav. Noise-robust speech triage. The Journal of the Acoustical Society of America. 2018, vol. 143, no. 4, pp. 2313-2320. ISSN 1520-8524.
 KLÍMA Ondřej, MADEJA Roman, ŠPANĚL Michal, ČUTA Martin, ZEMČÍK Pavel, STOKLÁSEK Pavel and MIZERA Aleš. Virtual 2D-3D Fracture Reduction with Bone Length Recovery Using Statistical Shape Models. In: ShapeMI MICCAI 2018: Workshop on Shape in Medical Imaging Proceedings. Granada: Springer International Publishing, 2018, pp. 1-12. ISBN 978-3-642-33463-4.
 KRÁLÍK Miroslav, KLÍMA Ondřej, POLCEROVÁ Lenka, URBANOVÁ Petra and ČUTA Martin. Morphometric Sex Estimation from the Hip Bone by Means of the HIP 1.1 Software. In: ShapeMI MICCAI 2018: Workshop on Shape in Medical Imaging Proceedings. Granada: Springer International Publishing, 2018, pp. 1-12. ISBN 978-3-642-33463-4.
 ONDEL Lucas, GODARD Pierre, BESACIER Laurent, LARSEN Elin, HASEGAWA-JOHNSON Mark, SCHARENBORG Odette, DUPOUX Emmanuel, BURGET Lukáš, YVON Francois and KHUDANPUR Sanjeev. Bayesian Models for Unit Discovery on a Very Low Resource Language. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, pp. 5939-5943. ISBN 978-1-5386-4658-8.
 RYANT Neville, BERGELSON Elika, CHURCH Kenneth, CRISTIA Alejandrina, DU Jun, GANAPATHY Sriram, KHUDANPUR Sanjeev, KOWALSKI Diana, KRISHNAMOORTHY Mahesh, KULSHRESHTA Rajat, LIBERMAN Mark, LU Yu-Ding, MACIEJEWSKI Matthew, METZE Florian, PROFANT Ján, SUN Lei, TSAO Yu and YU Zhou. Enhancement and Analysis of Conversational Speech: JSALT 2017. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, pp. 5154-5158. ISBN 978-1-5386-4658-8.
 SCHARENBORG Odette, BESACIER Laurent, BLACK Alan, HASEGAWA-JOHNSON Mark, METZE Florian, NEUBIG Graham, STÜKER Sebastian, GODARD Pierre, MÜLLER Markus, ONDEL Lucas, PALASKAR Shruti, ARTHUR Philip, CIANNELLA Francesco, DU Mingxing, LARSEN Elin, MERKX Danny, RIAD Rachid, WANG Liming and DUPOUX Emmanuel. Liguistic Unit Discovery from Multi-modal Inputs in Unwritten Languages: Summary of the "Speaking Rosetta" JSALT 2017 Workshop. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, pp. 4980-4984. ISBN 978-1-5386-4658-8.
2017BAŘINA David, KULA Michal, MATÝŠEK Michal and ZEMČÍK Pavel. Accelerating Discrete Wavelet Transforms on GPUs. In: International Conference on Image Processing (ICIP). Beijing: IEEE Signal Processing Society, 2017, pp. 2707-2710. ISBN 978-1-5090-2175-8.
 BAŘINA David, KULA Michal, MATÝŠEK Michal and ZEMČÍK Pavel. Accelerating Discrete Wavelet Transforms on Parallel Architectures. Journal of WSCG. Plzeň: 2017, vol. 25, no. 2, pp. 77-85. ISBN 978-80-86943-43-5. ISSN 1213-6972.
 BAŘINA David, NAJMAN Pavel, KLEPÁRNÍK Petr, KULA Michal and ZEMČÍK Pavel. The Parallel Algorithm for the 2-D Discrete Wavelet Transform. In: Ninth International Conference on Graphic and Image Processing (ICGIP 2017). Qingdao: SPIE - the international society for optics and photonics, 2017, pp. 1-6. ISBN 978-1-5106-1741-4. ISSN 0277-786X.
 BREJCHA Jan and ČADÍK Martin. GeoPose3K: Mountain Landscape Dataset for Camera Pose Estimation in Outdoor Environments. Image and Vision Computing. Washington: Elsevier Science, 2017, vol. 2017, no. 1, pp. 1-41. ISSN 0262-8856.
 BREJCHA Jan and ČADÍK Martin. State-of-the-art in Visual Geo-localization. Pattern Analysis and Applications. 2017, vol. 2017, no. 3, pp. 1-25. ISSN 1433-7541.
 DAS Amit, HASEGAWA-JOHNSON Mark and VESELÝ Karel. Deep Auto-encoder Based Multi-task Learning Using Probabilistic Transcriptions. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 2073-2077. ISSN 1990-9772.
 HIGUCHI Takuya, KINOSHITA Keisuke, DELCROIX Marc, ŽMOLÍKOVÁ Kateřina and NAKATANI Tomohiro. Deep clustering-based beamforming for separation with unknown number of sources. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 1183-1187. ISSN 1990-9772.
 MATERNA Zdeněk, ŠPANĚL Michal, MAST Marcus, BERAN Vítězslav, WEISSHARDT Florian, BURMESTER Michael and SMRŽ Pavel. A Remote User Interface for Semi-Autonomous Robots for Elderly Care. Journal of Robotics and Mechatronics. 2017, vol. 29, no. 2, pp. 381-394. ISSN 0915-3942.
 PAPADOPOULOS Pavlos, TRAVADI Ruchir, VAZ Colin, MALANDRAKIS Nikolaos, HERMJAKOB Ulf, POURDAMGHANI Nima, PUST Michael, ZHANG Boliang, PAN Xiaoman, LU Di, LIN Ying, GLEMBEK Ondřej, BASKAR Murali K., KARAFIÁT Martin, BURGET Lukáš, HASEGAWA-JOHNSON Mark, JI Heng, MAY Jonathan, KNIGHT Kevin and NARAYANAN Shrikanth. Team ELISA System for DARPA LORELEI Speech Evaluation 2016. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 2053-2057. ISSN 1990-9772.
 PRIVALOV Vladimir, BERAN Vítězslav and SMRŽ Pavel. Effectiveness of the Bag-of-Words approach on the object search problem in 3D domain. In: Proceedings of SCCG 2017. New York City, NY: Association for Computing Machinery, 2017, pp. 138-145. ISBN 978-1-4503-5107-2.
 PŘIBYL Bronislav, ZEMČÍK Pavel and ČADÍK Martin. Absolute Pose Estimation from Line Correspondences using Direct Linear Transformation. Computer Vision and Image Understanding. 2017, vol. 161, no. 1, pp. 130-144. ISSN 1077-3142.
 SVOBODA Stanislav and BAŘINA David. New Transforms for JPEG Format. In: Conference Materials and Posters of Spring Conference on Computer Graphics SCCG 2017. Mikulov: Brno University of Technology, 2017, pp. 25-30. ISSN 1335-5694.
 ŠPAŇHEL Jakub, SOCHOR Jakub, JURÁNEK Roman, HEROUT Adam, MARŠÍK Lukáš and ZEMČÍK Pavel. Holistic Recognition of Low Quality License Plates by CNN using Track Annotated Data. In: International Workshop on Traffic and Street Surveillance for Safety and Security (AVSS 2017). Lecce: IEEE Computer Society, 2017, pp. 1-6. ISBN 978-1-5386-2939-0.
 VESELÝ Karel, BASKAR Murali K., DIEZ Sánchez Mireia and BENEŠ Karel. MGB-3 BUT System: Low-resource ASR on Egyptian YOUTUBE data. In: Proceedings of ASRU 2017. Okinawa: IEEE Signal Processing Society, 2017, pp. 368-373. ISBN 978-1-5090-4788-8.
 VESELÝ Karel, BURGET Lukáš and ČERNOCKÝ Jan. Semi-supervised DNN training with word selection for ASR. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 3687-3691. ISSN 1990-9772.
 VLK Jan and CHUDÝ Peter. General Aviation Digital Autopilot Design Based on LQR/LQG Control Strategy. In: Proceedings of 36th Digital Avionics Systems Conference. St. Petersburg, FL: IEEE Computer Society, 2017, pp. 1-9. ISBN 978-1-5386-0365-9.
 ZEINALI Hossein, SAMETI Hossein and BURGET Lukáš. HMM-Based Phrase-Independent i-Vector Extractor for Text-Dependent Speaker Verification. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING. New York City: IEEE Signal Processing Society, 2017, vol. 25, no. 7, pp. 1421-1435. ISSN 2329-9290.
 ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, KINOSHITA Keisuke, HIGUCHI Takuya, OGAWA Atsunori and NAKATANI Tomohiro. Learning Speaker Representation for Neural Network Based Multichannel Speaker Extraction. In: Proceedings of ASRU 2017. Okinawa: IEEE Signal Processing Society, 2017, pp. 8-15. ISBN 978-1-5090-4788-8.

Your IPv4 address: 54.158.199.217
Switch to IPv6 connection

DNSSEC [dnssec]