Dolování infoRmAcí z řeči Pořízené vzdÁlenými miKrofony

English title

Information mining in speech acquired by distant microphones

Type

grant

Keywords

speech data mining, speech recognition, speaker recognition, language identification, keyword spotting, distant microphones

Abstract

Speech data mining is becoming indispensable for units fighting criminality and terrorism. The current versions allow for successful deployment on data acquired from close-talk microphones. The goal of DRAPAK is to increase the performance of speech data mining from distant microphones in real environments and to generate relevant information in corresponding operational scenarios. The output is a set of software tools to be tested by the Police of the Czech Republic and other state agencies.

Team members

Černocký Jan, prof. Dr. Ing. (UPGM FIT VUT) , research leader
Malenovský Vladimír, Ing., Ph.D. (UPGM FIT VUT) , team leader
Matějka Pavel, Ing., Ph.D. (UPGM FIT VUT) , team leader
Plchot Oldřich, Ing., Ph.D. (UPGM FIT VUT) , team leader
Veselý Karel, Ing., Ph.D. (UPGM FIT VUT) , team leader
Kesiraju Santosh (UPGM FIT VUT)
Ondel Yang Lucas Antoine Francois, Mgr., Ph.D. (UPGM FIT VUT)

Publications

2020

MATĚJKA Pavel, PLCHOT Oldřich, GLEMBEK Ondřej, BURGET Lukáš, ROHDIN Johan A., ZEINALI Hossein, MOŠNER Ladislav, SILNOVA Anna, NOVOTNÝ Ondřej, DIEZ Sánchez Mireia and ČERNOCKÝ Jan. 13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE. Computer Speech and Language, vol. 2020, no. 63, pp. 1-15. ISSN 0885-2308. Detail
ALAM Jahangir, BOULIANNE Gilles, BURGET Lukáš, DAHMANE Mohamed, DIEZ Sánchez Mireia, GLEMBEK Ondřej, LALONDE Marc, LOZANO Díez Alicia, MATĚJKA Pavel, MIZERA Petr, MOŠNER Ladislav, NOISEUX Cédric, MONTEIRO Joao, NOVOTNÝ Ondřej, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna, SLAVÍČEK Josef, STAFYLAKIS Themos, ST-CHARLES Pierre-Luc, WANG Shuai and ZEINALI Hossein. Analysis of ABC Submission to NIST SRE 2019 CMN and VAST Challenge. In: Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop. Tokyo: International Speech Communication Association, 2020, pp. 289-295. ISSN 2312-2846. Detail
DIEZ Sánchez Mireia, BURGET Lukáš, LANDINI Federico Nicolás and ČERNOCKÝ Jan. Analysis of Speaker Diarization based on Bayesian HMM with Eigenvoice Priors. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, vol. 28, no. 1, 2020, pp. 355-368. ISSN 2329-9290. Detail
BURGET Lukáš, GLEMBEK Ondřej, LOZANO Díez Alicia, MATĚJKA Pavel, NOVOTNÝ Ondřej, PLCHOT Oldřich, PULUGUNDLA Bhargav, ROHDIN Johan A., SILNOVA Anna and VESELÝ Karel. BUT System Description to SdSV Challenge 2020. In: Proceedings of Short-duration Speaker Verification Challenge 2020 Workshop. Shanghai, on-line event of Interspeech 2020 Conference, 2020, pp. 1-5. Detail
LOZANO Díez Alicia, SILNOVA Anna, PULUGUNDLA Bhargav, ROHDIN Johan A., VESELÝ Karel, BURGET Lukáš, PLCHOT Oldřich, GLEMBEK Ondřej, NOVOTNÝ Ondřej and MATĚJKA Pavel. BUT Text-Dependent Speaker Verification System for SdSV Challenge 2020. In: Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH. Shanghai: International Speech Communication Association, 2020, pp. 761-765. ISSN 1990-9772. Detail
WANG Shuai, ROHDIN Johan A., PLCHOT Oldřich, BURGET Lukáš, YU Kai and ČERNOCKÝ Jan. Investigation of Specaugment for Deep Speaker Embedding Learning. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. Barcelona: IEEE Signal Processing Society, 2020, pp. 7139-7143. ISBN 978-1-5090-6631-5. Detail
MOŠNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A. and ČERNOCKÝ Jan. Utilizing VOiCES dataset for multichannel speaker verification with beamforming. In: Proceedings of Odyssey 2020 The Speaker and Language Recognition Workshop. Tokyo: International Speech Communication Association, 2020, pp. 187-193. ISSN 2312-2846. Detail

2019

ALAM Jahangir, BOULIANNE Gilles, GLEMBEK Ondřej, LOZANO Díez Alicia, MATĚJKA Pavel, MIZERA Petr, MONTEIRO Joao, MOŠNER Ladislav, NOVOTNÝ Ondřej, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna, SLAVÍČEK Josef, STAFYLAKIS Themos, WANG Shuai and ZEINALI Hossein. ABC NIST SRE 2019 CTS System Description. In: Proceedings of NIST. Sentosa, Singapore: National Institute of Standards and Technology, 2019, pp. 1-6. Detail
ALAM Jahangir, BOULIANNE Gilles, BURGET Lukáš, GLEMBEK Ondřej, LOZANO Díez Alicia, MATĚJKA Pavel, MIZERA Petr, MOŠNER Ladislav, NOVOTNÝ Ondřej, PLCHOT Oldřich, ROHDIN Johan A., SILNOVA Anna, SLAVÍČEK Josef, STAFYLAKIS Themos, WANG Shuai, ZEINALI Hossein, DAHMANE Mohamed, ST-CHARLES Pierre-Luc, LALONDE Marc, NOISEUX Cédric and MONTEIRO Joao. ABC System Description for NIST Multimedia Speaker Recognition Evaluation 2019. In: Proceedings of NIST 2019 SRE Workshop. Sentosa, Singapore: National Institute of Standards and Technology, 2019, pp. 1-7. Detail
MATĚJKA Pavel, PLCHOT Oldřich, ZEINALI Hossein, MOŠNER Ladislav, SILNOVA Anna, BURGET Lukáš, NOVOTNÝ Ondřej and GLEMBEK Ondřej. Analysis of BUT Submission in Far-Field Scenarios of VOiCES 2019 Challenge. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, pp. 2448-2452. ISSN 1990-9772. Detail
NOVOTNÝ Ondřej, PLCHOT Oldřich, GLEMBEK Ondřej, ČERNOCKÝ Jan and BURGET Lukáš. Analysis of DNN Speech Signal Enhancement for Robust Speaker Recognition. Computer Speech and Language, vol. 2019, no. 58, pp. 403-421. ISSN 0885-2308. Detail
DIEZ Sánchez Mireia, BURGET Lukáš, WANG Shuai, ROHDIN Johan A. and ČERNOCKÝ Jan. Bayesian HMM based x-vector clustering for Speaker Diarization. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, pp. 346-350. ISSN 1990-9772. Detail
ONDEL Yang Lucas Antoine Francois, VYDANA Hari K., BURGET Lukáš and ČERNOCKÝ Jan. Bayesian Subspace Hidden Markov Model for Acoustic Unit Discovery. In: Proceedings of Interspeech 2019. Graz: International Speech Communication Association, 2019, pp. 261-265. ISSN 1990-9772. Detail
SZŐKE Igor, SKÁCEL Miroslav, MOŠNER Ladislav, PALIESEK Jakub and ČERNOCKÝ Jan. Building and Evaluation of a Real Room Impulse Response Dataset. IEEE Journal of Selected Topics in Signal Processing, vol. 13, no. 4, 2019, pp. 863-876. ISSN 1932-4553. Detail
ZEINALI Hossein, WANG Shuai, SILNOVA Anna, MATĚJKA Pavel and PLCHOT Oldřich. BUT System Description to VoxCeleb Speaker Recognition Challenge 2019. In: Proceedings of The VoxCeleb Challange Workshop 2019. Graz, 2019, pp. 1-4. Detail
ZEINALI Hossein, STAFYLAKIS Themos, ATHANASOPOULOU Georgia, ROHDIN Johan A., GKINIS Ioanis, BURGET Lukáš and ČERNOCKÝ Jan. Detecting Spoofing Attacks Using VGG and SincNet: BUT-Omilia Submission to ASVspoof 2019 Challenge. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, pp. 1073-1077. ISSN 1990-9772. Detail
NOVOTNÝ Ondřej, PLCHOT Oldřich, GLEMBEK Ondřej, BURGET Lukáš and MATĚJKA Pavel. Discriminatively Re-trained i-Vector Extractor For Speaker Recognition. In: Proceedings of 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP). Brighton: IEEE Signal Processing Society, 2019, pp. 6031-6035. ISBN 978-1-5386-4658-8. Detail
NOVOTNÝ Ondřej, PLCHOT Oldřich, GLEMBEK Ondřej and BURGET Lukáš. Factorization of Discriminatively Trained i-Vector Extractor for Speaker Recognition. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, pp. 4330-4334. ISSN 1990-9772. Detail
ZEINALI Hossein, BURGET Lukáš, ROHDIN Johan A., STAFYLAKIS Themos and ČERNOCKÝ Jan. How To Improve Your Speaker Embeddings Extractor in Generic Toolkits. In: Proceedings of 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP). Brighton: IEEE Signal Processing Society, 2019, pp. 6141-6145. ISBN 978-1-5386-4658-8. Detail
WANG Shuai, ROHDIN Johan A., BURGET Lukáš, PLCHOT Oldřich, QIAN Yanmin, YU Kai and ČERNOCKÝ Jan. On the Usage of Phonetic Information for Text-independent Speaker Embedding Extraction. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, pp. 1148-1152. ISSN 1990-9772. Detail
STAFYLAKIS Themos, ROHDIN Johan A., PLCHOT Oldřich, MIZERA Petr and BURGET Lukáš. Self-supervised speaker embeddings. In: Proceedings of Interspeech. Graz: International Speech Communication Association, 2019, pp. 2863-2867. ISSN 1990-9772. Detail
MOŠNER Ladislav, PLCHOT Oldřich, ROHDIN Johan A., BURGET Lukáš and ČERNOCKÝ Jan. Speaker Verification with Application-Aware Beamforming. In: IEEE Automatic Speech Recognition and Understanding Workshop - Proceedings (ASRU). Sentosa, Singapore: IEEE Signal Processing Society, 2019, pp. 411-418. ISBN 978-1-7281-0306-8. Detail

2018

ALAM Jahangir, BHATTACHARYA Gautam, BRUMMER Johan Nikolaas Langenhoven, BURGET Lukáš, DIEZ Sánchez Mireia, GLEMBEK Ondřej, KENNY Patrick, KLČO Michal, LANDINI Federico Nicolás, LOZANO Díez Alicia, MATĚJKA Pavel, MONTEIRO Joao, MOŠNER Ladislav, NOVOTNÝ Ondřej, PLCHOT Oldřich, PROFANT Ján, ROHDIN Johan A., SILNOVA Anna, SLAVÍČEK Josef, STAFYLAKIS Themos and ZEINALI Hossein. ABC NIST SRE 2018 SYSTEM DESCRIPTION. In: Proceedings of 2018 NIST SRE Workshop. Athens: National Institute of Standards and Technology, 2018, pp. 1-10. Detail
PLCHOT Oldřich, MATĚJKA Pavel, NOVOTNÝ Ondřej, CUMANI Sandro, LOZANO Díez Alicia, SLAVÍČEK Josef, DIEZ Sánchez Mireia, GRÉZL František, GLEMBEK Ondřej, KAMSALI Veera Mounika, SILNOVA Anna, BURGET Lukáš, ONDEL Yang Lucas Antoine Francois, KESIRAJU Santosh and ROHDIN Johan A. Analysis of BUT-PT Submission for NIST LRE 2017. In: Proceedings of Odyssey 2018 The Speaker and Language Recognition Workshop. Les Sables d'Olonne: International Speech Communication Association, 2018, pp. 47-53. ISSN 2312-2846. Detail
LOZANO Díez Alicia, PLCHOT Oldřich, MATĚJKA Pavel, NOVOTNÝ Ondřej and GONZALEZ-RODRIGUEZ Joaquin. Analysis of DNN-based Embeddings for Language Recognition on the NIST LRE 2017. In: Proceedings of Odyssey 2018 The Speaker and Language Recognition Workshop. Les Sables d'Olonne: International Speech Communication Association, 2018, pp. 39-46. ISSN 2312-2846. Detail
KARAFIÁT Martin, BASKAR Murali K., SZŐKE Igor, MALENOVSKÝ Vladimír, VESELÝ Karel, GRÉZL František, BURGET Lukáš and ČERNOCKÝ Jan. BUT OpenSAT 2017 speech recognition system. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, pp. 2638-2642. ISSN 1990-9772. Detail
DIEZ Sánchez Mireia, LANDINI Federico Nicolás, BURGET Lukáš, ROHDIN Johan A., SILNOVA Anna, ŽMOLÍKOVÁ Kateřina, NOVOTNÝ Ondřej, VESELÝ Karel, GLEMBEK Ondřej, PLCHOT Oldřich, MOŠNER Ladislav and MATĚJKA Pavel. BUT system for DIHARD Speech Diarization Challenge 2018. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, pp. 2798-2802. ISSN 1990-9772. Detail
SILNOVA Anna, MATĚJKA Pavel, GLEMBEK Ondřej, PLCHOT Oldřich, NOVOTNÝ Ondřej, GRÉZL František, SCHWARZ Petr and ČERNOCKÝ Jan. BUT/Phonexia Bottleneck Feature Extractor. In: Proceedings of Odyssey 2018. Les Sables d´Olonne: International Speech Communication Association, 2018, pp. 283-287. ISSN 2312-2846. Detail
ZEINALI Hossein, BURGET Lukáš and ČERNOCKÝ Jan. Convolutional Neural Networks and X-Vector Embedding for DCASE2018 Acoustic Scene Classification Challenge. In: Proceedings of DCASE 2018 Workshop. Surrey: Tampere University of Technology, 2018, pp. 1-5. ISBN 978-952-15-4262-6. Detail
MOŠNER Ladislav, MATĚJKA Pavel, NOVOTNÝ Ondřej and ČERNOCKÝ Jan. Dereverberation and Beamforming in Far-Field Speaker Recognition. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, pp. 5254-5258. ISBN 978-1-5386-4658-8. Detail
MOŠNER Ladislav, PLCHOT Oldřich, MATĚJKA Pavel, NOVOTNÝ Ondřej and ČERNOCKÝ Jan. Dereverberation and Beamforming in Robust Far-Field Speaker Recognition. In: Proceedings of Interspeech 2018. Hyderabad: International Speech Communication Association, 2018, pp. 1334-1338. ISSN 1990-9772. Detail
LOZANO Díez Alicia, PLCHOT Oldřich, MATĚJKA Pavel and GONZALEZ-RODRIGUEZ Joaquin. DNN Based Embeddings for Language Recognition. In: Proceedings of ICASSP 2018. Calgary: IEEE Signal Processing Society, 2018, pp. 5184-5188. ISBN 978-1-5386-4658-8. Detail
BRUMMER Johan Nikolaas Langenhoven, SILNOVA Anna, BURGET Lukáš and STAFYLAKIS Themos. Gaussian meta-embeddings for efficient scoring of a heavy-tailed PLDA model. In: Proceedings of Odyssey 2018. Les Sables d'Olonne: International Speech Communication Association, 2018, pp. 349-356. ISSN 2312-2846. Detail
BARTOS Anthony L., CIPR Tomáš, NELSON Douglas J., SCHWARZ Petr, BANOWETZ John and JERABEK Ladislav. Noise-robust speech triage. Journal of the Acoustical Society of America, vol. 143, no. 4, 2018, pp. 2313-2320. ISSN 1520-8524. Detail
NOVOTNÝ Ondřej, MATĚJKA Pavel, PLCHOT Oldřich and GLEMBEK Ondřej. On the use of DNN Autoencoder for Robust Speaker Recognition. Brno: Faculty of Information Technology BUT, 2018. Detail
NOVOTNÝ Ondřej, PLCHOT Oldřich, MATĚJKA Pavel, MOŠNER Ladislav and GLEMBEK Ondřej. On the use of X-vectors for Robust Speaker Recognition. In: Proceedings of Odyssey 2018. Les Sables d´Olonne: International Speech Communication Association, 2018, pp. 168-175. ISSN 2312-2846. Detail

2017

SILNOVA Anna, BURGET Lukáš and ČERNOCKÝ Jan. Alternative Approaches to Neural Network based Speaker Verification. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 1572-1575. ISSN 1990-9772. Detail
PLCHOT Oldřich, MATĚJKA Pavel, SILNOVA Anna, NOVOTNÝ Ondřej, DIEZ Sánchez Mireia, ROHDIN Johan A., GLEMBEK Ondřej, BRÜMMER Niko, SWART Albert du Preez, PRIETO Jesús J., GARCIA Perera Leibny Paola, BUERA Luis, KENNY Patrick, ALAM Jahangir and BHATTACHARYA Gautam. Analysis and Description of ABC Submission to NIST SRE 2016. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 1348-1352. ISSN 1990-9772. Detail
MATĚJKA Pavel, NOVOTNÝ Ondřej, PLCHOT Oldřich, BURGET Lukáš, DIEZ Sánchez Mireia and ČERNOCKÝ Jan. Analysis of Score Normalization in Multilingual Speaker Recognition. In: Proceedings of Interspeech 2017. Stockholm: International Speech Communication Association, 2017, pp. 1567-1571. ISSN 1990-9772. Detail
MATĚJKA Pavel, PLCHOT Oldřich, NOVOTNÝ Ondřej, CUMANI Sandro, LOZANO Díez Alicia, SLAVÍČEK Josef, DIEZ Sánchez Mireia, GRÉZL František, GLEMBEK Ondřej, KAMSALI Veera Mounika, SILNOVA Anna, BURGET Lukáš, ONDEL Yang Lucas Antoine Francois, KESIRAJU Santosh and ROHDIN Johan A. BUT- PT System Description for NIST LRE 2017. In: Proceedings of NIST Language Recognition Workshop 2017. Orlando, Florida: National Institute of Standards and Technology, 2017, pp. 1-6. Detail
ZEINALI Hossein, SAMETI Hossein and BURGET Lukáš. HMM-Based Phrase-Independent i-Vector Extractor for Text-Dependent Speaker Verification. IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH AND LANGUAGE PROCESSING, vol. 25, no. 7, 2017, pp. 1421-1435. ISSN 2329-9290. Detail
VESELÝ Karel, BASKAR Murali K., DIEZ Sánchez Mireia and BENEŠ Karel. MGB-3 BUT System: Low-resource ASR on Egyptian YOUTUBE data. In: Proceedings of ASRU 2017. Okinawa: IEEE Signal Processing Society, 2017, pp. 368-373. ISBN 978-1-5090-4788-8. Detail
ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, KINOSHITA Keisuke, HIGUCHI Takuya, OGAWA Atsunori and NAKATANI Tomohiro. Speaker-aware neural network based beamformer for speaker extraction in speech mixtures. In: Proceedings of Interspeech 2017. Stocholm: International Speech Communication Association, 2017, pp. 2655-2659. ISSN 1990-9772. Detail
ZEINALI Hossein, SAMETI Hossein, BURGET Lukáš and ČERNOCKÝ Jan. Text-dependent speaker verification based on i-vectors, Neural Networks and Hidden Markov Models. Computer Speech and Language, vol. 2017, no. 46, pp. 53-71. ISSN 0885-2308. Detail
KARAFIÁT Martin, VESELÝ Karel, ŽMOLÍKOVÁ Kateřina, DELCROIX Marc, WATANABE Shinji, BURGET Lukáš, ČERNOCKÝ Jan and SZŐKE Igor. Training Data Augmentation and Data Selection. New Era for Robust Speech Recognition: Exploiting Deep Learning. Computer Science, Artificial Intelligence. Heidelberg: Springer International Publishing, 2017, pp. 245-260. ISBN 978-3-319-64679-4. Detail

2016

BRUMMER Johan Nikolaas Langenhoven, SWART Albert du Preez, PRIETO Jesús J., GARCIA Perera Leibny Paola, MATĚJKA Pavel, PLCHOT Oldřich, DIEZ Sánchez Mireia, SILNOVA Anna, JIANG Xiaowei, NOVOTNÝ Ondřej, ROHDIN Johan A., GLEMBEK Ondřej, GRÉZL František, BURGET Lukáš, ONDEL Yang Lucas Antoine Francois, PEŠÁN Jan, ČERNOCKÝ Jan, KENNY Patrick, ALAM Jahangir, BHATTACHARYA Gautam and ZEINALI Hossein et al. ABC NIST SRE 2016 SYSTEM DESCRIPTION. San Diego: National Institute of Standards and Technology, 2016. Detail
LOZANO Díez Alicia, SILNOVA Anna, MATĚJKA Pavel, GLEMBEK Ondřej, PLCHOT Oldřich, PEŠÁN Jan, BURGET Lukáš and GONZALEZ-RODRIGUEZ Joaquin. Analysis and Optimization of Bottleneck Features for Speaker Recognition. In: Proceedings of Odyssey 2016. Bilbao: International Speech Communication Association, 2016, pp. 352-357. ISSN 2312-2846. Detail
MATĚJKA Pavel, GLEMBEK Ondřej, NOVOTNÝ Ondřej, PLCHOT Oldřich, GRÉZL František, BURGET Lukáš and ČERNOCKÝ Jan. Analysis Of DNN Approaches To Speaker Identification. In: Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016, pp. 5100-5104. ISBN 978-1-4799-9988-0. Detail
NOVOTNÝ Ondřej, MATĚJKA Pavel, PLCHOT Oldřich, GLEMBEK Ondřej, BURGET Lukáš and ČERNOCKÝ Jan. Analysis of Speaker Recognition Systems in Realistic Scenarios of the SITW 2016 Challenge. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, pp. 828-832. ISBN 978-1-5108-3313-5. Detail
NOVOTNÝ Ondřej, MATĚJKA Pavel, GLEMBEK Ondřej, PLCHOT Oldřich, GRÉZL František, BURGET Lukáš and ČERNOCKÝ Jan. Analysis of the DNN-Based SRE Systems in Multi-language Conditions. In: Proceedings of SLT 2016. San Diego: IEEE Signal Processing Society, 2016, pp. 199-204. ISBN 978-1-5090-4903-5. Detail
PLCHOT Oldřich, BURGET Lukáš, ARONOWITZ Hagai and MATĚJKA Pavel. Audio Enhancing With DNN Autoencoder For Speaker Recognition. In: Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016, pp. 5090-5094. ISBN 978-1-4799-9988-0. Detail
PLCHOT Oldřich, MATĚJKA Pavel, FÉR Radek, GLEMBEK Ondřej, NOVOTNÝ Ondřej, PEŠÁN Jan, VESELÝ Karel, ONDEL Yang Lucas Antoine Francois, KARAFIÁT Martin, GRÉZL František, KESIRAJU Santosh, BURGET Lukáš, BRUMMER Johan Nikolaas Langenhoven, SWART Albert du Preez, CUMANI Sandro, MALLIDI Sri Harish and LI Ruizhi. BAT System Description for NIST LRE 2015. In: Proceedings of Odyssey 2016, The Speaker and Language Recognition Workshop. Bilbao: International Speech Communication Association, 2016, pp. 166-173. ISSN 2312-2846. Detail
SAGHA Hesam, MATĚJKA Pavel, GAVRYUOKOVA Maryna, POVOLNÝ Filip, MARCHI Erik and SCHULLER Björn W. Enhancing multilingual recognition of emotion in speech by language identification. In: 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION - Proceedings (INTERSPEECH 2016). San Francisco: International Speech Communication Association, 2016, pp. 2949-2953. ISSN 1990-9772. Detail
ZEINALI Hossein, SAMETI Hossein, BURGET Lukáš, ČERNOCKÝ Jan, MAGHSOODI Nooshin and MATĚJKA Pavel. i-vector/HMM Based Text-dependent Speaker Verification System for RedDots Challenge. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, pp. 440-444. ISBN 978-1-5108-3313-5. Detail
VESELÝ Karel, WATANABE Shinji, ŽMOLÍKOVÁ Kateřina, KARAFIÁT Martin, BURGET Lukáš and ČERNOCKÝ Jan. Sequence Summarizing Neural Network for Speaker Adaptation. In: Proceedings of the 41th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016. Shanghai: IEEE Signal Processing Society, 2016, pp. 5315-5319. ISBN 978-1-4799-9988-0. Detail
PEŠÁN Jan, BURGET Lukáš and ČERNOCKÝ Jan. Sequence Summarizing Neural Networks for Spoken Language Recognition. In: Proceedings of Interspeech 2016. San Francisco: International Speech Communication Association, 2016, pp. 3285-3289. ISBN 978-1-5108-3313-5. Detail

2015

HSIAO Roger, MA Jeff, HARTMANN William, KARAFIÁT Martin, GRÉZL František, BURGET Lukáš, SZŐKE Igor, ČERNOCKÝ Jan, WATANABE Shinji, CHEN Zhuo, MALLIDI Sri Harish, HEŘMANSKÝ Hynek, TSAKALIDIS Stavros and SCHWARTZ Richard. Robust Speech Recognition in Unknown Reverberant and Noisy Conditions. In: Proceedings of 2015 IEEE Automatic Speech Recognition and Understanding Workshop. Scottsdale, Arizona: IEEE Signal Processing Society, 2015, pp. 533-538. ISBN 978-1-4799-7291-3. Detail

Products

2019

Tensorflow implementation of speaker recognition with x-vector topology, software, 2019
Authors: Zeinali Hossein, Burget Lukáš, Rohdin Johan A., Stafylakis Themos, Černocký Jan Detail

2016

Software for artificial noising and reverberation of speech data, software, 2016
Authors: Szőke Igor, Malenovský Vladimír, Novotný Ondřej Detail

Study Department

Dolování infoRmAcí z řeči Pořízené vzdÁlenými miKrofony - DRAPÁK

2020

2019

2018

2017

2016

2015

2019

2016