Folgen
Kateřina Žmolíková
Kateřina Žmolíková
Industrial postdoc, Demant
Bestätigte E-Mail-Adresse bei demant.com
Titel
Zitiert von
Zitiert von
Jahr
Single channel target speaker extraction and recognition with speaker beam
M Delcroix, K Zmolikova, K Kinoshita, A Ogawa, T Nakatani
2018 IEEE international conference on acoustics, speech and signal …, 2018
1572018
Speakerbeam: Speaker aware neural network for target speaker extraction in speech mixtures
K Žmolíková, M Delcroix, K Kinoshita, T Ochiai, T Nakatani, L Burget, ...
IEEE Journal of Selected Topics in Signal Processing 13 (4), 800-814, 2019
1152019
Speaker-aware neural network based beamformer for speaker extraction in speech mixtures
K Zmolikova, M Delcroix, K Kinoshita, T Higuchi, A Ogawa, T Nakatani
Proc. Interspeech 2017, 2655--2659, 2017
1012017
Improving speaker discrimination of target speech extraction with time-domain speakerbeam
M Delcroix, T Ochiai, K Zmolikova, K Kinoshita, N Tawara, T Nakatani, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
692020
Sequence summarizing neural network for speaker adaptation
K Veselý, S Watanabe, K Žmolíková, M Karafiát, L Burget, J Černocký
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
682016
BUT System for DIHARD Speech Diarization Challenge 2018.
M Diez, F Landini, L Burget, J Rohdin, A Silnova, K Zmolíková, O Novotný, ...
Interspeech, 2798-2802, 2018
502018
But system for the second dihard speech diarization challenge
F Landini, S Wang, M Diez, L Burget, P Matějka, K Žmolíková, L Mošner, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
462020
Learning speaker representation for neural network based multichannel speaker extraction
K Zmolikova, M Delcroix, K Kinoshita, T Higuchi, A Ogawa, T Nakatani
2017 IEEE Automatic Speech Recognition and Understanding Workshop, 8-15, 2017
422017
Deep Clustering-Based Beamforming for Separation with Unknown Number of Sources.
T Higuchi, K Kinoshita, M Delcroix, K Zmolíková, T Nakatani
Interspeech, 1183-1187, 2017
392017
Compact network for speakerbeam target speaker extraction
M Delcroix, K Zmolikova, T Ochiai, K Kinoshita, S Araki, T Nakatani
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
342019
But system description for dihard speech diarization challenge 2019
F Landini, S Wang, M Diez, L Burget, P Matějka, K Žmolíková, L Mošner, ...
arXiv preprint arXiv:1910.08847, 2019
222019
Speaker activity driven neural speech extraction
M Delcroix, K Zmolikova, T Ochiai, K Kinoshita, T Nakatani
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
152021
Jointly trained transformers models for spoken language translation
HK Vydana, M Karafiát, K Zmolikova, L Burget, H Černocký
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
112021
Optimization of speaker-aware multichannel speech extraction with ASR criterion
K Zmolikova, M Delcroix, K Kinoshita, T Higuchi, T Nakatani, J Cernocky
10*
Auxiliary Loss Function for Target Speech Extraction and Recognition with Weak Supervision Based on Speaker Characteristics.
K Zmolikova, M Delcroix, D Raj, S Watanabe, J Cernocký
Interspeech, 1464-1468, 2021
52021
Data selection by sequence summarizing neural network in mismatch condition training
K Zmolikova, M Karafiát, K Veselý, M Delcroix, S Watanabe, L Burget, ...
Interspeech 2016 2016, 2354-2358, 2016
52016
Listen only to me! How well can target speech extraction handle false alarms?
M Delcroix, K Kinoshita, T Ochiai, K Zmolikova, H Sato, T Nakatani
arXiv preprint arXiv:2204.04811, 2022
42022
Training data augmentation and data selection
M Karafiát, K Veselý, K Žmolíková, M Delcroix, S Watanabe, L Burget, ...
New Era for Robust Speech Recognition: Exploiting Deep Learning, 245-260, 2017
42017
Integration of variational autoencoder and spatial clustering for adaptive multi-channel neural speech separation
K Zmolikova, M Delcroix, L Burget, T Nakatani, JH Černocky
2021 IEEE Spoken Language Technology Workshop (SLT), 889-896, 2021
32021
Far-field speech recognition
K ŽMOLÍKOVÁ
Diplomová práce, Brno University of Technology, Faculty of Information …, 2016
22016
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20