Zero-shot multi-speaker text-to-speech with state-of-the-art neural speaker embeddings E Cooper, CI Lai, Y Yasuda, F Fang, X Wang, N Chen, J Yamagishi ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 211 | 2020 |
Generalization ability of MOS prediction networks E Cooper, WC Huang, T Toda, J Yamagishi ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 142 | 2022 |
The voicemos challenge 2022 WC Huang, E Cooper, Y Tsao, HM Wang, T Toda, J Yamagishi arXiv preprint arXiv:2203.11389, 2022 | 112 | 2022 |
Effect of pronounciations on OOV queries in spoken term detection D Can, E Cooper, A Sethy, C White, B Ramabhadran, M Saraclar 2009 IEEE International Conference on Acoustics, Speech and Signal …, 2009 | 75 | 2009 |
Ldnet: Unified listener dependent modeling in mos prediction for synthetic speech WC Huang, E Cooper, J Yamagishi, T Toda ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 68 | 2022 |
How do voices from past speech synthesis challenges compare today? E Cooper, J Yamagishi arXiv preprint arXiv:2105.02373, 2021 | 58 | 2021 |
Improving speech recognition and keyword search for low resource languages using web data G Mendels, E Cooper, V Soto, J Hirschberg, MJF Gales, KM Knill, A Ragni, ... INTERSPEECH 2015: 16th Annual Conference of the International Speech …, 2015 | 49 | 2015 |
The partialspoof database and countermeasures for the detection of short fake speech segments embedded in an utterance L Zhang, X Wang, E Cooper, N Evans, J Yamagishi IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 813-825, 2022 | 44 | 2022 |
An initial investigation for detecting partially spoofed audio L Zhang, X Wang, E Cooper, J Yamagishi, J Patino, N Evans arXiv preprint arXiv:2104.02518, 2021 | 40 | 2021 |
Cross-language prominence detection A Rosenberg, EL Cooper, R Levitan, JB Hirschberg Speech Prosody 2012, 2012 | 37 | 2012 |
Can speaker augmentation improve multi-speaker end-to-end TTS? E Cooper, CI Lai, Y Yasuda, J Yamagishi arXiv preprint arXiv:2005.01245, 2020 | 27 | 2020 |
Attention back-end for automatic speaker verification with multiple enrollment utterances C Zeng, X Wang, E Cooper, X Miao, J Yamagishi ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 26 | 2022 |
Utterance selection for optimizing intelligibility of tts voices trained on asr data E Cooper, X Wang Interspeech 2017 1, 2017 | 26 | 2017 |
Multi-task learning in utterance-level and segmental-level spoof detection L Zhang, X Wang, E Cooper, J Yamagishi arXiv preprint arXiv:2107.14132, 2021 | 25 | 2021 |
Text-to-speech synthesis using found data for low-resource languages E Cooper Columbia University, 2019 | 25 | 2019 |
Cross-language phrase boundary detection V Soto, E Cooper, A Rosenberg, J Hirschberg 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 23 | 2013 |
The VoiceMOS Challenge 2023: zero-shot subjective speech quality prediction for multiple domains E Cooper, WC Huang, Y Tsao, HM Wang, T Toda, J Yamagishi 2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023 | 22 | 2023 |
Language-independent speaker anonymization approach using self-supervised pre-trained models X Miao, X Wang, E Cooper, J Yamagishi, N Tomashenko arXiv preprint arXiv:2202.13097, 2022 | 22 | 2022 |
Learning disentangled phone and speaker representations in a semi-supervised VQ-VAE paradigm J Williams, Y Zhao, E Cooper, J Yamagishi ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 21 | 2021 |
Improved prosody from learned f0 codebook representations for vq-vae speech waveform reconstruction Y Zhao, H Li, CI Lai, J Williams, E Cooper, J Yamagishi arXiv preprint arXiv:2005.07884, 2020 | 19 | 2020 |