Ryuichi Yamamoto

Zitiert von

	Alle	Seit 2019
Zitate	2651	2560
h-index	15	14
i10-index	23	20

660

330

165

495

20152016201720182019202020212022202320247 16 31 25 63 284 614 603 652 320

Öffentlicher Zugriff

Alle anzeigen

1 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Eunwoo SongVoice, Naver CloudBestätigte E-Mail-Adresse bei navercorp.com
Tomoki HayashiHuman Dataware Lab. Co., Ltd., Nagoya UniversityBestätigte E-Mail-Adresse bei g.sp.m.is.nagoya-u.ac.jp
Shinji WatanabeCarnegie Mellon UniversityBestätigte E-Mail-Adresse bei cmu.edu
Takenori YoshimuraNagoya Institute of TechnologyBestätigte E-Mail-Adresse bei nitech.ac.jp
Tomoki TodaNagoya UniversityBestätigte E-Mail-Adresse bei icts.nagoya-u.ac.jp
Min-Jae HwangMeta AIBestätigte E-Mail-Adresse bei meta.com
Brian McFeeMusic and Performing Arts Professions / Center for Data Science, New York UniversityBestätigte E-Mail-Adresse bei nyu.edu
Shigeki KaritaGoogleBestätigte E-Mail-Adresse bei google.com
Hirofumi InagumaFundamental AI Research (FAIR) at MetaBestätigte E-Mail-Adresse bei meta.com
Jiatong Shi (史嘉彤)Carnegie Mellon UniversityBestätigte E-Mail-Adresse bei andrew.cmu.edu
Takaaki SaekiGoogleBestätigte E-Mail-Adresse bei google.com

Folgen

Ryuichi Yamamoto

LY Corporation

Bestätigte E-Mail-Adresse bei lycorp.co.jp - Startseite

Speech Synthesis Voice Conversion Speech Recognition Machine Learning Singing Voice Synthesis


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Parallel WaveGAN: A fast waveform generation model based on generative adversarial networks with multi-resolution spectrogram R Yamamoto, E Song, JM Kim ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	855	2020
A comparative study on transformer vs rnn in speech applications S Karita, N Chen, T Hayashi, T Hori, H Inaguma, Z Jiang, M Someki, ... 2019 IEEE automatic speech recognition and understanding workshop (ASRU …, 2019	802	2019
librosa/librosa: 0.6. 3 B McFee, M McVicar, S Balke, V Lostanlen, C Thom, C Raffel, D Lee, ... URL: https://doi. org/10.5281/zenodo 2564164, 2019	360*	2019
ESPnet-TTS: Unified, reproducible, and integratable open source end-to-end text-to-speech toolkit T Hayashi, R Yamamoto, K Inoue, T Yoshimura, S Watanabe, T Toda, ... ICASSP 2020-2020 IEEE international conference on acoustics, speech and …, 2020	216	2020
Probability density distillation with generative adversarial networks for high-quality parallel waveform generation R Yamamoto, E Song, JM Kim arXiv preprint arXiv:1904.04472, 2019	57	2019
Espnet2-tts: Extending the edge of tts research T Hayashi, R Yamamoto, T Yoshimura, P Wu, J Shi, T Saeki, Y Ju, ... arXiv preprint arXiv:2110.07840, 2021	51	2021
TTS-by-TTS: TTS-driven data augmentation for fast and high-quality speech synthesis MJ Hwang, R Yamamoto, E Song, JM Kim ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	37	2021
Parallel waveform synthesis based on generative adversarial networks with voicing-aware conditional discriminators R Yamamoto, E Song, MJ Hwang, JM Kim ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	21	2021
Improved Parallel WaveGAN vocoder with perceptually weighted spectrogram loss E Song, R Yamamoto, MJ Hwang, JS Kim, O Kwon, JM Kim 2021 IEEE Spoken Language Technology Workshop (SLT), 470-476, 2021	21	2021
Semi-supervised speaker adaptation for end-to-end speech synthesis with pretrained models K Inoue, S Hara, M Abe, T Hayashi, R Yamamoto, S Watanabe ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	19	2020
Cross-speaker emotion transfer for low-resource text-to-speech using non-parallel voice conversion with pitch-shift data augmentation R Terashima, R Yamamoto, E Song, Y Shirahata, HW Yoon, JM Kim, ... arXiv preprint arXiv:2204.10020, 2022	17	2022
Ryry: A real-time score-following automatic accompaniment playback system capable of real performances with errors, repeats and jumps S Sako, R Yamamoto, T Kitamura Active Media Technology: 10th International Conference, AMT 2014, Warsaw …, 2014	17	2014
Phrase break prediction with bidirectional encoder representations in Japanese text-to-speech synthesis K Futamata, B Park, R Yamamoto, K Tachibana arXiv preprint arXiv:2104.12395, 2021	16	2021
High-Fidelity Parallel WaveGAN with Multi-Band Harmonic-Plus-Noise Model. MJ Hwang, R Yamamoto, E Song, JM Kim Interspeech, 2227-2231, 2021	15	2021
Improving lpcnet-based text-to-speech with linear prediction-structured mixture density network MJ Hwang, E Song, R Yamamoto, F Soong, HG Kang ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	15	2020
Score following handling performances with arbitrary repeats and skips and automatic accompaniment E Nakamura, H Takeda, R Yamamoto, Y Saito, S Sako, S Sagayama IPSJ Journal 54 (4), 1338-1349, 2013	15	2013
Period vits: Variational inference with explicit pitch modeling for end-to-end emotional speech synthesis Y Shirahata, R Yamamoto, E Song, R Terashima, JM Kim, K Tachibana ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	12	2023
Language model-based emotion prediction methods for emotional speech synthesis systems HW Yoon, O Kwon, H Lee, R Yamamoto, E Song, JM Kim, MJ Hwang arXiv preprint arXiv:2206.15067, 2022	12	2022
Neural text-to-speech with a modeling-by-generation excitation vocoder E Song, MJ Hwang, R Yamamoto, JS Kim, O Kwon, JM Kim arXiv preprint arXiv:2008.00132, 2020	11	2020
Lightweight and high-fidelity end-to-end text-to-speech with multi-band generation and inverse short-time fourier transform M Kawamura, Y Shirahata, R Yamamoto, K Tachibana ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	10	2023

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren