Folgen
Yu Zhang
Yu Zhang
Google
Bestätigte E-Mail-Adresse bei csail.mit.edu - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Specaugment: A simple data augmentation method for automatic speech recognition
DS Park, W Chan, Y Zhang, CC Chiu, B Zoph, ED Cubuk, QV Le
arXiv preprint arXiv:1904.08779, 2019
28262019
Natural tts synthesis by conditioning wavenet on mel spectrogram predictions
J Shen, R Pang, RJ Weiss, M Schuster, N Jaitly, Z Yang, Z Chen, Y Zhang, ...
2018 IEEE international conference on acoustics, speech and signal …, 2018
23852018
Conformer: Convolution-augmented transformer for speech recognition
A Gulati, J Qin, CC Chiu, N Parmar, Y Zhang, J Yu, W Han, S Wang, ...
arXiv preprint arXiv:2005.08100, 2020
15502020
Style tokens: Unsupervised style modeling, control and transfer in end-to-end speech synthesis
Y Wang, D Stanton, Y Zhang, RJS Ryan, E Battenberg, J Shor, Y Xiao, ...
International Conference on Machine Learning, 5180-5189, 2018
7012018
Transfer learning from speaker verification to multispeaker text-to-speech synthesis
Y Jia, Y Zhang, R Weiss, Q Wang, J Shen, F Ren, P Nguyen, R Pang, ...
Advances in neural information processing systems 31, 2018
6942018
Very deep convolutional networks for end-to-end speech recognition
Y Zhang, W Chan, N Jaitly
2017 IEEE international conference on acoustics, speech and signal …, 2017
5042017
Libritts: A corpus derived from librispeech for text-to-speech
H Zen, V Dang, R Clark, Y Zhang, RJ Weiss, Y Jia, Z Chen, Y Wu
arXiv preprint arXiv:1904.02882, 2019
4682019
An introduction to computational networks and the computational network toolkit
MS Dong Yu, Adam Eversole, Mike Seltzer, Kaisheng Yao, Zhiheng Huang, Brian ...
Tech. Rep. MSR, Microsoft Research, 2014, http://codebox/cntk, 2014
464*2014
Spoken language understanding using long short-term memory neural networks
K Yao, B Peng, Y Zhang, D Yu, G Zweig, Y Shi
IEEE SLT, 2014
3662014
Unsupervised learning of disentangled and interpretable representations from sequential data
WN Hsu, Y Zhang, J Glass
Advances in neural information processing systems 30, 2017
3572017
Highway long short-term memory rnns for distant speech recognition
Y Zhang, G Chen, D Yu, K Yao, S Khudanpur, J Glass
2016 IEEE international conference on acoustics, speech and signal …, 2016
3392016
Wavegrad: Estimating gradients for waveform generation
N Chen, Y Zhang, H Zen, RJ Weiss, M Norouzi, W Chan
arXiv preprint arXiv:2009.00713, 2020
3342020
Advances in joint CTC-attention based end-to-end speech recognition with a deep CNN encoder and RNN-LM
T Hori, S Watanabe, Y Zhang, W Chan
arXiv preprint arXiv:1706.02737, 2017
3162017
Simple recurrent units for highly parallelizable recurrence
T Lei, Y Zhang, SI Wang, H Dai, Y Artzi
arXiv preprint arXiv:1709.02755, 2017
2592017
Pushing the limits of semi-supervised learning for automatic speech recognition
Y Zhang, J Qin, DS Park, W Han, CC Chiu, R Pang, QV Le, Y Wu
arXiv preprint arXiv:2010.10504, 2020
2382020
Hierarchical generative modeling for controllable speech synthesis
WN Hsu, Y Zhang, RJ Weiss, H Zen, Y Wu, Y Wang, Y Cao, Y Jia, Z Chen, ...
arXiv preprint arXiv:1810.07217, 2018
2232018
Improved noisy student training for automatic speech recognition
DS Park, Y Zhang, Y Jia, W Han, CC Chiu, B Li, Y Wu, QV Le
arXiv preprint arXiv:2005.09629, 2020
2002020
Contextnet: Improving convolutional neural networks for automatic speech recognition with global context
W Han, Z Zhang, Y Zhang, J Yu, CC Chiu, J Qin, A Gulati, R Pang, Y Wu
arXiv preprint arXiv:2005.03191, 2020
1962020
Deep beamforming networks for multi-channel speech recognition
X Xiao, S Watanabe, H Erdogan, L Lu, J Hershey, ML Seltzer, G Chen, ...
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
1912016
I-Vector Based Clustering Training Data in Speech Recognition
Q Huo, ZJ Yan, Y Zhang, J Xu
US Patent App. 13/640,804, 2015
1892015
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20