Folgen
Xiaodong Cui
Xiaodong Cui
Principal Research Scientist, IBM T. J. Watson Research Center
Bestätigte E-Mail-Adresse bei us.ibm.com - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Data augmentation for deep neural network acoustic modeling
X Cui, V Goel, B Kingsbury
IEEE/ACM Transactions on Audio, Speech, and Language Processing 23 (9), 1469 …, 2015
4742015
English conversational telephone speech recognition by humans and machines
G Saon, G Kurata, T Sercu, K Audhkhasi, S Thomas, D Dimitriadis, X Cui, ...
arXiv preprint arXiv:1703.02136, 2017
4312017
Dilated recurrent neural networks
S Chang, Y Zhang, W Han, M Yu, X Guo, W Tan, X Cui, M Witbrock, ...
Advances in neural information processing systems 30, 2017
2792017
Hybrid 8-bit floating point (HFP8) training and inference for deep neural networks
X Sun, J Choi, CY Chen, N Wang, S Venkataramani, VV Srinivasan, X Cui, ...
Advances in neural information processing systems 32, 2019
1432019
A database of vocal tract resonance trajectories for research in speech processing
L Deng, X Cui, R Pruvenok, J Huang, S Momen, Y Chen, A Alwan
2006 IEEE International Conference on Acoustics Speech and Signal Processing …, 2006
1242006
Ultra-low precision 4-bit training of deep neural networks
X Sun, N Wang, CY Chen, J Ni, A Agrawal, X Cui, S Venkataramani, ...
Advances in Neural Information Processing Systems 33, 1796-1807, 2020
1112020
Multilingual representations for low resource speech recognition and keyword search
J Cui, B Kingsbury, B Ramabhadran, A Sethy, K Audhkhasi, X Cui, ...
2015 IEEE workshop on automatic speech recognition and understanding (ASRU …, 2015
1002015
System combination and score normalization for spoken term detection
J Mamou, J Cui, X Cui, MJF Gales, B Kingsbury, K Knill, L Mangu, ...
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
952013
Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR
X Cui, A Alwan
IEEE Transactions on Speech and Audio Processing 13 (6), 1161-1172, 2005
892005
Evolutionary stochastic gradient descent for optimization of deep neural networks
X Cui, W Zhang, Z Tüske, M Picheny
Advances in neural information processing systems 31, 2018
852018
Stereo-based stochastic mapping for robust speech recognition
M Afify, X Cui, Y Gao
IEEE transactions on audio, speech, and language processing 17 (7), 1325-1334, 2009
732009
A high-performance Cantonese keyword search system
B Kingsbury, J Cui, X Cui, MJF Gales, K Knill, J Mamou, L Mangu, ...
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
612013
Data augmentation for deep convolutional neural network acoustic modeling
X Cui, V Goel, B Kingsbury
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
602015
Tball data collection: the making of a young children's speech corpus
A Kazemzadeh, H You, M Iseli, B Jones, X Cui, M Heritage, P Price, ...
Ninth European Conference on Speech Communication and Technology, 2005
542005
Towards better understanding of adaptive gradient algorithms in generative adversarial nets
M Liu, Y Mroueh, J Ross, W Zhang, X Cui, P Das, T Yang
arXiv preprint arXiv:1912.11940, 2019
532019
Developing speech recognition systems for corpus indexing under the IARPA Babel program
J Cui, X Cui, B Ramabhadran, J Kim, B Kingsbury, J Mamou, L Mangu, ...
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
502013
A study of variable-parameter Gaussian mixture hidden Markov modeling for noisy speech recognition
X Cui, Y Gong
IEEE transactions on audio, speech, and language processing 15 (4), 1366-1376, 2007
502007
A decentralized parallel algorithm for training generative adversarial nets
M Liu, W Zhang, Y Mroueh, X Cui, J Ross, T Yang, P Das
Advances in Neural Information Processing Systems 33, 11056-11070, 2020
452020
An empirical study of confusion modeling in keyword search for low resource languages
M Saraclar, A Sethy, B Ramabhadran, L Mangu, J Cui, X Cui, B Kingsbury, ...
2013 IEEE Workshop on Automatic Speech Recognition and Understanding, 464-469, 2013
442013
Speech emotion recognition with multiscale area attention and data augmentation
M Xu, F Zhang, X Cui, W Zhang
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
422021
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20