Folgen
Shansong Liu
Shansong Liu
Applied Research Center (ARC), PCG, Tencent
Bestätigte E-Mail-Adresse bei tencent.com
Titel
Zitiert von
Zitiert von
Jahr
Audio-visual recognition of overlapped speech for the lrs2 dataset
J Yu, SX Zhang, J Wu, S Ghorbani, B Wu, S Kang, S Liu, X Liu, H Meng, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
912020
Investigation of data augmentation techniques for disordered speech recognition
M Geng, X Xie, S Liu, J Yu, S Hu, X Liu, H Meng
arXiv preprint arXiv:2201.05562, 2022
572022
Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus.
J Yu, X Xie, S Liu, S Hu, MWY Lam, X Wu, KH Wong, X Liu, H Meng
Interspeech, 2938-2942, 2018
532018
Recent progress in the cuhk dysarthric speech recognition system
S Liu, M Geng, S Hu, X Xie, M Cui, J Yu, X Liu, H Meng
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2267-2281, 2021
452021
Adversarial data augmentation for disordered speech recognition
Z Jin, M Geng, X Xie, J Yu, S Liu, X Liu, H Meng
arXiv preprint arXiv:2108.00899, 2021
332021
Development of the cuhk elderly speech recognition system for neurocognitive disorder detection using the dementiabank corpus
Z Ye, S Hu, J Li, X Xie, M Geng, J Yu, J Xu, B Xue, S Liu, X Liu, H Meng
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
302021
Neural architecture search for LF-MMI trained time delay neural networks
S Hu, X Xie, M Cui, J Deng, S Liu, J Yu, M Geng, X Liu, H Meng
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1093-1107, 2022
282022
Bayesian transformer language models for speech recognition
B Xue, J Yu, J Xu, S Liu, S Hu, Z Ye, M Geng, X Liu, H Meng
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
232021
Audio-visual multi-channel integration and recognition of overlapped speech
J Yu, SX Zhang, B Wu, S Liu, S Hu, M Geng, X Liu, H Meng, D Yu
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2067-2082, 2021
232021
Spectro-temporal deep features for disordered speech assessment and recognition
M Geng, S Liu, J Yu, X Xie, S Hu, Z Ye, Z Jin, X Liu, H Meng
arXiv preprint arXiv:2201.05554, 2022
192022
Exploiting Cross-Domain Visual Feature Generation for Disordered Speech Recognition.
S Liu, X Xie, J Yu, S Hu, M Geng, R Su, SX Zhang, X Liu, H Meng
Interspeech, 711-715, 2020
192020
The CUHK Dysarthric Speech Recognition Systems for English and Cantonese.
S Hu, S Liu, HF Chang, M Geng, J Chen, LW Chung, TK Hei, J Yu, ...
INTERSPEECH, 3669-3670, 2019
182019
Exploiting Visual Features Using Bayesian Gated Neural Networks for Disordered Speech Recognition.
S Liu, S Hu, Y Wang, J Yu, R Su, X Liu, H Meng
INTERSPEECH (Best Student Paper Nomination), 4120-4124, 2019
172019
On the Use of Pitch Features for Disordered Speech Recognition.
S Liu, S Hu, X Liu, H Meng
Interspeech, 4130-4134, 2019
162019
Bayesian and gaussian process neural networks for large vocabulary continuous speech recognition
S Hu, MWY Lam, X Xie, S Liu, J Yu, X Wu, X Liu, H Meng
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
152019
Limited-memory bfgs optimization of recurrent neural network language models for speech recognition
X Liu, S Liu, J Sha, J Yu, Z Xu, X Chen, H Meng
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
152018
Bayesian learning of LF-MMI trained time delay neural networks for speech recognition
S Hu, X Xie, S Liu, J Yu, Z Ye, M Geng, X Liu, H Meng
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1514-1529, 2021
142021
Exploiting cross domain acoustic-to-articulatory inverted features for disordered speech recognition
S Hu, S Liu, X Xie, M Geng, T Wang, S Hu, M Cui, X Liu, H Meng
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
122022
Bayesian Parametric and Architectural Domain Adaptation of LF-MMI Trained TDNNs for Elderly and Dysarthric Speech Recognition.
J Deng, FR Gutierrez, S Hu, M Geng, X Xie, Z Ye, S Liu, J Yu, X Liu, ...
Interspeech, 4818-4822, 2021
122021
Music understanding llama: Advancing text-to-music generation with question answering and captioning
S Liu, AS Hussain, C Sun, Y Shan
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024
112024
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20