Folgen
Minglun Han
Minglun Han
ByteDance Inc.; Previously CASIA.
Bestätigte E-Mail-Adresse bei bytedance.com
Titel
Zitiert von
Zitiert von
Jahr
VLP: A Survey on Vision-language Pre-training
F Chen, D Zhang, M Han, X Chen, J Shi, S Xu, B Xu
Machine Intelligence Research 20 (1), 38-56, 2023
1312023
X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
F Chen, M Han, H Zhao, Q Zhang, J Shi, S Xu, B Xu
arXiv preprint arXiv:2305.04160, 2023
502023
Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection
M Han, L Dong, Z Liang, M Cai, S Zhou, Z Ma, B Xu
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
312022
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition
Q Wang, T Zhang, M Han, Y Wang, D Zhang, B Xu
Proceedings of the AAAI Conference on Artificial Intelligence 37 (1), 102-109, 2023
202023
CIF-based Collaborative Decoding for End-to-End Contextual Speech Recognition
M Han, L Dong, S Zhou, B Xu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
202021
Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation
M Han, F Chen, J Shi, S Xu, B Xu
INTERSPEECH 2023, 2023
92023
Matching-based Term Semantics Pre-training for Spoken Patient Query Understanding
Z Hu, X Chen, H Wu, M Han, Z Ni, J Shi, S Xu, B Xu
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
22023
VILAS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition
Z Ni, M Han, F Chen, L Meng, J Shi, S Xu, B Xu
ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2023
12023
Enhancing Visual Question Answering via Deconstructing Questions and Explicating Answers
F Chen, M Han, J Shi, S Xu, B Xu
INTERSPEECH 2023, 2023
2023
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–9