Minglun Han

Zitiert von

	Alle	Seit 2019
Zitate	264	264
h-index	6	6
i10-index	5	5

180

135

20222023202420 166 78

Öffentlicher Zugriff

Alle anzeigen

5 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Jing ShiInstitute of Automation Chinese Academy of SciencesBestätigte E-Mail-Adresse bei ia.ac.cn
Feilong ChenHuawei Inc.; Previously CASIABestätigte E-Mail-Adresse bei huawei.com
Duzhen ZhangInstitute of Automation, Chinese Academy of SciencesBestätigte E-Mail-Adresse bei ia.ac.cn
Xiuyi ChenBaidu << CASIABestätigte E-Mail-Adresse bei ia.ac.cn
Linhao DongBytedance AI-LabBestätigte E-Mail-Adresse bei bytedance.com
zhenlin liang东南大学Bestätigte E-Mail-Adresse bei seu.edu.cn
Meng CaiMicrosoft Research AsiaBestätigte E-Mail-Adresse bei microsoft.com
Zejun MaBytedanceBestätigte E-Mail-Adresse bei bytedance.com
Tielin ZhangChinese Academy of SciencesBestätigte E-Mail-Adresse bei ia.ac.cn
Ziyi NiInstitute of Automation，Chinese Academy of SciencesBestätigte E-Mail-Adresse bei ia.ac.cn
Linghui MengInstitute of Automation, Chinese Academy of Sciences, ChinaBestätigte E-Mail-Adresse bei ia.ac.cn

Folgen

Minglun Han

ByteDance Inc.; Previously CASIA.

Bestätigte E-Mail-Adresse bei bytedance.com

Speech Recognition Large Language Model Multimodal LLM


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
VLP: A Survey on Vision-language Pre-training F Chen, D Zhang, M Han, X Chen, J Shi, S Xu, B Xu Machine Intelligence Research 20 (1), 38-56, 2023	131	2023
X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages F Chen, M Han, H Zhao, Q Zhang, J Shi, S Xu, B Xu arXiv preprint arXiv:2305.04160, 2023	50	2023
Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection M Han, L Dong, Z Liang, M Cai, S Zhou, Z Ma, B Xu ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	31	2022
Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition Q Wang, T Zhang, M Han, Y Wang, D Zhang, B Xu Proceedings of the AAAI Conference on Artificial Intelligence 37 (1), 102-109, 2023	20	2023
CIF-based Collaborative Decoding for End-to-End Contextual Speech Recognition M Han, L Dong, S Zhou, B Xu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	20	2021
Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation M Han, F Chen, J Shi, S Xu, B Xu INTERSPEECH 2023, 2023	9	2023
Matching-based Term Semantics Pre-training for Spoken Patient Query Understanding Z Hu, X Chen, H Wu, M Han, Z Ni, J Shi, S Xu, B Xu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023	2	2023
VILAS: Exploring the Effects of Vision and Language Context in Automatic Speech Recognition Z Ni, M Han, F Chen, L Meng, J Shi, S Xu, B Xu ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2023	1	2023
Enhancing Visual Question Answering via Deconstructing Questions and Explicating Answers F Chen, M Han, J Shi, S Xu, B Xu INTERSPEECH 2023, 2023		2023

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–9

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren