Shihan Dou

Zitiert von

	Alle	Seit 2019
Zitate	563	563
h-index	9	9
i10-index	9	9

300

150

225

202120222023202410 40 224 287

Öffentlicher Zugriff

Alle anzeigen

5 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Huang Xuanjing (黄萱菁)Professor of Computer Science, Fudan UniversityBestätigte E-Mail-Adresse bei fudan.edu.cn
Qi Zhang (张奇)Professor of Computer Science, Fudan UniversityBestätigte E-Mail-Adresse bei fudan.edu.cn
Yueming WuNanyang Technological UniversityBestätigte E-Mail-Adresse bei ntu.edu.sg
Tao Gui （桂韬）复旦大学Bestätigte E-Mail-Adresse bei fudan.edu.cn
Hai JinHuazhong University of Science and TechnologyBestätigte E-Mail-Adresse bei hust.edu.cn
Rui ZhengFudan UniversityBestätigte E-Mail-Adresse bei fudan.edu.cn
Xipeng Qiu（邱锡鹏）Professor of Computer Science, Fudan UniversityBestätigte E-Mail-Adresse bei fudan.edu.cn

Folgen

Shihan Dou

Fudan University

Bestätigte E-Mail-Adresse bei m.fudan.edu.cn

Alignment RLHF Natural Language Processing Security


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
The rise and potential of large language model based agents: A survey Z Xi, W Chen, X Guo, W He, Y Ding, B Hong, M Zhang, J Wang, S Jin, ... arXiv preprint arXiv:2309.07864, 2023	241	2023
Vulcnn: An image-inspired scalable vulnerability detection system Y Wu, D Zou, S Dou, W Yang, D Xu, H Jin Proceedings of the 44th International Conference on Software Engineering …, 2022	65	2022
SCDetector: Software functional clone detection based on semantic tokens analysis Y Wu, D Zou, S Dou, S Yang, W Yang, F Cheng, H Liang, H Jin Proceedings of the 35th IEEE/ACM international conference on automated …, 2020	53	2020
Secrets of RLHF in large language models part I: PPO R Zheng, S Dou, S Gao, Y Hua, W Shen, B Wang, Y Liu, S Jin, Q Liu, ... arXiv preprint arXiv:2307.04964, 2023	48*	2023
IntDroid: Android malware detection based on API intimacy analysis D Zou, Y Wu, S Yang, A Chauhan, W Yang, J Zhong, S Dou, H Jin ACM Transactions on Software Engineering and Methodology (TOSEM) 30 (3), 1-32, 2021	33	2021
MINER: Improving out-of-vocabulary named entity recognition from an information theoretic perspective X Wang, S Dou, L Xiong, Y Zou, Q Zhang, T Gui, L Qiao, Z Cheng, ... arXiv preprint arXiv:2204.04391, 2022	24	2022
Obfuscation-resilient android malware analysis based on contrastive learning Y Wu, S Dou, D Zou, W Yang, W Qiang, H Jin arXiv preprint arXiv:2107.03799, 2021	17	2021
Secrets of rlhf in large language models part ii: Reward modeling B Wang, R Zheng, L Chen, Y Liu, S Dou, C Huang, W Shen, S Jin, E Zhou, ... arXiv preprint arXiv:2401.06080, 2024	13*	2024
LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment S Dou, E Zhou, Y Liu, S Gao, J Zhao, W Shen, Y Zhou, Z Xi, X Wang, ... arXiv preprint arXiv:2312.09979, 2023	12*	2023
Contrastive learning for robust android malware familial classification Y Wu, S Dou, D Zou, W Yang, W Qiang, H Jin IEEE Transactions on Dependable and Secure Computing, 2022	9	2022
Loose lips sink ships: Mitigating length bias in reinforcement learning from human feedback W Shen, R Zheng, W Zhan, J Zhao, S Dou, T Gui, Q Zhang, X Huang arXiv preprint arXiv:2310.05199, 2023	8	2023
Kernel-whitening: Overcome dataset bias with isotropic sentence embedding S Gao, S Dou, Q Zhang, X Huang arXiv preprint arXiv:2210.07547, 2022	6	2022
Towards understanding the capability of large language models on code clone detection: a survey S Dou, J Shan, H Jia, W Deng, Z Xi, W He, Y Wu, T Gui, Y Liu, X Huang arXiv preprint arXiv:2308.01191, 2023	5	2023
Decorrelate irrelevant, purify relevant: Overcome textual spurious correlations from a feature perspective S Dou, R Zheng, T Wu, S Gao, J Shan, Q Zhang, Y Wu, X Huang arXiv preprint arXiv:2202.08048, 2022	5	2022
Tailoring Personality Traits in Large Language Models via Unsupervisedly-Built Personalized Lexicons T Li, S Dou, C Lv, W Liu, J Xu, M Wu, Z Ling, Z Xiaoqing, X Huang arXiv preprint arXiv:2310.16582, 2024	4	2024
Tooleyes: Fine-grained evaluation for tool learning capabilities of large language models in real-world scenarios J Ye, G Li, S Gao, C Huang, Y Wu, S Li, X Fan, S Dou, Q Zhang, T Gui, ... arXiv preprint arXiv:2401.00741, 2024	3	2024
Open the Pandora's Box of LLMs: Jailbreaking LLMs through Representation Engineering T Li, S Dou, W Liu, M Wu, C Lv, X Zheng, X Huang arXiv preprint arXiv:2401.06824, 2024	2	2024
Delve into ppo: Implementation matters for stable rlhf R Zheng, S Dou, S Gao, Y Hua, W Shen, B Wang, Y Liu, S Jin, Y Zhou, ... NeurIPS 2023 Workshop on Instruction Tuning and Instruction Following, 2023	2	2023
Improving generalization of alignment with human preferences through group invariant learning R Zheng, W Shen, Y Hua, W Lai, S Dou, Y Zhou, Z Xi, X Wang, H Huang, ... arXiv preprint arXiv:2310.11971, 2023	2	2023
On the Universal Adversarial Perturbations for Efficient Data-free Adversarial Detection S Gao, S Dou, Q Zhang12, X Huang12, J Ma, Y Shan Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023	2	2023

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren