Is ChatGPT A Good Translator? Yes With GPT-4 As The Engine W Jiao, W Wang, J Huang, X Wang, S Shi, Z Tu arXiv preprint: 2301.08745, 2023 | 519* | 2023 |
Improving Adversarial Transferability via Neuron Attribution-Based Attacks J Zhang, W Wu, J Huang, Y Huang, W Wang, Y Su, MR Lyu CVPR'22, 14993-15002, 2022 | 104 | 2022 |
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher Y Yuan, W Jiao, W Wang, J Huang, P He, S Shi, Z Tu ICLR'24, 2024 | 42 | 2024 |
ParroT: Translating during Chat using Large Language Models tuned with Human Translation and Feedback W Jiao, J Huang, W Wang, Z He, T Liang, X Wang, S Shi, Z Tu EMNLP'23 Findings, 15009-15020, 2023 | 39* | 2023 |
Improving the Transferability of Adversarial Samples by Path-Augmented Method J Zhang, J Huang, W Wang, Y Li, W Wu, X Wang, Y Su, MR Lyu CVPR'23, 8173-8182, 2023 | 26 | 2023 |
Revisiting the Reliability of Psychological Scales on Large Language Models J Huang, W Wang, MH Lam, EJ Li, W Jiao, MR Lyu arXiv preprint: 2305.19926, 2023 | 20* | 2023 |
InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews X Wang, Y Xiao, J Huang, S Yuan, R Xu, H Guo, Q Tu, Y Fei, Z Leng, ... arXiv preprint: 2310.17976, 2023 | 18* | 2023 |
Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench J Huang, MH Lam, EJ Li, S Ren, W Wang, W Jiao, Z Tu, MR Lyu arXiv preprint: 2308.03656, 2023 | 18 | 2023 |
AEON: A Method for Automatic Evaluation of NLP Test Cases J Huang, J Zhang, W Wang, P He, Y Su, MR Lyu ISSTA'22, 202-214, 2022 | 16 | 2022 |
All Languages Matter: On the Multilingual Safety of Large Language Models W Wang, Z Tu, C Chen, Y Yuan, J Huang, W Jiao, MR Lyu arXiv preprint: 2310.00905, 2023 | 15 | 2023 |
MTTM: Metamorphic Testing for Textual Content Moderation Software W Wang, J Huang, W Wu, J Zhang, Y Huang, S Li, P He, MR Lyu ICSE'23, 2387-2399, 2023 | 14* | 2023 |
On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs J Huang, W Wang, EJ Li, MH Lam, S Ren, Y Yuan, W Jiao, Z Tu, MR Lyu ICLR'24, 2024 | 13* | 2024 |
Tencent's Multilingual Machine Translation System for WMT22 Large-Scale African Languages W Jiao, Z Tu, J Li, W Wang, J Huang, S Shi WMT'22, 1049-1056, 2022 | 13 | 2022 |
Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models W Wang, W Jiao, J Huang, R Dai, J Huang, Z Tu, MR Lyu arXiv preprint: 2310.12481, 2023 | 8 | 2023 |
An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software W Wang, J Huang, J Huang, C Chen, J Gu, P He, MR Lyu ASE'23, 1339-1351, 2023 | 6 | 2023 |
Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models T Liang, Z He, J Huang, W Wang, W Jiao, R Wang, Y Yang, Z Tu, S Shi, ... arXiv preprint: 2310.20499, 2023 | 3 | 2023 |
How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments J Huang, EJ Li, MH Lam, T Liang, W Wang, Y Yuan, W Jiao, X Wang, Z Tu, ... arXiv preprint: 2403.11807, 2024 | 2 | 2024 |
A & B == B & A: Triggering Logical Reasoning Failures in Large Language Models Y Wan, W Wang, Y Yang, Y Yuan, J Huang, P He, W Jiao, MR Lyu arXiv preprint: 2401.00757, 2024 | 2 | 2024 |
The Earth is Flat? Unveiling Factual Errors in Large Language Models W Wang, J Shi, Z Tu, Y Yuan, J Huang, W Jiao, MR Lyu arXiv preprint: 2401.00761, 2024 | 1 | 2024 |
A Unified Debugging Approach via LLM-Based Multi-Agent Synergy C Lee, CS Xia, J Huang, Z Zhu, L Zhang, MR Lyu arXiv preprint arXiv:2404.17153, 2024 | | 2024 |