Follow
Jiaxuan Gao
Jiaxuan Gao
Institute for Interdisciplinary Information Sciences, Tsinghua University
Verified email at mails.tsinghua.edu.cn
Title
Cited by
Cited by
Year
The surprising effectiveness of ppo in cooperative multi-agent games
C Yu, A Velu, E Vinitsky, J Gao, Y Wang, A Bayen, Y Wu
Advances in Neural Information Processing Systems 35, 24611-24624, 2022
14052022
Is dpo superior to ppo for llm alignment? a comprehensive study
S Xu, W Fu, J Gao, W Ye, W Liu, Z Mei, G Wang, C Yu, Y Wu
arXiv preprint arXiv:2404.10719, 2024
482024
Asynchronous multi-agent reinforcement learning for efficient real-time multi-robot cooperative exploration
C Yu, X Yang, J Gao, J Chen, Y Li, J Liu, Y Xiang, R Huang, H Yang, ...
arXiv preprint arXiv:2301.03398, 2023
352023
Learning zero-shot cooperation with humans, assuming humans are biased
C Yu, J Gao, W Liu, B Xu, H Tang, J Yang, Y Wang, Y Wu
arXiv preprint arXiv:2302.01605, 2023
342023
Llm-powered hierarchical language agent for real-time human-ai coordination
J Liu, C Yu, J Gao, Y Xie, Q Liao, Y Wu, Y Wang
arXiv preprint arXiv:2312.15224, 2023
252023
Learning efficient multi-agent cooperative visual exploration
C Yu, X Yang, J Gao, H Yang, Y Wang, Y Wu
European Conference on Computer Vision, 497-515, 2022
182022
Learning efficient multi-agent cooperative visual exploration
C Yu, X Yang, J Gao, H Yang, Y Wang, Y Wu
arXiv preprint arXiv:2110.05734, 2021
82021
Language-guided generation of physically realistic robot motion and control
S Xu, H Wang, J Gao, Y Ouyang, C Yu, Y Wu
arXiv preprint arXiv:2306.10518, 2023
32023
Save: Spatial-attention visual exploration
X Yang, C Yu, J Gao, Y Wang, H Yang
2022 IEEE International Conference on Image Processing (ICIP), 1356-1360, 2022
32022
Srl: Scaling distributed reinforcement learning to over ten thousand cores
Z Mei, W Fu, J Gao, G Wang, H Zhang, Y Wu
arXiv preprint arXiv:2306.16688, 2023
22023
Few-shot In-Context Preference Learning Using Large Language Models
C Yu, H Lu, J Gao, Q Tan, X Yang, Y Wang, Y Wu, E Vinitsky
arXiv preprint arXiv:2410.17233, 2024
12024
On Designing Effective RL Reward at Training Time for LLM Reasoning
J Gao, S Xu, W Ye, W Liu, C He, W Fu, Z Mei, G Wang, Y Wu
arXiv preprint arXiv:2410.15115, 2024
2024
LAGOON: Language-Guided Motion Control
S Xu, H Wang, Y Ouyang, J Gao, Z Mei, C Yu, Y Wu
2024 IEEE International Conference on Robotics and Automation (ICRA), 9743-9750, 2024
2024
A Benchmark of Planning-based Exploration Methods in Photo-Realistic 3D Simulator
X Du, X Yang, C Yu, J Gao, H Yang, Y Wang, Q Liao
2022 IEEE International Conference on Robotics and Biomimetics (ROBIO), 1562 …, 2022
2022
Robot Generating Data for Learning Generalizable Visual Robotic Manipulation
Y Li, Y Yuan, J Cui, H Huan, W Fu, J Gao, Z Xu, Y Wu
Supplementary Materials of Learning Efficient Multi-Agent Cooperative Visual Exploration
C Yu, X Yang, J Gao, H Yang, Y Wang, Y Wu23
The system can't perform the operation now. Try again later.
Articles 1–16