The surprising effectiveness of ppo in cooperative multi-agent games C Yu, A Velu, E Vinitsky, J Gao, Y Wang, A Bayen, Y Wu Advances in Neural Information Processing Systems 35, 24611-24624, 2022 | 1405 | 2022 |
Is dpo superior to ppo for llm alignment? a comprehensive study S Xu, W Fu, J Gao, W Ye, W Liu, Z Mei, G Wang, C Yu, Y Wu arXiv preprint arXiv:2404.10719, 2024 | 48 | 2024 |
Asynchronous multi-agent reinforcement learning for efficient real-time multi-robot cooperative exploration C Yu, X Yang, J Gao, J Chen, Y Li, J Liu, Y Xiang, R Huang, H Yang, ... arXiv preprint arXiv:2301.03398, 2023 | 35 | 2023 |
Learning zero-shot cooperation with humans, assuming humans are biased C Yu, J Gao, W Liu, B Xu, H Tang, J Yang, Y Wang, Y Wu arXiv preprint arXiv:2302.01605, 2023 | 34 | 2023 |
Llm-powered hierarchical language agent for real-time human-ai coordination J Liu, C Yu, J Gao, Y Xie, Q Liao, Y Wu, Y Wang arXiv preprint arXiv:2312.15224, 2023 | 25 | 2023 |
Learning efficient multi-agent cooperative visual exploration C Yu, X Yang, J Gao, H Yang, Y Wang, Y Wu European Conference on Computer Vision, 497-515, 2022 | 18 | 2022 |
Learning efficient multi-agent cooperative visual exploration C Yu, X Yang, J Gao, H Yang, Y Wang, Y Wu arXiv preprint arXiv:2110.05734, 2021 | 8 | 2021 |
Language-guided generation of physically realistic robot motion and control S Xu, H Wang, J Gao, Y Ouyang, C Yu, Y Wu arXiv preprint arXiv:2306.10518, 2023 | 3 | 2023 |
Save: Spatial-attention visual exploration X Yang, C Yu, J Gao, Y Wang, H Yang 2022 IEEE International Conference on Image Processing (ICIP), 1356-1360, 2022 | 3 | 2022 |
Srl: Scaling distributed reinforcement learning to over ten thousand cores Z Mei, W Fu, J Gao, G Wang, H Zhang, Y Wu arXiv preprint arXiv:2306.16688, 2023 | 2 | 2023 |
Few-shot In-Context Preference Learning Using Large Language Models C Yu, H Lu, J Gao, Q Tan, X Yang, Y Wang, Y Wu, E Vinitsky arXiv preprint arXiv:2410.17233, 2024 | 1 | 2024 |
On Designing Effective RL Reward at Training Time for LLM Reasoning J Gao, S Xu, W Ye, W Liu, C He, W Fu, Z Mei, G Wang, Y Wu arXiv preprint arXiv:2410.15115, 2024 | | 2024 |
LAGOON: Language-Guided Motion Control S Xu, H Wang, Y Ouyang, J Gao, Z Mei, C Yu, Y Wu 2024 IEEE International Conference on Robotics and Automation (ICRA), 9743-9750, 2024 | | 2024 |
A Benchmark of Planning-based Exploration Methods in Photo-Realistic 3D Simulator X Du, X Yang, C Yu, J Gao, H Yang, Y Wang, Q Liao 2022 IEEE International Conference on Robotics and Biomimetics (ROBIO), 1562 …, 2022 | | 2022 |
Robot Generating Data for Learning Generalizable Visual Robotic Manipulation Y Li, Y Yuan, J Cui, H Huan, W Fu, J Gao, Z Xu, Y Wu | | |
Supplementary Materials of Learning Efficient Multi-Agent Cooperative Visual Exploration C Yu, X Yang, J Gao, H Yang, Y Wang, Y Wu23 | | |