Follow
Zeyu Zheng
Title
Cited by
Cited by
Year
Poseidon: An efficient communication architecture for distributed deep learning on {GPU} clusters
H Zhang, Z Zheng, S Xu, W Dai, Q Ho, X Liang, Z Hu, J Wei, P Xie, ...
2017 USENIX Annual Technical Conference (USENIX ATC 17), 181-193, 2017
3922017
On learning intrinsic rewards for policy gradient methods
Z Zheng, J Oh, S Singh
Advances in Neural Information Processing Systems, 4644-4654, 2018
1922018
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
1812023
Parallelizing sequential graph computations
W Fan, J Xu, Y Wu, W Yu, J Jiang, Z Zheng, B Zhang, Y Cao, C Tian
Proceedings of the 2017 ACM International Conference on Management of Data …, 2017
1152017
What Can Learned Intrinsic Rewards Capture?
Z Zheng, J Oh, M Hessel, Z Xu, M Kroiss, H Van Hasselt, D Silver, S Singh
International Conference on Machine Learning, 11436-11446, 2020
832020
Automated multi-layer optical design via deep reinforcement learning
H Wang, Z Zheng, C Ji, LJ Guo
Machine Learning: Science and Technology 2 (2), 025013, 2021
542021
Understanding plasticity in neural networks
C Lyle, Z Zheng, E Nikishin, BA Pires, R Pascanu, W Dabney
arXiv preprint arXiv:2303.01486, 2023
212023
Adaptive Pairwise Weights for Temporal Credit Assignment
Z Zheng, R Vuorio, R Lewis, S Singh
Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 9225-9232, 2022
6*2022
Learning State Representations from Random Deep Action-conditional Predictions
Z Zheng, V Veeriah, R Vuorio, RL Lewis, S Singh
Advances in Neural Information Processing Systems 34, 23679-23691, 2021
52021
Towards Multi-Agent Reinforcement Learning driven Over-The-Counter Market Simulations
N Vadori, L Ardon, S Ganesh, T Spooner, S Amrouni, J Vann, M Xu, ...
arXiv preprint arXiv:2210.07184, 2022
32022
GrASP: Gradient-Based Affordance Selection for Planning
V Veeriah, Z Zheng, R Lewis, S Singh
arXiv preprint arXiv:2202.04772, 2022
32022
Generalized Preference Optimization: A Unified Approach to Offline Alignment
Y Tang, ZD Guo, Z Zheng, D Calandriello, R Munos, M Rowland, ...
arXiv preprint arXiv:2402.05749, 2024
2024
Towards Perpetually Trainable Neural Networks
C Lyle, Z Zheng, K Khetarpal, R Pascanu, J Martens, H van Hasselt, ...
2023
Advances in Deep Reinforcement Learning: Intrinsic Rewards, Temporal Credit Assignment, State Representations, and Value-equivalent Models
Z Zheng
2022
Reinforcement learning using meta-learned intrinsic rewards
Z Zheng, J Oh, SS Baveja
US Patent App. 17/033,410, 2021
2021
Supplementary Material: On Learning Intrinsic Rewards for Policy Gradient Methods
Z Zheng, J Oh, S Singh
The system can't perform the operation now. Try again later.
Articles 1–16