Shangtong Zhang

Zitiert von

	Alle	Seit 2019
Zitate	1152	1115
h-index	16	16
i10-index	23	21

300

150

225

201720182019202020212022202320245 26 63 158 231 275 298 87

Öffentlicher Zugriff

Alle anzeigen

10 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, WaymoBestätigte E-Mail-Adresse bei cs.ox.ac.uk
Hengshuai YaoSony AIBestätigte E-Mail-Adresse bei ualberta.ca
Richard S. SuttonKeen, Amii, and University of AlbertaBestätigte E-Mail-Adresse bei richsutton.com
Bo LiuAAAI SM, IEEE SMBestätigte E-Mail-Adresse bei cs.umass.edu
Linglong KongProfessor, Canada Research Chair in Statistical Learning, UAlberta, and Canada CIFAR AI Chair, AmiiBestätigte E-Mail-Adresse bei ualberta.ca
Wendelin BöhmerSequential Decision Making Group, Delft University of TechnologyBestätigte E-Mail-Adresse bei tudelft.nl
Ray JiangResearch Scientist, DeepMindBestätigte E-Mail-Adresse bei google.com
Remi Tachet des CombesBestätigte E-Mail-Adresse bei alpacaml.com
Romain LarocheMicrosoft ResearchBestätigte E-Mail-Adresse bei polytechnique.org
Marcus EdelComputer Science, Free University of BerlinBestätigte E-Mail-Adresse bei fu-berlin.de
Ryan R. CurtinFree agentBestätigte E-Mail-Adresse bei ratml.org
Nando de FreitasCIFAR & DeepMindBestätigte E-Mail-Adresse bei google.com
Tom Le PaineStaff Research Scientist at Google DeepMindBestätigte E-Mail-Adresse bei google.com
Julian SchrittwieserDeepMindBestätigte E-Mail-Adresse bei furidamu.org
Roman RingDeepMindBestätigte E-Mail-Adresse bei deepmind.com
Petko GeorgievGoogle DeepMind, University of CambridgeBestätigte E-Mail-Adresse bei cam.ac.uk
Michael MathieuDeepMindBestätigte E-Mail-Adresse bei google.com
Aäron van den OordGoogle DeepMindBestätigte E-Mail-Adresse bei google.com
Sergio Gómez ColmenarejoResearch Engineer, DeepMindBestätigte E-Mail-Adresse bei google.com
Caglar GulcehreProf at EPFL, Consultant@Google DeepMind, ex-Staff Research Scientist@Google DeepMind, PhD@MILABestätigte E-Mail-Adresse bei google.com

Folgen

Shangtong Zhang

University of Virginia

Bestätigte E-Mail-Adresse bei virginia.edu - Startseite

reinforcement learning


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
A Deeper Look at Experience Replay S Zhang, RS Sutton Deep Reinforcement Learning Symposium, NIPS 2017, 2017	334	2017
GradientDICE: Rethinking Generalized Offline Estimation of Stationary Values S Zhang, B Liu, S Whiteson ICML 2020, 2020	94	2020
Distributional Reinforcement Learning for Efficient Exploration B Mavrin, S Zhang, H Yao, L Kong, K Wu, Y Yu ICML 2019, 2019	87	2019
mlpack 3: a fast, flexible machine learning library R Curtin, M Edel, M Lozhnikov, Y Mentekidis, S Ghaisas, S Zhang Journal of Open Source Software 3 (26), 726, 2018	85	2018
DAC: The Double Actor-Critic Architecture for Learning Options S Zhang, S Whiteson NeurIPS 2019, 2019	77	2019
Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation S Zhang, B Liu, H Yao, S Whiteson ICML 2020, 2019	53	2019
Generalized Off-Policy Actor-Critic S Zhang, W Boehmer, S Whiteson NeurIPS 2019, 2019	51	2019
Breaking the Deadly Triad with a Target Network S Zhang, H Yao, S Whiteson ICML 2021, 2021	40	2021
Mean-variance policy iteration for risk-averse reinforcement learning S Zhang, B Liu, S Whiteson Proceedings of the AAAI Conference on Artificial Intelligence 35 (12), 10905 …, 2021	36	2021
QUOTA: The Quantile Option Architecture for Reinforcement Learning S Zhang, B Mavrin, L Kong, B Liu, H Yao AAAI 2019, 2018	32	2018
Average-Reward Off-Policy Policy Evaluation with Function Approximation S Zhang, Y Wan, RS Sutton, S Whiteson ICML 2021, 2021	31	2021
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search S Zhang, H Chen, H Yao AAAI 2019, 2018	31	2018
Modularized Implementation of Deep RL Algorithms in PyTorch S Zhang	29*	2018
A deep neural network for modeling music P Zhang, X Zheng, W Zhang, S Li, S Qian, W He, S Zhang, Z Wang Proceedings of the 5th ACM on International Conference on Multimedia …, 2015	27	2015
Deep Residual Reinforcement Learning S Zhang, W Boehmer, S Whiteson AAMAS 2020, 2019	22	2019
AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning M Mathieu, S Ozair, S Srinivasan, C Gulcehre, S Zhang, R Jiang, ... arXiv preprint arXiv:2308.03526, 2023	19*	2023
Learning expected emphatic traces for deep RL R Jiang, S Zhang, V Chelu, A White, H van Hasselt Proceedings of the AAAI Conference on Artificial Intelligence 36 (6), 7015-7023, 2022	13	2022
Learning Retrospective Knowledge with Reverse Reinforcement Learning S Zhang, V Veeriah, S Whiteson NeurIPS 2020, 2020	13	2020
Mega-Reward: Achieving Human-Level Play without Extrinsic Rewards Y Song, J Wang, T Lukasiewicz, Z Xu, S Zhang, M Xu AAAI 2020, 2019	13	2019
Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control S Zhang, OR Zaiane Deep Reinforcement Learning Symposium, NIPS 2017, 2017	12	2017

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren