Tom Zahavy

Zitiert von

	Alle	Seit 2019
Zitate	2008	1763
h-index	20	19
i10-index	31	31

460

230

115

345

20162017201820192020202120222023202434 58 143 171 257 309 401 455 168

Öffentlicher Zugriff

Alle anzeigen

3 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ Nvidia ResearchBestätigte E-Mail-Adresse bei technion.ac.il
Daniel J. MankowitzGoogle DeepmindBestätigte E-Mail-Adresse bei google.com
Satinder SinghGoogle DeepMind / U. of MichiganBestätigte E-Mail-Adresse bei umich.edu
Sebastian FlennerhagResearch Scientist at DeepMindBestätigte E-Mail-Adresse bei google.com
Chen TesslerResearch Scientist, NVIDIA ResearchBestätigte E-Mail-Adresse bei nvidia.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLBestätigte E-Mail-Adresse bei google.com
Mordechai SegevSolid State Institute, Physics Department and Electrical Engineering Department Technion - IsraelBestätigte E-Mail-Adresse bei technion.ac.il
Alex DikopoltsevQuantum Optoelectronics Group, Department of Physics, ETHBestätigte E-Mail-Adresse bei phys.ethz.ch
Brendan O'DonoghueStanford University, Google DeepMindBestätigte E-Mail-Adresse bei alumni.stanford.edu
Zhongwen XuTencentBestätigte E-Mail-Adresse bei tencent.com
Oren CohenProfessor of Physics, Technion, IsraelBestätigte E-Mail-Adresse bei technion.ac.il
Vivek VeeriahGoogle DeepMindBestätigte E-Mail-Adresse bei google.com
David SilverDeepMind, UCLBestätigte E-Mail-Adresse bei google.com
Matteo HesselResearch Engineer, Google DeepMindBestätigte E-Mail-Adresse bei google.com
Junhyuk OhResearch Scientist, DeepMindBestätigte E-Mail-Adresse bei google.com
Nadav MerlisPostdoctoral Fellow @ CREST, ENSAE ParisBestätigte E-Mail-Adresse bei ensae.fr
Alessandro MagnaniWalmartlabsBestätigte E-Mail-Adresse bei walmartlabs.com
Robert Tjarko LangeSakana AI, TU BerlinBestätigte E-Mail-Adresse bei tu-berlin.de
Tom SchaulSenior Staff Scientist, DeepMindBestätigte E-Mail-Adresse bei nyu.edu
Valentin DalibardUniversity of CambridgeBestätigte E-Mail-Adresse bei cl.cam.ac.uk

Folgen

Tom Zahavy

Sonstige NamenTom Ben Zion Zahavy

Staff Research Scientist, Google DeepMind

Bestätigte E-Mail-Adresse bei deepmind.com - Startseite

Reinforcement Learning


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
A deep hierarchical approach to lifelong learning in minecraft C Tessler, S Givony, T Zahavy, D Mankowitz, S Mannor Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017	433	2017
Graying the black box: Understanding dqns T Zahavy, N Ben-Zrihem, S Mannor International conference on machine learning (ICML), 1899-1908, 2016	320	2016
Learn what not to learn: Action elimination with deep reinforcement learning T Zahavy, M Haroush, N Merlis, DJ Mankowitz, S Mannor Advances in neural information processing systems 31, 2018	234	2018
Deep learning reconstruction of ultrashort pulses T Zahavy, A Dikopoltsev, D Moss, GI Haham, O Cohen, S Mannor, ... Optica 5 (5), 666-673, 2018	163	2018
Is a picture worth a thousand words? A deep multi-modal architecture for product classification in e-commerce T Zahavy, A Krishnan, A Magnani, S Mannor Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	105*	2018
A self-tuning actor-critic algorithm T Zahavy, Z Xu, V Veeriah, M Hessel, J Oh, HP van Hasselt, D Silver, ... Advances in neural information processing systems 33, 20913-20924, 2020	79	2020
Bootstrapped meta-learning S Flennerhag, Y Schroecker, T Zahavy, H van Hasselt, D Silver, S Singh International Conference on Learning Representations (ICLR) 2022, 2021	66	2021
Reward is enough for convex mdps T Zahavy, B O'Donoghue, G Desjardins, S Singh Advances in Neural Information Processing Systems 34, 25746-25759, 2021	52	2021
Shallow updates for deep reinforcement learning N Levine, T Zahavy, DJ Mankowitz, A Tamar, S Mannor Advances in Neural Information Processing Systems 30, 2017	52	2017
Online limited memory neural-linear bandits with likelihood matching O Nabati, T Zahavy, S Mannor International Conference on Machine Learning, 7905-7915, 2021	37*	2021
Discovery of options via meta-learned subgoals V Veeriah, T Zahavy, M Hessel, Z Xu, J Oh, I Kemaev, HP van Hasselt, ... Advances in Neural Information Processing Systems 34, 29861-29873, 2021	35	2021
Ensemble robustness and generalization of stochastic deep learning algorithms T Zahavy, B Kang, A Sivak, J Feng, H Xu, S Mannor arXiv preprint arXiv:1602.02389, 2016	34*	2016
Discovering Evolution Strategies via Meta-Black-Box Optimization R Tjarko Lange, T Schaul, Y Chen, T Zahavy, V Dallibard, C Lu, S Singh, ... International Conference on Learning Representations (ICLR) 2023, 2022	30*	2022
Discovering Policies with DOMiNO: Diversity Optimization Maintaining Near Optimality T Zahavy, Y Schroecker, F Behbahani, K Baumli, S Flennerhag, S Hou, ... International Conference on Learning Representations (ICLR) 2023, 2022	28	2022
Deep learning reconstruction of ultrashort pulses from 2D spatial intensity patterns recorded by an all-in-line system in a single-shot R Ziv, A Dikopoltsev, T Zahavy, I Rubinstein, P Sidorenko, O Cohen, ... Optics express 28 (5), 7528-7538, 2020	25	2020
Emphatic algorithms for deep reinforcement learning R Jiang, T Zahavy, Z Xu, A White, M Hessel, C Blundell, H Van Hasselt International Conference on Machine Learning (ICML), 5023-5033, 2021	22	2021
Online Apprenticeship Learning L Shani, T Zahavy, S Mannor Proceedings of the AAAI Conference on Artificial Intelligence, 2021	22	2021
Discovering a set of policies for the worst case reward T Zahavy, A Barreto, DJ Mankowitz, S Hou, B O'Donoghue, I Kemaev, ... International Conference on Learning Representations (ICLR) 2021, 2021	22	2021
Balancing constraints and rewards with meta-gradient d4pg DA Calian, DJ Mankowitz, T Zahavy, Z Xu, J Oh, N Levine, T Mann International Conference on Learning Representations (ICLR) 2021, 2020	21	2020
Visualizing dynamics: from t-sne to semi-mdps NB Zrihem, T Zahavy, S Mannor Workshop on Human Interpretability in Machine Learning, ICML (WHI 2016), 2016	21*	2016

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren