Folgen
Tom Zahavy
Tom Zahavy
Sonstige NamenTom Ben Zion Zahavy
Senior Research Scientist, DeepMind
Bestätigte E-Mail-Adresse bei deepmind.com - Startseite
Titel
Zitiert von
Zitiert von
Jahr
A deep hierarchical approach to lifelong learning in minecraft
C Tessler, S Givony, T Zahavy, D Mankowitz, S Mannor
Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017
3712017
Graying the black box: Understanding dqns
T Zahavy, N Ben-Zrihem, S Mannor
International conference on machine learning, 1899-1908, 2016
2622016
Learn what not to learn: Action elimination with deep reinforcement learning
T Zahavy, M Haroush, N Merlis, DJ Mankowitz, S Mannor
Advances in neural information processing systems 31, 2018
1872018
Deep learning reconstruction of ultrashort pulses
T Zahavy, A Dikopoltsev, D Moss, GI Haham, O Cohen, S Mannor, ...
Optica 5 (5), 666-673, 2018
1192018
Is a picture worth a thousand words? A deep multi-modal architecture for product classification in e-commerce
T Zahavy, A Krishnan, A Magnani, S Mannor
Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018
85*2018
A self-tuning actor-critic algorithm
T Zahavy, Z Xu, V Veeriah, M Hessel, J Oh, HP van Hasselt, D Silver, ...
Advances in neural information processing systems 33, 20913-20924, 2020
532020
Shallow updates for deep reinforcement learning
N Levine, T Zahavy, DJ Mankowitz, A Tamar, S Mannor
Advances in Neural Information Processing Systems 30, 2017
452017
Bootstrapped meta-learning
S Flennerhag, Y Schroecker, T Zahavy, H van Hasselt, D Silver, S Singh
International Conference on Learning Representations (ICLR) 2022, 2021
342021
Ensemble robustness and generalization of stochastic deep learning algorithms
T Zahavy, B Kang, A Sivak, J Feng, H Xu, S Mannor
arXiv preprint arXiv:1602.02389, 2016
31*2016
Online limited memory neural-linear bandits with likelihood matching
O Nabati, T Zahavy, S Mannor
International Conference on Machine Learning, 7905-7915, 2021
29*2021
Reward is enough for convex MDPs
T Zahavy, B O'Donoghue, G Desjardins, S Singh
Advances in Neural Information Processing Systems 34, 25746-25759, 2021
232021
Discovery of options via meta-learned subgoals
V Veeriah, T Zahavy, M Hessel, Z Xu, J Oh, I Kemaev, HP van Hasselt, ...
Advances in Neural Information Processing Systems 34, 29861-29873, 2021
202021
Deep learning reconstruction of ultrashort pulses from 2D spatial intensity patterns recorded by an all-in-line system in a single-shot
R Ziv, A Dikopoltsev, T Zahavy, I Rubinstein, P Sidorenko, O Cohen, ...
Optics express 28 (5), 7528-7538, 2020
182020
Action assembly: Sparse imitation learning for text based games with combinatorial action spaces
C Tessler, T Zahavy, D Cohen, DJ Mankowitz, S Mannor
The Multidisciplinary Conference on Reinforcement Learning and Decision …, 2019
172019
Balancing constraints and rewards with meta-gradient d4pg
DA Calian, DJ Mankowitz, T Zahavy, Z Xu, J Oh, N Levine, T Mann
International Conference on Learning Representations (ICLR) 2021, 2020
162020
Visualizing dynamics: from t-sne to semi-mdps
NB Zrihem, T Zahavy, S Mannor
Workshop on Human Interpretability in Machine Learning, ICML (WHI 2016), 2016
15*2016
Discovering a set of policies for the worst case reward
T Zahavy, A Barreto, DJ Mankowitz, S Hou, B O'Donoghue, I Kemaev, ...
International Conference on Learning Representations (ICLR) 2021, 2021
142021
Deep neural networks in single-shot ptychography
O Wengrowicz, O Peleg, T Zahavy, B Loevsky, O Cohen
Optics Express 28 (12), 17511-17520, 2020
142020
Discovering diverse nearly optimal policies with successor features
T Zahavy, B O'Donoghue, A Barreto, V Mnih, S Flennerhag, S Singh
arXiv preprint arXiv:2106.00669, 2021
122021
Online Apprenticeship Learning
L Shani, T Zahavy, S Mannor
Proceedings of the AAAI Conference on Artificial Intelligence, 2021
122021
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20