Robert Dadashi

Zitiert von

	Alle	Seit 2019
Zitate	1360	1358
h-index	14	14
i10-index	14	14

600

300

150

450

20192020202120222023202425 56 157 228 301 590

Öffentlicher Zugriff

Alle anzeigen

1 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Léonard HussenotGoogle DeepMindBestätigte E-Mail-Adresse bei google.com
Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)Bestätigte E-Mail-Adresse bei univ-lorraine.fr
Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Bestätigte E-Mail-Adresse bei univ-lille.fr
Marc G. BellemareGoogle BrainBestätigte E-Mail-Adresse bei google.com
Dale SchuurmansUniversity of Alberta, Google DeepMindBestätigte E-Mail-Adresse bei cs.ualberta.ca
Nicolas Le RouxMicrosoft Research, McGill, UdeMBestätigte E-Mail-Adresse bei le-roux.name
Saurabh KumarStanfordBestätigte E-Mail-Adresse bei stanford.edu

Folgen

Robert Dadashi

Google DeepMind

Bestätigte E-Mail-Adresse bei google.com - Startseite

Reinforcement Learning


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Gemini: A family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	463	2023
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	231	2020
Primal Wasserstein Imitation Learning R Dadashi, L Hussenot, M Geist, O Pietquin International Conference on Learning Representations (ICLR), 2021	120	2021
A Geometric Perspective on Optimal Representations for Reinforcement Learning M Bellemare, W Dabney, R Dadashi, A Ali Taiga, PS Castro, N Le Roux, ... Neural Information Processing Systems (NeurIPS), 2019	95	2019
Statistics and Samples in Distributional Reinforcement Learning M Rowland, R Dadashi, S Kumar, R Munos, MG Bellemare, W Dabney International Conference on Machine Learning (ICML), 2019	90	2019
What Matters for Adversarial Imitation Learning? M Orsini, A Raichuk, L Hussenot, D Vincent, R Dadashi, S Girgin, M Geist, ... Neural Information Processing Systems (NeurIPS), 2021	64	2021
The Value-Improvement Path: Towards Better Representations for Reinforcement Learning W Dabney, A Barreto, M Rowland, R Dadashi, J Quan, MG Bellemare, ... AAAI Conference on Artificial Intelligence, 2021	64	2021
Offline Reinforcement Learning as Anti-Exploration S Rezaeifar, R Dadashi, N Vieillard, L Hussenot, O Bachem, O Pietquin, ... AAAI Conference on Artificial Intelligence, 2022	41	2022
The Value Function Polytope in Reinforcement Learning R Dadashi, AA Taïga, NL Roux, D Schuurmans, MG Bellemare International Conference on Machine Learning (ICML), 2019	37	2019
Offline Reinforcement Learning with Pseudometric Learning R Dadashi, S Rezaeifar, N Vieillard, L Hussenot, O Pietquin, M Geist International Conference on Machine Learning (ICML), 2021	35	2021
Gemma: Open Models Based on Gemini Research and Technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024	30	2024
Factually Consistent Summarization via Reinforcement Learning with Textual Entailment Feedback P Roit, J Ferret, L Shani, R Aharoni, G Cideron, R Dadashi, M Geist, ... Annual Meeting of the Association for Computational Linguistics (ACL), 2023	27	2023
Continuous Control with Action Quantization from Demonstrations R Dadashi, L Hussenot, D Vincent, S Girgin, A Raichuk, M Geist, ... International Conference on Machine Learning (ICML), 2022	21	2022
Hyperparameter Selection for Imitation Learning L Hussenot, M Andrychowicz, D Vincent, R Dadashi, A Raichuk, ... International Conference on Machine Learning (ICML), 2021	16	2021
Show me the Way: Intrinsic Motivation from Demonstrations L Hussenot, R Dadashi, M Geist, O Pietquin International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2020	9	2020
WARM: On the Benefits of Weight Averaged Reward Models A Ramé, N Vieillard, L Hussenot, R Dadashi, G Cideron, O Bachem, ... arXiv preprint arXiv:2401.12187, 2024	8	2024
Learning Energy Networks with Generalized Fenchel-Young Losses M Blondel, F Llinares-López, R Dadashi, L Hussenot, M Geist Neural Information Processing Systems (NeurIPS), 2022	5	2022
Generalized Policy Updates for Policy Optimization S Kumar, Z Ahmed, R Dadashi, D Schuurmans, MG Bellemare NeurIPS 2019 Optimization Foundations for Reinforcement Learning Workshop, 2019	2	2019
Get Back Here: Robust Imitation by Return-to-Distribution Planning G Cideron, B Tabanpour, S Curi, S Girgin, L Hussenot, G Dulac-Arnold, ... arXiv preprint arXiv:2305.01400, 2023	1	2023
Offline Reinforcement Learning with On-Policy Q-Function Regularization L Shi, R Dadashi, Y Chi, PS Castro, M Geist	1	2023

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren