David Brandfonbrener

Zitiert von

	Alle	Seit 2019
Zitate	499	499
h-index	8	8
i10-index	8	8

220

110

165

2019202020212022202320244 20 70 128 202 75

Öffentlicher Zugriff

Alle anzeigen

5 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Joan BrunaCourant Institute of Mathematical Sciences, New York UniversityBestätigte E-Mail-Adresse bei cims.nyu.edu
William WhitneyDeepMindBestätigte E-Mail-Adresse bei deepmind.com
Rajesh RanganathAssistant Professor, NYUBestätigte E-Mail-Adresse bei cs.princeton.edu
Alessandro LazaricResearch Scientist, Facebook Artificial Intelligence ResearchBestätigte E-Mail-Adresse bei inria.fr
Matteo PirottaResearch Scientist, Meta (FAIR)Bestätigte E-Mail-Adresse bei fb.com
Andrea ZanetteAssistant Professor, Carnegie Mellon UniversityBestätigte E-Mail-Adresse bei andrew.cmu.edu
Romain LarocheMicrosoft ResearchBestätigte E-Mail-Adresse bei polytechnique.org
Denis YaratsCofounder and CTO, Perplexity AIBestätigte E-Mail-Adresse bei perplexity.ai
Alberto BiettiFlatiron Institute, Simons FoundationBestätigte E-Mail-Adresse bei nyu.edu
Jacob BuckmanPhD Student, MilaBestätigte E-Mail-Adresse bei mail.mcgill.ca
Sham M KakadeHarvard UniversityBestätigte E-Mail-Adresse bei seas.harvard.edu
Samy JelassiHarvard UniversityBestätigte E-Mail-Adresse bei fas.harvard.edu
Ofir NachumOpenAIBestätigte E-Mail-Adresse bei openai.com
Stephen TuGoogleBestätigte E-Mail-Adresse bei google.com
Jacob VarleyRobotics at Google DeepMindBestätigte E-Mail-Adresse bei google.com
Eran MalachKempner Institute, HarvardBestätigte E-Mail-Adresse bei fas.harvard.edu

Folgen

David Brandfonbrener

Kempner Institute at Harvard University

Bestätigte E-Mail-Adresse bei g.harvard.edu - Startseite

machine learning reinforcement learning


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Frequentist regret bounds for randomized least-squares value iteration A Zanette, D Brandfonbrener, E Brunskill, M Pirotta, A Lazaric International Conference on Artificial Intelligence and Statistics, 1954-1964, 2020	135	2020
Offline rl without off-policy evaluation D Brandfonbrener, W Whitney, R Ranganath, J Bruna Advances in neural information processing systems 34, 4933-4946, 2021	122	2021
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning D Yarats, D Brandfonbrener, H Liu, M Laskin, P Abbeel, A Lazaric, ... arXiv preprint arXiv:2201.13425, 2022	76	2022
When does return-conditioned supervised learning work for offline reinforcement learning? D Brandfonbrener, A Bietti, J Buckman, R Laroche, J Bruna Advances in Neural Information Processing Systems 35, 1542-1553, 2022	53	2022
Psychrnn: An accessible and flexible python package for training recurrent neural network models on cognitive tasks DB Ehrlich, JT Stone, D Brandfonbrener, A Atanasov, JD Murray eneuro 8 (1), 2021	24	2021
Evaluating representations by the complexity of learning low-loss predictors WF Whitney, MJ Song, D Brandfonbrener, J Altosaar, K Cho arXiv preprint arXiv:2009.07368, 2020	24	2020
Geometric insights into the convergence of nonlinear TD learning D Brandfonbrener, J Bruna International Conference on Learning Representations (ICLR), 2020	21*	2020
Offline Contextual Bandits with Overparameterized Models D Brandfonbrener, WF Whitney, R Ranganath, J Bruna International Conference on Machine Learning (ICML), 2021, 2020	15*	2020
Inverse dynamics pretraining learns good representations for multitask imitation D Brandfonbrener, O Nachum, J Bruna Advances in Neural Information Processing Systems 36, 2023	8	2023
Visual backtracking teleoperation: A data collection protocol for offline image-based reinforcement learning D Brandfonbrener, S Tu, A Singh, S Welker, C Boodoo, N Matni, J Varley 2023 IEEE International Conference on Robotics and Automation (ICRA), 11336 …, 2023	6	2023
Repeat after me: Transformers are better than state space models at copying S Jelassi, D Brandfonbrener, SM Kakade, E Malach arXiv preprint arXiv:2402.01032, 2024	5	2024
Quantile filtered imitation learning D Brandfonbrener, WF Whitney, R Ranganath, J Bruna arXiv preprint arXiv:2112.00950, 2021	4	2021
Incorporating explicit uncertainty estimates into deep offline reinforcement learning D Brandfonbrener, RT Combes, R Laroche arXiv preprint arXiv:2206.01085, 2022	3	2022
Two-vertex generators of Jacobians of graphs D Brandfonbrener, P Devlin, N Friedenberg, Y Ke, S Marcus, H Reichard, ... The Electronic Journal of Combinatorics 25 (1), 2018	2	2018
Verified Multi-Step Synthesis using Large Language Models and Monte Carlo Tree Search D Brandfonbrener, S Raja, T Prasad, C Loughridge, J Yang, S Henniger, ... arXiv preprint arXiv:2402.08147, 2024	1	2024
Q-Probe: A Lightweight Approach to Reward Maximization for Language Models K Li, S Jelassi, H Zhang, S Kakade, M Wattenberg, D Brandfonbrener arXiv preprint arXiv:2402.14688, 2024		2024
Bridging the Gap from Supervised Learning to Control D Brandfonbrener New York University, 2023		2023

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–17

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren