Folgen
David Brandfonbrener
David Brandfonbrener
Bestätigte E-Mail-Adresse bei nyu.edu - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Frequentist regret bounds for randomized least-squares value iteration
A Zanette*, D Brandfonbrener*, E Brunskill, M Pirotta, A Lazaric
International Conference on Artificial Intelligence and Statistics, 1954-1964, 2020
1042020
Offline rl without off-policy evaluation
D Brandfonbrener, W Whitney, R Ranganath, J Bruna
Advances in neural information processing systems 34, 4933-4946, 2021
522021
Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
D Yarats*, D Brandfonbrener*, H Liu, M Laskin, P Abbeel, A Lazaric, ...
arXiv preprint arXiv:2201.13425, 2022
252022
PsychRNN: an accessible and flexible python package for training recurrent neural network models on cognitive tasks
DB Ehrlich, JT Stone, D Brandfonbrener, A Atanasov, JD Murray
Eneuro 8 (1), 2021
182021
Geometric insights into the convergence of nonlinear TD learning
D Brandfonbrener, J Bruna
International Conference on Learning Representations (ICLR), 2020
17*2020
Evaluating representations by the complexity of learning low-loss predictors
WF Whitney, MJ Song, D Brandfonbrener, J Altosaar, K Cho
arXiv preprint arXiv:2009.07368, 2020
162020
Offline Contextual Bandits with Overparameterized Models
D Brandfonbrener, WF Whitney, R Ranganath, J Bruna
International Conference on Machine Learning (ICML), 2021, 2020
9*2020
When does return-conditioned supervised learning work for offline reinforcement learning?
D Brandfonbrener, A Bietti, J Buckman, R Laroche, J Bruna
arXiv preprint arXiv:2206.01079, 2022
82022
Quantile filtered imitation learning
D Brandfonbrener, WF Whitney, R Ranganath, J Bruna
arXiv preprint arXiv:2112.00950, 2021
32021
Two-vertex generators of Jacobians of graphs
D Brandfonbrener, P Devlin, N Friedenberg, Y Ke, S Marcus, H Reichard, ...
The Electronic Journal of Combinatorics 25 (1), 2018
22018
Incorporating explicit uncertainty estimates into deep offline reinforcement learning
D Brandfonbrener, RT Combes, R Laroche
arXiv preprint arXiv:2206.01085, 2022
12022
Visual Backtracking Teleoperation: A Data Collection Protocol for Offline Image-Based Reinforcement Learning
D Brandfonbrener, S Tu, A Singh, S Welker, C Boodoo, N Matni, J Varley
arXiv preprint arXiv:2210.02343, 2022
2022
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–12