Folgen
JB Lanier
JB Lanier
Sonstige NamenJohn Lanier, John Banister Lanier
Bestätigte E-Mail-Adresse bei uci.edu - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Pipeline PSRO: A scalable approach for finding approximate nash equilibria in large games
S McAleer, JB Lanier, R Fox, P Baldi
Advances in Neural Information Processing Systems 33, 2020
822020
XDO: A double oracle algorithm for extensive-form games
S McAleer, JB Lanier, KA Wang, P Baldi, R Fox
Advances in Neural Information Processing Systems 34, 23128-23139, 2021
552021
Curiosity-Driven Multi-Criteria Hindsight Experience Replay
JB Lanier, S McAleer, P Baldi
arXiv preprint arXiv:1906.03710, 2019
232019
Anytime psro for two-player zero-sum games
S McAleer, K Wang, J Lanier, M Lanctot, P Baldi, T Sandholm, R Fox
arXiv preprint arXiv:2201.07700, 2022
22*2022
OffWorld gym: Open-access physical robotics environment for real-world reinforcement learning benchmark and research
A Kumar, T Buckley, JB Lanier, Q Wang, A Kavelaars, I Kuzovkin
arXiv preprint arXiv:1910.08639, 2019
132019
Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games
S McAleer, JB Lanier, K Wang, P Baldi, R Fox, T Sandholm
arXiv preprint arXiv:2207.06541, 2022
122022
Selective Perception: Optimizing State Descriptions with Reinforcement Learning for Language Model Actors
K Nottingham, Y Razeghi, K Kim, JB Lanier, P Baldi, R Fox, S Singh
arXiv preprint arXiv:2307.11922, 2023
82023
Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments
JB Lanier, S McAleer, P Baldi, R Fox
arXiv preprint arXiv:2207.09597, 2022
62022
Improving Social Welfare While Preserving Autonomy via a Pareto Mediator
S McAleer, J Lanier, M Dennis, P Baldi, R Fox
arXiv preprint arXiv:2106.03927, 2021
62021
Selective Perception: Learning Concise State Descriptions for Language Model Actors
K Nottingham, Y Razeghi, K Kim, JB Lanier, P Baldi, R Fox, S Singh
Proceedings of the 2024 Conference of the North American Chapter of the …, 2024
32024
ColosseumRL: A Framework for Multiagent Reinforcement Learning in -Player Games
A Shmakov, J Lanier, S McAleer, R Achar, C Lopes, P Baldi
arXiv preprint arXiv:1912.04451, 2019
22019
CFR-DO: A Double Oracle Algorithm for Extensive-Form Games
S McAleer, J Lanier, P Baldi, R Fox
AAAI-21 Workshop on Reinforcement Learning in Games, 2021
12021
Make the Pertinent Salient: Task-Relevant Reconstruction for Visual Control with Distractions
K Kim, JB Lanier, P Baldi, C Fowlkes, R Fox
arXiv preprint arXiv:2410.09972, 2024
2024
Realizable Continuous-Space Shields for Safe Reinforcement Learning
K Kim, D Corsi, A Rodriguez, JB Lanier, B Parellada, P Baldi, C Sanchez, ...
arXiv preprint arXiv:2410.02038, 2024
2024
Anytime Optimal PSRO for Two-Player Zero-Sum Games
S McAleer123, K Wang, J Lanier, M Lanctot, P Baldi, T Sandholm, R Fox
2021
OffWorld Gym: open-access physical lunar analog environment for reinforcement learning and robotics research
I Kuzovkin, J Lanier, A Kumar, Q Wang
43rd COSPAR Scientific Assembly. Held 28 January-4 February 43, 164, 2021
2021
Toward Optimal Policy Population Growth in Two-Player Zero-Sum Games
SM McAleer, JB Lanier, KA Wang, P Baldi, T Sandholm, R Fox
The Twelfth International Conference on Learning Representations, 0
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–17