Konrad Żołna
Konrad Żołna
Research Scientist, DeepMind
Bestätigte E-Mail-Adresse bei
Zitiert von
Zitiert von
A Generalist Agent
S Reed*, K Zolna*, E Parisotto*, SG Colmenarejo, A Novikov, ...
Transactions on Machine Learning Research, 2022
Critic Regularized Regression
Z Wang, A Novikov, K Zolna, JT Springenberg, S Reed, B Shahriari, ...
Advances in Neural Information Processing Systems 33, 2020
Rl Unplugged: A Suite of Benchmarks for Offline Reinforcement Learning
C Gulcehre, Z Wang, A Novikov, T Le Paine, S Gomez Colmenarejo, ...
Advances in Neural Information Processing Systems 33, 2020
Scaling data-driven robotics with reward sketching and batch reinforcement learning
S Cabi, SG Colmenarejo, A Novikov, K Konyushkova, S Reed, R Jeong, ...
Robotics: Science and Systems, 2020
Hyperparameter selection for offline reinforcement learning
TL Paine, C Paduraru, A Michi, C Gulcehre, K Zolna, A Novikov, Z Wang, ...
arXiv preprint arXiv:2007.09055, 2020
Offline learning from demonstrations and unlabeled experience
K Zolna, A Novikov, K Konyushkova, C Gulcehre, Z Wang, Y Aytar, ...
arXiv preprint arXiv:2011.13885, 2020
Adversarial framing for image and video classification
M Zajac*, K Zolna*, N Rostamzadeh, PO Pinheiro
Proceedings of the AAAI Conference on Artificial Intelligence 33, 10077-10078, 2019
Robocat: A self-improving generalist agent for robotic manipulation
K Bousmalis, G Vezzani, D Rao, CM Devin, AX Lee, MB Villalonga, ...
Transactions on Machine Learning Research, 2023
Fraternal dropout
K Zolna, D Arpit, D Suhubdy, Y Bengio
Proceedings of the 6th International Conference on Learning Representations, 2017
Task-relevant adversarial imitation learning
K Zolna*, S Reed*, A Novikov, SG Colmenarejo, D Budden, S Cabi, ...
Conference on Robot Learning, 247-263, 2021
Regularized behavior value estimation
C Gulcehre, SG Colmenarejo, Z Wang, J Sygnowski, T Paine, K Zolna, ...
arXiv preprint arXiv:2103.09575, 2021
Semi-supervised reward learning for offline reinforcement learning
K Konyushkova, K Zolna, Y Aytar, A Novikov, S Reed, S Cabi, ...
arXiv preprint arXiv:2012.06899, 2020
Towards homoscedastic nonlinear cointegration for structural health monitoring
K Zolna, PB Dao, WJ Staszewski, T Barszcz
Mechanical Systems and Signal Processing 75, 94-108, 2016
Classifier-agnostic saliency map extraction
K Zolna, KJ Geras, K Cho
Computer Vision and Image Understanding, 102969, 2020
Nonlinear cointegration approach for condition monitoring of wind turbines
K Zolna, PB Dao, WJ Staszewski, T Barszcz
Mathematical Problems in Engineering 2015 (1), 978156, 2015
Focused Hierarchical RNNs for Conditional Sequence Processing
NR Ke, K Zolna, A Sordoni, Z Lin, A Trischler, Y Bengio, J Pineau, ...
Proceedings of the 35th International Conference on Machine Learning, 2018
Genie: Generative interactive environments
J Bruce, MD Dennis, A Edwards, J Parker-Holder, Y Shi, E Hughes, M Lai, ...
Forty-first International Conference on Machine Learning, 2024
The dynamics of handwriting improves the automated diagnosis of dysgraphia
K Zolna*, T Asselborn*, C Jolly, L Casteran, W Johal, P Dillenbourg
arXiv preprint arXiv:1906.07576, 2019
StarCraft II Unplugged: Large Scale Offline Reinforcement Learning
M Mathieu, S Ozair, S Srinivasan, C Gulcehre, S Zhang, R Jiang, ...
Deep RL Workshop NeurIPS 2021, 2021
Reinforced Imitation in Heterogeneous Action Space
K Zolna, N Rostamzadeh, Y Bengio, S Ahn, PO Pinheiro
arXiv preprint arXiv:1904.03438, 2019
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20