Ronald Ortner

Zitiert von

	Alle	Seit 2019
Zitate	3749	2473
h-index	24	18
i10-index	32	27

540

270

135

405

20072008200920102011201220132014201520162017201820192020202120222023202411 36 35 69 75 86 136 135 145 138 176 205 332 415 511 521 536 158

Öffentlicher Zugriff

Alle anzeigen

28 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Folgen

Ronald Ortner

Montanuniversität Leoben

Bestätigte E-Mail-Adresse bei unileoben.ac.at - Startseite


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Near-optimal regret bounds for reinforcement learning P Auer, T Jaksch, R Ortner Advances in neural information processing systems 21, 2008	1449	2008
UCB revisited: Improved regret bounds for the stochastic multi-armed bandit problem P Auer, R Ortner Periodica Mathematica Hungarica 61 (1-2), 55-65, 2010	366	2010
Logarithmic online regret bounds for undiscounted reinforcement learning P Auer, R Ortner Advances in neural information processing systems 19, 2006	286	2006
Improved rates for the stochastic continuum-armed bandit problem P Auer, R Ortner, C Szepesvári International Conference on Computational Learning Theory, 454-468, 2007	258	2007
Adaptively tracking the best bandit arm with an unknown number of distribution changes P Auer, P Gajane, R Ortner Conference on Learning Theory, 138-158, 2019	141*	2019
Efficient bias-span-constrained exploration-exploitation in reinforcement learning R Fruit, M Pirotta, A Lazaric, R Ortner International Conference on Machine Learning, 1578-1586, 2018	106	2018
A boosting approach to multiple instance learning P Auer, R Ortner European conference on machine learning, 63-74, 2004	106	2004
Online regret bounds for undiscounted continuous reinforcement learning R Ortner, D Ryabko Advances in Neural Information Processing Systems 25, 2012	88	2012
Regret bounds for restless markov bandits R Ortner, D Ryabko, P Auer, R Munos International conference on algorithmic learning theory, 214-228, 2012	82	2012
Variational regret bounds for reinforcement learning R Ortner, P Gajane, P Auer Uncertainty in Artificial Intelligence, 81-90, 2020	65	2020
PAC-Bayesian analysis of contextual bandits Y Seldin, P Auer, J Shawe-taylor, R Ortner, F Laviolette Advances in neural information processing systems 24, 2011	57	2011
Regret bounds for restless Markov bandits R Ortner, D Ryabko, P Auer, R Munos Theoretical Computer Science 558, 62-76, 2014	54	2014
Regret bounds for reinforcement learning via markov chain concentration R Ortner Journal of Artificial Intelligence Research 67, 115-128, 2020	47	2020
Non-backtracking random walks and cogrowth of graphs R Ortner, W Woess Canadian Journal of Mathematics 59 (4), 828-844, 2007	46	2007
A sliding-window algorithm for markov decision processes with arbitrarily changing rewards and transitions P Gajane, R Ortner, P Auer arXiv preprint arXiv:1805.10066, 2018	45	2018
Improved learning complexity in combinatorial pure exploration bandits V Gabillon, A Lazaric, M Ghavamzadeh, R Ortner, P Bartlett Artificial Intelligence and Statistics, 1004-1012, 2016	44	2016
Improved regret bounds for undiscounted continuous reinforcement learning K Lakshmanan, R Ortner, D Ryabko International conference on machine learning, 524-532, 2015	44	2015
Pareto front identification from stochastic bandit feedback P Auer, CK Chiang, R Ortner, M Drugan Artificial intelligence and statistics, 939-947, 2016	43	2016
Pseudometrics for state aggregation in average reward Markov decision processes R Ortner Algorithmic Learning Theory: 18th International Conference, ALT 2007, Sendai …, 2007	38	2007
Adaptive aggregation for reinforcement learning in average reward Markov decision processes R Ortner Annals of Operations Research 208, 321-336, 2013	35	2013

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von