Folgen
Rui Yuan
Rui Yuan
AI Research Scientist, Stellantis
Bestätigte E-Mail-Adresse bei stellantis.com - Startseite
Titel
Zitiert von
Zitiert von
Jahr
A general sample complexity analysis of vanilla policy gradient
R Yuan, RM Gower, A Lazaric
International Conference on Artificial Intelligence and Statistics (AISTATS …, 2022
392022
Sketched Newton-Raphson
R Yuan, A Lazaric, RM Gower
SIAM Journal on Optimization 32 (3), 1555-1583, 2022
26*2022
Linear convergence of natural policy gradient methods with log-linear policies
R Yuan, SS Du, RM Gower, A Lazaric, L Xiao
The Eleventh International Conference on Learning Representations, 2022
252022
A Novel Framework for Policy Mirror Descent with General Parameterization and Linear Convergence
C Alfano, R Yuan, P Rebeschini
Advances in Neural Information Processing Systems 36, 2024
92024
SAN: Stochastic Average Newton Algorithm for Minimizing Finite Sums
J Chen, R Yuan, G Garrigos, RM Gower
International Conference on Artificial Intelligence and Statistics (AISTATS …, 2022
52022
Enhancing Policy Gradient with the Polyak Step-Size Adaption
Y Li, R Yuan, C Fan, M Schmidt, S Horváth, RM Gower, M Takáč
arXiv preprint arXiv:2404.07525, 2024
2024
Understanding in-context learning in transformers
S Rossi, R Yuan, T Hannagan
The Third Blogpost Track at ICLR 2024, 2024
2024
Méthodes du second d'ordre stochastiques et analysis de temps fini des méthodes de policie gradient
R Yuan
2023
Stochastic Second Order Methods and Finite Time Analysis of Policy Gradient Methods
R Yuan
Institut polytechnique de Paris, 2023
2023
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–9