Romain Laroche

Zitiert von

	Alle	Seit 2019
Zitate	1786	1337
h-index	22	18
i10-index	44	28

340

170

255

20102011201220132014201520162017201820192020202120222023202412 16 3 8 34 34 104 110 123 160 213 231 287 336 109

Öffentlicher Zugriff

Alle anzeigen

8 Artikel

0 Artikel

verfügbar

nicht verfügbar

Basierend auf Fördermandaten

Koautoren

Remi Tachet des CombesBestätigte E-Mail-Adresse bei alpacaml.com
Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)Bestätigte E-Mail-Adresse bei univ-lille.fr
Harm van SeijenSony AIBestätigte E-Mail-Adresse bei sony.com
Layla El AsriResearch Lead at Borealis AIBestätigte E-Mail-Adresse bei borealisai.com
Raphaël FéraudOrange LabsBestätigte E-Mail-Adresse bei orange.com
Steve YoungProfessor of Information EngineeringBestätigte E-Mail-Adresse bei eng.cam.ac.uk
Oliver LemonProfessor of Artificial Intelligence, Heriot-Watt University, Edinburgh, Director of Interaction LabBestätigte E-Mail-Adresse bei hw.ac.uk
Matthieu GeistCohere (ex Google, on leave of Professor, Université de Lorraine)Bestätigte E-Mail-Adresse bei univ-lorraine.fr
Bilal PiotGoogle DeepmindBestätigte E-Mail-Adresse bei google.com
julien perolatDeepMindBestätigte E-Mail-Adresse bei google.com
Julia VelkovskaVanderbilt UniversityBestätigte E-Mail-Adresse bei vanderbilt.edu

Folgen

Romain Laroche

Microsoft Research

Bestätigte E-Mail-Adresse bei polytechnique.org - Startseite

Reinforcement Learning Dialogue Systems


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Hybrid reward architecture for reinforcement learning H Van Seijen, M Fatemi, J Romoff, R Laroche, T Barnes, J Tsang Advances in Neural Information Processing Systems 30, 2017	256	2017
Safe policy improvement with baseline bootstrapping R Laroche, P Trichelair, RT Des Combes International conference on machine learning, 3652-3661, 2019	214	2019
Learning dynamic belief graphs to generalize on text-based games A Adhikari, X Yuan, MA Côté, M Zelinka, MA Rondeau, R Laroche, ... Advances in Neural Information Processing Systems 33, 3045-3057, 2020	102	2020
Contextual bandit for active learning: Active thompson sampling D Bouneffouf, R Laroche, T Urvoy, R Féraud, R Allesiardo Neural Information Processing: 21st International Conference, ICONIP 2014 …, 2014	93	2014
Counting to explore and generalize in text-based games X Yuan, MA Côté, A Sordoni, R Laroche, RT Combes, M Hausknecht, ... arXiv preprint arXiv:1806.11525, 2018	60	2018
Transfer reinforcement learning with shared dynamics R Laroche, M Barlier Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017	59	2017
When does return-conditioned supervised learning work for offline reinforcement learning? D Brandfonbrener, A Bietti, J Buckman, R Laroche, J Bruna Advances in Neural Information Processing Systems 35, 1542-1553, 2022	53	2022
Score-based inverse reinforcement learning L El Asri, B Piot, M Geist, R Laroche, O Pietquin International Conference on Autonomous Agents and Multiagent Systems (AAMAS …, 2016	45	2016
Reinforcement learning algorithm selection R Laroche, R Feraud ICLR, 2018	39	2018
Hybrid reward architecture for reinforcement learning HH Van Seijen, SMF Booshehri, RMH Laroche, JS Romoff US Patent 10,977,551, 2021	38	2021
Safe policy improvement with soft baseline bootstrapping K Nadjahi, R Laroche, R Tachet des Combes Machine Learning and Knowledge Discovery in Databases: European Conference …, 2020	35	2020
Transfer Learning for User Adaptation in Spoken Dialogue Systems. A Genevay, R Laroche AAMAS, 975-983, 2016	33	2016
Human-machine dialogue as a stochastic game M Barlier, J Perolat, R Laroche, O Pietquin 16th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2015), 2015	31	2015
NASTIA: Negotiating Appointment Setting Interface. L El Asri, R Lemonnier, R Laroche, O Pietquin, H Khouzaimi LREC, 266-271, 2014	30	2014
Reward function learning for dialogue management L El Asri, R Laroche, O Pietquin STAIRS 2012, 95-106, 2012	29	2012
Reward shaping for statistical optimisation of dialogue management L El Asri, R Laroche, O Pietquin Statistical Language and Speech Processing: First International Conference …, 2013	28	2013
Decentralized exploration in multi-armed bandits R Féraud, R Alami, R Laroche International Conference on Machine Learning, 1901-1909, 2019	27	2019
Safe policy improvement with an estimated baseline policy TD Simão, R Laroche, RT Combes International Foundation for Autonomous Agents and Multi-Agent Systems, 2019	26	2019
On value function representation of long horizon problems L Lehnert, R Laroche, H van Seijen Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	25	2018
Multi-advisor reinforcement learning R Laroche, M Fatemi, J Romoff, H van Seijen arXiv preprint arXiv:1704.00756, 2017	25	2017

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von

Koautoren