The hanabi challenge: A new frontier for ai research N Bard, JN Foerster, S Chandar, N Burch, M Lanctot, HF Song, E Parisotto, ... Artificial Intelligence 280, 103216, 2020 | 444 | 2020 |
The option keyboard: Combining skills in reinforcement learning A Barreto, D Borsa, S Hou, G Comanici, E Aygün, P Hamel, D Toyama, ... Advances in Neural Information Processing Systems 32, 2019 | 109 | 2019 |
Task switching or task launching based on a ranked list of tasks PJ Beaudoin, TC Huang, R Lee, PA Manzagol, RDP McFarlane, ... US Patent App. 14/446,760, 2018 | 90 | 2018 |
Androidenv: A reinforcement learning platform for android D Toyama, P Hamel, A Gergely, G Comanici, A Glaese, Z Ahmed, ... arXiv preprint arXiv:2105.13231, 2021 | 64 | 2021 |
Shaping representations through communication: community size effect in artificial learning systems O Tieleman, A Lazaridou, S Mourad, C Blundell, D Precup arXiv preprint arXiv:1912.06208, 2019 | 26 | 2019 |
Learning to prove from synthetic theorems E Aygün, Z Ahmed, A Anand, V Firoiu, X Glorot, L Orseau, D Precup, ... arXiv preprint arXiv:2006.11259, 2020 | 21 | 2020 |
The barbados 2018 list of open issues in continual learning T Schaul, H van Hasselt, J Modayil, M White, A White, PL Bacon, J Harb, ... arXiv preprint arXiv:1811.07004, 2018 | 18 | 2018 |
Proving theorems using incremental learning and hindsight experience replay E Aygün, A Anand, L Orseau, X Glorot, SM Mcaleer, V Firoiu, LM Zhang, ... International Conference on Machine Learning, 1198-1210, 2022 | 17 | 2022 |
Training a first-order theorem prover from synthetic data V Firoiu, E Aygun, A Anand, Z Ahmed, X Glorot, L Orseau, L Zhang, ... arXiv preprint arXiv:2103.03798, 2021 | 16 | 2021 |
Learning representations of logical formulae using graph neural networks X Glorot, A Anand, E Aygun, S Mourad, P Kohli, D Precup Neural Information Processing Systems, Workshop on Graph Representation Learning, 2019 | 14 | 2019 |
Knowledge representation for reinforcement learning using general value functions G Comanici, D Precup, A Barreto, DK Toyama, E Aygün, P Hamel, ... | 11 | 2018 |
The Hanabi challenge: a new frontier for AI research. CoRR abs/1902.00506 (2019) N Bard, JN Foerster, S Chandar, N Burch, M Lanctot, HF Song, E Parisotto, ... arXiv preprint arXiv:1902.00506, 2019 | 8 | 2019 |
Shaping representations through communication O Tieleman, A Lazaridou, S Mourad, C Blundell, D Precup | 8 | 2018 |
Anonymous personalized recommendation method S Mourad, CK Phillips, MA Courteau, P Beaudoin US Patent 8,745,049, 2014 | 5 | 2014 |
Anonymous personalized recommendation method S Mourad, CK Phillips, MA Courteau, P Beaudoin US Patent 8,521,735, 2013 | 5 | 2013 |
Agents Thinking Fast and Slow: A Talker-Reasoner Architecture K Christakopoulou, S Mourad, M Matarić arXiv preprint arXiv:2410.08328, 2024 | 2 | 2024 |
Community size effect in artificial learning systems. O Tieleman, A Lazaridou, S Mourad, C Blundell, D Precup ViGIL@ NeurIPS, 2019 | 1 | 2019 |