Mohammadhosein Hasanbeig
Mohammadhosein Hasanbeig
Bestätigte E-Mail-Adresse bei cs.ox.ac.uk - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Logically-Constrained Reinforcement Learning
M Hasanbeig, A Abate, D Kroening
arXiv preprint arXiv:1801.08099, 2018
462018
Reinforcement Learning for Temporal Logic Control Synthesis with Probabilistic Satisfaction Guarantees
M Hasanbeig, Y Kantaros, A Abate, D Kroening, GJ Pappas, I Lee
IEEE Conference on Decision and Control (CDC), 2019
352019
Certified Reinforcement Learning with Logic Guidance
M Hasanbeig, A Abate, D Kroening
arXiv preprint arXiv:1902.00778, 2019
222019
Logically-Constrained Neural Fitted Q-iteration
M Hasanbeig, A Abate, D Kroening
AAMAS, 2012-2014, 2019
212019
Cautious Reinforcement Learning with Logical Constraints
M Hasanbeig, A Abate, D Kroening
AAMAS, 483-491, 2020
122020
Modular deep reinforcement learning with temporal logic specifications
LZ Yuan, M Hasanbeig, A Abate, D Kroening
arXiv preprint arXiv:1909.11591, 2019
122019
On Synchronous Binary Log-Linear Learning and Second Order Q-learning
M Hasanbeig, L Pavel
IFAC World Congress 50 (1), 8987-8992, 2017
82017
Deep reinforcement learning with temporal logics
M Hasanbeig, D Kroening, A Abate
International Conference on Formal Modeling and Analysis of Timed Systems, 1-22, 2020
62020
Deepsynth: Program synthesis for automatic task segmentation in deep reinforcement learning
M Hasanbeig, NY Jeppu, A Abate, T Melham, D Kroening
arXiv preprint arXiv:1911.10244, 2019
62019
Towards Verifiable and Safe Model-free Reinforcement Learning
M Hasanbeig, D Kroening, A Abate
Workshop on Artificial Intelligence and Formal Verification, Logics …, 2020
32020
From game-theoretic multi-agent log linear learning to reinforcement learning
M Hasanbeig, L Pavel
arXiv preprint arXiv:1802.02277, 2018
32018
Distributed coverage control by robot networks in unknown environments using a modified EM algorithm
M Hasanbeig, L Pavel
International Journal of Computer and Information Engineering 11 (7), 815-823, 2017
32017
Multi-agent Learning in Coverage Control Games
M Hasanbeig
University of Toronto, 2016
22016
Modular Deep Reinforcement Learning for Continuous Motion Planning with Temporal Logic
M Cai, M Hasanbeig, S Xiao, A Abate, Z Kan
arXiv preprint arXiv:2102.12855, 2021
12021
Shielding Atari Games with Bounded Prescience
M Giacobbe, M Hasanbeig, D Kroening, H Wijk
arXiv preprint arXiv:2101.08153, 2021
2021
Shielding Atari Games with Bounded Prescience Code Repository
HW Mirco Giacobbe, Mohammadhosein Hasanbeig, Daniel Kroening
https://github.com/HjalmarWijk/bounded-prescience, 2021
2021
DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning Code Repository
M Hasanbeig, NY Jeppu, A Abate, T Melham, D Kroening
https://github.com/grockious/deepsynth, 2020
2020
Jump Operator Planning: Goal-Conditioned Policy Ensembles and Zero-Shot Transfer
TJ Ringstrom, M Hasanbeig, A Abate
arXiv preprint arXiv:2007.02527, 2020
2020
Logically-Constrained Reinforcement Learning Code Repository
M Hasanbeig, A Abate, D Kroening
https://github.com/grockious/lcrl, 2020
2020
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–19