Folgen
Satinder Singh
Satinder Singh
DeepMind / U. of Michigan
Bestätigte E-Mail-Adresse bei umich.edu - Startseite
Titel
Zitiert von
Zitiert von
Jahr
Policy Gradient Methods for Reinforcement Learning with Function Approximation
R Sutton, D McAllester, S Singh, Y Mansour
Neural Information Processing Systems, 1999
62591999
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
RS Sutton, D Precup, S Singh
Artificial intelligence 112 (1-2), 181-211, 1999
36851999
Learning to act using real-time dynamic programming
AG Barto, SJ Bradtke, SP Singh
Artificial intelligence 72 (1-2), 81-138, 1995
15841995
Near-optimal reinforcement learning in polynomial time
M Kearns, S Singh
Machine learning 49, 209-232, 2002
11762002
Convergence of stochastic iterative dynamic programming algorithms
T Jaakkola, M Jordan, S Singh
Advances in neural information processing systems 6, 1993
11651993
Reinforcement learning with replacing eligibility traces
SP Singh, RS Sutton
Machine learning 22 (1-3), 123-158, 1996
9631996
Intrinsically motivated reinforcement learning
N Chentanez, A Barto, S Singh
Advances in neural information processing systems 17, 2004
9102004
Action-conditional video prediction using deep networks in atari games
J Oh, X Guo, H Lee, RL Lewis, S Singh
Advances in neural information processing systems 28, 2015
8932015
Convergence results for single-step on-policy reinforcement-learning algorithms
S Singh, T Jaakkola, ML Littman, C Szepesvári
Machine learning 38, 287-308, 2000
8892000
Between MDPs and semi-MDPs: Learning, planning, learning and sequential decision making
RS Sutton, D Precup, SP Singh
Technical Report COINS 89-95, University of Massachusetts, Amherst, 1998
810*1998
Graphical models for game theory
M Kearns, ML Littman, S Singh
arXiv preprint arXiv:1301.2281, 2013
7752013
Eligibility traces for off-policy policy evaluation
D Precup, R Sutton, S Singh
Computer Science Department Faculty Publication Series, 80, 2000
7352000
Predictive representations of state
ML Littman, RS Sutton, S Singh
Advances in neural information processing systems, 1555-1561, 2002
6702002
Learning without state-estimation in partially observable Markovian decision processes
SP Singh, T Jaakkola, MI Jordan
Machine Learning Proceedings 1994, 284-292, 1994
5681994
Intrinsically motivated learning of hierarchical collections of skills
AG Barto, S Singh, N Chentanez
Proceedings of the 3rd International Conference on Development and Learning …, 2004
5232004
Optimizing dialogue management with reinforcement learning: Experiments with the NJFun system
S Singh, D Litman, M Kearns, M Walker
Journal of Artificial Intelligence Research 16, 105-133, 2002
4972002
Intrinsically motivated reinforcement learning: An evolutionary perspective
S Singh, RL Lewis, AG Barto, J Sorg
IEEE Transactions on Autonomous Mental Development 2 (2), 70-82, 2010
4792010
Transfer of learning by composing solutions of elemental sequential tasks
SP Singh
Machine learning 8, 323-339, 1992
4781992
Reinforcement Learning with Soft State Aggregation
S Singh, T Jaakkola, M Jordan
Neural Information Processing Systems, 1995
4261995
Deep learning for real-time Atari game play using offline Monte-Carlo tree search planning
X Guo, S Singh, H Lee, RL Lewis, X Wang
Advances in neural information processing systems 27, 2014
4102014
Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.
Artikel 1–20