Gerald Tesauro

Zitiert von

	Alle	Seit 2019
Zitate	19135	7105
h-index	62	38
i10-index	116	73

1400

700

350

1050

19891990199119921993199419951996199719981999200020012002200320042005200620072008200920102011201220132014201520162017201820192020202120222023202470 105 95 86 152 174 137 177 178 219 192 225 304 280 411 359 445 524 605 596 670 626 663 612 606 609 526 590 611 845 1033 1179 1289 1334 1358 912

Folgen

Gerald Tesauro

IBM Research

Bestätigte E-Mail-Adresse bei us.ibm.com - Startseite

Machine Learning Reinforcement Learning Multi-Agent Learning


Titel Nach Zitationen sortieren Nach Jahr sortieren Nach Titel sortieren	Zitiert von Zitiert von	Jahr
Temporal difference learning and TD-Gammon G Tesauro Communications of the ACM 38 (3), 58-68, 1995	3084	1995
Practical issues in temporal difference learning G Tesauro Advances in neural information processing systems 4, 1991	1472	1991
TD-Gammon, a self-teaching backgammon program, achieves master-level play G Tesauro Neural computation 6 (2), 215-219, 1994	1331	1994
Learning to learn without forgetting by maximizing transfer and minimizing interference M Riemer, I Cases, R Ajemian, M Liu, I Rish, Y Tu, G Tesauro arXiv preprint arXiv:1810.11910, 2018	814	2018
Utility functions in autonomic systems WE Walsh, G Tesauro, JO Kephart, R Das International Conference on Autonomic Computing, 2004. Proceedings., 70-77, 2004	619	2004
A hybrid reinforcement learning approach to autonomic resource allocation G Tesauro, NK Jong, R Das, MN Bennani 2006 IEEE International Conference on Autonomic Computing, 65-73, 2006	486	2006
R³: Reinforced Ranker-Reader for Open-Domain Question Answering S Wang, M Yu, X Guo, Z Wang, T Klinger, W Zhang, S Chang, G Tesauro, ... Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	397	2018
Agent-human interactions in the continuous double auction R Das, JE Hanson, JO Kephart, G Tesauro International joint conference on artificial intelligence 17 (1), 1169-1178, 2001	381	2001
On-line policy improvement using Monte-Carlo search G Tesauro, G Galperin Advances in neural information processing systems 9, 1996	368	1996
A multi-agent systems approach to autonomic computing G Tesauro, DM Chess, WE Walsh, R Das, A Segal, I Whalley, JO Kephart, ... Proceedings of the Third International Joint Conference on Autonomous Agents …, 2004	365	2004
Programming backgammon using self-teaching neural nets G Tesauro Artificial Intelligence 134 (1-2), 181-199, 2002	307	2002
Diverse few-shot text classification with multiple metrics M Yu, X Guo, J Yi, S Chang, S Potdar, Y Cheng, G Tesauro, H Wang, ... arXiv preprint arXiv:1805.07513, 2018	300	2018
Extending Q-learning to general adaptive multi-agent systems G Tesauro Advances in neural information processing systems 16, 2003	300	2003
Neural networks for computer virus recognition GJ Tesauro, JO Kephart, GB Sorkin IEEE expert 11 (4), 5-6, 1996	265	1996
Multiresolution recurrent neural networks: An application to dialogue response generation I Serban, T Klinger, G Tesauro, K Talamadupula, B Zhou, Y Bengio, ... Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017	248	2017
Metric learning for kernel regression KQ Weinberger, G Tesauro Artificial intelligence and statistics, 612-619, 2007	228	2007
Analyzing complex strategic interactions in multi-agent systems WE Walsh, R Das, G Tesauro, JO Kephart AAAI-02 Workshop on Game-Theoretic and Decision-Theoretic Agents, 109-118, 2002	227	2002
Biologically inspired defenses against computer viruses JO Kephart, GB Sorkin, WC Arnold, DM Chess, GJ Tesauro, SR White, ... IJCAI (1), 985-996, 1995	225	1995
Pricing in agent economies using multi-agent Q-learning G Tesauro, JO Kephart Autonomous agents and multi-agent systems 5, 289-304, 2002	215	2002
Reinforcement learning in autonomic computing: A manifesto and case studies G Tesauro IEEE Internet Computing 11 (1), 22-30, 2007	208	2007

Das System kann den Vorgang jetzt nicht ausführen. Versuchen Sie es später erneut.

Artikel 1–20

Zitate pro Jahr

Doppelte Zitate

Zusammengeführte Zitate

Koautor hinzufügenKoautoren

Folgen

Zitiert von