Gerald Tesauro

Cited by

	All	Since 2019
Citations	18670	6604
h-index	61	37
i10-index	118	74

1400

700

350

1050

19891990199119921993199419951996199719981999200020012002200320042005200620072008200920102011201220132014201520162017201820192020202120222023202471 104 95 84 152 175 135 178 178 219 195 224 300 281 406 354 451 532 610 602 677 622 670 610 611 618 539 591 613 845 1040 1173 1299 1340 1370 382

Gerald Tesauro

IBM Research

Verified email at us.ibm.com - Homepage

Machine Learning Reinforcement Learning Multi-Agent Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Temporal difference learning and TD-Gammon G Tesauro Communications of the ACM 38 (3), 58-68, 1995	2968	1995
Practical issues in temporal difference learning G Tesauro Advances in neural information processing systems 4, 1991	1467	1991
TD-Gammon, a self-teaching backgammon program, achieves master-level play G Tesauro Neural computation 6 (2), 215-219, 1994	1287	1994
Learning to learn without forgetting by maximizing transfer and minimizing interference M Riemer, I Cases, R Ajemian, M Liu, I Rish, Y Tu, G Tesauro arXiv preprint arXiv:1810.11910, 2018	719	2018
Utility functions in autonomic systems WE Walsh, G Tesauro, JO Kephart, R Das International Conference on Autonomic Computing, 2004. Proceedings., 70-77, 2004	614	2004
A hybrid reinforcement learning approach to autonomic resource allocation G Tesauro, NK Jong, R Das, MN Bennani 2006 IEEE International Conference on Autonomic Computing, 65-73, 2006	483	2006
Agent-human interactions in the continuous double auction R Das, JE Hanson, JO Kephart, G Tesauro International joint conference on artificial intelligence 17 (1), 1169-1178, 2001	378	2001
R³: Reinforced Ranker-Reader for Open-Domain Question Answering S Wang, M Yu, X Guo, Z Wang, T Klinger, W Zhang, S Chang, G Tesauro, ... Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	377	2018
A multi-agent systems approach to autonomic computing G Tesauro, DM Chess, WE Walsh, R Das, A Segal, I Whalley, JO Kephart, ... Proceedings of the Third International Joint Conference on Autonomous Agents …, 2004	364	2004
On-line policy improvement using Monte-Carlo search G Tesauro, G Galperin Advances in neural information processing systems 9, 1996	354	1996
Programming backgammon using self-teaching neural nets G Tesauro Artificial Intelligence 134 (1-2), 181-199, 2002	295	2002
Extending Q-learning to general adaptive multi-agent systems G Tesauro Advances in neural information processing systems 16, 2003	289	2003
Diverse few-shot text classification with multiple metrics M Yu, X Guo, J Yi, S Chang, S Potdar, Y Cheng, G Tesauro, H Wang, ... arXiv preprint arXiv:1805.07513, 2018	273	2018
Neural networks for computer virus recognition GJ Tesauro, JO Kephart, GB Sorkin IEEE expert 11 (4), 5-6, 1996	264	1996
Multiresolution recurrent neural networks: An application to dialogue response generation I Serban, T Klinger, G Tesauro, K Talamadupula, B Zhou, Y Bengio, ... Proceedings of the AAAI Conference on Artificial Intelligence 31 (1), 2017	241	2017
Metric learning for kernel regression KQ Weinberger, G Tesauro Artificial intelligence and statistics, 612-619, 2007	225	2007
Analyzing complex strategic interactions in multi-agent systems WE Walsh, R Das, G Tesauro, JO Kephart AAAI-02 Workshop on Game-Theoretic and Decision-Theoretic Agents, 109-118, 2002	220	2002
Biologically inspired defenses against computer viruses JO Kephart, GB Sorkin, WC Arnold, DM Chess, GJ Tesauro, SR White, ... IJCAI (1), 985-996, 1995	218	1995
Proceedings of the 6th International Conference on Neural Information Processing Systems JD Cowan, G Tesauro, J Alspector Morgan Kaufmann Publishers Inc., 1993	216	1993
Pricing in agent economies using multi-agent Q-learning G Tesauro, JO Kephart Autonomous agents and multi-agent systems 5, 289-304, 2002	211	2002

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by