Universal Option Models
暂无分享,去创建一个
Shalabh Bhatnagar | Richard S. Sutton | Csaba Szepesvári | Hengshuai Yao | Joseph Modayil | R. Sutton | Csaba Szepesvari | S. Bhatnagar | Hengshuai Yao | Joseph Modayil
[1] Andrew G. Barto,et al. Monte Carlo Matrix Inversion and Reinforcement Learning , 1993, NIPS.
[2] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[3] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[4] Michael I. Jordan,et al. MASSACHUSETTS INSTITUTE OF TECHNOLOGY ARTIFICIAL INTELLIGENCE LABORATORY and CENTER FOR BIOLOGICAL AND COMPUTATIONAL LEARNING DEPARTMENT OF BRAIN AND COGNITIVE SCIENCES , 1996 .
[5] Jie Tang,et al. ArnetMiner: extraction and mining of academic social networks , 2008, KDD.
[6] Matthew Richardson,et al. The Intelligent surfer: Probabilistic Combination of Link and Content Information in PageRank , 2001, NIPS.
[7] Doina Precup,et al. Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning , 1999, Artif. Intell..
[8] Doina Precup,et al. Temporal abstraction in reinforcement learning , 2000, ICML 2000.
[9] John N. Tsitsiklis,et al. Neuro-Dynamic Programming , 1996, Encyclopedia of Machine Learning.
[10] Robert E. Schapire,et al. Reinforcement learning without rewards , 2010 .
[11] Rajeev Motwani,et al. The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.
[12] Andrew Y. Ng,et al. Pharmacokinetics of a novel formulation of ivermectin after administration to goats , 2000, ICML.
[13] Satinder P. Singh,et al. Linear options , 2010, AAMAS.
[14] Pieter Abbeel,et al. Autonomous Helicopter Aerobatics through Apprenticeship Learning , 2010, Int. J. Robotics Res..