On Ensemble Techniques for AIXI Approximation
暂无分享,去创建一个
[1] Mark Herbster,et al. Tracking the Best Expert , 1995, Machine-mediated learning.
[2] Frans M. J. Willems,et al. Switching between two universal source coding algorithms , 1998, Proceedings DCC '98 Data Compression Conference (Cat. No.98TB100225).
[3] Richard S. Sutton,et al. Introduction to Reinforcement Learning , 1998 .
[4] Martin Zinkevich,et al. Online Convex Programming and Generalized Infinitesimal Gradient Ascent , 2003, ICML.
[5] Y. Shtarkov,et al. The context-tree weighting method: basic properties , 1995, IEEE Trans. Inf. Theory.
[6] Dr. Marcus Hutter,et al. Universal artificial intelligence , 2004 .
[7] Richard S. Sutton,et al. Reinforcement Learning: An Introduction , 1998, IEEE Trans. Neural Networks.
[8] Matthew V. Mahoney,et al. Adaptive weighing of context models for lossless data compression , 2005 .
[9] Marcus Hutter,et al. Universal Artificial Intelligence: Sequential Decisions Based on Algorithmic Probability (Texts in Theoretical Computer Science. An EATCS Series) , 2006 .
[10] Stephen P. Boyd,et al. Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.
[11] Sanjeev Arora,et al. Efficient algorithms for online convex optimization and their applications , 2006 .
[12] Steven de Rooij,et al. Catching Up Faster in Bayesian Model Selection and Model Averaging , 2007, NIPS.
[13] Elad Hazan,et al. Logarithmic regret algorithms for online convex optimization , 2006, Machine Learning.
[14] Joel Veness,et al. Reinforcement Learning via AIXI Approximation , 2010, AAAI.
[15] Joel Veness,et al. A Monte-Carlo AIXI Approximation , 2009, J. Artif. Intell. Res..
[16] Yunmei Chen,et al. Projection Onto A Simplex , 2011, 1101.6081.
[17] Christopher Mattern. Mixing Strategies in Data Compression , 2012, 2012 Data Compression Conference.
[18] Joel Veness,et al. Context Tree Switching , 2011, 2012 Data Compression Conference.