Jürgen Schmidhuber,et al. Curious model-building control systems , 1991, [Proceedings] 1991 IEEE International Joint Conference on Neural Networks.
 J. Diamond. Zebras and the Anna Karenina Principle , 1994 .
 Martin A. Riedmiller,et al. Rprop - Description and Implementation Details , 1994 .
 David Andre,et al. Generalized Prioritized Sweeping , 1997, NIPS.
 Richard K. Belew,et al. New Methods for Competitive Coevolution , 1997, Evolutionary Computation.
 Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.
 Andrew W. Moore,et al. Prioritized sweeping: Reinforcement learning with less data and less time , 2004, Machine Learning.
 Long Ji Lin,et al. Self-improving reactive agents based on reinforcement learning, planning and teaching , 1992, Machine Learning.
 Yann LeCun,et al. The mnist database of handwritten digits , 2005 .
 David J. Foster,et al. Reverse replay of behavioural sequences in hippocampal place cells during the awake state , 2006, Nature.
 Geoffrey E. Hinton,et al. To recognize shapes, first learn to generate images. , 2007, Progress in Brain Research.
 David A. McAllester,et al. A discriminatively trained, multiscale, deformable part model , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.
 L. Frank,et al. Rewarded Outcomes Enhance Reactivation of Experience in the Hippocampus , 2009, Neuron.
 Hado van Hasselt,et al. Double Q-learning , 2010, NIPS.
 Clément Farabet,et al. Torch7: A Matlab-like Environment for Machine Learning , 2011, NIPS 2011.
 Yi Sun,et al. Incremental Basis Construction from Temporal Difference Error , 2011, ICML.
 Alborz Geramifard,et al. Online Discovery of Feature Dependencies , 2011, ICML.
 Francisco Herrera,et al. A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches , 2012, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).
 Alex Graves,et al. Playing Atari with Deep Reinforcement Learning , 2013, ArXiv.
 Richard S. Sutton,et al. Planning by Prioritized Sweeping with Small Backups , 2013, ICML.
 Tom Schaul,et al. No more pesky learning rates , 2012, ICML.
 R. Sutton,et al. Surprise and Curiosity for Big Data Robotics , 2014 .
 Richard S. Sutton,et al. Weighted importance sampling for off-policy learning with linear function approximation , 2014, NIPS.
 D. Dupret,et al. Dopaminergic neurons promote hippocampal reactivation and spatial memory persistence , 2014, Nature Neuroscience.
 Honglak Lee,et al. Deep Learning for Real-Time Atari Game Play Using Offline Monte-Carlo Tree Search Planning , 2014, NIPS.
 D. Hassabis,et al. Hippocampal place cells construct reward related sequences through unexplored space , 2015, eLife.
 Sergey Levine,et al. Incentivizing Exploration In Reinforcement Learning With Deep Predictive Models , 2015, ArXiv.
 Laura A. Atherton,et al. Memory trace replay: the shaping of memory consolidation by neuromodulation , 2015, Trends in Neurosciences.
 Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.
 Shane Legg,et al. Massively Parallel Methods for Deep Reinforcement Learning , 2015, ArXiv.
 Shane Legg,et al. Human-level control through deep reinforcement learning , 2015, Nature.
 Marc G. Bellemare,et al. The Arcade Learning Environment: An Evaluation Platform for General Agents (Extended Abstract) , 2012, IJCAI.
 Tom Schaul,et al. Dueling Network Architectures for Deep Reinforcement Learning , 2015, ICML.
 Marc G. Bellemare,et al. Increasing the Action Gap: New Operators for Reinforcement Learning , 2015, AAAI.