论文信息 - Minimizing data consumption with sequential online feature selection

Minimizing data consumption with sequential online feature selection

In most real-world information processing problems, data is not a free resource. Its acquisition is often expensive and time-consuming. We investigate how such cost factors can be included in supervised classification tasks by deriving classification as a sequential decision process and making it accessible to reinforcement learning. Depending on previously selected features and the internal belief of the classifier, a next feature is chosen by a sequential online feature selection that learns which features are most informative at each time step. Experiments on toy datasets and a handwritten digits classification task show significant reduction in required data for correct classification, while a medical diabetes prediction task illustrates variable feature cost minimization as a further property of our algorithm.

Christian Osendorfer | Patrick van der Smagt | Thomas Rückstieß | Thomas Rückstieß | Christian Osendorfer

[1] G. Monahan. State of the Art—A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms , 1982 .

[2] Jing Peng,et al. An Efficient Gradient-Based Algorithm for On-Line Training of Recurrent Network Trajectories , 1990, Neural Computation.

[3] Jürgen Schmidhuber,et al. Learning to Generate Artificial Fovea Trajectories for Target Detection , 1991, Int. J. Neural Syst..

[4] Jürgen Schmidhuber,et al. Long Short-Term Memory , 1997, Neural Computation.

[5] Yoshua Bengio,et al. Gradient-based learning applied to document recognition , 1998, Proc. IEEE.

[6] Stefan Schaal,et al. Locally Weighted Projection Regression : An O(n) Algorithm for Incremental Real Time Learning in High Dimensional Space , 2000 .

[7] Peter Stagge,et al. Recurrent neural networks for time series classification , 2003, Neurocomputing.

[8] James Theiler,et al. Online Feature Selection using Grafting , 2003, ICML.

[9] Fang Liu,et al. Reinforcement learning-based feature learning for object tracking , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[10] Longxin Lin. Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching , 2004, Machine Learning.

[11] Martin A. Riedmiller. Neural Fitted Q Iteration - First Experiences with a Data Efficient Neural Reinforcement Learning Method , 2005, ECML.

[12] Pierre Geurts,et al. Tree-Based Batch Mode Reinforcement Learning , 2005, J. Mach. Learn. Res..

[13] Lucas Paletta,et al. Q-learning of sequential attention for visual object recognition from informative local descriptors , 2005, ICML.

[14] A. Asuncion,et al. UCI Machine Learning Repository, University of California, Irvine, School of Information and Computer Sciences , 2007 .

[15] Hiroshi Motoda,et al. Computational Methods of Feature Selection , 2007 .

[16] George E. Monahan,et al. A Survey of Partially Observable Markov Decision Processes: Theory, Models, and Algorithms , 2007 .

[17] Foster J. Provost,et al. Handling Missing Values when Applying Classification Models , 2007, J. Mach. Learn. Res..

[18] S. Timmer,et al. Fitted Q Iteration with CMACs , 2007, 2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning.

[19] Jan Peters,et al. Fitted Q-iteration by Advantage Weighted Regression , 2008, NIPS.

[20] Majid Nili Ahmadabadi,et al. Attention control with reinforcement learning for face recognition under partial occlusion , 2011, Machine Vision and Applications.

[21] Carl E. Rasmussen,et al. Gaussian process dynamic programming , 2009, Neurocomputing.

[22] Michèle Sebag,et al. Feature Selection as a One-Player Game , 2010, ICML.

[23] Hao Wang,et al. Online Streaming Feature Selection , 2010, ICML.

[24] Nando de Freitas,et al. Learning attentional policies for tracking and recognition in video with deep networks , 2011, ICML.

[25] Ludovic Denoyer,et al. Datum-Wise Classification: A Sequential Approach to Sparsity , 2011, ECML/PKDD.