Dynamic Sensing: Better Classification under Acquisition Constraints

In many machine learning applications the quality of the data is limited by resource constraints (may it be power, bandwidth, memory, ...). In such cases, the constraints are on the average resources allocated, therefore there is some control over each sample's quality. In most cases this option remains unused and the data's quality is uniform over the samples. In this paper we propose to actively allocate resources to each sample such that resources are used optimally overall. We propose a method to compute the optimal resource allocation. We further derive generalization bounds for the case where the problem's model is unknown. We demonstrate the potential benefit of this approach on both simulated and real-life problems.

[1]  Matt J. Kusner,et al.  Classifier cascades and trees for minimizing feature evaluation cost , 2014, J. Mach. Learn. Res..

[2]  Foster J. Provost,et al.  Active feature-value acquisition for classifier induction , 2004, Fourth IEEE International Conference on Data Mining (ICDM'04).

[3]  Daphne Koller,et al.  Active Classification based on Value of Classifier , 2011, NIPS.

[4]  Burr Settles,et al.  Active Learning Literature Survey , 2009 .

[5]  Theodore B. Trafalis,et al.  Robust support vector machines for classification and computational issues , 2007, Optim. Methods Softw..

[6]  Andrew McCallum,et al.  Selecting actions for resource-bounded information extraction using reinforcement learning , 2012, WSDM '12.

[7]  Lawrence Carin,et al.  Cost-sensitive feature acquisition and classification , 2007, Pattern Recognit..

[8]  Nuno Vasconcelos,et al.  Boosting Classifier Cascades , 2010, NIPS.

[9]  Alex Park,et al.  The MIT Mobile Device Speaker Verification Corpus: Data Collection and Preliminary Experiments , 2006, 2006 IEEE Odyssey - The Speaker and Language Recognition Workshop.

[10]  Dan Roth,et al.  Learning Active Classifiers , 1996, ICML.

[11]  Carla E. Brodley,et al.  Active Class Selection , 2007, ECML.

[12]  Wang Bingxi,et al.  Text-independent speaker recognition using support vector machine , 2001, 2001 International Conferences on Info-Tech and Info-Net. Proceedings (Cat. No.01EX479).

[13]  H. Sebastian Seung,et al.  Selective Sampling Using the Query by Committee Algorithm , 1997, Machine Learning.

[14]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[15]  S. Brendle,et al.  Calculus of Variations , 1927, Nature.

[16]  Douglas A. Reynolds,et al.  A Tutorial on Text-Independent Speaker Verification , 2004, EURASIP J. Adv. Signal Process..

[17]  T. Cipra Statistical Analysis of Time Series , 2010 .

[18]  Panagiotis G. Ipeirotis,et al.  Get another label? improving data quality and data mining using multiple, noisy labelers , 2008, KDD.

[19]  Shie Mannor,et al.  Robustness and Regularization of Support Vector Machines , 2008, J. Mach. Learn. Res..

[20]  David Heckerman,et al.  Troubleshooting Under Uncertainty , 1994 .

[21]  Foster J. Provost,et al.  Active Feature-Value Acquisition , 2009, Manag. Sci..

[22]  Laurent El Ghaoui,et al.  Robust Solutions to Least-Squares Problems with Uncertain Data , 1997, SIAM J. Matrix Anal. Appl..