Object tracking using discriminative sparse appearance model

Object tracking based on sparse representation formulates tracking as searching the candidate with minimal reconstruction error in target template subspace. The key problem lies in modeling the target robustly to vary appearances. The appearance model in most sparsity-based trackers has two main problems. The first is that global structural information and local features are insufficiently combined because the appearance is modeled separately by holistic and local sparse representations. The second problem is that the discriminative information between the target and the background is not fully utilized because the background is rarely considered in modeling. In this study, we develop a robust visual tracking algorithm by modeling the target as a model for discriminative sparse appearance. A discriminative dictionary is trained from the local target patches and the background. The patches display the local features while their position distribution implies the global structure of the target. Thus, the learned dictionary can fully represent the target. The incorporation of the background into dictionary learning also enhances its discriminative capability. Upon modeling the target as a sparse coding histogram based on this learned dictionary, our tracker is embedded into a Bayesian state inference framework to locate a target. We also present a model update scheme in which the update rate is adjusted automatically. In conjunction with the update strategy, the proposed tracker can handle occlusion and alleviate drifting. Comparative results on challenging benchmark image sequences show that the tracking method performs favorably against several state-of-the-art algorithms. HighlightsA dictionary is learned by comprehensively considering both representation capability and discriminative strength.The target appearance is modeled as a discriminative sparse model.A similarity coefficient is defined to measure the similarity between the target model and the candidate.An update strategy is proposed to reduce the adverse effects caused by appearance changes.

[1]  Haibin Ling,et al.  Real time robust L1 tracker using accelerated proximal gradient approach , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[2]  Yuan Xie,et al.  Discriminative Object Tracking via Sparse Representation and Online Dictionary Learning , 2014, IEEE Transactions on Cybernetics.

[3]  Junzhou Huang,et al.  Robust tracking using local sparse appearance model and K-selection , 2011, CVPR 2011.

[4]  Qiuping Zhu,et al.  Structured partial least squares based appearance model for visual tracking , 2014, Neurocomputing.

[5]  Narendra Ahuja,et al.  Robust visual tracking via multi-task sparse learning , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[6]  Ales Leonardis,et al.  An adaptive coupled-layer visual model for robust visual tracking , 2011, 2011 International Conference on Computer Vision.

[7]  Ales Leonardis,et al.  A local-motion-based probabilistic model for visual tracking , 2009, Pattern Recognit..

[8]  Huchuan Lu,et al.  Least Soft-Threshold Squares Tracking , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[9]  Huchuan Lu,et al.  Learning structured visual dictionary for object tracking , 2013, Image Vis. Comput..

[10]  Guillermo Sapiro,et al.  Sparse Representation for Computer Vision and Pattern Recognition , 2010, Proceedings of the IEEE.

[11]  Rongrong Ji,et al.  Online semi-supervised compressive coding for robust visual tracking , 2014, J. Vis. Commun. Image Represent..

[12]  Dorin Comaniciu,et al.  Kernel-Based Object Tracking , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Horst Bischof,et al.  PROST: Parallel robust online simple tracking , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[14]  Junseok Kwon,et al.  Tracking of a non-rigid object via patch-based dynamic appearance modeling and adaptive Basin Hopping Monte Carlo sampling , 2009, CVPR.

[15]  Li Bai,et al.  Minimum error bounded efficient ℓ1 tracker with occlusion detection , 2011, CVPR 2011.

[16]  Horst Bischof,et al.  On-line semi-supervised multiple-instance boosting , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[17]  Michael J. Black,et al.  EigenTracking: Robust Matching and Tracking of Articulated Objects Using a View-Based Representation , 1996, International Journal of Computer Vision.

[18]  Simone Calderara,et al.  Visual Tracking: An Experimental Survey , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  M. Elad,et al.  $rm K$-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation , 2006, IEEE Transactions on Signal Processing.

[20]  Shengping Zhang,et al.  Robust visual tracking based on online learning sparse representation , 2013, Neurocomputing.

[21]  Horst Bischof,et al.  Real-Time Tracking via On-line Boosting , 2006, BMVC.

[22]  Jiri Matas,et al.  P-N learning: Bootstrapping binary classifiers by structural constraints , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[23]  Chunhua Shen,et al.  Real-time visual tracking using compressive sensing , 2011, CVPR 2011.

[24]  Junseok Kwon,et al.  Visual tracking decomposition , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[25]  Haibin Ling,et al.  Robust visual tracking using ℓ1 minimization , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[26]  Yi Wu,et al.  Online Object Tracking: A Benchmark , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[27]  Liang-Tien Chia,et al.  Sparse Representation With Kernels , 2013, IEEE Transactions on Image Processing.

[28]  Guillermo Sapiro,et al.  Online Learning for Matrix Factorization and Sparse Coding , 2009, J. Mach. Learn. Res..

[29]  Huchuan Lu,et al.  Robust object tracking via sparsity-based collaborative model , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[30]  Huchuan Lu,et al.  This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. IEEE TRANSACTIONS ON IMAGE PROCESSING 1 Online Object Tracking with Sparse Prototypes , 2022 .

[31]  Horst Bischof,et al.  On-line Boosting and Vision , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[32]  Shai Avidan Ensemble Tracking , 2007, IEEE Trans. Pattern Anal. Mach. Intell..

[33]  Zezhong Xu,et al.  Visual tracking with structural appearance model based on extended incremental non-negative matrix factorization , 2014, Neurocomputing.

[34]  Junzhou Huang,et al.  Robust and Fast Collaborative Tracking with Two Stage Sparse Optimization , 2010, ECCV.

[35]  Xiaoqin Zhang,et al.  Block covariance based l1 tracker with a subtle template dictionary , 2013, Pattern Recognit..

[36]  Z. M. Hefed Object tracking , 1999 .

[37]  Chunhong Pan,et al.  Visual Tracking Via Kernel Sparse Representation With Multikernel Fusion , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[38]  Hanqing Lu,et al.  A robust boosting tracker with minimum error bound in a co-training framework , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[39]  Huchuan Lu,et al.  Visual Tracking via Discriminative Sparse Similarity Map , 2014, IEEE Transactions on Image Processing.

[40]  Zhongliang Jing,et al.  Robust visual tracking using discriminative stable regions and K-means clustering , 2013, Neurocomputing.

[41]  Shaohua Kevin Zhou,et al.  Probabilistic Visual Tracking via Robust Template Matching and Incremental Subspace Update , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[42]  Ahmed M. Elgammal,et al.  Boosting adaptive linear weak classifiers for online learning and tracking , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[43]  Qing Wang,et al.  Online discriminative object tracking with local sparse representation , 2012, 2012 IEEE Workshop on the Applications of Computer Vision (WACV).

[44]  Ming-Hsuan Yang,et al.  Incremental Learning for Robust Visual Tracking , 2008, International Journal of Computer Vision.

[45]  Yanning Zhang,et al.  Part-Based Visual Tracking with Online Latent Structural Learning , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[46]  Takahiro Ishikawa,et al.  The template update problem , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[47]  Ehud Rivlin,et al.  Robust Fragments-based Tracking using the Integral Histogram , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[48]  Ming-Hsuan Yang,et al.  Visual tracking with online Multiple Instance Learning , 2009, CVPR.

[49]  Huchuan Lu,et al.  Visual tracking via adaptive structural local sparse appearance model , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[50]  Anton van den Hengel,et al.  Learning Compact Binary Codes for Visual Tracking , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition.

[51]  Huchuan Lu,et al.  Robust Superpixel Tracking , 2014, IEEE Transactions on Image Processing.

[52]  Allen Y. Yang,et al.  Robust Face Recognition via Sparse Representation , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.