KL based data fusion for target tracking

Visual object tracking in video can be formulated as a time varying appearance-based binary classification problem. Tracking algorithms need to adapt to changes in both foreground object appearance as well as varying scene backgrounds. Fusing information from multimodal features (views or representations) typically enhances classification performance without increasing classifier complexity when image features are concatenated to form a high-dimensional vector. Combining these representative views to effectively exploit multimodal information for classification becomes a key issue. We show that the Kullback-Leibler (KL) divergence measure provides a framework that leads to family of techniques for fusing representations including Cher-noff distance and variance ratio that is the same as linear discriminant analysis. We provide experimental results that corroborate well with our theoretical analysis.

[1]  R. Collins,et al.  On-line selection of discriminative tracking features , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[2]  Pavel Pudil,et al.  Introduction to Statistical Pattern Recognition , 2006 .

[3]  Yanxi Liu,et al.  Online Selection of Discriminative Tracking Features , 2005, IEEE Trans. Pattern Anal. Mach. Intell..

[4]  Robert T. Collins,et al.  Likelihood Map Fusion for Visual Object Tracking , 2008, 2008 IEEE Workshop on Applications of Computer Vision.

[5]  Fei Wang,et al.  Multi-View Local Learning , 2008, AAAI.

[6]  Steven P. Abney,et al.  Bootstrapping , 2002, ACL.

[7]  Robert P. W. Duin,et al.  Linear dimensionality reduction via a heteroscedastic extension of LDA: the Chernoff criterion , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[8]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[9]  Josef Kittler,et al.  Combining classifiers: A theoretical framework , 1998, Pattern Analysis and Applications.

[10]  James C. Bezdek,et al.  Decision templates for multiple classifier fusion: an experimental comparison , 2001, Pattern Recognit..

[11]  Pierre Isabelle,et al.  Proceedings of the 40th Annual Meeting on Association for Computational Linguistics , 2002, ACL 2002.

[12]  Nello Cristianini,et al.  Composite Kernels for Hypertext Categorisation , 2001, ICML.

[13]  Dmitrij Frishman,et al.  MIPS: a database for genomes and protein sequences , 1999, Nucleic Acids Res..

[14]  Mark Herbster,et al.  Combining Graph Laplacians for Semi-Supervised Learning , 2005, NIPS.

[15]  Mikhail Belkin,et al.  A Co-Regularization Approach to Semi-supervised Learning with Multiple Views , 2005 .

[16]  Tong Zhang,et al.  Linear prediction models with graph regularization for web-page categorization , 2006, KDD '06.

[17]  Chris H. Q. Ding,et al.  Linear Discriminant Analysis: New Formulations and Overfit Analysis , 2011, AAAI.

[18]  Huaiyu Zhu On Information and Sufficiency , 1997 .

[19]  Nello Cristianini,et al.  Kernel-Based Data Fusion and Its Application to Protein Function Prediction in Yeast , 2003, Pacific Symposium on Biocomputing.

[20]  David H. Wolpert,et al.  Stacked generalization , 1992, Neural Networks.