论文信息 - Multi-Label Classification: Inconsistency and Class Balanced K-Nearest Neighbor

Multi-Label Classification: Inconsistency and Class Balanced K-Nearest Neighbor

Many existing approaches employ one-vs-rest method to decompose a multi-label classification problem into a set of 2-class classification problems, one for each class. This method is valid in traditional single-label classification, it, however, incurs training inconsistency in multi-label classification, because in the latter a data point could belong to more than one class. In order to deal with this problem, in this work, we further develop classical K-Nearest Neighbor classifier and propose a novel Class Balanced K-Nearest Neighbor approach for multi-label classification by emphasizing balanced usage of data from all the classes. In addition, we also propose a Class Balanced Linear Discriminant Analysis approach to address high-dimensional multi-label input data. Promising experimental results on three broadly used multi-label data sets demonstrate the effectiveness of our approach.

[1] Jieping Ye,et al. Extracting shared subspace for multi-label classification , 2008, KDD.

[2] Naonori Ueda,et al. Single-shot detection of multiple categories of text using parametric mixture models , 2002, KDD.

[3] Chris H. Q. Ding,et al. Image annotation using multi-label correlated Green's function , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[4] Paul Over,et al. Evaluation campaigns and TRECVid , 2006, MIR '06.

[5] Pavel Pudil,et al. Introduction to Statistical Pattern Recognition , 2006 .

[6] Gang Chen,et al. Semi-supervised Multi-label Learning by Solving a Sylvester Equation , 2008, SDM.

[7] Grigorios Tsoumakas,et al. Multi-Label Classification of Music into Emotions , 2008, ISMIR.

[8] Keinosuke Fukunaga,et al. Introduction to statistical pattern recognition (2nd ed.) , 1990 .