Enhanced neural gas network for prototype-based clustering

In practical cluster analysis tasks, an efficient clustering algorithm should be less sensitive to parameter configurations and tolerate the existence of outliers. Based on the neural gas (NG) network framework, we propose an efficient prototype-based clustering (PBC) algorithm called enhanced neural gas (ENG) network. Several problems associated with the traditional PBC algorithms and original NG algorithm such as sensitivity to initialization, sensitivity to input sequence ordering and the adverse influence from outliers can be effectively tackled in our new scheme. In addition, our new algorithm can establish the topology relationships among the prototypes and all topology-wise badly located prototypes can be relocated to represent more meaningful regions. Experimental resultson synthetic and UCI datasets show that our algorithm possesses superior performance in comparison to several PBC algorithms and their improved variants, such as hard c-means, fuzzy c-means, NG, fuzzy possibilistic c-means, credibilistic fuzzy c-means, hard/fuzzy robust clustering and alternative hard/fuzzy c-means, in static data clustering tasks with a fixed number of prototypes.

[1]  Dong-Sik Jang,et al.  Extraction of major object features using VQ clustering for content-based image retrieval , 2002, Pattern Recognit..

[2]  Luigi Cinque,et al.  A clustering fuzzy approach for image segmentation , 2004, Pattern Recognit..

[3]  S. Haykin,et al.  Adaptive Filter Theory , 1986 .

[4]  James C. Bezdek,et al.  A mixed c-means clustering model , 1997, Proceedings of 6th International Fuzzy Systems Conference.

[5]  Hichem Frigui,et al.  A Robust Competitive Clustering Algorithm With Applications in Computer Vision , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[6]  Pavel Berkhin,et al.  A Survey of Clustering Data Mining Techniques , 2006, Grouping Multidimensional Data.

[7]  Douglas A. Baxter,et al.  Small networks of empirically derived adaptive elements simulate some higher-order features of classical conditioning , 1990, Neural Networks.

[8]  Xiaomin Liu,et al.  A Least Biased Fuzzy Clustering Method , 1994, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Hong Yan,et al.  A hybrid method for unconstrained handwritten numeral recognition by combining structural and neural "gas" classifiers , 2000, Pattern Recognit. Lett..

[10]  Paul S. Bradley,et al.  Refining Initial Points for K-Means Clustering , 1998, ICML.

[11]  Stanley C. Ahalt,et al.  Competitive learning algorithms for vector quantization , 1990, Neural Networks.

[12]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[13]  David G. Stork,et al.  Pattern Classification , 1973 .

[14]  Korris Fu-Lai Chung,et al.  Fuzzy competitive learning , 1994, Neural Networks.

[15]  Lakhmi C. Jain,et al.  Bioinformatics using computational intelligence paradigms , 2005 .

[16]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[17]  Giulio Sandini,et al.  Neural-gas for function approximation: a heuristic for minimizing the local estimation error , 2000, Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks. IJCNN 2000. Neural Computing: New Challenges and Perspectives for the New Millennium.

[18]  Thomas Martinetz,et al.  'Neural-gas' network for vector quantization and its application to time-series prediction , 1993, IEEE Trans. Neural Networks.

[19]  Hong Yan,et al.  A nonlinear neural network model of mixture of local principal component analysis: application to handwritten digits recognition , 2001, Pattern Recognit..

[20]  James M. Keller,et al.  Fuzzy Models and Algorithms for Pattern Recognition and Image Processing , 1999 .

[21]  Thomas Villmann,et al.  Supervised Neural Gas with General Similarity Measure , 2005, Neural Processing Letters.

[22]  Miin-Shen Yang,et al.  Alternative c-means clustering algorithms , 2002, Pattern Recognit..

[23]  H. L. Le Roy,et al.  Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability; Vol. IV , 1969 .

[24]  James M. Keller,et al.  A possibilistic approach to clustering , 1993, IEEE Trans. Fuzzy Syst..

[25]  Jacek M. Leski,et al.  Towards a robust fuzzy clustering , 2003, Fuzzy Sets Syst..

[26]  Jongwoo Kim,et al.  Application of the least trimmed squares technique to prototype-based clustering , 1996, Pattern Recognit. Lett..

[27]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[28]  Jung-Hua Wang,et al.  A new robust clustering algorithm-density-weighted fuzzy c-means , 1999, IEEE SMC'99 Conference Proceedings. 1999 IEEE International Conference on Systems, Man, and Cybernetics (Cat. No.99CH37028).

[29]  Ponnuthurai N. Suganthan,et al.  Hierarchical overlapped neural gas network with application to pattern classification , 2000, Neurocomputing.

[30]  Frederik Questier,et al.  The Neural-Gas network for classifying analytical data , 2002 .

[31]  R. Zemel A minimum description length framework for unsupervised learning , 1994 .

[32]  M.C. Clark,et al.  MRI segmentation using fuzzy clustering techniques , 1994, IEEE Engineering in Medicine and Biology Magazine.

[33]  Mineichi Kudo,et al.  MDL-Based Selection of the Number of Components in Mixture Models for Pattern Classification , 1998, SSPR/SPR.

[34]  James C. Bezdek,et al.  Generalized fuzzy c-means clustering strategies using Lp norm distances , 2000, IEEE Trans. Fuzzy Syst..

[35]  Thomas Martinetz,et al.  Topology representing networks , 1994, Neural Networks.

[36]  Moshe Kam,et al.  The Credibilistic Fuzzy C Means Algorithm , 1998 .

[37]  Palma Blonda,et al.  A survey of fuzzy clustering algorithms for pattern recognition. I , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[38]  Greg Hamerly,et al.  Learning the k in k-means , 2003, NIPS.

[39]  Jiang-She Zhang,et al.  Robust clustering by pruning outliers , 2003, IEEE Trans. Syst. Man Cybern. Part B.

[40]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[41]  Jorma Rissanen,et al.  Stochastic Complexity in Statistical Inquiry , 1989, World Scientific Series in Computer Science.

[42]  Barbara Hammer,et al.  Prototype based recognition of splice sites , 2005 .

[43]  Rajesh N. Davé,et al.  Characterization and detection of noise in clustering , 1991, Pattern Recognit. Lett..

[44]  Simon Haykin,et al.  Neural Networks: A Comprehensive Foundation , 1998 .

[45]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .