Automatic Hard Clustering Using Improved Differential Evolution Algorithm

This chapter describes a Differential Evolution (DE) based algorithm for the automatic clustering of large unlabeled datasets. In contrast to most of the existing clustering techniques, the proposed algorithm requires no prior knowledge of the data to be classified. Rather, it determines the optimal number of clusters in the data ‘on the run’. Superiority of the new method has been demonstrated by comparing it with two recently developed partitional clustering techniques and one popular hierarchical clustering algorithm. The partitional clustering algorithms are based on Genetic Algorithm (GA) and the Particle Swarm Optimization (PSO) algorithm. An interesting practical application of the proposed method to automatic segmentation of images is also illustrated.

[1]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[2]  Swagatam Das,et al.  Automatic Clustering Using an Improved Differential Evolution Algorithm , 2007 .

[3]  Andries Petrus Engelbrecht,et al.  A Cooperative approach to particle swarm optimization , 2004, IEEE Transactions on Evolutionary Computation.

[4]  J. Dunn Well-Separated Clusters and Optimal Fuzzy Partitions , 1974 .

[5]  Ravi Kothari,et al.  On finding the number of clusters , 1999, Pattern Recognit. Lett..

[6]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[7]  Mohan M. Trivedi,et al.  Low-Level Segmentation of Aerial Images with Fuzzy Clustering , 1986, IEEE Transactions on Systems, Man, and Cybernetics.

[8]  Donald Gustafson,et al.  Fuzzy clustering with a fuzzy covariance matrix , 1978, 1978 IEEE Conference on Decision and Control including the 17th Symposium on Adaptive Processes.

[9]  H. Edelsbrunner,et al.  Efficient algorithms for agglomerative hierarchical clustering methods , 1984 .

[10]  Andries Petrus Engelbrecht,et al.  Dynamic clustering using particle swarm optimization with application in image segmentation , 2006, Pattern Analysis and Applications.

[11]  Donald W. Bouldin,et al.  A Cluster Separation Measure , 1979, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  R. Storn,et al.  Differential Evolution: A Practical Approach to Global Optimization (Natural Computing Series) , 2005 .

[13]  Amit Konar,et al.  Two improved differential evolution schemes for faster global search , 2005, GECCO '05.

[14]  Clark F. Olson,et al.  Parallel Algorithms for Hierarchical Clustering , 1995, Parallel Comput..

[15]  D. J. Newman,et al.  UCI Repository of Machine Learning Database , 1998 .

[16]  J. Bezdek Numerical taxonomy with fuzzy sets , 1974 .

[17]  Steven M. Lalonde,et al.  A First Course in Multivariate Statistics , 1997, Technometrics.

[18]  J. Bezdek Cluster Validity with Fuzzy Sets , 1973 .

[19]  C. S. Wallace,et al.  An Information Measure for Classification , 1968, Comput. J..

[20]  Amit Konar,et al.  Automatic image pixel clustering with an improved differential evolution , 2009, Appl. Soft Comput..

[21]  Andries P. Engelbrecht,et al.  Dynamic Clustering using Particle Swarm Optimization with Application in Unsupervised Image Classification , 2007 .

[22]  Julius T. Tou,et al.  Pattern Recognition Principles , 1974 .

[23]  G H Ball,et al.  A clustering technique for summarizing multivariate data. , 1967, Behavioral science.

[24]  Andries Petrus Engelbrecht,et al.  Particle swarm optimization method for image clustering , 2005, Int. J. Pattern Recognit. Artif. Intell..

[25]  Gerardo Beni,et al.  A Validity Measure for Fuzzy Clustering , 1991, IEEE Trans. Pattern Anal. Mach. Intell..

[26]  James C. Bezdek,et al.  Some new indexes of cluster validity , 1998, IEEE Trans. Syst. Man Cybern. Part B.

[27]  Ujjwal Maulik,et al.  Genetic clustering for automatic evolution of clusters and application to image classification , 2002, Pattern Recognit..

[28]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[29]  William H. Press,et al.  Numerical Recipes in C The Art of Scientific Computing , 1995 .

[30]  M.-C. Su,et al.  A new cluster validity measure and its application to image compression , 2004, Pattern Analysis and Applications.

[31]  Sankar K. Pal,et al.  Fuzzy sets and decisionmaking approaches in vowel and speaker recognition , 1977 .