Data clustering using particle swarm optimization

This paper proposes two new approaches to using PSO to cluster data. It is shown how PSO can be used to find the centroids of a user specified number of clusters. The algorithm is then extended to use K-means clustering to seed the initial swarm. This second algorithm basically uses PSO to refine the clusters formed by K-means. The new PSO algorithms are evaluated on six data sets, and compared to the performance of K-means clustering. Results show that both PSO clustering techniques have much potential.

[1]  Frans van den Bergh,et al.  An analysis of particle swarm optimizers , 2002 .

[2]  Ali Zilouchian,et al.  FUNDAMENTALS OF NEURAL NETWORKS , 2001 .

[3]  Douglas H. Fisher,et al.  Knowledge Acquisition Via Incremental Conceptual Clustering , 1987, Machine Learning.

[4]  M. Rao Cluster Analysis and Mathematical Programming , 1971 .

[5]  B. Jaumard,et al.  Cluster Analysis and Mathematical Programming , 2003 .

[6]  Andries P. Engelbrecht,et al.  Image Classification using Particle Swarm Optimization , 2002, SEAL.

[7]  Vincent Kanade,et al.  Clustering Algorithms , 2021, Wireless RF Energy Transfer in the Massive IoT Era.

[8]  John A. Hartigan,et al.  Clustering Algorithms , 1975 .

[9]  D. E. Barton,et al.  Introduction to Mathematical Techniques in Pattern Recognition , 1973 .

[10]  Gavin Potgieter Mining continuous classes using evolutionary computing , 2003 .

[11]  E. Forgy Cluster analysis of multivariate data : efficiency versus interpretability of classifications , 1965 .

[12]  James Kennedy,et al.  Particle swarm optimization , 1995, Proceedings of ICNN'95 - International Conference on Neural Networks.

[13]  I. Evangelou,et al.  Data Mining and Knowledge Discovery in Complex Image Data using Artificial Neural Networks , 2001 .

[14]  Andries P. Engelbrecht Sensitivity analysis of multilayer neural networks , 1999 .

[15]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[16]  G H Ball,et al.  A clustering technique for summarizing multivariate data. , 1967, Behavioral science.