Automatic shape independent clustering inspired by ant dynamics

Abstract This article describes a simple heuristic algorithm that can automatically detect any number of well-separated clusters, which may be of any shape e.g. convex and/or non-convex. This is in contrast to most of the existing clustering algorithms that assume a value for the number of clusters and/or a particular cluster structure. The algorithm draws inspiration from the dynamics of ants and iteratively partitions the dataset based on its proximity matrix. A runtime complexity analysis shows that the complexity of the algorithm is either quadratic or cubic with respect to the size of the dataset. It can detect outliers from the data and is also able to identify the situation when the data do not have any natural clusters at all. Promising results on both real and artificial datasets have been included to show the effectiveness of the proposed technique.

[1]  Weng-Kin Lai,et al.  Homogeneous Ants for Web Document Similarity Modeling and Categorization , 2002, Ant Algorithms.

[2]  Swagatam Das,et al.  Automatic Clustering Using an Improved Differential Evolution Algorithm , 2007 .

[3]  Thomas Stützle,et al.  Ant Colony Optimization , 2009, EMO.

[4]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.

[5]  Pierre Hansen,et al.  Cluster analysis and mathematical programming , 1997, Math. Program..

[6]  Anil K. Jain,et al.  Artificial neural networks for feature extraction and multivariate data projection , 1995, IEEE Trans. Neural Networks.

[7]  Charles T. Zahn,et al.  Graph-Theoretical Methods for Detecting and Describing Gestalt Clusters , 1971, IEEE Transactions on Computers.

[8]  Giandomenico Spezzano,et al.  So-Grid: A self-organizing Grid featuring bio-inspired algorithms , 2008, TAAS.

[9]  Ujjwal Maulik,et al.  Fuzzy partitioning using a real-coded variable-length genetic algorithm for pixel classification , 2003, IEEE Trans. Geosci. Remote. Sens..

[10]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[11]  Sandra Paterlini,et al.  Evolutionary Approaches for Cluster Analysis , 2003 .

[12]  Luca Maria Gambardella,et al.  Ant colony system: a cooperative learning approach to the traveling salesman problem , 1997, IEEE Trans. Evol. Comput..

[13]  Emanuel Falkenauer,et al.  Genetic Algorithms and Grouping Problems , 1998 .

[14]  Ajith Abraham,et al.  Swarm Intelligence Algorithms for Data Clustering , 2008, Soft Computing for Knowledge Discovery and Data Mining.

[15]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[16]  Amit Konar,et al.  Metaheuristic Clustering , 2009, Studies in Computational Intelligence.

[17]  Marco Dorigo,et al.  Ant-based clustering: a comparative study of its relative performance with respect to k-means, average link and 1d-som , 2003 .

[18]  Pedro Pina,et al.  Self-Organized Data and Image Retrieval as a Consequence of Inter-Dynamic Synergistic Relationships in Artificial Ant Colonies , 2002, HIS.

[19]  Jean-Louis Deneubourg,et al.  The dynamics of collective sorting robot-like ants and ant-like robots , 1991 .

[20]  Michael K. Ng,et al.  A fuzzy k-modes algorithm for clustering categorical data , 1999, IEEE Trans. Fuzzy Syst..

[21]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[22]  Andries P. Engelbrecht,et al.  Dynamic Clustering using Particle Swarm Optimization with Application in Unsupervised Image Classification , 2007 .

[23]  James C. Bezdek,et al.  Generalized clustering networks and Kohonen's self-organizing scheme , 1993, IEEE Trans. Neural Networks.

[24]  David S. Johnson,et al.  The Traveling Salesman Problem: A Case Study in Local Optimization , 2008 .

[25]  Andries Petrus Engelbrecht,et al.  Particle swarm optimization method for image clustering , 2005, Int. J. Pattern Recognit. Artif. Intell..

[26]  Earl E. Swartzlander,et al.  Introduction to Mathematical Techniques in Pattern Recognition , 1973 .

[27]  Ujjwal Maulik,et al.  Genetic clustering for automatic evolution of clusters and application to image classification , 2002, Pattern Recognit..

[28]  Marcus Randall,et al.  Anti-pheromone as a Tool for Better Exploration of Search Space , 2002, Ant Algorithms.

[29]  Lawrence O. Hall,et al.  Fuzzy Ants and Clustering , 2007, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[30]  D. Snyers,et al.  New results on an ant-based heuristic for highlighting the organization of large graphs , 1999, Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406).

[31]  Francisco Herrera,et al.  A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms , 2011, Swarm Evol. Comput..

[32]  T. M. Lillesand,et al.  Remote Sensing and Image Interpretation , 1980 .

[33]  W. Peizhuang Pattern Recognition with Fuzzy Objective Function Algorithms (James C. Bezdek) , 1983 .

[34]  B. Jaumard,et al.  Cluster Analysis and Mathematical Programming , 2003 .

[35]  Marco Dorigo,et al.  Optimization, Learning and Natural Algorithms , 1992 .

[36]  Pascale Kuntz,et al.  A Stochastic Heuristic for Visualising Graph Clusters in a Bi-Dimensional Space Prior to Partitioning , 1999, J. Heuristics.

[37]  Brian Everitt,et al.  Cluster analysis , 1974 .

[38]  Baldo Faieta,et al.  Diversity and adaptation in populations of clustering ants , 1994 .

[39]  Julia Handl,et al.  Improved Ant-Based Clustering and Sorting , 2002, PPSN.

[40]  Francesco Masulli,et al.  Soft Computing Applications , 2003 .

[41]  E. Forgy Cluster analysis of multivariate data : efficiency versus interpretability of classifications , 1965 .

[42]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[43]  Richard O. Duda,et al.  Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[44]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[45]  Marco Dorigo,et al.  Ant system: optimization by a colony of cooperating agents , 1996, IEEE Trans. Syst. Man Cybern. Part B.