An investigation into the effect of unlabeled neurons on Self-Organizing Maps

Self-Organizing Maps (SOMs) are unsupervised neural networks that build data models. Neuron labeling attaches descriptive textual labels to the neurons making up a SOM, and is an important component of SOM-based exploratory data analysis (EDA) and data mining (DM). Several neuron labeling approaches tend to leave some neurons unlabeled. The interaction between unlabeled neurons and SOM model accuracy affect the choice of labeling algorithm for SOM-based EDA and DM, but has not been previously investigated. This paper applies the widely used example-centric neuron labeling algorithm to several classification problems, and empirically investigates the relationship between the percentage of neurons left unlabeled and classification accuracy. Practical recommendations are also presented, which address the treatment of unlabeled neurons and the selection of an appropriate neuron labeling algorithm.

[1]  Andreas Rauber,et al.  Automatic Labeling of Self-Organizing Maps: Making a Treasure-Map Reveal Its Secrets , 1999, PAKDD.

[2]  Shan Ling Pan,et al.  Improved SOM Labeling Methodology for Data Mining Applications , 2008, Soft Computing for Knowledge Discovery and Data Mining.

[3]  Samuel Kaski,et al.  Bibliography of Self-Organizing Map (SOM) Papers: 1981-1997 , 1998 .

[4]  Ben Shneiderman,et al.  Readings in information visualization - using vision to think , 1999 .

[5]  Teuvo Kohonen,et al.  Self-Organizing Maps , 2010 .

[6]  Alfred Ultsch,et al.  Automatic Acquisition of Symbolic Knowledge from Subsymbolic Neural Networks , 1993 .

[7]  Timo Honkela,et al.  BIBLIOGRAPHY OF SELF-ORGANIZING MAP (SOM) PAPERS: 2002-2005 ADDENDUM , 2009 .

[8]  Teuvo Kohonen,et al.  Self-organization and associative memory: 3rd edition , 1989 .

[9]  C. Spearman The proof and measurement of association between two things. , 2015, International journal of epidemiology.

[10]  Andries Petrus Engelbrecht,et al.  A comparison of map neuron labeling approaches for unsupervised self-organizing feature maps , 2008, 2008 IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence).

[11]  T. Kohonen Self-organized formation of topology correct feature maps , 1982 .

[12]  T. Kohonen,et al.  Visual Explorations in Finance with Self-Organizing Maps , 1998 .

[13]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[14]  Andries Petrus Engelbrecht,et al.  HybridSOM: A generic rule extraction framework for self-organizing feature maps , 2009, 2009 IEEE Symposium on Computational Intelligence and Data Mining.

[15]  Esa Alhoniemi Analysis of pulping data using the self-organizing map , 2000 .

[16]  Carlos Serrano-Cinca,et al.  Self organizing neural networks for financial diagnosis , 1996, Decision Support Systems.

[17]  T. Kohonen,et al.  Bibliography of Self-Organizing Map SOM) Papers: 1998-2001 Addendum , 2003 .

[18]  Andries Petrus Engelbrecht,et al.  Unsupervised Weight-Based Cluster Labeling for Self-Organizing Maps , 2012, WSOM.

[19]  Teuvo Kohonen,et al.  Visual Explorations in Finance , 1998 .

[20]  Stefan Wermter,et al.  Data mining using rule extraction from Kohonen self-organising maps , 2006, Neural Computing & Applications.

[21]  Lemuel A Moyé,et al.  Multiple Analyses in Clinical Trials: Fundamentals for Investigators , 2003 .

[22]  Nelis Franken,et al.  Visual exploration of algorithm parameter space , 2009, 2009 IEEE Congress on Evolutionary Computation.

[23]  Jana Tuckova The possibility of kohonen self-organizing map applications in medicine , 2013, 2013 IEEE 11th International Workshop of Electronics, Control, Measurement, Signals and their application to Mechatronics.

[24]  Sandhya Samarasinghe,et al.  Neural Networks for Applied Sciences and Engineering: From Fundamentals to Complex Pattern Recognition , 2006 .

[25]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[26]  I. Sobol On the distribution of points in a cube and the approximate evaluation of integrals , 1967 .

[27]  Juha Vesanto,et al.  Data exploration process based on the self-organizing map , 2002 .

[28]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[29]  Jan D. Miller,et al.  Agglomeration and magnetic deinking for office paper , 2000 .

[30]  A. Corradini,et al.  A hybrid stochastic-connectionist architecture for gesture recognition , 1999, Proceedings 1999 International Conference on Information Intelligence and Systems (Cat. No.PR00446).