论文信息 - Structure of hierarchic clusterings: implications for information retrieval and for multivariate data analysis

Structure of hierarchic clusterings: implications for information retrieval and for multivariate data analysis

Abstract Hierarchic clustering methods may be used to condense information for a user, as they are in multivariate data analysis, or to achieve computational advantages, as they are in information retrieval. The structure of the hierarchic classification produced has a direct bearing on the effectiveness and utility of using cluster analysis, yet this important feature of the classification has only been implicitly referred to in the literature to date. In this study, three different coefficients are defined, each of which quantify the symmetry-asymmetry (balancedness-unbalancedness) of hierarchic clusterings on a scale from 0 to 1. Using examples of data from the areas of information retrieval and of multivariate data analysis, a number of hierarchic clustering methods are discussed in terms of the hierarchies they produce.

Fionn Murtagh | F. Murtagh

[1] Fionn Murtagh,et al. A Survey of Recent Advances in Hierarchical Clustering Algorithms , 1983, Comput. J..

[2] C. J. van Rijsbergen,et al. The use of hierarchic clustering in information retrieval , 1971, Inf. Storage Retr..

[3] Richard C. T. Lee. Clustering Analysis and Its Applications , 1981 .

[4] Karen Spärck Jones. Collection properties influencing automatic term classification performance , 1973, Inf. Storage Retr..

[5] Anil K. Jain,et al. Clustering Methodologies in Exploratory Data Analysis , 1980, Adv. Comput..

[6] R K Blashfield,et al. The Growth Of Cluster Analysis: Tryon, Ward, And Johnson. , 1980, Multivariate behavioral research.