Name: | Description: | Size: | Format: | |
---|---|---|---|---|
527.48 KB | Adobe PDF |
Advisor(s)
Abstract(s)
The discovery of knowledge in the case of Hierarchical Cluster Analysis (HCA) depends on many factors, such as the clustering algorithms applied and the strategies developed in the initialstage of Cluster Analysis. We present a global approach for evaluating the quality of clustering results and making a comparison among different clustering algorithms using the relevant information available (e.g. the stability, isolation and homogeneity of the clusters). In addition, we present a visual method to facilitate evaluation of the quality of the partitions, allowing identification of the similarities and differences between partitions, as well as the behaviour of the elements in the partitions. We illustrate our approach using a complex and heterogeneous dataset (real horse data) taken from the literature. We apply HCA based on the generalized affinity coefficient (similarity coefficient) to the case of complex data (symbolic data), combined with 26 (classic and probabilistic) clustering algorithms. Finally, we discuss the obtained results and the contribution of this approach to gaining better knowledge of the structure of data.
Description
Copyright © 2012 Walter de Gruyter GmbH.
Keywords
Cluster Analysis VL Methodology Affinity Coefficient Comparing Partitions Cluster Stability Cluster Validation
Pedagogical Context
Citation
Silva, Osvaldo; Bacelar-Nicolau, Helena; Nicolau, Fernando, C. (2012). "A global Approach to the Comparison of Clustering Results", Biometrical Letters, 49(2), 135-147. ISSN (Print) 1896-3811, DOI: 10.2478/bile-2013-0010.
Publisher
Walter de Gruyter