Repository logo
 
Loading...
Thumbnail Image
Publication

On clustering interval data with different scales of measures : experimental results

Use this identifier to reference this record.

Advisor(s)

Abstract(s)

Symbolic Data Analysis can be defined as the extension of standard data analysis to more complex data tables. We illustrate the application of the Ascendant Hierarchical Cluster Analysis (AHCA) to a symbolic data set (with a known structure) in the field of the automobile industry (car data set), in which objects are described by variables whose values are intervals of the real data set (interval variables). The AHCA of thirty-three car models, described by eight interval variables (with different scales of measure), was based on the standardized weighted generalized affinity coefficient, by the method of Wald and Wolfowitz. We applied three probabilistic aggregation criteria in the scope of the VL methodology (V for Validity, L for Linkage). Moreover, we compare the achieved results with those obtained by other authors, and with a priori partition into four clusters defined by the category (Utilitarian, Berlina, Sporting and Luxury) to which the car belong. We used the global statistics of levels (STAT) to evaluate the obtained partitions.

Description

This article is is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License. Attribution-NonCommercial (CC BY-NC) license lets others remix, tweak, and build upon work non-commercially, and although the new works must also acknowledge & be non-commercial.

Keywords

Ascendant Hierarchical Cluster Analysis Interval Data VL Methodology

Pedagogical Context

Citation

Sousa Á.; Bacelar-Nicolau H.; Nicolau F.C.; Silva O. (2015). On Clustering Interval Data with Different Scales of Measures: Experimental Results. "Asian Journal of Applied Science and Engineering", Vol. 4, Nº 1, pp. 17-25.

Research Projects

Research ProjectShow more

Organizational Units

Journal Issue