CICS/A - Artigos em Revistas Internacionais / Articles in International Journals
Permanent URI for this collection
Artigo ou um editorial publicado numa revista científica.
(Aceite; Publicado; Actualizado).
Pesquisar Copyright
Browse
Browsing CICS/A - Artigos em Revistas Internacionais / Articles in International Journals by Author "Bacelar-Nicolau, Helena"
Now showing 1 - 2 of 2
Results Per Page
Sort Options
- Clustering an interval data set : are the main partitions similar to a priori partition?Publication . Sousa, Áurea; Bacelar-Nicolau, Helena; Nicolau, Fernando C.; Silva, OsvaldoIn this paper we compare the best partitions of data units (cities) obtained from different algorithms of Ascendant Hierarchical Cluster Analysis (AHCA) of a well-known data set of the literature on symbolic data analysis (“city temperature interval data set”) with a priori partition of cities given by a panel of human observers. The AHCA was based on the weighted generalised affinity with equal weights, and on the probabilistic coefficient associated with the asymptotic standardized weighted generalized affinity coefficient by the method of Wald and Wolfowitz. These similarity coefficients between elements were combined with three aggregation criteria, one classical, Single Linkage (SL), and the other ones probabilistic, AV1 and AVB, the last ones in the scope of the VL methodology. The evaluation of the partitions in order to find the partitioning that best fits the underlying data was carried out using some validation measures based on the similarity matrices. In general, global satisfactory results have been obtained using our methods, being the best partitions quite close (or even coinciding) with the a priori partition provided by the panel of human observers.
- On clustering interval data with different scales of measures : experimental resultsPublication . Sousa, Áurea; Bacelar-Nicolau, Helena; Nicolau, Fernando C.; Silva, OsvaldoSymbolic Data Analysis can be defined as the extension of standard data analysis to more complex data tables. We illustrate the application of the Ascendant Hierarchical Cluster Analysis (AHCA) to a symbolic data set (with a known structure) in the field of the automobile industry (car data set), in which objects are described by variables whose values are intervals of the real data set (interval variables). The AHCA of thirty-three car models, described by eight interval variables (with different scales of measure), was based on the standardized weighted generalized affinity coefficient, by the method of Wald and Wolfowitz. We applied three probabilistic aggregation criteria in the scope of the VL methodology (V for Validity, L for Linkage). Moreover, we compare the achieved results with those obtained by other authors, and with a priori partition into four clusters defined by the category (Utilitarian, Berlina, Sporting and Luxury) to which the car belong. We used the global statistics of levels (STAT) to evaluate the obtained partitions.