Repository logo
 
Thumbnail Image
Publication

Measuring similarity of complex and heterogeneous data in clustering of large data sets

Use this identifier to reference this record.
Name:Description:Size:Format: 
BBE_29_2_009_FT (1).pdf143.7 KBAdobe PDF Download

Advisor(s)

Abstract(s)

Cluster analysis or classification usually concerns a set of exploratory multivariate data analysis methods and techniques for finding a clustering structure on a dataset. That may refer either to groups of statistical data units or to groups of variables. In this work we deal with a generalization of this paradigm concerning clustering of complex data described by three different types of variables, frequently present in a three-way context. We obtain compatible versions of the same affinity coefficient for measuring similarity between statistical data units described by those three types of variables. A global generalized similarity coefficient is analyzed for such kind of mixed data, often arising in data mining or knowledge mining.

Description

Copyright © 2009 Polish Academy of Sciences.

Keywords

Cluster Analysis Different Type Variables Similarity Coefficient Three-way Data

Citation

Bacelar-Nicolau, Helena; Nicolau, Fernando C.; Sousa, Áurea; Bacelar-Nicolau, Leonor (2009). "Measuring similarity of complex and heterogeneous data in clustering of large data sets", Biocybernetics and Biomedical Engineering, 29(2), 9-18. ISSN 0208-5216.

Research Projects

Organizational Units

Journal Issue

Publisher

Polish Academy of Sciences

CC License