Clusteringn objects intok groups under optimal scaling of variables |
| |
Authors: | Stef van Buuren Willem J. Heiser |
| |
Affiliation: | (1) Department of Psychonomy, University of Utrecht, The Netherlands;(2) Department of Data Theory, University of Leiden, The Netherlands |
| |
Abstract: | We propose a method to reduce many categorical variables to one variable withk categories, or stated otherwise, to classifyn objects intok groups. Objects are measured on a set of nominal, ordinal or numerical variables or any mix of these, and they are represented asn points inp-dimensional Euclidean space. Starting from homogeneity analysis, also called multiple correspondence analysis, the essential feature of our approach is that these object points are restricted to lie at only one ofk locations. It follows that thesek locations must be equal to the centroids of all objects belonging to the same group, which corresponds to a sum of squared distances clustering criterion. The problem is not only to estimate the group allocation, but also to obtain an optimal transformation of the data matrix. An alternating least squares algorithm and an example are given.The authors thank Eveline Kroezen and Teije Euverman for their comments on a previous draft of this paper. |
| |
Keywords: | homogeneity analysis cluster analysis variable importance Groupals |
本文献已被 SpringerLink 等数据库收录! |
|