Instability of hierarchical cluster analysis due to input order of the data: the PermuCLUSTER solution |
| |
Authors: | van der Kloot Willem A Spaans Alexander M J Heiser Willem J |
| |
Affiliation: | Department of Psychology, Faculty of Social and Behavioural Sciences, Leiden University, Leiden, Netherlands. vanderkloot@fsw.leidenuniv.nl |
| |
Abstract: | Hierarchical agglomerative cluster analysis (HACA) may yield different solutions under permutations of the input order of the data. This instability is caused by ties, either in the initial proximity matrix or arising during agglomeration. The authors recommend to repeat the analysis on a large number of random permutations of the rows and columns of the proximity matrix and select a solution with the highest goodness-of-fit. This approach was implemented in an SPSS add-in, PermuCLUSTER, which can perform all HACA methods of SPSS. Analyses of 2 data sets show that (a) results are affected by input order, (b) instability in one method co-occurs with instability in other methods, and (c) some instability effects are more dramatic because they occur at higher agglomeration levels. |
| |
Keywords: | |
本文献已被 PubMed 等数据库收录! |
|