首页 | 本学科首页   官方微博 | 高级检索  
   检索      


An examination of indexes for determining the number of clusters in binary data sets
Authors:Evgenia Dimitriadou  Sara Dolničar  Andreas Weingessel
Institution:(1) Institut für Statistik und Wahrscheinlichkeitstheorie, Technische Universität Wien, Austria;(2) Institut für Tourismus und Freizeitwirtschaft, Wirtschaftsuniversität wien, Austria
Abstract:The problem of choosing the correct number of clusters is as old as cluster analysis itself. A number of authors have suggested various indexes to facilitate this crucial decision. One of the most extensive comparative studies of indexes was conducted by Milligan and Cooper (1985). The present piece of work pursues the same goal under different conditions. In contrast to Milligan and Cooper's work, the emphasis here is on high-dimensional empirical binary data. Binary artificial data sets are constructed to reflect features typically encountered in real-world data situations in the field of marketing research. The simulation includes 162 binary data sets that are clustered by two different algorithms and lead to recommendations on the number of clusters for each index under consideration. Index results are evaluated and their performance is compared and analyzed.Author names are listed in alphabetical order.This piece of research was supported by the Austrian Science Foundation (FWF) under grant SFB#010 (ldquoAdaptive Information Systems and Modeling in Economics and Management Sciencerdquo).The authors would like to thank the anonymous reviewers and especially the associate editor for their helpful comments and suggestions.
Keywords:number of clusters  clustering indexes  binary data  artificial data sets  market segmentation
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号