Gaussian model‐based partitioning using iterated local search |
| |
Authors: | Michael J. Brusco Emilie Shireman Douglas Steinley Susan Brudvig J. Dennis Cradit |
| |
Affiliation: | 1. Florida State University, Tallahassee, Florida, USA;2. University of Missouri, Columbia, Missouri, USA;3. Indiana University East, Richmond, Indiana, USA |
| |
Abstract: | The emergence of Gaussian model‐based partitioning as a viable alternative to K‐means clustering fosters a need for discrete optimization methods that can be efficiently implemented using model‐based criteria. A variety of alternative partitioning criteria have been proposed for more general data conditions that permit elliptical clusters, different spatial orientations for the clusters, and unequal cluster sizes. Unfortunately, many of these partitioning criteria are computationally demanding, which makes the multiple‐restart (multistart) approach commonly used for K‐means partitioning less effective as a heuristic solution strategy. As an alternative, we propose an approach based on iterated local search (ILS), which has proved effective in previous combinatorial data analysis contexts. We compared multistart, ILS and hybrid multistart–ILS procedures for minimizing a very general model‐based criterion that assumes no restrictions on cluster size or within‐group covariance structure. This comparison, which used 23 data sets from the classification literature, revealed that the ILS and hybrid heuristics generally provided better criterion function values than the multistart approach when all three methods were constrained to the same 10‐min time limit. In many instances, these differences in criterion function values reflected profound differences in the partitions obtained. |
| |
Keywords: | clustering model‐based partitioning heuristics |
|
|