Abstract: | It is common practice in psychology to devise “measurement” procedures by imposing rating scales (e.g., Likert items) onto phenomena and treating the values they produce as quantities. The validity of these procedures goes untested. Validity checks are instead performed on sets of these measurement procedures (i.e., multi-item scales). We present results from three studies suggesting that people cannot be assumed to preserve transitivity when comparing themselves and others on NEO Neuroticism-domain trait items. As transitivity is one of the fundamental axioms of quantitative measurement, these studies challenge the validity of Neuroticism scales at the level of individual scale items. |