Automatic measurement of propositional idea density from part-of-speech tagging |
| |
Authors: | Cati Brown Tony Snodgrass Susan J. Kemper Ruth Herman Michael A. Covington |
| |
Affiliation: | H5, San Francisco, California, USA. |
| |
Abstract: | The Computerized Propositional Idea Density Rater (CPIDR, pronounced "spider") is a computer program that determines the propositional idea density (P-density) of an English text automatically on the basis of part-of-speech tags. The key idea is that propositions correspond roughly to verbs, adjectives, adverbs, prepositions, and conjunctions. After tagging the parts of speech using MontyLingua (Liu, 2004), CPIDR applies numerous rules to adjust the count, such as combining auxiliary verbs with the main verb. A "speech mode" is provided in which CPIDR rejects repetitions and a wider range of fillers. CPIDR is a user-friendly Windows .NET application distributed as open-source freeware under GPL. Tested against human raters, it agrees with the consensus of two human raters better than the team of five raters agree with each other [r(80) = .97 vs. r(10) = .82, respectively]. |
| |
Keywords: | |
本文献已被 PubMed SpringerLink 等数据库收录! |
|