首页 | 本学科首页   官方微博 | 高级检索  
     


Wide-Coverage Probabilistic Sentence Processing
Authors:Matthew W. Crocker  Thorsten Brants
Affiliation:(1) Department of Computational Linguistics, Universität des Saarlandes, Saarbücken, Germany;(2) Department of Computational Linguistics, Universität des Saarlandes, Saarbücken, Germany
Abstract:This paper describes a fully implemented, broad-coverage model of human syntactic processing. The model uses probabilistic parsing techniques, which combine phrase structure, lexical category, and limited subcategory probabilities with an incremental, left-to-right ldquopruningrdquo mechanism based on cascaded Markov models. The parameters of the system are established through a uniform training algorithm, which determines maximum-likelihood estimates from a parsed corpus. The probabilistic parsing mechanism enables the system to achieve good accuracy on typical, ldquogarden-varietyrdquo language (i.e., when tested on corpora). Furthermore, the incremental probabilistic ranking of the preferred analyses during parsing also naturally explains observed human behavior for a range of garden-path structures. We do not make strong psychological claims about the specific probabilistic mechanism discussed here, which is limited by a number of practical considerations. Rather, we argue incremental probabilistic parsing models are, in general, extremely well suited to explaining this dual nature—generally good and occasionally pathological—of human linguistic performance.
Keywords:probabilistic parsing  frequency  Markov models
本文献已被 PubMed SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号