Bootstrapping the lexicon: a computational model of infant speech segmentation |
| |
Authors: | Batchelder Eleanor Olds |
| |
Affiliation: | The Graduate Center of the City University of New York, New York, USA. eob@post.harvard.edu |
| |
Abstract: | Prelinguistic infants must find a way to isolate meaningful chunks from the continuous streams of speech that they hear. BootLex, a new model which uses distributional cues to build a lexicon, demonstrates how much can be accomplished using this single source of information. This conceptually simple probabilistic algorithm achieves significant segmentation results on various kinds of language corpora - English, Japanese, and Spanish; child- and adult-directed speech, and written texts; and several variations in coding structure - and reveals which statistical characteristics of the input have an influence on segmentation performance. BootLex is then compared, quantitatively and qualitatively, with three other groups of computational models of the same infant segmentation process, paying particular attention to functional characteristics of the models and their similarity to human cognition. Commonalities and contrasts among the models are discussed, as well as their implications both for theories of the cognitive problem of segmentation itself, and for the general enterprise of computational cognitive modeling. |
| |
Keywords: | Speech segmentation Infants Computational algorithm Probabilistic cues Cognitive models |
本文献已被 ScienceDirect PubMed 等数据库收录! |
|