A computational model of word segmentation from continuous speech using transitional probabilities of atomic acoustic events |
| |
Authors: | Räsänen Okko |
| |
Affiliation: | Department of Signal Processing and Acoustics, Aalto University, School of Electrical Engineering, Espoo, Finland |
| |
Abstract: | Word segmentation from continuous speech is a difficult task that is faced by human infants when they start to learn their native language. Several studies indicate that infants might use several different cues to solve this problem, including intonation, linguistic stress, and transitional probabilities between subsequent speech sounds. In this work, a computational model for word segmentation and learning of primitive lexical items from continuous speech is presented. The model does not utilize any a priori linguistic or phonemic knowledge such as phones, phonemes or articulatory gestures, but computes transitional probabilities between atomic acoustic events in order to detect recurring patterns in speech. Experiments with the model show that word segmentation is possible without any knowledge of linguistically relevant structures, and that the learned ungrounded word models show a relatively high selectivity towards specific words or frequently co-occurring combinations of short words. |
| |
Keywords: | Unsupervised learning Language acquisition Word segmentation Distributional learning |
本文献已被 ScienceDirect PubMed 等数据库收录! |
|