首页 | 本学科首页   官方微博 | 高级检索  
     


Fluency Profiling System: An automated system for analyzing the temporal properties of speech
Authors:Daniel R. Little  Raoul Oehmen  John Dunn  Kathryn Hird  Kim Kirsner
Affiliation:1. Psychological Sciences, University of Melbourne, Parkville, Victoria, 3010, Australia
2. University of Western Australia, Crawley, Western Australia
3. University of Adelaide, Adelaide, South Australia
4. University of Notre Dame, Fremantle, Western Australia
Abstract:The temporal characteristics of speech can be captured by examining the distributions of the durations of measurable speech components, namely speech segment durations and pause durations. However, several barriers prevent the easy analysis of pause durations: The first problem is that natural speech is noisy, and although recording contrived speech minimizes this problem, it also discards diagnostic information about cognitive processes inherent in the longer pauses associated with natural speech. The second issue concerns setting the distribution threshold, and consists of the problem of appropriately classifying pause segments as either short pauses reflecting articulation or long pauses reflecting cognitive processing, while minimizing the overall classification error rate. This article describes a fully automated system for determining the locations of speech–pause transitions and estimating the temporal parameters of both speech and pause distributions in natural speech. We use the properties of Gaussian mixture models at several stages of the analysis, in order to identify theoretical components of the data distributions, to classify speech components, to compute durations, and to calculate the relevant statistics.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号