首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
A continuous speech message alternated between the left and right ears retains generally good intelligibility, except at certain critical rates of alternation of about 3–4 switching cycles/sec. In the present experiment, subjects heard speech alternated between the two ears at eight different switching frequencies, and at four different speech rates. Results support an earlier contention that the critical intelligibility parameter in alternated speech is average speech content per ear segment, rather than absolute time per ear. Implications are discussed both in terms of critical speech segments in auditory analysis and in neural processing of binaural auditory information.  相似文献   

2.
When speech is rapidly alternated between the two ears, intelligibility declines as the rate of alternation approaches 3 to 5 switching cycles per second, and then, paradoxically, returns to a good level beyond that point. We tested intelligibility when shadowing was used as a response measure (Experiment 1), when recall was used as a response measure (Experiment 2), and when time-compression was used to vary the speech rate of the presented materials (Experiment 3). In spite of claims that older adults are generally slower in switching attention, younger and older adults did not differ in the critical alternation rates producing minimal intelligibility. We suggest that the point of minimal intelligibility in alternated speech reflects an interaction between (1) the rate of disruption induced by breaking the speech stream between two sound sources, (2) the amount of contextual information per ear, and (3) the size of the silent gaps separating the speech elements that must be perceptually bridged.  相似文献   

3.
An automated threshold method has been developed for determining the maximum rate of speech understood by individual listeners. Two experiments were undertaken to determine whether the threshold was related to the comprehension of speech or to speech intelligibility. The first experiment compared thresholds of two types of rapid speech reportedly different in intelligibility: simple speeded speech and speech compressed by the sampling method. The second experiment sought to determine the relationship of the threshold to traditional comprehension measures. The results are discussed in terms of the intelligibility and comprehensibility of speech.  相似文献   

4.
In order to function effectively as a means of communication, speech must be intelligible under the noisy conditions encountered in everyday life. Two types of perceptual synthesis have been reported that can reduce or cancel the effects of masking by extraneous sounds: Phonemic restoration can enhance intelligibility when segments are replaced or masked by noise, and contralateral induction can prevent mislateralization by effectively restoring speech masked at one ear when it is heard in the other. The present study reports a third type of perceptual synthesis induced by noise: enhancement of intelligibility produced by adding noise to spectral gaps. In most of the experiments, the speech stimuli consisted of two widely separated narrow bands of speech (center frequencies of 370 and 6000 Hz, each band having high-pass and low-pass slopes of 115 dB/octave meeting at the center frequency). These very narrow bands effectively reduced the available information to frequency-limited patterns of amplitude fluctuation lacking information concerning formant structure and frequency transitions. When stochastic noise was introduced into the gap separating the two speech bands, intelligibility increased for “everyday” sentences, for sentences that varied in the transitional probability of keywords, and for monosyllabic word lists. Effects produced by systematically varying noise amplitude and noise bandwidth are reported, and the implications of some of the novel effects observed are discussed.  相似文献   

5.
Kim J  Sironic A  Davis C 《Perception》2011,40(7):853-862
Seeing the talker improves the intelligibility of speech degraded by noise (a visual speech benefit). Given that talkers exaggerate spoken articulation in noise, this set of two experiments examined whether the visual speech benefit was greater for speech produced in noise than in quiet. We first examined the extent to which spoken articulation was exaggerated in noise by measuring the motion of face markers as four people uttered 10 sentences either in quiet or in babble-speech noise (these renditions were also filmed). The tracking results showed that articulated motion in speech produced in noise was greater than that produced in quiet and was more highly correlated with speech acoustics. Speech intelligibility was tested in a second experiment using a speech-perception-in-noise task under auditory-visual and auditory-only conditions. The results showed that the visual speech benefit was greater for speech recorded in noise than for speech recorded in quiet. Furthermore, the amount of articulatory movement was related to performance on the perception task, indicating that the enhanced gestures made when speaking in noise function to make speech more intelligible.  相似文献   

6.
The relationship between subjective estimates of the comprehensibility of connected, freerunning speech and rate of speech was investigated for each of two types of time-compressed speech: pitch-varying speeded speech and pitch-normalized compressed speech. The midpoints of the resulting functions approximated the values obtained by a previously described speechrate tracking method. For equivalent degrees of comprehensibility, rates were higher for compressed speech than for speeded speech, indicating that estimates are sensitive to the intelligibility of speech. Subjective estimates of comprehensibility of time-compressed speech provide a means of assessing the itelligibility of connected speech.  相似文献   

7.
Understanding low-intelligibility speech is effortful. In three experiments, we examined the effects of intelligibility on working memory (WM) demands imposed by perception of synthetic speech. In all three experiments, a primary speeded word recognition task was paired with a secondary WM-load task designed to vary the availability of WM capacity during speech perception. Speech intelligibility was varied either by training listeners to use available acoustic cues in a more diagnostic manner (as in Experiment 1) or by providing listeners with more informative acoustic cues (i.e., better speech quality, as in Experiments 2 and 3). In the first experiment, training significantly improved intelligibility and recognition speed; increasing WM load significantly slowed recognition. A significant interaction between training and load indicated that the benefit of training on recognition speed was observed only under low memory load. In subsequent experiments, listeners received no training; intelligibility was manipulated by changing synthesizers. Improving intelligibility without training improved recognition accuracy, and increasing memory load still decreased it, but more intelligible speech did not produce more efficient use of available WM capacity. This suggests that perceptual learning modifies the way available capacity is used, perhaps by increasing the use of more phonetically informative features and/or by decreasing use of less informative ones.  相似文献   

8.
This study assessed intelligibility in a dysarthric patient with Parkinson's disease (PD) across five speech production tasks: spontaneous speech, repetition, reading, repeated singing, and spontaneous singing, using the same phrases for all but spontaneous singing. The results show that this speaker was significantly less intelligible when speaking spontaneously than in the other tasks. Acoustic analysis suggested that relative intensity and word duration were not independently linked to intelligibility, but dysfluencies (from perceptual analysis) and articulatory/resonance patterns (from acoustic records) were related to intelligibility in predictable ways. These data indicate that speech production task may be an important variable to consider during the evaluation of dysarthria. As speech production efficiency was found to vary with task in a patient with Parkinson's disease, these results can be related to recent models of basal ganglia function in motor performance.  相似文献   

9.
When deleted segments of speech are replaced by extraneous sounds rather than silence, the missing speech fragments may be perceptually restored and intelligibility improved. This phonemic restoration (PhR) effect has been used to measure various aspects of speech processing, with deleted portions of speech typically being replaced by stochastic noise. However, several recent studies of PhR have used speech-modulated noise, which may provide amplitude-envelope cues concerning the replaced speech. The present study compared the effects upon intelligibility of replacing regularly spaced portions of speech with stochastic (white) noise versus speech-modulated noise. In Experiment 1, filling periodic gaps in sentences with noise modulated by the amplitude envelope of the deleted speech fragments produced twice the intelligibility increase obtained with interpolated stochastic noise. Moreover, when lists of isolated monosyllables were interrupted in Experiment 2, interpolation of speech-modulated noise increased intelligibility whereas stochastic noise reduced intelligibility. The augmentation of PhR produced by modulated noise appeared without practice, suggesting that speech processing normally involves not only a narrowband analysis of spectral information but also a wideband integration of amplitude levels across critical bands. This is of considerable theoretical interest, but it also suggests that since PhRs produced by speech-modulated noise utilize potent bottom-up cues provided by the noise, they differ from the PhRs produced by extraneous sounds, such as coughs and stochastic noise.  相似文献   

10.
14 mothers of children who were deaf or hard of hearing provided magnitude estimation scaling responses for the speech intelligibility and speech annoyance of narrative speech samples produced by children who were deaf or hard of hearing. Analysis indicated that listeners scaled intelligibility and annoyance the same. As samples became more difficult to understand, they also became more annoying to these listeners. Implications for further research are discussed.  相似文献   

11.
The purpose of this investigation was to judge whether the Lombard effect, a characteristic change in the acoustical properties of speech produced in noise, existed in adductor spasmodic dysphonia speech, and if so, whether the effect added to or detracted from speaker intelligibility. Intelligibility, as described by Duffy, is the extent to which the acoustic signal produced by a speaker is understood by a listener based on the auditory signal alone. Four speakers with adductor spasmodic dysphonia provided speech samples consisting of low probability sentences from the Speech Perception in Noise test to use as stimuli. The speakers were first tape-recorded as they read the sentences in a quiet speaking condition and were later tape-recorded as they read the same sentences while exposed to background noise. The listeners used as subjects in this study were 50 undergraduate university students. The results of the statistical analysis indicated a significant difference between the intelligibility of the speech recorded in the quiet versus noise conditions (F(1,49) = 57.80, p < or = .001). It was concluded that a deleterious Lombard effect existed for the adductor spasmodic dysphonia speaker group, with the premise that the activation of a Lombard effect in such patients may detract from their overall speech intelligibility.  相似文献   

12.
We investigate the hypothesis that infant-directed speech is a form of hyperspeech, optimized for intelligibility, by focusing on vowel devoicing in Japanese. Using a corpus of infant-directed and adult-directed Japanese, we show that speakers implement high vowel devoicing less often when speaking to infants than when speaking to adults, consistent with the hyperspeech hypothesis. The same speakers, however, increase vowel devoicing in careful, read speech, a speech style which might be expected to pattern similarly to infant-directed speech. We argue that both infant-directed and read speech can be considered listener-oriented speech styles—each is optimized for the specific needs of its intended listener. We further show that in non-high vowels, this trend is reversed: speakers devoice more often in infant-directed speech and less often in read speech, suggesting that devoicing in the two types of vowels is driven by separate mechanisms in Japanese.  相似文献   

13.
The insertion of noise in the silent intervals of interrupted speech has a very striking perceptual effect if a certain signal-to-noise ratio is used. Conflicting reports have been published as to whether the inserted noise improves speech intelligibility or not. The major difference between studies was the level of redundancy in the speech material. We show in the present paper that the noise leads to a better intelligibility of interrupted speech. The redundancy level determines the possible amount of improvement. The consequences of our findings are discussed. in relation to such phenomena as continuity perception and pulsation threshold measurement. A hypothesis is formulated for the processing of interrupted stimuli with and without intervening noise: for stimuli presented with intervening noise, the presence in the auditory system of an automatic interpolation mechanism is assumed. The mechanism operates only if the noise makes it impossible to perceive the interruption.  相似文献   

14.
The role of rhythm in the speech intelligibility of 18 hearing-impaired children, aged 15 years with hearing losses from 40 to 108 db, was investigated. Their perceptual judgement of visual rhythm sequences was superior to that of the hearing controls, but their scores were not correlated with their speech intelligibility.  相似文献   

15.
Speech intelligibility performance with an in-the-ear microphone embedded in a custom-molded deep-insertion earplug was compared with results obtained using a free-field microphone. Intelligibility differences between microphones were further analyzed to assess whether reduced intelligibility was specific to certain sound classes. 36 participants completed the Modified Rhyme Test using recordings made with each microphone. While speech intelligibility for both microphones was highly accurate, intelligibility with the free-field microphone was significantly better than with the in-the-ear microphone. There were significant effects of place and manner of sound production. Significant differences in recognition among specific phonemes were also revealed. Implications included modifying the in-the-ear microphone to transmit more high frequency energy. Use of the in-the-ear microphone was limited by significant loss of high-frequency energy of the speech signal which resulted in reduced intelligibility for some sounds; however, the in-the-ear microphone is a promising technology for effective communication in military environments.  相似文献   

16.
Lip reading is the ability to partially understand speech by looking at the speaker's lips. It improves the intelligibility of speech in noise when audio-visual perception is compared with audio-only perception. A recent set of experiments showed that seeing the speaker's lips also enhances sensitivity to acoustic information, decreasing the auditory detection threshold of speech embedded in noise [J. Acoust. Soc. Am. 109 (2001) 2272; J. Acoust. Soc. Am. 108 (2000) 1197]. However, detection is different from comprehension, and it remains to be seen whether improved sensitivity also results in an intelligibility gain in audio-visual speech perception. In this work, we use an original paradigm to show that seeing the speaker's lips enables the listener to hear better and hence to understand better. The audio-visual stimuli used here could not be differentiated by lip reading per se since they contained exactly the same lip gesture matched with different compatible speech sounds. Nevertheless, the noise-masked stimuli were more intelligible in the audio-visual condition than in the audio-only condition due to the contribution of visual information to the extraction of acoustic cues. Replacing the lip gesture by a non-speech visual input with exactly the same time course, providing the same temporal cues for extraction, removed the intelligibility benefit. This early contribution to audio-visual speech identification is discussed in relationships with recent neurophysiological data on audio-visual perception.  相似文献   

17.
We studied speech intelligibility and memory performance for speech material heard under different signal‐to‐noise (S/N) ratios. Pre‐experimental measures of working memory capacity (WMC) were taken to explore individual susceptibility to the disruptive effects of noise. Thirty‐five participants first completed a WMC‐operation span task in quiet and later listened to spoken word lists containing 11 one‐syllable phonetically balanced words presented at four different S/N ratios (+12, +9, +6, and +3). Participants repeated each word aloud immediately after its presentation, to establish speech intelligibility and later on performed a free recall task for those words. The speech intelligibility function decreased linearly with increasing S/N levels for both the high‐WMC and low‐WMC groups. However, only the low‐WMC group had decreasing memory performance with increasing S/N levels. The memory of the high‐WMC individuals was not affected by increased S/N levels. Our results suggest that individual differences in WMC counteract some of the negative effects of speech noise. Copyright © 2012 John Wiley & Sons, Ltd.  相似文献   

18.
The effect of stimulus repetition was investigated in a speech identification task where intelligibility was lowered not by white noise added to the audio waveform but by ‘structural’ noise added to the spectrum parameters. Several aspects of the results argue for the improvement in intelligibility between one and two presentations being due not to statistical averaging over internal or external noise, but to increased perceptual selectivity under the influence of the first presentation's stimulus properties.  相似文献   

19.
Most scholars agree that meaning and intelligibility are central to Heidegger’s account of Dasein and Being-in-the-world, but there is some confusion about the nature of this intelligibility. In his debate with McDowell, Dreyfus draws on phenomenologists like Heidegger to argue that there are two kinds of intelligibility: a basic, nonconceptual, practical intelligibility found in practical comportment and a conceptual, discursive intelligibility. I explore two possible ways that Dreyfus might ground this twofold account of intelligibility in Heidegger: first in the distinction between the hermeneutic and apophantic “as”, and second in the presence and absence of the as-structure. I argue that neither approach succeeds because practical intelligibility is always already discursive and discursive articulation is a condition of practical comportment.  相似文献   

20.
Speech intelligibility during the performance of a second task (sorting of small plates), and the frequency of sorting in dependence of the phases of speech processing (input-processing-output) are investigated. A fixed speech level (65 dB) is combined with 5 different noise levels (55, 60, 65, 70, 75 dB). The speech material and the sorting task vary in difficulty (words, sentences, small texts; simple and complicated sorting). By rating 3 questions the subjective quality of both tasks is inquired. Main results: speech intelligibility and frequency of sorting vary in dependence of noise level; frequency of sorting varies in dependence of the phases of speech processing and speech material; subjective ratings are corresponding with the performance of both tasks.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号