首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 31 毫秒
1.
Several recent studies by Lehiste have reported that changes in fundamental frequency (F0) can serve as a cue to perceived vowel length and, furthermore, that the perceived lengthening of the vowel can influence perception of the voicing feature of stop consonants in syllable-final position. In Experiment 1, we replicated Lehiste’s basic results for stop consonants in final position. Experiment 2 extended these results to postvocalic fricatives. The final consonant in syllables of intermediate vowel duration was more often perceived as voiced when F0 was falling than when F0 was monotone. In Experiment 3, we examined the F0 contours produced by eight talkers before postvocalic stop consonants and fricatives in natural speech for minimal pairs of words differing in voicing. The amount of change of F0 over the vowel was no greater before voiced than voiceless consonants, suggesting that the earlier perceptual effects cannot be explained by appealing to regularities observed in the production of F0 contours in vowels preceding postvocalic consonants.  相似文献   

2.
This study assessed whether 3 children, 5 to 6 years old, who deleted word-final fricatives preserve the voicing contrast for those fricatives by producing dif ferential duration of the preceding vowel. The children's CV syllable productions were compared to their CV(C) syllable productions in which the final consonant was intended but actually deleted. Analysis indicated that all 3 children exhibited significantly longer vowel duration, in CV syllables than in CV(C) syllables. This differential duration of the preceding vowel was shown in both isolation and carrier phrase conditions. Of the 3 children 2 preserved the voicing contrast by showing significantly longer vowels preceding voiced consonants as compared to voiceless consonants. One child did this in both isolation and carrier phrase conditions and the other child did this in isolation only. All 3 children manipulated vowel duration to signify the linguis tic contrast, therefore these findings support a linguistic perspective of speech development which focuses on the acquisition and knowledge of the linguistic rules of the language. In addition, the large amount of variability in vowel duration for CV and CV(C) syllables and the marked variability in performance across children support a biological view of a developing vocal tract undergoing structural and physiological changes.  相似文献   

3.
For native speakers of English and several other languages, preceding vocalic duration andFi offset frequency are two of the cues that convey the stop consonant voicing distinction in wordfinal position. For speakers learning English as a second language, there are indications that use of vocalic duration, but notFl offset frequency, may be hindered by a lack of experience with phonemic (i.e., lexical) vowel length (the “phonemic vowel length account”: Crowther & Mann, 1992). In this study, native speakers of Arabic, a language that includes a phonemic vowel length distinction, were tested for their use of vocalic duration andF1 offset in production and perception of the English consonant-vowel-consonant forms pod and pot. The phonemic vowel length hypothesis predicts that Arabic speakers should use vocalic duration extensively in production and perception. On the contrary, experiment l repealed that, consistent with Flege and Port’s (1981) findings, they produced only slightly (but significantly) longer vocalic segments in their pod tokens. It further indicated that their productions showed a significant variation inFl offset as a function of final stop voicing. Perceptual sensitivity to vocalic duration andFl offset as voicing cues was tested in two experiments. In experiment 2, we employed a factorial combination of these two cues and a finely spaced vocalic duration continuum. Arabic speakers did not appear to be very sensitive to vocalic duration, but they were abort as sensitive as native English speakers toF1 offset frequency. In Experiment 3, we employed a one-dimensional continuum of more widely spaced stimuli that varied only vocalic duration. Arabic speakers showed native-English-like sensitivity to vocalic duration- Anexplanation based on tie perceptual anchor theory of context coding (Braida et al., 1984; Macmillan, 1987; Macmillan, Braida, & Goldberg, 1987) and phoneme perception theory (Schouten & Van Hessen, 2992) is offered to reconcile the apparently contradictory perceptual findings. The explanation does not attribute native-English-like voicing perception to the Ambit subjects. The findings in this study call fox a modification of the phonemic vowel length hypothesis.  相似文献   

4.
Five-year-old children were tested for perceptual trading relations between a temporal cue (silence duration) and a spectral cue (F1 onset frequency) for the “say-stay” distinction. Identification functions were obtained for two synthetic “say-stay” continua, each containing systematic variations in the amount of silence following the /s/ noise. In one continuum, the vocalic portion had a lower F1 onset than in the other continuum. Children showed a smaller trading relation than has been found with adults. They did not differ from adults, however, in their perception of an “ay-day” continuum formed by varying F1 onset frequency only. The results of a discrimination task in which the two acoustic cues were made to “cooperate” or “conflict” phonetically supported the notion of perceptual equivalence of the temporal and spectral cues along a single phonetic dimension. The results indicate that young children, like adults, perceptually integrate multiple cues to a speech contrast in a phonetically relevant manner, but that they may not give the same perceptual weights to the various cues as do adults.  相似文献   

5.
Traunmüller (1981) suggested that the tonotopic distance between the first formant (F 1) and the fundamental frequency (F 0) is a major determinant of perceived vowel height. In the present study, subjects identified a vowel-height continuum ranging in formant pattern from /?/ to /ε/, at fiveF 0 values. IncreasingF 0 led to an increased probability of /I/ responses (i.e., the phoneme boundary shifted toward the /ε/ end of the continuum). Various conditions of filtering out the lower harmonics of the stimuli caused only marginal shifts of the phoneme boundary. The experiments provide evidence against interpretations of Traunmüller’s (1981) results that claim that vowel height is determined by the distance betweenF 1 and the lowest harmonic that is present in the basilar membrane excitation pattern.  相似文献   

6.
Several experiments investigate voicing judgments in minimal pairs likerabid-rapid when the duration of the first vowel and the medial stop are varied factorially and other cues for voicing remain ambiguous. In Experiments 1 and 2, in which synthetic labial and velar-stop voicing pairs are investigated, the perceptual boundary along a continuum of silent consonant durations varies in constant proportion to increases in the duration of the preceding vocalic interval. In Experiment 3, it is shown that speaking tempo external to the test word has far smaller effects on a closure duration boundary for voicing than does the tempo within the test word. Experiment 4 shows that, even within the word, it is primarily the preceding vowel that accounts for changes in the consonant duration effects. Furthermore, in Experiments 3 and 4, the effects of timing outside the vowel-consonant interval are independent of the duration of that interval itself. These findings suggest that consonant/vowel ratio serves as a primary acoustic cue for English voicing in syllable-final position and imply that this ratio possibly is directly extracted from the speech signal.  相似文献   

7.
This study was designed to explore the effects of transient attention—the stimulus-driven component of spatial attention—on the perceived duration of a brief visual event. Observers had to compare the duration of two disks presented successively within a single trial. The disks’ location and duration varied independently. One of these disks, the “attended disk”, was preceded by an attentional cue indicating the disk's location, attracting transient attention in advanced to the disk location. This attentional cue was either a typical onset cue (Experiments 1, 2, and 4) or a singleton cue (Experiment 3) that minimized the differences between the cues. The other disk, the “neutral disk”, was cued with a neutral cue that did not convey information regarding the disk location. We found that the attraction of transient attention to the location of the attended disk prolonged its perceived duration, but only when compared to brief nonattend stimuli.  相似文献   

8.
Ss gave immediate ordered recall for series of seven isolated vowel sounds from a vocabulary of three-/a, u, i/. In a 2 by 2 design, the stimulus vowels were either 50 or 300 msec in duration (though always presented at a 2/sec rate), and the seven-item series was followed by either a nonverbal recall cue or a verbal recall cue that is a stimulus suffix. There was an interaction such that the recall impairment caused by the verbal suffix was larger when the stimulus items were vowels of long duration than when they were vowels of short duration. Since short vowels tend to be more categorically perceived than long vowels, this result reinforces the view that vowels and stop consonants are differently represented in auditory short-term memory for reasons that have to do with the way m winch they are perceived or vice versa.  相似文献   

9.
Domini F  Shah R  Caudek C 《Acta psychologica》2011,138(3):359-366
The current model of three-dimensional perception hypothesizes that the brain integrates the depth cues in a statistically optimal fashion through a weighted linear combination with weights proportional to the reliabilities obtained for each cue in isolation (Landy, Maloney, Johnston, & Young, 1995). Even though many investigations support such theoretical framework, some recent empirical findings are at odds with this view (e.g., Domini, Caudek, & Tassinari, 2006). Failures of linear cue integration have been attributed to cue-conflict and to unmodelled cues to flatness present in computer-generated displays. We describe two cue-combination experiments designed to test the integration of stereo and motion cues, in the presence of consistent or conflicting blur and accommodation information (i.e., when flatness cues are either absent, with physical stimuli, or present, with computer-generated displays). In both conditions, we replicated the results of Domini et al. (2006): The amount of perceived depth increased as more cues were available, also producing an over-estimation of depth in some conditions. These results can be explained by the Intrinsic Constraint model, but not by linear cue combination.  相似文献   

10.
Speech perception requires listeners to integrate multiple cues that each contribute to judgments about a phonetic category. Classic studies of trading relations assessed the weights attached to each cue but did not explore the time course of cue integration. Here, we provide the first direct evidence that asynchronous cues to voicing (/b/ vs. /p/) and manner (/b/ vs. /w/) contrasts become available to the listener at different times during spoken word recognition. Using the visual world paradigm, we show that the probability of eye movements to pictures of target and of competitor objects diverge at different points in time after the onset of the target word. These points of divergence correspond to the availability of early (voice onset time or formant transition slope) and late (vowel length) cues to voicing and manner contrasts. These results support a model of cue integration in which phonetic cues are used for lexical access as soon as they are available.  相似文献   

11.
The perception of the distinction between /r/ and /l/ by native speakers of American English and of Japanese was studied using natural and synthetic speech. The American subjects were all nearly perfect at recognizing the natural speech sounds, whereas there was substantial variation among the Japanese subjects in their accuracy of recognizing /r/ and /l/ except in syllable-final position. A logit model, which additively combined the acoustic information conveyed byF1-transition duration and byF3-onset frequency, provided a good fit to the perception of synthetic /r/ and /l/ by the American subjects. There was substantial variation among the Japanese subjects in whether theF1 andF3 cues had a significant effect on their classifications of the synthetic speech. This variation was related to variation in accuracy of recognizing natural /r/ and /l/, such that greater use of both theF1 cue and theF3 cue in classifying the synthetic speech sounds was positively related to accuracy in recognizing the natural sounds. However, multiple regression showed that use of theF1 cue did not account for significant variance in natural speech performance beyond that accounted for by theF3 cue, indicating that theF3 cue is more important than theF1 cue for Japanese speakers learning English. The relation between performance on natural and synthetic speech also provides external validation of the logit model by showing that it predicts performance outside of the domain of data to which it was fit.  相似文献   

12.
Recent experiments showed that the perception of vowel length by German listeners exhibits the characteristics of categorical perception. The present study sought to find the neural activity reflecting categorical vowel length and the short-long boundary by examining the processing of non-contrastive durations and categorical length using MEG. Using disyllabic words with varying /a/-durations and temporally-matched nonspeech stimuli, we found that each syllable elicited an M50/M100-complex. The M50-amplitude to the second syllable varied along the durational continuum, possibly reflecting the mapping of duration onto a rhythm representation. Categorical length was reflected by an additional response elicited when vowel duration exceeded the short-long boundary. This was interpreted to reflect the integration of an additional timing unit for long in contrast to short vowels. Unlike to speech, responses to short nonspeech durations lacked a M100 to the first and M50 to the second syllable, indicating different integration windows for speech and nonspeech signals.  相似文献   

13.
ABSTRACT

This study used synthesis to manipulate vowel formant frequencies and durations to evaluate their role on foreign accent perception. Formant frequencies and durations for the vowels /æ/, /?/, and /a/ were manipulated with changes toward and away from the mean native English and Spanish-accented values from Sidaras, S. K., Alexander, J. E. D., & Nygaard, L. C. (2009. Perceptual learning of systematic variation in Spanish-accented speech. The Journal of the Acoustical Society of America, 125, 3306–3316). Native listeners rated these stimuli on degree of accentedness and comprehensibility. Gradual changes in formant frequencies from native to non-native values impacted /a/ negatively, /?/ positively, and /æ/ minimally. Effects of vowel duration on either type of ratings were small and restricted to vowel-specific interactions. The current findings suggest that vowel formant frequencies are primary cues to foreign accent. Their influence depends upon whether or not frequencies could reflect alternative vowel categories.  相似文献   

14.
We present three experiments designed to investigate the role of prosody during sentence processing. The first investigated the question of whether an utterance's prosodic contour influences its comprehension on-line. We spliced the beginning and end portions of direct object and embedded clause sentences and observed the consequent effects on comprehension using a dual-task procedure to measure processing load. Our second experiment sought to determine-whether the constituent structure of these sentences could be reliably predicted using prosodic information. We found that the duration and F0 contour associated with the main-clause verb and the following NP reliably distinguished between the direct object and embedded clause constructions. In the final experiment, we manipulated the duration of the main-clause verb and found that subjects used this information to guide their initial parse during on-line sentence comprehension. The need for a model of sentence processing that addresses the use of prosodic information is discussed.The work reported in this paper was supported in part by NTH grant DC00494.  相似文献   

15.
The hypothesis was tested that CV syllables (10 different consonants but the same vowel /?/) would show different evoked potential latencies and amplitudes. Differences were found which were dichotomised between plosives and other consonants. Although one difference between these two groups of consonants is the duration of that consonant, this duration difference did not adequately explain the prolonged evoked potential latencies and the reduced amplitudes for the non-plosives. However, no differences were found within the group of non-plosives, although they varied in consonant duration. It was suggested that N1 latency and amplitude reflected processing time at an early stage of analysis of both speech and non-speech stimuli. The results show that consunants of long duration are perceived later than plosives, but well before the onset of the vowel.  相似文献   

16.
Phonetically governed changes in the fundamental frequency (F0) of vowels that immediately precede and follow voiceless stop plosives have been found to follow consistent patterns in adults and children as young as four years of age. In the present study, F0 onset and offset patterns in 14 children who stutter (CWS) and 14 children who do not stutter (CWNS) were investigated to evaluate differences in speech production. Participants produced utterances containing two VCV sequences. F0 patterns in the last ten vocal cycles in the preceding vowel (voicing offset) and the first ten vocal cycles in the subsequent vowel (voicing onset) were analyzed. A repeated measures ANOVA revealed no group differences between the CWS and CWNS in either voicing onset or offset gestures. Both groups showed patterns of F0 onset and offset that were consistent with the mature patterns seen in children and adults in previous studies. These findings suggest that in both CWS and CWNS, a mature pattern of voicing onset and offset is present by age 3;6. This study suggests that there is no difference between CWS and CWNS in the coordination of respiratory and laryngeal systems during voicing onset or offset.Educational objectives: The reader will be able to: (a) discuss the importance of investigating children who stutter close to the onset of stuttering; (b) describe the typical change in F0 during voicing onset; (c) discuss the potential implications of these results with regard to future research.  相似文献   

17.
The acoustical spectrum of the five Spanish vowels |a, e, i, o, u| has been delimited to show the areas covered by F 1 , F 2 , and F 3 and the relative distribution energy among the formants. Through the analysis of the spectral components of vowels, isolated and in consonantal context, it is possible to estimate the different weight of each formant in vowel identification. At least for isolated vowels,F 2 andF 3 seem to be effective for the identification of [i] and [e] while theF 1 andF 2 carry the weight for the identification of [o] and [u]. The cue to differentiate [a] seems to beF 2. Spanish vowels are compared with cardinals and North American English vowels. There is no correlation with cardinal vowels while similarities are found with English vowels.  相似文献   

18.
Metrical phonology is the perceptual “strength” in language of some syllables relative to others. The ability to perceive lexical stress is important, as it can help a listener segment speech and distinguish the meaning of words and sentences. Despite this importance, there has been little comparative work on the perception of lexical stress across species. We used a go/no-go operant paradigm to train human participants and budgerigars (Melopsittacus undulatus) to distinguish trochaic (stress-initial) from iambic (stress-final) two-syllable nonsense words. Once participants learned the task, we presented both novel nonsense words, and familiar nonsense words that had certain cues removed (e.g., pitch, duration, loudness, or vowel quality) to determine which cues were most important in stress perception. Members of both species learned the task and were then able to generalize to novel exemplars, showing categorical learning rather than rote memorization. Tests using reduced stimuli showed that humans could identify stress patterns with amplitude and pitch alone, but not with only duration or vowel quality. Budgerigars required more than one cue to be present and had trouble if vowel quality or amplitude were missing as cues. The results suggest that stress patterns in human speech can be decoded by other species. Further comparative stress-perception research with more species could help to determine what species characteristics predict this ability. In addition, tests with a variety of stimuli could help to determine how much this ability depends on general pattern learning processes versus vocalization-specific cues.  相似文献   

19.
An interesting phenomenon in human speech perception is the trading relation, in which two different acoustic cues both signal the same phonetic percept. The present study compared American English, Spanish, and monkey listeners in their perception of the trading relation between gap duration andFl transition onset frequency in a syntheticsay-stay continuum. For all the subjects, increased gap duration caused perception to change fromsay tostay; however, subjects differed in the extent to which theFl cue traded with gap duration. For American English listeners, a change from a low to a highF1 onset caused a phoneme boundary shift of 26 msec toward shorter gap durations, indicating a strong trading relation. For Spanish listeners, the shift was significantly smaller at 13.7 msec, indicating a weaker trading relation. For monkeys, there was no shift at all, indicating no trading relation. These results provide evidence that thesay-stay trading relation is dependent on perceptual learning from linguistic exposure.  相似文献   

20.
This article reports three experiments in which the effects of cue duration on involuntary orienting to uninformative symbolic cues (arrows presented at fixation) were investigated. Experiment 1 showed that symbolic cues had less effect on involuntary orienting when they were presented for only 25 msec than when they were presented for 200 msec across a range of stimulus onset asynchronies. Experiment 2 suggested that the effect of cue duration on involuntary orienting was due primarily to top-down strategic factors, rather than to bottom-up stimulus factors, and Experiment 3 suggested that these strategic factors may involve differences in how the cue is processed. Altogether, the present findings are important because they emphasize the distinction between cue processing and the putative involuntary orienting that results from such processing in the symbolic-cuing paradigm. In so doing, the present results help resolve discrepant findings that have been reported across previous studies.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号