首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 15 毫秒
1.
In adult listeners, the signal-to-noise ratio at masked threshold remains constant with increases in masker level over a wide range of stimulus conditions. This relationship was examined in 7-month-old infants by obtaining masked thresholds for .5- and 4-kHz tones presented in four levels of continuous masking noise. Adults were also tested for comparison. Masker spectrum levels ranged from 5 to 35 dB/Hz for .5-kHz tones, and from -5 to 25 dB/Hz for 4-kHz stimuli. Thresholds were determined for stimuli of both 10 and 100 msec in duration. The results indicated that infants' performance was more adult-like for 4-kHz stimuli. Although mean thresholds for both 10- and 100-msec, 4-kHz tones were approximately 7 dB higher in infants than in adults, E/N0 at threshold remained essentially constant over the 30-dB range of maskers employed. By contrast, infants' thresholds for .5-kHz tones were exceptionally high at lower levels of the masker. Threshold E/N0 decreased significantly as masker level increased from 5 to 35 dB/Hz, and this decrease was significantly greater for 10- than for 100-msec stimuli. Temporal summation of .5-kHz tones, measured as the difference between thresholds obtained at the two signal durations, was greater for infants than for adults at low levels of the masker. However, because infants' thresholds improved more rapidly with level for 10- than for 100-msec tones, age differences in temporal summation were no longer significant when masker spectrum level was 35 dB/Hz. These results suggest that the relationship between signal-to-noise ratio at masked threshold and level of the masker is dependent on both signal frequency and duration during infancy.  相似文献   

2.
In adult listeners, the signal-to-noise ratio at masked threshold remains constant with increases in masker level over a wide range of stimulus conditions. This relationship was examined in 7-month-old infants by obtaining masked thresholds for .5- and 4-kHz tones presented in four levels of continuous masking noise. Adults were also tested for comparison. Masker spectrum levels ranged from 5 to 35 dB/Hz for .5-kHz tones, and from ?5 to 25 dB/Hz for 4-kHz stimuli. Thresholds were determined for stimuli of both 10 and 100 msec in duration. The results indicated that infants’ performance was more adultlike for 4-kHz stimuli. Although mean thresholds for both 10- and 100-msec, 4-kHz tones were approximately 7 dB higher in infants than in adults, E/N0 at threshold remained essentially constant over the 30-dB range of maskers employed. By contrast, infants’ thresholds for .5-kHz tones were exceptionally high at lower levels of the masker. Threshold E/N0 decreased significantly as masker level increased from 5 to 35 dB/Hz, and this decrease was significantly greater for 10- than for 100-msec stimuli. Temporal summation of .5-kHz tones, measured as the difference between thresholds obtained at the two signal durations, was greater for infants than for adults at low levels of the masker. However, because infants’ thresholds improved more rapidly with level for 10- than for 100-msec tones, age differences in temporal summation were no longer significant when masker spectrum level was 35 dB/Hz. These results suggest that the relationship between signal-to-noise ratio at masked threshold and level of the masker is dependent on both signal frequency and duration during infancy.  相似文献   

3.
The application of the power-spectrum model of masking to the detectability of a signal masked by dichotic noise was investigated in three experiments. In each experiment, the signal was a 2-kHz sinusoid of 400-msec duration, masked by either one or two 800-Hz wide bands of noise presented singly or in pairs. In Experiment 1, we compared the detectability of a diotic signal masked by dichotic noise with the detectability of a monaural signal masked by each of the noises separately. The spectrum level of the noise was 35 dB SPL. For dichotic presentations, the signal was sent to both ears while pairs of noise bands, one below and one above the signal frequency, were presented together, one band to each ear. Threshold levels with the dichotic stimuli were lower than or equal to the thresholds with either ear's stimulus on its own. Similar dichotic stimuli were used in Experiment 2, except that the signal frequency was nearer to one or the other of the bands of masking noise, and the noise had a spectrum level of 50 dB SPL. In Experiment 3, thresholds were obtained with two sets of symmetrically and asymmetrically placed notched-noise maskers. For one of these sets, the spectrum level of both noise bands was 35 dB SPL; for the other set, interaural intensity differences were introduced in the form of an inequality in the levels of the noise bands on either side of the signal. In one ear, the spectrum level of the lower frequency noise band was 35 dB SPL and the spectrum level of the higher frequency noise band was 25 dB SPL, whereas in the other ear, the allocation of noise level to noise band was reversed. The dichotic thresholds obtained with the unequal noise maskers could be predicted from the shapes of the auditory filters derived with equal noise maskers. The data from all three experiments suggest that threshold signal levels in the presence of interaural differences in masker intensity depend principally on the ear with the higher signal-to-masker ratio at the output of its auditory filter, a finding consistent with the power-spectrum model of masking.  相似文献   

4.
Size of critical band in infants, children, and adults   总被引:1,自引:0,他引:1  
Masked thresholds at two signal frequencies (0.8 and 4 kHz) were obtained from listeners aged 6.5 months, 2 years, 5 years, and 20.5 years in the presence of constant spectrum level, narrowband maskers of differing bandwidths. Consistent with the classical results of Fletcher (1940), masked threshold for all age groups increased with bandwidth up to a critical width, beyond which further increases in bandwidth were ineffective in increasing threshold. These critical widths (estimates of critical band size) did not change substantially with age (critical widths for infants were no more than 50% larger than those of adults) despite substantial changes in masked thresholds with age. Thus, contrary to previous claims, changes in auditory filter width cannot account for developmental changes in masked or absolute thresholds.  相似文献   

5.
According to the formant centre of gravity (FCOG) hypothesis, two vowel formants in close proximity are merged during perceptual analysis, and their contribution to vowel quality depends on the centre of gravity of the formant cluster. Findings consistent with this hypothesis are that two formants can be replaced by a single formant of intermediate centre frequency, provided their separation is less than 3-3.5 Bark; and that changes in their relative amplitudes produce systematic shifts in vowel quality. In Experiment 1, listeners adjusted the frequencies of F1 and F2 in a synthesized 6-formant vowel (with the F1-F2 separation fixed at 250 Hz, i.e. less than 3 Bark) to find the best phonetic match to a reference vowel with modified formant amplitudes. Contrary to FCOG predictions, F2 attenuation did not produce lower frequency matches. Raising the amplitude of F2 led to predicted upward shifts in formant frequencies of the matched vowel, but with increased variability of matches for some stimuli. In Experiment 2, listeners identified synthesized vowels with a range of separations of F1 and F2. Formant amplitude manipulations had no effect on listeners' judgements when the fundamental frequency was low (125 Hz). Small shifts in vowel quality appeared for stimuli with a high fundamental (250 Hz), but the shifts were significantly larger for F1-F2 separations greater than 3.5 Bark. These effects of formant amplitude are qualitatively different from those observed with single-formant vowels and are generally incompatible with a formant-averaging mechanism.  相似文献   

6.
A combined forward-backward masking procedure was used to investigate the threshold of a 30-msec, 500-Hz signal as a function of masker frequency. The signal thresholds were obtained in two signal conditions, diotic (So) and dichotic (Sπ), and for two different temporal separations of the maskers. The maskers were 500 msec in duration and were presented at 75 dB SPL. The function relating masked signal threshold to masker frequency was used to describe frequency selectivity in the four conditions. There were no differences in frequency selectivity measured between the diotic and dichotic signal conditions and only a small difference measured between the two intermasker interval conditions. The Sπ conditions yielded lower thresholds than did the So conditions. The change in intermasker interval from 10 to 50 msec lowered the threshold maximally 18 dB for the So condition and 13 dB for the Sπ condition. The results indicate that in this tonal temporal masking procedure there are no differences between the diotic and dichotic critical bands.  相似文献   

7.
In four experiments we investigated whether listeners can locate the formants of vowels not only from peaks, but also from spectral “shoulders”—features that give rise to zero crossings in the third, but not the first, differential of the excitation pattern—as hypothesized by Assmann and Summerfield (1989). Stimuli were steady-state approximations to the vowels [a, i, з, u, ?] created by summing the first 45 harmonics of a fundamental of 100 Hz. Thirty-nine harmonics had equal amplitudes; the other 6 formed three pairs that were raised in level to define three “formants.” An adaptive psychophysical procedure determined the minimal difference in level between the 6 harmonics and the remaining 39 at which the vowels were identifiably different from one another. These thresholds were measured through simulated communication channels, giving overall slopes to the excitation patterns of the five vowels that ranged from ?1 dB/erb to +2 dB/erb. Excitation patterns of the threshold stimuli were computed, and the locations of formants were estimated from zero crossings in the first and third differentials. With the more steeply sloping communication channels, some formants of some vowels were represented as shoulders rather than peaks, confirming the predictions of Assmann and Summerfield’s models. We discuss the limitations of the excitation pattern model and the related issue of whether the location of formants can be computed from spectral shoulders in auditory analysis.  相似文献   

8.
Three experiments were performed to examine listeners’ thresholds for identifying stimuli whose spectra were modeled after the vowels /i/ and /ε/, with the differences between these stimuli restricted to the frequency of the first formant. The stimuli were presented in a low-pass masking noise that spectrally overlapped the first formant but not the higher formants. Identification thresholds were lower when the higher formants were present than when they were not, even though the first formant contained the only distinctive information for stimulus identification. This indicates that listeners were more sensitive in identifying the first formant energy through its contribution to the vowel than as an independent percept; this effect is given the namecoherence masking protection. The first experiment showed this effect for synthetic vowels in which the distinctive first formant was supported by a series of harmonics that progressed through the higher formants. In the second two experiments, the harmonics in the first formant region were removed, and the first formant was simulated by a narrow band of noise. This was done so that harmonic relations did not provide a basis for grouping the lower formant with the higher formants; coherence masking protection was still observed. However, when the temporal alignment of the onsets and offsets of the higher and lower formants was disrupted, the effect was eliminated, although the stimuli were still perceived as vowels. These results are interpreted as indicating that general principles of auditory grouping that can exploit regularities in temporal patterns cause acoustic energy belonging to a coherent speech sound to stand out in the auditory scene.  相似文献   

9.
We conducted four experiments to investigate the specificity of perceptual adjustments made to unusual speech sounds. Dutch listeners heard a female talker produce an ambiguous fricative [?] (between [f] and [s]) in [f]- or [s]-biased lexical contexts. Listeners with [f]-biased exposure (e.g., [witlo?]; from witlof, "chicory"; witlos is meaningless) subsequently categorized more sounds on an [epsilonf]-[epsilons] continuum as [f] than did listeners with [s]-biased exposure. This occurred when the continuum was based on the exposure talker's speech (Experiment 1), and when the same test fricatives appeared after vowels spoken by novel female and male talkers (Experiments 1 and 2). When the continuum was made entirely from a novel talker's speech, there was no exposure effect (Experiment 3) unless fricatives from that talker had been spliced into the exposure talker's speech during exposure (Experiment 4). We conclude that perceptual learning about idiosyncratic speech is applied at a segmental level and is, under these exposure conditions, talker specific.  相似文献   

10.
In a variant of duplex perception with speech, phoneme perception is maintained when distinguishing components are presented below intensities required for separate detection, forming the basis for the claim that a phonetic module takes precedence over nonspeech processing. This finding is replicated with music chords (C major and minor) created by mixing a piano fifth with a sinusoidal distinguishing tone (E or E flat). Individual threshold intensities for detecting E or E flat in the context of the fixed piano tones are established. Chord discrimination thresholds defined by distinguishing tone intensity were determined. Experiment 2 verified masked detection thresholds and subliminal chord identification for experienced musicians. Accurate chord perception was maintained at distinguishing tone intensities nearly 20 dB below the threshold for separate detection. Speech and music findings are argued to demonstrate general perceptual principles.  相似文献   

11.
The question of whether sensitivity peaks at vowel boundaries (i.e., phoneme boundary effects) and sensitivity minima near excellent category exemplars (i.e., perceptual magnet effects) stem from the same stage of perceptual processing was examined in two experiments. In Experiment 1, participants gave phoneme identification and goodness ratings for 13 synthesized English /i/ and /e/ vowels. In Experiment 2, participants discriminated pairs of these vowels. Either the listeners discriminated the entire range of stimuli within each block of trials, or the range within each block was restricted to a single stimulus pair. In addition, listeners discriminated either one-step or two-step intervals along the stimulus series. The results demonstrated that sensitivity peaks at vowel boundaries were more influenced by stimulus range than were perceptual magnet effects; peaks in sensitivity near the /i/-/e/ boundary were reduced with restricted stimulus ranges and one-step intervals, but minima in discrimination near the best exemplars of /i/ were present in all conditions.  相似文献   

12.
Four experiments examined the effects of language characteristics on voice identification. In Experiment 1, monolingual English listeners identified bilinguals' voices much better when they spoke English than when they spoke German. The opposite outcome was found in Experiment 2, in which the listeners were monolingual in German. In Experiment 3, monolingual English listeners also showed better voice identification when bilinguals spoke a familiar language (English) than when they spoke an unfamiliar one (Spanish). However, English-Spanish bilinguals hearing the same voices showed a different pattern, with the English-Spanish difference being statistically eliminated. Finally, Experiment 4 demonstrated that, for English-dominant listeners, voice recognition deteriorates systematically as the passage being spoken is made less similar to English by rearranging words, rearranging syllables, and reversing normal text. Taken together, the four experiments confirm that language familiarity plays an important role in voice identification.  相似文献   

13.
Two experiments investigating the selective adaptation of vowels examined changes in listeners’ identification functions for the vowel continuum [i-I-∈] as a function of the adapting stimulus. In Experiment I, the adapting stimuli were [i], [I], and [∈]. Both the [i] and [∈] stimuli produced significant shifts in the neighboringand distant phonetic boundaries, whereas [I] did not result in any adaptation effects. In order to explore the phonetic nature of feature adaptation in vowels, a second experiment was conducted using the adapting stimuli [gig] and [g ∈ g], which differed acoustically from the [i] and [∈] vowels on the identification continuum. Only [gig] yielded reliable adaptation effects. The results of these experiments were interpreted as suggesting arelative rather than a stableauditory mode of feature analysis in vowels and a possibly more complex auditory feature analysis for the vowel [i].  相似文献   

14.
The intelligibility of word lists subjected to various types of spectral filtering has been studied extensively. Although words used for communication are usually present in sentences rather than lists, there has been no systematic report of the intelligibility of lexical components of narrowband sentences. In the present study, we found that surprisingly little spectral information is required to identify component words when sentences are heard through narrow spectral slits. Four hundred twenty listeners (21 groups of 20 subjects) were each presented with 100 bandpass filtered CID ( “everyday speech ”) sentences; separate groups received center frequencies of 370, 530, 750, 1100, 1500, 2100, 3000, 4200, and 6000 Hz at 70 dBA SPL. In Experiment 1, intelligibility of single 1/3-octave bands with steep filter slopes (96 dB/octave) averaged more than 95% for sentences centered at 1100, 1500, and 2100 Hz. In Experiment 2, we used the same center frequencies with extremely narrow bands (slopes of 115 dB/octave intersecting at the center frequency, resulting in a nominal bandwidth of l/20 octave). Despite the severe spectral tilt for all frequencies of this impoverished spectrum, intelligibility remained relatively high for most bands, with the greatest intelligibility (77%) at 1500 Hz. In Experiments 1 and 2, the bands centered at 370 and 6000 Hz provided little useful information when presented individually, but in each experiment they interacted synergistically when combined. The present findings demonstrate the adaptive flexibility of mechanisms used for speech perception and are discussed in the context of the LAME model of opportunistic multilevel processing.  相似文献   

15.
A threshold procedure using operant behavioral techniques with positive reinforcement was developed after initial efforts with avoidance behavioral procedures proved unsatisfactory. In the first of three experiments the operant threshold procedure was tested by determining masked auditory thresholds for trains of clicks. In a second experiment, similar techniques were used to measure thresholds for electrical stimulation of the brain. The last experiment, again an auditory problem, involved a determination of the absolute thresholds for trains of short noise bursts as a function of the time between bursts. Middle ear malfunction proved to be a more severe problem than had been anticipated on the basis of reports in the literature. The threshold procedure, however, seems to be adequate for determination of absolute or masked thresholds with auditory or electrical stimuli.  相似文献   

16.
This study examined the effect of native language background on listeners’ perception of native and non-native vowels spoken by native (Hong Kong Cantonese) and non-native (Mandarin and Australian English) speakers. They completed discrimination and an identification task with and without visual cues in clear and noisy conditions. Results indicated that visual cues did not facilitate perception, and performance was better in clear than in noisy conditions. More importantly, the Cantonese talker’s vowels were the easiest to discriminate, and the Mandarin talker’s vowels were as intelligible as the native talkers’ speech. These results supported the interlanguage speech native intelligibility benefit patterns proposed by Hayes-Harb et al. (J Phonetics 36:664–679, 2008). The Mandarin and English listeners’ identification patterns were similar to those of the Cantonese listeners, suggesting that they might have assimilated Cantonese vowels to their closest native vowels. In addition, listeners’ perceptual patterns were consistent with the principles of Best’s Perceptual Assimilation Model (Best in Speech perception and linguistic experience: issues in cross-language research. York Press, Timonium, 1995).  相似文献   

17.
In Experiment 1, masking-level differences (MLDs) for a 500-Hz tone at five masker levels were obtained from younger and older adults. For both age groups, there were no reliable increases in MLD once the spectrum level of the masker exceeded 27 dB SPL. MLDs were larger for younger than for older adults over the range of masker levels tested. In Experiment 2, the levels of both the signal and the masker in one ear were attenuated by either 15 or 30 dB relative to their level in the other ear, which was fixed at a spectrum level of 47 dB SPL. MLDs for both age groups declined with increasing IAA and age-related differences were observed in all conditions. The findings of these experiments indicate that (1) age-related differences in MLDs exist even when the level of the masker is sufficiently high that older adults achieve their plateau performance, and (2) older listeners are not disadvantaged more than younger listeners by interaural differences in the level of the input.  相似文献   

18.
Many perceptual categories exhibit internal structure in which category prototypes play an important role. In the four experiments reported here, the internal structure of phonetic categories was explored in studies involving adults, infants, and monkeys. In Experiment 1, adults rated the category goodness of 64 variants of the vowel i parallel on a scale from 1 to 7. The results showed that there was a certain location in vowel space where listeners rated the i parallel vowels as best instances, or prototypes. The perceived goodness of i parallel vowels declined systematically as stimuli were further removed from the prototypic i parallel vowel. Experiment 2 went beyond this initial demonstration and examined the effect of speech prototypes on perception. Either the prototypic or a nonprototypic i parallel vowels was used as the referent stimulus and adults' generalization to other members of the category was examined. Results showed that the typicality of the speech stimulus strongly affected perception. When the prototype of the category served as the referent vowel, there was significantly greater generalization to other i parallel vowels, relative to the situation in which the nonprototype served as the referent. The notion of a perceptual magnet was introduced. The prototype of the category functioned like a perceptual magnet for other category members; it assimilated neighboring stimuli, effectively pulling them toward the prototype. In Experiment 3, the ontogenetic origins of the perceptual magnet effect were explored by testing 6-month-old infants. The results showed that infants' perception of vowels was also strongly affected by speech prototypes. Infants showed significantly greater generalization when the prototype of the vowel category served as the referent; moreover, their responses were highly correlated with those of adults. In Experiment 4, Rhesus monkeys were tested to examine whether or not the prototype's magnet effect was unique to humans. The animals did not provide any evidence of speech prototypes; they did not exhibit the magnet effect. It is suggested that the internal organization of phonetic categories around prototypic members is an ontogenetically early, species-specific, aspect of the speech code.  相似文献   

19.
In the experiments reported here, we attempted to find out more about how the auditory system is able to separate two simultaneous harmonic sounds. Previous research (Halikia & Bregman, 1984a, 1984b; Scheffers, 1983a) had indicated that a difference in fundamental frequency (F0) between two simultaneous vowel sounds improves their separate identification. In the present experiments, we looked at the effect of F0s that changed as a function of time. In Experiment 1, pairs of unfiltered or filtered pulse trains were used. Some were steady-state, and others had gliding F0s; different F0 separations were also used. The subjects had to indicate whether they had heard one or two sounds. The results showed that increased F0 differences and gliding F0s facilitated the perceptual separation of simultaneous sounds. In Experiments 2 and 3, simultaneous synthesized vowels were used on frequency contours that were steady-state, gliding in parallel (parallel glides), or gliding in opposite directions (crossing glides). The results showed that crossing glides led to significantly better vowel identification than did steady-state F0s. Also, in certain cases, crossing glides were more effective than parallel glides. The superior effect of the crossing glides could be due to the common frequency modulation of the harmonics within each component of the vowel pair and the consequent decorrelation of the harmonics between the two simultaneous vowels.  相似文献   

20.
Speech reception thresholds (SRTs) were measured at a central focus of attention and at 20°, 40°, and 60° locations distant in azimuth. Measurements were taken with one target collocated with two maskers, or with maskers flanking the target by ±20°. For 80% of trials, the target was played from the attended location (“expected”), and 20% came from another (“unexpected”) location. For collocated stimuli, SRTs worsened with increasing distance from the expected location by 2.1 dB over the 60° azimuth and by 5.1 dB for spatially separated target and maskers. In spatially separated conditions, a 2.9-dB change was still found when gaze was away from the attended location. Spatial attention appears to increase speech intelligibility against interferers, with gain decreasing with distance from the focus of attention. Spatial release from masking (RFM) was only found for the attended location. Further experiments suggest that target location uncertainty prevented RFM at unattended locations.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号