期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Christina Y. Tzeng Laura L. Namy Lynne C. Nygaard 《Cognitive Science》2019,43(11)

The current study assessed the extent to which the use of referential prosody varies with communicative demand. Speaker–listener dyads completed a referential communication task during which speakers attempted to indicate one of two color swatches (one bright, one dark) to listeners. Speakers' bright sentences were reliably higher pitched than dark sentences for ambiguous (e.g., bright red versus dark red) but not unambiguous (e.g., bright red versus dark purple) trials, suggesting that speakers produced meaningful acoustic cues to brightness when the accompanying linguistic content was underspecified (e.g., “Can you get the red one?”). Listening partners reliably chose the correct corresponding swatch for ambiguous trials when lexical information was insufficient to identify the target, suggesting that listeners recruited prosody to resolve lexical ambiguity. Prosody can thus be conceptualized as a type of vocal gesture that can be recruited to resolve referential ambiguity when there is communicative demand to do so. 相似文献

2.

The minimal unit of phonological encoding: prosodic or lexical word 总被引：1，自引：0，他引：1

Wheeldon LR Lahiri A 《Cognition》2002,85(2):B31-B41

Wheeldon and Lahiri (Journal of Memory and Language 37 (1997) 356) used a prepared speech production task (Sternberg, S., Monsell, S., Knoll, R. L., & Wright, C. E. (1978). The latency and duration of rapid movement sequences: comparisons of speech and typewriting. In G. E. Stelmach (Ed.), Information processing in motor control and learning (pp. 117-152). New York: Academic Press; Sternberg, S., Wright, C. E., Knoll, R. L., & Monsell, S. (1980). Motor programs in rapid speech: additional evidence. In R. A. Cole (Ed.), The perception and production of fluent speech (pp. 507-534). Hillsdale, NJ: Erlbaum) to demonstrate that the latency to articulate a sentence is a function of the number of phonological words it comprises. Latencies for the sentence [Ik zoek het] [water] 'I seek the water' were shorter than latencies for sentences like [Ik zoek] [vers] [water] 'I seek fresh water'. We extend this research by examining the prepared production of utterances containing phonological words that are less than a lexical word in length. Dutch compounds (e.g. ooglid 'eyelid') form a single morphosyntactic word and a phonological word, which in turn includes two phonological words. We compare their prepared production latencies to those syntactic phrases consisting of an adjective and a noun (e.g. oud lid 'old member') which comprise two morphosyntactic and two phonological words, and to morphologically simple words (e.g. orgel 'organ') which comprise one morphosyntactic and one phonological word. Our findings demonstrate that the effect is limited to phrasal level phonological words, suggesting that production models need to make a distinction between lexical and phrasal phonology. 相似文献

3.

Brett R. Myers Duane G. Watson 《Cognitive Science》2021,45(8):e13017

Rhythmic structure in speech is characterized by sequences of stressed and unstressed syllables. A large body of literature suggests that speakers of English attempt to achieve rhythmic harmony by evenly distributing stressed syllables throughout prosodic phrases. The question remains as to how speakers plan metrical structure during speech production and whether it is planned independently of phonemes. To examine this, we designed a tongue twister task consisting of disyllabic word pairs with overlapping phonological segments and either matching or non-matching metrical structure. Results showed that speakers had more difficulty producing metrically regular word pairs, compared to irregular pairs; that is, word pairs with irregular meter had faster productions and fewer speech errors in this production task. This finding of metrical regularity inhibiting production is inconsistent with an abstract metrical structure that is planned independently of phonemes at the point of phonological encoding. 相似文献

4.

The use of prosodic cues in language discrimination tasks by rats

Toro JM Trobalon JB Sebastián-Gallés N 《Animal cognition》2003,6(2):131-136

Recent research with cotton-top tamarin monkeys has revealed language discrimination abilities similar to those found in human infants, demonstrating that these perceptual abilities are not unique to humans but are also present in non-human primates. Specifically, tamarins could discriminate forward but not backward sentences of Dutch from Japanese, using both natural and synthesized utterances. The present study was designed as a conceptual replication of the work on tamarins. Results show that rats trained in a discrimination learning task readily discriminate forward, but not backward sentences of Dutch from Japanese; the results are particularly robust for synthetic utterances, a pattern that shows greater parallels with newborns than with tamarins. Our results extend the claims made in the research with tamarins that the capacity to discriminate languages from different rhythmic classes depends on general perceptual abilities that evolved at least as far back as the rodents. Electronic Publication 相似文献

5.

Chua Shi Min 《Cognition & emotion》2013,27(8):1376-1392

Emotional inferences from speech require the integration of verbal and vocal emotional expressions. We asked whether this integration is comparable when listeners are exposed to their native language and when they listen to a language learned later in life. To this end, we presented native and non-native listeners with positive, neutral and negative words that were spoken with a happy, neutral or sad tone of voice. In two separate tasks, participants judged word valence and ignored tone of voice or judged emotional tone of voice and ignored word valence. While native listeners outperformed non-native listeners in the word valence task, performance was comparable in the voice task. More importantly, both native and non-native listeners responded faster and more accurately when verbal and vocal emotional expressions were congruent as compared to when they were incongruent. Given that the size of the latter effect did not differ as a function of language proficiency, one can conclude that the integration of verbal and vocal emotional expressions occurs as readily in one's second language as it does in one's native language. 相似文献

6.

How aging affects the recognition of emotional speech

Paulmann S Pell MD Kotz SA 《Brain and language》2008,104(3):262-269

To successfully infer a speaker's emotional state, diverse sources of emotional information need to be decoded. The present study explored to what extent emotional speech recognition of 'basic' emotions (anger, disgust, fear, happiness, pleasant surprise, sadness) differs between different sex (male/female) and age (young/middle-aged) groups in a behavioural experiment. Participants were asked to identify the emotional prosody of a sentence as accurately as possible. As a secondary goal, the perceptual findings were examined in relation to acoustic properties of the sentences presented. Findings indicate that emotion recognition rates differ between the different categories tested and that these patterns varied significantly as a function of age, but not of sex. 相似文献

7.

Signature tunes in mothers' speech to infants

Bergeson TR Trehub SE 《Infant behavior & development》2007,30(4):648-654

The purpose of the present investigation was to determine whether mothers use discernible tunes (i.e., specific interval sequences) in their speech to infants and whether such tunes are individually distinctive. Mothers were recorded speaking with their infants on two occasions separated by 1 week or more. Examination of the tunes of each mother revealed discernible tunes and frequent repetitions of tunes within and across sessions. Comparisons of utterances with the most common pitch contour (i.e., rising), both within and across mothers, revealed interval patterns that were individually distinctive, or unique. The findings confirm the prominence of tunes and the presence of signature tunes in maternal speech to infants. 相似文献

8.

《Infant behavior & development》2015

From birth, newborns show a preference for faces talking a native language compared to silent faces. The present study addresses two questions that remained unanswered by previous research: (a) Does the familiarity with the language play a role in this process and (b) Are all the linguistic and paralinguistic cues necessary in this case? Experiment 1 extended newborns’ preference for native speakers to non-native ones. Given that fetuses and newborns are sensitive to the prosodic characteristics of speech, Experiments 2 and 3 presented faces talking native and nonnative languages with the speech stream being low-pass filtered. Results showed that newborns preferred looking at a person who talked to them even when only the prosodic cues were provided for both languages. Nonetheless, a familiarity preference for the previously talking face is observed in the “normal speech” condition (i.e., Experiment 1) and a novelty preference in the “filtered speech” condition (Experiments 2 and 3). This asymmetry reveals that newborns process these two types of stimuli differently and that they may already be sensitive to a mismatch between the articulatory movements of the face and the corresponding speech sounds. 相似文献

9.

The link between speech perception and production is phonological and abstract: evidence from the shadowing task

Mitterer H Ernestus M 《Cognition》2008,109(1):168-173

This study reports a shadowing experiment, in which one has to repeat a speech stimulus as fast as possible. We tested claims about a direct link between perception and production based on speech gestures, and obtained two types of counterevidence. First, shadowing is not slowed down by a gestural mismatch between stimulus and response. Second, phonetic detail is more likely to be imitated in a shadowing task if it is phonologically relevant. This is consistent with the idea that speech perception and speech production are only loosely coupled, on an abstract phonological level. 相似文献

10.

Object labeling influences infant phonetic learning and generalization

H. Henny Yeung Thierry Nazzi 《Cognition》2014

Different kinds of speech sounds are used to signify possible word forms in every language. For example, lexical stress is used in Spanish (/‘be.be/, ‘he/she drinks’ versus /be.’be/, ‘baby’), but not in French (/‘be.be/ and /be.’be/ both mean ‘baby’). Infants learn many such native language phonetic contrasts in their first year of life, likely using a number of cues from parental speech input. One such cue could be parents’ object labeling, which can explicitly highlight relevant contrasts. Here we ask whether phonetic learning from object labeling is abstract—that is, if learning can generalize to new phonetic contexts. We investigate this issue in the prosodic domain, as the abstraction of prosodic cues (like lexical stress) has been shown to be particularly difficult. One group of 10-month-old French-learners was given consistent word labels that contrasted on lexical stress (e.g., Object A was labeled /‘ma.bu/, and Object B was labeled /ma.’bu/). Another group of 10-month-olds was given inconsistent word labels (i.e., mixed pairings), and stress discrimination in both groups was measured in a test phase with words made up of new syllables. Infants trained with consistently contrastive labels showed an earlier effect of discrimination compared to infants trained with inconsistent labels. Results indicate that phonetic learning from object labeling can indeed generalize, and suggest one way infants may learn the sound properties of their native language(s). 相似文献

11.

An interaction between prosody and statistics in the segmentation of fluent speech

Shukla M Nespor M Mehler J 《Cognitive psychology》2007,54(1):1-32

Sensitivity to prosodic cues might be used to constrain lexical search. Indeed, the prosodic organization of speech is such that words are invariably aligned with phrasal prosodic edges, providing a cue to segmentation. In this paper we devise an experimental paradigm that allows us to investigate the interaction between statistical and prosodic cues to extract words from a speech stream. We provide evidence that statistics over the syllables are computed independently of prosody. However, we also show that trisyllabic sequences with high transition probabilities that straddle two prosodic constituents appear not to be recognized. Taken together, our findings suggest that prosody acts as a filter, suppressing possible word-like sequences that span prosodic constituents. 相似文献

12.

Temporal parameters as cues to phrasal boundaries: a comparison of processing by left- and right-hemisphere brain-damaged individuals

Aasland WA Baum SR 《Brain and language》2003,87(3):385-399

Two experiments were conducted to examine the ability of left- (LHD) and right-hemisphere-damaged (RHD) patients and normal controls to use temporal cues in rendering phrase grouping decisions. The phrase "pink and black and green" was manipulated to signal a boundary after "pink" or after "black" by altering pre-boundary word durations and pause durations at the boundary in a stepwise fashion. Stimuli were presented to listeners auditorily along with a card with three alternative groupings of colored squares from which to select the presented alternative. Results revealed that normal controls were able to use both temporal cues to identify the intended grouping. In contrast, LHD patients required longer than normal pause durations to consistently identify the intended grouping, suggesting a higher than normal threshold for perception of temporal prosodic cues. Surprisingly, the RHD patients exhibited great difficulty with the task, perhaps due to the limited acoustic cues available in the stimuli. 相似文献

13.

Christina Regenbogen Daniel A. Schneider Andreas Finkelmeyer Nils Kohn Birgit Derntl Thilo Kellermann 《Cognition & emotion》2013,27(6):995-1014

Background: Facial expressions, prosody, and speech content constitute channels by which information is exchanged. Little is known about the simultaneous and differential contribution of these channels to empathy when they provide emotionality or neutrality. Especially neutralised speech content has gained little attention with regards to influencing the perception of other emotional cues. Methods: Participants were presented with video clips of actors telling short-stories. One condition conveyed emotionality in all channels while the other conditions either provided neutral speech content, facial expression, or prosody, respectively. Participants judged the emotion and intensity presented, as well as their own emotional state and intensity. Skin conductance served as a physiological measure of emotional reactivity. Results: Neutralising channels significantly reduced empathic responses. Electrodermal recordings confirmed these findings. The differential effect of the communication channels on empathy prerequisites was that target emotion recognition of the other decreased mostly when the face was neutral, whereas decreased emotional responses attributed to the target emotion were especially present in neutral speech. Conclusion: Multichannel integration supports conscious and autonomous measures of empathy and emotional reactivity. Emotional facial expressions influence emotion recognition, whereas speech content is important for responding with an adequate own emotional state, possibly reflecting contextual emotion-appraisal. 相似文献

14.

Segmentation by lexical subtraction in Hungarian speakers of second-language English

《Quarterly journal of experimental psychology (2006)》2013,66(3):544-554

Using cross-modal form priming, we compared the use of stress and lexicality in the segmentation of spoken English by native English speakers (L1) and by native Hungarian speakers of second-language English (L2). For both language groups, lexicality was found to be an effective segmentation cue. That is, spoken disyllabic word fragments were stronger primes in a subsequent visual word recognition task when preceded by meaningful words than when preceded by nonwords: For example, the first two syllables of corridor were a more effective prime for visually presented corridor when heard in the phrase anythingcorri than in imoshingcorri. The stress pattern of the prime (strong–weak vs. weak–strong) did not affect the degree of priming. For L1 speakers, this supports previous findings about the preferential use of high-level segmentation strategies in clear speech. For L2 speakers, the lexical strategy was employed regardless of L2 proficiency level and instead of exploiting the consistent stress pattern of their native language. This is clear evidence for the primacy and robustness of segmentation by lexical subtraction even in individuals whose lexical knowledge is limited. 相似文献

15.

Listeners' comprehension of uptalk in spontaneous speech

Tomlinson JM Fox Tree JE 《Cognition》2011,(1):58-69

Listeners’ comprehension of phrase final rising pitch on declarative utterances, or uptalk, was examined to test the hypothesis that prolongations might differentiate conflicting functions of rising pitch. In Experiment 1 we found that listeners rated prolongations as indicating more speaker uncertainty, but that rising pitch was unrelated to ratings. In Experiment 2 we found that prolongations interacted with rising pitch when listeners monitored for words in the subsequent utterance. Words preceded by prolonged uptalk were monitored faster than words preceded by non-prolonged uptalk. In Experiment 3 we found that the interaction between rising pitch and prolongations depended on listeners’ beliefs about speakers’ mental states. Results support the theory that temporal and situational context are important in determining intonational meaning. 相似文献

16.

Perceptual uniqueness point effects in monitoring internal speech

Ozdemir R Roelofs A Levelt WJ 《Cognition》2007,105(2):457-465

Disagreement exists about how speakers monitor their internal speech. Production-based accounts assume that self-monitoring mechanisms exist within the production system, whereas comprehension-based accounts assume that monitoring is achieved through the speech comprehension system. Comprehension-based accounts predict perception-specific effects, like the perceptual uniqueness-point effect, in the monitoring of internal speech. We ran an extensive experiment testing this prediction using internal phoneme monitoring and picture naming tasks. Our results show an effect of the perceptual uniqueness point of a word in internal phoneme monitoring in the absence of such an effect in picture naming. These results support comprehension-based accounts of the monitoring of internal speech. 相似文献

17.

Petri Laukka Patrik N. Juslin 《Motivation and emotion》2007,31(3):182-191

Young and old adults’ ability to recognize emotions from vocal expressions and music performances was compared. The stimuli consisted of (a) acted speech (anger, disgust, fear, happiness, and sadness; each posed with both weak and strong emotion intensity), (b) synthesized speech (anger, fear, happiness, and sadness), and (c) short melodies played on the electric guitar (anger, fear, happiness, and sadness; each played with both weak and strong emotion intensity). The listeners’ recognition of discrete emotions and emotion intensity was assessed and the recognition rates were controlled for various response biases. Results showed emotion-specific age-related differences in recognition accuracy. Old adults consistently received significantly lower recognition rates for negative, but not for positive, emotions for both speech and music stimuli. Some age-related differences were also evident in the listeners’ ratings of emotion intensity. The results show the importance of considering individual emotions in studies on age-related differences in emotion recognition. 相似文献

18.

Vocal learning of a communicative signal in captive chimpanzees, Pan troglodytes

Jamie L. Russell Joseph M. McIntyre William D. Hopkins Jared P. Taglialatela 《Brain and language》2013

We hypothesized that chimpanzees could learn to produce attention-getting (AG) sounds via positive reinforcement. We conducted a vocal assessment in 76 captive chimpanzees for their use of AG sounds to acquire the attention of an otherwise inattentive human. Fourteen individuals that did not produce AG sounds during the vocal assessment were evaluated for their ability to acquire the use of an AG sound through operant conditioning and to employ these sounds in an attention-getting context. Nine of the 14 chimpanzees were successfully shaped using positive reinforcement to produce an AG sound. In a post-training vocal assessment, eight of the nine individuals that were successfully trained to produce AG sounds generalized the use of these newly acquired signals to communicatively relevant situations. Chimpanzees possess the ability to acquire the use of a communicative signal via operant conditioning and can generalize the use of this newly acquired signal to appropriate communicative contexts. 相似文献

19.

A Cross-Linguistic Speech Error Investigation of Functional Complexity

Wells-Jensen S 《Journal of psycholinguistic research》2007,36(2):107-157

This work is a systematic, cross-linguistic examination of speech errors in English, Hindi, Japanese, Spanish and Turkish. It first describes a methodology for the generation of parallel corpora of error data, then uses these data to examine three general hypotheses about the relationship between language structure and the speech production system. All of the following hypotheses were supported by the data. Languages are equally complex. No overall differences were found in the numbers of errors made by speakers of the five languages in the study. Languages are processed in similar ways. English-based generalizations about language production were tested to see to what extent they would hold true across languages. It was found that, to a large degree, languages follow similar patterns. However, the relative numbers of phonological anticipations and perseverations in other languages did not follow the English pattern. Languages differ in that speech errors tend to cluster around loci of complexity within each language. Languages such as Turkish and Spanish, which have more inflectional morphology, exhibit more errors involving inflected forms, while languages such as Japanese, with rich systems of closed-class forms, tend to have more errors involving closed-class items. 相似文献

20.

Prosodic sensitivity and morphological awareness in children’s reading

Ellie Clin Lindsay Heggie 《Journal of experimental child psychology》2009,104(2):197-213

This study examined the relationships among prosodic sensitivity, morphological awareness, and reading ability in a sample of 104 8- to 13-year-olds. Using a task adapted from Carlisle (Applied Psycholinguistics, 9 (1988) 247-266), we measured children’s ability to produce morphological derivations with differing levels of phonological complexity between stem and derivation: No Change, Phonemic Change, Stress Change, and Both Phonemic and Stress Change. A 3 (Grade) × 4 (Derivation Type) analysis of variance showed that children perform significantly more poorly on both types of derivations that involve stress changes than on phonemic change and no change derivations. Regression analyses showed that both prosodic sensitivity and morphological awareness, especially in derivations that require manipulation of stress, are significant predictors of reading ability after controlling for age, verbal and nonverbal abilities, and phonological awareness. 相似文献