首页 | 本学科首页   官方微博 | 高级检索  
相似文献
 共查询到20条相似文献,搜索用时 62 毫秒
1.
In this article, we present a new lexical database for French: Lexique. In addition to classical word information such as gender, number, and grammatical category, Lexique includes a series of interesting new characteristics. First, word frequencies are based on two cues: a contemporary corpus of texts and the number of Web pages containing the word. Second, the database is split into a graphemic table with all the relevant frequencies, a table structured around lemmas (particularly interesting for the study of the inflectional family), and a table about surface frequency cues. Third, Lexique is distributed under a GNU-like license, allowing people to contribute to it. Finally, a metasearch engine, Open Lexique, has been developed so that new databases can be added very easily to the existing ones. Lexique can either be downloaded or interrogated freely from http://www.lexique.org.  相似文献   

2.
Researchers often require subjects to make judgments that call upon their knowledge of the orthographic structure of English words. Such knowledge is relevant in experiments on, for example, reading, lexical decision, and anagram solution. One common measure of orthographic structure is the sum of the frequencies of consecutive bigrams in the word. Traditionally, researchers have relied ontoken-based norms of bigram frequencies. These norms confound bigram frequency with word frequency because each instance (i.e., token) of a particular word in a corpus of running text increments the frequencies of the bigrams that it contains. In this article, the authors report a set oftype-based bigram frequencies in which each word (i.e., type) contributes only once, thereby unconfounding bigram frequency from word frequency. The authors show that type-based bigram frequency is a better predictor of the difficulty of anagram solution than is token-based frequency. These norms can be downloaded fromwww.psychonomic.org/archive/.  相似文献   

3.
This article presents MANULEX, a Web-accessible database that provides grade-level word frequency lists of nonlemmatized and lemmatized words (48,886 and 23,812 entries, respectively) computed from the 1.9 million words taken from 54 French elementary school readers. Word frequencies are provided for four levels: first grade (G1), second grade (G2), third to fifth grades (G3-5), and all grades (G1-5). The frequencies were computed following the methods describedby Carroll, Davies, and Richman (1971) and Zeno, Ivenz, Millard, and Duwuri (1995), with four statistics at each level (F, overall word frequency;D, index of dispersion across the selectedreaders;U, estimated frequencyper million words; andSFI, standard frequency index). The database also provides the number of letters in the word and syntactic category information. MANULEX is intended to be a useful tool for studying language development through the selection of stimuli based on precise frequency norms. Researchers in artificial intelligence can also use it as a source of information on natural language processing to simulate written language acquisition in children. Finally, it may serve an educational purpose by providing basic vocabulary lists.  相似文献   

4.
A data set is described that includes eight variables gathered for 13 common superordinate natural language categories and a representative set of 338 exemplars in Dutch. The category set contains 6 animal categories (reptiles, amphibians, mammals, birds, fish, andinsects), 3 artifact categories (musical instruments, tools, andvehicles), 2 borderline artifact-natural-kind categories (vegetables andfruit), and 2 activity categories (sports andprofessions). In an exemplar and a feature generation task for the category nouns, frequency data were collected. For each of the 13 categories, a representative sample of 5–30 exemplars was selected. For all exemplars, feature generation frequencies, typicality ratings, pairwise similarity ratings, age-of-acquisition ratings, word frequencies, and word associations were gathered. Reliability estimates and some additional measures are presented. The full set of these norms is available in Excel format at the Psychonomic Society Web archive,www.psychonomic. org/archive/.  相似文献   

5.
English‐learning 7.5‐month‐olds are heavily biased to perceive stressed syllables as word onsets. By 11 months, however, infants begin segmenting non‐initially stressed words from speech. Using the same artificial language methodology as Johnson and Jusczyk (2001 ), we explored the possibility that the emergence of this ability is linked to a decreased reliance on prosodic cues to word boundaries accompanied by an increased reliance on syllable distribution cues. In a baseline study, where only statistical cues to word boundaries were present, infants exhibited a familiarity preference for statistical words. When conflicting stress cues were added to the speech stream, infants exhibited a familiarity preference for stress as opposed to statistical words. This was interpreted as evidence that 11‐month‐olds weight stress cues to word boundaries more heavily than statistical cues. Experiment 2 further investigated these results with a language containing convergent cues to word boundaries. The results of Experiment 2 were not conclusive. A third experiment using new stimuli and a different experimental design supported the conclusion that 11‐month‐olds rely more heavily on prosodic than statistical cues to word boundaries. We conclude that the emergence of the ability to segment non‐initially stressed words from speech is not likely to be tied to an increased reliance on syllable distribution cues relative to stress cues, but instead may emerge due to an increased reliance on and integration of a broad array of segmentation cues.  相似文献   

6.
In two experiments, subjects were given arbitrary letter cues or meaningful word cues that specified the task to be performed on a subsequent target stimulus. Letter and word cues were presented in separate blocks. There were two cues of each type for each task. Three kinds of transitions separated tasks:cue repetitions, in which both the cue and the task repeated;task repetitions, in which the cue changed but the task repeated; andtask alternations, in which both the cue and the task changed. Responses were faster for cue than for task repetitions for both cue types. With word cues, task repetitions were not reliably faster than task alternations. With letter cues, task repetitions were reliably faster than task alternations in the first block but not in the second block. The results suggest that subjects responded to the compound of the cue and the target rather than switching task set between trials.  相似文献   

7.
In this article, we present Procura-PALavras (P-PAL), a Web-based interface for a new European Portuguese (EP) lexical database. Based on a contemporary printed corpus of over 227 million words, P-PAL provides a broad range of word attributes and statistics, including several measures of word frequency (e.g., raw counts, per-million word frequency, logarithmic Zipf scale), morpho-syntactic information (e.g., parts of speech [PoSs], grammatical gender and number, dominant PoS, and frequency and relative frequency of the dominant PoS), as well as several lexical and sublexical orthographic (e.g., number of letters; consonant–vowel orthographic structure; density and frequency of orthographic neighbors; orthographic Levenshtein distance; orthographic uniqueness point; orthographic syllabification; and trigram, bigram, and letter type and token frequencies), and phonological measures (e.g., pronunciation, number of phonemes, stress, density and frequency of phonological neighbors, transposed and phonographic neighbors, syllabification, and biphone and phone type and token frequencies) for ~53,000 lemmatized and ~208,000 nonlemmatized EP word forms. To obtain these metrics, researchers can choose between two word queries in the application: (i) analyze words previously selected for specific attributes and/or lexical and sublexical characteristics, or (ii) generate word lists that meet word requirements defined by the user in the menu of analyses. For the measures it provides and the flexibility it allows, P-PAL will be a key resource to support research in all cognitive areas that use EP verbal stimuli. P-PAL is freely available at http://p-pal.di.uminho.pt/tools.  相似文献   

8.
On the basis of calculations using the latest lexical database produced by Amano and Kondo (2000), the fourth edition of a Web-accessible database of characteristics of the 1,945 basic Japanese kanji was produced by including the mathematical concepts ofentropy, redundancy, andsymmetry and by replacing selected indexes found in previous editions (Tamaoka, Kirsner, Yanase, Miyaoka, & Kawakami, 2002). The kanji database in the fourth edition introduces seven new figures for kanji characteristics: (1) printed frequency, (2) lexical productivity, (3) accumulative lexical productivity, (4) symmetry for lexical productivity, (5) entropy, (6) redundancy, and (7) numbers of meanings for On-readings and Kun-readings. The file of the fourth edition of the kanji database may be downloaded from the Psychonomic Society Web archive,http://www.psychonomics.org/archive/.  相似文献   

9.
In this article, normative data on the familiarity and difficulty of 196 single-solution Spanish word fragments are presented. The database includes the following indices: difficulty, familiarity, frequency, number of meanings, number of letters given in the fragment, first and/or last letters given, and ratio of letters to blanks. A factor analysis was performed on difficulty, and two factors were obtained. Frequency, familiarity, and number of meanings loaded highly on the first factor, which we consider to measure lexical processes, whereas number of letters in the fragment, first and/or last letters given, and ratio of letters to blanks loaded highly on the second factor, which we judge to be determined by perceptual information. Regression analyses using factor scores as predictors showed that both factors accounted for a significant part of the completion probability scores. The full set of these norms may be downloaded from the Psychonomic Society Web archive atwww.psychonomic.org/archive/.  相似文献   

10.
This work presents the results of a masked lexical decision experiment in which we explore the morphological parsing of Spanish suffixed or pseudosuffixed words through the suffix priming effect. Priming the bases or pseudobases with their suffixed or pseudosuffixed forms is the standard process in experiments aimed at understanding the processes underlying morphological parsing in visual word recognition with masked priming lexical decision (e.g., darkness–DARK; corner–CORN). We, however, compare the effect of suffix priming on the lexical decision of suffixed (ero–JORNALERO) and pseudosuffixed words (ero–CORDERO), as well as the effect of orthographic priming on nonsuffixed words (eba–PRUEBA). The results show that in the case of suffixed and pseudosuffixed words, related primes (ero–JORNALERO; ero–CORDERO) significantly accelerated response latencies in comparison to unrelated primes (ista–JORNALERO; ura–CORDERO), while for simple words there was no facilitation from the orthographically related prime in comparison to the unrelated prime (eba–PRUEBA; afo–PRUEBA). These results are consistent with the so-called morpho-orthographic segmentation process in the course of visual word recognition, which might also be independent of orthographic and purely semantic factors. Our results also support the view that morphological parsing takes place regardless of whether a stem is present in a word. These results complement findings from studies dealing with CORNER- and BROTHEL-like stimuli.  相似文献   

11.
WordNet, an electronic dictionary (or lexical database), is a valuable resource for computational and cognitive scientists. Recent work on the computing of semantic distances among nodes (synsets) in WordNet has made it possible to build a large database of semantic distances for use in selecting word pairs for psychological research. The database now contains nearly 50,000 pairs of words that have values for semantic distance, associative strength, and similarity based on co-occurrence. Semantic distance was found to correlate weakly with these other measures but to correlate more strongly with another measure of semantic relatedness, featural similarity. Hierarchical clustering analysis suggested that the knowledge structure underlying semantic distance is similar in gross form to that underlying featural similarity. In experiments in which semantic similarity ratings were used, human participants were able to discriminate semantic distance. Thus, semantic distance as derived from WordNet appears distinct from other measures of word pair relatedness and is psychologically functional. This database may be downloaded fromwww.psychonomic.org/archive/.  相似文献   

12.
The ability of second- and fifth-grade children and college adults to use “extra-list” cues to retrieve episodic information from memory was examined in four experiments. The acquisition events were word triplets characterized by category relations (i.e., Rose-Tulip-Lily), theme cooccurrence relations (i.e., Bunny-Jelly Beans-Lily), or no relevant conceptual relation (i.e., Wood-Brick-Lily) among the triplet members, and event classification was ensured by means of an acquisition orienting question. Cued recall for the target (i.e., Lily) was assessed for extra-list Class Name cues (Flower or Easter) in Experiments 1 and 3, and Class Associate cues in Experiments 2 and 4. The classifications concerned superordinate category versus theme cooccurrence relations in Experiments 1 and 2, subordinate versus superordinate category relations in Experiment 3, and prototypical versus nonprototypical associates in Experiment 4. The results showed that effective cue use varied strongly with both the “match” of cue and event class information, and with the associative structure of permanent memory. Probable differences in this structure seem to contribute to both inherent and developmental differences in using cues to retrieve theme or category episodic information.  相似文献   

13.
We describe a set of pictorial and auditory stimuli that we have developed for use in word learning tasks in which the participant learns pairings of novel auditory sound patterns (names) with pictorial depictions of novel objects (referents). The pictorial referents are drawings of “space aliens,” consisting of images that are variants of 144 different aliens. The auditory names are possible nonwords of English; the stimulus set consists of over 2,500 nonword stimuli recorded in a single voice, with controlled onsets, varying from one to seven syllables in length. The pictorial and nonword stimuli can also serve as independent stimulus sets for purposes other than word learning. The full set of these stimuli may be downloaded fromwww.psychonomic.org/archive/.  相似文献   

14.
ABSTRACT

The sense of smell has made a recent return to the forefront of research on episodic memory. Odour context cues can reactivate recently encoded memories during sleep-dependent memory consolidation [e.g., Rasch, B., Buchel, C., Gais, S., & Born, J. (2007). Odor cues during slow-wave sleep prompt declarative memory consolidation. Science, 315, 1426–1429], and reinstating the odour experienced during encoding at test results in superior recall and recognition [e.g., Isarida, T., Sakai, T., Kubota, T., Koga, M., Katayama, Y., & Isarida, T. K. (2014). Odor-context effects in free recall after a short retention interval: A new methodology for controlling adaptation. Memory & Cognition, 42, 421–433]. However, whether the impact of odour cues is restricted to the specific memories studied in the presence of the odour, or whether reinstating the odour also cues unstudied memories that are semantically related to the studied memories (i.e., false memories) is unknown. We used the Deese-Roediger-McDermott false memory paradigm to quantify the impact of odour cues on both veridical memory and false memory. Reinstating the odour presented during the study of the DRM word lists at the test phase resulted in better free recall of the studied words, but had no statistically significant impact on the number of false memories produced. We argue that odour cues influence recall of the memories they co-occurred with during study but potentially not semantically related memories.  相似文献   

15.
The LEXIN database offers psycholinguistic indexes of the 13,184 different words (types) computed from 178,839 occurrences of these words (tokens) contained in a corpus of 134 beginning readers widely used in Spain. This database provides four statistical indicators: F (overall word frequency), D (index of dispersion across selected readers), U (estimated frequency per million words), and SFI (standard frequency index). It also gives information about the number of letters, syntactic category, and syllabic structure of the words included. To facilitate comparisons, LEXIN provides data from LEXESP’s (Sebastián-Gallés, Martí, Cuetos, & Carreiras, 2000), Alameda and Cuetos’s (1995), and Martínez and García’s (2004) Spanish adult psycholinguistic frequency databases. Access to the LEXIN database is facilitated by a computer program. The LEXIN program allows for the creation of word lists by letting the user specify searching criteria. LEXIN can be useful for researchers in cognitive psychology, particularly in the areas of psycholinguistics and education.  相似文献   

16.
Previous work examining prosodic cues in online spoken-word recognition has focused primarily on local cues to word identity. However, recent studies have suggested that utterance-level prosodic patterns can also influence the interpretation of subsequent sequences of lexically ambiguous syllables (Dilley, Mattys, & Vinke, Journal of Memory and Language, 63:274–294, 2010; Dilley & McAuley, Journal of Memory and Language, 59:294–311, 2008). To test the hypothesis that these distal prosody effects are based on expectations about the organization of upcoming material, we conducted a visual-world experiment. We examined fixations to competing alternatives such as pan and panda upon hearing the target word panda in utterances in which the acoustic properties of the preceding sentence material had been manipulated. The proportions of fixations to the monosyllabic competitor were higher beginning 200 ms after target word onset when the preceding prosody supported a prosodic constituent boundary following pan-, rather than following panda. These findings support the hypothesis that expectations based on perceived prosodic patterns in the distal context influence lexical segmentation and word recognition.  相似文献   

17.

A series of four experiments was conducted to determine whether English-learning infants can use allophonic cues to word boundaries to segment words from fluent speech. Infants were familiarized with a pair of two-syllable items, such asnitrates andnight rates and then were tested on their ability to detect these same words in fluent speech passages. The presence of allophonic cues to word bound-aries did not help 9-month-olds to distinguish one of the familiarized words from an acoustically similar foil. Infants familiarized withnitrates were just as likely to listen to a passage aboutnight rates as they were to listen to one aboutnitrates. Nevertheless, when the passages contained distributional cues that favored the extraction of the familiarized targets, 9-month-olds were able to segment these items from fluent speech. By the age of 10.5 months, infants were able to rely solely on allophonic cues to locate the familiarized target words in passages. We consider what implications these findings have for understanding how word segmentation skills develop.

  相似文献   

18.
To investigate the possibility that knowledge of two languages influences the nature of semantic representations, bilinguals and monolinguals were compared in a word association task. In Experiment 1, bilinguals produced less typical responses relative to monolinguals when given cues with a very common associate (e.g., given bride, bilinguals said “dress” instead of “groom”). In Experiment 2, bilinguals produced responses as typical as those of monolinguals when given cues with high-frequency associates, but not when given cues with lowfrequency associates. Bilinguals’ responses were also affected, to a certain extent, by the cognate status of the stimulus word pairs: They were more similar to monolinguals’ responses when the cue and its strongest associate were both cognates (e.g., minute-second is minuto-segundo in Spanish), as opposed to both being noncognates. Experiment 3 confirmed the presence of a robust frequency effect on bilingual but not on monolingual association responses. These findings imply a lexical locus for the bilingual effect on association responses and reveal the association task to be not quite as purely semantic as was previously assumed.  相似文献   

19.
When children acquire language, they often learn words in the absence of direct instruction (e.g. ‘This is a ball!’) or even social cues to reference (e.g. eye gaze, pointing). However, there are few accounts of how children do this, especially in cases where the referent of a new word is ambiguous. Across two experiments, we test whether preschoolers (2‐ to 4‐year‐olds; n = 239) can learn new words by inferring the referent of a new word from the surrounding linguistic discourse. Across two experiments, we show that children as young as 2 can learn a new word from the linguistic discourse in which it appears. This suggests that children use the linguistic discourse in which a word appears to learn new words.  相似文献   

20.
Visual cues to the individual segments of speech and to sentence prosody guide speech recognition. The present study tested whether visual suprasegmental cues to the stress patterns of words can also constrain recognition. Dutch listeners use acoustic suprasegmental cues to lexical stress (changes in duration, amplitude, and pitch) in spoken-word recognition. We asked here whether they can also use visual suprasegmental cues. In two categorization experiments, Dutch participants saw a speaker say fragments of word pairs that were segmentally identical but differed in their stress realization (e.g., ?ca-vi from cavia “guinea pig” vs. ?ka-vi from kaviaar “caviar”). Participants were able to distinguish between these pairs from seeing a speaker alone. Only the presence of primary stress in the fragment, not its absence, was informative. Participants were able to distinguish visually primary from secondary stress on first syllables, but only when the fragment-bearing target word carried phrase-level emphasis. Furthermore, participants distinguished fragments with primary stress on their second syllable from those with secondary stress on their first syllable (e.g., pro-?jec from projector “projector” vs. ?pro-jec from projectiel “projectile”), independently of phrase-level emphasis. Seeing a speaker thus contributes to spoken-word recognition by providing suprasegmental information about the presence of primary lexical stress.  相似文献   

设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号