EsPal: One-stop shopping for Spanish word properties |
| |
Authors: | Andrew Duchon Manuel Perea Nuria Sebastián-Gallés Antonia Martí Manuel Carreiras |
| |
Affiliation: | 1. Basque Center on Cognition, Brain, and Language, Donostia, Spain 2. Universitat de València, Valencia, Spain 3. Universitat Pompeu Fabra, Barcelona, Spain 4. Universitat de Barcelona, Barcelona, Spain 5. IKERBASQUE. Basque Foundation for Science, Bilbao, Spain
|
| |
Abstract: | This article introduces EsPal: a Web-accessible repository containing a comprehensive set of properties of Spanish words. EsPal is based on an extensible set of data sources, beginning with a 300 million token written database and a 460 million token subtitle database. Properties available include word frequency, orthographic structure and neighborhoods, phonological structure and neighborhoods, and subjective ratings such as imageability. Subword structure properties are also available in terms of bigrams and trigrams, biphones, and bisyllables. Lemma and part-of-speech information and their corresponding frequencies are also indexed. The website enables users either to upload a set of words to receive their properties or to receive a set of words matching constraints on the properties. The properties themselves are easily extensible and will be added over time as they become available. It is freely available from the following website: http://www.bcbl.eu/databases/espal/. |
| |
Keywords: | |
本文献已被 SpringerLink 等数据库收录! |
|