首页 | 本学科首页   官方微博 | 高级检索  
     


Aralex: A lexical database for Modern Standard Arabic
Authors:Sami Boudelaa  William D. Marslen-Wilson
Affiliation:(1) School of Education, Haddad Center for Research in Dyslexia, Bar-Ilan University, Ramat-Gan, 52900, Israel;(2) Psychology Department, Gonda Brain Research Center, Kinneret College, Bar-Ilan University, Ramat-Gan, 52900, Israel
Abstract:In this article, we present a new lexical database for Modern Standard Arabic: Aralex. Based on a contemporary text corpus of 40 million words, Aralex provides information about (1) the token frequencies of roots and word patterns, (2) the type frequency, or family size, of roots and word patterns, and (3) the frequency of bigrams, trigrams in orthographic forms, roots, and word patterns. Aralex will be a useful tool for studying the cognitive processing of Arabic through the selection of stimuli on the basis of precise frequency counts. Researchers can use it as a source of information on natural language processing, and it may serve an educational purpose by providing basic vocabulary lists. Aralex is distributed under a GNU-like license, allowing people to interrogate it freely online or to download it from www.mrc-cbu.cam.ac.uk:8081/aralex .online/login.jsp.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号