Worldlex: Twitter and blog word frequencies for 66 languages - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Behavior Research Methods Année : 2016

Worldlex: Twitter and blog word frequencies for 66 languages

Résumé

Lexical frequency is one of the strongest predictors of word processing time. The frequencies are often calculated from book-based corpora, or more recently from subtitle-based corpora. We present new frequencies based on Twitter, blog posts, or newspapers for 66 languages. We show that these frequencies predict lexical decision reaction times similar to the already existing frequencies, or even better than them. These new frequencies are freely available and may be downloaded from http://worldlex.lexique.org.

Dates et versions

hal-01435674 , version 1 (15-01-2017)

Identifiants

Citer

Manuel Gimenes, Boris New. Worldlex: Twitter and blog word frequencies for 66 languages. Behavior Research Methods, 2016, 48 (3), pp.963 - 972. ⟨10.3758/s13428-015-0621-0⟩. ⟨hal-01435674⟩
160 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More