Developing a Large-Scale Lexicon for a Less-Resourced Language: General Methodology and Preliminary Experiments on Sorani Kurdish - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2010

Developing a Large-Scale Lexicon for a Less-Resourced Language: General Methodology and Preliminary Experiments on Sorani Kurdish

Résumé

In this paper, we describe a general methodology for developing a large-scale lexicon for a less-resourced language, i.e., a language for which raw internet-based corpora and general-purpose grammars are virtually the only existing resources. We apply this methodology to the development of a morphological lexicon for Sorani Kurdish, an Iranian language mostly spoken in northern Iraq and north-western Iran. Although preliminary, our results demonstrate the relevance of this methodology

Mots clés

Domaines

Linguistique
Fichier principal
Vignette du fichier
saltmil10soralex_1.pdf (108.13 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

halshs-00751634 , version 1 (14-11-2012)

Identifiants

  • HAL Id : halshs-00751634 , version 1

Citer

Géraldine Walther, Benoît Sagot. Developing a Large-Scale Lexicon for a Less-Resourced Language: General Methodology and Preliminary Experiments on Sorani Kurdish. Proceedings of the 7th SaLTMiL Workshop on Creation and use of basic lexical resources for less-resourced languages (LREC 2010 Workshop), 2010, Valetta, Malta. ⟨halshs-00751634⟩
369 Consultations
384 Téléchargements

Partager

Gmail Facebook X LinkedIn More