Restructuring Lemmas in a Dictionary of Serbian - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2004

Restructuring Lemmas in a Dictionary of Serbian

Cvetana Krstev
  • Fonction : Auteur
  • PersonId : 963257
Duško Vitas
  • Fonction : Auteur
  • PersonId : 963258

Résumé

Traditionally produced lexical resources for Serbo-Croatian are not suitable for automatic processing of contemporary Serbian. More specifically, the processes of structural derivation, although very productive in Serbian, are not presented in either monolingual or bilingual dictionaries in a systematic way. The morphological e-dictionary of Serbian was initially produced on the basis of traditional resources and as such reproduces the same flaws. In order to overcome them two solutions are possible. One is to put in the e-dictionary all the lemmas produced by structural derivation, no matter whether they are recorded in traditional dictionaries and confirmed in the corpus of contemporary Serbian, and then to assemble them explicitly in a complex lemma by an appropriate lexical graph. This approach, however, implies an overproduction of lemmas and the construction of such a graph for each complex lemma. Another approach is to extrapolate the missing lemmas using morphological grammars that model specific morphological processes and that are applied only to the entries already in the dictionary, which is checked by using the lexical constraints. It is demonstrated that such morphological grammars enable a precise classification of processes of structural derivation in a way similar to the classification of inflective phenomena.

Mots clés

Fichier principal
Vignette du fichier
sdjt04-20krstev.pdf (376.11 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-01108220 , version 1 (22-01-2015)

Identifiants

  • HAL Id : hal-01108220 , version 1

Citer

Cvetana Krstev, Duško Vitas. Restructuring Lemmas in a Dictionary of Serbian. Zbornik 7. mednarodne multikonference Informacijska druzba IS 2004 Jezikovne tehnologije 9-15 Oktober 2004, Ljubljana, Slovenija, 2004, Oct 2004, Ljubljana, Slovenia. ⟨hal-01108220⟩

Collections

LIGM_LINGU_INVITE
65 Consultations
121 Téléchargements

Partager

Gmail Facebook X LinkedIn More