Treelex : a subcategorization lexicon automatically extracted from a French Treebank - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2008

Treelex : a subcategorization lexicon automatically extracted from a French Treebank

Anna Kupść
  • Fonction : Auteur
  • PersonId : 863656

Résumé

TreeLex is a subcategorization lexicon of French verbs, automatically extracted from a syntactically annotated corpus. The lexicon comprises 1362 verbs (12353 occurrences). We present not only a list of verbs with their subcategorization frames but we also estimate the number of different verb frames available in French in general. Additionally, we estimate the average number of frames per verb. After applying various factorization techniques, we obtain 58 frames for a function-based representation (on average, 1.72 frames per verb), and 160 frames for a richer representation based on function-category information (on average, 1.91 frames per verb)

Mots clés

Fichier non déposé

Dates et versions

halshs-00761801 , version 1 (06-12-2012)

Identifiants

  • HAL Id : halshs-00761801 , version 1

Citer

Anna Kupść, Anne Abeillé. Treelex : a subcategorization lexicon automatically extracted from a French Treebank. ICGL, 2008, France. ⟨halshs-00761801⟩
28 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More