Large and noisy vs small and reliable: combining 2 types of corpora for adjective valence extraction

Cécile Fabre 1 Anna Kupsc 2, 3
3 SIGNES - Linguistic signs, grammar and meaning: computational logic for natural language
Université Sciences et Technologies - Bordeaux 1, Inria Bordeaux - Sud-Ouest, École Nationale Supérieure d'Électronique, Informatique et Radiocommunications de Bordeaux (ENSEIRB), CNRS - Centre National de la Recherche Scientifique : UMR5800
Abstract : This work investigates a possibility of combining two different types of corpora to build a valence lexicon for French adjectives. We complete adjectival frames extracted from a Treebank with statistical cues computed from a large automatically parsed corpus. This experiment shows how linguistic knowledge and large amount of annotated data can be used in a complementary manner.
Document type :
Conference papers
Complete list of metadatas

Cited literature [21 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00559908
Contributor : Cécile Fabre <>
Submitted on : Thursday, January 27, 2011 - 11:03:52 AM
Last modification on : Wednesday, July 10, 2019 - 1:33:55 AM
Long-term archiving on : Thursday, April 28, 2011 - 2:37:29 AM

File

Fabre_Kupsc_CL09.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00559908, version 1

Citation

Cécile Fabre, Anna Kupsc. Large and noisy vs small and reliable: combining 2 types of corpora for adjective valence extraction. 5th Corpus Linguistics conference, Jul 2009, Liverpool, United Kingdom. pp.202. ⟨hal-00559908⟩

Share

Metrics

Record views

407

Files downloads

235