Coreference Resolution for French Oral Data: Machine Learning Experiments with ANCOR

We present CROC (Coreference Resolution for Oral Corpus), the first machine learning system for coreference resolution in French. One specific aspect of the system is that it has been trained on data that come exclusively from transcribed speech, namely ANCOR (ANaphora and Corefer-ence in ORal corpus), the first large-scale French corpus with anaphorical relation annotations. In its current state, the CROC system requires pre-annotated mentions. We detail the features used for the learning algorithms , and we present a set of experiments with these features. The scores we obtain are close to those of state-of-the-art systems for written English.

Mots clés

mention-pair model dialogue corpus machine learning coreference resolu-tion

Domaines

Informatique et langage [cs.CL]

Fichier principal

18_Cicling.pdf (265.06 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Jean-Yves Antoine : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01889593

Soumis le : dimanche 7 octobre 2018-09:45:48

Dernière modification le : vendredi 19 avril 2024-16:18:57

Archivage à long terme le : mardi 8 janvier 2019-12:34:02

Dates et versions

hal-01889593 , version 1 (07-10-2018)

Identifiants

HAL Id : hal-01889593 , version 1
DOI : 10.1007/978-3-319-75477-2_36

Citer

Adèle Désoyer, Frédéric Landragin, Isabelle Tellier, Anaïs Lefeuvre, Jean-Yves Antoine, et al.. Coreference Resolution for French Oral Data: Machine Learning Experiments with ANCOR. Computational Linguistics and Intelligent Text Processing., 9623, Springer International Publishing, pp.507-519, 2018, Lecture Notes in Computer Science, ⟨10.1007/978-3-319-75477-2_36⟩. ⟨hal-01889593⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

ENS-PARIS UGA UNIV-TOURS CNRS UNIV-PARIS3 LATTICE MODYCO LIG LIG_TDCGE_GETALP LIBDTLN PSL USPC LIFAT INSA-GROUPE DEMOCRAT INSA-CVL UNIV-PARIS-LUMIERES ANR UNIV-PARIS-NANTERRE LIG_SIDCH

130 Consultations

189 Téléchargements