Paraphrastic Reformulations in Spoken Corpora

Abstract : Our work addresses the automatic detection of paraphrastic reformulation in French spoken corpora. The proposed approach is syn-tagmatic. It is based on specific markers and the specificities of the spoken language. Manual multi-dimensional annotation performed by two annotators provides fine-grained reference data. An automatic method is proposed in order to decide whether sentences contain or not paraphras-tic relations. The obtained results show up to 66.4% precision. Analysis of the manual annotations indicates that few paraphrastic segments show morphological modifications (inflection, derivation or compounding) and that the syntactic equivalence between the segments is seldom respected, as these usually belong to different syntactic categories.
Type de document :
Article dans une revue
Advances in Natural Language Processing Lecture Notes in Computer Science, Springer, 2014, 9th International Conference on NLP, PolTAL2014, 8686, pp.425-437. 〈http://www.springer.com/fr/〉. 〈10.1007/978-3-319-10888-9_42〉
Liste complète des métadonnées

Littérature citée [40 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01174657
Contributeur : Iris Eshkol, Eshkol-Taravella <>
Soumis le : jeudi 9 juillet 2015 - 14:24:24
Dernière modification le : mardi 3 juillet 2018 - 11:21:31
Document(s) archivé(s) le : mercredi 26 avril 2017 - 02:36:03

Fichier

para (1).pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

Collections

Citation

Iris Eshkol-Taravella, Natalia Grabar. Paraphrastic Reformulations in Spoken Corpora. Advances in Natural Language Processing Lecture Notes in Computer Science, Springer, 2014, 9th International Conference on NLP, PolTAL2014, 8686, pp.425-437. 〈http://www.springer.com/fr/〉. 〈10.1007/978-3-319-10888-9_42〉. 〈hal-01174657〉

Partager

Métriques

Consultations de la notice

212

Téléchargements de fichiers

230