Paraphrastic Reformulations in Spoken Corpora

Abstract : Our work addresses the automatic detection of paraphrastic reformulation in French spoken corpora. The proposed approach is syn-tagmatic. It is based on specific markers and the specificities of the spoken language. Manual multi-dimensional annotation performed by two annotators provides fine-grained reference data. An automatic method is proposed in order to decide whether sentences contain or not paraphras-tic relations. The obtained results show up to 66.4% precision. Analysis of the manual annotations indicates that few paraphrastic segments show morphological modifications (inflection, derivation or compounding) and that the syntactic equivalence between the segments is seldom respected, as these usually belong to different syntactic categories.
Document type :
Journal articles
Liste complète des métadonnées

Cited literature [40 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01174657
Contributor : Iris Eshkol, Eshkol-Taravella <>
Submitted on : Thursday, July 9, 2015 - 2:24:24 PM
Last modification on : Tuesday, July 3, 2018 - 11:21:31 AM
Document(s) archivé(s) le : Wednesday, April 26, 2017 - 2:36:03 AM

File

para (1).pdf
Files produced by the author(s)

Identifiers

Citation

Iris Eshkol-Taravella, Natalia Grabar. Paraphrastic Reformulations in Spoken Corpora. Advances in Natural Language Processing Lecture Notes in Computer Science, Springer, 2014, 9th International Conference on NLP, PolTAL2014, 8686, pp.425-437. ⟨http://www.springer.com/fr/⟩. ⟨10.1007/978-3-319-10888-9_42⟩. ⟨hal-01174657⟩

Share

Metrics

Record views

222

Files downloads

277