Paraphrastic Reformulations in Spoken Corpora
Résumé
Our work addresses the automatic detection of paraphrastic reformulation in French spoken corpora. The proposed approach is syn-tagmatic. It is based on specific markers and the specificities of the spoken language. Manual multi-dimensional annotation performed by two annotators provides fine-grained reference data. An automatic method is proposed in order to decide whether sentences contain or not paraphras-tic relations. The obtained results show up to 66.4% precision. Analysis of the manual annotations indicates that few paraphrastic segments show morphological modifications (inflection, derivation or compounding) and that the syntactic equivalence between the segments is seldom respected, as these usually belong to different syntactic categories.
Origine : Fichiers produits par l'(les) auteur(s)
Loading...