CompiLIG at SemEval-2017 Task 1: Cross-Language Plagiarism Detection Methods for Semantic Textual Similarity

Abstract : We present our submitted systems for Semantic Textual Similarity (STS) Track 4 at SemEval-2017. Given a pair of Spanish-English sentences, each system must estimate their semantic similarity by a score between 0 and 5. In our submission, we use syntax-based, dictionary-based, context-based, and MT-based methods. We also combine these methods in unsupervised and supervised way. Our best run ranked 1st on track 4a with a correlation of 83.02% with human annotations.
Liste complète des métadonnées

https://hal.archives-ouvertes.fr/hal-01531330
Contributor : Jérémy Ferrero <>
Submitted on : Thursday, June 1, 2017 - 3:28:26 PM
Last modification on : Tuesday, February 12, 2019 - 1:31:24 AM

Identifiers

  • HAL Id : hal-01531330, version 1

Collections

Citation

Jérémy Ferrero, Laurent Besacier, Didier Schwab, Frédéric Agnès. CompiLIG at SemEval-2017 Task 1: Cross-Language Plagiarism Detection Methods for Semantic Textual Similarity. Proceedings of the 11th International Workshop on Semantic Evaluations (SemEval-2017),, Aug 2017, Vancouver, Canada. ⟨hal-01531330⟩

Share

Metrics

Record views

109