Skip to Main content Skip to Navigation
Conference papers

Construction of a Multilingual Corpus Annotated with Translation Relations

Abstract : Translation relations, which distinguish literal translation from other translation techniques, constitute an important subject of study for human translators (Chuquet and Paillard, 1989). However , automatic processing techniques based on interlingual relations, such as machine translation or paraphrase generation exploiting translational equivalence, have not made use of these relations explicitly until now. In this work, we present a categorization of translation relations and then we annotate a parallel multilingual (English, French, Chinese) corpus of oral presentations, the TED Talks, with these relations. Our long-term objective will be to automatically detect these relations in order to integrate them as important characteristics for the search of monolingual segments in relation of equivalence (paraphrases) or of entailment. The annotated corpus resulting from our work will be made available to the community.
Complete list of metadatas

Cited literature [17 references]  Display  Hide  Download
Contributor : Yuming Zhai <>
Submitted on : Tuesday, September 4, 2018 - 4:56:22 PM
Last modification on : Monday, February 10, 2020 - 6:14:08 PM
Document(s) archivé(s) le : Wednesday, December 5, 2018 - 7:04:43 PM


Publisher files allowed on an open archive


  • HAL Id : hal-01867896, version 1


Yuming Zhai, Aurélien Max, Anne Vilnat. Construction of a Multilingual Corpus Annotated with Translation Relations. First Workshop on Linguistic Resources for Natural Language Processing, Aug 2018, Santa Fe, United States. pp.102 - 111. ⟨hal-01867896⟩



Record views


Files downloads