Text–sign parallel corpus study to start designing an automatic translation system
Résumé
This paper presents a new project whose goal is to design an automatic system to translate from French text to Sign Language, using a symbolic approach. After stating two essential properties of Sign Language that makes such a system different from text-to-text systems in terms of internal rep-resentational models, we present the 2006 Websourd AFP news corpus we chose for our design process. It is a parallel corpus consisting of journalistic texts in French and their video translations in Sign Language. Then we present our methodology, based on separate analyses of video description and text annotation first, and a comparison second. The idea is to annotate the entities in the texts thought to trigger some recurrent signed structures, and as a start we focused on three structures emerging from the video corpus observation: comparisons, oppositions and geographic local-isations. Inspired by Guitteny's work on how to organise the signing space in an interpreting situation [7], they were chosen because they all strongly involve use of signing space, an essential notion in Sign Language with no equivalent in a written text. Using the highly-abstract model AZee [4] for representation of Sign Language rules, the ultimate goal is to build a set of translation mechanisms from annotated text to AZee operations, usable as input to a virtual signer animation system. Prospects are given to enrole theoretical frameworks capable of describing rhetorical/discourse structure representation.
Domaines
Informatique et langage [cs.CL]
Origine : Fichiers produits par l'(les) auteur(s)
Loading...