Parallel alignment of structured documents

Laurent Romary 1 Patrice Bonhomme 1
1 LANGUE ET DIALOGUE - Human-machine dialogue with a significant language component
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : Classical methods for parallel text alignment consider one specific level (e.g. sentences) along which two or more versions of a text are to be synchronised. This may lead to some problems when these documents are particularly long since alignment errors at some point in the text may, in the absence of any other linguistic information, propagate for some time without any chance of recovery. In this chapter we consider how multilingual parallel alignment can be based on the fact that more and more texts are now highly structured by means of tagging languages such as SGML. In particular we will describe recent efforts in multi-level alignment for which we will present the main advances as well as some of the difficulties to be dealt with, in particular when the text and its translation are associated with different encoding schemes or different encoding practices for the same scheme.
Type de document :
Chapitre d'ouvrage
Jean Véronis. Parallel Text Processing, Kluwer Academic Publisher, pp.233-253, 2000
Liste complète des métadonnées

Littérature citée [9 références]  Voir  Masquer  Télécharger
Contributeur : Laurent Romary <>
Soumis le : mercredi 11 mars 2009 - 18:01:11
Dernière modification le : jeudi 11 janvier 2018 - 06:19:48
Document(s) archivé(s) le : samedi 26 novembre 2016 - 06:24:25


Fichiers produits par l'(les) auteur(s)


  • HAL Id : hal-00367603, version 1



Laurent Romary, Patrice Bonhomme. Parallel alignment of structured documents. Jean Véronis. Parallel Text Processing, Kluwer Academic Publisher, pp.233-253, 2000. 〈hal-00367603〉



Consultations de la notice


Téléchargements de fichiers