Skip to Main content Skip to Navigation
Book sections

Parallel alignment of structured documents

Laurent Romary 1 Patrice Bonhomme 1
1 LANGUE ET DIALOGUE - Human-machine dialogue with a significant language component
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : Classical methods for parallel text alignment consider one specific level (e.g. sentences) along which two or more versions of a text are to be synchronised. This may lead to some problems when these documents are particularly long since alignment errors at some point in the text may, in the absence of any other linguistic information, propagate for some time without any chance of recovery. In this chapter we consider how multilingual parallel alignment can be based on the fact that more and more texts are now highly structured by means of tagging languages such as SGML. In particular we will describe recent efforts in multi-level alignment for which we will present the main advances as well as some of the difficulties to be dealt with, in particular when the text and its translation are associated with different encoding schemes or different encoding practices for the same scheme.
Complete list of metadatas

Cited literature [9 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00367603
Contributor : Laurent Romary <>
Submitted on : Wednesday, March 11, 2009 - 6:01:11 PM
Last modification on : Friday, March 22, 2019 - 2:22:12 PM
Document(s) archivé(s) le : Saturday, November 26, 2016 - 6:24:25 AM

File

Parallel_alignment_of_structur...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00367603, version 1

Collections

Citation

Laurent Romary, Patrice Bonhomme. Parallel alignment of structured documents. Jean Véronis. Parallel Text Processing, Kluwer Academic Publisher, pp.233-253, 2000. ⟨hal-00367603⟩

Share

Metrics

Record views

420

Files downloads

315