Skip to Main content Skip to Navigation
Book sections

Encoding Syntactic Annotation

Nancy Ide 1 Laurent Romary 2
2 LANGUE ET DIALOGUE - Human-machine dialogue with a significant language component
INRIA Lorraine, LORIA - Laboratoire Lorrain de Recherche en Informatique et ses Applications
Abstract : There is a need for a general framework for linguistic annotation that is flexible and extensible enough to accommodate different annotation types and different theoretical and practical approaches, while at the same time enabling their representation in a “pivot” format that can serve as the basis for comparative evaluation, merging, and the development of reusable editing and processing tools. To answer this need, we have developed a framework comprised of an abstract model for a variety of different annotation types (e.g., morpho-syntactic tagging, syntactic annotation, co-reference annotation, etc.), which can be instantiated in different ways depending on the annotator's approach and goals. The results have been incorporated into XCES (Ide, et al., 2000a), the XML instantiation of the Corpus Encoding Standard (Ide, 1998a,b), which provides a ready-made, standard encoding format together with a data architecture designed specifically for linguistically annotated corpora.
Document type :
Book sections
Complete list of metadata

Cited literature [18 references]  Display  Hide  Download
Contributor : Laurent Romary <>
Submitted on : Friday, June 9, 2006 - 1:12:10 PM
Last modification on : Friday, February 26, 2021 - 3:28:04 PM
Long-term archiving on: : Monday, April 5, 2010 - 10:35:44 PM


  • HAL Id : hal-00079163, version 1



Nancy Ide, Laurent Romary. Encoding Syntactic Annotation. Anne Abeillée. Treebanks: Building and Using Parsed Corpora, Kluwer Academic Publisher, pp.281-296, 2003. ⟨hal-00079163⟩



Record views


Files downloads