Un modèle générique d'organisation de corpus en ligne : application à la FReeBank

Abstract : The few available French resources for evaluating linguistic models or algorithms on other linguistic levels than morpho-syntax are either insufficient from quantitative as well as qualitative point of view or not freely accessible. Based on this fact, the FREEBANK project intends to create French corpora constructed using manually revised output from a hybrid Constraint Grammar parser and annotated on several linguistic levels (structure, morpho-syntax, syntax, coreference), with the objective to make them available on-line for research purposes. Therefore, we will focus on using standard annotation schemes, integration of existing resources and maintenance allowing for continuous enrichment of the annotations. Prior to the actual presentation of the prototype that has been implemented, this paper describes a generic model for the organization and deployment of a linguistic resource archive, in compliance with the various works currently conducted within international standardization initiatives (TEI and ISO/TC 37/SC 4).
Type de document :
Article dans une revue
Traitement Automatique des Langues, ATALA, 2006, 45, pp.145-169
Liste complète des métadonnées

Littérature citée [29 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-00110970
Contributeur : Susanne Alt <>
Soumis le : lundi 6 novembre 2006 - 15:18:00
Dernière modification le : jeudi 11 janvier 2018 - 06:24:27
Document(s) archivé(s) le : mardi 6 avril 2010 - 21:25:41

Identifiants

Citation

Susanne Salmon-Alt, Laurent Romary, Jean-Marie Pierrel. Un modèle générique d'organisation de corpus en ligne : application à la FReeBank. Traitement Automatique des Langues, ATALA, 2006, 45, pp.145-169. 〈hal-00110970〉

Partager

Métriques

Consultations de la notice

346

Téléchargements de fichiers

270