Collaborative Annotation for Reliable Natural Language Processing: Technical and Sociological Aspects - Archive ouverte HAL Accéder directement au contenu
Ouvrages Année : 2016

Collaborative Annotation for Reliable Natural Language Processing: Technical and Sociological Aspects

Karën Fort

Résumé

This book presents a unique opportunity for constructing a consistent image of collaborative manual annotation for Natural Language Processing (NLP). NLP has witnessed two major evolutions in the past 25 years: firstly, the extraordinary success of machine learning, which is now, for better or for worse, overwhelmingly dominant in the field, and secondly, the multiplication of evaluation campaigns or shared tasks. Both involve manually annotated corpora, for the training and evaluation of the systems. These corpora have progressively become the hidden pillars of our domain, providing food for our hungry machine learning algorithms and reference for evaluation. Annotation is now the place where linguistics hides in NLP. However, manual annotation has largely been ignored for some time, and it has taken a while even for annotation guidelines to be recognized as essential. Although some efforts have been made lately to address some of the issues presented by manual annotation, there has still been little research done on the subject. This book aims to provide some useful insights into the subject. Manual corpus annotation is now at the heart of NLP, and is still largely unexplored. There is a need for manual annotation engineering (in the sense of a precisely formalized process), and this book aims to provide a first step towards a holistic methodology, with a global view on annotation.
Fichier principal
Vignette du fichier
CollaborativeAnnotation_ExemplaireAuteurNonCorrigé.pdf (4.29 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-01324322 , version 1 (19-11-2020)

Identifiants

  • HAL Id : hal-01324322 , version 1

Citer

Karën Fort. Collaborative Annotation for Reliable Natural Language Processing: Technical and Sociological Aspects. Patrick Paroubek. Wiley-ISTE, pp.196, 2016, 978-1-84821-904-5. ⟨hal-01324322⟩
251 Consultations
471 Téléchargements

Partager

Gmail Facebook X LinkedIn More