Crowdsourcing Model for Multilingual Corpus and Knowledge Construction: The Case of Transnational Mark Twain

Abstract : Purpose/Thesis: We describe a new approach that addresses key challenges to multilingual corpus by merging collective human intelligence (crowdsourcing) and automated knowledge construction and extraction methods in a symbiotic fashion. Approach/Methods: We use a crowdsourcing model to collect and annotate translations of the same literary text. Results and conclusions: The model promotes a dynamic approach to archives that increases the impact of traditional research by presenting the text from a new angle, accessible to a global public. Practical implications: The Global Huck project proposes a new paradigm to assess the contribution of crowdsourcing-based models for collection and annotation purposes. Originality/Value: Choosing the translations of a novel as a field of study is a truly transnational and multilingual collaborative endeavor allowing us to increase our capacity to collect and organize data on a broad, transnational and multilingual scale.
Type de document :
Article dans une revue
ZIN. Issues in Information Science. Information Studies, 2018, 56 (1), pp.21-32. 〈http://www.sbp.pl/en/article/?cid=3449&prev=467〉
Liste complète des métadonnées

Littérature citée [20 références]  Voir  Masquer  Télécharger

https://hal.archives-ouvertes.fr/hal-01868242
Contributeur : Quoc-Tan Tran <>
Soumis le : mardi 25 septembre 2018 - 18:42:58
Dernière modification le : jeudi 29 novembre 2018 - 01:11:42

Fichier

01_02_Fraisse_Jenn_Tran.pdf
Accord explicite pour ce dépôt

Identifiants

  • HAL Id : hal-01868242, version 1

Collections

Citation

Amel Fraisse, Ronald Jenn, Quoc-Tan Tran. Crowdsourcing Model for Multilingual Corpus and Knowledge Construction: The Case of Transnational Mark Twain. ZIN. Issues in Information Science. Information Studies, 2018, 56 (1), pp.21-32. 〈http://www.sbp.pl/en/article/?cid=3449&prev=467〉. 〈hal-01868242〉

Partager

Métriques

Consultations de la notice

121

Téléchargements de fichiers

38