Crowdsourcing Model for Multilingual Corpus and Knowledge Construction: The Case of Transnational Mark Twain

Amel Fraisse; Ronald Jenn; Quoc-Tan Tran

Article Dans Une Revue ZIN. Issues in Information Science. Information Studies Année : 2018

Crowdsourcing Model for Multilingual Corpus and Knowledge Construction: The Case of Transnational Mark Twain

(1) , (2) , (1)

1
2

Amel Fraisse

Fonction : Auteur
PersonId : 15486
IdHAL : amel-fraisse
ORCID : 0000-0002-8693-8862
IdRef : 146155580

Groupe d'Études et de Recherche Interdisciplinaire en Information et COmmunication - ULR 4073

Ronald Jenn

Fonction : Auteur
PersonId : 16625
IdHAL : ronald-jenn
IdRef : 080606695

Centre d'Études en Civilisations, Langues et Lettres Étrangères - ULR 4074

Quoc-Tan Tran

Fonction : Auteur
PersonId : 16889
IdHAL : quoc-tan-tran
ORCID : 0000-0003-3533-5858
IdRef : 253119081

Groupe d'Études et de Recherche Interdisciplinaire en Information et COmmunication - ULR 4073

Résumé

Purpose/Thesis: We describe a new approach that addresses key challenges to multilingual corpus by merging collective human intelligence (crowdsourcing) and automated knowledge construction and extraction methods in a symbiotic fashion. Approach/Methods: We use a crowdsourcing model to collect and annotate translations of the same literary text. Results and conclusions: The model promotes a dynamic approach to archives that increases the impact of traditional research by presenting the text from a new angle, accessible to a global public. Practical implications: The Global Huck project proposes a new paradigm to assess the contribution of crowdsourcing-based models for collection and annotation purposes. Originality/Value: Choosing the translations of a novel as a field of study is a truly transnational and multilingual collaborative endeavor allowing us to increase our capacity to collect and organize data on a broad, transnational and multilingual scale.

Mots clés

Deep mapping Under-resourced languages Parallel text processing Multilingual corpus Humanities crowdsourcing

Domaines

Sciences de l'information et de la communication Littératures

Fichier principal

01_02_Fraisse_Jenn_Tran.pdf (433.16 Ko)

Origine : Accord explicite pour ce dépôt

Quoc-Tan Tran : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01868242

Soumis le : mardi 25 septembre 2018-18:42:58

Dernière modification le : mercredi 24 janvier 2024-09:54:20

Archivage à long terme le : mercredi 26 décembre 2018-12:28:50

Dates et versions

hal-01868242 , version 1 (25-09-2018)

Identifiants

HAL Id : hal-01868242 , version 1

Citer

Amel Fraisse, Ronald Jenn, Quoc-Tan Tran. Crowdsourcing Model for Multilingual Corpus and Knowledge Construction: The Case of Transnational Mark Twain. ZIN. Issues in Information Science. Information Studies, 2018, 56 (1), pp.21-32. ⟨hal-01868242⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

GERIICO CECILLE CAMPUS-AAR AAI UNIV-LILLE

159 Consultations

112 Téléchargements

Crowdsourcing Model for Multilingual Corpus and Knowledge Construction: The Case of Transnational Mark Twain

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager