Mining team characteristics to predict Wikipedia article quality

Abstract : In this study, we were interested in studying which characteristics of virtual teams are good predictors for the quality of their production. The experiment involved obtaining the Spanish Wikipedia database dump and applying different data mining techniques suitable for large data sets to label the whole set of articles according to their quality (comparing them with the Featured/Good Articles, or FA/GA). Then we created the attributes that describe the characteristics of the team who produced the articles and using decision tree methods, we obtained the most relevant characteristics of the teams that produced FA/GA. The team's maximum efficiency and the total length of contribution are the most important predictors. This article contributes to the literature on virtual team organization.
Liste complète des métadonnées

Cited literature [37 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01354368
Contributor : Bibliothèque Télécom Bretagne <>
Submitted on : Thursday, August 18, 2016 - 7:10:50 PM
Last modification on : Wednesday, March 20, 2019 - 11:50:09 AM
Document(s) archivé(s) le : Saturday, November 19, 2016 - 8:54:59 PM

File

mining-team-characteristics vf...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01354368, version 1

Citation

Grace Gimon Betancourt, Armando Segnini, Carlos Trabuco, Amira Rezgui, Nicolas Jullien. Mining team characteristics to predict Wikipedia article quality. OpenSym 2016 : 12th International Symposium on Open Collaboration, Aug 2016, Berlin, Germany. pp.1 - 9. ⟨hal-01354368⟩

Share

Metrics

Record views

523

Files downloads

849