Skip to Main content Skip to Navigation
Conference papers

Mining team characteristics to predict Wikipedia article quality

Abstract : In this study, we were interested in studying which characteristics of virtual teams are good predictors for the quality of their production. The experiment involved obtaining the Spanish Wikipedia database dump and applying different data mining techniques suitable for large data sets to label the whole set of articles according to their quality (comparing them with the Featured/Good Articles, or FA/GA). Then we created the attributes that describe the characteristics of the team who produced the articles and using decision tree methods, we obtained the most relevant characteristics of the teams that produced FA/GA. The team's maximum efficiency and the total length of contribution are the most important predictors. This article contributes to the literature on virtual team organization.
Complete list of metadatas

Cited literature [37 references]  Display  Hide  Download
Contributor : Bibliothèque Télécom Bretagne <>
Submitted on : Thursday, August 18, 2016 - 7:10:50 PM
Last modification on : Wednesday, June 24, 2020 - 4:18:40 PM
Long-term archiving on: : Saturday, November 19, 2016 - 8:54:59 PM


mining-team-characteristics vf...
Files produced by the author(s)


  • HAL Id : hal-01354368, version 1


Grace Gimon Betancourt, Armando Segnini, Carlos Trabuco, Amira Rezgui, Nicolas Jullien. Mining team characteristics to predict Wikipedia article quality. OpenSym 2016 : 12th International Symposium on Open Collaboration, Aug 2016, Berlin, Germany. pp.1 - 9. ⟨hal-01354368⟩



Record views


Files downloads