How to explore conflicts in French Wikipedia talk pages?

Abstract : With the exponential development of the Internet, new discourse genres and situations have expanded. These new web genres, which are still little described, are complex objects challenging our methodologies and our analysis tools: the encyclopedic project Wikipedia is one of these new objects which are part of Computer-mediated communication (CMC). The present article concentrates on the exploration of conflicts in Wikipedia talk pages, using Hyperbase Web. Wikipedia data and CMC corpora have been little studied by French linguistics so far, and are still challenging text statistics, notably because of the complexity of such data (multiple annotations, consistent metadata, references between postings and user networks). Based on the Wikiconflits corpus, which is already available and freely usable by researchers, we will propose some methodological avenues to explore Wikipedia data and CMC corpora.
Document type :
Conference papers
Liste complète des métadonnées

Cited literature [16 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01359416
Contributor : Laurent Vanni <>
Submitted on : Friday, September 2, 2016 - 12:09:38 PM
Last modification on : Thursday, February 7, 2019 - 4:18:54 PM

File

78404.pdf
Publisher files allowed on an open archive

Identifiers

  • HAL Id : hal-01359416, version 1

Collections

Citation

Céline Poudat, Laurent Vanni, Natalia Grabar. How to explore conflicts in French Wikipedia talk pages?. Statistics Analysis of Textual Data, Jun 2016, Nice, France. pp.645-656. ⟨hal-01359416⟩

Share

Metrics

Record views

250

Files downloads

252