Efficient construction of metadata-enhanced web corpora, Proceedings of the 10th Web as Corpus Workshop, Association for Computational Linguistics, pp.7-16, 2016. ,
URL : https://hal.archives-ouvertes.fr/hal-01371704
A TEI Schema for the Representation of Computer-mediated Communication, Journal of the Text Encoding Initiative, vol.3, 2012. ,
, Rechtliche Bedingungen für die Bereitstellung eines Chat-Korpus in CLARIN-D: Ergebnisse eines Rechtsgutachtens, 2017.
Closing a Gap in the Language Resources Landscape: Groundwork and Best Practices from Projects on Computer-mediated Communication in four European Countries, Selected papers from the CLARIN Annual Conference, pp.1-18, 2016. ,
Defining the online interaction space and the TEI structure for CoMeRe corpora, 2013. ,
, , 2014.
, The CoMeRe corpus for French: structuring and annotating heterogeneous CMC genres, Journal of language Technology and Computational Linguistics, vol.29, issue.2, pp.1-30
The compilation, processing and analysis of the Janes corpus of Slovene user-generated content, Corpus de Communication Médiée par les Réseaux. Construction, structuration, analyse. Paris: L'Harmattan (Humanités numériques), pp.125-138, 2017. ,
JANES v0.4: Korpus slovenskih spletnih uporabni?kih vsebin, pp.67-99, 2016. ,
The DiDi Cor-pus of South Tyr-olean CMC Data, Proceedings of the 2nd Workshop on Natural Language Processing for Computer-Mediated Communication / Social Media (NLP4CMC2015), 2015. ,
The DiDi Corpus of South Tyrolean CMC Data: A Multilingual Corpus of Facebook Texts, Proceedings of the Third Italian Conference on Computational Linguistics (CLiC-it 2016). ceur-ws.org, 2016. ,
, Die Korpusplattform des "Digitalen Wörterbuchs der deutschen Sprache" (DWDS). Zeitschrift für germanistische Linguistik, p.45, 2017.
Compilation and Annotation of the Discourse-structured Blog Corpus for German, Proceedings of the 4th Conference on CMC and Social Media Corpora for the Humanities (cmc-corpora2016), 2016. ,
Le corpus WikiDisc, une ressource pour la caractérisation des discussions en ligne, Corpus de Communication Médiée par les Réseaux. Construction, structuration, analyse. Paris: L'Harmattan (Humanités numériques), pp.107-124, 2017. ,
Structuring a CMC corpus of political tweets in TEI: corpus features, ethics and workflow. Poster at Corpus Linguistics, 2015. ,
URL : https://hal.archives-ouvertes.fr/halshs-01176061
DEREKO-Das Deutsche Referenzkorpus. Schriftkorpora der deutschen Gegenwartssprache am Institut für Deutsche Sprache in Mannheim. Zeitschrift für germanistische Linguistik, vol.45, pp.161-170, 2017. ,
Integrating corpora of computer-mediated communication in CLARIN-D: Results from the curation project ChatCorpus2CLARIN, Proceedings of the 13th Conference on Natural Language Processing, pp.156-164, 2016. ,
Anonymisation of the Dortmund Chat Corpus 2.1, Proceedings of the 5th Conference on CMC and Social Media corpora for the Humanities, 2017. ,
Building Linguistic Corpora from Wikipedia Articles and Discussions, Journal of language Technology and Computational Linguistics, vol.29, issue.2, pp.59-82, 2014. ,
The Paris corpus, Journal of French Language Studies, vol.22, pp.7-12, 2012. ,
URL : https://hal.archives-ouvertes.fr/halshs-01350592
Wikiconflits: un corpus de discussions éditoriales conflictuelles du Wikipédia francophone, Corpus de Communication Médiée par les Réseaux. Construction, structuration, analyse. Paris: L'Harmattan (Humanités numériques), pp.19-36, 2017. ,
Good practices in the compilation of FOLK, the research and teaching corpus of spoken German, International Journal of Corpus Linguistics, vol.21, issue.3, pp.396-418, 2016. ,
Building and Annotating a Corpus of German-Language Newsgroups, Proceedings of the 2nd Workshop on Natural Language Processing for Computer-Mediated Communication / Social Media (NLP4CMC2015), pp.16-26, 2015. ,
The CLARIN Language Resource Switchboard, Proceedings of the CLARIN Annual Conference, 2016. ,
DOI : 10.1162/coli_a_00329
URL : https://doi.org/10.1162/coli_a_00329