Building a treebank of noisy user-generated content: The French Social Media Bank - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2012

Building a treebank of noisy user-generated content: The French Social Media Bank

Résumé

We introduce the French Social Media Bank, the first user-generated content treebank for French. Its first release contains 1,700 sentences from various Web 2.0 and social media sources (FACEBOOK, TWITTER, web forums), including data specifically chosen for their high noisiness.
Fichier non déposé

Dates et versions

hal-00780898 , version 1 (25-01-2013)

Identifiants

  • HAL Id : hal-00780898 , version 1

Citer

Djamé Seddah, Benoît Sagot, Marie Candito, Virginie Mouilleron, Vanessa Combet. Building a treebank of noisy user-generated content: The French Social Media Bank. TLT 11 - The 11th International Workshop on Treebanks and Linguistic Theories, Nov 2012, Lisbonne, Portugal. ⟨hal-00780898⟩
219 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More