The MoNoPoli database - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2021

The MoNoPoli database

Résumé

In this article we present our method to build a derivational database of French deanthroponyms, which we call MoNoPoli for Mots construits sur Noms propres de personnalités Politiques, ‘complex words based on politician proper names’. MoNoPoli contains 6,545 complex words amounting to a total of 55,030 tokens and includes almost only neologistic forms. TheWeb is the only conceivable resource for collecting them: it alone gives massive access to discourse genres that contain neologisms. To feed the database, a program automatically generates the set of all possible derived words. Generated forms are then used as queries on theWeb. Attested forms are kept with their context. This method provides a potential alternative to collect data that cannot be found elsewhere. Finally, this article describes some of the remarkable results obtained with the analysis of the deanthroponyms of MoNoPoli.
Fichier principal
Vignette du fichier
DeriMo-2021-76-85.pdf (268.54 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03374883 , version 1 (12-10-2021)

Licence

Copyright (Tous droits réservés)

Identifiants

  • HAL Id : hal-03374883 , version 1

Citer

Mathilde Huguin. The MoNoPoli database: Or how to catch Macronitis. Third International Workshop on Resources and Tools for Derivational Morphology (DeriMo 2021), Fiammetta Namer; Nabil Hathout; Stéphanie Lignon; Magda Ševčíková; Zdeněk Žabokrtský, Sep 2021, Nancy, France. pp.76-85. ⟨hal-03374883⟩
56 Consultations
45 Téléchargements

Partager

Gmail Facebook X LinkedIn More