The expansion of isms, 1820-1917: Data-driven analysis of political language in digitized newspaper collections - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Journal of Data Mining and Digital Humanities Année : 2020

The expansion of isms, 1820-1917: Data-driven analysis of political language in digitized newspaper collections

Résumé

Words with the suffix-ism are reductionist terms that help us navigate complex social issues by using a simple one-word label for them. On the one hand they are often associated with political ideologies, but on the other they are present in many other domains of language, especially culture, science, and religion. This has not always been the case. This paper studies isms in a historical record of digitized newspapers from 1820 to 1917 published in Finland to find out how the language of isms developed historically. We use diachronic word embeddings and affinity propagation clustering to trace how new isms entered the lexicon and how they relate to one another over time. We are able to show how they became more common and entered more and more domains. Still, the uses of isms as traditions for political action and thinking stand out in our analysis.
Fichier principal
Vignette du fichier
Marjanen, Kurunmaki, Pivovarova, Zosa_The Expansion of Isms_03.pdf (1.12 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02491304 , version 1 (25-02-2020)
hal-02491304 , version 2 (29-05-2020)
hal-02491304 , version 3 (21-08-2020)
hal-02491304 , version 4 (22-09-2020)
hal-02491304 , version 5 (14-12-2020)

Identifiants

  • HAL Id : hal-02491304 , version 3

Citer

Jani Marjanen, Jussi Kurunmäki, Lidia Pivovarova, Elaine Zosa. The expansion of isms, 1820-1917: Data-driven analysis of political language in digitized newspaper collections. Journal of Data Mining and Digital Humanities, 2020, HistoInformatics. ⟨hal-02491304v3⟩
363 Consultations
1488 Téléchargements

Partager

Gmail Facebook X LinkedIn More