Skip to Main content Skip to Navigation
Conference papers

Search for Meaning Through the Study of Co-occurrences in Texts

Abstract : In this paper, we combine several tools used in text-mining in order to study both the lexicon and the semantic structure of a set of medieval texts. On the one hand, the study of occurrences (Principal Component Analysis, Topic Models, Self-Organizing Maps, Hierarchical Cluster Analysis) allows a wide scope of tools to extract and display information from big data. On the other hand, the study of co-occurrences (words belonging to a sentence, a paragraph) allows to keep track of the structure of each text, but is more tedious to handle and often leads to messy visualizations. Here we use the SOM algorithm to reduce the size of the data (clustering, removal of fickle information) while preserving the semantic structure ; then we can rely on classical but slower algorithms (HCA, graph representation) to purpose data visualization.
Document type :
Conference papers
Complete list of metadatas

Cited literature [13 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01519217
Contributor : Nicolas Bourgeois <>
Submitted on : Saturday, May 6, 2017 - 3:45:29 PM
Last modification on : Tuesday, January 19, 2021 - 11:08:40 AM

Files

papier_iwann_2015_revised.pdf
Files produced by the author(s)

Licence


Copyright

Identifiers

  • HAL Id : hal-01519217, version 1

Citation

Nicolas Bourgeois, Marie Cottrell, Stéphane Lamasse, Madalina Olteanu. Search for Meaning Through the Study of Co-occurrences in Texts. International Work-Conference on Artificial Neural Networks, Jun 2015, Palma de Mallorca, Spain. ⟨hal-01519217⟩

Share

Metrics

Record views

227

Files downloads

890