The network of concepts in written texts - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue The European Physical Journal B: Condensed Matter and Complex Systems Année : 2006

The network of concepts in written texts

Résumé

Complex network theory is used to investigate the structure of meaningful concepts in written texts of individual authors. Networks have been constructed after a two phase filtering, where words with less meaning contents are eliminated and all remaining words are set to their canonical form, without any number, gender or time flexion. Each sentence in the text is added to the network as a clique. A large number of written texts have been scrutinised, and its found that texts have small-world as well as scale-free structures. The growth process of these networks has also been investigated, and a universal evolution of network quantifiers have been found among the set of texts written by distinct authors. Further analyses, based on shuffling procedures taken either on the texts or on the constructed networks, provide hints on the role played by the word frequency and sentence length distributions to the network structure.

Mots clés

Dates et versions

hal-01024896 , version 1 (16-07-2014)

Identifiants

Citer

Silvia M.G Caldeira, Thierry C Petit Lobão, R.F.S. Andrade, Alexis Neme, J.G.V. Miranda. The network of concepts in written texts. The European Physical Journal B: Condensed Matter and Complex Systems, 2006, 49 (4), pp.523-529. ⟨10.1140/epjb/e2006-00091-3⟩. ⟨hal-01024896⟩
253 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More