The deconstruction of a text: the permanence of the generalized Zipf law- the inter-textual relationship between entropy and effort amount - Institut Camille Jordan Accéder directement au contenu
Article Dans Une Revue Scientometrics Année : 2015

The deconstruction of a text: the permanence of the generalized Zipf law- the inter-textual relationship between entropy and effort amount

Résumé

Zipf’s law has intrigued people for a long time. This distribution models a certain type of statistical regularity observed in a text. George K. Zipf showed that, if a word is characterised by its frequency, then, rank and frequency are not independent and approximately verify the relationship:Rank * frequency = constantVarious explanations have been advanced to explain this law. In this article, we talk about the Mandelbrot process, which includes two very different approaches. In the first ap- proach, Mandelbrot studies language generation as the transmission of a signal and bases it on information theory, using the entropy concept. In the second, geometric approach, he draws a parallel with the fractal theory, where each word of the text is a sequence of characters framed by two separators, meaning a simple geometric pattern. This leads us to hypothesise that, since the statistical regularities observed have several possible explana- tions, Zipf’s law carries other patterns. To verify this hypothesis, we chose a text, which we modified and degraded in several successive stages. We called Ti the text degraded at step i. We then segmented Ti into words. We found that rank and frequency were not independent and approximately verified the relationship:〖rank〗^(β_i ) * frequency = constant[ 1]
Fichier principal
Vignette du fichier
Zipf-Decons.pdf (13.36 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01295351 , version 1 (30-03-2016)

Identifiants

Citer

Thierry Lafouge, Abdellatif Agouzal, Geneviève Boidin-Lallich. The deconstruction of a text: the permanence of the generalized Zipf law- the inter-textual relationship between entropy and effort amount. Scientometrics, 2015, 104 (1), pp.193-217. ⟨10.1007/s11192-015-1600-z⟩. ⟨hal-01295351⟩
330 Consultations
279 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More