Trees and after: The concept of text topology

Xuan Luong; Michel Juillard; Sylvie Mellet; Dominique Longrée

Article Dans Une Revue Literary and Linguistic Computing Année : 2007

Trees and after: The concept of text topology

(1) , (1) , (2) , (3)

1
2
3

Xuan Luong

Fonction : Auteur
PersonId : 889049

BCL, équipe Logométrie : corpus, traitements, modèles

Michel Juillard

Fonction : Auteur

BCL, équipe Logométrie : corpus, traitements, modèles

Sylvie Mellet

Fonction : Auteur
PersonId : 7609
IdHAL : sylvie-mellet
IdRef : 028264819

BCL, équipe Linguistique de l'énonciation : les connecteurs concessifs [jusqu'en 2007]

Dominique Longrée

Fonction : Auteur
PersonId : 889050

Laboratoire d'Analyse Statistique des Langues Anciennes

Résumé

The model described here relies on the key concepts of topology, i.e. neighbourhood and equivalence of shape. A linguistic object L is studied in text T by means of one or several local questions Q. The set of successive local answers is processed so as to provide a global function characterizing the textual space under scrutiny. We begin with short sequences of tenses to illustrate the way in which to explore originally Emile Benveniste's concepts of history and discourse . We then supply life-size examples of other objects selected for their heuristic value. We go on to demonstrate the model at work on the distribution of strings of finite (F) and non-finite (n) verbal forms in the LOB Corpus of English. A topological chart is produced as the synthetic image mirroring the locations of the relevant linguistic entities throughout the text. All the individual strings concatenating any number of F and n are classified in a table. Alternatively, individual full-text strings can be extracted. We then proceed to refine the notion of lexical distribution in "rafales" in a lemmatized corpus of Latin texts, the purpose being to test the stability of the distributions in individual texts of selected verbs and assess whether a verb's behaviour is related to its semantic status. The final section is devoted to other Latin texts. The use of segments of equal length makes it possible to draw up the narrative profile of each author as revealed by his handling of tenses in main clauses.

Mots clés

textual topology sequences of tenses sequences of verbal forms bursts corpus linguistics

rafales séquences de formes verbales séquences temporelles topologie textuelle linguistique de corpus

Domaines

Linguistique

Fichier principal

Trees_and_After_auteurs.pdf (232.38 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Sylvie Mellet : Connectez-vous pour contacter le contributeur

https://hal.science/hal-00555349

Soumis le : mercredi 19 janvier 2011-18:22:25

Dernière modification le : vendredi 29 mars 2024-16:00:03

Archivage à long terme le : mercredi 20 avril 2011-02:28:11

Dates et versions

hal-00555349 , version 1 (19-01-2011)

Identifiants

HAL Id : hal-00555349 , version 1

Citer

Xuan Luong, Michel Juillard, Sylvie Mellet, Dominique Longrée. Trees and after: The concept of text topology: Some applications to verb-form distributions in language corpora. Literary and Linguistic Computing, 2007, 22 (2), pp.167-186. ⟨hal-00555349⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS BCL CAMPUS-AAR AAI UNIV-COTEDAZUR

101 Consultations

382 Téléchargements

Trees and after: The concept of text topology

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager