Skip to Main content Skip to Navigation
Book sections

Un modèle de partition du vocabulaire

Abstract : The model proposed here is used to describe the vocabulary of a corpus. It is divided into two groups : general vocabulary which is used whatever the circumstances and several local (or 'specialised') vocabularies, each of which is used in only one part of the corpus, General words may appear everywhere in the text and their increase with corpus size can be estimated with Muller's formula. In this model, a partition parameter measures the relative importance of both types of vocabularies: so the value of this parameter gives an estimation of the lexical 'specialisation' in the text. This model has been applied to Racine's plays and TV debates (Giscard vs Mitterrand, Chirac vs Fabius). The partition model can also be used to measure the increase of vocabulary with corpus length, to locate stylistic changes or to compare several texts from the point of view of their lexical richness.
Complete list of metadata

Cited literature [5 references]  Display  Hide  Download
Contributor : Dominique Labbé Connect in order to contact the contributor
Submitted on : Wednesday, November 28, 2012 - 8:59:14 AM
Last modification on : Wednesday, November 17, 2021 - 12:31:17 PM
Long-term archiving on: : Saturday, December 17, 2016 - 4:21:16 PM


Files produced by the author(s)


  • HAL Id : hal-00758061, version 1


Pierre Hubert, Dominique Labbé. Un modèle de partition du vocabulaire. Dominique Labbé, Philippe Thoiron, Daniel Serant. Etudes sur la richesse et la structures lexicales, Slatkine-Champion, pp.93-114, 1988. ⟨hal-00758061⟩



Record views


Files downloads