Skip to Main content Skip to Navigation
Journal articles

A Model of Vocabulary Partition

Abstract : The model proposed here is used to describe the vocabulary of a corpus. It is divided into two groups: general vocabulary which is used whatever the circumstances and several local (or 'specialized') vocabularies, each of which is used in only one part of the corpus. General words may appear everywhere in the text and their increase with corpus length can be estimated with Muller's formula. In this model, a partition parameter measures the relative importance of both types of vocabularies: so the value of this parameter gives an estimation of the lexical 'specialization' in the text. This model has been applied to Racine's plays and can also be used to measure the increase of vocabulary with corpus length, to locate stylistic changes or to compare several texts from the point of view of their lexical richness.
Complete list of metadata

Cited literature [7 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00763209
Contributor : Dominique Labbé Connect in order to contact the contributor
Submitted on : Monday, December 10, 2012 - 12:11:13 PM
Last modification on : Wednesday, October 20, 2021 - 1:52:05 AM
Long-term archiving on: : Monday, March 11, 2013 - 12:20:30 PM

File

HubertLabbe1988c.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00763209, version 1

Collections

CNRS | PACTE | UGA

Citation

Pierre Hubert, Dominique Labbé. A Model of Vocabulary Partition. Literary and Linguistic Computing, Oxford University Press (OUP), 1988, 3 (4), pp.223-225. ⟨hal-00763209⟩

Share

Metrics

Record views

447

Files downloads

502