Beautiful Trees on Unstable Ground: Notes on the Data Problem in Lexicostatistics

Abstract : While lexicostatistics and glottochronology had been suffering a lack of prestige for a long time, the integration of stochastic methods taken from genetics has initiated an unexpected revival of these scorned disciplines. The proponents of these "new quantitative methods" in historical linguistics claim that the procedures are relatively robust regarding errors in the data (wrong cognate judgments, undetected borrowings, or wrong translations). In order to check this claim, we have investigated the differences and errors in two large lexicostatistical datasets and tested their influence on the topologies of computed family trees. Our results show clearly that the shortcomings of lexicostatistics and glottochronology have not been overcome by these new computation methods: the main problems of lexicostatistics and glottochronology, the translation of basic concepts into individual languages, and the execution of cognate judgments are still so grave that no reliable results can be drawn from these methods.
Document type :
Preprints, Working Papers, ...
Liste complète des métadonnées

Cited literature [20 references]  Display  Hide  Download
Contributor : Johann-Mattis List <>
Submitted on : Wednesday, April 6, 2016 - 8:24:06 AM
Last modification on : Monday, March 19, 2018 - 10:04:52 AM
Document(s) archivé(s) le : Thursday, July 7, 2016 - 4:42:24 PM


Files produced by the author(s)


Distributed under a Creative Commons Attribution - NonCommercial 4.0 International License


  • HAL Id : hal-01298493, version 1



Hans Geisler, Johann-Mattis List. Beautiful Trees on Unstable Ground: Notes on the Data Problem in Lexicostatistics . 2010. ⟨hal-01298493⟩



Record views


Files downloads