HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation
Journal articles

Large-scale diversity estimation through surname origin inference

Abstract : The study of surnames as both linguistic and geographical markers of the past has proven valuable in several research fields spanning from biology and genetics to demography and social mobility. This article builds upon the existing literature to conceive and develop a surname origin classifier based on a data-driven typology. This enables us to explore a methodology to describe large-scale estimates of the relative diversity of social groups, especially when such data is scarcely available. We subsequently analyze the representativeness of surname origins for 15 socio-professional groups in France.
Complete list of metadata

Cited literature [33 references]  Display  Hide  Download

Contributor : Antoine Mazieres Connect in order to contact the contributor
Submitted on : Friday, April 13, 2018 - 6:48:59 PM
Last modification on : Wednesday, May 25, 2022 - 9:50:16 AM


Files produced by the author(s)


Distributed under a Creative Commons Attribution - NonCommercial - NoDerivatives 4.0 International License



Antoine Mazieres, Camille Roth. Large-scale diversity estimation through surname origin inference. Bulletin de Méthodologie Sociologique / Bulletin of Sociological Methodology, SAGE Publications, 2018, 139 (1), pp.59-73. ⟨10.1177/0759106318778828⟩. ⟨hal-01766665⟩



Record views


Files downloads