A. Ambekar, C. Ward, J. Mohammed, S. Male, and S. Skiena, Name-ethnicity classification from open sources, Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, KDD '09, pp.49-58, 2009.
DOI : 10.1145/1557019.1557032

URL : http://www.cs.sunysb.edu/~skiena/lydia/names.pdf

F. Barth, Ethnic groups and boundaries: The social organization of culture difference, 1998.

C. Beauchemin, C. Hamel, and P. Simon, Trajectoires et origines: enquête sur la diversité des populations en France, 2016.

C. Brutel, La localisation géographique des immigrés: Une forte concentration dans l'aire urbaine de paris, 2016.

E. Cediey and F. Foroni, Les discriminations à raison de «l'origine» dans les embauches en france, 2007.

J. Chang, I. Rosenn, L. Backstrom, and C. Marlow, epluribus: Ethnicity on social networks, ICWSM, vol.10, pp.18-25, 2010.

B. Choi, J. Hanley, E. Holowaty, and D. Dale, Use of Surnames to Identify Individuals of Chinese Ancestry, American Journal of Epidemiology, vol.138, issue.9, pp.723-734, 1993.
DOI : 10.1093/oxfordjournals.aje.a116910

G. Clark, The son also rises: surnames and the history of social mobility, 2014.
DOI : 10.1515/9781400851096

É. Delattre, N. Leandri, D. Meurs, and R. Rathelot, Introduction - Trois approches de la discrimination : ??valuations indirectes, exp??rimentation, discriminations ressenties, Economie et statistique, vol.464, issue.1, pp.7-13, 2013.
DOI : 10.3406/estat.2013.10225

URL : http://www.persee.fr/docAsPDF/estat_0336-1454_2013_num_464_1_10225.pdf

F. Foroni, M. Ruault, and E. Valat, Discrimination à l'embauche selon «l'origine»: que nous apprend le testing auprès de grandes entreprises?, 2016.

M. Güell, J. Mora, and C. Telmer, The Informational Content of Surnames, the Evolution of Intergenerational Mobility, and Assortative Mating, The Review of Economic Studies, vol.26, issue.1, pp.693-735, 2015.
DOI : 10.2307/146023

O. Herfindahl, Concentration in the steel industry, 1950.

A. Hirschman, National power and the structure of foreign trade, 1945.

F. Jobard and S. Nevanen, La couleur du jugement, Revue fran??aise de sociologie, vol.48, issue.2, p.243, 1965.
DOI : 10.3917/rfs.482.0243

URL : https://hal.archives-ouvertes.fr/halshs-00443047

M. Jobling, In the name of the father: surnames and genetics, Trends in Genetics, vol.17, issue.6, pp.353-357, 2001.
DOI : 10.1016/S0168-9525(01)02284-3

T. King, S. Ballereau, K. Schürer, and M. Jobling, Genetic Signatures of Coancestry within Surnames, Current Biology, vol.16, issue.4, pp.384-388, 2006.
DOI : 10.1016/j.cub.2005.12.048

T. King and M. Jobling, What's in a name? Y chromosomes, surnames and the genetic genealogy revolution, Trends in Genetics, vol.25, issue.8, pp.351-360, 2009.
DOI : 10.1016/j.tig.2009.06.003

G. Lasker, Surnames in the Study of Human Biology, American Anthropologist, vol.82, issue.3, pp.525-538, 1980.
DOI : 10.1525/aa.1980.82.3.02a00030

G. Lasker, Surnames and genetic structure, 1985.
DOI : 10.1017/CBO9780511983351

J. Lee, H. Kim, M. Ko, D. Choi, J. Choi et al., Name Nationality Classification with Recurrent Neural Networks, Proceedings of the Twenty-Sixth International Joint Conference on Artificial Intelligence, 2017.
DOI : 10.24963/ijcai.2017/289

URL : https://www.ijcai.org/proceedings/2017/0289.pdf

P. Mateos, A review of name-based ethnicity classification methods and their potential in population studies, Population, Space and Place, vol.6, issue.4, pp.243-263, 2007.
DOI : 10.1136/bmj.314.7082.705

A. Mislove, S. Lehmann, Y. Ahn, J. Onnela, and J. Rosenquist, Understanding the demographics of twitter users, ICWSM, vol.11, p.5, 2011.

P. Nieminen and M. Isohanni, Bias against European journals in medical publication databases, The Lancet, vol.353, issue.9164, p.1592, 1999.
DOI : 10.1016/S0140-6736(99)00415-8

F. Pedregosa, G. Varoquaux, A. Gramfort, V. Michel, B. Thirion et al., Scikit-learn: Machine learning in python, Journal of machine learning research, vol.12, pp.2825-2830, 2011.
URL : https://hal.archives-ouvertes.fr/hal-00650905

A. Piazza, S. Rendine, G. Zei, A. Moroni, and L. Cavalli-sforza, Migration rates of human populations from surname distributions, Nature, vol.329, issue.6141, pp.714-716, 1987.
DOI : 10.1038/329714a0

A. Polednak, Estimating cervical cancer incidence in the hispanic population of connecticut by use of surnames, Cancer, vol.81, issue.11, pp.3560-3564, 1993.
DOI : 10.2105/AJPH.77.1.69

M. Rosenfeld, Racial, Educational and Religious Endogamy in the United States: A Comparative Historical Perspective, Social Forces, vol.87, issue.1, pp.1-31, 2008.
DOI : 10.1353/sof.0.0077

B. Shah, M. Chiu, S. Amin, M. Ramani, S. Sadry et al., Surname lists to identify South Asian and Chinese ethnicity from secondary data in Ontario, Canada: a validation study, BMC Medical Research Methodology, vol.319, issue.1, p.42, 2010.
DOI : 10.1136/bmj.319.7204.215

E. Tonkin, M. Mcdonald, and M. Chapman, History and ethnicity, Routledge, vol.27, 2016.

V. Torvik and S. Agarwal, Ethnea: an instance-based ethnicity classifier based on geo-coded author names in a largescale bibliographic database, International Symposium on Science of Science, 2016.

W. Jr and J. , Hierarchical grouping to optimize an objective function, Journal of the American statistical association, vol.58, issue.301, pp.236-244, 1963.

M. Weber, Economy and Society: An Outline of Interpretive Sociology, 1978.
DOI : 10.1002/9780470755679.ch3

Z. Wu, D. Yuan, P. Treeratpituk, and C. Giles, Science and ethnicity: How ethnicities shape the evolution of computer science research community. arXiv preprint, 2014.