D. Martinez, O. Plchot, L. Burget, O. Glembek, and P. Mat?jka, Language recognition in i-vectors space, Proc. Interspeech -Annual Conference of the International Speech Communication Association, 2011.

A. Lozano-diez, O. Plchot, P. Mat?jka, and J. Gonzalez-rodriguez, DNN based embeddings for language recognition, Proc. ICASSP -International Conference on Acoustics, Speech and Signal Processing, pp.5184-5188, 2018.

M. Mclaren, M. K. Nandwana, D. Castán, and L. Ferrer, Approaches to multi-domain language recognition, Proc. Odyssey: The Speaker and Language Recognition Workshop, pp.90-97, 2018.

D. Snyder, D. Garcia-romero, A. Mccree, G. Sell, D. Povey et al., Spoken language recognition using x-vectors, Proc. Odyssey: The Speaker and Language Recognition Workshop, pp.105-111, 2018.

D. Raj, D. Snyder, D. Povey, and S. Khudanpur, Probing the information encoded in x-vectors, 2019.

S. O. Sadjadi, T. Kheyrkhah, C. S. Greenberg, E. Singer, D. A. Reynolds et al., Performance analysis of the 2017 NIST language recognition evaluation, Proc. Interspeech -Annual Conference of the International Speech Communication Association, pp.1798-1802, 2018.

O. Plchot, P. Mat?jka, O. Novotný, S. Cumani, A. Lozano-diez et al., Analysis of BUT-PT submission for NIST LRE 2017, Proc. Odyssey: The Speaker and Language Recognition Workshop, pp.47-53, 2018.

F. Richardson, P. A. Torres-carrasquillo, J. Borgstrom, D. E. Sturim, Y. Gwon et al., The MIT Lincoln Laboratory/JHU/EPITA-LSE LRE17 system, Proc. Odyssey: The Speaker and Language Recognition Workshop, pp.54-59, 2018.

P. Bousquet and M. Rouvier, On robustness of unsupervised domain adaptation for speaker recognition, Proc. Interspeech -Annual Conference of the International Speech Communication Association, pp.2958-2962, 2019.

F. Verdet, D. Matrouf, J. Bonastre, and J. Hennebert, Coping with two different transmission channels in language recognition, Proc. Odyssey: The Speaker and Language Recognition Workshop, p.39, 2010.
URL : https://hal.archives-ouvertes.fr/hal-01321165

W. Cai, J. Chen, and M. Li, Exploring the encoding layer and loss function in end-to-end speaker and language recognition system, Proc. Odyssey: The Speaker and Language Recognition Workshop, pp.74-81, 2018.

J. Villalba, N. Chen, D. Snyder, D. Garcia-romero, A. Mccree et al., State-of-the-art speaker recognition with neural network embeddings in NIST SRE18 and speakers in the wild evaluations, Computer Speech & Language, vol.60, p.101026, 2020.

J. Rohdin, T. Stafylakis, A. Silnova, H. Zeinali, L. Burget et al., Speaker verification using end-to-end adversarial language adaptation, Proc. ICASSP -International Conference on Acoustics, Speech and Signal Processing, pp.6006-6010, 2019.

G. Bhattacharya, J. Alam, and P. Kenny, Adapting end-to-end neural speaker verification to new languages and recording conditions with adversarial training, Proc. ICASSP -International Conference on Acoustics, Speech and Signal Processing, pp.6041-6045, 2019.

R. Duroselle, D. Jouvet, and I. Illina, Unsupervised regularization of the embedding extractor for robust language identification, Proc. Odyssey: The Speaker and Language Recognition Workshop, 2020.
URL : https://hal.archives-ouvertes.fr/hal-02544156

J. S. Chung, J. Huh, S. Mun, M. Lee, H. S. Heo et al., In defence of metric learning for speaker recognition, p.2003, 2020.

C. Zhang and K. Koishida, End-to-end text-independent speaker verification with triplet loss on short utterances, Proc. Interspeech -Annual Conference of the International Speech Communication Association, pp.1487-1491, 2017.

D. Snyder, P. Ghahremani, D. Povey, D. Garcia-romero, Y. Carmiel et al., Deep neural network-based speaker embeddings for end-to-end speaker verification, Proc. Spoken Language Technology Workshop (SLT), pp.165-170, 2016.

V. Mingote, D. Castan, M. Mclaren, M. K. Nandwana, E. L. Ortega et al., Language recognition using triplet neural networks, Proc. Interspeech -Annual Conference of the International Speech Communication Association, pp.4025-4029, 2019.

G. Gelly and J. Gauvain, Spoken language identification using LSTM-based angular proximity, Proc. Interspeech -Annual Conference of the International Speech Communication Association, pp.2566-2570, 2017.

C. S. Greenberg, A. F. Martin, and M. A. Przybocki, The 2011 NIST language recognition evaluation, Proc. Interspeech -Annual Conference of the International Speech Communication Association, 2012.

K. Walker and S. Strassel, The RATS radio traffic collection system, Proc. Odyssey: The Speaker and Language Recognition Workshop, pp.291-297, 2012.

R. Fér, P. Mat?jka, F. Grézl, O. Plchot, K. Veselý et al., Multilingually trained bottleneck features in spoken language recognition, Computer Speech & Language, vol.46, pp.252-267, 2017.

N. Brummer and D. A. Van-leeuwen, On calibration of language recognition scores, Proc. Odyssey: The Speaker and Language Recognition Workshop, pp.1-8, 2006.

E. Singer, P. Torres-carrasquillo, D. A. Reynolds, A. Mccree, F. Richardson et al., The MITLL NIST LRE 2011 language recognition system, Proc. Odyssey: The Speaker and Language Recognition Workshop, 2012.

G. Peyré and M. Cuturi, Computational optimal transport, Foundations and Trends® in Machine Learning, vol.11, pp.355-607, 2019.

J. Feydy, T. Séjourné, F. Vialard, S. Amari, A. Trouve et al., Interpolating between optimal transport and MMD using Sinkhorn divergences, Proc. International Conference on Artificial Intelligence and Statistics, pp.2681-2690, 2019.
URL : https://hal.archives-ouvertes.fr/hal-01898858

E. Hoffer and N. Ailon, Deep metric learning using triplet network, Proc. International Workshop on Similarity-Based Pattern Recognition, pp.84-92, 2015.

K. Sohn, Improved deep metric learning with multi-class n-pair loss objective, Advances in neural information processing systems, pp.1857-1865, 2016.

A. Kulkarni, V. Colotte, and D. Jouvet, Transfer learning of the expressivity using FLOW metric learning in multispeaker text-tospeech synthesis, Proc. Interspeech -Annual Conference of the International Speech Communication Association, 2020.

J. Deng, J. Guo, N. Xue, and S. Zafeiriou, Arcface: Additive angular margin loss for deep face recognition, Proc. CVPR -Conference on Computer Vision and Pattern Recognition, pp.4690-4699, 2019.

, LRE 2011 results, NIST, 2013.