An Extensible Speaker Identification SIDEKIT in Python

Anthony Larcher; Kong Aik Lee; Sylvain Meignier

doi:10.1109/ICASSP.2016.7472648

Communication Dans Un Congrès Année : 2016

An Extensible Speaker Identification SIDEKIT in Python

(1) , (2) , (1)

1
2

Anthony Larcher

Fonction : Auteur
PersonId : 20105
IdHAL : anthony-larcher
ORCID : 0000-0003-4398-0224
IdRef : 139544569

Laboratoire d'Informatique de l'Université du Mans

Kong Aik Lee

Fonction : Auteur

Agency for science, technology and research [Singapore]

Sylvain Meignier

Fonction : Auteur
PersonId : 11674
IdHAL : sylvain-meignier
ORCID : 0000-0001-7687-073X
IdRef : 182269086

Laboratoire d'Informatique de l'Université du Mans

Résumé

SIDEKIT is a new open-source Python toolkit that includes a large panel of state-of-the-art components and allow a rapid prototyping of an end-to-end speaker recognition system. For each step from front-end feature extraction, normalization, speech activity detection, modelling, scoring and visualization, SIDEKIT offers a wide range of standard algorithms and flexible interfaces. The use of a single efficient programming and scripting language (Python in this case), and the limited dependencies, facilitate the deployment for industrial applications and extension to include new algorithms as part of the whole tool-chain provided by SIDEKIT. Performance of SIDEKIT is demonstrated on two standard evaluation tasks, namely the RSR2015 and NIST-SRE 2010.

Mots clés

tutorials speaker recognition toolkit open-source python

Domaines

Informatique et langage [cs.CL]

Fichier principal

ICASSP2015_sidekit.pdf (371.85 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

sylvain meignier : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01433157

Soumis le : vendredi 24 mars 2017-00:01:48

Dernière modification le : vendredi 1 mars 2024-18:10:04

Archivage à long terme le : dimanche 25 juin 2017-12:13:30

Dates et versions

hal-01433157 , version 1 (24-03-2017)

Identifiants

HAL Id : hal-01433157 , version 1
DOI : 10.1109/ICASSP.2016.7472648

Citer

Anthony Larcher, Kong Aik Lee, Sylvain Meignier. An Extensible Speaker Identification SIDEKIT in Python. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2016), 2016, Shanghai, China. pp.5095-5099, ⟨10.1109/ICASSP.2016.7472648⟩. ⟨hal-01433157⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-LEMANS LIUM LIUM-LST

462 Consultations

3147 Téléchargements

An Extensible Speaker Identification SIDEKIT in Python

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager