ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech

Andreas Nautsch; Xin Wang; Nicholas Evans; Tomi Kinnunen; Ville Vestman; Massimiliano Todisco; Hector Delgado; Md Sahidullah; Junichi Yamagishi; Kong Aik Lee

doi:10.1109/TBIOM.2021.3059479

Article Dans Une Revue IEEE Transactions on Biometrics, Behavior, and Identity Science Année : 2021

ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech

(1) , (2) , (1) , (3) , (3) , (1) , (4) , (5) , (2) , (6)

1
2
3
4
5
6

Andreas Nautsch

Fonction : Auteur
PersonId : 795565
ORCID : 0000-0002-3405-4416

Eurecom [Sophia Antipolis]

Xin Wang

Fonction : Auteur
PersonId : 761950
ORCID : 0000-0002-3891-2684

National Institute of Informatics

Nicholas Evans

Fonction : Auteur
PersonId : 938450

Eurecom [Sophia Antipolis]

Tomi Kinnunen

Fonction : Auteur

University of Eastern Finland

Ville Vestman

Fonction : Auteur

University of Eastern Finland

Massimiliano Todisco

Fonction : Auteur
PersonId : 1199117

Eurecom [Sophia Antipolis]

Hector Delgado

Fonction : Auteur

Nuance Communications [Spain]

Md Sahidullah

Fonction : Auteur
PersonId : 737397
IdHAL : sahid

Speech Modeling for Facilitating Oral-Based Communication

Junichi Yamagishi

Fonction : Auteur

National Institute of Informatics

Kong Aik Lee

Fonction : Auteur

Institute for Infocomm Research - I²R [Singapore]

Résumé

The ASVspoof initiative was conceived to spearhead research in anti-spoofing for automatic speaker verification (ASV). This paper describes the third in a series of biannual challenges: ASVspoof 2019. With the challenge database and protocols being described elsewhere, the focus of this paper is on results and the top performing single and ensemble system submissions from 62 teams, all of which out-perform the two baseline systems, often by a substantial margin. Deeper analyses shows that performance is dominated by specific conditions involving either specific spoofing attacks or specific acoustic environments. While fusion is shown to be particularly effective for the logical access scenario involving speech synthesis and voice conversion attacks, participants largely struggled to apply fusion successfully for the physical access scenario involving simulated replay attacks. This is likely the result of a lack of system complementarity, while oracle fusion experiments show clear potential to improve performance. Furthermore, while results for simulated data are promising, experiments with real replay data show a substantial gap, most likely due to the presence of additive noise in the latter. This finding, among others, leads to a number of ideas for further research and directions for future editions of the ASVspoof challenge.

Mots clés

Spoofing Countermeasures Presentation attack detection Speaker recognition Automatic speaker verification

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV] Traitement du signal et de l'image [eess.SP] Multimédia [cs.MM] Acoustique [physics.class-ph]

Fichier principal

ASVspoof2019_TBIOM.pdf (5.56 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Md Sahidullah : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03236124

Soumis le : mercredi 26 mai 2021-10:30:07

Dernière modification le : lundi 11 septembre 2023-17:41:19

Archivage à long terme le : vendredi 27 août 2021-18:40:45

Dates et versions

hal-03236124 , version 1 (26-05-2021)

Identifiants

HAL Id : hal-03236124 , version 1
DOI : 10.1109/TBIOM.2021.3059479

Citer

Andreas Nautsch, Xin Wang, Nicholas Evans, Tomi Kinnunen, Ville Vestman, et al.. ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech. IEEE Transactions on Biometrics, Behavior, and Identity Science, 2021, 3 (2), pp.252-265. ⟨10.1109/TBIOM.2021.3059479⟩. ⟨hal-03236124⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA EURECOM UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD

98 Consultations

309 Téléchargements

ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager