Skip to Main content Skip to Navigation
Journal articles

ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech

Abstract : The ASVspoof initiative was conceived to spearhead research in anti-spoofing for automatic speaker verification (ASV). This paper describes the third in a series of biannual challenges: ASVspoof 2019. With the challenge database and protocols being described elsewhere, the focus of this paper is on results and the top performing single and ensemble system submissions from 62 teams, all of which out-perform the two baseline systems, often by a substantial margin. Deeper analyses shows that performance is dominated by specific conditions involving either specific spoofing attacks or specific acoustic environments. While fusion is shown to be particularly effective for the logical access scenario involving speech synthesis and voice conversion attacks, participants largely struggled to apply fusion successfully for the physical access scenario involving simulated replay attacks. This is likely the result of a lack of system complementarity, while oracle fusion experiments show clear potential to improve performance. Furthermore, while results for simulated data are promising, experiments with real replay data show a substantial gap, most likely due to the presence of additive noise in the latter. This finding, among others, leads to a number of ideas for further research and directions for future editions of the ASVspoof challenge.
Complete list of metadata

https://hal.archives-ouvertes.fr/hal-03236124
Contributor : Md Sahidullah Connect in order to contact the contributor
Submitted on : Wednesday, May 26, 2021 - 10:30:07 AM
Last modification on : Friday, July 8, 2022 - 10:05:35 AM
Long-term archiving on: : Friday, August 27, 2021 - 6:40:45 PM

File

ASVspoof2019_TBIOM.pdf
Files produced by the author(s)

Identifiers

Citation

Andreas Nautsch, Xin Wang, Nicholas Evans, Tomi Kinnunen, Ville Vestman, et al.. ASVspoof 2019: Spoofing Countermeasures for the Detection of Synthesized, Converted and Replayed Speech. IEEE Transactions on Biometrics, Behavior, and Identity Science, IEEE, 2021, 3 (2), pp.252-265. ⟨10.1109/TBIOM.2021.3059479⟩. ⟨hal-03236124⟩

Share

Metrics

Record views

56

Files downloads

144