Introduction to the Special Issue “Speaker and Language Characterization and Recognition: Voice Modeling, Conversion, Synthesis and Ethical Aspects”

Jean-François Bonastre; Tomi Kinnunen; Anthony Larcher; Junichi Yamagishi

doi:10.1016/j.csl.2019.101021

Article Dans Une Revue Computer Speech and Language Année : 2019

Introduction to the Special Issue “Speaker and Language Characterization and Recognition: Voice Modeling, Conversion, Synthesis and Ethical Aspects”

(1) , (2) , (3) , (4)

1
2
3
4

Jean-François Bonastre

Fonction : Auteur
PersonId : 172421
IdHAL : jean-francois-bonastre
ORCID : 0000-0001-7741-3346
IdRef : 079112978

Laboratoire Informatique d'Avignon

Tomi Kinnunen

Fonction : Auteur

University of Eastern Finland

Anthony Larcher

Fonction : Auteur
PersonId : 20105
IdHAL : anthony-larcher
ORCID : 0000-0003-4398-0224
IdRef : 139544569

Laboratoire d'Informatique de l'Université du Mans

Junichi Yamagishi

Fonction : Auteur

National Institute of Informatics

Résumé

Welcome to this special issue on Speaker and Language Characterization which features, among other contributions, some of the most remarkable ideas presented and discussed at Odyssey 2018: the Speaker and Language Recognition Workshop, held in Les Sables d'Olonne, France, in June 2018. This issue perpetuates the series proposed by ISCA Speaker and language Characterization Special Interest Group in coordination with ISCA Speaker Odyssey workshops [1, 2, 3]. Voice is one of the most casual modalities for natural and intuitive interactions between humans as well as between humans and machines. Voice is also a central part of our identity. Voice-based solutions are currently deployed in a growing variety of applications, including person authentication through automatic speaker verification (ASV). A related technology concerns digital cloning of personal voice characteristics for text-to-speech (TTS) and voice conversion (VC). In the last years, the impressive advancements of the VC/TTS field opened the way for numerous new consumer applications. Especially, VC is offering new solutions for privacy protection. However, VC/TTS also brings the possibility of misuse of the technology in order to spoof ASV systems (for example presentation attacks implemented using voice conversion). As a direct consequence, spoofing countermeasures raises a growing interest during the past years. Moreover, voice is a central part of our identity and is also bringing other

Domaines

Intelligence artificielle [cs.AI]

Fichier principal

CSL_special_issue.pdf (104.94 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

anthony larcher : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02280130

Soumis le : vendredi 6 septembre 2019-09:28:32

Dernière modification le : samedi 21 mai 2022-03:52:33

Archivage à long terme le : jeudi 6 février 2020-12:46:28

Dates et versions

hal-02280130 , version 1 (06-09-2019)

Identifiants

HAL Id : hal-02280130 , version 1
DOI : 10.1016/j.csl.2019.101021

Citer

Jean-François Bonastre, Tomi Kinnunen, Anthony Larcher, Junichi Yamagishi. Introduction to the Special Issue “Speaker and Language Characterization and Recognition: Voice Modeling, Conversion, Synthesis and Ethical Aspects”. Computer Speech and Language, 2019, pp.101021. ⟨10.1016/j.csl.2019.101021⟩. ⟨hal-02280130⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-AVIGNON UNIV-LEMANS LIUM LIA

117 Consultations

202 Téléchargements

Introduction to the Special Issue “Speaker and Language Characterization and Recognition: Voice Modeling, Conversion, Synthesis and Ethical Aspects”

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager