A corpus of audio-visual Lombard speech with frontal and profile views

Najwa Alghamdi; Steve Maddock; Ricard Marxer; Jon Barker; Guy Brown

doi:10.1121/1.5042758

Article Dans Une Revue Journal of the Acoustical Society of America Année : 2018

A corpus of audio-visual Lombard speech with frontal and profile views

(1) , (1) , (1, 2, 3) , (1) , (1)

1
2
3

Najwa Alghamdi

Fonction : Auteur

Department of Computer Sciences [Scheffield]

Steve Maddock

Fonction : Auteur

Department of Computer Sciences [Scheffield]

Ricard Marxer

Fonction : Auteur
PersonId : 19391
IdHAL : ricard-marxer
ORCID : 0000-0001-5099-5059
IdRef : 240437713

Department of Computer Sciences [Scheffield]

Laboratoire d'Informatique et des Systèmes (LIS) (Marseille, Toulon)

DYNamiques de l’Information

Jon Barker

Fonction : Auteur
PersonId : 895549

Department of Computer Sciences [Scheffield]

Guy Brown

Fonction : Auteur

Department of Computer Sciences [Scheffield]

Résumé

This paper presents a bi-view (front and side) audiovisual Lombard speech corpus, which is freely available for download. It contains 5400 utterances (2700 Lombard and 2700 plain reference utterances), produced by 54 talkers, with each utterance in the dataset following the same sentence format as the audiovisual “Grid” corpus [Cooke, Barker, Cunningham, and Shao (2006). J. Acoust. Soc. Am. 120(5), 2421–2424]. Analysis of this dataset confirms previous research, showing prominent acoustic, phonetic, and articulatory speech modifications in Lombard speech. In addition, gender differences are observed in the size of Lombard effect. Specifically, female talkers exhibit a greater increase in estimated vowel duration and a greater reduction in F2 frequency.

Domaines

Intelligence artificielle [cs.AI] Informatique et langage [cs.CL] Traitement du signal et de l'image [eess.SP]

Fichier principal

jasa-el-slash.pdf (648.81 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Ricard Marxer : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01867824

Soumis le : mercredi 5 septembre 2018-11:08:30

Dernière modification le : vendredi 22 mars 2024-18:24:03

Archivage à long terme le : jeudi 6 décembre 2018-14:32:00

Dates et versions

hal-01867824 , version 1 (05-09-2018)

Identifiants

HAL Id : hal-01867824 , version 1
DOI : 10.1121/1.5042758

Citer

Najwa Alghamdi, Steve Maddock, Ricard Marxer, Jon Barker, Guy Brown. A corpus of audio-visual Lombard speech with frontal and profile views. Journal of the Acoustical Society of America, 2018, 143 (6), pp.EL523-EL529. ⟨10.1121/1.5042758⟩. ⟨hal-01867824⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TLN CNRS UNIV-AMU LIS-LAB

95 Consultations

151 Téléchargements

A corpus of audio-visual Lombard speech with frontal and profile views

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager