Phonetic corpora and big data

Martine Adda-Decker

Communication Dans Un Congrès Année : 2015

Phonetic corpora and big data

(1, 2)

1
2

Martine Adda-Decker

Fonction : Auteur
PersonId : 6743
IdHAL : martine-adda-decker
ORCID : 0000-0003-2154-7438
IdRef : 033427747

LPP - Laboratoire de Phonétique et Phonologie - UMR 7018

Laboratoire d'Informatique pour la Mécanique et les Sciences de l'Ingénieur

Résumé

During the last years, 'big data' has emerged as a trendy, highly promising portmanteau term in economics and high-tech domains, such as information technology and speech processing. Big data are often described using a 3V scheme: volume, variety, velocity: a huge volume of data, a large variety of possibly unstructured, heterogeneous data sources, a high frequency or velocity of data generation over time. In this Glasgow ICPhS 2015 discussant session, we will question the 'big data' term with respect to phonetics and speech sciences at large. In this context, big data typically refer to huge, generally unstructured collections of speech or audio-visual data, pre-existing any phoneticians' investigation hypotheses. Can such data become beneficial to phonetic sciences?

Domaines

Linguistique

Gwénaëlle Lo Bue : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01251374

Soumis le : mercredi 6 janvier 2016-10:07:41

Dernière modification le : samedi 7 octobre 2023-21:36:20

Dates et versions

hal-01251374 , version 1 (06-01-2016)

Identifiants

HAL Id : hal-01251374 , version 1

Citer

Martine Adda-Decker. Phonetic corpora and big data. 18th International Congress of Phonetic Sciences (ICPhS'15), Aug 2015, Glasgow, United Kingdom. pp.5. ⟨hal-01251374⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS UNIV-PARIS3 LPP LIMSI UNIV-PARIS-SACLAY SORBONNE-UNIVERSITE LISN GS-COMPUTER-SCIENCE

81 Consultations

0 Téléchargements

Phonetic corpora and big data

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager