Multiresolution Analysis for Speech Recognition

Marie-Josée Caraty; Claude Montacié

Communication Dans Un Congrès Année : 1998

Multiresolution Analysis for Speech Recognition

(1) , (1)

Marie-Josée Caraty

Fonction : Auteur
PersonId : 1014618

Apprentissage et Acquisition des connaissances

Claude Montacié

Fonction : Auteur
PersonId : 1014617

Apprentissage et Acquisition des connaissances

Résumé

In the purpose to deal with artifact on observations measurements resulting from usual speech processing, we propose to extend the representation of the speech signal by taking a sequence of sets of observations instead of a simple sequence of observations. A set of observations is computed from temporal Multi-Resolution (MR) analysis. This method is designed to be adapted to any usual mode and technique of analysis. Its originality is to take into account two main variations in the analysis, -the center of the frame and -the duration of the frame. In speech processing, multi-resolution analysis has many applications. MR analysis is a basic representation -to locate the stationary and non-stationary parts of speech from the inertia computation, -to select the best representative observation from centroid or generalized centroid. Preliminary experiments are presented. The first one consists in the MR analysis of pieces of the French and the English-American speech databases (i.e., TIMIT, BREF80) and on the inertia as a criterion of location of stationary and non-stationary parts of the speech signal. The second one is on the computation of the phoneme prototypes of the two speech databases. At last, some perspectives are discussed.

Domaines

Informatique [cs]

Lip6 Publications : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01617601

Soumis le : lundi 16 octobre 2017-17:21:15

Dernière modification le : mardi 11 avril 2023-15:16:28

Dates et versions

hal-01617601 , version 1 (16-10-2017)

Identifiants

HAL Id : hal-01617601 , version 1

Citer

Marie-Josée Caraty, Claude Montacié. Multiresolution Analysis for Speech Recognition. ICSLP 1998 - 5th International Conference on Spoken Language Processing, Nov 1998, Sydney, Australia. pp.955-958. ⟨hal-01617601⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS LIP6 SORBONNE-UNIVERSITE SU-SCIENCES

41 Consultations

0 Téléchargements

Multiresolution Analysis for Speech Recognition

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager