Lexical-phonetic automata for spoken utterance indexing and retrieval

Julien Fayolle 1 Murat Saraclar 2 Fabienne Moreau 1 Christian Raymond 1, * Guillaume Gravier 1
* Corresponding author
1 TEXMEX - Multimedia content-based indexing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
2 BUSIM Speech Group
Department of Electrical and Electronic Engineering [Istanbul]
Abstract : This paper presents a method for indexing spoken utterances which combines lexical and phonetic hypotheses in a hybrid index built from automata. The retrieval is realised by a lexical-phonetic and semi-imperfect matching whose aim is to improve the recall. A feature vector, containing edit distance scores and a confidence measure, weights each transition to help the filtering of the candidate utterance list for a more precise search. Experiment results show that the lexical and phonetic representations are complementary and we compare the hybrid search with the state-of-the-art cascaded search to retrieve named entity queries.
Document type :
Conference papers
Complete list of metadatas

Cited literature [10 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-00757765
Contributor : Christian Raymond <>
Submitted on : Tuesday, November 27, 2012 - 3:41:57 PM
Last modification on : Friday, November 16, 2018 - 1:24:53 AM
Long-term archiving on : Thursday, February 28, 2013 - 3:45:01 AM

File

interspeech2012.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-00757765, version 1

Citation

Julien Fayolle, Murat Saraclar, Fabienne Moreau, Christian Raymond, Guillaume Gravier. Lexical-phonetic automata for spoken utterance indexing and retrieval. International Conference on Speech Communication and Technologies, Sep 2012, Portland, United States. ⟨hal-00757765⟩

Share

Metrics

Record views

1043

Files downloads

321