Skip to Main content Skip to Navigation
Conference papers

Lexical-phonetic automata for spoken utterance indexing and retrieval

Julien Fayolle 1 Murat Saraclar 2 Fabienne Moreau 1 Christian Raymond 1, * Guillaume Gravier 1 
* Corresponding author
1 TEXMEX - Multimedia content-based indexing
IRISA - Institut de Recherche en Informatique et Systèmes Aléatoires, Inria Rennes – Bretagne Atlantique
2 BUSIM Speech Group
Department of Electrical and Electronic Engineering [Istanbul]
Abstract : This paper presents a method for indexing spoken utterances which combines lexical and phonetic hypotheses in a hybrid index built from automata. The retrieval is realised by a lexical-phonetic and semi-imperfect matching whose aim is to improve the recall. A feature vector, containing edit distance scores and a confidence measure, weights each transition to help the filtering of the candidate utterance list for a more precise search. Experiment results show that the lexical and phonetic representations are complementary and we compare the hybrid search with the state-of-the-art cascaded search to retrieve named entity queries.
Document type :
Conference papers
Complete list of metadata

Cited literature [10 references]  Display  Hide  Download
Contributor : Christian Raymond Connect in order to contact the contributor
Submitted on : Tuesday, November 27, 2012 - 3:41:57 PM
Last modification on : Friday, January 21, 2022 - 3:10:10 AM
Long-term archiving on: : Thursday, February 28, 2013 - 3:45:01 AM


Files produced by the author(s)


  • HAL Id : hal-00757765, version 1


Julien Fayolle, Murat Saraclar, Fabienne Moreau, Christian Raymond, Guillaume Gravier. Lexical-phonetic automata for spoken utterance indexing and retrieval. International Conference on Speech Communication and Technologies, Sep 2012, Portland, United States. ⟨hal-00757765⟩



Record views


Files downloads