Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet - Archive ouverte HAL Accéder directement au contenu
Article Dans Une Revue Multimedia Tools and Applications Année : 2010

Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet

Résumé

We present in this paper an approach based on the use of the International Phonetic Alphabet (IPA) for content-based indexing and retrieval of multilingual audiovisual documents. The approach works even if the languages of the document are unknown. It has been validated in the context of the ''Star Challenge'' search engine competition organized by the Agency for Science, Technology and Research (A*STAR) of Singapore. Our approach includes the building of an IPA-based multilingual acoustic model and a dynamic programming based method for searching document segments by ''IPA string spotting''. Dynamic programming allows for retrieving the query string in the document string even with a significant transcription error rate at the phone level. The methods that we developed ranked us as first and third on the monolingual (English) search task, as fifth on the multilingual search task and as first on the multimodal (audio and image) search task.

Dates et versions

hal-00953696 , version 1 (28-02-2014)

Identifiants

Citer

Georges Quénot, Tien-Ping Tan, Viet-Bac Le, Stéphane Ayache, Laurent Besacier, et al.. Content-based search in multilingual audiovisual documents using the International Phonetic Alphabet. Multimedia Tools and Applications, 2010, 48 (1), pp.123-140. ⟨10.1007/s11042-009-0377-6⟩. ⟨hal-00953696⟩
173 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More