From text vocabularies to visual vocabularies: what basis?

Jean Martinet

Communication Dans Un Congrès Année : 2014

From text vocabularies to visual vocabularies: what basis?

(1, 2)

1
2

Jean Martinet

Fonction : Auteur
PersonId : 1821
IdHAL : martinej
ORCID : 0000-0001-8821-5556
IdRef : 086873172

Université de Lille, Sciences et Technologies

Centre de Recherche en Informatique, Signal et Automatique de Lille - UMR 9189

Résumé

The popular ”bag-of-visual-words” approach for representing and searching visual documents consists in de- scribing images (or video keyframes) using a set of descriptors, that correspond to quantized low-level features. Most of existing approaches for visual words are inspired from works in text indexing, based on the implicit assumption that visual words can be handled the same way as text words. More specifically, these techniques implicitly rely on the same postulate as in text information retrieval, stating that the words distribution for a natural language globally follows Zipf’s law – that is to say, words from a natural language appear in a corpus with a frequency inversely proportional to their rank. However, our study shows that the visual words distribution depends on the choice of low-level features, and also especially on the choice of the clustering method. We also show that when the visual words distribution is close to this of text words, the results of an image retrieval system are increased. To the best of our knowledge, no prior study has yet been carried out to compare the distributions of text words and visual words, with the objective of establishing the theoretical foundations of visual vocabularies.

Mots clés

Bag-of-features quantization vocabulary Zipf’s law evaluation

Domaines

Vision par ordinateur et reconnaissance de formes [cs.CV]

Jean Martinet : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01532717

Soumis le : vendredi 2 juin 2017-21:31:59

Dernière modification le : mercredi 24 janvier 2024-09:54:23

Dates et versions

hal-01532717 , version 1 (02-06-2017)

Identifiants

HAL Id : hal-01532717 , version 1

Citer

Jean Martinet. From text vocabularies to visual vocabularies: what basis?. International Conference on Computer Vision Theory and Applications, Jan 2014, Lisbon, Portugal. pp.668-675. ⟨hal-01532717⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS CRISTAL UNIV-LILLE

76 Consultations

0 Téléchargements

From text vocabularies to visual vocabularies: what basis?

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager