Visual and interactive exploration of a large collection of Open Datasets

T. Liu; D. Bangash Ahmed; Fatma Bouali; Gilles Venturini

Communication Dans Un Congrès Année : 2013

Visual and interactive exploration of a large collection of Open Datasets

(1) , (1) , , (1)

T. Liu

Fonction : Auteur

Laboratoire d'Informatique Fondamentale et Appliquée de Tours

D. Bangash Ahmed

Fonction : Auteur

Laboratoire d'Informatique Fondamentale et Appliquée de Tours

Fatma Bouali

Fonction : Auteur
PersonId : 958234

Gilles Venturini

Fonction : Auteur
PersonId : 4368
IdHAL : gilles-venturini
ORCID : 0000-0002-8112-2418
IdRef : 050802666

Laboratoire d'Informatique Fondamentale et Appliquée de Tours

Résumé

We deal in this paper with the problem of creating an interactive and visual map for a large collection of Open datasets. We first describe how to define a representation space for such data, using text mining techniques to create features. Then, with a similarity measure between Open datasets, we use the k-nearest neighbors method for building a proximity graph between datasets. We use a force-directed layout method to visualize the graph (Tulip Software). We present the results with a collection of 293,000 datasets from the French Open data web site, in which the display of the graph is limited to 151,000 datasets. We study the discovered clusters and we show how they can be used to browse this large collection.

Domaines

Traitement des images [eess.IV] Traitement du texte et du document Vision par ordinateur et reconnaissance de formes [cs.CV]

Denis Maurel : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01027490

Soumis le : mardi 22 juillet 2014-10:06:31

Dernière modification le : vendredi 16 février 2024-18:16:04

Dates et versions

hal-01027490 , version 1 (22-07-2014)

Identifiants

HAL Id : hal-01027490 , version 1

Citer

T. Liu, D. Bangash Ahmed, Fatma Bouali, Gilles Venturini. Visual and interactive exploration of a large collection of Open Datasets. 17th International Conference on Information Visualisation, Jul 2013, Londres, United Kingdom. pp.285-290. ⟨hal-01027490⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TOURS CNRS LIRFAI LIFAT INSA-GROUPE INSA-CVL

43 Consultations

0 Téléchargements

Visual and interactive exploration of a large collection of Open Datasets

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager