Visualizing a large collection of Open datasets: an experiment with proximity graphs

T. Liu; D. Bangash Ahmed; Fatma Bouali; Gilles Venturini

Communication Dans Un Congrès Année : 2013

Visualizing a large collection of Open datasets: an experiment with proximity graphs

(1) , (1) , , (1)

T. Liu

Fonction : Auteur

Laboratoire d'Informatique Fondamentale et Appliquée de Tours

D. Bangash Ahmed

Fonction : Auteur

Laboratoire d'Informatique Fondamentale et Appliquée de Tours

Fatma Bouali

Fonction : Auteur
PersonId : 958234

Gilles Venturini

Fonction : Auteur
PersonId : 4368
IdHAL : gilles-venturini
ORCID : 0000-0002-8112-2418
IdRef : 050802666

Laboratoire d'Informatique Fondamentale et Appliquée de Tours

Résumé

We deal in this paper with the problem of creating an interactive and visual map for a large collection of Open datasets. We first describe how to define a representation space for such data, using text mining techniques to create features. Then, with a similarity measure between Open datasets, we use the K-nearest neighbors method for building a proximity graph between datasets. We use a force-directed layout method to visualize the graph (Tulip Software). We present the results with a collection of 300,000 datasets from the French Open data web site, in which the display of the graph is limited to 150,000 datasets. We study the discovered clusters and we show how they can be used to browse this large collection.

Domaines

Traitement des images [eess.IV] Traitement du texte et du document Vision par ordinateur et reconnaissance de formes [cs.CV]

Denis Maurel : Connectez-vous pour contacter le contributeur

https://hal.science/hal-01027485

Soumis le : mardi 22 juillet 2014-10:01:50

Dernière modification le : vendredi 16 février 2024-18:16:04

Dates et versions

hal-01027485 , version 1 (22-07-2014)

Identifiants

HAL Id : hal-01027485 , version 1

Citer

T. Liu, D. Bangash Ahmed, Fatma Bouali, Gilles Venturini. Visualizing a large collection of Open datasets: an experiment with proximity graphs. Second International Workshop on Open Data, 2013, Paris, France. ⟨hal-01027485⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TOURS CNRS LIRFAI LIFAT INSA-GROUPE INSA-CVL

35 Consultations

0 Téléchargements

Visualizing a large collection of Open datasets: an experiment with proximity graphs

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager