Who Tags What? An Analysis Framework

Abstract : The rise of Web 2.0 is signaled by sites such as Flickr, del.icio.us, and YouTube, and social tagging is essential to their success. A typical tagging action involves three components, user, item (e.g., photos in Flickr), and tags (i.e., words or phrases). Analyzing how tags are assigned by certain users to certain items has important implications in helping users search for desired information. In this paper, we explore common analysis tasks and propose a dual mining framework for social tagging behavior mining. This framework is centered around two opposing measures, similarity and diversity, being applied to one or more tagging components, and therefore enables a wide range of analysis scenarios such as characterizing similar users tagging diverse items with similar tags, or diverse users tagging similar items with diverse tags, etc. By adopting different concrete measures for similarity and diversity in the framework, we show that a wide range of concrete analysis problems can be defined and they are NP-Complete in general. We design efficient algorithms for solving many of those problems and demonstrate, through comprehensive experiments over real data, that our algorithms significantly out-perform the exact brute-force approach without compromising analysis result quality.
Type de document :
Article dans une revue
Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2012, 5 (11), pp.1567-1578
Liste complète des métadonnées


https://hal.archives-ouvertes.fr/hal-00922883
Contributeur : Valérie Samper <>
Soumis le : mardi 31 décembre 2013 - 22:26:28
Dernière modification le : mardi 28 octobre 2014 - 18:33:48
Document(s) archivé(s) le : lundi 31 mars 2014 - 22:07:35

Fichier

p1567_mahashwetadas_vldb2012.p...
Fichiers éditeurs autorisés sur une archive ouverte

Identifiants

  • HAL Id : hal-00922883, version 1

Collections

Citation

Mahashweta Das, Saravanan Thirumuruganathan, Sihem Amer-Yahia, Gautam Das, Cong Yu. Who Tags What? An Analysis Framework. Proceedings of the VLDB Endowment (PVLDB), VLDB Endowment, 2012, 5 (11), pp.1567-1578. <hal-00922883>

Partager

Métriques

Consultations de
la notice

204

Téléchargements du document

69