Extracting People's Hobby and Interest Information from Social Media Content - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2014

Extracting People's Hobby and Interest Information from Social Media Content

Kaj-Mikael Bjork
  • Fonction : Auteur
  • PersonId : 957462
yes

Résumé

In this study we investigate how to analyze people's social media profiles to extract hobby and interest information. We developed a baseline system that applies heuristic rules and TF-IDF term weighting method in determining the most representative terms indicating hobbies and interests. A pilot test was done to collect feedback from users concerning the perceived usefulness of the extracted tags. The baseline system was then extended to include new functionality to help set limits on the scope of relevant content, extract Named Entities, use of predefined dictionaries to identify even lowscoring hobbies and interests, and use of machine translation to handle content in multiple languages.
Fichier principal
Vignette du fichier
TDE_Berlin_TSK_final_version.pdf (245.26 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01005875 , version 1 (13-06-2014)

Licence

Paternité

Identifiants

  • HAL Id : hal-01005875 , version 1

Citer

Thomas Forss, Shuhua Liu, Kaj-Mikael Bjork. Extracting People's Hobby and Interest Information from Social Media Content. Terminology and Knowledge Engineering 2014, Jun 2014, Berlin, Germany. 9 p. ⟨hal-01005875⟩

Collections

TKE2014
304 Consultations
1286 Téléchargements

Partager

Gmail Facebook X LinkedIn More