Predicting Locations in Tweets - Archive ouverte HAL Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

Predicting Locations in Tweets

Résumé

Five hundred millions of tweets are posted daily, making Twitter a major social media from which topical information on events can be extracted. Events are represented by time, location and entity-related information. This paper focuses on location which is an important clue for both users and geo-spatial applications. We address the problem of predicting whether a tweet contains a location or not, as location prediction is a useful pre-processing step for location extraction, by defining a number of features to represent tweets and conducting intensive evaluation of machine learning parameters. We found that: (1) not only words appearing in a geography gazetteer are important but the occurrence of a preposition right before a proper noun also is. (2) it is possible to improve precision on location extraction if the occurrence of a location is predicted.
Fichier principal
Vignette du fichier
hoang_22055.pdf (281.08 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02624131 , version 1 (26-05-2020)

Identifiants

  • HAL Id : hal-02624131 , version 1
  • OATAO : 22055

Citer

Thi Bich Ngoc Hoang, Véronique Moriceau, Josiane Mothe. Predicting Locations in Tweets. CINCLing 2017 : 18th International Conference on Intelligent Text Processing and Computational Linguistics, Apr 2017, Budapest, Hungary. ⟨hal-02624131⟩
47 Consultations
24 Téléchargements

Partager

Gmail Facebook X LinkedIn More