Geocoding for texts with fine-grain toponyms: an experiment on a geoparsed hiking descriptions corpus

Abstract : Geoparsing and geocoding are two essential middleware services to facilitate final user applications such as location-aware searching or different types of location-based services. The objective of this work is to propose a method for establishing a processing chain to support the geoparsing and geocoding of text documents describing events strongly linked with space and with a frequent use of fine-grain toponyms. The geoparsing part is a Natural Language Processing approach which combines the use of part of speech and syntactico-semantic combined patterns (cascade of transducers). However, the real novelty of this work lies in the geocoding method. The geocoding algorithm is unsupervised and takes profit of clustering techniques to provide a solution for disambiguating the toponyms found in gazetteers, and at the same time estimating the spatial footprint of those other fine-grain toponyms not found in gazetteers. The feasibility of the proposal has been tested with a corpus of hiking descriptions in French, Spanish and Italian.
Complete list of metadatas

Cited literature [32 references]  Display  Hide  Download

https://hal.archives-ouvertes.fr/hal-01069625
Contributor : Ludovic Moncla <>
Submitted on : Wednesday, November 12, 2014 - 4:56:23 PM
Last modification on : Wednesday, May 15, 2019 - 2:18:03 PM

File

Geocoding for texts with fine-...
Files produced by the author(s)

Identifiers

Collections

Citation

Ludovic Moncla, Walter Renteria-Agualimpia, Javier Nogueras-Iso, Mauro Gaio. Geocoding for texts with fine-grain toponyms: an experiment on a geoparsed hiking descriptions corpus. ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems (ACM SIGSPATIAL 2014), Nov 2014, Dallas, Texas, United States. ⟨10.1145/2666310.2666386⟩. ⟨hal-01069625v2⟩

Share

Metrics

Record views

315

Files downloads

595