Skip to Main content Skip to Navigation
New interface
Journal articles

Deep Learning for Toponym Resolution: Geocoding Based on Pairs of Toponyms

Abstract : Geocoding aims to assign unambiguous locations (i.e., geographic coordinates) to place names (i.e., toponyms) referenced within documents (e.g., within spreadsheet tables or textual paragraphs). This task comes with multiple challenges, such as dealing with referent ambiguity (multiple places with a same name) or reference database completeness. In this work, we propose a geocoding approach based on modeling pairs of toponyms, which returns latitude-longitude coordinates. One of the input toponyms will be geocoded, and the second one is used as context to reduce ambiguities. The proposed approach is based on a deep neural network that uses Long Short-Term Memory (LSTM) units to produce representations from sequences of character n-grams. To train our model, we use toponym co-occurrences collected from different contexts, namely textual (i.e., co-occurrences of toponyms in Wikipedia articles) and geographical (i.e., inclusion and proximity of places based on Geonames data). Experiments based on multiple geographical areas of interest—France, United States, Great-Britain, Nigeria, Argentina and Japan—were conducted. Results show that models trained with co-occurrence data obtained a higher geocoding accuracy, and that proximity relations in combination with co-occurrences can help to obtain a slightly higher accuracy in geographical areas with fewer places in the data sources.
Complete list of metadata

https://hal.archives-ouvertes.fr/hal-03464000
Contributor : Ludovic Moncla Connect in order to contact the contributor
Submitted on : Thursday, December 2, 2021 - 7:35:35 PM
Last modification on : Friday, September 30, 2022 - 11:34:16 AM
Long-term archiving on: : Thursday, March 3, 2022 - 8:28:54 PM

File

ijgi-10-00818.pdf
Publisher files allowed on an open archive

Identifiers

Citation

Jacques Fize, Ludovic Moncla, Bruno Martins. Deep Learning for Toponym Resolution: Geocoding Based on Pairs of Toponyms. ISPRS International Journal of Geo-Information, 2021, Deep Learning Meets GIR: Recent Advances in Geographic Information Retrieval, 10 (12), pp.818. ⟨10.3390/ijgi10120818⟩. ⟨hal-03464000⟩

Share

Metrics

Record views

33

Files downloads

36