Search Results

Documents authored by Das, Pragyan P.


Document
Predicting Distance and Direction from Text Locality Descriptions for Biological Specimen Collections

Authors: Ruoxuan Liao, Pragyan P. Das, Christopher B. Jones, Niloofar Aflaki, and Kristin Stock

Published in: LIPIcs, Volume 240, 15th International Conference on Spatial Information Theory (COSIT 2022)


Abstract
A considerable proportion of records that describe biological specimens (flora, soil, invertebrates), and especially those that were collected decades ago, are not attached to corresponding geographical coordinates, but rather have their location described only through textual descriptions (e.g. North Canterbury, Selwyn River near bridge on Springston-Leeston Rd). Without geographical coordinates, millions of records stored in museum collections around the world cannot be mapped. We present a method for predicting the distance and direction associated with human language location descriptions which focuses on the interpretation of geospatial prepositions and the way in which they modify the location represented by an associated reference place name (e.g. near the Manawatu River). We study eight distance-oriented prepositions and eight direction-oriented prepositions and use machine learning regression to predict distance or direction, relative to the reference place name, from a collection of training data. The results show that, compared with a simple baseline, our model improved distance predictions by up to 60% and direction predictions by up to 31%.

Cite as

Ruoxuan Liao, Pragyan P. Das, Christopher B. Jones, Niloofar Aflaki, and Kristin Stock. Predicting Distance and Direction from Text Locality Descriptions for Biological Specimen Collections. In 15th International Conference on Spatial Information Theory (COSIT 2022). Leibniz International Proceedings in Informatics (LIPIcs), Volume 240, pp. 4:1-4:15, Schloss Dagstuhl - Leibniz-Zentrum für Informatik (2022)


Copy BibTex To Clipboard

@InProceedings{liao_et_al:LIPIcs.COSIT.2022.4,
  author =	{Liao, Ruoxuan and Das, Pragyan P. and Jones, Christopher B. and Aflaki, Niloofar and Stock, Kristin},
  title =	{{Predicting Distance and Direction from Text Locality Descriptions for Biological Specimen Collections}},
  booktitle =	{15th International Conference on Spatial Information Theory (COSIT 2022)},
  pages =	{4:1--4:15},
  series =	{Leibniz International Proceedings in Informatics (LIPIcs)},
  ISBN =	{978-3-95977-257-0},
  ISSN =	{1868-8969},
  year =	{2022},
  volume =	{240},
  editor =	{Ishikawa, Toru and Fabrikant, Sara Irina and Winter, Stephan},
  publisher =	{Schloss Dagstuhl -- Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops.dagstuhl.de/entities/document/10.4230/LIPIcs.COSIT.2022.4},
  URN =		{urn:nbn:de:0030-drops-168892},
  doi =		{10.4230/LIPIcs.COSIT.2022.4},
  annote =	{Keywords: geospatial prepositions, biological specimen collections, georeferencing, natural language processing, locative expressions, locality descriptions, geoparsing, geocoding, geographic information retrieval, regression, machine learning}
}
Questions / Remarks / Feedback
X

Feedback for Dagstuhl Publishing


Thanks for your feedback!

Feedback submitted

Could not send message

Please try again later or send an E-mail