License: Creative Commons Attribution 3.0 Unported license (CC BY 3.0)
When quoting this document, please refer to the following
DOI: 10.4230/LIPIcs.GIScience.2021.I.5
URN: urn:nbn:de:0030-drops-130405
URL: https://drops.dagstuhl.de/opus/volltexte/2020/13040/
Go to the corresponding LIPIcs Volume Portal


Hervey, Thomas ; Lafia, Sara ; Kuhn, Werner

Search Facets and Ranking in Geospatial Dataset Search

pdf-format:
LIPIcs-GIScience-2021-I-5.pdf (2 MB)


Abstract

This study surveys the state of search on open geospatial data portals. We seek to understand 1) what users are able to control when searching for geospatial data, 2) how these portals process and interpret a user’s query, and 3) if and how user query reformulations alter search results. We find that most users initiate a search using a text input and several pre-created facets (such as a filter for tags or format). Some portals supply a map-view of data or topic explorers. To process and interpret queries, most portals use a vertical full-text search engine like Apache Solr to query data from a content-management system like CKAN. When processing queries, most portals initially filter results and then rank the remaining results using a common keyword frequency relevance metric (e.g., TF-IDF). Some portals use query expansion. We identify and discuss several recurring usability constraints across portals. For example, users are typically only given text lists to interact with search results. Furthermore, ranking is rarely extended beyond syntactic comparison of keyword similarity. We discuss several avenues for improving search for geospatial data including alternative interfaces and query processing pipelines.

BibTeX - Entry

@InProceedings{hervey_et_al:LIPIcs:2020:13040,
  author =	{Thomas Hervey and Sara Lafia and Werner Kuhn},
  title =	{{Search Facets and Ranking in Geospatial Dataset Search}},
  booktitle =	{11th International Conference on Geographic Information Science (GIScience 2021) - Part I},
  pages =	{5:1--5:15},
  series =	{Leibniz International Proceedings in Informatics (LIPIcs)},
  ISBN =	{978-3-95977-166-5},
  ISSN =	{1868-8969},
  year =	{2020},
  volume =	{177},
  editor =	{Krzysztof Janowicz and Judith A. Verstegen},
  publisher =	{Schloss Dagstuhl--Leibniz-Zentrum f{\"u}r Informatik},
  address =	{Dagstuhl, Germany},
  URL =		{https://drops.dagstuhl.de/opus/volltexte/2020/13040},
  URN =		{urn:nbn:de:0030-drops-130405},
  doi =		{10.4230/LIPIcs.GIScience.2021.I.5},
  annote =	{Keywords: search, portal, discovery, GIR, facet, relevance, ranking}
}

Keywords: search, portal, discovery, GIR, facet, relevance, ranking
Collection: 11th International Conference on Geographic Information Science (GIScience 2021) - Part I
Issue Date: 2020
Date of publication: 25.09.2020


DROPS-Home | Fulltext Search | Imprint | Privacy Published by LZI