AOSM2022: Integration of Text and Geospatial Search for Hydrographic Datasets Using the Lucene Search Library
Related Information
Publication
Abstract
Miscellany
Download
Section 1: Publication
Authorship or Presenters
Matthew Yang, Siwen Yang, Jimmy Lin
Title
Integration of Text and Geospatial Search for Hydrographic Datasets Using the Lucene Search Library
Year
2022
Conference
AOSM2022
Theme
Hydrology and Terrestrial Ecosystems
Format
10-minute oral presentation
DOI
Citation
Matthew Yang, Siwen Yang, Jimmy Lin (2022). Integration of Text and Geospatial Search for Hydrographic Datasets Using the Lucene Search Library. Proceedings of the GWF Annual Open Science Meeting, May 16-18, 2022.
Additional Information
AOSM2022 core Computer Science
Section 2: Abstract
Plain Language Summary
Abstract
We present a hybrid text and geospatial search application for hydrographic datasets built on the open-source Lucene search library. Our goal is to demonstrate that it is possible to build custom GIS applications by integrating existing open-source components and data sources, which contrasts with existing approaches based on monolithic platforms such as ArcGIS and QGIS.
Lucene provides rich index structures and search capabilities for both free text and geometries; the former has already been integrated and exposed via our group's Anserini and Pyserini IR toolkits. In this work, we extend these toolkits to include geospatial capabilities as well. Combining knowledge extracted from Wikidata with the HydroSHEDS dataset, our application enables text and geospatial search of rivers around the world.
Section 3: Miscellany
Submitters
Matthew Yang | Submitter/Presenter | m259yang@uwaterloo.ca | University of Waterloo |
Miscellaneous Information
First Author: Matthew Yang, University of Waterloo
Additional Authors: Siwen Yang, University of Waterloo; Jimmy Lin, University of Waterloo
Section 4: Download
Download Links