Articles | Volume 3
https://doi.org/10.5194/agile-giss-3-59-2022
https://doi.org/10.5194/agile-giss-3-59-2022
11 Jun 2022
 | 11 Jun 2022

Modelling Inhomogeneous Geodata Quality in a Dataset’s Metadata

Arne Rümmler, Christin Henzen, and Heiko Figgemeier

Keywords: geodata quality, metadata, linked data, reusability, DQV

Abstract. Extensive data quality descriptions as a vital part of a dataset’s metadata are widely accepted, albeit their provision in a formalized manner is often lacking. This is due to a number of problems that are frequently encountered by geodata producing scientists. As one of these problems, we identified missing, unknown or unused options to model inhomogeneity of data quality across space, time, and theme in a dataset’s metadata. Detailed information of inhomogeneous geodata quality beyond dataset-wide statistical measures (variance, min, max, etc.) is often only described in dataset accompanying papers or quality reports. These text-based approaches prevent precise querying and hinder the development of advanced data discovery tools that could make valuable use of inhomogeneous data quality information. We propose a profile for the data quality vocabulary (DQV) that allows to model inhomogeneous geodata quality. Considering established vocabularies typically used to describe geographic metadata, as well as ensuring compatibility with the default version of DQV, enhances the usability and thus, minimizes the effort for data producers to provide formalized descriptions of inhomogeneous data quality.