Geodata source retrieval by multilingual/semantic query expansion: the Case of Google Translate and WordNet version 3.1
- Department of Human Geography and Planning, Faculty of Geoscience, Utrecht University, Utrecht, Netherlands
Keywords: semantic, query expansion, information retrieval metrics
Abstract. In this article, we examined the potential of the current version of WordNet and Google Translate API to enhance the quality of geodata source retrieval in the Dutch geoinformation portal (PDOK) using semantic keywords for the geographic phenomena requested. Keywords gathered from real users’ questions in natural language extracted in an English corpus. Then, these keywords were expanded using WordNet and Google Translate API. Lastly, the results of query expansion were evaluated compared to a manual gold standard and based on information retrieval metrics. Our study shows that the results of query expansion help users by reformulating good alternative queries.