CASQAD – A New Dataset for Context-Aware Spatial Question Answering

Author: J. Rose, J. Lehmann
Journal: ISWC 2020: The Semantic Web
Year: 2020

Citation information

J. Rose, J. Lehmann,
ISWC 2020: The Semantic Web,
2020,
3–17,
Springer, Cham,
https://link.springer.com/chapter/10.1007%2F978-3-030-62466-8_1

The task of factoid question answering (QA) faces new challenges when applied in scenarios with rapidly changing context information, for example on smartphones. Instead of asking who the architect of the “Holocaust Memorial” in Berlin was, the same question could be phrased as “Who was the architect of the many stelae in front of me?” presuming the user is standing in front of it. While traditional QA systems rely on static information from knowledge bases and the analysis of named entities and predicates in the input, question answering for temporal and spatial questions imposes new challenges to the underlying methods. To tackle these challenges, we present the Context-aware Spatial QA Dataset (CASQAD) with over 5,000 annotated questions containing visual and spatial references that require information about the user’s location and moving direction to compose a suitable query. These questions were collected in a large scale user study and annotated semi-automatically, with appropriate measures to ensure the quality.