A Fully Automated Approach to a Complete Semantic Table Interpretation

Author: M. Cremaschi, F. De Paoli, A. Rula, B. Spahiu
Journal: Future Gener. Comput. Syst.
Year: 2020

Citation information

M. Cremaschi, F. De Paoli, A. Rula, B. Spahiu,
Future Gener. Comput. Syst.,
2020,
112,
478-500,
https://www.sciencedirect.com/science/article/abs/pii/S0167739X19302663?via%3Dihub

In recent years, there has been an increasing interest in extracting and annotating tables on the Web. This activity allows the transformation of text data into machine-readable formats to enable the execution of various artificial intelligence tasks, e.g. semantic search and dataset extension. Semantic Table Interpretation is the process of annotating elements in a table. Current approaches are mainly based on lexical matching algorithms that rely on metadata associated with tables or custom Knowledge Graphs. Their main limitations are due to the lack of metadata, the little use of contextual semantics, and the incompleteness of the proposed methods that do not include all the necessary steps. In this paper, we propose a comprehensive approach and a tool that provides an unsupervised method to annotate independent tables, possibly without header row or other external information. The approach is based on the definition of a context created from the elements within the table in order to discriminate among matching entities found in shared Knowledge Graphs and create high-quality annotations. The approach has achieved excellent results in an international challenge, thus proving its effectiveness.