Extracting information from court process documents to populate a knowledge base produces data valuable to legal faculties, publishers and law firms. A challenge lies in the fact that the relevant information is interdependent and structured by numerous semantic constraints of the legal domain. Ignoring these dependencies leads to inferior solutions. Hence, the objective of this paper is to demonstrate how the extraction pipeline can be improved by the use of probabilistic soft logic rules that reflect both legal and linguistic knowledge. We propose a probabilistic rule model for the overall extraction pipeline, which enables to both map dependencies between local extraction models and to integrate additional domain knowledge in the form of logical constraints. We evaluate the performance of the model on a German court sentences corpus.
Using Probabilistic Soft Logic to Improve Information Extraction in the Legal Domain
Type: Inproceedings
Author: B. Kirsch, S. Giesselbach, T. Schmude, M. Völkening, F. Rostalsko, S.Rüping
Journal: LWDA
Year: 2020
Citation information
B. Kirsch, S. Giesselbach, T. Schmude, M. Völkening, F. Rostalsko, S.Rüping:
Using Probabilistic Soft Logic to Improve Information Extraction in the Legal Domain.
LWDA,
2020,
https://www.semanticscholar.org/paper/Using-Probabilistic-Soft-Logic-to-Improve-in-the-Kirsch-Giesselbach/b1d66b35b2a1aa746a443348d27757b04bc24894