Using Probabilistic Soft Logic to Improve Information Extraction in the Legal Domain

Extracting information from court process documents to populate a knowledge base produces data valuable to legal faculties, publishers and law firms. A challenge lies in the fact that the relevant information is interdependent and structured by numerous semantic constraints of the legal domain. Ignoring these dependencies leads to inferior solutions. Hence, the objective of this paper is to demonstrate how the extraction pipeline can be improved by the use of probabilistic soft logic rules that reflect both legal and linguistic knowledge. We propose a probabilistic rule model for the overall extraction pipeline, which enables to both map dependencies between local extraction models and to integrate additional domain knowledge in the form of logical constraints. We evaluate the performance of the model on a German court sentences corpus.

  • Published in:
    LWDA Lernen. Wissen. Daten. Analysen. (LWDA)
  • Type:
    Inproceedings
  • Authors:
    B. Kirsch, S. Giesselbach, T. Schmude, M. Völkening, F. Rostalsko, S. Rüping
  • Year:
    2020

Citation information

B. Kirsch, S. Giesselbach, T. Schmude, M. Völkening, F. Rostalsko, S. Rüping: Using Probabilistic Soft Logic to Improve Information Extraction in the Legal Domain, Lernen. Wissen. Daten. Analysen. (LWDA), LWDA, 2020, https://www.semanticscholar.org/paper/Using-Probabilistic-Soft-Logic-to-Improve-in-the-Kirsch-Giesselbach/b1d66b35b2a1aa746a443348d27757b04bc24894, Kirsch.etal.2020,