The Anatomy of Evidence: An Investigation Into Explainable ICD Coding

Automatic medical coding has the potential to ease documentation and billing processes. For this task, transparency plays an important role for medical coders and regulatory bodies, which can be achieved using explainability methods. However, the evaluation of these approaches has been mostly limited to short text and binary settings due to a scarcity of annotated data. Recent efforts by Cheng et al. (2023) have introduced the MDACE dataset, which provides a valuable resource containing code evidence in clinical records. In this work, we conduct an in-depth analysis of the MDACE dataset and perform plausibility evaluation of current explainable medical coding systems from an applied perspective. With this, we contribute to a deeper understanding of automatic medical coding and evidence extraction. Our findings reveal that ground truth evidence aligns with code descriptions to a certain degree. An investigation into state-of-the-art approaches shows a high overlap with ground truth evidence. We propose match measures and highlight success and failure cases. Based on our findings, we provide recommendations for developing and evaluating explainable medical coding systems.

  • Published in:
    Findings of the Association for Computational Linguistics: ACL 2025
  • Type:
    Inproceedings
  • Authors:
    Beckh, Katharina; Studeny, Elisa; Gannamaneni, Sujan Sai; Antweiler, Dario; Rüping, Stefan
  • Year:
    2025
  • Source:
    https://aclanthology.org/2025.findings-acl.864/

Citation information

Beckh, Katharina; Studeny, Elisa; Gannamaneni, Sujan Sai; Antweiler, Dario; Rüping, Stefan: The Anatomy of Evidence: An Investigation Into Explainable ICD Coding, Findings of the Association for Computational Linguistics: ACL 2025, 2025, https://aclanthology.org/2025.findings-acl.864/, Beckh.etal.2025a,

Associated Lamarr Researchers

Portrait of Katharina Beckh.

Katharina Beckh

Author to the profile
lamarr institute person Gannamaneni Sujan Sai e1663925008286 - Lamarr Institute for Machine Learning (ML) and Artificial Intelligence (AI)

Sujan Sai Gannamaneni

Author to the profile
Dario Antweiler

Dario Antweiler

Autor to the profile