Weakly Supervised Segmentation of Hyper-Reflective Foci with Compact Convolutional Transformers and {SAM} 2
Weakly supervised segmentation has the potential to greatly reduce the annotation effort for training segmentation models for small structures such as hyper-reflective foci (HRF) in optical coherence tomography (OCT). However, most weakly supervised methods either involve a strong downsampling of input images, or only achieve localization at a coarse resolution, both of which are unsatisfactory for small structures. We propose a novel framework that increases the spatial resolution of a traditional attention-based Multiple Instance Learning (MIL) approach by using Layer-wise Relevance Propagation (LRP) to prompt the Segment Anything Model (SAM 2), and increases recall with iterative inference. Moreover, we demonstrate that replacing MIL with a Compact Convolutional Transformer (CCT), which adds a positional encoding, and permits an exchange of information between different regions of the OCT image, leads to a further and substantial increase in segmentation accuracy.
- Published in:
Bildverarbeitung für die Medizin - Type:
Inproceedings - Authors:
Morelle, Olivier; Bisten, Justus; Wintergerst, Maximilian; Finger, Robert; Schultz, Thomas - Year:
2025
Citation information
Morelle, Olivier; Bisten, Justus; Wintergerst, Maximilian; Finger, Robert; Schultz, Thomas: Weakly Supervised Segmentation of Hyper-Reflective Foci with Compact Convolutional Transformers and {SAM} 2, Bildverarbeitung für die Medizin, 2025, 101--106, Morelle.etal.2025a,
@Inproceedings{Morelle.etal.2025a,
author={Morelle, Olivier; Bisten, Justus; Wintergerst, Maximilian; Finger, Robert; Schultz, Thomas},
title={Weakly Supervised Segmentation of Hyper-Reflective Foci with Compact Convolutional Transformers and {SAM} 2},
booktitle={Bildverarbeitung für die Medizin},
pages={101--106},
year={2025},
abstract={Weakly supervised segmentation has the potential to greatly reduce the annotation effort for training segmentation models for small structures such as hyper-reflective foci (HRF) in optical coherence tomography (OCT). However, most weakly supervised methods either involve a strong downsampling of input images, or only achieve localization at a coarse resolution, both of which are unsatisfactory...}}