{SG}-{DOR}: Learning Scene Graphs with Direction-Conditioned Occlusion Reasoning for Pepper Plants
Robotic harvesting in dense crop canopies requires effective interventions that depend not only on geometry, but also on explicit, direction-conditioned relations identifying which organs obstruct a target fruit. We present {SG}-{DOR} (Scene Graphs with Direction-Conditioned Occlusion Reasoning), a relational framework that, given instance-segmented organ point clouds, infers a scene graph encoding physical attachments and direction-conditioned occlusion. We introduce an occlusion ranking task for retrieving and ranking candidate leaves for a target fruit and approach direction, and propose a direction-aware graph neural architecture with per-fruit leaf-set attention and union-level aggregation. Experiments on a multi-plant synthetic pepper dataset show improved occlusion prediction (F1=0.73, {NDCG}\@3=0.85) and attachment inference (edge F1=0.83) over strong ablations, yielding a structured relational signal for downstream intervention planning.
- Veröffentlicht in:
arXiv - Typ:
Article - Autoren:
- Jahr:
2026 - Source:
http://arxiv.org/abs/2603.06512
Informationen zur Zitierung
: {SG}-{DOR}: Learning Scene Graphs with Direction-Conditioned Occlusion Reasoning for Pepper Plants, arXiv, 2026, {arXiv}:2603.06512, March, {arXiv}, http://arxiv.org/abs/2603.06512, Menon.etal.2026a,
@Article{Menon.etal.2026a,
author={Menon, Rohit; Mueller-Goldingen, Niklas; Pan, Sicong; Chenchani, Gokul Krishna; Bennewitz, Maren},
title={{SG}-{DOR}: Learning Scene Graphs with Direction-Conditioned Occlusion Reasoning for Pepper Plants},
journal={arXiv},
number={{arXiv}:2603.06512},
month={March},
publisher={{arXiv}},
url={http://arxiv.org/abs/2603.06512},
year={2026},
abstract={Robotic harvesting in dense crop canopies requires effective interventions that depend not only on geometry, but also on explicit, direction-conditioned relations identifying which organs obstruct a target fruit. We present {SG}-{DOR} (Scene Graphs with Direction-Conditioned Occlusion Reasoning), a relational framework that, given instance-segmented organ point clouds, infers a scene graph...}}