Learning a Neural Association Network for Self-supervised Multi-Object Tracking
This paper introduces a novel framework to learn data association for multi-object tracking in a self-supervised manner. Fully-supervised learning methods are known to achieve excellent tracking performances, but acquiring identity-level annotations is tedious and time-consuming. Motivated by the fact that in real-world scenarios object motion can be usually represented by a Markov process, we present a novel expectation maximization ({EM}) algorithm that trains a neural network to associate detections for tracking, without requiring prior knowledge of their temporal correspondences. At the core of our method lies a neural Kalman filter, with an observation model conditioned on associations of detections parameterized by a neural network. Given a batch of frames as input, data associations between detections from adjacent frames are predicted by a neural network followed by a Sinkhorn normalization that determines the assignment probabilities of detections to states. Kalman smoothing is then used to obtain the marginal probability of observations given the inferred states, producing a training objective to maximize this marginal probability using gradient descent. The proposed framework is fully differentiable, allowing the underlying neural model to be trained end-to-end. We evaluate our approach on the challenging {MOT}17, {MOT}20, and {BDD}100K datasets and achieve state-of-the-art results in comparison to self-supervised trackers using public detections.
- Published in:
British Machine Vision Conference (BMVC'25) - Type:
Inproceedings - Authors:
- Year:
2025 - Source:
https://bmvc2025.bmva.org/proceedings/762/
Citation information
: Learning a Neural Association Network for Self-supervised Multi-Object Tracking, British Machine Vision Conference (BMVC'25), 2025, September, https://bmvc2025.bmva.org/proceedings/762/, Li.etal.2025a,
@Inproceedings{Li.etal.2025a,
author={Li, Shuai; Burke, Michael; Ramamoorthy, Subramanian; Gall, Juergen},
title={Learning a Neural Association Network for Self-supervised Multi-Object Tracking},
booktitle={British Machine Vision Conference (BMVC'25)},
month={September},
url={https://bmvc2025.bmva.org/proceedings/762/},
year={2025},
abstract={This paper introduces a novel framework to learn data association for multi-object tracking in a self-supervised manner. Fully-supervised learning methods are known to achieve excellent tracking performances, but acquiring identity-level annotations is tedious and time-consuming. Motivated by the fact that in real-world scenarios object motion can be usually represented by a Markov process, we...}}