Graphs offer a natural way to formulate Multiple Object Tracking (MOT) and Multiple Object Tracking and Segmentation (MOTS) within the tracking-by-detection paradigm. However, they also introduce a major challenge for learning methods, as defining a model that can operate on such structured domain is not trivial. In this work, we exploit the classical network flow formulation of MOT to define a fully differentiable framework based on Message Passing Networks. By operating directly on the graph domain, our method can reason globally over an entire set of detections and exploit contextual features. It then jointly predicts both final solutions for the data association problem and segmentation masks for all objects in the scene while exploiting synergies between the two tasks. We achieve state-of-the-art results for both tracking and segmentation in several publicly available datasets. Our code is available at https://github.com/ocetintas/MPNTrackSeg.
CITATION STYLE
Brasó, G., Cetintas, O., & Leal-Taixé, L. (2022). Multi-Object Tracking and Segmentation Via Neural Message Passing. International Journal of Computer Vision, 130(12), 3035–3053. https://doi.org/10.1007/s11263-022-01678-6
Mendeley helps you to discover research relevant for your work.