This paper aims to reduce the time to annotate images for panoptic segmentation, which requires annotating segmentation masks and class labels for all object instances and stuff regions. We formulate our approach as a collaborative process between an annotator and an automated assistant who take turns to jointly annotate an image using a predefined pool of segments. Actions performed by the annotator serve as a strong contextual signal. The assistant intelligently reacts to this signal by annotating other parts of the image on its own, which reduces the amount of work required by the annotator. We perform thorough experiments on the COCO panoptic dataset, both in simulation and with human annotators. These demonstrate that our approach is significantly faster than the recent machine-assisted interface of [Andriluka 18 ACMMM], and $2.4\times$ to $5\times$ faster than manual polygon drawing. Finally, we show on ADE20k that our method can be used to efficiently annotate new datasets, bootstrapping from a very small amount of annotated data.
CITATION STYLE
Uijlings, J. R. R., Andriluka, M., & Ferrari, V. (2020). Panoptic Image Annotation with a Collaborative Assistant. In MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia (pp. 3302–3310). Association for Computing Machinery, Inc. https://doi.org/10.1145/3394171.3413812
Mendeley helps you to discover research relevant for your work.