Video Object Segmentation by Learning Location-Sensitive Embeddings

17Citations
Citations of this article
139Readers
Mendeley users who have this article in their library.

This article is free to access.

Abstract

We address the problem of video object segmentation which outputs the masks of a target object throughout a video given only a bounding box in the first frame. There are two main challenges to this task. First, the background may contain similar objects as the target. Second, the appearance of the target object may change drastically over time. To tackle these challenges, we propose an end-to-end training network which accomplishes foreground predictions by leveraging the location-sensitive embeddings which are capable to distinguish the pixels of similar objects. To deal with appearance changes, for a test video, we propose a robust model adaptation method which pre-scans the whole video, generates pseudo foreground/background labels and retrains the model based on the labels. Our method outperforms the state-of-the-art methods on the DAVIS and the SegTrack v2 datasets.

Cite

CITATION STYLE

APA

Ci, H., Wang, C., & Wang, Y. (2018). Video Object Segmentation by Learning Location-Sensitive Embeddings. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11215 LNCS, pp. 524–539). Springer Verlag. https://doi.org/10.1007/978-3-030-01252-6_31

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free