Learning to Segment Instances in Videos with Spatial Propagation Network

Cheng, Jingchun; Liu, Sifei; Tsai, Yi-Hsuan; Hung, Wei-Chih; De Mello, Shalini; Gu, Jinwei; Kautz, Jan; Wang, Shengjin; Yang, Ming-Hsuan

Computer Science > Computer Vision and Pattern Recognition

arXiv:1709.04609 (cs)

[Submitted on 14 Sep 2017]

Title:Learning to Segment Instances in Videos with Spatial Propagation Network

Authors:Jingchun Cheng, Sifei Liu, Yi-Hsuan Tsai, Wei-Chih Hung, Shalini De Mello, Jinwei Gu, Jan Kautz, Shengjin Wang, Ming-Hsuan Yang

View PDF

Abstract:We propose a deep learning-based framework for instance-level object segmentation. Our method mainly consists of three steps. First, We train a generic model based on ResNet-101 for foreground/background segmentations. Second, based on this generic model, we fine-tune it to learn instance-level models and segment individual objects by using augmented object annotations in first frames of test videos. To distinguish different instances in the same video, we compute a pixel-level score map for each object from these instance-level models. Each score map indicates the objectness likelihood and is only computed within the foreground mask obtained in the first step. To further refine this per frame score map, we learn a spatial propagation network. This network aims to learn how to propagate a coarse segmentation mask spatially based on the pairwise similarities in each frame. In addition, we apply a filter on the refined score map that aims to recognize the best connected region using spatial and temporal consistencies in the video. Finally, we decide the instance-level object segmentation in each video by comparing score maps of different instances.

Comments:	CVPR 2017 Workshop on DAVIS Challenge. Code is available at this http URL
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1709.04609 [cs.CV]
	(or arXiv:1709.04609v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1709.04609

Submission history

From: Yi-Hsuan Tsai [view email]
[v1] Thu, 14 Sep 2017 04:15:49 UTC (1,283 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2017-09

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jingchun Cheng
Sifei Liu
Yi-Hsuan Tsai
Wei-Chih Hung
Shalini De Mello

…

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:Learning to Segment Instances in Videos with Spatial Propagation Network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Learning to Segment Instances in Videos with Spatial Propagation Network

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators