BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation

Dai, Jifeng; He, Kaiming; Sun, Jian

Computer Science > Computer Vision and Pattern Recognition

arXiv:1503.01640 (cs)

[Submitted on 5 Mar 2015 (v1), last revised 18 May 2015 (this version, v2)]

Title:BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation

Authors:Jifeng Dai, Kaiming He, Jian Sun

View PDF

Abstract:Recent leading approaches to semantic segmentation rely on deep convolutional networks trained with human-annotated, pixel-level segmentation masks. Such pixel-accurate supervision demands expensive labeling effort and limits the performance of deep networks that usually benefit from more training data. In this paper, we propose a method that achieves competitive accuracy but only requires easily obtained bounding box annotations. The basic idea is to iterate between automatically generating region proposals and training convolutional networks. These two steps gradually recover segmentation masks for improving the networks, and vise versa. Our method, called BoxSup, produces competitive results supervised by boxes only, on par with strong baselines fully supervised by masks under the same setting. By leveraging a large amount of bounding boxes, BoxSup further unleashes the power of deep convolutional networks and yields state-of-the-art results on PASCAL VOC 2012 and PASCAL-CONTEXT.

Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1503.01640 [cs.CV]
	(or arXiv:1503.01640v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1503.01640

Submission history

From: Jifeng Dai [view email]
[v1] Thu, 5 Mar 2015 14:06:53 UTC (1,428 KB)
[v2] Mon, 18 May 2015 09:00:40 UTC (2,269 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CV

< prev | next >

new | recent | 2015-03

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Jifeng Dai
Kaiming He
Jian Sun

export BibTeX citation

Computer Science > Computer Vision and Pattern Recognition

Title:BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators