Multi-path feedback recurrent neural networks for scene parsing

23Citations
Citations of this article
55Readers
Mendeley users who have this article in their library.

Abstract

In this paper, we consider the scene parsing problem and propose a novel Multi-Path Feedback recurrent neural network (MPF-RNN) for parsing scene images. MPF-RNN can enhance the capability of RNNs in modeling long-range context information at multiple levels and better distinguish pixels that are easy to confuse. Different from feedforward CNNs and RNNs with only single feedback, MPF-RNN propagates the contextual features learned at top layer through multiple weighted recurrent connections to learn bottom features. For better training MPF-RNN, we propose a new strategy that considers accumulative loss at multiple recurrent steps to improve performance of the MPF-RNN on parsing small objects. With these two novel components, MPF-RNN has achieved significant improvement over strong baselines (VGG16 and Res101) on five challenging scene parsing benchmarks, including traditional SiftFlow, Barcelona, CamVid, Stanford Background as well as the recently released large-scale ADE20K.

Cite

CITATION STYLE

APA

Jin, X., Chen, Y., Jie, Z., Feng, J., & Yan, S. (2017). Multi-path feedback recurrent neural networks for scene parsing. In 31st AAAI Conference on Artificial Intelligence, AAAI 2017 (pp. 4096–4102). AAAI press. https://doi.org/10.1609/aaai.v31i1.11199

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free