Research on multitask deep learning network for semantic segmentation and object detection

Ting Rui; Feng Xiao; Jian Tang; Fukai Zhang; Chengsong Yang; Min Liu

Conference Proceedings

Research on multitask deep learning network for semantic segmentation and object detection

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2018) 11166 LNCS 708-718

DOI: 10.1007/978-3-030-00764-5_65

1Citations

6Readers

Get full text

Abstract

After analyzing methods of object detection under the existing deep learning framework, a multitask learning model (Fully Convolution Object Detection Network, FCDN) is proposed, which can realize complete end to end semantic segmentation and object detection through deep learning, without delimiting the default boxes. First, this paper analysis the reason why the current mainstream object detection network needs the default box delineated in advance; second, an object detection network with no delimited default box needed is proposed. It uses the semantic segmentation to detect all boundaries and key points of object at the pixel level, and then obtain prediction boxes by combining the category information of the semantic segmentation map. Finally, the feasibility of the method is verified on the VOC 2007 datasets, and compared with the performance of current mainstream object detection algorithm. Results show that the semantic segmentation and object detection can be realized at the same time by the new model. Trained by the same training sample, detection precision of FCDN is superior to that of classic detection models.

Author supplied keywords

Cite

CITATION STYLE

APA

Rui, T., Xiao, F., Tang, J., Zhang, F., Yang, C., & Liu, M. (2018). Research on multitask deep learning network for semantic segmentation and object detection. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11166 LNCS, pp. 708–718). Springer Verlag. https://doi.org/10.1007/978-3-030-00764-5_65

Research on multitask deep learning network for semantic segmentation and object detection

Abstract

Author supplied keywords

Cite

Register to see more suggestions