Image segmentation using encoder-decoder with deformable convolutions

Andreea Gurita; Irina Georgiana Mocanu

Journal ArticleOPEN ACCESS

Image segmentation using encoder-decoder with deformable convolutions

Sensors (2021) 21(5) 1-27

DOI: 10.3390/s21051570

18Citations

16Readers

Abstract

Image segmentation is an essential step in image analysis that brings meaning to the pixels in the image. Nevertheless, it is also a difficult task due to the lack of a general suited approach to this problem and the use of real-life pictures that can suffer from noise or object obstruction. This paper proposes an architecture for semantic segmentation using a convolutional neural network based on the Xception model, which was previously used for classification. Different experiments were made in order to find the best performances of the model (eg. different resolution and depth of the network and data augmentation techniques were applied). Additionally, the network was improved by adding a deformable convolution module. The proposed architecture obtained a 76.8 mean IoU on the Pascal VOC 2012 dataset and 58.1 on the Cityscapes dataset. It outperforms SegNet and U-Net networks, both networks having considerably more parameters and also a higher inference time.

Author supplied keywords

Cite

CITATION STYLE

APA

Gurita, A., & Mocanu, I. G. (2021). Image segmentation using encoder-decoder with deformable convolutions. Sensors, 21(5), 1–27. https://doi.org/10.3390/s21051570

Image segmentation using encoder-decoder with deformable convolutions

Abstract

Author supplied keywords

Cite

Register to see more suggestions