This paper proposes a multi-level max-margin discriminative analysis (M3DA) framework, which takes both coarse and fine semantics into consideration, for the annotation of high-resolution satellite images. In order to generate more discriminative topic-level features, the M3DA uses the maximum entropy discrimination latent Dirichlet Allocation (MedLDA) model. Moreover, for improving the spatial coherence of visual words neglected by M3DA, conditional random field (CRF) is employed to optimize the soft label field composed of multiple label posteriors. The framework of M3DA enables one to combine word-level features (generated by support vector machines) and topic-level features (generated by MedLDA) via the bag-of-words representation. The experimental results on high-resolution satellite images have demonstrated that, using the proposed method can not only obtain suitable semantic interpretation, but also improve the annotation performance by taking into account the multi-level semantics and the contextual information. © 2013 by the authors.
CITATION STYLE
Hu, F., Yang, W., Chen, J., & Sun, H. (2013). Tile-level annotation of satellite images using multi-level max-margin discriminative random field. Remote Sensing, 5(5), 2275–2291. https://doi.org/10.3390/rs5052275
Mendeley helps you to discover research relevant for your work.