Signboards are important location landmarks that provide services to a local community. Non-disabled people can easily understand the meaning of a signboard based on its special shape; however, visually impaired people who need an assistive system to guide them to destinations or to help them understand their surroundings cannot. Currently, designing accurate assistive systems remain a challenge. Computer vision struggles to recognize signboards due to the diverse designs that combine text and images. Moreover, there is a lack of datasets to train the best model and reach good results. In this paper, we propose a novel framework that can automatically detect and recognize signboard logos. In addition, we utilize Google Street View to collect signboard images from Taiwan's streets. The proposed framework consists of a domain adaptation that not only reduces the loss function between source-target datasets, but also represents important source features adopted by the target dataset. In our model, we add nonlocal blocks and attention mechanisms called deep attention networks to achieve the best final result. We perform extensive experiments on both our dataset and public datasets to demonstrate the superior performance and effectiveness of our proposed method. The experimental results show that our proposed method outperforms state-of-the-art methods across all evaluation metrics.
CITATION STYLE
Yohannes, E., Lin, C. Y., Shih, T. K., Hong, C. Y., Enkhbat, A., & Utaminingrum, F. (2021). Domain Adaptation Deep Attention Network for Automatic Logo Detection and Recognition in Google Street View. IEEE Access, 9, 102623–102635. https://doi.org/10.1109/ACCESS.2021.3098713
Mendeley helps you to discover research relevant for your work.