Noisy-LSTM: Improving Temporal Awareness for Video Semantic Segmentation

Bowen Wang; Liangzhi Li; Yuta Nakashima; Ryo Kawasaki; Hajime Nagahara; Yasushi Yagi

Journal ArticleOPEN ACCESS

Noisy-LSTM: Improving Temporal Awareness for Video Semantic Segmentation

IEEE Access (2021) 9 46810-46820

DOI: 10.1109/ACCESS.2021.3067928

16Citations

33Readers

Abstract

Semantic video segmentation is a key challenge for various applications. This paper presents a new model named Noisy-LSTM, which is trainable in an end-to-end manner, with convolutional LSTMs (ConvLSTMs) to leverage the temporal coherence in video frames, together with a simple yet effective training strategy that replaces a frame in a given video sequence with noises. Our training strategy spoils the temporal coherence in video frames and thus makes the temporal links in ConvLSTMs unreliable; this may consequently improve the ability of the model to extract features from video frames and serve as a regularizer to avoid overfitting, without requiring extra data annotations or computational costs. Experimental results demonstrate that the proposed model can achieve state-of-the-art performances on both the CityScapes and EndoVis2018 datasets. The code for the proposed method is available at https://github.com/wbw520/NoisyLSTM.

Author supplied keywords

Cite

CITATION STYLE

APA

Wang, B., Li, L., Nakashima, Y., Kawasaki, R., Nagahara, H., & Yagi, Y. (2021). Noisy-LSTM: Improving Temporal Awareness for Video Semantic Segmentation. IEEE Access, 9, 46810–46820. https://doi.org/10.1109/ACCESS.2021.3067928

Noisy-LSTM: Improving Temporal Awareness for Video Semantic Segmentation

Abstract

Author supplied keywords

Cite

Register to see more suggestions