A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency

Xu Wang; Yujie Li; Haoyu Wang; Longzhao Huang; Shuxue Ding

Journal ArticleOPEN ACCESS

A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency

Sensors (2022) 22(19)

DOI: 10.3390/s22197689

5Citations

14Readers

Abstract

Deep summarization models have succeeded in the video summarization field based on the development of gated recursive unit (GRU) and long and short-term memory (LSTM) technology. However, for some long videos, GRU and LSTM cannot effectively capture long-term dependencies. This paper proposes a deep summarization network with auxiliary summarization losses to address this problem. We introduce an unsupervised auxiliary summarization loss module with LSTM and a swish activation function to capture the long-term dependencies for video summarization, which can be easily integrated with various networks. The proposed model is an unsupervised framework for deep reinforcement learning that does not depend on any labels or user interactions. Additionally, we implement a reward function ((Formula presented.)) that jointly considers the consistency, diversity, and representativeness of generated summaries. Furthermore, the proposed model is lightweight and can be successfully deployed on mobile devices and enhance the experience of mobile users and reduce pressure on server operations. We conducted experiments on two benchmark datasets and the results demonstrate that our proposed unsupervised approach can obtain better summaries than existing video summarization methods. Furthermore, the proposed algorithm can generate higher F scores with a nearly 6.3% increase on the SumMe dataset and a 2.2% increase on the TVSum dataset compared to the DR-DSN model.

Author supplied keywords

Cite

CITATION STYLE

APA

Wang, X., Li, Y., Wang, H., Huang, L., & Ding, S. (2022). A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency. Sensors, 22(19). https://doi.org/10.3390/s22197689

A Video Summarization Model Based on Deep Reinforcement Learning with Long-Term Dependency

Abstract

Author supplied keywords

Cite

Register to see more suggestions