Deep point-wise prediction for action temporal proposal

Luxuan Li; Tao Kong; Fuchun Sun; Huaping Liu

Conference Proceedings

Deep point-wise prediction for action temporal proposal

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11955 LNCS 475-487

DOI: 10.1007/978-3-030-36718-3_40

6Citations

16Readers

Get full text

Abstract

Detecting actions in videos is an important yet challenging task. Previous works usually utilize (a) sliding window paradigms, or (b) per-frame action scoring and grouping to enumerate the possible temporal locations. Their performances are also limited to the designs of sliding windows or grouping strategies. In this paper, we present a simple and effective method for temporal action proposal generation, named Deep Point-wise Prediction (DPP). DPP simultaneously predicts the action existing possibility and the corresponding temporal locations, without the utilization of any handcrafted sliding window or grouping. The whole system is end-to-end trained with joint loss of temporal action proposal classification and location prediction. We conduct extensive experiments to verify its effectiveness, generality and robustness on standard THUMOS14 dataset. DPP runs more than 1000 frames per second, which largely satisfies the real-time requirement. The code is available at https://github.com/liluxuan1997/DPP.

Author supplied keywords

Cite

CITATION STYLE

APA

Li, L., Kong, T., Sun, F., & Liu, H. (2019). Deep point-wise prediction for action temporal proposal. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11955 LNCS, pp. 475–487). Springer. https://doi.org/10.1007/978-3-030-36718-3_40

Deep point-wise prediction for action temporal proposal

Abstract

Author supplied keywords

Cite

Register to see more suggestions