Surgical workflow analysis is an important topic of computer-assisted intervention and phase recognition is one of its important tasks. Features extracted from video frames by 2D convolutional networks were proved feasible for online phase analysis in former publications. In this paper, we propose to extract fine-level temporal features from video clips using 3D convolutional networks (CNN) and use Long Short-Term Memory (LSTM) networks to capture coarse-level information. By combining fine-level and coarse-level information, our proposed method outperforms state-of-the-art online methods without using specific knowledge of surgeries and almost reaches the state-of-the-art offline performance.
CITATION STYLE
Chen, W., Feng, J., Lu, J., & Zhou, J. (2018). Endo3D: Online workflow analysis for endoscopic surgeries based on 3D CNN and LSTM. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11041 LNCS, pp. 97–107). Springer Verlag. https://doi.org/10.1007/978-3-030-01201-4_12
Mendeley helps you to discover research relevant for your work.