Recorded cataract surgery videos play a prominent role in training and investigating the surgery, and enhancing the surgical outcomes. Due to storage limitations in hospitals, however, the recorded cataract surgeries are deleted after a short time and this precious source of information cannot be fully utilized. Lowering the quality to reduce the required storage space is not advisable since the degraded visual quality results in the loss of relevant information that limits the usage of these videos. To address this problem, we propose a relevance-based compression technique consisting of two modules: (i) relevance detection, which uses neural networks for semantic segmentation and classification of the videos to detect relevant spatio-temporal information, and (ii) content-adaptive compression, which restricts the amount of distortion applied to the relevant content while allocating less bitrate to irrelevant content. The proposed relevance-based compression framework is implemented considering five scenarios based on the definition of relevant information from the target audience's perspective. Experimental results demonstrate the capability of the proposed approach in relevance detection. We further show that the proposed approach can achieve high compression efficiency by abstracting substantial redundant information while retaining the high quality of the relevant content.
CITATION STYLE
Ghamsarian, N., Amirpourazarian, H., Timmerer, C., Taschwer, M., & Schöffmann, K. (2020). Relevance-Based Compression of Cataract Surgery Videos Using Convolutional Neural Networks. In MM 2020 - Proceedings of the 28th ACM International Conference on Multimedia (pp. 3577–3582). Association for Computing Machinery, Inc. https://doi.org/10.1145/3394171.3413658
Mendeley helps you to discover research relevant for your work.