A Novel Self-Supervised Learning Network for Binocular Disparity Estimation

12Citations
Citations of this article
5Readers
Mendeley users who have this article in their library.

Abstract

Two-dimensional endoscopic images are susceptible to interferences such as specular reflections and monotonous texture illumination, hindering accurate three-dimensional lesion reconstruction by surgical robots. This study proposes a novel end-to-end disparity estimation model to address these challenges. Our approach combines a Pseudo-Siamese neural network architecture with pyramid dilated convolutions, integrating multi-scale image information to enhance robustness against lighting interferences. This study introduces a Pseudo-Siamese structure-based disparity regression model that simplifies left-right image comparison, improving accuracy and efficiency. The model was evaluated using a dataset of stereo endoscopic videos captured by the Da Vinci surgical robot, comprising simulated silicone heart sequences and real heart video data. Experimental results demonstrate significant improvement in the network’s resistance to lighting interference without substantially increasing parameters. Moreover, the model exhibited faster convergence during training, contributing to overall performance enhancement. This study advances endoscopic image processing accuracy and has potential implications for surgical robot applications in complex environments.

Cite

CITATION STYLE

APA

Tian, J., Zhou, Y., Chen, X., AlQahtani, S. A., Chen, H., Yang, B., … Zheng, W. (2025). A Novel Self-Supervised Learning Network for Binocular Disparity Estimation. CMES - Computer Modeling in Engineering and Sciences, 142(1), 209–229. https://doi.org/10.32604/cmes.2024.057032

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free