Binary classification of images for applications in intelligent 3D scanning

Citations of this article
Mendeley users who have this article in their library.

You may have access to this PDF.


Three-dimensional (3D) scanning techniques based on photogrammetry, also known as Structure-from-Motion (SfM), require many two-dimensional (2D) images of an object, obtained from different viewpoints, in order to create its 3D reconstruction. When these images are acquired using closed-space 3D scanning rigs, which are composed of large number of cameras fitted on multiple pods, flash photography is required and image acquisition must be well synchronized to avoid the problem of ‘misfired’ cameras. This paper presents an approach to binary classification (as ‘good’ or ‘misfired’) of images obtained during the 3D scanning process, using four machine learning methods—support vector machines, artificial neural networks, k-nearest neighbors algorithm, and random forests. Input to the algorithms are histograms of regions determined to be of interest in the detection of image misfires. The considered algorithms are evaluated based on the prediction accuracy that they achieved on our dataset. The average prediction accuracy of 94.19% is obtained using the random forests approach under cross-validation. Therefore, the application of the proposed approach allows the development of an ‘intelligent’ 3D scanning system which can automatically detect camera misfiring and repeat the scanning process without the need for human intervention.




Vezilić, B., Gajić, D. B., Dragan, D., Petrović, V., Mihić, S., Anišić, Z., & Puhalac, V. (2017). Binary classification of images for applications in intelligent 3D scanning. Studies in Computational Intelligence, 737, 199–209.

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free