Pose invariant object detection and classification plays a critical role in robust image recognition systems and can be applied in a multitude of applications, ranging from simple monitoring to advanced tracking. This paper analyzes the usage of the Bag of Words model for recognizing objects in different scales, orientations and perspective views within cluttered environments. The recognition system relies on image analysis techniques, such as feature detection, description and clustering along with machine learning classifiers. For pinpointing the location of the target object, it is proposed a multiscale sliding window approach followed by a dynamic thresholding segmentation. The recognition system was tested with several configurations of feature detectors, descriptors and classifiers and achieved an accuracy of 87% when recognizing cars from an annotated dataset with 177 training images and 177 testing images.
CITATION STYLE
Costa, C. M., Sousa, A., & Veiga, G. (2018). Pose Invariant Object Recognition Using a Bag of Words Approach. In Advances in Intelligent Systems and Computing (Vol. 694, pp. 153–164). Springer Verlag. https://doi.org/10.1007/978-3-319-70836-2_13
Mendeley helps you to discover research relevant for your work.