Slam-or: Simultaneous localization, mapping and object recognition using video sensors data in open environments from the Sparse points cloud

Patryk Mazurek; Tomasz Hachaj

Journal ArticleOPEN ACCESS

Slam-or: Simultaneous localization, mapping and object recognition using video sensors data in open environments from the Sparse points cloud

Sensors (2021) 21(14)

DOI: 10.3390/s21144734

13Citations

32Readers

Abstract

In this paper, we propose a novel approach that enables simultaneous localization, mapping (SLAM) and objects recognition using visual sensors data in open environments that is capable to work on sparse data point clouds. In the proposed algorithm the ORB-SLAM uses the current and previous monocular visual sensors video frame to determine observer position and to determine a cloud of points that represent objects in the environment, while the deep neural network uses the current frame to detect and recognize objects (OR). In the next step, the sparse point cloud returned from the SLAM algorithm is compared with the area recognized by the OR network. Because each point from the 3D map has its counterpart in the current frame, therefore the filtration of points matching the area recognized by the OR algorithm is performed. The clustering algorithm determines areas in which points are densely distributed in order to detect spatial positions of objects detected by OR. Then by using principal component analysis (PCA)—based heuristic we estimate bounding boxes of detected objects. The image processing pipeline that uses sparse point clouds generated by SLAM in order to determine positions of objects recognized by deep neural network and mentioned PCA heuristic are main novelties of our solution. In contrary to state-of-the-art approaches, our algorithm does not require any additional calculations like generation of dense point clouds for objects positioning, which highly simplifies the task. We have evaluated our research on large benchmark dataset using various state-of-the-art OR architectures (YOLO, MobileNet, RetinaNet) and clustering algorithms (DBSCAN and OPTICS) obtaining promising results. Both our source codes and evaluation data sets are available for download, so our results can be easily reproduced.

Author supplied keywords

References Powered by Scopus

View more at Scopus

Cited by Powered by Scopus

View more at Scopus

Cite

CITATION STYLE

APA

Mazurek, P., & Hachaj, T. (2021). Slam-or: Simultaneous localization, mapping and object recognition using video sensors data in open environments from the Sparse points cloud. Sensors, 21(14). https://doi.org/10.3390/s21144734

Readers over time

Readers' Seniority

PhD / Post grad / Masters / Doc 10

83%

Researcher 2

17%

Readers' Discipline

Engineering 7

50%

Computer Science 5

36%

Medicine and Dentistry 1

Social Sciences 1

Article Metrics

Mentions

Blog Mentions: 1

View details >

Slam-or: Simultaneous localization, mapping and object recognition using video sensors data in open environments from the Sparse points cloud

Abstract

Author supplied keywords

References Powered by Scopus

Deep residual learning for image recognition

You only look once: Unified, real-time object detection

Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks

Cited by Powered by Scopus

Efficient deep learning-based semantic mapping approach using monocular vision for resource-limited mobile robots

A Study of Drone-based AI for Enhanced Human-AI Trust and Informed Decision Making in Human-AI Interactive Virtual Environments

WPO-net: Windowed pose optimization network for monocular visual odometry estimation

Register to see more suggestions

Cite

Readers over time

Readers' Seniority

Readers' Discipline

Article Metrics