In this paper, we present a system for detecting pedestrians at long ranges using a combination of stereo-based detection, classification using deep learning, and a cascade of specialized classifiers that can reduce false positives and computational load. Specifically, we use stereo to perform detection of vertical structures which are further filtered based on edge responses. A convolutional neural network was then designed to support the classification of pedestrians using both appearance and stereo disparity-based features. A second convolutional network classifier was trained specifically for the case of long-range detections using appearance only. We further speed up the classifier using a cascade approach and multi-threading. The system was deployed on two robots, one using a high resolution stereo pair with 180 degree fisheye lenses and the other using 80 degree FOV lenses. Results are demonstrated on a large dataset captured in a variety of environments. © 2012 IEEE.
CITATION STYLE
Kira, Z., Hadsell, R., Salgian, G., & Samarasekera, S. (2012). Long-Range Pedestrian Detection using stereo and a cascade of convolutional network classifiers. In IEEE International Conference on Intelligent Robots and Systems (pp. 2396–2403). https://doi.org/10.1109/IROS.2012.6386029
Mendeley helps you to discover research relevant for your work.