Microsoft Kinect, a low‐cost motion capture device, has huge potential in applications that require machine vision, such as human‐robot interactions, home‐based rehabilitation and clinical assessments. The Kinect sensor can track 25 key three‐dimensional (3D) “skeleton” joints on the human body at 30 frames per second, and the skeleton data often have acceptable accuracy. However, the skeleton data obtained from the sensor sometimes exhibit a high level of jitter due to noise and estimation error. This jitter is worse when there is occlusion or a subject moves slightly out of the field of view of the sensor for a short period of time. Therefore, this paper proposed a novel approach to simultaneously handle the noise and error in the skeleton data derived from Kinect. Initially, we adopted classification processing to divide the skeleton data into noise data and erroneous data. Furthermore, we used a Kalman filter to smooth the noise data and correct erroneous data. We performed an occlusion experiment to prove the effectiveness of our algorithm. The proposed method outperforms existing techniques, such as the moving mean filter and traditional Kalman filter. The experimental results show an improvement of accuracy of at least 58.7%, 47.5% and 22.5% compared to the original Kinect data, moving mean filter and traditional Kalman filter, respectively. Our method provides a new perspective for Kinect data processing and a solid data foundation for subsequent research that utilizes Kinect.
CITATION STYLE
Niu, J., Wang, X., Wang, D., & Ran, L. (2020). A novel method of human joint prediction in an occlusion scene by using low‐cost motion capture technique. Sensors (Switzerland), 20(4). https://doi.org/10.3390/s20041119
Mendeley helps you to discover research relevant for your work.