Various computer vision problems and applications rely on an accurate, fast head pose estimator. We model head pose estimation as a regression problem. We show that it is possible to use the appearance of the facial image as a feature which depicts the pose variations. We use a parametrized Multi-Variate Relevance Vector Machine (MVRVM) to learn the three rotation angles of the face (yaw, pitch, and roll). The input of the MVRVM is normalized mean pixel intensities of the face patches, and the output is the three head rotation angles. We evaluated our approach on the challenging YouTube faces dataset. We achieved a head pose estimation with an average error tolerance of ±6.5 ◦ in the yaw rotation angle, and less than ±2.5 ◦ in both the pitch and roll angles. The time taken in one prediction is 2-3 milliseconds, hence suitable for real-time applications.
CITATION STYLE
Selim, M., Pagani, A., & Stricker, D. (2015). Real-time head pose estimation using multi-variate RVM on faces in the wild. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9257, pp. 254–265). Springer Verlag. https://doi.org/10.1007/978-3-319-23117-4_22
Mendeley helps you to discover research relevant for your work.