Towards real-time head pose estimation: Exploring parameter-reduced residual networks on in-the-wild datasets

7Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Head poses are a key component of human bodily communication and thus a decisive element of human-computer interaction. Real-time head pose estimation is crucial in the context of human-robot interaction or driver assistance systems. The most promising approaches for head pose estimation are based on Convolutional Neural Networks (CNNs). However, CNN models are often too complex to achieve real-time performance. To face this challenge, we explore a popular subgroup of CNNs, the Residual Networks (ResNets) and modify them in order to reduce their number of parameters. The ResNets are modified for different image sizes including low-resolution images and combined with a varying number of layers. They are trained on in-the-wild datasets to ensure real-world applicability. As a result, we demonstrate that the performance of the ResNets can be maintained while reducing the number of parameters. The modified ResNets achieve state-of-the-art accuracy and provide fast inference for real-time applicability.

Cite

CITATION STYLE

APA

Rieger, I., Hauenstein, T., Hettenkofer, S., & Garbas, J. U. (2019). Towards real-time head pose estimation: Exploring parameter-reduced residual networks on in-the-wild datasets. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11606 LNAI, pp. 123–134). Springer Verlag. https://doi.org/10.1007/978-3-030-22999-3_12

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free