Head pose estimation for sign language video

Marcos Luzardo; Matti Karppa; Jorma Laaksonen; Tommi Jantunen

Conference ProceedingsOPEN ACCESS

Head pose estimation for sign language video

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2013) 7944 LNCS 349-360

DOI: 10.1007/978-3-642-38886-6_34

6Citations

11Readers

Abstract

We address the problem of estimating three head pose angles in sign language video using the Pointing04 data set as training data. The proposed model employs facial landmark points and Support Vector Regression learned from the training set to identify yaw and pitch angles independently. A simple geometric approach is used for the roll angle. As a novel development, we propose to use the detected skin tone areas within the face bounding box as additional features for head pose estimation. The accuracy level of the estimators we obtain compares favorably with published results on the same data, but the smaller number of pose angles in our setup may explain some of the observed advantage. We evaluated the pose angle estimators also against ground truth values from motion capture recording of a sign language video. The correlations for the yaw and roll angles exceeded 0.9 whereas the pitch correlation was slightly worse. As a whole, the results are very promising both from the computer vision and linguistic points of view. © 2013 Springer-Verlag.

Cite

CITATION STYLE

APA

Luzardo, M., Karppa, M., Laaksonen, J., & Jantunen, T. (2013). Head pose estimation for sign language video. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 7944 LNCS, pp. 349–360). https://doi.org/10.1007/978-3-642-38886-6_34

Head pose estimation for sign language video

Abstract

Cite

Register to see more suggestions