WNet: Joint Multiple Head Detection and Head Pose Estimation from a Spectator Crowd Image

Yasir Jan; Ferdous Sohel; Mohd Fairuz Shiratuddin; Kok Wai Wong

Conference Proceedings

WNet: Joint Multiple Head Detection and Head Pose Estimation from a Spectator Crowd Image

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2019) 11367 LNCS 484-493

DOI: 10.1007/978-3-030-21074-8_38

2Citations

9Readers

Get full text

Abstract

Crowd image analysis has various application areas such as surveillance, crowd management and augmented reality. Existing techniques can detect multiple faces in a single crowd image, but small head/face size and additional non facial regions in the head bounding box makes the head detection (HD) challenging. Additionally, in existing head pose estimations (HPE) of multiple heads in an image, individual cropped head image is passed through a network one by one, instead of estimating poses of multiple heads at the same time. The proposed WNet, performs both HD and HPE jointly on multiple heads in a single crowd image, in a single pass. Experiments are demonstrated on the spectator crowd S-HOCK dataset and results are compared with the HPE benchmarks. WNet proposes to use lesser number of training images compared to number of cropped images used by benchmarks, and does not utilize transferred weights from other networks. WNet not just performs HPE, but joint HD and HPE efficiently i.e. accuracy for more number of heads while depending on lesser number of testing images, compared to the benchmarks.

Author supplied keywords

Cite

CITATION STYLE

APA

Jan, Y., Sohel, F., Shiratuddin, M. F., & Wong, K. W. (2019). WNet: Joint Multiple Head Detection and Head Pose Estimation from a Spectator Crowd Image. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 11367 LNCS, pp. 484–493). Springer Verlag. https://doi.org/10.1007/978-3-030-21074-8_38

WNet: Joint Multiple Head Detection and Head Pose Estimation from a Spectator Crowd Image

Abstract

Author supplied keywords

Cite

Register to see more suggestions