Recognition from hand cameras: A revisit with deep learning

Cheng Sheng Chan; Shou Zhong Chen; Pei Xuan Xie; Chiung Chih Chang; Min Sun

Conference ProceedingsOPEN ACCESS

Recognition from hand cameras: A revisit with deep learning

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2016) 9908 LNCS 505-521

DOI: 10.1007/978-3-319-46493-0_31

4Citations

11Readers

Abstract

We revisit the study of a wrist-mounted camera system (referred to as HandCam) for recognizing activities of hands. HandCam has two unique properties as compared to egocentric systems (referred to as HeadCam): (1) it avoids the need to detect hands; (2) it more consistently observes the activities of hands. By taking advantage of these properties, we propose a deep-learning-based method to recognize hand states (free vs. active hands, hand gestures, object categories), and discover object categories. Moreover, we propose a novel two-streams deep network to further take advantage of both HandCam and HeadCam. We have collected a new synchronized HandCam and HeadCam dataset with 20 videos captured in three scenes for hand states recognition. Experiments show that our HandCam system consistently outperforms a deeplearning- based HeadCam method (with estimated manipulation regions) and a dense-trajectory-based HeadCam method in all tasks. We also show that HandCam videos captured by different users can be easily aligned to improve free vs. active recognition accuracy (3.3% improvement) in across-scenes use case. Moreover, we observe that finetuning Convolutional Neural Network consistently improves accuracy. Finally, our novel two-streams deep network combining HandCam and HeadCam achieves the best performance in four out of five tasks. With more data, we believe a joint HandCam and HeadCam system can robustly log hand states in daily life.

Author supplied keywords

Cite

CITATION STYLE

APA

Chan, C. S., Chen, S. Z., Xie, P. X., Chang, C. C., & Sun, M. (2016). Recognition from hand cameras: A revisit with deep learning. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9908 LNCS, pp. 505–521). Springer Verlag. https://doi.org/10.1007/978-3-319-46493-0_31

Recognition from hand cameras: A revisit with deep learning

Abstract

Author supplied keywords

Cite

Register to see more suggestions