Abstract
We propose a vision-based method for recognizing first-person reading activity with deep learning. For the success of deep learning, it is well known that a large amount of training data plays a vital role. Unlike image classification, there are less publicly available datasets for reading activity recognition, and the collection of book images might cause copyright trouble. In this paper, we develop a synthetic approach for generating positive training images. Our approach synthesizes computer-generated images and real backround images. In experiments, we show that this synthesis is effective in combination with pre-trained deep convolutional neural networks and also our trained neural network outperforms other baselines.
Author supplied keywords
Cite
CITATION STYLE
Segawa, Y., Kawamoto, K., & Okamoto, K. (2018). First-person reading activity recognition by deep learning with synthetically generated images. Eurasip Journal on Image and Video Processing, 2018(1). https://doi.org/10.1186/s13640-018-0272-z
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.