We propose in this work an approach for the automatic extraction and recognition of the Italian sign language using the RGB, depth and skeletal-joint modalities offered by Microsoft’s Kinect sensor. We investigate the best modality combination that improves the humanaction spotting and recognition in a continuous stream scenario. For this purpose, we define per modality a complementary feature representation and fuse the decisions of multiple SVM classifiers with probability outputs. We contribute by proposing a multi-scale analysis approach that combines a global Fisher vector representation with a local frame-wise one. In addition we define a temporal segmentation strategy that allows the generation of multiple specialized classifiers. The final decision is obtained using the combination of their results. Our tests have been carried out on the Chalearn gesture challenge dataset, and promising results have been obtained on primary experiments.
CITATION STYLE
Seddik, B., Gazzah, S., & Ben Amara, N. E. (2015). Modalities combination for Italian sign language extraction and recognition. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9280, pp. 710–721). Springer Verlag. https://doi.org/10.1007/978-3-319-23234-8_65
Mendeley helps you to discover research relevant for your work.