Abstract
We investigate the use of two visual descriptors: Local Binary Patterns-Three Orthogonal Planes(LBP-TOP) and Dense Trajectories for depression assessment on the AVEC 2014 challenge dataset. We encode the visual information generated by the two descriptors using Fisher Vector encoding which has been shown to be one of the best performing methods to encode visual data for image classification. We also incorporate audio features in the final system to introduce multiple input modalities. The results produced using Linear Support Vector regression outperform the baseline method[16].
Author supplied keywords
Cite
CITATION STYLE
Jain, V., Crowley, J. L., Dey, A. K., & Lux, A. (2014). Depression estimation using audiovisual features and fisher vector encoding. In AVEC 2014 - Proceedings of the 4th International Workshop on Audio/Visual Emotion Challenge, Workshop of MM 2014 (pp. 87–91). Association for Computing Machinery. https://doi.org/10.1145/2661806.2661817
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.