Facial expression recognition using facial-componentbased bag of words and PHOG descriptors

10Citations
Citations of this article
12Readers
Mendeley users who have this article in their library.

Abstract

Facial expression recognition has many potential applications in areas such as human-computer interaction (HCI), emotion analysis, and synthetic face animation. This paper proposes a novel framework of facial appearance and shape information extraction for facial expression recognition. For appearance information extraction, a facial-componentbased bag of words method is presented. We segment face images into four component regions: forehead, eye-eyebrow, nose, and mouth. We then partition them into 4 × 4 sub-regions. Dense SIFT (scale-invariant feature transform) features are calculated over the sub-regions and vector quantized into 4 × 4 sets of codeword distributions. For shape information extraction, PHOG (pyramid histogram of orientated gradient) descriptors are computed on the four facial component regions to obtain the spatial distribution of edges. Multi-class SVM classifiers are applied to classify the six basic facial expressions using the facial-component-based bag of words and PHOG descriptors respectively. Then the appearance and shape information is fused at decision level to further improve the recognition rate. Our framework provides holistic characteristics for the local texture and shape features by enhancing the structure-based spatial information, and makes it possible to use the bag of words method and the local descriptors in facial expression recognition for the first time. The recognition rate achieved by the fusion of appearance and shape features at decision level using the Cohn-Kanade database is 96.33%, which outperforms the state-of-the-art research works.

References Powered by Scopus

Distinctive image features from scale-invariant keypoints

50215Citations
N/AReaders
Get full text

Histograms of oriented gradients for human detection

30666Citations
N/AReaders
Get full text

Multiresolution gray-scale and rotation invariant texture classification with local binary patterns

13869Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Facial expression recognition via learning deep sparse autoencoders

456Citations
N/AReaders
Get full text

A new facial expression recognition based on curvelet transform and online sequential extreme learning machine initialized with spherical clustering

121Citations
N/AReaders
Get full text

Multi-gradient features and elongated quinary pattern encoding for image-based facial expression recognition

39Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Li, Z., Imai, J. I., & Kaneko, M. (2010). Facial expression recognition using facial-componentbased bag of words and PHOG descriptors. Kyokai Joho Imeji Zasshi/Journal of the Institute of Image Information and Television Engineers, 64(2), 230–236. https://doi.org/10.3169/itej.64.230

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 8

100%

Readers' Discipline

Tooltip

Computer Science 5

50%

Engineering 4

40%

Philosophy 1

10%

Save time finding and organizing research with Mendeley

Sign up for free