Utterance-level multimodal sentiment analysis

Verónica Pérez-Rosas; Rada Mihalcea; Louis Philippe Morency

Conference Proceedings

Utterance-level multimodal sentiment analysis

ACL 2013 - 51st Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (2013) 1 973-982

283Citations

264Readers

Abstract

During real-life interactions, people are naturally gesturing and modulating their voice to emphasize specific points or to express their emotions. With the recent growth of social websites such as YouTube, Facebook, and Amazon, video reviews are emerging as a new source of multimodal and natural opinions that has been left almost untapped by automatic opinion analysis techniques. This paper presents a method for multimodal sentiment classification, which can identify the sentiment expressed in utterance-level visual datastreams. Using a new multimodal dataset consisting of sentiment annotated utterances extracted from video reviews, we show that multimodal sentiment analysis can be effectively performed, and that the joint use of visual, acoustic, and linguistic modalities can lead to error rate reductions of up to 10.5% as compared to the best performing individual modality. © 2013 Association for Computational Linguistics.

Cite

CITATION STYLE

APA

Pérez-Rosas, V., Mihalcea, R., & Morency, L. P. (2013). Utterance-level multimodal sentiment analysis. In ACL 2013 - 51st Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference (Vol. 1, pp. 973–982). Association for Computational Linguistics (ACL).

Utterance-level multimodal sentiment analysis

Abstract

Cite

Register to see more suggestions