Explicit document modeling through weighted multiple-instance learning

39Citations
Citations of this article
45Readers
Mendeley users who have this article in their library.

Abstract

Representing documents is a crucial component in many NLP tasks, for instance predicting aspect ratings in reviews. Previous methods for this task treat documents globally, and do not acknowledge that target categories are often assigned by their authors with generally no indication of the specific sentences that motivate them. To address this issue, we adopt a weakly supervised learning model, which jointly learns to focus on relevant parts of a document according to the context along with a classifier for the target categories. Derived from the weighted multiple-instance regression (MIR) framework, the model learns decomposable document vectors for each individual category and thus overcomes the representational bottleneck in previous methods due to a fixed-length document vector. During prediction, the estimated relevance or saliency weights explicitly capture the contribution of each sentence to the predicted rating, thus offering an explanation of the rating. Our model achieves state-of-the-art performance on multi-aspect sentiment analysis, improving over several baselines. Moreover, the predicted saliency weights are close to human estimates obtained by crowdsourcing, and increase the performance of lexical and topical features for review segmentation and summarization.

References Powered by Scopus

Long Short-Term Memory

77222Citations
N/AReaders
Get full text

Support-Vector Networks

45914Citations
N/AReaders
Get full text

Regression Shrinkage and Selection Via the Lasso

35799Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Accurate Screening of COVID-19 Using Attention-Based Deep 3D Multiple Instance Learning

268Citations
N/AReaders
Get full text

Sharp Multiple Instance Learning for DeepFake Video Detection

138Citations
N/AReaders
Get full text

A Sentiment Polarity Categorization Technique for Online Product Reviews

56Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Pappas, N., & Popescu-Belis, A. (2017). Explicit document modeling through weighted multiple-instance learning. Journal of Artificial Intelligence Research, 58, 591–626. https://doi.org/10.1613/jair.5240

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 26

79%

Researcher 3

9%

Professor / Associate Prof. 2

6%

Lecturer / Post doc 2

6%

Readers' Discipline

Tooltip

Computer Science 22

85%

Engineering 2

8%

Decision Sciences 1

4%

Design 1

4%

Save time finding and organizing research with Mendeley

Sign up for free