A broad-coverage challenge corpus for sentence understanding through inference

2.6kCitations
Citations of this article
729Readers
Mendeley users who have this article in their library.

Abstract

This paper introduces the Multi-Genre Natural Language Inference (MultiNLI) corpus, a dataset designed for use in the development and evaluation of machine learning models for sentence understanding. At 433k examples, this resource is one of the largest corpora available for natural language inference (a.k.a. recognizing textual entailment), improving upon available resources in both its coverage and difficulty. MultiNLI accomplishes this by offering data from ten distinct genres of written and spoken English, making it possible to evaluate systems on nearly the full complexity of the language, while supplying an explicit setting for evaluating cross-genre domain adaptation. In addition, an evaluation using existing machine learning models designed for the Stanford NLI corpus shows that it represents a substantially more difficult task than does that corpus, despite the two showing similar levels of inter-Annotator agreement.

References Powered by Scopus

Long Short-Term Memory

77615Citations
N/AReaders
Get full text

GloVe: Global vectors for word representation

27039Citations
N/AReaders
Get full text

Visualizing and understanding convolutional networks

11205Citations
N/AReaders
Get full text

Cited by Powered by Scopus

Natural Questions: A Benchmark for Question Answering Research

1875Citations
N/AReaders
Get full text

SimCSE: Simple Contrastive Learning of Sentence Embeddings

1776Citations
N/AReaders
Get full text

Spanbert: Improving pre-training by representing and predicting spans

1338Citations
N/AReaders
Get full text

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Cite

CITATION STYLE

APA

Williams, A., Nangia, N., & Bowman, S. R. (2018). A broad-coverage challenge corpus for sentence understanding through inference. In NAACL HLT 2018 - 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference (Vol. 1, pp. 1112–1122). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/n18-1101

Readers' Seniority

Tooltip

PhD / Post grad / Masters / Doc 282

77%

Researcher 54

15%

Lecturer / Post doc 23

6%

Professor / Associate Prof. 8

2%

Readers' Discipline

Tooltip

Computer Science 319

80%

Engineering 42

11%

Linguistics 24

6%

Social Sciences 12

3%

Article Metrics

Tooltip
Mentions
News Mentions: 3
References: 7

Save time finding and organizing research with Mendeley

Sign up for free