Automatic identification of altlexes using monolingual parallel corpora

1Citations
Citations of this article
75Readers
Mendeley users who have this article in their library.
Get full text

Abstract

The automatic identification of discourse relations is still a challenging task in natural language processing. Discourse connectives, such as since or but, are the most informative cues to identify explicit relations; however discourse parsers typically use a closed inventory of such connectives. As a result, discourse relations signaled by markers outside these inventories (i.e. AltLexes) are not detected as effectively. In this paper, we propose a novel method to leverage parallel corpora in text simplification and lexical resources to automatically identify alternative lexicalizations that signal discourse relation. When applied to the Simple Wikipedia and Newsela corpora along with WordNet and the PPDB, the method allowed the automatic discovery of 91 AltLexes.

Cite

CITATION STYLE

APA

Davoodi, E., & Kosseim, L. (2017). Automatic identification of altlexes using monolingual parallel corpora. In International Conference Recent Advances in Natural Language Processing, RANLP (Vol. 2017-September, pp. 195–200). Incoma Ltd. https://doi.org/10.26615/978-954-452-049-6_027

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free