Corpus-based Open-Domain Event Type Induction

22Citations
Citations of this article
62Readers
Mendeley users who have this article in their library.

Abstract

Traditional event extraction methods require predefined event types and their corresponding annotations to learn event extractors. These prerequisites are often hard to be satisfied in real-world applications. This work presents a corpus-based open-domain event type induction method that automatically discovers a set of event types from a given corpus. As events of the same type could be expressed in multiple ways, we propose to represent each event type as a cluster of hpredicate sense, object headi pairs. Specifically, our method (1) selects salient predicates and object heads, (2) disambiguates predicate senses using only a verb sense dictionary, and (3) obtains event types by jointly embedding and clustering hpredicate sense, object headi pairs in a latent spherical space. Our experiments, on three datasets from different domains, show our method can discover salient and high-quality event types, according to both automatic and human evaluations.

Cite

CITATION STYLE

APA

Shen, J., Zhang, Y., Ji, H., & Han, J. (2021). Corpus-based Open-Domain Event Type Induction. In EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 5427–5440). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.emnlp-main.441

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free