Traditional event extraction methods require predefined event types and their corresponding annotations to learn event extractors. These prerequisites are often hard to be satisfied in real-world applications. This work presents a corpus-based open-domain event type induction method that automatically discovers a set of event types from a given corpus. As events of the same type could be expressed in multiple ways, we propose to represent each event type as a cluster of hpredicate sense, object headi pairs. Specifically, our method (1) selects salient predicates and object heads, (2) disambiguates predicate senses using only a verb sense dictionary, and (3) obtains event types by jointly embedding and clustering hpredicate sense, object headi pairs in a latent spherical space. Our experiments, on three datasets from different domains, show our method can discover salient and high-quality event types, according to both automatic and human evaluations.
CITATION STYLE
Shen, J., Zhang, Y., Ji, H., & Han, J. (2021). Corpus-based Open-Domain Event Type Induction. In EMNLP 2021 - 2021 Conference on Empirical Methods in Natural Language Processing, Proceedings (pp. 5427–5440). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/2021.emnlp-main.441
Mendeley helps you to discover research relevant for your work.