Abstract
Labeled data is not readily available for many natural language domains, and it typically requires expensive human effort with considerable domain knowledge to produce a set of labeled data. In this paper, we propose a simple unsupervised system that helps us create a labeled resource for categorical data (e.g., a document set) us-ing only fifteen minutes of human input. We utilize the labeled resources to dis-cover important insights about the data. The entire process is domain independent, and demands no prior annotation samples, or rules specific to an annotation.
Cite
CITATION STYLE
Dasgupta, S. (2015). Painless labeling with application to text mining. In ACL-IJCNLP 2015 - 53rd Annual Meeting of the Association for Computational Linguistics and the 7th International Joint Conference on Natural Language Processing of the Asian Federation of Natural Language Processing, Proceedings of the Conference (Vol. 2, pp. 402–407). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/p15-2066
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.