Narrowing the loop: Integration of resources and linguistic dataset development with interactive machine learning

3Citations
Citations of this article
72Readers
Mendeley users who have this article in their library.

Abstract

This thesis proposal sheds light on the role of interactive machine learning and implicit user feedback for manual annotation tasks and semantic writing aid applications. First we focus on the cost-effective annotation of training data using an interactive machine learning approach by conducting an experiment for sequence tagging of German named entity recognition. To show the effectiveness of the approach, we further carry out a sequence tagging task on Amharic part-of-speech and are able to significantly reduce time used for annotation. The second research direction is to systematically integrate different NLP resources for our new semantic writing aid tool using again an interactive machine learning approach to provide contextual paraphrase suggestions. We develop a baseline system where three lexical resources are combined to provide paraphrasing in context and show that combining resources is a promising direction.

Cite

CITATION STYLE

APA

Yimam, S. M. (2015). Narrowing the loop: Integration of resources and linguistic dataset development with interactive machine learning. In NAACL-HLT 2015 - 2015 Student Research Workshop (SRW) at the Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings (pp. 88–95). Association for Computational Linguistics (ACL). https://doi.org/10.3115/v1/n15-2012

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free