A novel approach to part name discovery in noisy text

Nobal B. Niraula; Daniel Whyatt; Anne Kao

Conference ProceedingsOPEN ACCESS

A novel approach to part name discovery in noisy text

NAACL HLT 2018 - 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference (2018) 3 170-176

DOI: 10.18653/v1/n18-3021

4Citations

79Readers

Abstract

As a specialized example of information extraction, part name extraction is an area that presents unique challenges. Part names are typically multiword terms longer than two words. There is little consistency in how terms are described in noisy free text, with variations spawned by typos, ad hoc abbreviations, acronyms, and incomplete names. This makes search and analyses of parts in these data extremely challenging. In this paper, we present our algorithm, PANDA (Part Name Discovery Analytics), based on a unique method that exploits statistical, linguistic and machine learning techniques to discover part names in noisy text such as that in manufacturing quality documentation, supply chain management records, service communication logs, and maintenance reports. Experiments show that PANDA is scalable and outperforms existing techniques significantly.

Cite

CITATION STYLE

APA

Niraula, N. B., Whyatt, D., & Kao, A. (2018). A novel approach to part name discovery in noisy text. In NAACL HLT 2018 - 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - Proceedings of the Conference (Vol. 3, pp. 170–176). Association for Computational Linguistics (ACL). https://doi.org/10.18653/v1/n18-3021

A novel approach to part name discovery in noisy text

Abstract

Cite

Register to see more suggestions