MEIM: A multi-source software knowledge entity extraction integration model

4Citations
Citations of this article
13Readers
Mendeley users who have this article in their library.

Abstract

Entity recognition and extraction are the foundations of knowledge graph construction. Entity data in the field of software engineering come from different platforms and communities, and have different formats. This paper divides multi-source software knowledge entities into unstructured data, semi-structured data and code data. For these different types of data, Bi-directional Long Short- Term Memory (Bi-LSTM) with Conditional Random Field (CRF), template matching, and abstract syntax tree are used and integrated into a multi-source software knowledge entity extraction integration model (MEIM) to extract software entities. The model can be updated continuously based on user's feedbacks to improve the accuracy. To deal with the shortage of entity annotation datasets, keyword extraction methods based on Term Frequency-Inverse Document Frequency (TF-IDF), TextRank, and K-Means are applied to annotate tasks. The proposed MEIM model is applied to the Spring Boot framework, which demonstrates good adaptability. The extracted entities are used to construct a knowledge graph, which is applied to association retrieval and association visualization.

Cite

CITATION STYLE

APA

Lv, W., Liao, Z., Liu, S., & Zhang, Y. (2021). MEIM: A multi-source software knowledge entity extraction integration model. Computers, Materials and Continua, 66(1), 1027–1042. https://doi.org/10.32604/cmc.2020.012478

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free