In this paper, we designed a knowledge supporting software system in which sentences and keywords are extracted from large scale document database. This system consists of semantic representation scheme for natural language processing of the document database. Documents originally in a form of PDF are broken into triple-store data after pre-processing. The semantic representation is a hyper-graph which consists of collections of binary relations of 'triples'. According to a certain rule based on user's interests, the system identify sentences and words of interests. The relationship of those extracted sentences is visualized in the form of network graph. A user can introduce new rules to extract additional Knowledge from the Database or paper. For practical example, we choose a set of research papers related IoT for the case study purpose. Applying several rules concerning authors' indicated keywords as well as the system's specified discourse words, significant knowledge are extracted from the papers.
CITATION STYLE
Upadhyay, R., & Fujii, A. (2016). Semantic knowledge extraction from research documents. In Proceedings of the 2016 Federated Conference on Computer Science and Information Systems, FedCSIS 2016 (pp. 439–445). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.15439/2016F221
Mendeley helps you to discover research relevant for your work.