Abstract
Background: The amount of biomedical literature available is growing at an explosive speed, but a large amount of useful information remains undiscovered in it. Researchers can make informed biomedical hypotheses through mining this literature. Unfortunately, popular mining methods based on co-occurrence produce too many target concepts, leading to the declining relevance ranking of the potential target concepts. Methods: This paper presents a new method for selecting linking concepts which exploits statistical and textual features to represent each linking concept, and then classifies them as relevant or irrelevant to the starting concepts. Relevant linking concepts are then used to discover target concepts. Results: Through an evaluation it is observed textual features improve the results obtained with only statistical features. We successfully replicate Swanson's two classic discoveries and find the rankings of potentially relevant target concepts are relatively high. Conclusions: The number of target concepts is greatly reduced and potentially relevant target concepts gain higher ranking by adopting only relevant linking concepts. Thus, the proposed method has the potential to help biomedical experts find the most useful and valuable target concepts effectively.
Cite
CITATION STYLE
Cheng, L., Lin, H., Zhou, F., Yang, Z., & Wang, J. (2014). Enhancing the accuracy of knowledge discovery: A supervised learning method. BMC Bioinformatics, 15(12). https://doi.org/10.1186/1471-2105-15-S12-S9
Register to see more suggestions
Mendeley helps you to discover research relevant for your work.