A Hybrid Approach for Tag Hierarchy Construction

2Citations
Citations of this article
7Readers
Mendeley users who have this article in their library.
Get full text

Abstract

Open source resources are playing a more and more important role in software engineering for reuse. However, the dramatically increasing scale of these resources brings great challenges for their management and location. In this study, we propose a hybrid approach for automatic tag hierarchy construction, which combines the tag co-occurrence relations and domain knowledge to build and optimize the hierarchy. We firstly calculate the generality of each tag in accordance with the co-occurrence relationship with others, and construct the hierarchy based on the generality. Then we leverage the domain knowledge of existing hierarchical categories to perform an optimization and promote the final hierarchy. We select 8064 projects in Openhub community and 10703 posts in StackOverflow community as the original data and use the information of the SourceForge community as the domain knowledge. We conduct extensive experiments and evaluate our approach by utilizing Wordnet and F-measure method. The results show that our approach exhibits better performance than others with accuracy rate and recall that exceed 90%.

Cite

CITATION STYLE

APA

Wang, S., Wang, T., Mao, X., Yin, G., & Yu, Y. (2018). A Hybrid Approach for Tag Hierarchy Construction. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 10826 LNCS, pp. 59–75). Springer Verlag. https://doi.org/10.1007/978-3-319-90421-4_4

Register to see more suggestions

Mendeley helps you to discover research relevant for your work.

Already have an account?

Save time finding and organizing research with Mendeley

Sign up for free