Much of the data about free, libre, and open source (FLOSS) software development comes from studies of code repositories used for managing projects. This paper presents a method for integrating data about open source projects by way of matching projects (entities) and deleting duplicates across multiple code repositories. After a review of the relevant literature, a few of the methods are chosen and applied to the FLOSS domain, including a simple scoring system for confidence in pairwise project matches. Finally, the paper describes limitations of this approach and recommendations for future work. © 2007 International Federation for Information Processing.
CITATION STYLE
Conklin, M. (2007). Project entity matching across FLOSS repositories. IFIP International Federation for Information Processing, 234, 45–57. https://doi.org/10.1007/978-0-387-72486-7_4
Mendeley helps you to discover research relevant for your work.