4.5 Article

Graph Convolutional Network-Based Repository Recommendation System

Journal

CMES-COMPUTER MODELING IN ENGINEERING & SCIENCES
Volume 137, Issue 1, Pages 175-196

Publisher

TECH SCIENCE PRESS
DOI: 10.32604/cmes.2023.027287

Keywords

Repository recommendation; graph convolutional network; open -source software; GitHub

Ask authors/readers for more resources

GitHub repository recommendation is a research hotspot in the field of open-source software. The current problems lie in the inadequate utilization of open-source community information and the manual development of scoring metrics, which leads to poor recommendation results. To address these issues, we propose GCNRec, a graph convolutional network-based repository recommendation system. It leverages a Developer-Repository network and developer/repository features to recommend repositories that developers would be interested in. Compared to other methods, GCNRec achieves higher precision and hit rate, as verified on the dataset.
GitHub repository recommendation is a research hotspot in the field of open-source software. The current problems with the repository recommendation system are the insufficient utilization of open-source community information and the fact that the scoring metrics used to calculate the matching degree between developers and repositories are developed manually and rely too much on human experience, leading to poor recommendation results. To address these problems, we design a questionnaire to investigate which repository information developers focus on and propose a graph convolutional network-based repository recommendation system (GCNRec). First, to solve insufficient information utilization in open-source communities, we construct a Developer-Repository network using four types of behavioral data that best reflect developers' programming preferences and extract features of developers and repositories from the repository content that developers focus on. Then, we design a repository recommendation model based on a multi-layer graph convolutional network to avoid the manual formulation of scoring metrics. This model takes the Developer-Repository network, developer features and repository features as inputs, and recommends the top-k repositories that developers are most likely to be interested in by learning their preferences. We have verified the proposed GCNRec on the dataset, and by comparing it with other open-source repository recommendation methods, GCNRec achieves higher precision and hit rate.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.5
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available