Since this research was broken down into many sections, the instruction for running/compile/execute on each section will be written in README.md on each folder of each part.
The order for running the code are as follows:
- Data Collection
- Data Preparation
- Network Generation
- Feature enginering
- Team Scoring Model Selection
- Team Recommendation
- Evaluation by P, R, F1, MRR
The "Files" directory is needed to be placed in the same directory as the project. Please download it in the data link ("Files.zip" is in the Processed Data directory in the data download link.) The filepathhelper.py file is the utility file for helping with the path of the files in the "Files" directory. In addition, please edit config.json according to the dataset ("Moodle", "Apache", or "Atlassian"). Steps 1-5 are used for getting the processed files from the raw data. If you download the processed files, you can skip and run the Team Recommendation code.
If you have any questions on this project or get stuck during code execution, feel free to create issue on this repository together with hitnothit label.
We will generate the assignee team by using our code to fix your issue.
The raw data, preprocessed data, and the result of team recommendation can be download here.