RepoHealth
|-- data
| |-- repositories_label.csv
| |-- repositories_merged.csv
| |-- repositories_statistics.csv
| |-- Repositories_with_label.csv
| |-- Repositories_with_statistics.csv
|
|-- src
| |-- data_process.ipynb
| |-- exploratory_data_analysis.ipynb
| |-- machine_learning_models.ipynb
| |-- deep_learning.ipynb
|
|-- images
| |-- feature_importance
| |-- prediction
Repositories_with_label
Repositories_with_statistics
Dependency libraries:requirements.txt
# Install dependency libraries
pip install -r ./requirements.txt
data_process.ipynb
Process raw data and generate preprocessed datasets
exploratory_data_analysis.ipynb
Exploratory data analysis
machine_learning_models.ipynb
Apply various feature importance analysis methods and traditional machine learning models for feature analysis, health metrics, and prediction, etc.
deep_learning.ipynb
Train various deep learning models for repository lifespan prediction