Skip to content

Applying multiple techniques to evaluate and forecast the health of software repositories.

Notifications You must be signed in to change notification settings

delete3GM/Repo-Health

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Repository Health Metrics and Lifespan Prediction

File Structure

RepoHealth  
|-- data  
|   |-- repositories_label.csv  
|   |-- repositories_merged.csv  
|   |-- repositories_statistics.csv  
|   |-- Repositories_with_label.csv  
|   |-- Repositories_with_statistics.csv
|  
|-- src  
|   |-- data_process.ipynb  
|   |-- exploratory_data_analysis.ipynb  
|   |-- machine_learning_models.ipynb
|   |-- deep_learning.ipynb
|
|-- images
|   |-- feature_importance
|   |-- prediction

Datasets

Repositories_with_label
Repositories_with_statistics

Execution Environment

Dependency libraries:requirements.txt

# Install dependency libraries
pip install -r ./requirements.txt

Getting Started

data_process.ipynb

Process raw data and generate preprocessed datasets

exploratory_data_analysis.ipynb

Exploratory data analysis

machine_learning_models.ipynb

Apply various feature importance analysis methods and traditional machine learning models for feature analysis, health metrics, and prediction, etc.

deep_learning.ipynb

Train various deep learning models for repository lifespan prediction

About

Applying multiple techniques to evaluate and forecast the health of software repositories.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published