Skip to content

Latest commit

 

History

History
57 lines (47 loc) · 1.29 KB

README.md

File metadata and controls

57 lines (47 loc) · 1.29 KB

Repository Health Metrics and Lifespan Prediction

File Structure

RepoHealth  
|-- data  
|   |-- repositories_label.csv  
|   |-- repositories_merged.csv  
|   |-- repositories_statistics.csv  
|   |-- Repositories_with_label.csv  
|   |-- Repositories_with_statistics.csv
|  
|-- src  
|   |-- data_process.ipynb  
|   |-- exploratory_data_analysis.ipynb  
|   |-- machine_learning_models.ipynb
|   |-- deep_learning.ipynb
|
|-- images
|   |-- feature_importance
|   |-- prediction

Datasets

Repositories_with_label
Repositories_with_statistics

Execution Environment

Dependency libraries:requirements.txt

# Install dependency libraries
pip install -r ./requirements.txt

Getting Started

data_process.ipynb

Process raw data and generate preprocessed datasets

exploratory_data_analysis.ipynb

Exploratory data analysis

machine_learning_models.ipynb

Apply various feature importance analysis methods and traditional machine learning models for feature analysis, health metrics, and prediction, etc.

deep_learning.ipynb

Train various deep learning models for repository lifespan prediction