Loan_Approval_Prediction_Python

Objective

The objective of this project is to develop a machine learning model that predicts the loan approval status for a customer's loan application. The goal is to help financial institutions minimize the risk of potential defaults and ensure that approved loans are more likely to be repaid.

Dataset

The dataset used for this project is from one of the competition datasets on Kaggle. It contains a total of 97,742 rows and 13 dimensions. The data is split into two datasets:

Training dataset: 58,645 rows and 13 columns
Test dataset: 13 columns

Models and Results

After running 3 different machine learning models with varying hyperparameters and cross-validation, the Random Forest model was selected as the champion model. Here is a summary of the results:

Random Forest Model:
- Modified hyperparameters: {'max_depth': None, 'max_features': 0.6, 'max_samples': 0.7, 'n_estimators': 100}
- Achieved a high precision score of 92%
- Prioritized for its high precision score, indicating a higher accuracy in predicting the positive class of customers whose loans should be approved.
XGBoost Model:
- Achieved a better recall score but lower precision score compared to the Random Forest model.

Rationale for Model Selection

The context of the problem required prioritizing the precision score as the primary model evaluation metric. The cost of approving a loan that should be rejected (false positive) is comparatively higher than the cost of rejecting a loan application that should be approved (false negative). Therefore, the Random Forest model, which achieved a high precision score, was selected to minimize the approval of risky loans.

Conclusion

The Random Forest model, with a precision score of 92%, aligns with the financial institution's objective of minimizing the approval of risky loans. This model effectively helps in predicting the loan approval status, thereby reducing the potential risk of defaults and improving the overall loan approval process.

Acknowledgments

Special thanks to my professor Samiul Islam for the guidance in project report preparation and thanks to Kaggle for providing the dataset used in this project.

Author

Nibin Joseph

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
DataMining_LoanApproval.ipynb		DataMining_LoanApproval.ipynb
DataMining_LoanApproval_Notebook.pdf		DataMining_LoanApproval_Notebook.pdf
Loan Approval Prediction_Report.docx		Loan Approval Prediction_Report.docx
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Loan_Approval_Prediction_Python

Objective

Dataset

Models and Results

Rationale for Model Selection

Conclusion

Acknowledgments

Author

About

Releases

Packages

Languages

nibinjoseph/Loan_Approval_Prediction

Folders and files

Latest commit

History

Repository files navigation

Loan_Approval_Prediction_Python

Objective

Dataset

Models and Results

Rationale for Model Selection

Conclusion

Acknowledgments

Author

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages