Explainable Credit Risk Assessment: Comparing Logistic Regression, XGBoost, and LightGBM with SHAP and LIME Analysis | IJCT Volume 13 – Issue 3 | IJCT-V13I3P112

International Journal of Computer Techniques
ISSN 2394-2231
Volume 13, Issue 3  |  Published: May – June 2026

Author

Durrusadaf Imanova, Hakan Kutucu

Abstract

This study investigates whether gradient boosting models (XGBoost and LightGBM), combined with SHAP and LIME explainability, offer meaningful advantages over Logistic Regression in credit scoring. Using the German Credit dataset (n = 1,000), three models were compared through stratified five-fold cross-validation, held-out test evaluation, and pairwise McNemar’s tests. None of the three models achieved a statistically significant improvement over any other (all p > 0.05). Logistic Regression achieved the highest recall on the test set (0.800), followed by XGBoost (0.717) and LightGBM (0.567). SHAP analysis revealed consistent nonlinear risk patterns across both ensemble models, including duration threshold effects and compensatory attribute dynamics inaccessible through linear coefficients. A comparative analysis of SHAP and LIME showed broad agreement on top predictors but meaningful differences in stability and explanation structure. The findings suggest that on small, structured datasets, Logistic Regression remains the most practical production choice, while gradient boosting with SHAP serves a complementary analytical role for credit policy design.

Keywords

Explainable AI, Credit Risk, Logistic Regression, XGBoost, LightGBM, SHAP, LIME

Conclusion

This study compared Logistic Regression, XGBoost, and LightGBM within an explainable credit scoring framework using both SHAP and LIME. No model achieved a statistically significant predictive advantage. Logistic Regression achieved the highest recall (0.800) and remains the most practical production choice for small structured portfolios due to its native interpretability and regulatory compatibility. SHAP analysis retains analytical value independently of model superiority, revealing consistent nonlinear risk patterns across both ensemble models. The SHAP-LIME comparison demonstrates that explanation methods offer complementary rather than redundant perspectives, with SHAP better suited for audit documentation and LIME for interpretable customer communication. Future research should extend this comparison to larger real-world datasets, incorporate macroeconomic variables, evaluate fairness, and examine interpretable ensemble architectures such as Explainable Boosting Machines.

References

[1] B. Baesens, T. Van Gestel, S. Viaene, M. Stepanova, J. Suykens, and J. Vanthienen, “Benchmarking state-of-the-art classification algorithms for credit scoring,” J. Oper. Res. Soc., vol. 54, no. 6, pp. 627–635, 2003. [2] T. Chen and C. Guestrin, “XGBoost: A scalable tree boosting system,” in Proc. 22nd ACM SIGKDD, 2016, pp. 785–794. [3] J. N. Crook, D. B. Edelman, and L. C. Thomas, “Recent developments in consumer credit risk assessment,” Eur. J. Oper. Res., vol. 183, no. 3, pp. 1447–1465, 2007. [4] P. E. de Lange, B. Melsom, C. B. Vennerød, and S. Westgaard, “Explainable AI for credit assessment in banks,” J. Risk Financial Manag., vol. 15, no. 12, p. 556, 2022. [5] D. Dua and C. Graff, “UCI Machine Learning Repository,” Univ. California, Irvine, 2019. [6] B. Hadji Misheva, J. Osterrieder, A. Hirsa, O. Kulkarni, and S. F. Lin, “Explainable AI in credit risk management,” arXiv:2103.00949, 2021. [7] D. J. Hand and W. E. Henley, “Statistical classification methods in consumer credit scoring,” J. Roy. Stat. Soc. A, vol. 160, no. 3, pp. 523–541, 1997. [8] G. Ke, Q. Meng, T. Finley, T. Wang, W. Chen, W. Ma, Q. Ye, and T. Y. Liu, “LightGBM: A highly efficient gradient boosting decision tree,” in Adv. Neural Inf. Process. Syst., vol. 30, 2017, pp. 3146–3154. [9] S. Lessmann, B. Baesens, H. V. Seow, and L. C. Thomas, “Benchmarking state-of-the-art classification algorithms for credit scoring: An update,” Eur. J. Oper. Res., vol. 247, no. 1, pp. 124–136, 2015. [10] S. M. Lundberg and S. I. Lee, “A unified approach to interpreting model predictions,” in Adv. Neural Inf. Process. Syst., vol. 30, 2017, pp. 4765–4774. [11] C. Molnar, Interpretable Machine Learning, 2nd ed. christophm.github.io/interpretable-ml-book, 2022. [12] M. T. Ribeiro, S. Singh, and C. Guestrin, “Why should I trust you?: Explaining the predictions of any classifier,” in Proc. 22nd ACM SIGKDD, 2016, pp. 1135–1144. [13] L. C. Thomas, “A survey of credit and behavioural scoring,” Int. J. Forecasting, vol. 16, no. 2, pp. 149–172, 2000.

How to Cite This Paper

Durrusadaf Imanova, Hakan Kutucu (2026). Explainable Credit Risk Assessment: Comparing Logistic Regression, XGBoost, and LightGBM with SHAP and LIME Analysis. International Journal of Computer Techniques, 13(3). ISSN: 2394-2231.

© 2026 International Journal of Computer Techniques (IJCT). All rights reserved.

Submit Your Paper