Dr Emmanuel Ogundimu emmanuel.ogundimu@durham.ac.uk
Associate Professor
Prediction models in credit scoring are often formulated using available data on accepted applicants at the loan application stage. The use of this data to estimate probability of default (PD) may lead to bias due to non-random selection from the population of applicants. That is, the PD in the general population of applicants may not be the same with the PD in the subpopulation of the accepted applicants. A prominent model for the reduction of bias in this framework is the sample selection model, but there is no consensus on its utility yet. It is unclear if the bias-variance trade- off of regularization techniques can improve the predictions of PD in non-random sample selection setting. To address this, we propose the use of Lasso and adaptive Lasso for variable selection and optimal predictive accuracy. By appealing to the least square approximation of the likelihood function of sample selection model, we optimize the resulting function subject to L1 and adaptively weighted L1 penalties using an efficient algorithm. We evaluate the performance of the proposed approach and competing alternatives in a simulation study and applied it to the well-known American Express credit card dataset.
Ogundimu, E. O. (2024). On Lasso and adaptive Lasso for non-random sample in credit scoring. Statistical Modelling, 24(2), 115-138. https://doi.org/10.1177/1471082x221092181
Journal Article Type | Article |
---|---|
Acceptance Date | Apr 9, 2022 |
Online Publication Date | May 9, 2022 |
Publication Date | 2024-04 |
Deposit Date | May 9, 2022 |
Publicly Available Date | May 13, 2022 |
Journal | Statistical Modelling |
Print ISSN | 1471-082X |
Electronic ISSN | 1477-0342 |
Publisher | SAGE Publications |
Peer Reviewed | Peer Reviewed |
Volume | 24 |
Issue | 2 |
Pages | 115-138 |
DOI | https://doi.org/10.1177/1471082x221092181 |
Public URL | https://durham-repository.worktribe.com/output/1207860 |
Published Journal Article (Advance Online Version)
(186 Kb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
Copyright Statement
Advance Online Version This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
Published Journal Article
(651 Kb)
PDF
Licence
http://creativecommons.org/licenses/by/4.0/
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0/
Developments in Statistical Modelling
(2024)
Book
About Durham Research Online (DRO)
Administrator e-mail: dro.admin@durham.ac.uk
This application uses the following open-source libraries:
Apache License Version 2.0 (http://www.apache.org/licenses/)
Apache License Version 2.0 (http://www.apache.org/licenses/)
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search