Journal of Risk Model Validation
ISSN:
1753-9579 (print)
1753-9587 (online)
Editor-in-chief: Steve Satchell
An end-to-end deep learning approach to credit scoring using CNN + XGBoost on transaction data
Lars Ole Hjelkrem, Petter Eilif de Lange and Erik Nesset
Need to know
- Open Banking APIs can prove to be an important data source for banks when assessing the creditworthiness of potential customers using application credit score models and have the potential to increase the profitability of banks when recruiting new customers.
- We find that traditional regression models perform poorly, while machine learning methods can generate credit score models with satisfactory performance based on transaction data solely from the last 90-days before the score date.
- The best-performing machine learning models are based on an end-to-end deep learning approach, where the machine learning algorithms create the explanatory variables based on non-aggregated data.
- This result is in accordance with experiments in other scientific fields where deep learning has replaced shallow learning as state-of-the- art.
Abstract
The performance of credit scoring models is closely linked to a bank’s profitability. Application scoring models for potential customers usually perform worse than models for existing customers. This is due to the lender not having access to the financial behavioral data of potential customers. Access to such data about potential customers could therefore increase a bank’s profitability. Open banking application programming interfaces (APIs) provide access to 90 days of historical data on potential customers’ balances and transactions. We examine the performance of credit scoring models developed using such data from a Norwegian bank. We find that traditional regression models perform poorly, while machine learning (ML) methods can provide models that perform satisfactorily based on these data alone. Further, we find that the best performing models are based on an end-to-end deep learning approach, where machine learning algorithms create explanatory variables based on non-aggregated data. These results indicate that data available through the open banking APIs can be an important data source when banks assess the creditworthiness of potential customers. In combination with end-to-end deep learning methods they have the potential to increase the performance of a bank’s application credit scoring models and thus increase the bank’s profitability.
Copyright Infopro Digital Limited. All rights reserved.
As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (point 2.4), printing is limited to a single copy.
If you would like to purchase additional rights please email info@risk.net
Copyright Infopro Digital Limited. All rights reserved.
You may share this content using our article tools. As outlined in our terms and conditions, https://www.infopro-digital.com/terms-and-conditions/subscriptions/ (clause 2.4), an Authorised User may only make one copy of the materials for their own personal use. You must also comply with the restrictions in clause 2.5.
If you would like to purchase additional rights please email info@risk.net