Insights in AI applied to Credit Scoring

Going beyond default prediction, a continuous system that promotes successful relationships.

Having access to liquidity has been a major issue for humankind for financing both personal life aspects (e.g., housing, cars, college) and for business initiatives (e.g., starting, growing, and expanding a business). The amount of uncertainty that both faces of this exchange – i.e., the creditor and the prospective debtor – face are paramount. On the one hand, the creditor tries to fill in all blanks about the prospect’s capacity to pay the loan requesting additional information, information that in some cases is difficult to obtain by the debtor, resulting in lost opportunities for both sides. On the other hand, the prospect faces a black box interaction, with an approval process that happens behind doors and whose criteria aren’t clear, so the prospective debtor cannot work on adjusting her score.

Artificial Intelligence has played a major role in solving the first side of the equation -the creditor side-, bringing an objective and reproducible credit score estimation of the prospective debtor, at the expense of bringing further opacity to the process for the prospective debtor. In this blog post, we will go through the current paradigm and how we could tackle it differently to perform a more comprehensive approach, identifying current limitations as well as potential solutions from an AI point of view. The ideas described in this article can be applied to both financial entities and subscription-based business models that face default as a source of involuntary churn.

Traditional credit scoring uses blackbox methods -either a human that is out of reach from the end-client or a statistical model- which weights various factors including demographic factors, payment history and other financial indicators. This traditional approach will deny credit to consumers without considering the possibility to adjust their current situation or other extenuating factors. In this sense, it’s modeled as the likelihood at the time of approval of a prospective debtor to pay in full or default. We observe two main limitations:

  1. One-side interaction: the interaction tends to be one-sided with small room for negotiation. Debtors and creditors should be able to update their terms & conditions to maximize the chance of a mutually beneficial interaction. A hard rejection will move the client to a different institution, being a fully-wasted opportunity for the creditor that could be -in some cases- easily transformed into an approval to a different kind of deal.
  2. One-shot decision: assuming the decision and conditions are a one-shot process. We should consider credit scoring strategies that continuously re-assess the default probability and adjust new conditions accordingly.

So, our view on Artificial Intelligence applied to credit scoring is a dynamic interaction, where the model suggests and re-assesses conditions to maximize the mutual benefits on a continuous basis.

Modeling and data

So, we can formalize a credit scoring model as a function that, given the prospective debtor, the creditor conditions and the request loan characteristics predicts a valuable KPI for decision making such as the default probability, the expected ROI, etc.

 f :: debtor, \: creditor, \: loan \: conditions \rightarrow KPI

Just for illustrative purposes, such information may comprise:

  • Debtor:
    • Demographic: age, profession, education, etc.
    • Financial records: income and spending, debts, mortgages, etc.
  • Creditor: portfolio, cash flow, market indicators, etc.
  • Loan conditions: the amount of money loaned, spread, annual percentage rate, timeline, guarantors, consequences in case of contract breach, etc.

We can use any model to encode the aforementioned function (e.g., scorecards, decision trees, neural networks).

As mentioned in the previous section, our view on credit scoring considers not just the end decision (i.e., approval vs. rejection) but the adjustment of the loan conditions to ensure a mutually beneficial outcome. So, an AI model should use the aforementioned function to discover the conditions that maximize your KPI

 loan \: conditions \in \, argmax \: f(debtor, \: creditor, \: loan \: conditions)

So, we can transform a rejection to an initial submission into approval of an alternative. Please note that while we are simplifying the search for the loan condition adjustment, the model could as well suggest changes to the debtor information. For instance, it could suggest that the easier way of approving the credit is to increase the debtor’s salary by a certain amount, or by reduce his monthly expenses. We prioritize changes on load conditions because they can be applied instantly while looking for approval through the client’s adjustment tend to require longer feedback loops.

However, regardless of our KPI or modeling strategy, we need to be careful about corner cases such as predicting that by lowering the interest rate to zero the chance of a client paying will maximize, leading to a zero profit for the creditor. Similarly, increasing the interest to infinity to maximize profit, but in practice, leads to prospects turning down the loan or getting into bankruptcy. If you’re facing this issue, take a look at causal modeling and monotonic models or simply embed some domain-driven constraints in the decision process.

Course, Templates

Data Ignite

Create bulletproof AI solutions following our methodology.

Learn More

Making credit approval transparent and fair

When dealing with credits, the ability to explain why a certain loan was not conceived, as well as the conditions that were not met, plays a central role. It is not only relevant for those in the receiving end, but also for those who could profit from the deal, if it had happened.

When one models the problem by changing the conditions and context of a client one can infer counterfactual explanations and come up with a hypothesis on how the client could be accepted, and what needs to be changed versus the reality.

Various models have been developed to conceptualize reasons and explanations on the decisions made, an interesting example of this is the model made by Cynthia Rudin’s team for the FICO Data Challenge. The strategy groups features by subscales and attributes to each of these subscales a miniature model (with a softmax). These miniature models are non-linear and thus, by doing so, one can build a tree-like model with machine learning under the hood. A demo of their work can be seen here.

From the picture above we can see that the delinquency is a subscale that can be given by those four features and that is contributing to the risk factor (warmer colors represent a higher risk). I would recommend the reader to read Cynthia’s newest article on black box models: “Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead

Despite all advantages that machine learning models carry, they are also associated with biases. It may be the data you’re using is already biased, the historical decisions made by humans are biased, or the sampling chosen is not representative, or you may be using the wrong algorithm or KPI for the specific problem. The fact is that there will always be some type of bias, and in credit scoring, we have to guarantee that our models are not tending to a specific race, or gender, for example. Note that biased decisions aren’t only prejudicial for the end customer but also for the financial institution. Having models that rely on spurious correlations will eventually lead to catastrophic failure.

Tools like Aequitas, AI Fairness 360, and  What-If are open-source toolkits that data scientists can use to check for bias ad discrimination in machine learning models. These tools allow for informed and equitable decisions around developing and deploying predictive tools. More on these tools here.

If you want to learn more about Fairness in AI, check our previous post.

Conclusion

A change in the actual paradigm of credit attribution could benefit both parties. More deals could be made, and, more importantly, more deals can be completely fulfilled as time goes by. We live in a fast-paced world, surrounded by continuous changes and new opportunities, why would we oversimplify and assume static solutions to dynamic environments?

Do you want to further discuss this idea?

Book a meeting with Kelwin Fernandes

Meet Kelwin Learn More

If you would like to work on credit scoring solutions, or further discuss what was exposed here, feel free to contact us!

Like this story?

Subscribe to Our Newsletter

Special offers, latest news and quality content in your inbox once per month.

Signup single post

Consent(Required)
This field is for validation purposes and should be left unchanged.

Recommended Articles

Article
How Often Should You Retrain Machine Learning Models?

A common question in the domain of AI and machine learning is: how often should you retrain machine learning models? The answer isn’t as straightforward as you might think. It’s not a one-size-fits-all solution, but rather a process that requires careful consideration and strategic planning. In this article, we’ll explore three strategies for deciding when […]

Read More
Article
Reject Option: Your AI Model Has the Right to Remain Silent

When it comes to AI models, we often expect them always to provide an answer. But what if we could trust them more when they choose to remain silent? This concept, known as the ‘Reject option‘, allows AI models to abstain from answering when they are not confident, opening up many applications in your business […]

Read More
Kelwin Fernandes at the leacture the 3 tribes of AI. UPdate2024
Article
Empowering Future Leaders with AI: NILG.AI at UPdate 2024

At NILG.AI, we were recently proud sponsors at UPdate 2024. We lead both a workshop and a lecture by our CEO, Kelwin Fernandes, who has over 15 years of experience in AI. Our goal? To empower future leaders and professionals to navigate the complex yet rewarding terrain of AI with confidence and safety. UPdate 2024, […]

Read More