Skip to content

remotain/NonPerformingLoans

Repository files navigation

Data challenge: Non-Performing Loans

This Data Challenge was part of the interview process for a position of Senior Data Scientist in an italian bank. The challenge was meant to demonstrate the required technical skills for the position. Thanks to the work performed to solve this Challenge, I obtained a job offer.

This public repository contains all the work done toward the Data Challenge. The same documents served to guide the interviewers through the general strategy, the methodology as well as the code during their review.

The results of this work are summarised in two main documents:

I prefer to not disclose the name of the company and the original dataset provided.

Executive summary

Two supervised machine learning algorithms, trained on the provided historical dataset, are employed to estimate the likelihood to repay and the recovery rate of Non-Performing Loans (NPL) in a 12 months ahead window. The model predictions are then used to re-rank NPL customers by the estimated probability to repay. Given the NPL sample available, as of September 2017, the expected 12 months recovery rate is between 1.6M€ and 6M€.

Framework

A Non-Performing Loan (NPL) is the sum of borrowed money upon which the debtor has not made his scheduled payments for at least 90 days. Once a loan is non-performing, the odds that it will be fully repaid are considered to be substantially lower. High levels of NPLs inhibit the capacity of banks to lend to the economy and take up valuable bank management time. As of the third quarter of 2016, NPLs of significant institutions in the Euro area amounted to €921bln (average NPE of 7%), therefore the ECB asks banks to devise a strategy to manage and reduce the volume of impaired loans.

Objective

The general goal of this data challenge is to help NPL portfolios managers in their daily tasks by re-ranking their customers according to the likelihood to repay and/or by suggesting new strategies to maximize the recovery rate. In particular, you are asked to perform analysis on the given data, train predictive models and present your findings in a clear and data-driven way.

About

Data Challenge: Non Performing Loans

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published