C REDIT R ISK M ODELS C ROSS - V ALIDATION – I S T HERE A NY A DDED V ALUE ? Croatian Quants Day...

CREDIT RISK MODELS CROSS-VALIDATION – IS THERE ANY ADDED VALUE?

Croatian Quants Day

Zagreb, June 6, 2014

Vili Krainz

vili.krainz@rba.hr

The views expressed during this presentation are solely

those of the author

INTRODUCTION

Credit risk – The risk that one party to a financial contract will not perform the obligation partially or entirely (default)

Example – Bank loans The need to assess the level of credit risk – credit risk

rating models (credit scorecards) Problem – to determine the functional relationship

between obligor or loan characteristics X1, X2, ... , Xn (risk drivers) and binary event of default (0/1), in a form of latent variable of probability of default (PD)

SCORECARD DEVELOPMENT PROCESS Potential risk drivers – retail application example

Sociodemographic characteristics: Age, marital status, residential status...

Economic characteristics: Level of education, profession, years of work experience...

Financial characteristics: Monthly income, monthly income averages...

Stability characteristics: Time on current address, current job...

Loan characteristics: Installment amount, approved limit amount, loan maturity...

SCORECARD DEVELOPMENT PROCESS

Univariate analysis – analysis of each individual characteristic Fine classing – division of numeric variables into a

number (e.g. 20) of subgroups, analysis of general trend Coarse classing – grouping into (2-5) larger classes to

optimize predictiveness, with certain conditions (logical, monotonic trend, robust enough...)

Age Bad rate

<30 3.47%

[30, 55] 2.86%

>55 1.73%

SCORECARD DEVELOPMENT PROCESS Multivariate analysis

Correlation between characteristics Logit model – most widely used Logistic regression (with selection process)

PDscore

iscoreiii

ePDscore

SCORECARD MODEL PREDICTIVENESS

The goal of a scorecard model is to discriminate between the good and the bad applications

Predictivity is most commonly measured by Gini index (a.k.a Accuracy Ratio, Somers’ D)

SCORECARD MODEL CROSS-VALIDATION

At model development start, the whole data sample is split randomly (70/30, 75/25, 80/20...)

The bigger sample is used for model development, while the smaller sample is used for cross-validation

Model’s predictive power (Gini index) is measured on the independent, validation sample

Done to avoid overfitting The predictive power shouldn’t be much lower on

the validation sample than it is on the development – that’s when the validation is considered successful

WHAT IF VALIDATION FAILS?

Is it possible if everything is done „by the book”? Does that mean that:

Something was done wrong in model development process?

The sample is not suitable for modeling at all?The process needs to be repeated?

MONTE CARLO SIMULATIONS Real (masked) publicly available retail application data

(Thomas, L., Edelman, D. and Crook, J., 2002. Credit Scoring and Its Applications. Philadelphia: SIAM.)

1000 simulations of model development process in R Each time stratified random sampling (75/25) was done (on

several characteristics, including the target variable – default indicator)

Fine classing for the numeric variables Coarse classing all the variables using the code that simulates

modeler’s decisions Stepwise logistic regression using AIC Measuring Gini index on development and validation sample

Pre-selection of characteristics for the business logic and correlation

One reference model was built on whole data sample9

RESULTS

In 12.5% of cases we get a difference bigger than 0.1 Pearson’s chi-square test – all characteristics of all 1000 samples

representative at 5% significance level 10

RESULTS

Idea: Compare the scores from each simulation model to reference model (on the whole sample) and relate to differences in Gini

If there is a strong connection – we strive to get a model similar to the reference model

Wilcoxon paired (signed rank) test H0: median difference between the pairs is zero

H1: median difference is not zero.

Basically, the alternative hypothesis states that one model results in systematically different (higher or lower) scores than the other

RESULTS

Correlation: 0.68

RESULTS

FROM THE SIMULATIONS...

Regardless of a modeling job done right, validation can fail by chance

We like to have Gini index on the development sample “similar” to the one on the validation sample – we tend to get the model that is more similar to the reference model – why not develop on the whole sample in the first place?

Regardless of validation results and difference in Gini, predictive power on the whole data sample does not vary too much

INSTEAD OF A CONCLUSION...

Does this method of cross-validation bring any added value?

It may be more important to check whether all the modeling steps have been performed carefully and properly, and that best practices are used, in order to avoid overfitting

Can any cross-validation method can offer real assurance or does the only real test come with future data?

THANK YOU!

vili.krainz@rba.hr

C REDIT R ISK M ODELS C ROSS - V ALIDATION – I S T HERE A NY A DDED V ALUE ? Croatian Quants Day...

Documents

Cornell Baseball...• Dale Wickham and Frankie Padulo each had two doubles among their four hits over the weekend, and Cole Rutherford had 4 RBI. Ryan Krainz worked four walks and

Data V alidation Monitoring: Student Assessment Presents…

alidation Testing of Standard Requirements for … · NIST Technical Note 1789 Validation Testing of Standard Requirements for Backpack-type Radiation Detectors L. Pibida B. Norman

dded REVISED e 9 International Business Awards · Complete instructions about how to prepare and submit nominations to the world’s premier business awards program ... to marketing

Finite Element Analysis and Experimental V alidation of ...cervera.rmee.upc.edu/papers/2018-ADDMA-LSF-pre.pdf* Corresponding author. E-mail address: xlin@nwpu.edu.cn (Xin Lin) Finite

ALIDATION A TWO-PHASE NUMERICAL OLVER FOR ...webcritech.jrc.ec.europa.eu/tsunami/Portals/_default/...• Ghobara et al. (2006) • Nistor et al. (2009) • Chock et al. (2011) •

Microstructural Characterization and Influence of ... · Microstructural Characterization and Influence of Ceramography Method on the Microhardness of Sintering 93 gents dded Silicon

CREATING VALUE DDED W - Chapters Site · points (this is where real value is added!) Risk Categories - Standard • Reputational - Potential that negative publicity regarding an the

Sterling Executive Home · 1 COVEA 09 58 00 05 0 100 0 20 00 75 V: V COVEA INSURANCE Nº dossier : 20110324E Date : 8/12/11 alidation DA/DC alidation Client Sterling Executive Home

EMAS V ERIFICATION AND V ALIDATION Impressions from an Accreditation Body M ARC WOUTERS BELAC EMAS S TAKEHOLDERS D IALOGUE B RUSSELS 04.10.2013 The views

Chromophores from hexeneuronic acids: chemical behavior under … · 2017. 8. 26. · Markus Bacher . Karin Krainz . Thomas Dietz . Ute Henniges . Antje Potthast . Thomas Rosenau

Sp ecication and V alidation of aoverturetool.org/publications/books/proof-in-vdm/Ch3.pdf · Chapter Sp ecication and V alidation of a Net w ork Securit y P olicy Mo del P eter Lindsa

Integration and automation to improve your business processesIntegration that delivers real benefits •alidation API Address V — Catch errors before they catch up with you •alidation

U SING V ALUE -A DDED V ISUALS IN E-L EARNING Using Value-Added Visuals in E-Learning 1

I NFUSING V ALUE -A DDED INTO THE SLO PROCESS August 2015

KEA | HomeKARNATAKA EXAMINATIONS AUTHORITY 06.05.2014 add): 2014d ( dded - 2014d Cd Oea-ðd e:3dodD0d 20.05.2014dodo 10.00 odöð*ðddo geodde' ddô 2014d

SRI HOMES DESIGN SHOWCASE - Winfield Home … · design showcasesri homes ddesigns for esigns for aadded curb dded curb aappealppeal sstoragetorage ssolutionsolutions ffor kitchenor

C HAPTER 11 Competitive Interaction. A DDED T OOLS FOR E XTERNAL A NALYSIS Strategic Group Map Mobility Barriers Strategy Canvas Competitive Response

ALIDATION A TWO-PHASE NUMERICAL OLVER FOR SIMULATING … · 2016-09-16 · VALIDATION A TWO-PHASE NUMERICAL SOLVER FOR SIMULATING HYDRAULIC BORE INTERACTIONS WITH NEARSHORE STRUCTURES

XML - University of Chicagopeople.cs.uchicago.edu/~ravenben/publications/msreport.pdfn umerous b ene ts in-cluding structured extensibilit y, strong data v alidation capabilities,