Upload
tanadkit-munkonghat
View
3
Download
0
Embed Size (px)
DESCRIPTION
Anderson
Citation preview
WWW.WATSONWYATT.COM
GLM III: Advanced Modeling Strategy2005CASSeminaronPredictiveModeling
DuncanAndersonMAFIA
WatsonWyattWorldwide
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Agenda
l Introduction
l Testingthelinkfunction
l TheTweediedistribution
lSplines
lReferencemodels
lAliasing/nearaliasing
lCombiningmodelsacrossclaimtypes
lRestrictedmodels
CopyrightWatsonWyattWorldwide.Allrightsreserved.
"A Practitioner's Guide to GLMs"
l 2004CASDiscussionPaperProgram
lDiscusses testingthelinkfunction theTweediedistribution aliasing/nearaliasing combiningmodelsacrossclaimtypes
restrictedmodels
lCopiesavailablehere
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Generalized linear models
E[Yi]= m i=g1(SXij b j+ x i)
Var[Yi]= f.V(m i)/w ilConsiderallfactorssimultaneously
lProvidestatisticaldiagnostics
lAllowfornatureofrandomprocess
lRobustandtransparent
l Increasinglyaglobalindustrystandard
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Generalized linear models
E[Y]= m =g1(X.b + x)
Var[Y]= f.V(m)/w
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Generalized linear models
E[Y]= m =g1(X.b + x)
Yvariate
Linkfunction
Designmatrix Parameterestimates
Offsetterm
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Generalized linear models
E[Y]= m =g1(X.b + x)
Observedthing(data)
Somefunction(userdefined)
Somematrixbasedondata
(userdefined)
Parameterstobe
estimated(theanswer!)
Knowneffects
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Generalized linear models
Var[Y]= f.V(m)/w
Scaleparameter
Variancefunction
Priorweights
l Usuallyassumeexponentialfamily,eg
l f = s 2 (estimated),V(x)=1 Var[Yi]= s 2 Normal
l f =1 (specified), V(x)=x Var[Yi]= m i Poisson
l f =k (estimated),V(x)=x2 Var[Yi]=km i2 Gamma
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Agenda
l Introduction
l Testingthelinkfunction
l TheTweediedistribution
lSplines
lReferencemodels
lAliasing/nearaliasing
lCombiningmodelsacrossclaimtypes
lRestrictedmodels
NumericalMLE
ParameterEstimates Diagnostics
Data
LinearPredictorForm
Linkfunctiong(x) ErrorStructure V(m)
ScaleParameter
PriorWeights
f
Offset x
w Y
a, b, g, d
X.b = a i+ b j+ g k+ d l
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Model testing
lUseonlythosefactorswhicharepredictive standarderrorsofparameterestimates
Ftests/ c 2testsondeviances stepwiseapproach(helpfulifusedwithcare) consistencyovertime humanintuition
lMakesurethemodelisreasonable variancefunction:residualplots
(histograms/QQ/residualvsfittedvalueetc) outliers:leverage/Cook'sdistance linkfunction:BoxCox
CopyrightWatsonWyattWorldwide.Allrightsreserved.
BoxCox link function investigation
lGLMstructureisE[Y]= m =g1(X.b + x) Var[Y]= f.V(m)/ w
lBoxCoxtransformsdefinesg(x)=(x l 1)/ l for l0,ln(x)for l=0
l l =1 g(x)=x1 additive(withbaselevelshift)
l l 0 g(x) ln(x) multiplicative(viamaths)
l l =1 g(x)=11/x inverse(withbaselevelshift)
l Trydifferentvaluesof l andmeasuregoodnessoffittoseewhichfitsexperiencebest
CopyrightWatsonWyattWorldwide.Allrightsreserved.
BoxCox link function investigation Auto third party property damage frequencies
137910
137870
137830
137790
0.6 0.4 0.2 0 0.2 0.4 0.6
l
Likelihoo
d
MultiplicativeInverse Additive
CopyrightWatsonWyattWorldwide.Allrightsreserved.
BoxCox link function investigation Auto third party property damage average amounts
268569
268569
268568
268568
268567
268567
268566
0.6 0.5 0.4 0.3 0.2 0.1 0 0.1 0.2 0.3
l
Likelihoo
d
Inverse Multiplicative Additive
CopyrightWatsonWyattWorldwide.Allrightsreserved.
BoxCox link function investigation Comparing fitted values of different link functions
0
50000
100000
150000
200000
250000
300000
350000
0.792
0.803
0.814
0.824
0.835
0.846
0.857
0.868
0.878
0.889
0.900
0.911
0.921
0.932
0.943
0.954
0.965
0.975
0.986
0.997
1.008
1.018
1.029
1.040
1.051
1.061
1.072
1.083
1.094
1.105
1.115
1.126
1.137
1.148
1.158
1.169
1.180
1.191
1.202
1.212
1.223
1.234
1.245
1.255
RatiooffittedvaluesforLambda=0(mult)toLambda=0.3
Cou
ntofrec
ords
Notmuch
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Agenda
l Introduction
l Testingthelinkfunction
l TheTweediedistribution
lSplines
lReferencemodels
lAliasing/nearaliasing
lCombiningmodelsacrossclaimtypes
lRestrictedmodels
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Standard approach
AmtFreq
AmtFreq
AmtFreq
AmtFreq
AmtFreq
BI x =Cost1
PD x =Cost2
COL x =Cost4
MED x =Cost3
OTC x =Cost5
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Tweedie distributions
l Incurredlosseshaveapointmassatzeroandthenacontinuousdistribution
l Poissonandgammanotsuitedtothis
l TweediedistributioncanhavepointmassatzeroandparameterswhichcanaltertheshapetobelikePoissonandgammaabovezero
{ } { }
=
-
> - - G
- =
100
1
0for)]([exp.!)(
)/1()(),,(
n
n
Y yyynny
yf q k q lw a
k lw a l q a a a
0
0.2
{ })(exp)0( 0 q lwk a - = =Yp
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Tweedie distributions
l p=1correspondstoPoisson,p=2togamma
l Definesavaliddistributionforp
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Var[Y]= f.V(m)/ w Normal: f = s 2,V(x)=1 Var[Y]= s 2.1
Poisson: f =1,V(x)=x Var[Y]= m
Gamma: f =k,V(x)=x2 Var[Y]=km 2
Generalised linear models
Tweedie: f =k,V(x)=xp Var[Y]=km p
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Example 1: frequency
ComparisonofTweediemodelwithtraditionalfrequency/amountsapproachRun7Model2Frequency
75%
17%
21%
15%23%
0%
44%
96%94%128%
156%
446%568%
1.8
1.2
0.6
0
0.6
1.2
1.8
2.4
BonusMalus
Logofm
ultip
lier
0
20000
40000
60000
80000
100000
A B C D E F G H I J K L M
Exp
osure(yea
rs)
Onew ayrelativities Approx95%confidenceinterval Unsmoothedestimate Smoothedestimate
Pvalue=0.0%Rank12/12
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Example 1: amounts
ComparisonofTweediemodelwithtraditionalfrequency/amountsapproachRun7Model6Amounts
10%
4%
2%3%
2%0%
7%
3%5%
10%
6%
26%24%
0.4
0.3
0.2
0.1
0
0.1
0.2
0.3
0.4
0.5
BonusMalus
Logofm
ultip
lier
0
200
400
600
800
1000
1200
1400
1600
A B C D E F G H I J K L M
Num
berofclaim
s
Onew ayrelativities Approx95%confidenceinterval Unsmoothedestimate SmoothedestimateEXCLUDEDFACTOR
Pvalue=50.9%Rank4/12
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Example 1: traditional RP vs Tweedie
ComparisonofTweediemodelwithtraditionalfrequency/amountsapproachRun11Model2TweedieModels
75%
17%
21%
15%23%
0%
44%
96%94%128%
156%
443%567%
75%
17%
21%
15%23%
0%
44%
96%94%128%
156%
446%568%
1.8
1.2
0.6
0
0.6
1.2
1.8
2.4
BonusMalus
Logofm
ultip
lier
0
20000
40000
60000
80000
100000
A B C D E F G H I J K L M
Exp
osure(yea
rs)
Tw eedieSE Tw eedie RPSE RP
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Example 2: frequency
ComparisonofTweediemodelwithtraditionalfrequency/amountsapproachRun7Model1Frequency
62%
84%
69%75%
68% 67%61%
58% 58%
45% 48%54%
47%
38%
0%
0.1
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
Ageofvehicle
Logofm
ultip
lier
0
20000
40000
60000
80000
100000
0 1 2 3 4 5 6 7 8 9 10 11 12 13 >13
Exp
osure(yea
rs)
Onew ayrelativities Approx95%confidenceinterval Unsmoothedestimate Smoothedestimate
Pvalue=0.0%Rank12/12
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Example 2: amounts
ComparisonofTweediemodelwithtraditionalfrequency/amountsapproachRun7Model5Amounts
0%
5%
4%
10%
13%14%
11%8%
24%
11%
14%
17%
11%10%
13%
0.06
0
0.06
0.12
0.18
0.24
0.3
0.36
Ageofvehicle
Logofm
ultip
lier
0
500
1000
1500
2000
2500
3000
3500
4000
4500
0 1 2 3 4 5 6 7 8 9 10 11 12 13 >13
Num
berofclaim
s
Onew ayrelativities Approx95%confidenceinterval Unsmoothedestimate Smoothedestimate
Pvalue=0.0%Rank5/7
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Example 2: traditional RP vs Tweedie
ComparisonofTweediemodelwithtraditionalfrequency/amountsapproachRun11Model1TweedieModels
88%
107%
90%
106%94% 90%
104%
73% 75%67% 70%
73%
55%
34%
0%
83%
101%
87%
105%
92%86%
99%
71% 75%65% 67%
70%
53%
31%
0%
0.2
0
0.2
0.4
0.6
0.8
1
Ageofvehicle
Logofm
ultip
lier
0
20000
40000
60000
80000
100000
0 1 2 3 4 5 6 7 8 9 10 11 12 13 >13
Exp
osure(yea
rs)
Tw eedieSE Tw eedie RPSE RP Numbers Amounts
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Example 3: traditional RP vs Tweedie
ComparisonofTweediemodelwithtraditionalfrequency/amountsapproachRun11Model1TweedieModels
0%
9%
0%
10%
0.2
0.15
0.1
0.05
0
0.05
0.1
Occupation
Logofm
ultip
lier
0
50000
100000
150000
200000
250000
300000
350000
400000
450000
Category1 Category2
Exp
osure(yea
rs)
Tw eedieSE Tw eedie RPSE RP Numbers Amounts
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Example 4: frequency
ComparisonofTweediemodelwithtraditionalfrequency/amountsapproachRun7Model1Frequency
39%
20%17%
3%
8%8%3%
8%5%
2%2%0%2%
6%
21%
0.5
0.4
0.3
0.2
0.1
0
0.1
0.2
0.3
0.4
Ageofdriver
Logofm
ultip
lier
0
10000
20000
30000
40000
50000
60000
70000
80000
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Example 4: amounts
ComparisonofTweediemodelwithtraditionalfrequency/amountsapproachRun7Model5Amounts
16%
4%2%
0%
5%3%
5%
9%
6%
1%1%0%2%
3%3%
0.24
0.18
0.12
0.06
0
0.06
0.12
0.18
Ageofdriver
Logofm
ultip
lier
0
1000
2000
3000
4000
5000
6000
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Example 4: traditional RP vs Tweedie
ComparisonofTweediemodelwithtraditionalfrequency/amountsapproachRun11Model1TweedieModels
28%
16%14%
2%
15%16%
1%
20%13%
0%4%
0%1%
11%
23%
39%
20%17%
3%
8%8%3%
8%5%
2%2%0%2%
6%
21%
0.6
0.4
0.2
0
0.2
0.4
0.6
Ageofdriver
Logofm
ultip
lier
0
10000
20000
30000
40000
50000
60000
70000
80000
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Agenda
l Introduction
l Testingthelinkfunction
l TheTweediedistribution
lSplines
lReferencemodels
lAliasing/nearaliasing
lCombiningmodelsacrossclaimtypes
lRestrictedmodels
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Spline definition
0
10
20
30
40
50
60
70
80
90
100
0 25 50 75 100 125 150 175 200
l Aseriesofpolynomialfunctions,witheachfunctiondefinedoverashortinterval
l Intervalsaredefinedbyk+2knots twoexteriorknotsatextremesofdata
variablenumber(k)ofinteriorknots
l Ateachinteriorknotthetwofunctionsmustjoin"smoothly"
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Cubic splines
lEachpolynomialisacubic a+bx+cx2+dx3
l "Smoothness"atinteriorknotsisdefinedas: continuous continuousfirstderivative continuoussecondderivative
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Regression splines
l Thepositionoftheknotsisspecifiedbytheuser
lStandardGLMscanbeusedbycarefuldefinitionofvariates
lPros fitseasilyintoexistingstructures
nocomplexresamplingneeded
lCons positionofknotscaneffectfinalanswer
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Smoothing splines
lOneknotateachuniquedatavalue
lAdditionalcurvaturepenaltypreventsoverfitting
lCurvaturepenaltyselectedbyrepeatedlysamplingsubsetsandoptimisinggeneralisedgoodnessoffitmeasuresuchasAIC
lPros allowsdatatoguidefinalresult
lCons 100sofknotsrequired optimisationprocessistimeconsuming
difficulttoproducenewfittedvalues
CopyrightWatsonWyattWorldwide.Allrightsreserved.
"Easy" regression splines
l Fitacubicoverthewholerange simplydefinex,x2andx3asvariatesandincludeinthemodel
l Fitadditionalcubic"correction"variatesforeachinterval,definedas 0ifx
CopyrightWatsonWyattWorldwide.Allrightsreserved.
"Easy" regression splines
0.0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1.0
0 25 50 75 100
CopyrightWatsonWyattWorldwide.Allrightsreserved.
"Easy" regression splines
0
5
10
15
20
25
0 25 50 75 100
CopyrightWatsonWyattWorldwide.Allrightsreserved.
"Easy" regression splines
0
10
20
30
40
50
60
70
0 25 50 75 100
CopyrightWatsonWyattWorldwide.Allrightsreserved.
"Easy" regression splines
l "Correction"variatesgetlargequickly
l InpracticeGLMprocesscanstrugglewiththeselargenumbers
lAlternatebasisisclearlydesirable
CopyrightWatsonWyattWorldwide.Allrightsreserved.
BSplines
l Setofbasisfunctionsusuallycoveringfoursegments(definedbyfiveknots)
l Eachfunctionisitselfacubicspline
l Eachbasisfunctionhasthesameshape,exceptforthethreebasisfunctionsateachextremewhichoccupyfewerthanfoursegments
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 25 50 75 100
CopyrightWatsonWyattWorldwide.Allrightsreserved.
BSplines
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 25 50 75 100
CopyrightWatsonWyattWorldwide.Allrightsreserved.
BSplines
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 25 50 75 100
CopyrightWatsonWyattWorldwide.Allrightsreserved.
BSplines
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 25 50 75 100
CopyrightWatsonWyattWorldwide.Allrightsreserved.
BSplines
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 25 50 75 100
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Example
0.4
0.2
0
0.2
0.4
0.6
0.8
1
1.2
1.4
18 21 24 27 30 33 36 39 42 45 48 51 54 57 60 63 66 69 72 75
Factor
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Example
0.4
0.2
0
0.2
0.4
0.6
0.8
1
1.2
1.4
18 21 24 27 30 33 36 39 42 45 48 51 54 57 60 63 66 69 72 75
Factor Polynomial
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Example
0.4
0.2
0
0.2
0.4
0.6
0.8
1
1.2
1.4
18 21 24 27 30 33 36 39 42 45 48 51 54 57 60 63 66 69 72 75
Spline Factor Polynomial
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Example
0.4
0.2
0
0.2
0.4
0.6
0.8
1
1.2
1.4
18 21 24 27 30 33 36 39 42 45 48 51 54 57 60 63 66 69 72 75
Spline Factor SplineSE SplineSE
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Extrapolation
lCancombinethethreeboundarybasisfunctionstoachievelinearextrapolation
lSumofthreefunctionstendsto1attheexteriorknot,andcontinuesas1whenextrapolated
lCancombinethelasttwofunctionstogivefunctionthattendstoastraightlineattheexteriorknot,andcanbeextrapolatedassuch
CopyrightWatsonWyattWorldwide.Allrightsreserved.
BSplines
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 25 50 75 100
CopyrightWatsonWyattWorldwide.Allrightsreserved.
BSplines
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 25 50 75 100
CopyrightWatsonWyattWorldwide.Allrightsreserved.
BSplines
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
1
0 25 50 75 100
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Example
0.4
0.2
0
0.2
0.4
0.6
0.8
1
1.2
1.4
15 18 21 24 27 30 33 36 39 42 45 48 51 54 57 60 63 66 69 72 75 78
Spline Factor SplineSE SplineSE Polynomial
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Further example
ComparisonoffactorwithsplineUrbandensity
149%
208%
81%
53%44%
18%
1%0%2%
15%9%
28%25%28%34%35%35%34%37%
33%
0.9
0.6
0.3
0
0.3
0.6
0.9
1.2
Populationdensity
Logofm
ultip
lier
0
10000
20000
30000
40000
50000
60000
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Exp
osure(yea
rs)
FactorSE FactorPvalue=0.0%Rank7/11
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Further example
ComparisonoffactorwithsplineUrbandensity
0.9
0.6
0.3
0
0.3
0.6
0.9
1.2
Populationdensity
Logofm
ultip
lier
0
10000
20000
30000
40000
50000
60000
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Exp
osure(yea
rs)
FactorSE Factor SplinePvalue=0.0%Rank7/11
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Further example
ComparisonoffactorwithsplineUrbandensity
0.9
0.6
0.3
0
0.3
0.6
0.9
1.2
Populationdensity
Logofm
ultip
lier
0
10000
20000
30000
40000
50000
60000
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Exp
osure(yea
rs)
Spline
Pvalue=0.0%Rank7/11
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Further example
ComparisonoffactorwithsplineUrbandensity
182%
139%
101%
74%
40%
17%
5%0%3%
8%15%
24%29%
31%33%35%36%37%37%38%
0.9
0.6
0.3
0
0.3
0.6
0.9
1.2
Populationdensity
Logofm
ultip
lier
0
10000
20000
30000
40000
50000
60000
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Exp
osure(yea
rs)
Factor SplineSE SplinePvalue=0.0%Rank7/11
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Splines
lPracticalwayofmodelingcontinuousvariables
lOftenbetterthanpolynomials
l Increasescomplexity,thereforebestusedwhenitisimportantthatratesvarycontinuouslywithavariable
whenmodelingelasticitytobeusedinpriceoptimizationanalyses
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Agenda
l Introduction
l Testingthelinkfunction
l TheTweediedistribution
lSplines
lReferencemodels
lAliasing/nearaliasing
lCombiningmodelsacrossclaimtypes
lRestrictedmodels
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Standard approach
AmtFreq
AmtFreq
AmtFreq
AmtFreq
AmtFreq
BI x =Cost1
PD x =Cost2
COL x =Cost4
MED x =Cost3
OTC x =Cost5
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Binomial reference models
Amt
Amt
AmtFreq
AmtFreq
AmtFreq
=Cost1
=Cost2
COL x =Cost4
MED x =Cost3
OTC x =Cost5
Freq BI/PD?TP
CopyrightWatsonWyattWorldwide.Allrightsreserved.
0.6
0.5
0.4
0.3
0.2
0.1
0
0.1
0.2
0.3
0.4
0.5
1 2 3 4 5
Offset reference model
CopyrightWatsonWyattWorldwide.Allrightsreserved.
0.6
0.5
0.4
0.3
0.2
0.1
0
0.1
0.2
0.3
0.4
0.5
1 2 3 4 5
Offset reference model
CopyrightWatsonWyattWorldwide.Allrightsreserved.
0.6
0.5
0.4
0.3
0.2
0.1
0
0.1
0.2
0.3
0.4
0.5
1 2 3 4 5
Offset reference model
CopyrightWatsonWyattWorldwide.Allrightsreserved.
0.6
0.5
0.4
0.3
0.2
0.1
0
0.1
0.2
0.3
0.4
0.5
1 2 3 4 5
Offset reference model
CopyrightWatsonWyattWorldwide.Allrightsreserved.
0.6
0.5
0.4
0.3
0.2
0.1
0
0.1
0.2
0.3
0.4
0.5
1 2 3 4 5
Offset reference model
CopyrightWatsonWyattWorldwide.Allrightsreserved.
0.6
0.5
0.4
0.3
0.2
0.1
0
0.1
0.2
0.3
0.4
0.5
1 2 3 4 5
Offset reference model
CopyrightWatsonWyattWorldwide.Allrightsreserved.
0.6
0.5
0.4
0.3
0.2
0.1
0
0.1
0.2
0.3
0.4
0.5
1 2 3 4 5
Offset reference model
CopyrightWatsonWyattWorldwide.Allrightsreserved.
0.6
0.5
0.4
0.3
0.2
0.1
0
0.1
0.2
0.3
0.4
0.5
1 2 3 4 5
Offset reference model
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Testing the reference model approach
100%oflargecompany
(1)FittoBIclaimsonalldatathe"correctanswer"
10%Randomsampletoemulatesmallcompany
(2)ModelBIclaimswithstandardapproach
(3)ModelBIclaimsreferencingPDexperienceonthissmallsample
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Example of reference model method working
ExamplePImodels
27%
14%
0%
19%
14%
0%
0.06
0
0.06
0.12
0.18
0.24
0.3
0.36
FactorX
Logofm
ultip
lier
0
200000
400000
600000
800000
1000000
1200000
1400000
1600000
Level1 Level2 Level3
Expos
ure(yea
rs)
Approx2s.e.fromestimateFullmodel UnsmoothedestimateFullmodel PDmodel
StandardBIGLMfactorrejected
Green:BI100%("correctanswer")
Blue:StandardBIGLMon10%
Purple:ReferencingPDon10%
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Example of reference model method working
ExamplePImodels
0%6%
10%
1%
12%
20%
42%
54%
66%
54%
0%
38%
10%
34%
6%
34%29%
182%191%
100%
0%7%
3%
14%11%12%
24%
38%
90%
37%
0.3
0
0.3
0.6
0.9
1.2
FactorY
Logofm
ultip
lier
0
200000
400000
600000
800000
1000000
1200000
1400000
1600000
LevelA LevelB LevelC LevelD LevelE LevelF LevelG LevelH LevelI LevelJ
Exp
osure(yea
rs)
Approx2s.e.f romestimateFullmodel UnsmoothedestimateFullmodel Unsmoothedestimate10%model Approx2s.e.f romestimate10%model PDmodel
Green:BI100%("correctanswer")
Blue:StandardBIGLMon10%
Purple:ReferencingPDon10%
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Agenda
l Introduction
l Testingthelinkfunction
l TheTweediedistribution
lSplines
lReferencemodels
lAliasing/nearaliasing
lCombiningmodelsacrossclaimtypes
lRestrictedmodels
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Aliasing and "near aliasing"
lAliasing theremovalofunwantedredundantparameters
l Intrinsicaliasing occursbythedesignofthemodel
lExtrinsicaliasing occurs"accidentally"asaresultofthedata
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Example
l Supposewewantedamodeloftheform:
m = a + b ifage40
+ g ifsexmale
+ g ifsexfemale
2
3
1
2
1 m = a + b ifage40
+ g ifsexmale
+ g ifsexfemale
2
3
1
2
1
CopyrightWatsonWyattWorldwide.Allrightsreserved.
1 010 101 100 101 100 011 001 101 010 01....................................................
Age Sex
ab b b g g
1
2
1
3
2
.
40 MF
1
2
3
4
5
Form ofX.b in this case
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Example
l Supposewewantedamodeloftheform:
m = a + b ifage40
+ g ifsexmale
+ g ifsexfemale
2
3
1
2
1 m = a + b ifage40
+ g ifsexmale
+ g ifsexfemale
2
3
1
2
1
"Baselevels"
CopyrightWatsonWyattWorldwide.Allrightsreserved.
X.b having adjusted for base levels
1 010 101 100 101 100 011 001 101 010 01....................................................
Age Sex
ab b b g g
1
2
1
3
2
.
40 MF
1
2
3
4
5
CopyrightWatsonWyattWorldwide.Allrightsreserved.
X.b having adjusted for base levels
1 00 01 10 01 10 11 01 01 00 1..........................................
Age Sex
a
b 1
g 2
b 3
.
40 F
1
2
3
4
5
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Intrinsic aliasing
ExamplejobRun16Model3SmallinteractionThirdpartymaterialdamage,Numbers
11%6%
19%
0%
24%28%
63%
138%155%
0.4
0.2
0
0.2
0.4
0.6
0.8
1
1.2
Ageofdriver
Logofm
ultip
lier
0
50000
100000
150000
200000
250000
1721 2224 2529 3034 3539 4049 5059 6069 70+
Exp
osure(yea
rs)
Onew ayrelativities Approx95%conf idenceinterval Unsmoothedestimate Smoothedestimate
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Extrinsic aliasing
l Ifaperfectcorrelationexists,onefactorcanaliaslevelsofanother
l Egifdoorsdeclaredfirst:
l Thisistheonlyreasontheorderofdeclarationcanmatter(fittedvaluesareunaffected)
Exposure:#Doors Color
2 3 4 5Unknown
Red 13,234 12,343 13,432 13,432 0
Green 4,543 4,543 13,243 2,345 0
Blue 6,544 5,443 15,654 4,565 0
Black 4,643 1,235 14,565 4,545 0
Unknown 0 0 0 0 3,242
Selectedbase
Selectedbase
Furtheraliasing
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Extrinsic aliasing
ExamplejobRun16Model3SmallinteractionThirdpartymaterialdamage,Numbers
0%
135%
103%
91%
70%72%82%
57%
45%39%
30%
0%
0.2
0
0.2
0.4
0.6
0.8
1
Noclaimdiscount
Logofm
ultip
lier
0
100000
200000
300000
400000
500000
600000
50 5559 6064 6569 7074 7579 8084 8589 9094 9599 100 Malus
Exp
osure(yea
rs)
Onew ayrelativities Approx95%conf idenceinterval Unsmoothedestimate Smoothedestimate
CopyrightWatsonWyattWorldwide.Allrightsreserved.
"Near aliasing"
l Iftwofactorsarealmostperfectly,butnotquitealiased,convergenceproblemscanresultand/orresultscanbecomehardtointerpret
l Egifthe2black,unknowndoorspolicieshadnoclaims,GLMwouldtrytoestimateaverylargenegativenumberforunknowndoors,andaverylargepositivenumberforunknowncolor
Exposure:#Doors Color
2 3 4 5Unknown
Red 13,234 12,343 13,432 13,432 0
Green 4,543 4,543 13,243 2,345 0
Blue 6,544 5,443 15,654 4,565 0
Black 4,643 1,235 14,565 4,545 2
Unknown 0 0 0 0 3,242
Selectedbase
Selectedbase
CopyrightWatsonWyattWorldwide.Allrightsreserved.
"Near aliasing" solution
1. Spotit
2. Fixthedata!
Exposure:#Doors Colour
2 3 4 5Unknown
Red 13,234 12,343 13,432 13,432 0
Green 4,543 4,543 13,243 2,345 0
Blue 6,544 5,443 15,654 4,565 0
Black 4,643 1,235 14,565 4,545 2
Unknown 0 0 0 0 3,242
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Agenda
l Introduction
l Testingthelinkfunction
l TheTweediedistribution
lSplines
lReferencemodels
lAliasing/nearaliasing
lCombiningmodelsacrossclaimtypes
lRestrictedmodels
CopyrightWatsonWyattWorldwide.Allrightsreserved.
lMultiplyfactorsforfrequenciesandamounts
lCalculateriskpremiumassumofclaimelements
AmtFreqPD x =Cost2
AmtFreqMED x =Cost3
AmtFreqCOL x =Cost4
AmtFreqBI x =Cost1
AmtFreqOTC x =Cost5
Combining claim elements I
CopyrightWatsonWyattWorldwide.Allrightsreserved.
l Considercurrentexposure
l Calculateexpectedfrequencyandamountforeachclaimtypeforeachrecord
l Combinetogiveexpectedtotalcostofclaimsforeachrecord
l Fitmodeltothisexpectedvalue
Combining claim elements II
AmtFreq
AmtFreq
AmtFreq
AmtFreq
AmtFreq
BI x =Cost1
PD x =Cost2
COL x =Cost4
MED x =Cost3
OTC x =Cost5
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Calculation of risk premium
Policy Sex Area WWNUM1 WWAMT1 WWNUM2 WWAMT2 WWCC1 WWCC2 WWRSKPRM
82155654 M T 32% 1000 12% 4860 320 583.20 903.2082168746 F T 24% 1200 8% 4374 288 349.92 637.9282179481 M C 40% 700 9% 4050 280 364.50 644.5082186845 F C 30% 840 6% 3645 252 218.70 470.70
TPPDNumbers
TPPDAmounts
TPBINumbers
TPBIAmounts
Intercept 32% 1000 12% 4860Sex Male 1.000 1.000 1.000 1.000
Female 0.750 1.200 0.667 0.900Area Town 1.000 1.000 1.000 1.000
Country 1.250 0.700 0.750 0.833
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Risk premium standard errors
lRiskpremiummodelstandarderrorsaresmallowingtothesmoothnessoftheexpectedvalue
l Itispossibletoapproximatestandarderrorofriskpremiumparameterestimatesbasedonstandarderrorsofparameterestimatesinunderlyingmodels
lCareneededininterpretingsuchapproximationssincetheydonotreflectmodelerror,egdecidingtoexcludeamarginalfactor
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Risk premium standard errors failings
Amounts
Numbers
Riskpremium
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Risk premium standard errors failings
Amounts
Numbers
Riskpremium
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Agenda
l Introduction
l Testingthelinkfunction
l TheTweediedistribution
lSplines
lReferencemodels
lAliasing/nearaliasing
lCombiningmodelsacrossclaimtypes
lRestrictedmodels
CopyrightWatsonWyattWorldwide.Allrightsreserved.
E[Y]= m =g(X .b + x )1
l Offsettermusedforknowneffects,egexposureinanumbersmodel
l Canalsobeusedtoconstrainmodel(egclaimfreeyears/paymentfrequency/amountofcover)
l Otherfactorsadjustedtocompensate
Offset
Restricted models
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Restricted models
1 00 01 10 01 10 11 01 01 00 1..........................................
Age Sex
a
b 1 b 1
g 2
g 2
b 3
b 3
.
40 F
1
2
3
4
5
1
2
3
4
5
CopyrightWatsonWyattWorldwide.Allrightsreserved.
E[Y]= m =g(X .b )1Restricted models
1 00 01 10 01 10 11 01 01 00 1..........................................
Age Sex
a
b 1 b 1
g 2
g 2
b 3
b 3
.
40 F
1
2
3
4
5
1
2
3
4
5
CopyrightWatsonWyattWorldwide.Allrightsreserved.
E[Y]= m =g(X .b + x )1Restricted models
000.100.1......
+
1 001 101 101 011 00..........................................
Age
a
b 1
b 3
.
40
1
2
3
4
5
1
2
3
4
5
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Restricted models
1 00 01 10 01 10 11 01 01 00 1..........................................
Age Sex
a
b 1
0.1
b 3
.
40 F
1
2
3
4
5
1
2
3
4
5
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Restricted models
Restriction
0 1 2 3 4 5 6 7 8 9 10 11 12 13 141.5
1
0.5
0
0.5
1
1.5
2
BonusMalus
Logofm
ultip
lier
1 2 3 4 5 6 7 8 9 10 11 12 13 14 151.5
1
0.5
0
0.5
1
1.5
Vehiclegroup
Logofm
ultip
lier
GeographicalzoneA B C D E F G H I J K L M N O
2
1.5
1
0.5
0
0.5
1
1.5
Logofm
ultip
lier
18 19 20 22 24 26 28 30 35 40 45 50 55 60 651
0.5
0
0.5
1
1.5
Ageofdriver
Logofm
ultip
lier
CopyrightWatsonWyattWorldwide.Allrightsreserved.
0 1 2 3 4 5 6 7 8 9 10 11 12 13 141.5
1
0.5
0
0.5
1
1.5
2
BonusMalus
Logofm
ultip
lier
1 2 3 4 5 6 7 8 9 10 11 12 13 14 151.5
1
0.5
0
0.5
1
1.5
Vehiclegroup
Logofm
ultip
lier
GeographicalzoneA B C D E F G H I J K L M N O
2
1.5
1
0.5
0
0.5
1
1.5
Logofm
ultip
lier
18 19 20 22 24 26 28 30 35 40 45 50 55 60 651
0.5
0
0.5
1
1.5
Ageofdriver
Logofm
ultip
lier
Restricted models
Restriction
Modelcompensates(asbestitcan)
toallowforrestriction
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Testing the effectiveness of restrictions
0
2000
4000
6000
8000
10000
12000
0.800
0.810
0.830
0.840
0.860
0.870
0.890
0.900
0.920
0.930
0.950
0.960
0.980
0.990
1.010
1.020
1.040
1.050
1.070
1.080
1.100
1.110
1.130
1.140
1.160
1.170
1.190
1.200
1.220
1.230
1.250
1.260
1.280
1.290
1.310
1.320
1.340
1.350
1.370
1.380
1.400
1.410
1.430
1.440
1.460
1.470
1.490
1.500
Ratio
Cou
ntofreco
rds
A B C D E F G H I J K L M N O P Q R S
l EffectofrestrictingBonusMalus
l OtherfactorsnotverycorrelatedwithBonusMalus
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Testing the effectiveness of restrictions
0
2000
4000
6000
8000
10000
12000
0.800
0.810
0.830
0.840
0.860
0.870
0.890
0.900
0.920
0.930
0.950
0.960
0.980
0.990
1.010
1.020
1.040
1.050
1.070
1.080
1.100
1.110
1.130
1.140
1.160
1.170
1.190
1.200
1.220
1.230
1.250
1.260
1.280
1.290
1.310
1.320
1.340
1.350
1.370
1.380
1.400
1.410
1.430
1.440
1.460
1.470
1.490
1.500
Ratio
Cou
ntofreco
rds
A B C D E F G H I J K L M N O P Q R S
l EffectofrestrictingBonusMalus
l Addinginameasureofclaimfreeyears
CopyrightWatsonWyattWorldwide.Allrightsreserved.
Restrictions
lOnlyuseto"getaround"restrictions
lAcommercialsmoothingisacommercialsmoothing
lApplyatriskpremiumstage
WWW.WATSONWYATT.COM
GLM III: Advanced Modeling Strategy2005CASSeminaronPredictiveModeling
DuncanAndersonMAFIA
WatsonWyattWorldwide