Documente Academic
Documente Profesional
Documente Cultură
I.
INTRODUCTION
regression. Antonio and Valdez [2] point out that, after decades
dominated by statistically unsophisticated models, it is now
recognized that Generalized Linear Models (GLMs) constitute
the efficient tool for risk classification. Kaas, Goovaerts,
Dhaene and Denuit [14] state that models allow the random
deviations from the mean to have a distribution different from
the normal and the mean of the random variable may be a
linear function of the explanatory variables on some other
scale. GLMs allow modelling a non-linear behaviour and a
non-Gaussian distribution of residuals. This aspect is very
useful for the analysis of non-life insurances, where claim
frequency and costs follows an asymmetric density that is
clearly non-Gaussian. GLMs development has contributed to
quality improvement of the risk prediction models and to the
process of establishing a fair tariff or premium given the nature
of the risk.
This paper present an example based on real-life insurance
data in order to illustrate several techniques in the framework
of GLMs. These illustrations are relevant for the insurers to
implement the used techniques in practice in order to obtain
equitable and reasonable premiums. In this purpose, the
structure of the paper is as follows. Section 2 presents the basic
distributions that can be used to model the two components of
pure premium, namely the frequency and cost of claims. In this
part of paper, the reasons for using these distributions are
explained and a special test concerning the difference between
claim frequency models is also described. Section 3 is
dedicated to an empirical application using a French
automobile insurance portfolio. This is followed by a
discussion and an interpretation of the obtained results.
Concluding remarks are summarized in Section 4.
II.
METHODOLOGY APPROACH
ISBN: 978-80-554-1003-6
( )
))
( )
( )
( )
ISBN: 978-80-554-1003-6
( )
( )
( )
( )
( )
{
( )
( )
where
is the deviance function of Poisson model
determined as follows:
(
( )
( )
( | ))
{
}
( )
(
)
where n represents the number of observations and p is the
number of parameters from the regression model.
The parameter estimatates ( ) are identical to those for the
Poisson model, which shows that estimates are identical, but
the standard errors of the estimators for the quasi-Poisson
model are modified by the dispersion factor . McCullagh
and Nelder [16] demonstrate that the parameter estimators
equality of the two models derived from the shape of the
likelihood function corresponding to Poisson distribution and
ISBN: 978-80-554-1003-6
where:
1. ( ) is the deviance of the regression modell that
includes the constant term without any explicative variable,
being expressed as follows:
( )
( (
(
( )
))
and variance ( )
)
.
is the number
( )
)
or can be simplified as
Defining
ISBN: 978-80-554-1003-6
) independent of their
A. Data Used
In this paper, the data used constitute a French automobile
portfolio insured against theft of the vehicle and possibly
damage to the vehicle, comprising 50000 polices registered
during the year 2009. An insurance policy corresponds to one
policyholder and the elements included in the policies are the
analysis factors presented bellow. Hence, except the explained
variables, the frequency and cost of claims, the other ones are
considered risk factors, known a priori by the insurer and are
used to customize the profile of each insured. These exogenous
variables reflect the insured characteristics: age (18-75 years),
profession (employed, housewife, retired, self-employed,
unemployed); the vehicle features category (large, medium,
small), brand (A, B, C, D, E, F), GPS (Yes, No), purpose of
vehicle usage (private, professional); the insurance contracts
characteristics: duration (0-15 years), bonus-malus coefficient
(50-150 by 10).
Among the explanatory variables introduced in the analysis,
bonus-malus coefficient presents a particular interest, assuming
the increase or decrease of insurance premium depending on
the number of claims registered by an insured during a
reference period. Therefore, if the policyholder does not cause
any responsible accident, he receives a bonus, meaning that the
insurance premium will be reduced with 5%. Contrary, if the
insured is responsible for the accident, he is penalized by
applying a malus of 25% for a claim declared, which will have
the consequence of a premium increase. The implementation
bonus-malus system is different from one country to another,
but the principle remains the same, namely to purchase the
encouragement of prudent insured and the discouragement of
those who, for various reasons, declare many claims, and
thereby they present a high degree of risk for the insurance
company.
B. Equations
Further there are presented and interpreted the results
obtained through the application of the models mentioned,
based on which the pure premium is determined. The variables
entered previously are taken into consideration as risk factors
and the models are fitted using the SAS 9.3 software by means
of GENMOD procedure. This procedure enables the use of
Type 3 analysis that allows the contribution assessment of each
risk factor, considering all the others explanatory variables.
The type 3 analysis provides the values of Chi-Square statistics
for each variable by calculating two times the difference
between the log-likelihood of the model which includes all the
independent variables and the log-likelihood of the model
obtained by deleting one of the specified variables. This
statistics test appreciates the impact of each risk factor on the
studied phenomenon and follows the asymptotic
distribution with df degrees of freedom, representing the
number of parameters associated to the analyzed variable.
Poisson model
By employing the Poisson model to estimate the frequency
of claims, the results obtained are shown in Table 1.
ISBN: 978-80-554-1003-6
TABLE I.
Poisson Regression(*)
Source
Age
Occup
Categ
Brand
GPS
Bonus
Poldur
Use
Chi-Square
Poisson Regression(**)
Pr > ChiSq
Chi-Square
<.0001
<.0001
0.1252
<.0001
<.0001
<.0001
<.0001
0.2644
91.64
63.81
47.05
84.55
452.06
35.53
-
87.99
63.86
4.16
46.92
84.52
451.80
35.55
1.25
Pr > ChiSq
<.0001
<.0001
<.0001
<.0001
<.0001
<.0001
-
Poisson Regression
Paramete Estima Std
r
te
Error
Intercept
-0.301 0.074
Age
-0.043 0.001
Occup -0.336 0.036
employed
Occup -0.411 0.043
housewife
Occup -0.045 0.065
retired
Occup -0.015 0.038
self employed
Brand - A -0.356 0.055
Brand - B -0.357 0.057
Brand - C -0.308 0.060
Brand - D -0.112 0.056
Brand (E) -0.039 0.060
GPS - No
0.179 0.029
Bonus0.007 0.001
Malus
Poldur
-0.025 0.003
Scale
1.000 0.000
Quasi-Poisson Regression
Wald 95%
Confidence
Limits
Estima
te
Std
Error
Wald 95%
Confidence
Limits
-0.171
0.081 -0.045
0.067 -0.177
0.086
-0.091
0.060 -0.015
0.040 -0.094
0.063
-0.464
-0.468
-0.426
-0.222
-0.156
0.122
0.007
-0.248
-0.246
-0.190
-0.003
0.079
0.237
0.007
-0.356
-0.357
-0.308
-0.112
-0.039
0.179
0.007
-0.031
1.000
-0.020 -0.025
1.000 1.039
0.057
0.059
0.063
0.058
0.062
0.030
0.001
Poisson regression
Criterion
Deviance
Pearson ChiSquare
Log Likelihood
(
)*
Log Likelihood
( )**
DF
Gamma regression
Value/
DF
Value
50000 38486.723
50000 161961.992
DF
0.257
1.080
Value/
DF
Value
5500 5598.940
5500 5130.767
1.017
0.930
-24548.229
-47901.574
-25651.178
-48480.555
*the loglikelihood for the regression model including all the significant risk factors
**the loglikelihood for the regression model without explanatory variables
Gamma Regression*
Source
Age
Occup
Brand
Categ
GPS
BonusMalus
Poldur
-0.468 -0.244
-0.473 -0.242
-0.431 -0.185
-0.226 0.001
-0.160 0.083
0.120 0.239
0.007 0.008
Chi-Square
Pr > ChiSq
Gamma Regression**
Chi-Square
Pr > ChiSq
181.64
90.66
78.98
2.11
21.84
0.13
<.0001
<.0001
<.0001
0.3485
<.0001
0.7136
189.84
91.73
79.91
22.62
-
<.0001
<.0001
<.0001
<.0001
-
0.46
0.4980
ISBN: 978-80-554-1003-6
Parameter
Intercept
Age ( )
Occup - employed ( )
Occup - housewife ( )
Occup - retired ( )
Occup - self-employed
( )
Brand - A ( )
Brand - B ( )
Brand - C ( )
Brand - D ( )
Brand - E ( )
GPS - no ( )
Bonus-Malus ( )
Poldur ( )
Gamma
Regression
-0.301
-0.043
-0.336
-0.411
-0.045
-0.015
0.074
0.001
0.036
0.043
0.065
0.038
8.456
-0.012
-0.167
0.024
0.023
0.297
0.108
0.002
0.064
0.077
0.111
0.069
Pure
Premium
6.268
-0.031
-0.433
-0.394
-0.220
0.349
-0.356
-0.357
-0.308
-0.112
-0.039
0.179
0.007
-0.025
0.055
0.057
0.060
0.056
0.060
0.029
0.001
0.003
-0.421
-0.446
-0.300
-0.172
-0.203
-
0.093
0.096
0.103
0.096
0.102
-
-0.778
-0.875
-0.674
-0.359
-0.246
0.422
0.008
-0.029
Estimate
Std
Error
Estimate
Std
Error
CONCLUSIONS
ISBN: 978-80-554-1003-6
[5]
[7]
[3]
[4]
[9]
[10]
[12]
[13]
[14]
[15]
REFERENCES
[2]
[8]
[11]
ACKNOWLEDGMENT
[1]
[6]
[16]
[17]
[18]
[19]
[20]
ISBN: 978-80-554-1003-6