• No results found

Log-normal approximations in the Solvency II Standard Formula

N/A
N/A
Protected

Academic year: 2022

Share "Log-normal approximations in the Solvency II Standard Formula"

Copied!
51
0
0

Laster.... (Se fulltekst nå)

Fulltekst

(1)

Log-normal approximations in the Solvency II Standard Formula

Peder T. Haagensli

Master’s Thesis, Spring 2017

(2)

This master’s thesis is submitted under the master’s programme Modelling and Data Analysis, with programme optionFinance, Insurance and Risk, at the Department of Mathematics, University of Oslo. The scope of the thesis is 30 credits.

The front page depicts a section of the root system of the exceptional Lie group E8, projected into the plane. Lie groups were invented by the Norwegian mathematician Sophus Lie (1842–1899) to express symmetries in differential equations and today they play a central role in various parts of mathematics.

(3)

Abstract

On the 1st of January 2016, the Solvency II Directive regulating the European insurance industry came into force. Along with a host of reporting requirements, quantitative algorithms for calculating key quantities are provided in the Solvency II documentation.

One of these key quantities is the Solvency Capital Requirement (SCR) providing an insurance company with a "soft" oor in terms of capital that it needs to hold. A vital element in calculating the SCR for an insurance company is the so-called Standard formula based on Gaussian risks which is used iteratively to aggregate the dierent risk elements in the business.

An alternative to the Standard formula was proposed by Bølviken and Guillen who introduced log-normal distributions to better capture the skewness often present in in- surance risks. They used classical moment matching to approximate the distribution of a sum of log-normal risks. In this thesis we will consider another log-normal alternative making use of moment-generating functions. This method will then be shown to oer superior accuracy compared to the Standard formula and the method of Bølviken and Guillen.

Keywords: Cholesky decomposition, Gauss-Hermite quadrature, moment-generating function, principal components, Solvency II, spectral decomposition, Standard formula, sum of log-normals.

(4)

Contents

1 Introduction 1

2 The Solvency II approach 2

2.1 Introduction . . . 2

2.2 The Standard Formula . . . 3

3 A log-normal standard formula 4 3.1 Introduction . . . 4

3.2 The Fenton-Wilkinson approximation . . . 4

3.3 Matching moment-generating functions . . . 5

3.3.1 Cholesky decomposition . . . 6

3.3.2 Spectral decomposition . . . 8

3.3.3 Dimension reduction . . . 9

4 Numerical experiments 9 4.1 Accuracy . . . 9

4.2 Design of the MGF method . . . 10

4.3 Computing time experiments . . . 11

5 Conclusion 12 References 13 Appendices 14 A Figures - Numerical experiments 14 B PDF and MGF 18 B.1 PDF of our log-normal model . . . 18

B.2 The moment-generating function ofY and Gauss-Hermite . . . 18

C The Cholesky decomposition 19 C.1 Change of variable . . . 19

C.2 Applying Gauss-Hermite . . . 19

D The Spectral decomposition 20 D.1 Change of variables . . . 20

D.2 Applying Gauss-Hermite . . . 21

D.3 Dimension reduction . . . 21

E Variance relations 22 E.1 Variance ofYS . . . 22

E.2 Relationship between the correlation of and Y . . . 22

F Solvency II correlation matrices 23

(5)

R-code 25

Function library . . . 25

Program 1 - Density plots for dierent t . . . 34

Program 2a - Accuracy in theK = 6 problem . . . 35

Program 2b - Accuracy in theK = 12 problem . . . 37

Program 3 - Testingt-values for the K = 12 problem . . . 38

Program 4 - Testingt-values for the K = 7 problem . . . 40

Program 5a - Computing time experiments for the K = 12 problem . . . 42

Program 5b - Computing time experiments for the K = 7 problem . . . 44

(6)

1 Introduction

Solvency II is a set of regulations from the European Insurance And Occupational Pensions Authority (EIOPA) placed upon insurance companies conducting business in the EU. The directive institutes a collection of algorithms for calculating a set of quantities which will dictate the amount capital an insurance company will be required to hold to be allowed to oer insurance. Solvency II also issues a set of protocols with respect to reporting to national authorities and improving public disclosure in the insurance business.

With its introduction EIOPA seeks to establish an EU wide license supporting the idea of a single market in the EU.

Risk aggregation in Solvency II takes a modular approach. Referring to Figure 1 which is taken from Bølviken (2017). Consider the overall risk of a modern insurance company which may be broken down into many dierent types of risks. Take for example the risk linked with market investments. This market risk may be viewed as a sum of sub-risks associated with the interest rate, equity, property and so on. On the other hand, the overall market risk may also be viewed as a part of a larger aggregated risk along with life risk, non-life risk etc.

The Solvency Capital Requirement (SCR) is a key quantity in Solvency II. The SCR represents the amount of capital an insurance company is required to hold in order to reliably be able to meet its liabilities. The 99.5 percent principle is a key feature in Solvency II which means that there should in principle be no more than a 0.5 percent chance of the insurance company going bankrupt within one year. In mathematical terms, we may think of the SCR as a percentile of an associated risk variable. An insurance company is required to calculate its top-level company SCR regularly. This is done by starting at the lowest possible level of Figure 1. Solvency II provides us with algorithms for approximating the SCRs connected to all of the input nodes at the lowest level.

Then after obtaining all the risks making up a particular node one level up we aggregate these risks into a single new SCR for that node. For example after obtaining SCRs for Healthli, Healthnli and Catastrophe these are used to aggregate up one level, giving us the Health SCR. After doing this for the other nodes Market, Life, Non-Life etc. these SCRs are used to aggregate up one level again. This continues until all of the risks making up the business have been aggregated to one single parent node. Details beyond what is presented in this thesis regarding this process can be found in EIOPA (2014) and Commission Delegated Regulations (2015).

This aggregation process uses a mathematical formula referred to as the Standard formula which is mathematically correct if the underlying risks are Gaussian. Conse- quently, the Standard Formula may seem tting when considering for example nancial risk which often exhibits approximate normality. Insurance risk however, will generally not t any Gaussian assumption. This breach of assumption may reveal that the 99.5 percent principle in Solvency is misleading.

Bølviken and Guillen (2017) proposed an extension based on log-normal risks. They used the method of matching the moments of the sum of the underlying log-normal risk variables with that of a new log-normal variable. There is however another alternative from engineering literature presented by Methe et al. (2007). This method also creates a new log-normal variable, but through matching the moment-generating function of the new variable with that of the sum of the underlying log-normal variables. The purpose of this thesis is to develop and apply this technique to the model in Bølviken and Guillen (2017). Furthermore, the performance of the resulting method will be compared to the

(7)

method in Bølviken and Guillen (2017) and the Standard formula. The procedures will be implemented using the programming language R for which the source code will be provided on page 25. A large part of the time dedicated to this thesis was spent producing the code for these methods from scratch.

Figure 1: Risk evaluations in Solvency II and its aggregation. There is also another level below for some nodes (not in gure).

2 The Solvency II approach

2.1 Introduction

We will now try to formalize the methodology described in section 1. Consider a random variable Z representing the total company net loss for the coming year. If it is negative then the business is protable that year. The expectation E[Z] is predictable and the Solvency II approach is to examine

Y =Z−E[Z].

The Solvency Capital Requirement is then the solution of the equation

P(Y >SCR) = (1)

with = 0.5%. In order to reasonably assume the company to be able to meet its obligations Solvency II require the insurance company to hold assets valued at leastE[Z]+

SCR at the start of the year.

Consider the upper layer in Figure 1 with risk variables Z =Zoper+Zbasic+Zadjust. Subtracting the means then yields the variables of interest Y =Yoper+Ybasic+Yadjust.

(8)

Equation (1) may be solved through the use of Monte Carlo, but that involves much modelling and is not the Solvency II way. Instead, SCRs on the lowest levels are calculated and aggregated iteratively in order to end up with the SCR for the entire company.

Consider for example the top-level SCRcompany which is immediately comprised of and calculated from SCRoper, SCRbasic and some adjustment term Adj. Now, SCRbasic is computed from the SCRs from market, lif e and so on, and those are in turn computed from the SCRs of their own underlying variables. Those may have other risk variables under them and the same technique of merging risks from their percentiles are used there.

This scheme deals with many types of risk and the true distribution of these dierent risks may vary. Financial risk will often emit Gaussian traits while risk connected with say natural catastrophes in insurance will be much more skewed. With these considerations notwithstanding, the risks are handled in the same way in regards to risk aggregation.

How is this carried out?

The general case is K risk variables Yi, ...YK with mean equal to zero for which their Solvency Capital Requirements SCR1, ...SCRK are to be aggregated to the corresponding SCR for the sum

YS =Y1+...+YK.

Of course, for the bottom nodes where the scheme starts the SCRs must be found by other methods. How this is done is not relevant to the scope of this thesis, but one may consult EIOPA (2014) to see how this is done for each node. Our objective is the aggregation of percentiles from one level in Figure 1 to another.

2.2 The Standard Formula

We still assume that we have K nodes representing the dependent risk variablesY1, ..., YK making up a parent node and risk variableYS. These random variablesYkhave expectation equal to zero and 99.5 percentiles SCR1, ...,SCRK. The dependence structure is handled through the use of correlation coecients

ρij =cor(Yi, Yj)

which are supplied by Solvency II documentation. See EIOPA (2014) or some of the samples on page 23. We then seek the corresponding SCRS or 99.5 percentile of the sum YS =P

Yk that we obtain through the Standard formula SCRS =

K

X

i=1 K

X

j=1

ρij×SCRi×SCRj

!1/2

. (2)

If the risk variables Yk are Gaussian with mean zero, then (2) is mathematically exact.

To see this, observe that if we let φ be the 99.5th percentile of the standard normal distribution and denote the standard deviation of risk k = 1, ..., K as σk then

SCRk=φ×σk, k = 1, ..., K.

Now take the variance formula for the sum YS, which is

σS =

K

X

i=1 K

X

j=1

ρijσiσj

!1/2

,

(9)

and multiply both sides by φ and you arrive at (2).

The Standard formula may therefore seem tting when considering nancial risk which often exhibits approximate normality. Insurance risk, however, will generally not t any Gaussian assumption. This breach of assumption may mean that the 99.5 percent prin- ciple in Solvency II is misleading. However, in spite of this, there is no denying the simplicity of (2) in terms of implementation and its ability to quickly aggregate many sources of risks quickly and easily. There is also the question of whether or not describing the dependence through the use of correlation coecients is appropriate or not for all the dierent types of risk. We will not address that question in this paper. In the next section we will explore alternative methods to the Standard formula which will abandon any Gaussian assumption of the underlying risks Yk making up the sum. We will then compare the accuracy of these methods and eventually discuss any trade-os with these methods in regards to practical implementation.

3 A log-normal standard formula

3.1 Introduction

As noted in the previous section the mathematics behind the Standard Formula which the risk aggregation in Solvency II is built upon assumes Gaussian risks. An alternative approach is to assume the underlying risks to be log-normal. Our model (3) is taken from Bølviken and Guillen (2017) and should better accommodate non-Gaussian risks skewed to the right which is common in insurance. We will assume risks of the form

Ykke−τk2/2+τkk−1 peτk2 −1

, k = 1, ..., K (3)

where the elements in = (k) are Gaussian with mean zero, standard deviation 1 and correlation matrix Σ. Furthermore, σk is the standard deviation of Yk and τk is a shape parameter. Given the shape parameter τk the skewness γk is given by the formula

γk = (eτk2 + 2)p

eτk2 −1. (4)

One may show by application of L'Hôpital's rule that Yk would approach a normal dis- tribution if we were to let τk tend to zero. Choosing the log-normal distribution over a Gaussian represents a step towards more skewed distribution which should be closer to reality when it comes to insurance risk. The log-normal distribution in particular also has some properties that makes it easy to work with. Note that Yk has expectation zero.

Our goal is then to approximate the sum PK

k=1Yk with a new log-normal variable YSSe−τS2/2+τSS −1

peτS2 −1

for some new parameters (σS, τS). This problem has no exact solution. However, it has been studied a great deal and several methods have been developed.

3.2 The Fenton-Wilkinson approximation

Bølviken and Guillen (2017) have already studied this particular problem through the use of the so-called Fenton-Wilkinson scheme. They considered the skewness and standard

(10)

deviation of the sum of the log-normals P

Yk and selected the new parameters (σS, τS) in order to match the skewness of P

Yk and YS. They selected σS through the ordinary variance-formula for a sum of correlated variables.

σS =

n

X

i=1 n

X

j=1

ρijσiσj

!1/2

(5) The skewness coecientsγk is readily obtained through (4) and they showed the skewness of the sum P

Yk to be given by γS =X

i

α3i(eτi2 + 2)(eτj2 −1)2+ 3X

i6=j

α2iαjh

eτi2ij2 −1)−2(βij −1)i + 6 X

i<j<k

αiαjαkijβikβjk−βij −βik−βjk+ 2] (6) where

αi = σi σS

peτi2 −1 and βij = 1 +ρij r

eτi2 −1

eτj2 −1 . For their proof see the appendix of Bølviken and Guillen (2017)

They then used these results to match the skewness of the new random variableYSwith PYk. Finally, with γS in hand they then obtain τS by solving equation (4) with respect to τk. This then fully species the new random variable YS such that skew(P

Yk) = skew(YS).

This method may be generalized to distributions other than the log-normal. For eligible distributions, an exact expression for γS might not exist in which case numerical methods may be introduced in order to solve the equations. Bølviken and Guillen (2017) also demonstrate how copulas may be incorporated instead of correlation coecients to better capture dierent dependence structures. All in all this approach is a simple and exible solution to the problem and their results suggest promising accuracy for problems with K = 3 log-normal risks for dierent degrees of skewness.

3.3 Matching moment-generating functions

We will now develop another way of approximating the sum of log-normals with a new log-normals. The method will have similarities with the methods of moment-matching in Bølviken and Guillen (2017), but will rather obtain the parameters (σS, τS)by match- ing the moment-generating functions at some point t.1 This concept is introduced in Metha et al. (2007), but we will also suggest a modication in an attempt to reduce the computational burden of problems with a large number of dimensions K.

We will consider the same model as Bølviken and Guillen. The properties of the log-normal distribution are well-studied. The pdf is given by

fYk(y) = 1

˜

σfVky

˜ σ + 1

, −˜σk < y <∞

1We will more specically use the characteristic function which is a special case of the moment- generating function. However, we will not make any distinction between these two and treat them as one.

(11)

where we have dened Vk := exp(−τk2/2 +τkk) and σ˜k := σk/p

eτk2 −1 to simplify the expression. The pdf of Vk is readily known as

fVk(v) = 1 vτk

√2πexp

−(lnv−(−τk2/2))2k2

. The moment-generating function (MGF) is written as

MYk(t) =E e−tY

= Z

−˜σk

exp(−ty)fYk(y)dy. (7) We may here interpret the exponential function as a weight function with a tuning pa- rameter t. If we then set the MGF of YS equal to the MGF of the sum P

Yk we may solve for the parameters (τS, σS)at some point t.

We may also relate this to the skewness matching method in Bølviken and Guillen (2017) by noting that matching variance and skewness is equivalent to solving the equa- tions

Z

−∞

wi(y)fYS(y)dy= Z

−∞

wi(y)fPYk(y)dy (8) for the corresponding weight functions w1(y) = (y− E[Yk])2 = y2 and w2(y) = (y − E[Yk])3 =y3. Dierent weight functions will translate to dierent penalization schemes.

Using a weight function like(y−E[Yk])3 will penalize the tail portion of the density more than for example ln(y) which will penalize the head portion more. By choosing exp(−ty) as our weight function we will then be able to tune which parts of the density we want to emphasize. In particular, our focus will be the 99.5th percentile in Solvency II. This makes the method more exible than similar methods which use a xed weight function.

On Figure 2 equation (8) is solved using the weight function exp(−ty)for dierent values oft with the Cholesky approach described in section 3.3.1. Simulation of the sum directly using Monte Carlo is also included. Here we see that by taking t= 5, or weight function exp(−5y), errors in the tail of the distribution is penalized less. While with t = 0.5 we t the tail much better. For values t = 0.1 and t = 0.02 it may seem that we need to go even further along the tail to a reach an area where the scheme benets from such small values. On the other hand, t= 5 ts the head portion of the distribution quite well in comparison to the smaller values of t. The next step in the process is now to derive approximations for the MGF of the sum P

Yk and the tting variable YS. This will be done through the use of Gauss-Hermite expansions of the integrals.

3.3.1 Cholesky decomposition Now, the MGF of the sum P

Yk may be written on the following form MPYk(t) =E

e−tPYk

=EhY

exp(−t˜σk(exp(Xk)−1))i

= Z

−∞

· · · Z

−∞

K

Y

k=1

exp(−tσ˜k(exp(xk)−1)) 1

(2π)K/2|C|1/2 exp

−1

2[x−(−τ2/2)]TC−1[x−(−τ2/2)]

dx1...dxK (9)

(12)

Figure 2: Main window: CCDFs resulting from solving the problem for dierent values of t. Top right window: Corresponding CDFs. Details on page 34 - Program 1

whereτ = (τk),X= (Xk)is the Gaussian vector withXk =−τk2/2 +τkk and the matrix C is the covariance matrix of X.

There is no general closed-form expression for (7) and (9) but we may apply Gauss- Hermite approximations to the integrals for real values of t. For the 1-dimensional case of (7) we perform a standard change of variable to get an integrand compatible with Gauss-Hermite integration. We then end up with the following approximation.

MYk(t)≈ΨYk(t;σk, τk) =

N

X

n=1

wn

√π exph

−t˜σk

exp√

kan−τk2/2

−1i

(10) an and wn are the abscissas and weights of the Gauss-Hermite expansion of order N, respectively. The derivation of (10) may be found in the appendix section B on page 18.

The MGF of the sum is more complicated because of the covariance matrix C which we need to get rid of before Gauss-Hermite expansion may be applied. One way of doing this is by applying the following Cholesky transformation as done in Mehta et al. (2007)

xk=√ 2

K

X

i=1

lkizi−τk2/2, k= 1, ..., K

where L = (lki)Kk,i=1 is the Cholesky decomposition of C, i.e. C = LLT. This removes the covariance matrix from the last exponent in (9) and makes the integration variables independent in a sense. After iteratively applying Gauss-Hermite expansion with respect to the variables of integration this approach yield the following approximation to the MGF of the sum P

Yk. MPYk(t)≈ΨPYk(t;σ,τ, C)

=

N

X

nK=1

· · ·

N

X

n1=1

wn1· · ·wnK πK/2

K

Y

k=1

exp (

−t˜σk

"

exp √ 2

K

X

j=1

lkjanj−τk2/2

!

−1

#)

(11)

(13)

Details in the appendix section C on page 19. Then given parameters τ, σ, σS and covariance matrix C we may take some value of t and solve

ΨPYk(t;σ,τ, C) = ΨYS(t;σS, τS) (12) with respect to τS. σS is already given by

σ2S =VarX Yk

=

K

X

i,j=1

cov(Yi, Yj) =

K

X

i,j=1

ρijσiσj.

3.3.2 Spectral decomposition

We will now introduce an alternative to the expression in (11) based on spectral decom- position rather than Cholesky decomposition. This will open up the possibility of easing the computational burden by reducing the number of sums in an expression like (11) from K to some smaller number L. The expression in (11) sums NK products. In Solvency II we will encounter K = 12 dimensional problems and these computations will quickly become non-trivial even for a modern computer.

Consider the correlation matrix of ,Σ, and its spectral decomposition Σ = ΓΛΓT

where thei-th column vector ofΓis thei-th eigenvector ofΣwith corresponding eigenvalue λisorted by descending eigenvalue. Λis the matrix with eigenvalues along the diagonal in descending order. Instead of the Cholesky decomposition used above we instead consider the transformation

δ = ΓT or = Γδ.

We may then write the model using this transformation Yk = ˜σk exp −τk2/2 +τk

K

X

j=1

γkjδj

!

−1

!

with γkj being the element (k, j) in Γ. This is useful because since δ ∼ N(0,Λ) the elements δj are independent. We may also write the model as

Yk = ˜σk exp −τk2/2 +τk

K

X

j=1

γkjp λjηj

!

−1

!

where η∼N(0, I). Using this expression the MGF of the sum P

Yk can be written as MPYk(t) =

Z

−∞

· · · Z

−∞

K

Y

k=1

exp (

−tσ˜k exp

"

−τk2/2 +τk

K

X

j=1

γkjp λjηj

#

−1

!)

× 1

(2π)K/2 exp

−1 2ηTη

1...dηK. (13) Then utilizing a small change of variable and applying Gauss-Hermite expansion itera- tively, as done in appendix D on page 20, yields

MPYk(t)≈ΨˆPYk(t;σ,τ,Σ)

=

N

X

nK=1

· · ·

N

X

n1=1

wn1· · ·wnK πK/2

K

Y

k=1

exp (

−t˜σk exp

"

−τk2/2 +τk

K

X

j=1

γkjp 2λjanj

#

−1

!) . (14)

(14)

3.3.3 Dimension reduction

The spectral decomposition gives us the option of dropping the smallest principal com- ponents and only including the L < K largest ones. This gives us the following approxi- mation.

k=

K

X

j=1

γkjδj

L

X

j=1

γkjδj

The idea is that for L < k ≤ K, Var(δk) = λk will be close to zero and δk will be approximately N(0,0). Moving forward with this yields the approximate MGF of the sum

MPYk(t)≈ Z

−∞

· · · Z

−∞

K

Y

k=1

exp (

−t˜σk exp

"

−τk2/2 +τk L

X

j=1

γkj

jηj

#

−1

!)

× 1

(2π)L/2exp

−1 2ηTη

1...dηL (15) and the Gauss-Hermite expansion

MPYk(t)≈ΨˆPYk(t;σ,τ,Σ)

=

N

X

nL=1

· · ·

N

X

n1=1

wn1· · ·wnL πL/2

K

Y

k=1

exp (

−t˜σk exp

"

−τk2/2 +τk

L

X

j=1

γkjp 2λjanj

#

−1

!) . (16) The eciency gain in using (16) instead of (14) will be examined below.

4 Numerical experiments

4.1 Accuracy

In Figure 32 the absolute error of the dierent methods are compared in a problem with K = 6 risks and K = 12 risks. The x-axis denotes the number of large risks with a large scale (σk = 4) and heavy skewness (τk = 1.2). The rest of the K risks where smaller and with much lighter skewness (σk = 1 and τk = 0.25). For each set of parameters the resulting 99.5th percentiles where compared with that of a Monte Carlo simulation of the sum with a size of 107 simulations. For theK = 6 problem at-value of0.1where chosen, while in the K = 12 problem t was set to 0.08 for both the Cholesky and full Spectral (PC) approach. For the PC model with L=K−1 t was set equal to 0.05. These values are believed to provide good results based on analysis in section 4.2. The correlation matrices of (k) where set equal to the 6- and 12-dimensional correlation matrices from Solvency II documentation, see EIOPA (2014) or page 23. In Solvency II these matrices are meant to dene correlation on the risk-level, i.e. correlation of (Yk). However, the resulting correlation matrix of (k), given the correlation of (Yk), will not always end up being positive denite for any choice of σk and τk. We avoid this by directly dening Σ = cor(k) equal to the correlation matrix in Solvency II. By starting with a valid correlation structure of (k) the resulting correlation matrix of (Yk) will be necessarily positive denite. In this experiment cor(Yk)will change for each set of(τk)and(σk)along

2This section refers to gures placed towards the end of the document on page 14.

(15)

the x-axis. However, the resulting matrices are easily calculated through the results in the appendix on page 23, specically using result (E.8), or via the R-code on page 35.

Firstly note the poor performance of the classical Solvency II Standard formula. The Standard formula especially has problems with the heavily skewed risks. Since the risks stray further and further away from normality asτkincreases this is expected. The method introduced in Bølviken and Guillen (2017) provide very good results in the case of only lightly skewed risks. However, when larger, more heavy-tailed risks are introduced the Bølviken-Guillen (BG) method deteriorates abruptly in performance. This method along with the Standard formula seem to do worse and worse as more and more heavy-tailed risks are introduced. The BG method also seems to have trouble when introduced to a large number of risks as in the K = 12 problem.

In the K = 6 problem in Figure 3a the MGF-methods arguably perform better than BG as they handle the large risks better. Figure 3a might also suggest that the values of σk and τk plays a part in determining the value of t that minimizes the absolute error.

The K = 12 problem provides similar results. The BG method performs very well when all risks are small and lightly skewed. However, when introduced to large risks the method again immediately suers. The MGF methods also show a slightly similar tendency of performing the best when only having small risks with small skewness. Else the absolute accuracy seems relatively constant as we introduce more and more large- scaled risks. This shows considerable improvement on BG and especially the Standard formula. Using the PC-decomposition and discarding the last component also yield good results, albeit slightly worse than using the full model L = K. One can note for zero large risks the optimal t in the L = K −1 case is relatively far from the selected value t = 0.05in 3b which explains the low accuracy in that point. The sensitivity in the design parameter t will be discussed below in section 4.2.

One may dierentiate the large and small risks even more by building a new correlation structure. Consider for example letting the large risks be more strongly correlated with each other and much less correlated with the smaller ones. However, this does not seem to have a great impact on the performance of the dierent methods. This was tested, but not plotted in this document.

The MGF methods also seem to scale down to small-dimensional problems well with K as low as 3 being tested. The resulting plots showed a similar story to that of Figure 3 (Not plotted).

4.2 Design of the MGF method

When applying the MGF method to a problem with given parameters we need to make a choice of N, the Gauss-Hermite degree, and the point t for which to solve (12). We start by considering t which will dictate what part of the density we will approximate the best as discussed on page 6.

Consider theK = 12problem on Figure 4 where dierentt-values were tested and the corresponding 99.5th percentile is plotted for the MGF methods. An even mix of large and small risks were used as in section 4.1. Increasingt makes the resulting density of the new variable YS t the lower end of the distribution well, but the 99.5th percentile will be underestimated. On the other hand, a t-value that is too small will provide a good t further out than our target percentile. In which case the lower part of the distribution will be aected by this which will mean that our target 99.5th percentile will be overestimated.

For this problem we seem to hit a sweet spot somewhere between 0.05and0.1. Choice of

(16)

N seem to interact with the resulting plots when comparing theN = 4 case in Figure 4b and the N = 3 case in Figure 4a.

Discarding principal components has a noticeable eect, especially on low t-values.

However, decent accuracy is still obtainable in the K = 12 problem. Another interesting aspect is how much the Cholesky approach degrades for very high t-values compared to the PC approach. This is also observable in the K = 6 problem in Figure 3a.

For theK = 6 problem the similar balanced large-small risk distributions were tested along with letting every risk be small. For the balanced scenario, the results were similar to above with the exception of the reduced model L = 6 which performed very close to the full models. When using only light risks both the Cholesky and PC approach (with L=K) are very exible in terms oft-values that yield near-perfect results. On Figure 5b we observe that any t-value up to 0.5 yield great results, with signicant error building up only with values higher than this. This suggests that the nature of the risks has a signicant impact on how the methods react to dierent values of t.

4.3 Computing time experiments

Along with K, which is already given by the problem one will be facing, N will dictate the computational complexity of the MGF methods. By considering the rst K sums in the expression in (11) we already end up with NK terms. This is before including the product and sum in the middle and right end of the expression. With this in mind, N and K will both dramatically increase the number of operations necessary to evaluate the resulting expressions. Similar observations are valid for the Spectral approach with the expressions (14) and (16). Minimizing N as much as possible while still achieving a reasonable accuracy should therefore be a high priority.

The time needed for the dierent procedures in the K = 12 and K = 7 problems is plotted on Figure 6 both for the Cholesky approach and the Spectral approach with dierent values of L. This is shown as a function of N. For this problem an even mix of large, heavy-tailed risks and smaller, lightly-tailed risks were chosen. The details may be found in the R-code on page 42 and 44. Here,twas set equal to1. Note that changing the risk parametersσkk,Σorthave negligible eect on the time spent computing. However, it is worth mentioning that not all values of t provide the problem with a solution. Low values of Lin particular does not yield a solution to (12) for low values of t.

First we will consider theK = 12problem on Figure 6a. The computation time seems to suddenly spike for the Cholesky approach and Spectral approach for large values of L. This makes sense considering the number of terms NK (or NL) noted above. Note that the Cholesky approach seems to be more ecient than using a Spectral decomposition with L = K. However, by letting L = 11 we may potentially reduce the time required even further. For the K = 12 problem going from N = 3 toN = 4 represents a big jump in computational time for the Cholesky and Spectral approaches. Yet, we notice from section 4.2 that N = 3can achieve good results with less than 30 seconds of computation time.

Now for the smaller K = 7problem in Figure 6b we obtain reasonably small compu- tation times even for very large N. Further experiments indicate that here is little to no improvement in increasing N further than 4. For the K = 7 this problem will then be solved in 1 second. We still note that the Cholesky approach edges out the full Spectral approach in terms of time needed to solve the problem.

(17)

5 Conclusion

The MGF methods consistently provide the best results in terms of accuracy. In the previous section the MGF methods have been shown to provide good accuracy in across a number of dierent scenarios. We have looked at scenarios with only lightly skewed risks, with only heavily skewed risks and with a mixture of lightly and heavily skewed risks. The method of Bølviken and Guillen (2017) handles the rst scenario well, but its accuracy suers when introduced to more heavily skewed risks. The MGF methods presented here does not seem to have this problem but instead produce consistently good results across all of the scenarios.

The Standard formula handled the log-normal risks quite poorly, but on the other hand it is, along with the BG method, quite simple to implement. The MGF methods, on the other hand, are relatively demanding to implement and require a design parameter t to be set. The sensitivity of the choice of t is dependent on the risks and should be studied further if one were to consider implementing the MGF methods into a system like Solvency II which handles dierent types of risk. EIOPA would either have to set the value of t pre-determinately or provide some form of automatic procedure which would provide a t-value. If an insurance company could freely choose its t-values then it could knowingly underestimate the SCRs by intentionally setting them too high as shown on Figure 2.

For a problem with a high number of risks computation time for the MGF methods may be a concern for some. The time required was shown to be non-trivial in the 12- dimensional problem. A dimension reduction method was developed with such problems in mind and tested along with the full models. Combining this with reducing the Gauss- Hermite parameter N to 3 was shown to signicantly reduce computation time while still oering good accuracy mostly on par with the full-model approaches.

(18)

References

Bølviken, E., (2014) Computation and modelling in insurance and nance. Cambridge University Press, page 150.

Bølviken, E., (2017) "Solvency II quickly". In preparation.

Bølviken, E., & Guillen, M. (2017). Risk aggregation in Solvency II through recursive log-normals. Insurance: Mathematics and Economics, Vol. 73, pages 20-26.

Commission Delegated Regulation (2015). In the Ocial Journal of the European Union. Available as http://eur-lex.europa.eu/legal-content/EN/TXT/PDF/?uri=

OJ:L:2015:012:FULL&from=EN

EIOPA (2014). Technical Specication For The Preparatory Phase (Part I).

Available as https://eiopa.europa.eu/Publications/Standards/A_-_Technical_

Specification_for_the_Preparatory_Phase__Part_I_disclaimer.pdf

Mehta, N. B., Wu, J., Molisch, A. F., & Zhang, J. (2007). Approximating a sum of random variables with a lognormal. IEEE Transactions on Wireless Communications, Vol.6, No. 7, pages 2690-2699.

Press, W. H., Flannery, B. P., Teukolsky, S. A., & Vetterling, W. T. (2007). Numerical recipes: The Art of Scientic Computing. 3rd ed. Cambridge University Press, Cam- bridge.

(19)

Appendices

A Figures - Numerical experiments

(a) The K= 6 problem. N = 4.

(b) The K= 12 problem. N = 3

Figure 3: Distance between the 99.5th percentiles of the Monte Carlo simulations and for the methods in the K = 6 and K = 12 problems with increasing number of large, heavy-tailed risks.

(20)

(a) N = 3.

(b) N = 4.

Figure 4: Resulting 99.5th percentiles for dierent values of t in the K = 12 problem.

Note non-linear x-axis.

(21)

(a) 3 heavy risks with (σk, τk) = (4,1.2) and 4 light risks with (σk, τk) = (1,0.25).

(b) Only light risks

Figure 5: Resulting 99.5th percentiles for dierent values of t in the K = 7 problem.

Note non-linear x-axis.

(22)

(a) The K = 12 problem. Using 6 risks with (σk, τk) = (4,1.2) and 6 with (σk, τk) = (1,0.25).

(b) The K = 7 problem. Using 3 risks with (σk, τk) = (1.4,4) and 4 with (σk, τk) = (0.25,1).

Figure 6: Time used to solve ΨPYk = ΨYS for dierent values N. Comparing the Cholesky approach to the spectral decomposition approach for dierent values of L. Using correlation matrix from Solvency II documentation.

(23)

B PDF and MGF

B.1 PDF of our log-normal model

Our model variable Y is given as a function of a regular log-normal variable V through the transformation g

Y =σe−τ2/2+τ −1

eτ2 −1 = ˜σ(V −1) =g(V) (B.1) whereσ˜= σ

eτ2 −1 andV =e−τ2/2+τ ∼lnN(−τ2/2, τ). Then by the change of variable formula the density of Y is readily obtained through

fY(y) =

d

dyg−1(y))

fV(g−1(y))

= 1

˜ σfV y

˜ σ + 1

(B.2) where

fV(v) = 1 vτ√

2π exp

−(lnv−(−τ2/2))22

. The support of Y is (−˜σ,∞).

B.2 The moment-generating function of Y and Gauss-Hermite

The moment-generating function of a random variable is dened as MY(t) =E

e−tY

, t∈R

wherever this expectation exists. Using equation (B.2) this becomes MY(t) =E

e−tY

= Z

−˜σ

exp(−ty)fY(y)dy

= Z

−˜σ

exp(−ty)1

˜ σ

1 (y

˜

σ + 1)τ√ 2π

exp

−

(ln(y

˜

σ + 1) +τ2/2)22

dy. (B.3) In order to end up with the expression in (10) we need to use substitution to rst arrive at an integrand compatible with Gauss-Hermite integration. Set

z = ln(y/˜σ+ 1) +τ2/2

√2τ giving y= ˜σh exp(√

2τ z−τ2/2)−1i . We then get

dz

dy = 1

√2τ(˜σ+y) ordy =√

2τ(˜σ+y)dz.

Finally, consider the integration limits. If y → −˜σ then ln(y

˜

σ + 1) → −∞and z → −∞. Now if y → ∞then ln(y

˜

σ + 1)→ ∞ and z → ∞. Therefore the integral (B.3) becomes My(t) =

Z

−∞

√1

πexph

−tσ˜ exp(√

2τ z−τ2/2)−1i

exp(−z2)dz

N

X

n=1

wn

√πexp h

−tσ˜

exp(√

2τ xn−τ2/2)−1

i (B.4)

(24)

which is the expression in (10).

C The Cholesky decomposition

C.1 Change of variable

We have the full expression of the MGF of P

Yk as on page 6:

MPYk(t) = Z

−∞

· · · Z

−∞

K

Y

k=1

exp(−t˜σk(exp(xk)−1)) 1

(2π)K/2|C|1/2 exp

−1

2[x−(−τ2/2)]TC−1[x−(−τ2/2)]

dx1...dxK (C.1) We will now do two things with one change of variable. Firstly, we want to obtain inde- pendence in the integrand, i.e. we want expression like (C.1) but where x independent.

Secondly, we want to get an integrand compatible with Gauss-Hermite integration. In order to achieve the rst goal, we consider the Cholesky decomposition in order to decor- relate x. X = (Xk) is the Gaussian vector Xk = −τk2/2 +τkk and the matrix C is the covariance matrix ofX. LetC =LLT be its Cholesky decomposition. We then know that if we consider an arbitrary vector z with mean zero and covariance equal to the identity matrix that the transformation Lz will be a mean-zero vector with covariance matrix equal to C. If we furthermore multiply by √

2 and add−τ2/2 to the transformation our integrand will be Gauss-Hermite compatible. Our change of variable then becomes

x=√

2Lz−τ2/2 or

xk=√ 2

K

X

i=1

lkizi−τk2/2, k = 1, ..., K.

We then get dxk dzi =√

2lki and the Jacobian becomes J(z) =√

2Land |J(z)|= 2K/2|L|= 2K/2|C|1/2. The MGF then becomes

M(PYk)(t) =E[e−tPYk] = Z

−∞

· · · Z

−∞

K

Y

k=1

exp (

−t˜σk exp

"

√2

K

X

i=1

lkizi−τk2/2

#

−1

!)

× 1

(2π)K/2|C|1/2 exp

−1 2(√

2Lz)TC−1(√ 2Lz)

2K/2|C|1/2dz1...dzK

= Z

−∞

· · · Z

−∞

K

Y

k=1

exp (

−t˜σk exp

"

√ 2

K

X

i=1

lkizi−τk2/2

#

−1

!)

exp(−zTz)dz1...dzK. (C.2)

C.2 Applying Gauss-Hermite

We factor out z1 and apply Gauss-Hermite expansion with respect to z1:

(25)

M(PYk)(t) = Z

−∞

· · · Z

−∞

1

π(K−1)/2 exp −

K

X

i=2

zi2

! N X

n1=1

wn1

π1/2f(1)(z(−1), an1;t)dz2...dzK+R(1)N where we have some error term in the end and

f(1)(z(−1), an1;t) =

K

Y

k=1

exp −t˜σk

"

exp(√ 2

K

X

i=2

lkizi+√

2li1an1 −τk2/2)−1

#!

. Then again with z2

M(PYk)(t)

= Z

−∞

· · · Z

−∞

1

π(K−2)/2 exp −

K

X

i=3

z2i

! N X

n2=1 N

X

n1=1

wn1wn2

π2/2 f(2)(z(−2), an1, an2;t)dz3...dzK +R(2)N where

f(2)(z(−2), an1, an2;t) =

K

Y

k=1

exp −t˜σk

"

exp(√ 2

K

X

i=3

lkizi+√ 2

2

X

m=1

limanm−τk2/2)−1

#!

. By continuing this we end up the expression

M(PYk)(t)

=

N

X

nK=1

· · ·

N

X

n1=1

wn1· · ·wnK πK/2

K

Y

k=1

exp (

−t˜σk

"

exp √ 2

K

X

j=1

lkjanj−τk2/2

!

−1

#)

+RN(K). (C.3) This then justies the expression in (11) where the error term is discarded.

D The Spectral decomposition

D.1 Change of variables

The strategy and technique is the same as with the Cholesky decomposition. We start with the expression in (13) and apply a small change of variables before we iteratively apply Gauss-Hermite expansion with respect to the variables of integration. We start with

MPYk(t) = Z

−∞

· · · Z

−∞

K

Y

k=1

exp (

−t˜σk exp

"

−τk2/2 +τk

K

X

j=1

γkjp λjηj

#

−1

!)

× 1

(2π)K/2 exp

−1 2ηTη

1...dηK. Settingη=√

2zwill remove the fraction in the last exponent. This gives us the Jacobian J(z) = √

2I and |J(z)|= 2K/2 and the expression above becomes

(26)

Z

−∞

· · · Z

−∞

K

Y

k=1

exp (

−tσ˜k exp

"

−τk2/2 +τk

K

X

j=1

Γkjp 2λjzj

#

−1

!)

× 1

πK/2 exp −

K

X

k=1

zk2

!

dz1...dzK.

D.2 Applying Gauss-Hermite

First with respect to z1: MPYk(t) =

Z

−∞

· · · Z

−∞

1 π(K−1)/2

N

X

n1=1

wn1

π1/2f(1)(z(−1), an1) exp −

K

X

k=2

zk2

!

dz2...dzK +R(1)N where we have some error term in the end and

f(1)(z(−1), an1) =

K

Y

k=1

exp (

−t˜σk exp

"

−τk2/2 +τk

K

X

j=2

Γkjp

jzjk

1

X

j=1

Γkjp 2λjanj

#

−1

!) . Then again with respect z2:

MPYk(t) = Z

−∞

· · · Z

−∞

1 π(K−2)/2

N

X

n2=1 N

X

n1=1

wn1wn2

π2/2 f(2)(z(−2), an1, an2) exp −

K

X

k=3

zk2

!

dz2...dzK+R(2)N where

f(2)(z(−2), an1, an2) =

K

Y

k=1

exp (

−t˜σk exp

"

−τk2/2 +τk

K

X

j=3

Γkjp

jzjk

2

X

j=1

Γkjp 2λjanj

#

−1

!) . Continuing this yields

MPKY

k(t) =

N

X

nK=1

· · ·

N

X

n1=1

wn1· · ·wnK

πK/2

K

Y

k=1

exp (

−tσ˜k exp

"

−τk2/2 +τk

K

X

j=1

Γkjp 2λjanj

#

−1

!)

+R(K)N (D.4) which then justies the expression in (14) where the error term is discarded.

D.3 Dimension reduction

The same strategy applies in this case, but still written out here for completeness. We start with expression (15).

MPYk(t)≈ Z

−∞

· · · Z

−∞

K

Y

k=1

exp (

−t˜σk exp

"

−τk2/2 +τk

L

X

j=1

γkjp λjηj

#

−1

!)

× 1

(2π)L/2 exp

−1 2ηTη

1...dηL (D.5)

Referanser

RELATERTE DOKUMENTER

In the analyses we compare data from a new chronology of plots and attacks by jihadis in Europe in 2014-16 (Appendix 1)[5], with findings from three previous studies by FFI’s

Keywords: gender, diversity, recruitment, selection process, retention, turnover, military culture,

The system can be implemented as follows: A web-service client runs on the user device, collecting sensor data from the device and input data from the user. The client compiles

In April 2016, Ukraine’s President Petro Poroshenko, summing up the war experience thus far, said that the volunteer battalions had taken part in approximately 600 military

This report documents the experiences and lessons from the deployment of operational analysts to Afghanistan with the Norwegian Armed Forces, with regard to the concept, the main

Overall, the SAB considered 60 chemicals that included: (a) 14 declared as RCAs since entry into force of the Convention; (b) chemicals identied as potential RCAs from a list of

An abstract characterisation of reduction operators Intuitively a reduction operation, in the sense intended in the present paper, is an operation that can be applied to inter-

Risk is defined as “the product of frequency and consequence” (ibid.), and can be expressed by the formula of (Risk = probability X consequence 6 ). The risk ma- trix is the