This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Minimizing Ruin Probability Under Dependencies for Insurance Pricing

R.L. Gudmundarson 111E-mail address: rlg2000@hw.ac.uk Edinburgh Business School, Heriot-Watt University M. Guerra 222E-mail address: mguerra@iseg.ulisboa.pt ISEG-School of Economics and Management, Universidade de Lisboa; REM - Research in Economics and Mathematics, CEMAPRE A. B. de Moura 333E-mail address: amoura@iseg.ulisboa.pt ISEG-School of Economics and Management, Universidade de Lisboa; REM - Research in Economics and Mathematics, CEMAPRE
Abstract

In this work the ruin probability of the Lundberg risk process is used as a criterion for determining the optimal security loading of premia in the presence of price-sensitive demand for insurance. Both single and aggregated claim processes are considered and the independent and the dependent cases are analyzed. For the single-risk case, we show that the optimal loading does not depend on the initial reserve. In the multiple risk case we account for arbitrary dependency structures between different risks and for dependencies between the probabilities of a client acquiring policies for different risks. In this case, the optimal loadings depend on the initial reserve. In all cases the loadings minimizing the ruin probability do not coincide with the loadings maximizing the expected profit

1 Introduction

Insurance is based on the idea that society asks for protection against unforeseeable events which may cause serious financial damage. Insurance companies offer a financial protection against these events. The general idea is to build a community where everybody contributes a certain amount and those who are exposed to the damage receive financial reimbursement [35] .

When (non-life) insurers set premium prices they usually start by finding the so-called pure premium, which is the expected value of the total claims that will occur in one time unit. However, when pricing insurance policies, insurers must take into account the risk associated with the policy as well as additional costs (e.g. operational cost, capital cost, etc.). Therefore, a so-called security loading is added to cover the risk and additional costs. The security loading is often calculated using some premium calculation principle, and the insurance premium is obtained once the security loading has been determined and added to the pure premium. The main concerns are usually whether the loading is an appropriate measure of the risk and which premium principle to choose. The higher the loading the higher the premium and consequently, the underwriting risk will be lower. However, if the premium price is too high then the exposure will be too low due to competition, and the operational cost of the insurer will engulf the premium income resulting in financial instability. Therefore, insurers usually require sophisticated premium calculations in order to secure stability.

Collective risk models are fundamental in actuarial science to model the aggregate claim amount of a line of business in an insurance company. The collective risk model has two random components, the number of claims and the severity of claims, and is usually modelled with a compound process [15, Chapter 3]. The classical Lundberg risk process has been studied extensively and there exist many variations, for example including reinsurance or investments [14]. It assumes that premia come in a continuous stream while claims happen at discrete times according to a Poisson distribution.

Another common assumption is that the risk can be divided into groups of homogeneous risks such that the pure premia and security loadings can be estimated separately for each risk group. The pure premia of these individual groups are usually modelled with generalized linear models (GLM). GLM’s have been applied extensively in actuarial work and a good overview is provided in [26, 36]. Traditional risk theory has usually assumed independence between risks due to its convenience, but it is generally not very realistic. Claims in an insurer’s risk portfolio are correlated as they are subject to the same event causes [7]. Completely homogeneous risk groups are extremely rare and dependence among risks has become a flourishing topic in actuarial literature [2]. Dependence has mostly been measured through linear correlation coefficients [4]. The popularity of linear coefficient is mainly due to the ease with which dependencies can be parameterized, in terms of correlation matrices. Most random variables, however, are not jointly elliptically distributed and it could be very misleading to use linear coefficients [30]. This motivated the use of concordance measures. Two random variables are concordant when large values of one go with large values of the other [25]. The Lundberg risk model is a Lévy jump process, [8] which means that the dependency of two claim processes is best explained through their Lévy measure [32]. This study will not go into details about Lévy processes, but both [8] and [27] provide a very good introduction, and [2, 5, 28, 34, 1] are examples of applications of Lévy copulas to risk processes. For example, van Velesen [34] showed how Lévy copulas can be used in operational modelling and discussed how dependence is implied by the Lévy copula. In this work we consider bivariate claim processes, but the presented theory can be straightforwardly extended to multiple claim processes.

Ruin probability is a classical measure of risk and has been extensively studied [15, 14, 18, 33]. Although there is no absolute meaning to the probability of ruin, it still measures the stability of insurance companies. A high ruin probability indicates instability, and risk mitigation techniques should be used, like reinsurance or raising premia [15]. Most non-life insurance products have a term of one year and therefore it can be argued that the one year ruin probability should be used. The one year ruin probability is the probability that the capital of an insurance company will hit zero within one year. However, the appropriateness of risk measures defined over fixed time horizons can be questioned, since ruin in a given time span can be minimized by increasing the probability of ruin in the aftermath of that period. Lundberg concluded that the actual assumptions behind the classical collective risk model are in fact less restrictive when time-invariant quantities like the infinite time ruin probability are considered [33]. Therefore, we focus on the infinite time ruin probability in this paper.

In this work, the optimal loadings based on two strategies are derived, and compared. One strategy maximizes the profit and the other minimizes the ruin probability. We show that the two loading strategies give different results. Furthermore, we show how the optimal loading with respect to the ruin probability can be found and compare it to the one obtained when the expected profit is maximized. We consider dependencies and illustrate how Lévy copulas can be used to model claim process dependencies and how dependencies can affect the riskiness of the insurance portfolio. We take this idea further and consider dependency between the acquisition of insurance for different risks by policyholders. This is a realistic assumption as policyholders usually buy multiple insurance products from the same insurance company. We also take into account the fact that the market risk process and the company’s risk process are not the same, and how the company’s risk process depends on its exposure to the market. This is, to our knowledge, the first analysis of the interplay of the ruin probability, the dependency structure of claim, and the dependency structure of acquisition of insurance. We demonstrate that even if there is a strong dependency between insurance products within the market, small insurance companies have less dependency and therefore less risk than bigger insurance companies, provided the dependency between acquisition of insurance for different risks is not too strong.

The paper is organized as follows: Section 2 contains some background material about ruin probabilities in the Lundberg process and aggregation of compound Poisson processes. Section 3 deals with the single-risk case. We characterize the optimal loading and compare it with the loading maximizing the expected profit. Section 4 handles the multiple risks case. We show how the dependency structure existing in the market (i.e. the general population) translates into the risk exposure of the company through its market shares on different risks and the likelihood that clients acquire insurance for more than one risk. Section 5 contains a numerical illustration. A numerical scheme to compute the ruin probabilities is given in the appendix.

2 Preliminaries

2.1 Claim and Surplus Processes

The Lundberg risk model describes the evolution of the capital of an insurance company and assumes that exposure is constant in time, losses follow a compound Poisson process, and premia arrive at a fixed continuous rate:

Xt=u+cti=0NtYi=u+ctSt,Y00,X_{t}=u+ct-\sum_{i=0}^{N_{t}}Y_{i}=u+ct-S_{t},\qquad Y_{0}\coloneqq 0,

where uu is the initial surplus, cc is the risk premium rate, NtN_{t} is a time homogeneous Poisson process with intensity parameter λ\lambda, and YiY_{i} are i.i.d. random variables representing the severity of claim ii, i=0,,Nti=0,\dots,N_{t}. Here it is assumed that YiY_{i} are positive. In the following sections, YY denotes an arbitrary random variable with the same distribution as any YiY_{i}. The severity distribution is denoted as F(x)F(x) and the severity survival distribution as F¯(x)\overline{F}(x). StS_{t} is a compound Poisson process and thus XtX_{t} is a stochastic process (sometimes called the surplus process) representing the insurance wealth at time tt. XtX_{t} increases because of earned premia and decreases when claims occur. When the capital of an insurance company hits zero, the insurance company is said to be ruined. Formally, the ruin probability is defined as follows.

Definition 2.1 (Probability of Ruin).

Let (Ω,,{t}t0,)(\Omega,\mathcal{F},\{\mathcal{F}_{t}\}_{t\geq 0},\mathbbm{P}) be a filtered probability space and X=(Xt)t[0,[X=(X_{t})_{t\in[0,\infty[} a surplus process which is adapted and Markov with respect to the filtration. The state space is (,())(\mathbbm{R},\mathcal{B}(\mathbbm{R})). If X is time homogeneous, the infinite time ruin probability is the function V:[0,1]V:\mathbbm{R}\mapsto[0,1] such that

V(x)=(s[0,+[:Xs0|X0=x),x.V(x)=\operatorname{\mathbb{P}}\big{(}\exists s\in[0,+\infty[:X_{s}\leq 0\>\big{|}\>X_{0}=x\big{)},\quad x\in\mathbbm{R}.

Sometimes it is useful to use the survival (non-ruin) probability, defined as V¯(x)=1V(x)\overline{V}(x)=1-V(x). The ruin probability can be calculated using the following integro-differential equation [11].

Proposition 2.1.

Assume that XtX_{t} is defined as above and the premium rate satisfies c>λ𝔼[Y]c>\lambda\operatorname{\mathbb{E}}[Y]. If VC1(]0,[),V\in C^{1}(]0,\infty[), then the probability of ruin with infinite time horizon satisfies the following equation:

0=cddxV(x)+λ(0xV(xy)𝑑F(y)V(x)+1F(x)),x>0,\displaystyle 0=c\frac{d}{dx}V(x)+\lambda\bigg{(}\int_{0}^{x}V(x-y)dF(y)-V(x)+1-F(x)\bigg{)},\quad x>0, (2.1)

with the following boundary condition:

{V(x)=1x0,limx0+V(x)=λc𝔼[Y].\begin{cases}V(x)=1&x\leq 0,\\ \lim_{x\to 0^{+}}V(x)=\frac{\lambda}{c}\operatorname{\mathbb{E}}[Y].\\ \end{cases}

Furthermore, the probability of non-ruin satisfies the following equation:

V¯(x)V¯(ϵ)=λcϵxV¯(xy)F¯(y)𝑑y\overline{V}(x)-\overline{V}(\epsilon)=\frac{\lambda}{c}\int_{\epsilon}^{x}\overline{V}(x-y)\overline{F}(y)dy (2.2)

for 0<ϵx<+0<\epsilon\leq x<+\infty with the following boundary condition:

{V¯(x)=0x0,limx0+V¯(x)=1λc𝔼[Y].\begin{cases}\overline{V}(x)=0&x\leq 0,\\ \lim_{x\to 0^{+}}\overline{V}(x)=1-\frac{\lambda}{c}\operatorname{\mathbb{E}}[Y].\\ \end{cases}

A numerical scheme solving equation (2.1) can be found in Appendix A.

2.2 Accounting for Claim Dependencies

Consider the surplus process 𝑿=(Xt(1),,Xt(n))\bm{X}=(X_{t}^{(1)},...,X_{t}^{(n)}) where

Xt(1)=u(1)+c(1)ti=0Nt(1)Yi(1)Xt(n)=u(n)+c(n)ti=0Nt(n)Yi(n)\begin{split}X_{t}^{(1)}&=u^{(1)}+c^{(1)}t-\sum_{i=0}^{N_{t}^{(1)}}Y_{i}^{(1)}\\ \vdots&\\ X_{t}^{(n)}&=u^{(n)}+c^{(n)}t-\sum_{i=0}^{N_{t}^{(n)}}Y_{i}^{(n)}\\ \end{split} (2.3)

If these processes are independent, it is relatively easy to combine them into a single process using the aggregation property of compound Poisson processes as described in Wütrich [35]. The aggregation property allows the combination of multiple surplus processes into a single risk process as follows:

Xt=j=1nu(j)+j=1nc(j)ti=0NtYi,X_{t}=\sum_{j=1}^{n}u^{(j)}+\sum_{j=1}^{n}c^{(j)}t-\sum_{i=0}^{N_{t}}Y_{i},

where NtN_{t} is a Poisson r.v. with λ=λ1++λn\lambda=\lambda_{1}+...+\lambda_{n} and YiY_{i} are i.i.d. random variables, which follow the severity distribution F(x)=j=1nλjλFj(x)F(x)=\sum_{j=1}^{n}\frac{\lambda_{j}}{\lambda}F_{j}(x). This aggregation property allows us to use the integro-differential equation (2.1) to calculate the ruin of multiple surplus processes.

If the risks are not independent, then we can use the fact that compound Poisson processes are characterized by their Lévy measure to decompose the claim process into independent processes to which the aggregation property can be applied. In particular, for n=2n=2 risks, we obtain the decomposition:

Xt=Xt(1)+Xt(2)=u+ctSt1St2St.\begin{split}X_{t}=X_{t}^{(1)}+X_{t}^{(2)}=u+ct-S_{t}^{1\perp}-S_{t}^{2\perp}-S_{t}^{\parallel}.\end{split}

where S1S^{1\perp} and S2S^{2\perp} are compound Poisson processes accounting for events concerning only risk 1 and risk 2, respectively. SS^{\parallel} is a compound Poisson process accounting for events concerning both risks simultaneously. Furthermore, S1S^{1\perp}, S2S^{2\perp} and SS^{\parallel} are mutually independent.

In this section, we briefly explain how this can be achieved. Further details can be found in [32]. We will use the following definitions:

Definition 2.2.

The tail integral of a Lévy measure ν\nu on [0,]2[0,\infty]^{2} is given by a function U:[0,]2[0,]U:[0,\infty]^{2}\mapsto[0,\infty]

U(x1,x2)=0ifx1=orx2=,U(x1,x2)=ν([x1,[×[x2,[)for(x1,x2)]0,[2,U(0,0)=.\begin{split}&U(x_{1},x_{2})=0\quad if\quad x_{1}=\infty\quad or\quad x_{2}=\infty,\\ &U(x_{1},x_{2})=\nu\big{(}[x_{1},\infty[\times[x_{2},\infty[\big{)}\quad for\quad(x_{1},x_{2})\in]0,\infty[^{2},\\ &U(0,0)=\infty.\end{split} (2.4)
Definition 2.3 (Lévy Copula for Processes with Positive Jumps).

A two-dimensional Lévy copula for Lévy processes with positive jumps, or for short, a positive Lévy copula, is a 2-increasing grounded function 𝒞:[0,]2[0,]\mathcal{C}:[0,\infty]^{2}\to[0,\infty] with uniform margins, that is, 𝒞(x,)=𝒞(,x)=x\mathcal{C}(x,\infty)=\mathcal{C}(\infty,x)=x.

Similarly to Sklar’s theorem for ordinary copulas [25], it has been shown that the dependency structure of (Xt(1),Xt(2))(X_{t}^{(1)},X_{t}^{(2)}) can be characterized by a Levy copula 𝒞\mathcal{C} such that 𝒞(U1(x1),U2(x2))\mathcal{C}(U_{1}(x_{1}),U_{2}(x_{2})) where U1U_{1} and U2U_{2} are the marginal tail integrals for Xt(1)X_{t}^{(1)} and Xt(2)X_{t}^{(2)}. If U1U_{1} and U2U_{2} are absolutely continuous, this Lévy copula is unique, otherwise it is unique on Range(U1)×Range(U2)Range(U_{1})\times Range(U_{2}), the product of ranges of one-dimensional tail integrals, [8, Theorem 5.4]

Consider a two dimensional claim process:

St=(St(1),St(2))=i=0Nt(Yi(1),Yi(2)),S_{t}=(S_{t}^{(1)},S_{t}^{(2)})=\sum_{i=0}^{N_{t}}(Y_{i}^{(1)},Y_{i}^{(2)}), (2.5)

where NtN_{t} is a Poisson process with intensity λ\lambda and Yi=(Yi(1),Yi(2))Y_{i}=(Y_{i}^{(1)},Y_{i}^{(2)}), ii\in\mathbbm{N} are independent random variables with common joint distribution FYF_{Y}. The components of SS, S(1)S^{(1)} and S(2)S^{(2)}, are one-dimensional compound Poisson processes with intensities λ1\lambda_{1} and λ2\lambda_{2} and severity distributions FY(1)F_{Y^{(1)}} and FY(2)F_{Y^{(2)}}, respectively. We wish to obtain a decomposition:

(St(1),St(2))=i=0Nt1(Yi(1),0)+i=0Nt2(0,Yi(2))+i=0Nt(Yi(1),Yi(2)),(S_{t}^{(1)},S_{t}^{(2)})=\sum_{i=0}^{N_{t}^{1\perp}}(Y_{i}^{(1\perp)},0)+\sum_{i=0}^{N_{t}^{2\perp}}(0,Y_{i}^{(2\perp)})+\sum_{i=0}^{N_{t}^{\parallel}}(Y_{i}^{(1\parallel)},Y_{i}^{(2\parallel)}), (2.6)

where i=0Nt1Yi(1)\sum_{i=0}^{N_{t}^{1\perp}}Y_{i}^{(1\perp)},i=0Nt2Yi(2)\sum_{i=0}^{N_{t}^{2\perp}}Y_{i}^{(2\perp)} and i=0Nt(Yi(1),Yi(2))\sum_{i=0}^{N_{t}^{\parallel}}(Y_{i}^{(1\parallel)},Y_{i}^{(2\parallel)}) are independent compound Poisson processes with intensities λ1\lambda_{1}^{\perp}, λ2\lambda_{2}^{\perp}, λ\lambda^{\parallel} and severity distributions FY1F_{Y^{1\perp}},FY2F_{Y^{2\perp}},FYF_{Y^{\parallel}}, respectively. In the above setting, we consider

FY(0,0)=FY(0,0)=0,FY(1)(0)=FY1(0)=FY(2)(0)=FY2(0)F_{Y}(0,0)=F_{Y^{\parallel}}(0,0)=0,\quad F_{Y^{(1)}}(0)=F_{Y^{1\perp}}(0)=F_{Y^{(2)}}(0)=F_{Y^{2\perp}}(0) (2.7)

A compound Poisson process, SS, is a Lévy process with Lévy measure ν(dx)=λdF(x)\nu(dx)=\lambda dF(x), with tail integral

U(x1,x2)={λ(Y(1)x1,Y(2)x2)if x1>0 or x2>0+if x1=x2=0.U(x_{1},x_{2})=\begin{cases}\lambda\operatorname{\mathbb{P}}\big{(}Y^{(1)}\geq x_{1},Y^{(2)}\geq x_{2}\big{)}&\textrm{if }x_{1}>0\textrm{ or }x_{2}>0\\ +\infty&\textrm{if }x_{1}=x_{2}=0.\end{cases}

The components S(1)S^{(1)} and S(2)S^{(2)} are independent if and only if U(x1,x2)=0U(x_{1},x_{2})=0 for every (x1,x2)]0,+[(x_{1},x_{2})\in]0,+\infty[, i.e., if and only if limx10+,x20+U(x1,x2)=0\lim_{x_{1}\to 0^{+},x_{2}\to 0^{+}}U(x_{1},x_{2})=0.

The Lévy measure of the processes S(i)S^{(i)}, i=1,2i=1,2, have tail integrals

U1(x1)=λ1(Y(1)x1)=U(x1,0)U2(x2)=λ2(Y(2)x2)=U(0,x2)\begin{split}U_{1}(x_{1})&=\lambda_{1}\operatorname{\mathbb{P}}\big{(}Y^{(1)}\geq x_{1}\big{)}=U(x_{1},0)\\ U_{2}(x_{2})&=\lambda_{2}\operatorname{\mathbb{P}}\big{(}Y^{(2)}\geq x_{2}\big{)}=U(0,x_{2})\end{split}

Taking equation (2.7) into account, one obtains

λi=limxi0+Ui(xi),i=1,2\lambda_{i}=\lim_{x_{i}\to 0^{+}}U_{i}(x_{i}),\quad i=1,2
λ=limx1,x20+(U1(x1)+U2(x2)U(x1,x2))=λ1+λ2λ\lambda=\lim_{x_{1},x_{2}\to 0^{+}}\big{(}U_{1}(x_{1})+U_{2}(x_{2})-U(x_{1},x_{2})\big{)}=\lambda_{1}+\lambda_{2}-\lambda^{\parallel}
λ=limx1,x20+U(x1,x2)\lambda^{\parallel}=\lim_{x_{1},x_{2}\to 0^{+}}U(x_{1},x_{2})
λi=λiλ,i=1,2.\lambda_{i}^{\perp}=\lambda_{i}-\lambda^{\parallel},\quad i=1,2.

The severity distributions FY1F_{Y^{1\perp}}, FY2F_{Y^{2\perp}}, and FYF_{Y^{\parallel}} can be recovered from the tail integrals:

(Y1x1)=1λ1limx20+(U1(x1)U(x1,x2))\operatorname{\mathbb{P}}\big{(}Y^{1\perp}\geq x_{1}\big{)}=\frac{1}{\lambda_{1}^{\perp}}\lim_{x_{2}\to 0^{+}}\Big{(}U_{1}(x_{1})-U(x_{1},x_{2})\Big{)}
(Y2x2)=1λ2limx10+(U2(x2)U(x1,x2))\operatorname{\mathbb{P}}\big{(}Y^{2\perp}\geq x_{2}\big{)}=\frac{1}{\lambda_{2}^{\perp}}\lim_{x_{1}\to 0^{+}}\Big{(}U_{2}(x_{2})-U(x_{1},x_{2})\Big{)}
(Y1x1,Y2x2)=1λU(x1,x2).\operatorname{\mathbb{P}}\big{(}Y^{1\perp}\geq x_{1},Y^{2\perp}\geq x_{2}\big{)}=\frac{1}{\lambda^{\parallel}}U(x_{1},x_{2}).

If the dependency between S(1)S^{(1)} and S(2)S^{(2)} is characterized by a Lévy copula, 𝒞\mathcal{C}, i.e. U(x1,x2)=𝒞(U1(x1),U2(x2))U(x_{1},x_{2})=\mathcal{C}(U_{1}(x_{1}),U_{2}(x_{2})) for (x1,x2)[0,+[2(x_{1},x_{2})\in[0,+\infty[^{2}, then the relations above can be written using the Lévy copula and one-dimensional tail integrals:

λ=limu1λ1+,u2λ2+𝒞(u1,u2)\lambda^{\parallel}=\lim_{u_{1}\to\lambda_{1}^{+},u_{2}\to\lambda_{2}^{+}}\mathcal{C}(u_{1},u_{2})
(Y1x1)=1λ1limu2λ2+(U1(x1)𝒞(U1(x1),u2))\operatorname{\mathbb{P}}\big{(}Y^{1\perp}\geq x_{1}\big{)}=\frac{1}{\lambda_{1}^{\perp}}\lim_{u_{2}\to\lambda_{2}^{+}}\Big{(}U_{1}(x_{1})-\mathcal{C}(U_{1}(x_{1}),u_{2})\Big{)}
(Y2x2)=1λ2limu1λ1+(U2(x2)𝒞(u1,U2(x2)))\operatorname{\mathbb{P}}\big{(}Y^{2\perp}\geq x_{2}\big{)}=\frac{1}{\lambda_{2}^{\perp}}\lim_{u_{1}\to\lambda_{1}^{+}}\Big{(}U_{2}(x_{2})-\mathcal{C}(u_{1},U_{2}(x_{2}))\Big{)}
(Y1x1,Y2x2)=1λ𝒞(U1(x1),U2(x2)).\operatorname{\mathbb{P}}\big{(}Y^{1\perp}\geq x_{1},Y^{2\perp}\geq x_{2}\big{)}=\frac{1}{\lambda^{\parallel}}\mathcal{C}(U_{1}(x_{1}),U_{2}(x_{2})).

Using the above methodology, the surplus process can be represented as:

Xt=u+cti=0Nt1Yi1i=0Nt2Yi2i=0Nt(Yi1+Yi2)=u+cti=1NtYi,X_{t}=u+ct-\sum_{i=0}^{N_{t}^{1\perp}}Y_{i}^{1\perp}-\sum_{i=0}^{N_{t}^{2\perp}}Y_{i}^{2\perp}-\sum_{i=0}^{N_{t}^{\parallel}}(Y_{i}^{1\parallel}+Y_{i}^{2\parallel})=u+ct-\sum_{i=1}^{N_{t}^{*}}Y_{i}^{*},

where u=u1+u2u=u_{1}+u_{2}, c=c1+c2c=c_{1}+c_{2}, NN^{*} is a Poisson process with intensity λ=λ1+λ2+λ\lambda=\lambda_{1}^{\perp}+\lambda_{2}^{\perp}+\lambda^{\parallel} and YiY_{i}^{*} are i.i.d. random variables with distribution:

F=λ1λFY1+λ2λFY2+λλFY1+Y2,F^{*}=\frac{\lambda_{1}^{\perp}}{\lambda}F_{Y^{1\perp}}+\frac{\lambda_{2}^{\perp}}{\lambda}F_{Y^{2\perp}}+\frac{\lambda^{\parallel}}{\lambda}F_{Y^{1\parallel}+Y^{2\parallel}},

where

FY1+Y2(x)=x1+x2x𝑑FY(x1,x2).F_{Y^{1\parallel}+Y^{2\parallel}}(x)=\int_{x_{1}+x_{2}\leq x}dF_{Y^{\parallel}}(x_{1},x_{2}).

3 The Optimal Loading for a Single Risk

An insurer can control the volume of its business through the premium loading θ\theta. A reasonable assumption is that the higher the loading, the smaller the number of contracts in its portfolio, which means that the claim intensity (or business volume) will decrease. Therefore, both the claim intensity 𝔼θ[N1]\operatorname{\mathbb{E}}^{\theta}[N_{1}], and the premium rate c(θ)c(\theta), will depend on θ\theta. It is reasonable to assume that 𝔼[N1]=0\operatorname{\mathbb{E}}^{\infty}[N_{1}]=0, as abnormal premium rates will not attract customers [13]. To capture these concepts let 𝔼θ[N1]=λp(θ)\operatorname{\mathbb{E}}^{\theta}[N_{1}]=\lambda p(\theta). Here λ\lambda is the average number of claims per unit of time for the whole market, and p(θ)p(\theta) is the probability that a potential claim is filed as an actual claim to the particular insurer under consideration. In other words, p(θ)p(\theta) reflects the demand or the market share sensitivity to the loading parameter θ\theta. p(θ)p(\theta) can be interpreted as a probability that a customer buys an insurance product. For example, we may assume that demand of insurance contracts is described by a logit glm model as in Hardin and Tabari [12]. Thus, p(θ)p(\theta), will be :

p(θ)=11+eβ0+β1θ,p(\theta)=\frac{1}{1+e^{\beta_{0}+\beta_{1}\theta}}, (3.1)

where β0\beta_{0} and β1\beta_{1} are determined from the glm and θ\theta is the loading parameter. β1\beta_{1} will be a positive number so p0p\to 0 when θ\theta\to\infty and p1p\to 1 when θ\theta\to-\infty. Assuming that the company has some fixed costs, independent of the risk exposure, denoted by r>0r>0, the expression for the net premium income becomes:

c(θ)=(1+θ)𝔼θ[N1]𝔼[Y]r.c(\theta)=(1+\theta)\operatorname{\mathbb{E}}^{\theta}[N_{1}]\operatorname{\mathbb{E}}[Y]-r.

The following proposition characterizes the behaviour of the solution of equation (2.1) with respect to the loading θ\theta.

Proposition 3.1.

If V(x,θ)V(x,\theta) satisfies equation (2.1) then V(x,θ)V(x,\theta) is strictly increasing with respect to the parameter α=𝔼θ[N1]c(θ)\alpha=\frac{\operatorname{\mathbb{E}}^{\theta}[N_{1}]}{c(\theta)}.

Proof.

It is possible to integrate Equation (2.1) on the interval ]0,x]]0,x] to obtain:

V(x,θ)=𝔼θ[N1]c(θ)(𝔼[Y]+0x(V(z,θ)0zV(zy)𝑑F(y)+F(z)1)𝑑z).\begin{split}V(x,\theta)=\frac{\operatorname{\mathbb{E}}^{\theta}[N_{1}]}{c(\theta)}\Bigg{(}\operatorname{\mathbb{E}}[Y]+\int_{0}^{x}\Big{(}V(z,\theta)-\int_{0}^{z}V(z-y)dF(y)+F(z)-1\Big{)}dz\Bigg{)}.\end{split} (3.2)

To prove the proposition, we will study equations of the general form:

u(x)=α(g(x)+0x(u(z)0zu(zy)𝑑F(y))𝑑z).\begin{split}u(x)=\alpha\Bigg{(}g(x)+\int_{0}^{x}\Big{(}u(z)-\int_{0}^{z}u(z-y)dF(y)\Big{)}dz\Bigg{)}.\end{split} (3.3)

We introduce the operator Ψ\Psi, acting on measurable locally bounded functions h:[0,+]h:[0,+\infty]\mapsto\mathbbm{R}, as:

(Ψh)(x)=0x(h(z)0zh(zy)𝑑F(y))𝑑z,x0(\Psi h)(x)=\int_{0}^{x}\Big{(}h(z)-\int_{0}^{z}h(z-y)dF(y)\Big{)}dz,\quad x\geq 0 (3.4)

Notice that the transformation hΨhh\mapsto\Psi h is linear and for every hh, Ψh:[0,+[\Psi h:[0,+\infty[\mapsto\mathbbm{R} is continuous, hence measurable and locally bounded. Thus, powers of the operator Ψ\Psi are defined in the usual way.

Ψ0h=h,Ψnh=Ψ(Ψn1h),n.\Psi^{0}h=h,\qquad\Psi^{n}h=\Psi(\Psi^{n-1}h),\quad n\in\mathbb{N}.

Let, h[0,x]=supz[0,x]|h(z)|\left\lVert h\right\rVert_{[0,x]}=\sup_{z\in[0,x]}|h(z)|. Then:

|(Ψh)(x)|0x(|h(z)|+0z|h(zy)|𝑑F(y))𝑑z2xh[0,x].\begin{split}|(\Psi h)(x)|&\leq\int_{0}^{x}\Big{(}|h(z)|+\int_{0}^{z}|h(z-y)|dF(y)\Big{)}dz\leq 2x\left\lVert h\right\rVert_{[0,x]}.\end{split}

If the inequality

(Ψnh)[0,x]2nxnn!h[0,x]\left\lVert(\Psi^{n}h)\right\rVert_{[0,x]}\leq\frac{2^{n}x^{n}}{n!}\left\lVert h\right\rVert_{[0,x]} (3.5)

holds, for some nn\in\mathbbm{N}, then

|(Ψn+1h)(x)|0x(|(Ψnh)(z)|+0z|(Ψnh)(zy)|𝑑F(y))𝑑z0x22nznn!h[0,x]𝑑z=2n+1xn+1(n+1)!h[0,x].\begin{split}|(\Psi^{n+1}h)(x)|&\leq\int_{0}^{x}\Big{(}|(\Psi^{n}h)(z)|+\int_{0}^{z}|(\Psi^{n}h)(z-y)|dF(y)\Big{)}dz\\ &\leq\int_{0}^{x}2\frac{2^{n}z^{n}}{n!}\left\lVert h\right\rVert_{[0,x]}dz=\frac{2^{n+1}x^{n+1}}{(n+1)!}\left\lVert h\right\rVert_{[0,x]}.\end{split}

Thus, by induction, (3.5) holds for every nn\in\mathbb{N}. Therefore, for every x[0,[x\in[0,\infty[, fixed, there is some nn\in\mathbb{N} such that Ψn\Psi^{n} is a contraction in the space of measureable and bounded functions h:[0,x]h:[0,x]\mapsto\mathbbm{R}. It follows from the contraction principle that equation (3.2) has one unique solution. Further, limn(αnΨn)h=0\lim_{n\to\infty}(\alpha^{n}\Psi^{n})h=0, uniformly in [0,x][0,x] for any given hh and any fixed x[0,+[x\in[0,+\infty[.

Let uα,gu_{\alpha,g} be the solution of equation (3.3) for given gg and α\alpha. Then,

uα,g=α(g+Ψuα,g)=αg+αΨ(α(g+Ψuα,g))=αg+α2Ψg+α2Ψuα,g=αg+α2Ψg++αn+1Ψng+αn+2Ψn+1uα,g.\begin{split}u_{\alpha,g}&=\alpha(g+\Psi u_{\alpha,g})=\alpha g+\alpha\Psi(\alpha(g+\Psi u_{\alpha,g}))=\alpha g+\alpha^{2}\Psi g+\alpha^{2}\Psi u_{\alpha,g}\\ &=\alpha g+\alpha^{2}\Psi g+\dots+\alpha^{n+1}\Psi^{n}g+\alpha^{n+2}\Psi^{n+1}u_{\alpha,g}.\end{split}

Since limnαnΨnuα,g(x)=0\lim_{n\to\infty}\alpha^{n}\Psi^{n}u_{\alpha,g}(x)=0, this shows that uα,gu_{\alpha,g} admits the series representation:

uα,g=n=0αn+1Ψng,\begin{split}u_{\alpha,g}&=\sum_{n=0}^{\infty}\alpha^{n+1}\Psi^{n}g,\end{split}

which converges uniformly with respect to α\alpha on compact intervals. Thus, we can differentiate term by term and obtain

ddαuα,g(x)=n=0(n+1)αn(Ψng)(x)=n=0αn(Ψng)(x)+n=1nαn(Ψng)(x)=1αuα,g+n=1αn(Ψng)(x)+n=2(n1)αn(Ψng)(x)=1αuα,g+n=0αn+1(Ψn+1g)(x)+n=1nαn+1(Ψn+1g)(x)=1αuα,g+(Ψuα,g)(x)+n=1αn+1(Ψn+1g)(x)+n=2(n1)αn+1(Ψn+1g)(x)=1αuα,g+(Ψuα,g)(x)+n=0αn+2(Ψn+2g)(x)+n=1nαn+2(Ψn+2g)(x)=1αuα,g+(Ψuα,g)(x)+(αΨ2uα,g)(x)++(αk1Ψkuα,g)(x)+n=1nαn+k(Ψn+kg)(x)=n=0αn1(Ψnuα,g)(x)=1α2uα,uα,g.\begin{split}\frac{d}{d\alpha}u_{\alpha,g}(x)&=\sum_{n=0}^{\infty}(n+1)\alpha^{n}(\Psi^{n}g)(x)\\ &=\sum_{n=0}^{\infty}\alpha^{n}(\Psi^{n}g)(x)+\sum_{n=1}^{\infty}n\alpha^{n}(\Psi^{n}g)(x)\\ &=\frac{1}{\alpha}u_{\alpha,g}+\sum_{n=1}^{\infty}\alpha^{n}(\Psi^{n}g)(x)+\sum_{n=2}^{\infty}(n-1)\alpha^{n}(\Psi^{n}g)(x)\\ &=\frac{1}{\alpha}u_{\alpha,g}+\sum_{n=0}^{\infty}\alpha^{n+1}(\Psi^{n+1}g)(x)+\sum_{n=1}^{\infty}n\alpha^{n+1}(\Psi^{n+1}g)(x)\\ &=\frac{1}{\alpha}u_{\alpha,g}+(\Psi u_{\alpha,g})(x)+\sum_{n=1}^{\infty}\alpha^{n+1}(\Psi^{n+1}g)(x)+\sum_{n=2}^{\infty}(n-1)\alpha^{n+1}(\Psi^{n+1}g)(x)\\ &=\frac{1}{\alpha}u{\alpha,g}+(\Psi u_{\alpha,g})(x)+\sum_{n=0}^{\infty}\alpha^{n+2}(\Psi^{n+2}g)(x)+\sum_{n=1}^{\infty}n\alpha^{n+2}(\Psi^{n+2}g)(x)\\ &=\frac{1}{\alpha}u_{\alpha,g}+(\Psi u_{\alpha,g})(x)+(\alpha\Psi^{2}u_{\alpha,g})(x)+\dots+(\alpha^{k-1}\Psi^{k}u_{\alpha,g})(x)+\sum_{n=1}^{\infty}n\alpha^{n+k}(\Psi^{n+k}g)(x)\\ &=\sum_{n=0}^{\infty}\alpha^{n-1}(\Psi^{n}u_{\alpha,g})(x)=\frac{1}{\alpha^{2}}u_{\alpha,u_{\alpha,g}}.\end{split}

For any h:[0,x]h:[0,x]\mapsto\mathbbm{R} locally absolutely continuous function:

(Ψh)(x)=0x(h(z)0zh(zy)𝑑F(y))𝑑z=0x(h(z)[h(zy)F(y)]y=0y=z0zh(zy)F(y)𝑑y)𝑑z=0x(h(z)h(0)F(z))𝑑z0xyxh(zy)F(y)𝑑z𝑑y=0x(h(z)h(0)F(z))𝑑z0x(h(xy)h(0))F(y)𝑑y=0xh(z)𝑑z0xh(xy)F(y)𝑑y=0xh(z)(1+F(xz))𝑑z.\begin{split}(\Psi h)(x)&=\int_{0}^{x}\Big{(}h(z)-\int_{0}^{z}h(z-y)dF(y)\Big{)}dz\\ &=\int_{0}^{x}\Big{(}h(z)-[h(z-y)F(y)]_{y=0}^{y=z}-\int_{0}^{z}h^{\prime}(z-y)F(y)dy\Big{)}dz\\ &=\int_{0}^{x}(h(z)-h(0)F(z))dz-\int_{0}^{x}\int_{y}^{x}h^{\prime}(z-y)F(y)dzdy\\ &=\int_{0}^{x}(h(z)-h(0)F(z))dz-\int_{0}^{x}\big{(}h(x-y)-h(0)\big{)}F(y)dy\\ &=\int_{0}^{x}h(z)dz-\int_{0}^{x}h(x-y)F(y)dy=\int_{0}^{x}h(z)(1+F(x-z))dz.\\ \end{split}

Thus, h>0h>0 implies (Ψh)>0(\Psi h)>0, which implies (Ψnh)>0(\Psi^{n}h)>0, n\forall n\in\mathbbm{N}, and therefore uα,h>0u_{\alpha,h}>0 for any α>0\alpha>0. This argument shows that ddαV=1α2uα,V>0\frac{d}{d\alpha}V=\frac{1}{\alpha^{2}}u_{\alpha,V}>0 as V>0V>0. Therefore VV is strictly increasing with α\alpha. ∎

According to Proposition 3.1, in order to find θ\theta minimizing the probability of ruin, it is sufficient to find θ\theta minimizing 𝔼θ[N1]c(θ)\frac{\mathbb{E}^{\theta}[N_{1}]}{c(\theta)}. For example, using the logit demand model (3.1), the optimal loading is found with direct differentiation of α\alpha and is given by:

θruin=1β1(ln(λ𝔼[Y]rβ1)β0).\begin{split}\theta_{ruin}=\frac{1}{\beta_{1}}\Big{(}\ln\big{(}\frac{\lambda\operatorname{\mathbb{E}}[Y]}{r\beta_{1}}\big{)}-\beta_{0}\Big{)}.\end{split} (3.6)

However, the loading that maximizes the expected profit is:

θprofit=argmaxθ𝔼θ[X1|X0=x]=argmaxθ{θ𝔼θ[N1]𝔼[Y]r},\begin{split}\theta_{profit}=\operatorname*{arg\,max}_{\theta}\operatorname{\mathbb{E}}^{\theta}[X_{1}\>|\>X_{0}=x]=\operatorname*{arg\,max}_{\theta}\{\theta\operatorname{\mathbb{E}}^{\theta}[N_{1}]\operatorname{\mathbb{E}}[Y]-r\},\end{split}

which is, in the case of logit demand (3.1), the unique solution of:

1+eβ0+β1θβ1θeβ0+β1θ=0.\begin{split}1+e^{\beta_{0}+\beta_{1}\theta}-\beta_{1}\theta e^{\beta_{0}+\beta_{1}\theta}=0.\end{split} (3.7)

Thus, in general, θruin\theta_{ruin} does not coincide with θprofit\theta_{profit}.

4 The Multiple Risk Case

In this section, we explore how dependencies between risks available in an insurance market translate into risk exposure for a company through its market shares on the different risks. It turns out that this mechanism is non trivial when the risks are dependent. For the sake of simplicity, we assume that the company offers insurance for two risks in a market constituted by identical individuals, all of them exposed to both risks. Using the notation in equations (2.5) and (2.6) to denote the market claim process, St=(St(1),St(2))S_{t}=(S_{t}^{(1)},S_{t}^{(2)}) is the vector of the total (accumulated) amount of claims of each risk that occurred in the market, up to time tt. The marginal distributions of S(1)S^{(1)} and S(2)S^{(2)} are characterized by claim intensities λ1\lambda_{1} and λ2\lambda_{2} and the severity distributions FY(1)F_{Y^{(1)}}, FY(2)F_{Y^{(2)}} and their dependency structure is characterized by a parameter λ[0,min(λ1,λ2)]\lambda^{\parallel}\in[0,\min(\lambda_{1},\lambda_{2})] and a joint distribution F(Y1,Y2)F_{(Y^{1\parallel},Y^{2\parallel})}, as explained in Section 2.

4.1 Risk Exposure as a Function of Market Shares

To extend the demand model outlined in Section 3 to a market with multiple risks where the acquisition of insurance for different risks may not be independent, we propose the following interpretation for the function pp.

Let (θ1,θ2)(\theta_{1},\theta_{2}) be the loadings charged by the company for each risk. We assume that every individual in the market (a potential client) is provided with a vector of bid prices (b1b_{1}, b2b_{2}). The client acquires the insurance for risk ii if biθib_{i}\geq\theta_{i} (for convenience, we consider prices net of the pure premium). The distribution of the price vectors in the market is modelled by a random vector B=(B1,B2)B=(B_{1},B_{2}). Thus, pi(θ)=pi(θi)=(Biθi)p_{i}(\theta)=p_{i}(\theta_{i})=\operatorname{\mathbb{P}}\big{(}B_{i}\geq\theta_{i}\big{)} is the company’s market share for the insurance of risk ii at equilibrium, given the loadings θ=(θ1,θ2)\theta=(\theta_{1},\theta_{2}). Let p(1,0)p^{(1,0)} be the proportion of individuals in the market holding a policy for risk 1 and no policy for risk 2. Similarly, p(0,1)(θ)p^{(0,1)}(\theta) and p(1,1)(θ)p^{(1,1)}(\theta) denote the proportion of individuals holding a policy only for risk 2 and for both risks, respectively. If the acquisition of polices for different risks is independent, then:

p(1,1)(θ)=p1(θ1)p2(θ2),p(1,0)(θ)=p1(θ1)(1p2(θ2)),p(0,1)(θ)=p2(θ2)(1p1(θ1)).p^{(1,1)}(\theta)=p_{1}(\theta_{1})p_{2}(\theta_{2}),\quad p^{(1,0)}(\theta)=p_{1}(\theta_{1})(1-p_{2}(\theta_{2})),\quad p^{(0,1)}(\theta)=p_{2}(\theta_{2})(1-p_{1}(\theta_{1})). (4.1)

Dependency between the acquisition of different risks can be introduced by considering dependent bid prices B=(B1,B2)B=(B_{1},B_{2}). In particular, if the joint distribution of BB is characterized by an ordinary copula C:[0,1]2[0,1]C:[0,1]^{2}\mapsto[0,1], then, according to Sklar’s theorem FB(θ1,θ2)=C(FB1(θ1),FB2(θ2))F_{B}(\theta_{1},\theta_{2})=C(F_{B_{1}}(\theta_{1}),F_{B_{2}}(\theta_{2})) [25]. This gives:

p(1,0)=FB2(θ2)C(FB1(θ1),FB2(θ2)),p(0,1)=FB1(θ1)C(FB1(θ1),FB2(θ2)),p(1,1)=1FB1(θ1)FB2(θ2)+C(FB1(θ1),FB2(θ2)).\begin{split}p^{(1,0)}&=F_{B_{2}}(\theta_{2}^{-})-C(F_{B_{1}}(\theta_{1}^{-}),F_{B_{2}}(\theta_{2}^{-})),\\ p^{(0,1)}&=F_{B_{1}}(\theta_{1}^{-})-C(F_{B_{1}}(\theta_{1}^{-}),F_{B_{2}}(\theta_{2}^{-})),\\ p^{(1,1)}&=1-F_{B_{1}}(\theta_{1}^{-})-F_{B_{2}}(\theta_{2}^{-})+C(F_{B_{1}}(\theta_{1}^{-}),F_{B_{2}}(\theta_{2}^{-})).\\ \end{split}

Under this model, the company’s surplus process is:

X~t=u(1)+u(2)+(c(1)(θ1)+c(2)(θ2))ti=0N~t1Y~i1i=0N~t2Y~i2i=0N~t(Yi1+Yi2),\tilde{X}_{t}=u^{(1)}+u^{(2)}+\big{(}c^{(1)}(\theta_{1})+c^{(2)}(\theta_{2})\big{)}t-\sum_{i=0}^{\tilde{N}_{t}^{1\perp}}\tilde{Y}_{i}^{1\perp}-\sum_{i=0}^{\tilde{N}_{t}^{2\perp}}\tilde{Y}_{i}^{2\perp}-\sum_{i=0}^{\tilde{N}_{t}^{\parallel}}\Big{(}Y_{i}^{1\parallel}+Y_{i}^{2\parallel}\Big{)}, (4.2)

where N~t1\tilde{N}_{t}^{1\perp}, N~t2\tilde{N}_{t}^{2\perp} , and N~t\tilde{N}_{t}^{\parallel} count the number of claims received by the company concerning only risk 1, only risk 2, and both risks, respectively. Their intensities are, respectively,

λ~1=p(1,0)(θ)(λ1+λ)+p(1,1)(θ)λ1=p1(θ1)λ1+p(1,0)(θ)λ,λ~2=p2(θ2)λ2+p(0,1)(θ)λ,λ~=p(1,1)(θ)λ.\begin{split}\tilde{\lambda}_{1}^{\perp}&=p^{(1,0)}(\theta)\big{(}\lambda_{1}^{\perp}+\lambda^{\parallel}\big{)}+p^{(1,1)}(\theta)\lambda_{1}^{\perp}=p_{1}(\theta_{1})\lambda_{1}^{\perp}+p^{(1,0)}(\theta)\lambda^{\parallel},\\ \tilde{\lambda}_{2}^{\perp}&=p_{2}(\theta_{2})\lambda_{2}^{\perp}+p^{(0,1)}(\theta)\lambda^{\parallel},\\ \tilde{\lambda}^{\parallel}&=p^{(1,1)}(\theta)\lambda^{\parallel}.\end{split}

The distribution of the single risk claim amounts Y~1\tilde{Y}^{1\perp} (resp., Y~2\tilde{Y}^{2\perp}) is a mixture of the distributions Y1Y^{1\perp} and Y1Y^{1\parallel} (resp., Y2Y^{2\perp} and Y2Y^{2\parallel}):

FY~1=p1λ1p1λ1+p(1,0)λFY1+p(1,0)λp1λ1+p(1,0)λFY1FY~2=p2λ2p2λ2+p(0,1)λFY2+p(0,1)λp2λ2+p(0,1)λFY2\begin{split}&F_{\tilde{Y}^{1\perp}}=\frac{p_{1}\lambda_{1}^{\perp}}{p_{1}\lambda_{1}^{\perp}+p^{(1,0)}\lambda^{\parallel}}F_{Y^{1\perp}}+\frac{p^{(1,0)}\lambda^{\parallel}}{p_{1}\lambda_{1}^{\perp}+p^{(1,0)}\lambda^{\parallel}}F_{Y^{1\parallel}}\\ &F_{\tilde{Y}^{2\perp}}=\frac{p_{2}\lambda_{2}^{\perp}}{p_{2}\lambda_{2}^{\perp}+p^{(0,1)}\lambda^{\parallel}}F_{Y^{2\perp}}+\frac{p^{(0,1)}\lambda^{\parallel}}{p_{2}\lambda_{2}^{\perp}+p^{(0,1)}\lambda^{\parallel}}F_{Y^{2\parallel}}\end{split}

This is because some customers insure risk 1, but not risk 2 and vice-versa. Therefore, the aggregate process for the insurer is

X~t=u(1)+u(2)+(c(1)(θ1)+c(2)(θ2))ti=0N~tY~i,\tilde{X}_{t}=u^{(1)}+u^{(2)}+\big{(}c^{(1)}(\theta_{1})+c^{(2)}(\theta_{2})\big{)}t-\sum_{i=0}^{\tilde{N}_{t}}\tilde{Y}_{i}, (4.3)

where N~t\tilde{N}_{t} is a Poisson process with intensity

λ~=p1λ1+p2λ2+(p(1,0)+p(0,1)+p(1,1))λ=p1λ1+p2λ2p(1,1)λ,\tilde{\lambda}=p_{1}\lambda_{1}^{\perp}+p_{2}\lambda_{2}^{\perp}+\big{(}p^{(1,0)}+p^{(0,1)}+p^{(1,1)}\big{)}\lambda^{\parallel}=p_{1}\lambda_{1}+p_{2}\lambda_{2}-p^{(1,1)}\lambda^{\parallel}, (4.4)

and Y~i\tilde{Y}_{i}, ii\in\mathbb{N} are i.i.d random variables with distribution

FY~=p1λ1λ~FY1+p2λ2λ~FY2+p(1,0)λλ~FY1+p(0,1)λλ~FY2+p(1,1)λλ~FY1+Y2=1p1λ1+p2λ2p(1,1)λ(p1λ1FY1+p2λ2FY2+p(1,1)λ(FY1+Y2FY1FY2)).\begin{split}F_{\tilde{Y}}&=\frac{p_{1}\lambda_{1}^{\perp}}{\tilde{\lambda}}F_{Y^{1\perp}}+\frac{p_{2}\lambda_{2}^{\perp}}{\tilde{\lambda}}F_{Y^{2\perp}}+\frac{p^{(1,0)}\lambda^{\parallel}}{\tilde{\lambda}}F_{Y^{1\parallel}}+\frac{p^{(0,1)}\lambda^{\parallel}}{\tilde{\lambda}}F_{Y^{2\parallel}}+\frac{p^{(1,1)}\lambda^{\parallel}}{\tilde{\lambda}}F_{Y^{1\parallel}+Y^{2\parallel}}\\ &=\frac{1}{p_{1}\lambda_{1}+p_{2}\lambda_{2}-p^{(1,1)}\lambda^{\parallel}}\bigg{(}p_{1}\lambda_{1}F_{Y^{1}}+p_{2}\lambda_{2}F_{Y^{2}}+p^{(1,1)}\lambda^{\parallel}\Big{(}F_{Y^{1\parallel}+Y^{2\parallel}}-F_{Y^{1\parallel}}-F_{Y^{2\parallel}}\Big{)}\bigg{)}.\end{split} (4.5)

Thus, if the risks in the market are independent (i.e. if λ=0\lambda^{\parallel}=0), then the risk in the company’s portfolio is just a sum of the risks S(1)S^{(1)} and S(2)S^{(2)}, weighted by the respective market shares, p1p_{1} and p2p_{2}, irrespective of any dependency between sales of policies for different risks. However, if the risks in the market are dependent ( λ0\lambda^{\parallel}\neq 0), then the company’s risk is not, in general, a weighted sum of S(1)S^{(1)} and S(2)S^{(2)}. Further, this effect persists even in the case where sales of different policies are independent (i.e., p(1,1,)=p1p2p^{(1,1,)}=p_{1}p_{2}). On the other hand, equalities (4.4) and (4.5) show that in the (unlikely) situation where clients always buy insurance for only one risk, the risk exposure of the insurer is accurately computed using only the marginal distributions of each risk (i.e. assuming that the risks are independent). This is due to the static nature of our model. For example, it does not take into account the possibility of external factors changing the frequency of claim events in both risks simultaneously.

4.2 The Impact of Dependencies on Ruin Probability

From the discussion above and Proposition 3.1, it follows that the ruin probability of a company with market shares (p1p_{1}, p2p_{2}, p(1,1)p^{(1,1)}) solves the equation

dV(x)dx=\displaystyle\frac{dV(x)}{dx}= λ~c(1)+c(2)(V(x)0xV(xy)𝑑FY~(y)+FY~(x)1),\displaystyle\frac{\tilde{\lambda}}{c^{(1)}+c^{(2)}}\Big{(}V(x)-\int_{0}^{x}V(x-y)dF_{\tilde{Y}}(y)+F_{\tilde{Y}}(x)-1\Big{)}, (4.6)
V(0+)=\displaystyle V(0^{+})= λ~c(1)+c(2)𝔼[Y~],\displaystyle\frac{\tilde{\lambda}}{c^{(1)}+c^{(2)}}\operatorname{\mathbb{E}}[\tilde{Y}], (4.7)

with λ~\tilde{\lambda} and FY~F_{\tilde{Y}} given by equations (4.4) and (4.5).

Since estimating the dependency structure may pose substantial difficulties, we may wish to have an a-priori bound for the error introduced by neglecting dependencies, that is, by substituting the probability Vind(x)V_{ind}(x) for V(x)V(x), where Vind(x)V_{ind}(x) solves the equation.

dV(x)dx=λ^c(1)+c(2)(V(x)0xV(xy)𝑑FY^(y)+FY^(x)1),\frac{dV(x)}{dx}=\frac{\hat{\lambda}}{c^{(1)}+c^{(2)}}\Big{(}V(x)-\int_{0}^{x}V(x-y)dF_{\hat{Y}}(y)+F_{\hat{Y}}(x)-1\Big{)}, (4.8)

where λ^=λ1p1+λ2p2\hat{\lambda}=\lambda_{1}p_{1}+\lambda_{2}p_{2} and FY^(x)=λ1p1FY(1)+λ2p2FY(2)λ^F_{\hat{Y}}(x)=\frac{\lambda_{1}p_{1}F_{Y^{(1)}}+\lambda_{2}p_{2}F_{Y^{(2)}}}{\hat{\lambda}}. Notice that λ^𝔼[Y^]=λ~𝔼[λ~]\hat{\lambda}\operatorname{\mathbb{E}}[\hat{Y}]=\tilde{\lambda}\operatorname{\mathbb{E}}[\tilde{\lambda}] and therefore the boundary condition for (4.8) is again (4.7).

The discussion in Subsection 4.1 shows that the difference V(x)Vind(x)V(x)-V_{ind}(x) is expected to be small when p(1,1)p^{(1,1)} is small compared to p1+p2p_{1}+p_{2}. The following proposition gives a precise meaning for this statement.

Proposition 4.1.

With the notation above:

|V(x)Vind(x)|p(1,1)λe2λ~xc(1)+c(2)1λ~|V(x)-V_{ind}(x)|\leq p^{(1,1)}\lambda^{\parallel}\frac{e^{\frac{2\tilde{\lambda}x}{c^{(1)}+c^{(2)}}}-1}{\tilde{\lambda}}

for every amount of initial reserve x0x\geq 0.

Proof.

From equalities (4.6), (4.7) and (4.8), straightforward computations yield:

V(x)Vind(x)=p(1,1)λc(1)+c(1)(0xVind(z)0zVind(zy)dFY1+Y2(y)+FY1+Y2(z)1dz0xVind(z)0zVind(zy)𝑑FY1(y)+FY1(z)1dz0xVind(z)0zVind(zy)dFY2(y)+FY2(z)1dz)+λ~c(1)+c(2)0x(VVind)(z)0z(VVind)(zy)𝑑FY~(y)𝑑z\begin{split}V(x)-V_{ind}(x)&=\frac{p^{(1,1)}\lambda^{\parallel}}{c^{(1)}+c^{(1)}}\Bigg{(}\int_{0}^{x}V_{ind}(z)-\int_{0}^{z}V_{ind}(z-y)dF_{Y^{1\parallel}+Y^{2\parallel}}(y)+F_{Y^{1\parallel}+Y^{2\parallel}}(z)-1dz-\\ &\quad\int_{0}^{x}V_{ind}(z)-\int_{0}^{z}V_{ind}(z-y)dF_{Y^{1\parallel}}(y)+F_{Y^{1\parallel}}(z)-1dz-\\ &\quad\int_{0}^{x}V_{ind}(z)-\int_{0}^{z}V_{ind}(z-y)dF_{Y^{2\parallel}}(y)+F_{Y^{2\parallel}}(z)-1dz-\Bigg{)}\\ &\quad+\frac{\tilde{\lambda}}{c^{(1)}+c^{(2)}}\int_{0}^{x}(V-V_{ind})(z)-\int_{0}^{z}(V-V_{ind})(z-y)dF_{\tilde{Y}}(y)dz\end{split} (4.9)

It can be checked that for every distribution function G:[0,+[[0,1]G:[0,+\infty[\mapsto[0,1],

x0xVind(z)0zVind(zy)𝑑G(y)+G(z)1dz0-x\leq\int_{0}^{x}V_{ind}(z)-\int_{0}^{z}V_{ind}(z-y)dG(y)+G(z)-1dz\leq 0

Therefore, (4.9) implies:

maxy[0,x]|V(x)Vind(x)|p(1,1)λc(1)+c(2)2x+λ~c(1)+c(2)0x2maxy[0,z]|V(x)Vind(y)|dz\max_{y\in[0,x]}|V(x)-V_{ind}(x)|\leq\frac{p^{(1,1)}\lambda^{\parallel}}{c^{(1)}+c^{(2)}}2x+\frac{\tilde{\lambda}}{c^{(1)}+c^{(2)}}\int_{0}^{x}2\max_{y\in[0,z]}|V(x)-V_{ind}(y)|dz

Thus, the result follows by Grönwall’s inequality [10]. ∎

4.3 The Impact of Dependencies on Small Companies

Now, we proceed with the argument above to explore how dependencies affect companies of different size. We measure the size of the company by it’s expected total value of claims, λ~𝔼[Y~]\tilde{\lambda}\operatorname{\mathbb{E}}[\tilde{Y}] and, to make comparisons meaningful, we consider that the total revenue is proportional to the company’s size, i.e.

c(1)+c(2)=(1+θ)λ~𝔼[Y~],with θ>0 constant.c^{(1)}+c^{(2)}=(1+\theta)\tilde{\lambda}\operatorname{\mathbb{E}}[\tilde{Y}],\quad\textrm{with }\theta>0\textrm{ constant.}

Similarly, we consider the initial reserve to be proportional to size, i.e.:

x=xoλ~𝔼[Y~],with x0>0 constant.x=x_{o}\tilde{\lambda}\operatorname{\mathbb{E}}[\tilde{Y}],\quad\textrm{with }x_{0}>0\textrm{ constant.}

Notice that, due to equations (4.6), (4.7) and (4.8), the effect of dependencies must be bounded in the sense that

|V(x0λ~𝔼[Y~])Vind(x0λ~𝔼[Y~])|K1x0λ~𝔼[Y~]K2(p1+p2),|V(x_{0}\tilde{\lambda}\operatorname{\mathbb{E}}[\tilde{Y}])-V_{ind}(x_{0}\tilde{\lambda}\operatorname{\mathbb{E}}[\tilde{Y}])|\leq K_{1}x_{0}\tilde{\lambda}\operatorname{\mathbb{E}}[\tilde{Y}]\leq K_{2}(p_{1}+p_{2}),

for some constants K1,K2<+K_{1},K_{2}<+\infty. However, we can use Proposition 4.1 to obtain a better estimate:

|V(x0λ~𝔼[Y~])Vind(x0λ~𝔼[Y~])|p(1,1)λe2λ~x01+θc(1)+c(2)1λ~.|V(x_{0}\tilde{\lambda}\operatorname{\mathbb{E}}[\tilde{Y}])-V_{ind}(x_{0}\tilde{\lambda}\operatorname{\mathbb{E}}[\tilde{Y}])|\leq p^{(1,1)}\lambda^{\parallel}\frac{e^{\frac{2\tilde{\lambda}\frac{x_{0}}{1+\theta}}{c^{(1)}+c^{(2)}}}-1}{\tilde{\lambda}}. (4.10)

Notice that the right-hand side of (4.10) has the same asymptotic behaviour as

p(1,1)λx01+θ,when p1+p20.p^{(1,1)}\lambda^{\parallel}\frac{x_{0}}{1+\theta},\quad\textrm{when }p_{1}+p_{2}\to 0.

Further, if the sales of policies for different risks to the same individual are independent, then p(1,1)=p1p2p^{(1,1)}=p_{1}p_{2} goes to zero faster than λ~𝔼[Y~]=p1λ1𝔼[Y(1)]+p2λ2𝔼[Y(2)]\tilde{\lambda}\operatorname{\mathbb{E}}[\tilde{Y}]=p_{1}\lambda_{1}\operatorname{\mathbb{E}}[Y^{(1)}]+p_{2}\lambda_{2}\operatorname{\mathbb{E}}[Y^{(2)}], when p1+p20p_{1}+p_{2}\to 0. Thus, a small company selling policies for different risks independently is relatively immune to the effects of dependencies between the risks, contrary to a large company (it is obvious that a monopolistic company is fully exposed to the dependencies between risks). This immunity to risk’s dependencies may persist even when sales of policies for different risks are not independent, provided the dependency in sales is sufficiently mild. For example, limp1+p20p(1,1)p1+p2=0\lim_{p_{1}+p_{2}\to 0}\frac{p^{(1,1)}}{p_{1}+p_{2}}=0 if the dependency between sales is modelled by a Clayton or a Frank copula in (4.1). However, small companies are not specially protected from risk dependencies if the dependency between sales is modelled by a Pareto or a Gumbel copula.

4.4 Optimal Loadings and Market Shares

Since the right-hand sides of equalities (4.6) and (4.7) depend on the loadings through both 𝔼[N~1c(1)+c(2)\frac{\operatorname{\mathbb{E}}[\tilde{N}_{1}}{c^{(1)}+c^{(2)}} and FY~F_{\tilde{Y}}, Proposition 3.1 can not be generalized to models with multiple risks. However, it is possible to provide optimality conditions for the loadings θ=(θ1,θ2)\theta=(\theta_{1},\theta_{2}) minimizing the ruin probability.

To do this, we extend the notation introduced in the proof of Proposition 3.1. For any distribution function G:[0,+[[0,1]G:[0,+\infty[\mapsto[0,1], we consider the compounding operator of type (3.4)

(ΨGh)(x)=0x(h(z)0zh(zy)𝑑θ(y))𝑑z,x0.(\Psi_{G}h)(x)=\int_{0}^{x}\Big{(}h(z)-\int_{0}^{z}h(z-y)d\theta(y)\Big{)}dz,\quad x\geq 0.

Thus, the 2-risk version of equation (3.2) can be written as

Vθ(x)=λ~θc(θ)(x1Fθ(z)dz+(ΨFθVθ)(x)),V_{\theta}(x)=\frac{\tilde{\lambda}_{\theta}}{c(\theta)}\bigg{(}\int_{x}^{\infty}1-F_{\theta}(z)dz+\Big{(}\Psi_{F_{\theta}}V_{\theta}\Big{)}(x)\bigg{)}, (4.11)

where

Fθ=λ1λ~θp1(θ)FY1+λ2λ~θp1(θ)FY2+λλ~θp(1,0)(θ)FY1+λλ~θp(0,1)(θ)FY2+λλ~θp(1,1)(θ)FY1+Y2.F_{\theta}=\frac{\lambda_{1}^{\perp}}{\tilde{\lambda}_{\theta}}p_{1}(\theta)F_{Y^{1\perp}}+\frac{\lambda_{2}^{\perp}}{\tilde{\lambda}_{\theta}}p_{1}(\theta)F_{Y^{2\perp}}+\frac{\lambda^{\parallel}}{\tilde{\lambda}_{\theta}}p^{(1,0)}(\theta)F_{Y^{1\parallel}}+\frac{\lambda^{\parallel}}{\tilde{\lambda}_{\theta}}p^{(0,1)}(\theta)F_{Y^{2\parallel}}+\frac{\lambda^{\parallel}}{\tilde{\lambda}_{\theta}}p^{(1,1)}(\theta)F_{Y^{1\parallel}+Y^{2\parallel}}.

Since λ1λ~θp1(θ)+λ2λ~θp2(θ)+λλ~θp(1,0)(θ)+λλ~θp(0,1)(θ)+λλ~θp(1,1)(θ)=1\frac{\lambda_{1}^{\perp}}{\tilde{\lambda}_{\theta}}p_{1}(\theta)+\frac{\lambda_{2}^{\perp}}{\tilde{\lambda}_{\theta}}p_{2}(\theta)+\frac{\lambda^{\parallel}}{\tilde{\lambda}_{\theta}}p^{(1,0)}(\theta)+\frac{\lambda^{\parallel}}{\tilde{\lambda}_{\theta}}p^{(0,1)}(\theta)+\frac{\lambda^{\parallel}}{\tilde{\lambda}_{\theta}}p^{(1,1)}(\theta)=1, (4.11) becomes

Vθ(x)=λ1p1(θ)c(θ)(x1FY1(z)dz+(ΨFY1Vθ)(x))+λ2p2(θ)c(θ)(x1FY2(z)dz+(ΨFY2Vθ)(x))+λp(1,0)(θ)c(θ)(x1FY1(z)dz+(ΨFY1Vθ)(x))+λp(0,1)(θ)c(θ)(x1FY2(z)dz+(ΨFY2Vθ)(x))+λp(1,1)(θ)c(θ)(x1FY1+Y2(z)dz+(ΨFY1+Y2Vθ)(x)).\begin{split}V_{\theta}(x)&=\frac{\lambda_{1}^{\perp}p_{1}(\theta)}{c(\theta)}\bigg{(}\int_{x}^{\infty}1-F_{Y^{1\perp}}(z)dz+(\Psi_{F_{Y^{1\perp}}}V_{\theta})(x)\bigg{)}\\ &\quad+\frac{\lambda_{2}^{\perp}p_{2}(\theta)}{c(\theta)}\bigg{(}\int_{x}^{\infty}1-F_{Y^{2\perp}}(z)dz+(\Psi_{F_{Y^{2\perp}}}V_{\theta})(x)\bigg{)}\\ &\quad+\frac{\lambda^{\parallel}p^{(1,0)}(\theta)}{c(\theta)}\bigg{(}\int_{x}^{\infty}1-F_{Y^{1\parallel}}(z)dz+(\Psi_{F_{Y^{1\parallel}}}V_{\theta})(x)\bigg{)}\\ &\quad+\frac{\lambda^{\parallel}p^{(0,1)}(\theta)}{c(\theta)}\bigg{(}\int_{x}^{\infty}1-F_{Y^{2\parallel}}(z)dz+(\Psi_{F_{Y^{2\parallel}}}V_{\theta})(x)\bigg{)}\\ &\quad+\frac{\lambda^{\parallel}p^{(1,1)}(\theta)}{c(\theta)}\bigg{(}\int_{x}^{\infty}1-F_{Y^{1\parallel}+Y^{2\parallel}}(z)dz+(\Psi_{F_{Y^{1\parallel}+Y^{2\parallel}}}V_{\theta})(x)\bigg{)}.\end{split}

We write this in abbreviated form:

Vθ(x)=<α(θ),Γ(x)>+(<α(θ),Ψ>Vθ)(x),V_{\theta}(x)=<\alpha(\theta),\Gamma(x)>+(<\alpha(\theta),\Psi>V_{\theta})(x),

where α(θ)\alpha(\theta) is the vector

α(θ)=1c(θ)(λ1p1(θ),λ2p2(θ),λp(1,0),λp(0,1),λp(1,1)),\alpha(\theta)=\frac{1}{c(\theta)}\Big{(}\lambda_{1}^{\perp}p_{1}(\theta),\lambda_{2}^{\perp}p_{2}(\theta),\lambda^{\parallel}p^{(1,0)},\lambda^{\parallel}p^{(0,1)},\lambda^{\parallel}p^{(1,1)}\Big{)},

Γ(x)\Gamma(x) is the vector function

Γ(x)=(x1FY1(z)dz,x1FY2(z)dz,x1FY1(z)dz,x1FY2(z)dz,x1FY1+Y2(z)dz),\Gamma(x)=\Big{(}\int_{x}^{\infty}1-F_{Y^{1\perp}}(z)dz,\int_{x}^{\infty}1-F_{Y^{2\perp}}(z)dz,\int_{x}^{\infty}1-F_{Y^{1\parallel}}(z)dz,\int_{x}^{\infty}1-F_{Y^{2\parallel}}(z)dz,\int_{x}^{\infty}1-F_{Y^{1\parallel}+Y^{2\parallel}}(z)dz\Big{)},

Ψ\Psi is the vector of operators

Ψ=(ΨFY1,ΨFY2,ΨFY1,ΨFY2,ΨFY1+Y2),\Psi=\Big{(}\Psi_{F_{Y^{1\perp}}},\Psi_{F_{Y^{2\perp}}},\Psi_{F_{Y^{1\parallel}}},\Psi_{F_{Y^{2\parallel}}},\Psi_{F_{Y^{1\parallel}}+Y^{2\parallel}}\Big{)},

and <,><\cdot,\cdot> is the usual inner product in 5\mathbbm{R}^{5}.

Using the argument in the proof of Proposition 3.1, we see that VθV_{\theta} admits the series representation

Vθ(x)=n=0(<α(θ),Ψ>n<α(θ),Γ>)(x).V_{\theta}(x)=\sum_{n=0}^{\infty}\Big{(}<\alpha(\theta),\Psi>^{n}<\alpha(\theta),\Gamma>\Big{)}(x).

Similarly, any vector γ5\gamma\in\mathbbm{R}^{5} and any bounded measurable function g:[0,+[g:[0,+\infty[\mapsto\mathbbm{R} define one unique function

uγ,g(x)=n=0(<γ,Ψ>ng)(x).u_{\gamma,g}(x)=\sum_{n=0}^{\infty}\Big{(}<\gamma,\Psi>^{n}g\Big{)}(x).

This function is analytic with respect to γ\gamma, with partial derivatives

uγ,gγi=n=0<γ,Ψ>n(Ψiuγ,g)=uγ,Ψiuγ,g,i=1,,5.\frac{\partial{u_{\gamma,g}}}{\partial{\gamma_{i}}}=\sum_{n=0}^{\infty}<\gamma,\Psi>^{n}\big{(}\Psi_{i}u_{\gamma,g}\big{)}=u_{\gamma,\Psi_{i}u_{\gamma,g}},\quad i=1,\dots,5.

Taking into account the chain rule for derivatives, this proves the following proposition.

Proposition 4.2.

If θα(θ)\theta\mapsto\alpha(\theta) is differentiable, then θVθ(x)\theta\mapsto V_{\theta}(x) is differentiable for every x0x\geq 0 and

θiVθ(x)=j=15uα(θ),(Γj+ΨjVθ)αj(θ)θi,i=1,2.\frac{\partial{}}{\partial{\theta_{i}}}V_{\theta}(x)=\sum_{j=1}^{5}u_{\alpha(\theta),(\Gamma_{j}+\Psi_{j}V_{\theta})}\frac{\partial{\alpha_{j}(\theta)}}{\partial{\theta_{i}}},\quad i=1,2.

By Proposition 4.2, the optimal loadings satisfy the equation

j=15uα(θ),(Γj+ΨjVθ)αj(θ)θi=0,i=1,2\sum_{j=1}^{5}u_{\alpha(\theta),(\Gamma_{j}+\Psi_{j}V_{\theta})}\frac{\partial{\alpha_{j}(\theta)}}{\partial{\theta_{i}}}=0,\quad i=1,2 (4.12)

Contrary to the single-risk case, the odds of finding explicit solutions for this equation seem very low, even in simple cases. However, (4.12) can be numerically solved by Newton’s algorithm, the second-order partial derivatives being

2θiθjVθ(x)=k=15uα(θ),(Γk+ΨkVθ)2αk(θ)θiθj+k=15l=15uα(θ),Ψkuα(θ),(Γl+ΨlVθ)αk(θ)θiαl(θ)θj.\frac{\partial^{2}}{\partial{\theta_{i}}\partial{\theta_{j}}}V_{\theta}(x)=\sum_{k=1}^{5}u_{\alpha(\theta),(\Gamma_{k}+\Psi_{k}V_{\theta})}\frac{\partial^{2}{\alpha_{k}(\theta)}}{\partial{\theta_{i}}\partial{\theta_{j}}}+\sum_{k=1}^{5}\sum_{l=1}^{5}u_{\alpha(\theta),\Psi_{k}}u_{\alpha(\theta),(\Gamma_{l}+\Psi_{l}V_{\theta})}\frac{\partial{\alpha_{k}(\theta)}}{\partial{\theta_{i}}}\frac{\partial{\alpha_{l}(\theta)}}{\partial{\theta_{j}}}.

Notice that the expected profit is

c(1)(θ)+c(2)(θ)λ~𝔼[Y~]=θ1p1(θ1)λ1𝔼[Y(1)]+θ2p2(θ2)λ2𝔼[Y(2)].c^{(1)}(\theta)+c^{(2)}(\theta)-\tilde{\lambda}\operatorname{\mathbb{E}}[\tilde{Y}]=\theta_{1}p_{1}(\theta_{1})\lambda_{1}\operatorname{\mathbb{E}}[Y^{(1)}]+\theta_{2}p_{2}(\theta_{2})\lambda_{2}\operatorname{\mathbb{E}}[Y^{(2)}].

Thus, it depends only on the marginal distribution of the claim processes S(1)S^{(1)}, S(2)S^{(2)}, being independent of the dependency structure. It follows that the loadings minimizing the joint profit coincide with the loadings minimizing the profit on each risk, separately. That is, a pricing strategy that completely focus on expected profit completely fails to take both dependencies between risks and dependencies between sales of policies into account.

5 Numerical Results

Throughout this section, Yi(i)Y_{i}^{(i)} are assumed to be i.i.d gamma distributed random variables with shape parameter, a(i)a^{(i)}, and scale parameters, k(i)k^{(i)}, which means that the mean is, 𝔼[Y(i)]=a(i)k(i)\operatorname{\mathbb{E}}[Y^{(i)}]=a^{(i)}k^{(i)}, for i=1,2i=1,2. In the following numerical analysis let a(1)=a(2)=2a^{(1)}=a^{(2)}=2, k(1)=k(2)=500k^{(1)}=k^{(2)}=500, λ(1)=λ(2)=800\lambda^{(1)}=\lambda^{(2)}=800, β0(1)=β0(2)=0.5\beta_{0}^{(1)}=\beta_{0}^{(2)}=-0.5, β1(2)=4\beta_{1}^{(2)}=4 and β1(1)=4.5\beta_{1}^{(1)}=4.5. That is, the difference stems from surplus process 2 being more sensitive to the loading via the parameter β1(2)\beta_{1}^{(2)}. r(i)r^{(i)} is taken to be 20%20\% of the pure premium if the exposure was 40%40\%, that is r(i)=0.40.2k(i)a(i)N(i)r^{(i)}=0.4*0.2k^{(i)}a^{(i)}N^{(i)}. The operational cost is therefore 8%8\% of the expected total amount of claims in the market. The Clayton Lévy copula is considered for positive dependence and the parameter is set to ω=1\omega=1. Finally, let θruin\theta_{ruin}^{*} and θprofit\theta_{profit}^{*} denote the optimal loading when the ruin probability and expected profit criterion is used, respectively. The programming language R was used for every calculation.

5.1 Single Surplus Process

The surplus processes are first considered separately. The ruin probability and the expected profit is plotted as a function of θ\theta for the two processes in Figures 1 and 2. θruin\theta_{ruin}^{*} was found by minimizing α\alpha.

Refer to caption
Figure 1: Surplus process 1. The blue lines show the ruin probability as a function of θ\theta for a given surplus x. The black line shows the expected profit per time unit as a function of θ\theta. The blue dots show the minimum ruin probability for each surplus. θprofit\theta_{profit}^{*} and θruin\theta_{ruin}^{*} denote the optimal security loading parameter for the expected profit and for the probability of ruin, respectively.

From Figure 1 it can be seen that the optimal security loading parameter for the ruin probability is, θruin=0.435\theta_{ruin}^{*}=0.435, while the θ\theta that maximizes the expected profit is lower, θprofit=0.359\theta_{profit}^{*}=0.359. Moreover, in this example, the maximum expected profit is 22.843 units and is given at θprofit\theta_{profit}^{*}. The expected profit taken at the point θruin\theta_{ruin}^{*} is lower, close to 20.000 units.

Refer to caption
Figure 2: Surplus process 2. The blue lines show the ruin probability as a function of θ\theta for a given surplus x. The black line shows the expected profit per time unit as a function of θ\theta. The blue dots show the minimum ruin probability for each surplus. θprofit\theta_{profit}^{*} and θruin\theta_{ruin}^{*} denote the optimal security loading parameter for the expected profit and for the probability of ruin, respectively.

From Figure 2 it can be seen that the optimal security loading parameter for the ruin probability is θruin=0.358\theta_{ruin}^{*}=0.358, while the θ\theta that maximizes the expected profit is again lower or θprofit=0.319\theta_{profit}^{*}=0.319.

Obviously, for both processes, the ruin probability decreases with increasing surplus. Moreover, it can be seen that surplus process X2X_{2} has higher probability of ruin than surplus process X1X_{1} for the same amount of surplus. The sensitivity of the demand curve affects the ruin probability and θruin\theta_{ruin}^{*} greatly. The more sensitive to the exposure the demand curve is, the closer the θprofit\theta_{profit}^{*} and θruin\theta_{ruin}^{*} are. This more sensitive curve also has higher probability of ruin for a given surplus, which indicates that more competitive insurance products are riskier. These effects can be seen if the two Figures (1 and 2) are compared. Conversely, if the demand curve is not sensitive to the price, then the gap between θprofit\theta_{profit}^{*} and θruin\theta_{ruin}^{*} can become quite large. Additionally, it can be seen from the curve at surplus = 100 that the ruin probability for θprofit\theta_{profit}^{*} and θruin\theta_{ruin}^{*} are similar but as the surplus grows the values start to differ and once the surplus is great enough the two values θprofit\theta_{profit}^{*} and θruin\theta_{ruin}^{*} result in similar ruin probabilities again. This means that if the insurance firm has high enough surplus then they can choose arbitrary θ\theta without risking the chance of ruin. If the surplus is great enough then the value of θ\theta does not matter as much. However, having too much reserves can be bad for insurance companies as it can be seen as a negative leverage. The bowl shape of the blue curves in the two Figures (1 and 2) is because of the interplay between the fixed cost and the demand curve.

θruin\theta_{ruin}^{*} should give the minimum ruin probability at all surplus values. This can be tested by graphing multiple ruin probability curves and compare it with the one obtained by θruin\theta_{ruin}^{*}. Figure 3 shows that θruin\theta_{ruin}^{*} gives the minimum ruin probability indeed.

Refer to caption
Figure 3: The figure compares the optimal value function of surplus process, X1X_{1} (blue line) to other cost functions (grey lines). The blue line is achieved by setting θ=θruin\theta=\theta_{ruin}^{*}. All the cost functions lie above the optimal value function, as is expected.

5.2 Two Aggregated Surplus Processes with Common Loading

Next, the two surplus processes, X1X_{1} and X2X_{2} are aggregated, both when the claims are independent and dependent. The acquisition is independent in this subsection.

Refer to caption
Figure 4: Ruin probability when X1X_{1} and X2X_{2} are aggregated as a function of the security loading parameter, θ\theta, both when they are independent and dependent via Clayton Lévy copula with ω=0.5\omega=0.5. The blue curves show the ruin probability when the two processes are independent for different values of the surplus and the red curves show the same for the dependent case. The curves have similar shapes, but the ruin probability is higher in the case of dependence, for the same surplus. The values in the legend show the minimum ruin probability for a given surplus (surplus \rightarrow probability).

Figure 4 shows the ruin probability of the aggregated surplus process as a function of the security loading parameter, θ\theta, both when they are independent and dependent via Clayton Lévy copula. The red curves represent dependence while the blue curves represent independence.

Firstly, it can be seen that the expected profit is the same for dependence and independence and from the figure, θprofit0.34\theta^{*}_{profit}\approx 0.34. The reason is that the claim mean and the claim frequency is almost the same (numerically) for dependence and independence.

Secondly, the dependent case has a higher probability of ruin than the independent case for the same amount of surplus. However, the ruin probability is almost the same for small surplus values as can be seen from the figure. Interestingly, the optimal loading for dependence and independence seem to be the same and numerically the values are θruin,dep=0.4=θruin,indp\theta^{*}_{ruin,dep}=0.4=\theta^{*}_{ruin,indp}. The surplus value does not change the optimal loading θ\theta^{*}, as expected. The reason why the ruin probability difference between the dependent and independent cases is relatively small is because of the probability p(0,1)(θ)p^{(0,1)}(\theta). The fact that the insurance company does not always have the both claims Y1Y^{1\parallel} and Y1Y^{1\parallel} when a common jump occurs reduces the risk.

Finally, the difference of the two ruin probability curves (red and blue) for a given surplus seems to be increasing with increasing surplus, meaning that the ruin probability in the independent case decreases more rapidly with increasing surplus then for the dependent case. Therefore, it is clear that the positive dependent case is riskier.

Note that θruin0.4\theta_{ruin}^{*}\approx 0.4, which is very close to the weighted average of the optimal loading parameter of the isolated surplus processes where the weight is the exposure ratio of each surplus process, that is

θweighted=0.43511+exp(0.6+40.4)+0.35811+exp(0.6+4.50.4)11+exp(0.6+40.4)+11+exp(0.6+4.50.4)0.4,\theta_{weighted}=\frac{0.435\frac{1}{1+\exp(-0.6+4*0.4)}+0.358\frac{1}{1+\exp(-0.6+4.5*0.4)}}{\frac{1}{1+\exp(-0.6+4*0.4)}+\frac{1}{1+\exp(-0.6+4.5*0.4)}}\approx 0.4,

which strongly indicates that the optimal value, θruin\theta_{ruin}^{*}, is simply the weighted average.

5.2.1 Two Aggregated Surplus Processes with Separate Loadings

It is more realistic to consider θ\theta as a vector so that the loading parameter can be different for each surplus process separately, to spread the total premium over the policies in an optimal way. The two surplus processes, X1X_{1} and X2X_{2}, are aggregated as before and the constants are the same, but let θ=(θ(1),θ(2))\theta=(\theta^{(1)},\theta^{(2)}).

Refer to caption
Refer to caption
Figure 5: Expected profit (left) and the ruin probability (right) when X1X_{1} and X2X_{2} are aggregated, as a function of the security loading parameters, θ(1)\theta^{(1)} and θ(2)\theta^{(2)}. The processes are assumed to be independent and the surplus is fixed at x=5000x=5000. The parenthesis in the right figure shows the optimal values of θ(1)\theta^{(1)} and θ(2)\theta^{(2)} with the ruin probability as a criterion. The arrow indicates which values θ(1)\theta^{(1)} and θ(2)\theta^{(2)} are mapped into, thus showing the minimum ruin probability. The parenthesis in the left figure shows the same for the expected profit. The shape of the contour plot is due to the fact that the θ\theta grid considered is sparser for values that give high ruin probability.

Figure 5 shows the expected profit (left) and the ruin probability (right), when X1X_{1} and X2X_{2} are assumed to be independent and aggregated, as a function of the security loading parameters, θ(1)\theta^{(1)} and θ(2)\theta^{(2)}. The surplus is fixed at x=5000x=5000 and the optimal values are shown. It should be noted that many surplus values were tested and they all gave the same value for θruin(1)\theta_{ruin}^{*(1)}, θruin(2)\theta_{ruin}^{*(2)}, θprofit(2)\theta_{profit}^{*(2)}, and θprofit(2)\theta_{profit}^{*(2)} as shown, only the ruin probability level changed. Note that the optimal loading parameters for the expected profit are the same as those for the individual surplus processes. However, the optimal loading parameters for the ruin probability change when compared to the individual one (compare it with Figures 1 and 2). When compared to the optimal loading parameter for the individual surplus process, θ(1)\theta^{(1)} decreases from 0.4350.435 to 0.420.42 and θ(2)\theta^{(2)} increases from 0.358 to 0.38. Therefore, the optimal security loading parameter decision is to decrease the loading parameter of the less sensitive surplus process while increasing the loading parameter of the more sensitive surplus process. Additionally, when compared to Figure 4, the minimum ruin probability for one shared loading is 0.570.57 while the ruin for two loadings is 0.560.56, showing only a marginal difference. When the same is done for other surplus values a similar difference is found. The expected profit is marginally higher.

Lastly, consider the case when the surplus processes are assumed to be dependent via Lévy Clayton copula and the loadings can be different for each surplus process separately. Figure 6 shows the ruin probability when X1X_{1} and X2X_{2} are aggregated as a function of the security loading parameters, θ(1)\theta^{(1)} and θ(2)\theta^{(2)}. The shape of the contour plots is due to the fact that the θ\theta grid considered is sparser for values that give high ruin probability. The surplus is fixed at x=5000x=5000.

Refer to caption
Refer to caption
Figure 6: Expected profit (left) and the ruin probability (right) when X1X_{1} and X2X_{2} are aggregated, as a function of the security loading parameters, θ(1)\theta^{(1)} and θ(2)\theta^{(2)}. The processes are assumed to be dependent via Clayton Lévy copula and the surplus is fixed at x=5000x=5000. The parenthesis in the right figure shows the optimal values of θ(1)\theta^{(1)} and θ(2)\theta^{(2)} with the ruin probability as a criterion. The arrow shows which values θ(1)\theta^{(1)} and θ(2)\theta^{(2)} are mapped into, thus showing the minimum ruin probability. The parenthesis in the left figure shows the same except for the expected profit. The shape of the contour plot is due to the fact that the θ\theta grid considered is sparser for values that give high ruin probability.

It can be seen that the optimal loadings θ(1)\theta^{(1)} and θ(2)\theta^{(2)} are the same as the ones in the case of independence and the minimum ruin probability is higher (compared to the case in Figure 5). Both the values and the optimal loadings of the expected profit are the same as the independent case. Again, the optimal security loading parameter decision is to decrease the loading parameter of the less sensitive surplus process while increasing the loading parameter of the more sensitive surplus process. The difference between the ruin probability in Figure 6 vs Figure 5 is only 0.030.03 but in this case the surplus is low compared to the expected profit. If the surplus would be increased to 20.000\approx 20.000 the difference would become greater. The difference would then decrease again if the surplus were increased to 40.000\approx 40.000.

Additionally, when compared to Figure 4, the minimum ruin probability for one common loading is 0.590.59, which is the same as the ruin probability for separate loading selections, therefore the difference is only marginal.

5.3 Dependent Claims and Dependent Acquisition

It is time to look at the case when we have dependent claims and dependent acquisition. Note that the case when we have independent claims and dependent acquisition is the same as the total independence case. We will look both at the case when the acquisition is modelled with a Gumbel and Clayton dependency structure. To compare these two structures we use Kendell’s tau. The following equations relate the copula parameters, ωclayton\omega_{clayton} and ωgumbel\omega_{gumbel} to kendell’s tau, τ\tau.

ωclayton=2τ1τ,ωgumbel=11τ.\omega_{clayton}=\frac{2\tau}{1-\tau},\quad\omega_{gumbel}=\frac{1}{1-\tau}.

We know that the expected profit is the same as before for all values of τ\tau. Therefore, we analyze the ruin probability.

Refer to caption
Refer to caption
Figure 7: The ruin probability when the acquisition is modelled with a Clayton (left) and a Gumbel (right) dependency structure. The surplus is constant. In both cases, the ruin probability is higher for higher kendell’s tau. The Gumbel case is more riskier. The surplus is fixed at 5000 units.

In Figure 7 we can see the ruin probability for different dependency values when the surplus is fixed at 5000 units. We can see that the ruin probability is higher for more dependent acquisition, as we expected. Also, we can see that the Gumbel acquisition model gives higher ruin probabilities than the Clayton model for the same Kendell’s tau value. They give the same optimal loading parameter. It can also be seen that when kendell’s tau is 0.050.05 (close to 0) the ruin probability is close to the case of independent acquisition, as expected. The optimal loading parameter is the same for all dependency levels.

Appendix A Numerical scheme for equation 2.2, using linear approximation

Consider the process

Xt=u+cti=0NtYiX_{t}=u+ct-\sum_{i=0}^{N_{t}}Y_{i}

where YiY_{i} are iid continuous random variables and NtN_{t} is a Poisson(λt)Poisson(\lambda t). To approximate equation 2.2, take a grid of points ϵ=x0<x1<xn\epsilon=x_{0}<x_{1}<...x_{n}, xi,i,ϵ>0x_{i}\in\mathbbm{R},\forall i\in\mathbbm{N},\epsilon>0, with equal interval lengths, h=xixi1h=x_{i}-x_{i-1}. A linear approximation is used to approximate V¯(x)\bar{V}(x)

V¯(z)=V¯(xj1)+V¯(xj)V¯(xj1)h(zxj1),z[xj1,xj],ji\bar{V}(z)=\bar{V}(x_{j-1})+\frac{\bar{V}(x_{j})-\bar{V}(x_{j-1})}{h}(z-x_{j-1}),\quad z\in[x_{j-1},x_{j}],j\leq i

where V¯(xj)V¯(xj1)h\frac{\bar{V}(x_{j})-\bar{V}(x_{j-1})}{h} is an approximation of the derivative, V¯(xj1)\bar{V}^{\prime}(x_{j-1}), using the so-called forward difference

Let V¯i\bar{V}_{i} denote the approximation of V¯(xi)\bar{V}(x_{i}). Let S¯(x)=0xF¯(y)𝑑y\bar{S}(x)=\int_{0}^{x}\bar{F}(y)dy and S¯¯(x)=0xS¯(y)𝑑y\bar{\bar{S}}(x)=\int_{0}^{x}\bar{S}(y)dy. For each xi,i>0x_{i},i>0 solve the following equation

V¯(xi)V¯(0+)=λc0xV¯(xiy)F¯(y)𝑑y.\bar{V}(x_{i})-\bar{V}(0^{+})=\frac{\lambda}{c}\int_{0}^{x}\bar{V}(x_{i}-y)\bar{F}(y)dy.

The goal is to develop a recursive method from x0x_{0} as the value of V¯0\bar{V}_{0} is known.

𝒊𝒇𝒊=𝟎\bm{if\quad i=0}

Set V¯0=1λc𝔼[Y]\bar{V}_{0}=1-\frac{\lambda}{c}\operatorname{\mathbb{E}}[Y]

𝒊𝒇𝒊=𝟏\bm{if\quad i=1}

Calculate

V¯1=V¯0+λcoxi((V¯0+V¯1V¯0h)(x1yx0))F¯(y)𝑑y=V¯0+λc(V¯0(S¯(x1)S¯(x0))+V¯1V¯0h([x1yx0]x0x1+S¯¯(x1)S¯¯(x0)))=V¯0+λc(a1,1+V¯1V¯0ha2,1)\begin{split}\bar{V}_{1}&=\bar{V}_{0}+\frac{\lambda}{c}\int_{o}^{x_{i}}((\bar{V}_{0}+\frac{\bar{V}_{1}-\bar{V}_{0}}{h})(x_{1}-y-x_{0}))\bar{F}(y)dy\\ &=\bar{V}_{0}+\frac{\lambda}{c}\Big{(}\bar{V}_{0}(\bar{S}(x_{1})-\bar{S}(x_{0}))+\\ &\qquad\frac{\bar{V}_{1}-\bar{V}_{0}}{h}\big{(}[x_{1}-y-x_{0}]_{x_{0}}^{x_{1}}+\bar{\bar{S}}(x_{1})-\bar{\bar{S}}(x_{0})\big{)}\Big{)}\\ &=\bar{V}_{0}+\frac{\lambda}{c}(a_{1,1}+\frac{\bar{V}_{1}-\bar{V}_{0}}{h}a_{2,1})\end{split}
\Leftrightarrow
(1λa2,1ch)V¯1=V¯0+λcV¯0(a1,1a2,1h)(1-\frac{\lambda a_{2,1}}{ch})\bar{V}_{1}=\bar{V}_{0}+\frac{\lambda}{c}\bar{V}_{0}(a_{1,1}-\frac{a_{2,1}}{h})

𝒊𝒇𝒊>𝟏\bm{if\quad i>1}

Calculate

V¯(xi)V¯(0)=λc(j=2ixj1xjV¯(xiy)S(y)𝑑y+xi1xiV¯(xiy)S(y)𝑑y)=λc(j=2i(V¯ij(S¯(xj)S¯(xj1))+V¯ij+1V¯ijh([xiyxij]xj1xj+S¯¯(xj)S¯¯(xj1)))+V¯ij(S¯(xi)S¯(xi1))+V¯iV¯ijh([xiyxij]xi1xi+S¯¯(xi)S¯¯(xi1)))=λc(j=2i(V¯ij(S¯(xj)S¯(xj1))+V¯ij+1V¯ijh([xiyxij]xj1xj+S¯¯(xj)S¯¯(xj1)))+V¯ija1,i+V¯iV¯ijha2,i)\begin{split}\bar{V}(x_{i})-\bar{V}(0)&=\frac{\lambda}{c}\Big{(}\sum_{j=2}^{i}\int_{x_{j-1}}^{x_{j}}\bar{V}(x_{i}-y)S(y)dy+\int_{x_{i-1}}^{x_{i}}\bar{V}(x_{i}-y)S(y)dy\Big{)}\\ &=\frac{\lambda}{c}\bigg{(}\sum_{j=2}^{i}\Big{(}\bar{V}_{i-j}(\bar{S}(x_{j})-\bar{S}(x_{j-1}))+\frac{\bar{V}_{i-j+1}-\bar{V}_{i-j}}{h}\big{(}[x_{i}-y-x_{i-j}]_{x_{j-1}}^{x_{j}}+\bar{\bar{S}}(x_{j})-\bar{\bar{S}}(x_{j-1})\big{)}\Big{)}+\\ &\qquad\bar{V}_{i-j}(\bar{S}(x_{i})-\bar{S}(x_{i-1}))+\frac{\bar{V}_{i}-\bar{V}_{i-j}}{h}\big{(}[x_{i}-y-x_{i-j}]_{x_{i-1}}^{x_{i}}+\bar{\bar{S}}(x_{i})-\bar{\bar{S}}(x_{i-1})\big{)}\bigg{)}\\ &=\frac{\lambda}{c}\bigg{(}\sum_{j=2}^{i}\Big{(}\bar{V}_{i-j}(\bar{S}(x_{j})-\bar{S}(x_{j-1}))+\frac{\bar{V}_{i-j+1}-\bar{V}_{i-j}}{h}\big{(}[x_{i}-y-x_{i-j}]_{x_{j-1}}^{x_{j}}+\bar{\bar{S}}(x_{j})-\bar{\bar{S}}(x_{j-1})\big{)}\Big{)}+\\ &\qquad\bar{V}_{i-j}a_{1,i}+\frac{\bar{V}_{i}-\bar{V}_{i-j}}{h}a_{2,i}\bigg{)}\end{split}
\Leftrightarrow
(1λa2,ich)V¯i=V¯0+λc(j=2i(V¯ij(S¯(xj)S¯(xj1))+V¯ij+1V¯ijh([xiyxij]xj1xj+S¯¯(xj)S¯¯(xj1)))+V¯ija1,iV¯ijha2,i)\begin{split}(1-\frac{\lambda a_{2,i}}{ch})\bar{V}_{i}&=\bar{V}_{0}+\\ &\qquad\frac{\lambda}{c}\bigg{(}\sum_{j=2}^{i}\Big{(}\bar{V}_{i-j}(\bar{S}(x_{j})-\bar{S}(x_{j-1}))+\frac{\bar{V}_{i-j+1}-\bar{V}_{i-j}}{h}\big{(}[x_{i}-y-x_{i-j}]_{x_{j-1}}^{x_{j}}+\bar{\bar{S}}(x_{j})-\bar{\bar{S}}(x_{j-1})\big{)}\Big{)}+\\ &\qquad\bar{V}_{i-j}a_{1,i}-\frac{\bar{V}_{i-j}}{h}a_{2,i}\bigg{)}\end{split}
Let the symbol \vec{} denote a vector.
Initialize
x\vec{x} for some x0,,xnx_{0},...,x_{n}
V¯\vec{\bar{V}} with length equal to the length of x\vec{x}
NxN_{x}\leftarrow length of x\vec{x}
# loop to estimate each value in V¯\vec{\bar{V}}
for ii in 0,,(Nx1)0,...,(N_{x}-1) do
 if i = 0 then
    V¯01λc𝔼[Y]\bar{V}_{0}\leftarrow 1-\frac{\lambda}{c}\operatorname{\mathbb{E}}[Y]
 else if i = 1 then
    V¯1\bar{V}_{1}\leftarrow take case i=1i=1 from above and isolate V¯1\bar{V}_{1}
    
 else
    V¯i\bar{V}_{i}\leftarrow take case i>1i>1 from above and isolate V¯i\bar{V}_{i}
    
   end if
 
end for
Return V¯\vec{\bar{V}}, x\vec{x}
 1 Estimation of V¯(x)\bar{V}(x)

Acknowledgements

The second and third authors acknowledge financial support from FCT – Fundação para a Ciência e Tecnologia (Portugal), national funding, through research grant UIDB/05069/2020.

References

  • [1] Benjamin Avanzi, Luke C. Cassar, and Bernard Wong. Modelling dependence in insurance claims processes with lévy copulas. ASTIN Bulletin, 41(2):575–609, 2011.
  • [2] Frank Barning. Counting processes and copulas: Applications in insurance. Master’s thesis, Université du Québec à Montréal, 2018.
  • [3] Fred Espen Benth, Valery A Kholodnyi, and Peter Laurence. Quantitative energy finance. In Modelling, pricing, and hedging in energy and commodity markets. Springer, 2014.
  • [4] Stephen Britt and Albert Napoli. Linear correlation as a measure of dependency. In XVth General Insurance Seminar, Institute of Actuaries of Australia, 2005.
  • [5] Nicole Bäuerle and Anja Blatter. Optimal control and dependence modeling of insurance portfolios with lévy dynamics. Insurance: Mathematics and Economics, 48(3):398–405, 2011.
  • [6] Hans Bühlmann. Premium calculation from top down. Astin Bulletin - ASTIN BULL, 15:89–101, 11 1985.
  • [7] Antonella Campana, Paola Ferretti, et al. Distortion risk measures and discrete risks. Technical report, 2005.
  • [8] R Cont and P Tankov. Financial Modelling with Jump Processes. Chapman & Hall/CRC, 2004.
  • [9] Michel Denuit, Xavier Maréchal, Sandra Pitrebois, and Jean-François Walhin. Actuarial modelling of claim counts: Risk classification, credibility and bonus-malus systems. John Wiley & Sons, 2007.
  • [10] Sever S Dragomir. Some Gronwall type inequalities and applications. Nova Science, 2003.
  • [11] Jan Grandell. Aspects of risk theory. Springer Science & Business Media, 2012.
  • [12] Patrik Hardin and Sam Tabari. Modelling non-life insurance policyholder price sensitivity: A statistical analysis performed with logistic regression, 2017.
  • [13] Christian Hipp. Stochastic control with application in insurance. In Stochastic methods in finance, pages 127–164. Springer, 2004.
  • [14] Christian Hipp and Michael Plum. Optimal investment for insurers. Insurance: Mathematics and Economics, 27(2):215–228, 2000.
  • [15] Rob Kaas, Marc Goovaerts, Jan Dhaene, and Michel Denuit. Modern actuarial risk theory: using R, volume 128. Springer Science & Business Media, 2008.
  • [16] Yehuda Kahane. The theory of insurance risk premiums—a re-examination in the light of recent developments in capital market theory. ASTIN Bulletin: The Journal of the IAA, 10(2):223–239, 1979.
  • [17] Hilbert J Kappen. An introduction to stochastic control theory, path integrals and reinforcement learning. In AIP conference proceedings, volume 887, pages 149–181. American Institute of Physics, 2007.
  • [18] Christian Kasumo, Juma Kasozi, and Dmitry Kuznetsov. On minimizing the ultimate ruin probability of an insurer by reinsurance. Journal of Applied Mathematics, 2018, 2018.
  • [19] Paul C Kettler. Lévy-copula-driven financial processes. Preprint series. Pure mathematics http://urn. nb. no/URN: NBN: no-8076, 2006.
  • [20] Doron Kliger and Benny Levikson. Pricing insurance contracts—an economic viewpoint. Insurance: Mathematics and Economics, 22(3):243–249, 1998.
  • [21] Stuart A Klugman, Harry H Panjer, and Gordon E Willmot. Loss models: from data to decisions, volume 715. John Wiley & Sons, 2012.
  • [22] Elena Krasheninnikova, Javier García, Roberto Maestre, and Fernando Fernández. Reinforcement learning for pricing strategy optimization in the insurance industry. Engineering Applications of Artificial Intelligence, 80:8–19, 2019.
  • [23] Hanson Li and Ruodu Wang. Pelve: Probability equivalent level of var and es. Available at SSRN, 2019.
  • [24] Ronnie Loeffen, Irmina Czarna, Zbigniew Palmowski, et al. Parisian ruin probability for spectrally negative lévy processes. Bernoulli, 19(2):599–609, 2013.
  • [25] Roger B Nelsen. An introduction to copulas. Springer Science & Business Media, 2007.
  • [26] Esbjörn Ohlsson and Björn Johansson. Non-life insurance pricing with generalized linear models, volume 2. Springer, 2010.
  • [27] Antonis Papapantoleon. An introduction to lévy processes with applications in finance, 2008.
  • [28] Ken-Iti Sato. Lévy processes and infinitely divisible distributions. Cambridge university press, 1999.
  • [29] Timothy Sauer. Numerical Analysis. Pearson, 2013.
  • [30] Daniel Yannik Straumann. Correlation and dependency in risk management: properties and pitfalls. In Risk Management: Value at Risk and Beyond. Cambridge University Press, 2001.
  • [31] Richard S Sutton and Andrew G Barto. Reinforcement learning: An introduction. MIT press, 2018.
  • [32] Peter Tankov. Lévy copulas: review of recent results. In The fascination of probability, statistics and their applications, pages 127–151. Springer, 2016.
  • [33] Julien Trufin, Hansjoerg Albrecher, and Michel M Denuit. Properties of a risk measure derived from ruin theory. The Geneva Risk and Insurance Review, 36(2):174–188, 2011.
  • [34] J. L. van Velsen. Parameter estimation of a lévy copula of a discretely observed bivariate compound poisson process with an application to operational risk modelling, 2012.
  • [35] Mario V Wuthrich. Non-life insurance: mathematics & statistics. Available at SSRN 2319328, 2019.
  • [36] Ji Yao. Generalized linear models for non-life pricing-overlooked facts and implications. Institute and Faculty of Actuaries, 2013.
  • [37] Yuqing Zhang and Neil Walton. Adaptive pricing in insurance: Generalized linear models and gaussian process regression approaches. arXiv preprint arXiv:1907.05381, 2019.