The Influence of Seed Selection on the Solvency II Ratio
Abstract
This article contains the first published example of a real economic balance sheet where the Solvency II ratio substantially depends on the seed selected for the random number generator (RNG) used. The theoretical background and the main quality criteria for RNGs are explained in detail. To serve as a gauge for RNGs, a definition of true randomness is given. Quality tests that RNGs should pass in order to generate stable results when used in risk management under Solvency II are described.
Introduction.
Most German insurance companies use the standard formula for risk capital calculation, implying in almost all cases the use of the simulation model Branchensimulationsmodell (BSM), provided by the German Insurance Association (GDV) in order to evaluate the Best Estimate Liabilities (BEL), Own Funds (OF) and the Solvency Capital Requirements (SCR) within the Solvency II framework. Some insurance companies, especially the European-wide capital stock companies, use an internal model, In this paper we consider exlusively the BSM. According to legislation (see [EIO15]), the methodology for calculating the BEL has to be accurate and robust. In particular for life insurance a very important component of the calculation of BEL is the Economic Scenario Generator (ESG). Here, Guidelines 55-59, [EIO15], explicitly define requirements for its quality.The reliability of a Monte Carlo simulation of the BSM (or any other internal model) does not only depend on the quality of the model from a classical actuarial point of view but also on how well the underlying ESG performs. In this article, we put our focus on a mathematically challenging aspect within the ESG, namely the random numbers used therein serving as driver of stochasticity. Guideline 59, [EIO15], explicitly demands the proper testing of random number generators used in the ESG. The ESG provided by the GDV is implemented in Excel and is based on a combined linear congruential generator (LCG) of Wichmann and Hill (see [Wie82]) in order to generate the underlying random numbers.
Example.
To demonstrate the impact of the random number generator, let us consider an example of a typical German life insurance company using the BSM with a strong focus on endowment business in their portfolio. From a regulatory point of view, it is required to have stable results depending only minimal on the random number generator of the ESG. However, our data shows a significant impact of one of these settings within the ESG, namely the seed selection, on relevant financial statement data in the Economic Balance Sheet and ultimately the Solvency II ratio. The endowment business is typical for the German market and the BSM was mainly developed for its valuation. Based on our data the Solvency II ratio for YE 2016 was in the range of typical life insurance companies in the German market. Since focus on the stability of the results, we look at the relative changes and differences in percentage points and not absolute figures. In Table 1 a short comparison of the relevant financial data of two seeds is given.These two seeds are two of in total 30 seeds analysed, and show typical differences and not the maximal as observed in the set.
Relevant financial information | Delta Own Funds | -8.2 |
Delta SCR | 7.2 | |
Delta Risk Margin | 9.9 | |
Final Solveny II ratios | Delta SII Ratio | -14.2 |
Delta SII Ratio [pp] | -35.0 |
Table 1. Comparison of relevant financial data among two seeds.
As we can assume that the BSM model works properly, there are essentially two explanations why the results in the example differ by such a large amount: Either the rates of convergence of the Monte Carlo simulation distinguish or one of the seeds produces a more random sequence than the other.
What does randomness mean?
The term “random” describes the absence of patterns and predictability. For example, it would be correct to say that if is chosen at random according to the uniform distribution from the unit interval , then (by the strong law of large numbers), it satisfies the following property:
-
P1.
Each of the digits - shows up in ’s decimal expansion with frequency ; e.g.
P1 is one example of a randomness property but its satisfaction is certainly not sufficient to guarantee randomness; there are numbers, like , that are intuitively nonrandom, but satisfy P1.
So instead of satisfying only P1, one might require it also satisfy some other (almost-sure) property, P2. But then an example of something intuitively nonrandom satisfying P1 and P2 can be exhibited.
It is then natural to attempt to define randomness by considering only ’s that satisfy all randomness properties. The problem with this approach is that there are too many randomness properties, since the property “not equal to ” holds almost-surely and hence is a randomness property. If a random number were required to satisfy that property for each , there would then be nothing left to call random.
The theory of computation gives a natural way to restrict the kind of randomness property; only those properties that can be effectively checked by a computer. (For sufficiently complicated ’s, a computer cannot check “not equal to ”.)
Definition 1 abstracts the notion of a randomness property by exploiting outer regularity; any probability-zero event can be covered by open set of arbitrarily small probability. Any almost-sure property that one would want a truly random to satisfy is representable as a so-called Martin-Löf test.
Definition 1 ([Nie09]).
A (Martin-Löf) test for randomness is a sequence of open subsets of , with the following properties
-
•
for rational numbers ,
-
•
,
-
•
there is an (abstract) computer program that, given inputs and , outputs and .
A number passes the test if and is called (Martin-Löf) random if it passes every such test.
Furthermore, Definition 1 gives a definition that is inherently not practical; finitely many digits of a given do not determine its randomness. However, from a practioner’s perspective only finite samples are of relevance. As John Von Neumann said, “Any one who considers arithmetical methods of producing random digits is, of course, in a state of sin.” (See [VNWiki])
Random Number Generators.
Before we can explain how to test practically, if a sequence at hand is random or not we have to better understand how a computer produces random numbers and what is meant by seed selection. At the core of every Monte Carlo simulation there is a sequence of random variables which is
-
(i)
uniformly distributed in the interval and
-
(ii)
satisfies the property that the are mutually independent.111Property (i) is not a restriction since there are many ways to transform random variables uniformly distributed on to arbitrary distributed random variables, compare [Gla03], Chapter 2.2. These methods are applied by the economic scenario generators which are used by insurance companies.
A random number generator is a mechanism producing such a
sequence. The economic scenario generator is fed by these numbers. One
of the most ubiquitous random number generators is Mersenne-Twister. Like most other commonly used random number generators it is based on congruence calculations.222An example of a random number generator not relying on congruence is John von Neumann’s middle-square method which is for several reasons not a good method in practice. Since we only intend to familiarize the reader with the underlying ideas but not to treat the topic in its full generality (see e.g. [Gla03], Chapter 2, [Nie92], Chapter 7 and most importantly [PTVF07], Chapter 7 for more details), we explain here only the simple linear congruential generator introduced by Lehmer in [Leh51] which suffices for an overall understanding.
At first we choose a large integer , called the modulus,
an integer , called the multiplier, with , a
number , called the increment, with and a
starting value with . A sequence of
numbers is obtained by the recurrence
From that a sequence in can be derived by taking . In fact, a sophisticated choice of the numbers is essential for getting a good random sequence. This process is called seed selection.333For certain seeds, like , , the resulting sequence is obviously not random, see [Knu98]. Note that the sequence will repeat after applying the recurrence a certain number of times. The repeating cycle is called period. Of course, it is a desirable property that the period is as great as possible. There is a well-known theorem stating that the period is at most and it can also be described precisely when this is the case.
Theorem 1.
(see [Knu98], Chapter 1, Theorem A) The linear congruential sequence defined by and has period length if and only if
-
(i)
is relatively prime to ,
-
(ii)
is a multiple of for every prime dividing ,
-
(iii)
is a multiple of if is a multiple of .
Quality Criteria for Random Number Generators.
Since the
random number generators commonly used in industry are based on
deterministic algorithms444Indeed, it is necessary that the
algortihms be deterministic because of demands by the Solvency II delegated acts [EC15]. which are similar to the linear congruential method they inherit periodicity and do not produce independent random variables in the mathematical sense. Therefore, it is more appropriate to speak of pseudorandom numbers in this context. The main questions of the remainder of our article are now if and how we can distinguish pseudorandom numbers and random numbers. In other words: given one sequence of pseudorandom numbers and one sequence of random number can we decide statistically which one was made up by a computer? The less possible it is to detect that a sequence of numbers has been created by a computer the better the random number generator (or the random seed) is deemed.
In the following we assume that we have a finite sequence of numbers
in at hand. The present article does not aim
to give an exhaustive list of all known tests but should rather be
regarded as a selection of meaningful and functional tests the authors consider to be
particularly intuitive or easy-to-implement in practice. For a more
comprehensive overview we refer to [ES07] and [Knu98]. While
reading this paragraph the reader should always keep in mind
Definition 1 which makes clear that no single test or even
(finite) battery of tests can guarantee that a random number generator
is indeed a good mimic of randomness. Still these tests can improve
our confidence on the random number generator under consideration.
Global Uniformity Test.
A very simple starting point is to calculate the empirical mean and variance of the finite sequence and compare it to the values and expected theoretically. This can be done in a statistical precise way by applying Student’s t-test and Levene’s test. Moreover, the empirical distribution may in a next step be compared graphically to uniform distribution. Finally a goodness-of-fit test like Kolmogorov-Smirnov test, Chi-squared test or Anderson-Darling test may be used to judge statistically if the empirically observed and theoretically desired distribution coincide globally. Note that only a very bad random number generator would fail these basic tests.
Permutation Test.
We choose an arbitrary integral number and consider the tuples for . All the relative orderings555For instance, the relative ordering of the -tuple ist . among the entries of a generic -tuple should be equiprobable, i.e. have probability . By counting the empirical frequency of the orderings in the sample and applying a goodness-of-fit test we may judge if the null-hypothesis that the relative orderings are equidistributed has to be rejected or not.
Serial Test.
A uniformly distributed sequence on is equally dense everywhere. Even -tuples of successive numbers should be uniformly and independently distributed on . The serial test analyses if this property is satisfied. For that purpose we at first partition each copy of into equally long pieces for some integer . By this construction we obtain subcubes of volume and each subcube is therefore expected to contain elements of the finite sequence. The serial test measures the discrepancy between the empiric counts of the subcubes and . The corresponding test statistics is given by
and it is approximately Chi-square distributed with degrees of freedom. Hence, we may at the very end apply a Chi-square test to see if equidistribution holds statistically. The crucial point about the serial test is the choice of . In order to have a (good) approximate Chi-square distribution it is often recommended to have . This gives an upper bound on because otherwise the number of categories would be too large and the test would not be exact enough (see [Knu98], 3.3.2).
Birthday Spacings Test.
The birthday spacings test is a refinement of the serial test. Its name is inspired by the birthday “paradox” which states that the likelihood that some pair of 23 randomly chosen people has the same birthday is .666In the language of the birthday paradox the length of the sub-sequence corresponds to the number of birthdays and the number of cells corresponds to the number of days in a year. As for the serial test we decompose into cells and consider points of the sequence. Let be the cell numbers where these points fall and let be the number of collissions. It is well-known that is approximately Poisson distributed with mean (see [ES07]). A Chi-squared test can then be applied to the sample values of .
![[Uncaptioned image]](https://cdn.awesomepapers.org/papers/9a9477ed-af1e-403f-9761-98c8849f90bd/2d-plot.jpg)
Figure 1. Pairs of pseudorandom numbers.
Spectral Test.
While all tests so far were driven by statistics we complete our list by a purely geometrical test. Figure 1 and Figure 2 were generated using a pseudorandom number sequence similar to the one implemented in Excel.777In fact, we used the linear congruential method with and . This example has been chosen because the described effects become visible already in lower dimensions. They show the sets consisting of all pairs respectively triples . While Figure 1 does not seem to be particularly remarkable at first sight, Figure 2 is astonishing since all points lie on a few planes.
![[Uncaptioned image]](https://cdn.awesomepapers.org/papers/9a9477ed-af1e-403f-9761-98c8849f90bd/3d-plot.jpg)
Figure 2. Triples of pseudorandom numbers.
On the other hand random number generators are neither expected nor desired to produce any symmetric geometric structures. This is a drawback of all random number generators which rely on linear congruence as we will explain now. Let be the maximum distance between lines, taken over all families of parallel straight lines that cover all points . The greater is, the fewer lines suffice to cover all points. In other words, the greater is the less the random number generator produces symmetric structures. Hence, is the two-dimensional accuracy of the random number generator. The concept of accuracy can easily be carried over to higher dimensions , e.g. in dimension we replace lines by planes. Another interpretation of the numbers is that they measure how many digits of -tuples may be regarded as being independent. The difference between pseudorandom numbers and random numbers truncated to multiples of some is that random number sequences have approximately the same accuracy in all dimensions. It has been suggested in the literature to accept a random number generator for application whenever for . Practical implementation of spectral test demands some lines of programing code but can still be achieved with reasonable effort (see [Knu98] for details). The number theory behind the spectral test is explained wonderfully in [Kon03].
Conclusion.
This paper has revealed a significant impact of seed selection on the Solvency II ratio. Moreover, we explained how randomness can be defined in a mathematical rigorous sense and showed how statistical and geometrical tests can be used to distinguish good seeds of an random number generator from bad ones. We strongly suggest that these quality criteria for random number generators will be considered by insurance undertakings, regulators and external auditors as well in the future. Despite proving appropriateness of the interest rate model888The Hull-White model is used in the ESG of the BSM., the software used for calculating the provision for future policy benefits, the management rules and so forth, the random number generator should also be added in the framework for reasons of adequacy consideration.
Acknowledgments.
The authors wish to thank two anonymous actuarial employees of a German insurance company for their calculations and providing the data. Furthermore, we are grateful to referee for his useful comments.
References
- [EC15] European Commission, “Commission Delegated Regulation (EU) 2015/35” (2015).
- [EIO15] EIOPA, “Guidelines on the valuation of technical provisions” (2015).
- [ES07] L’Ecuyer, P., Simard, R.: “TestU01: A C Library for Empirical Testing of Random Number Generators”, ACM Transactions on Mathematical Software, 33 (4), Article 22 (2007).
- [Gla03] Glasserman, P.: “Monte Carlo Methods in Financial Engineering”, Springer (2003).
- [Knu98] Knuth, D.E.: “The Art of Computer Programming”, Vol 2: Seminumerical Algorithms, Addison-Wesley (1998).
- [Kon03] Kontorovich, A.: “From Apollonius to Zaremba: Local-Global Phenomena in Thin Orbit”, Bull. AMS 50,, No. 2, 187–228 (2013).
- [Leh51] Lehmer D.H.: “Mathematical methods in large-scaling computing units”, Proc. 2nd Sympos. of Large-Scale Digital Calculating Machinery, Harvard University Press, 141–146 (1951).
- [Nie92] Niederreiter, H.: “Random Number Generation and Quasi-Monte Carlo Methods”, Number 63 in CBMS-NSF Series in Applied Mathematics, SIAM, Philadelphia (1992).
- [Nie09] Nies, André: “Computability and randomness”, Oxford University Press, 2009.
- [PTVF07] Press, W, Teukolsky, S., Vetterling, W., Flannery, B.: “Numerical Recipes: The Art of Scientific Computing”, Cambridge University Press (2007).
- [Wie82] Wichmann, B.A., Hill, I.D..: “An efficient and portable pseudo-random number generator”, App. Statis. 31, 188-190 (1982).
- [VNWiki] Wikiquote: “John von Neumann — Wikiquote,”, 25 April 2017, Retrieved December 20, 2017 from https://en.wikiquote.org/w/index.php?title=John_von_Neumann&oldid=2246778.
Quinn Culver, Fordham University, 407 John Mulcahy Hall Bronx, NY 0458-5165
E-mail address: qculver@fordham.edu
Dennis Heitmann, BSP Business School Berlin - Campus Hamburg, Am Kaiserkai 1, D-20457 Hamburg
E-mail address: dennis.heitmann@bsp-campus-hamburg.de
Christian Weiß, Hochschule Ruhr West, Duisburger Str. 100, D-45479 Mülheim an der Ruhr
E-mail address: christian.weiss@hs-ruhrwest.de