Multimatricvariate and multimatrix variate distributions based on elliptically contoured laws under real normed division algebras
José A. Díaz-García Universidad Autónoma de Chihuahua
Facultad de Zootecnia y Ecología
Periférico Francisco R. Almada Km 1, Zootecnia
33820 Chihuahua, Chihuahua, México
E-mail: jadiaz@uach.mx
Francisco J. Caro-Lopera
University of Medellin
Faculty of Basic Sciences
Carrera 87 No.30-65
Medellín, Colombia
E-mail: fjcaro@udemedellin.edu.co
Corresponding author
Key words. Multimatrix variate, real normed division algebras, matrix variate, multimatricvariate, random matrices, matrix variate elliptical distributions.
2000 Mathematical Subject Classification. 60E05; 62E15; 15A23; 15B52
Abstract
This paper proposes famillies of multimatricvariate and multimatrix variate distributions based on elliptically contoured laws in the context of real normed division algebras. The work allows to answer the following inference problems about random matrix variate distributions: 1) Modeling of two or more probabilistically dependent random variables in all possible combinations whether univariate, vector and matrix simultaneously. 2) Expected marginal distributions under independence and joint estimation of models under likelihood functions of dependent samples. 3) Definition of a likelihood function for dependent samples in the mentioned random dimensions and under real normed division algebras. The corresponding real distributions are alternative approaches to the existing univariate and vector variate copulas, with the additional advantages previously listed. An application for quaternionic algebra is illustrated by a computable dependent sample joint distribution for landmark data emerged from shape theory.
1 Introduction
A postmodern world of increasingly heuristic and interdisciplinary problems requires integrative solutions that do not always have the answer in the current frameworks of mathematics and statistics. Chaos theory as a paradigmatic element of phenomenal interrelation links more and more random variables. They are not only univariate and vector variables, but matrix variates, due to their great versatility in the description of variability and intrinsic multiple correlation. Likewise, the domain of the variables began to move from the real field to the remaining real normed division algebras, such as complex, quaternionic and octonionic. There the emerging models of multiple natural, exact and engineering sciences have their own scope and they are not restricted by the real field. Given multiple statistical models for marginal distributions in univariate and vector cases, the additional problem of parametric estimation arises from a set of samples provided by the scientific expert user of statistics. Then comes another challenge for statistics, which has historically been resolved by optimising likelihood functions as a joint law for sample distribution. The theory of copulas emerged as a possible solution to the problem of explaining a joint phenomenon of known interrelation. Countless link functions then appear for two variables, however, the intention of a dependent relationship is diluted when using the typical likelihood functions for estimating copula parameters from independent samples. The role of likelihood in the history of statistics is of such transcendence that it is a popular tool for estimating parameters; however, its profound simplicity based on sample independence is never debated. While copulas and similar theories represent the solution to the problem of dependence, the concept of likelihood function, defined as the product of the marginal densities, seems immutable and universal. Recently, the authors have begun the discussion of the role of classical likelihood functions in statistics, showing the differences in application in databases declared dependently probabilistic. Time series in particular are the source of the greatest discrepancy, given the historical effort in proposing models for variance and volatility. But maintaining the postulate of an estimate via likelihood functions on independent data is an obvious contradiction in the face of a temporal process that intrinsically is founded on dependency. A work that launched this new way of reframing likelihood functions appeared recently in Díaz-García et al. [7]. In a real data series, the discrepancy of likelihood function over independent time samples versus realistic likelihood function over dependent samples was observed. The study indicated that decisions diffused about the mean estimated from the database using independent likelihood were in the tails of the dependent expected likelihood distribution. The finding was possible thanks to the definition of the so-termed multivector variate distribution, a natural way of defining a likelihood function of dependent vector samples, further parameterised by a large class of elliptically contoured distributions that are isolated from the choices of popular models. Until then, the mutivector could be an approach to vector copulas, but the implemented theory is designed to address the matrix case, which is still an open problem for copula theory. Then, the matrix version appeared in the form of the so-termed classes of multimatricvariate distributions, which provided the likelihood function or joint matrix distributions with a family of elliptically contoured distributions with a variety of kurtosis and symmetries, see Díaz-García and Caro-Lopera [5]. The choice of distributions that are invariant under the class of elliptically contoured laws within the family of multimatricvariate and multimatrix variate distributions gives robustness to the analysis since it avoids the difficult paradigm of adjusting the link function, if we speak in parallel to the procedure. in vector copulas.
Multimatricvariate distributions specialise in distributions based on the determinant. Still to be defined a class of multiple distributions of random matrices that depended on the trace, the other function that has historically governed the theory of matrix distributions. Recently, the so-termeded multimatrix distributions appear in Díaz-García and Caro-Lopera [6]. Like the multimatricvariate distributions, they maintain the philosophy of probabilistic dependence, computability and a wide class of underlying distributions.
When we surpass the level of random vectors, where the most popular statistical techniques remain, such as copulas, we find immense difficulty for the calculation. We must observe the domain of the matrices, their transformations and Jacobians, and therefore their integration. It is generally difficult because it involves averaging over orthogonal groups, cones and hypercubes. The central, isotropic, non-central and non-isotropic cases also emerge. These integrations comprises the geometric filtering that comes for their definitions and factorisations.
At this point we find that the new multimatricvariate and multimatrix variate distributions on the real field satisfy the first 2 of the 3 conditions that we establish as a paradigm for a robust point of view of joint distributions, namely: 1) modeling of several dependent matrices capable of being combined with all univariate and/or vector variables governed by elliptically contoured models; 2) with strictly verifiable marginals under the trivial case of independence and testable from the dependent sample likelihood; and 3) founded on the same algebraic principles that allow them to be applied indistinctly from the real normed division algebras.
We now address the third desirable characteristic: its versatility in the four unique real normed division algebras.
Until a couple of decades ago it would be unthinkable to unify the theory of real random matrices with complex ones. History shows us that the theory of distributions of the central cases overflowed into an immense effort to characterize the indispensable elements of random matrices, namely: After its establishment, works on the complex case began to appear very slowly and completely conceptually distant from their real analogues. The so-termed statistical theory under real normed division algebras is to statistics what the expected theory of unification of forces is to physics. Its simplification is surprising in that all its results are reduced to modifying a beta parameter that goes from 1 (real) to 2 (complex) to 4 (quaternionic) to 8 (octonionic), see Díaz-García [4] were the transitions to complex, quaternionic and octonion are reached by changing the support group from orthogonal to unitary, compact symplectic or exceptional type.
If the real multimatricvariate distributions [5] and multimatrix variate distributions [6] articles are parallel compared with the results of the present work, an apparent repetition shall appear by its simplicity; Its notation makes the beta parameter open one of the four universes by simply changing the value, although understanding the profound difference that underpins them requires an extensive and difficult literature that began in other areas far from statistics. The non commutativity for quaternions and non associative for octonions promotes a deep research for some applications in those algebras.
We place the above discussion into the setting of two main problems that we shall address in this article.
The interest in multimatricvariate and multimatrix variate distributions has been motivated by the following two situations:
1.
In different areas of knowledge (such as Finance and Hydrology, among others), people are interested in simultaneously modeling two random variables, say and , which are suspected of not being probabilistically independent. On the one hand, the marginal distributions of each variable are known, whether they are and . Typically this problem has been approached assuming that the random variables and are independent and, as a function of joint density of the two-dimensional vector , the product of the marginals, , has been considered. Thus, for example, in this case the likelihood function, given the two-dimensional sample , denoted as is defined as
For some parameter vector which is part of the density .
Alternatively, based on variable changes on a set of independent random variables, the random vector was generated, where now the variables and are not independent and their density function is known joint , such that
Under this approach we have that
See Libby and Novick [24], Chen and Novick [3], Olkin and Liu [28], Nadarajah [26, 27] and Sarabia et al. [30] among many others.
This situation also occurs in other multivariate problems. Then, parallel solutions were proposed in the vector and matrix cases, giving foothold to the study of bi-matrix variate distributions in the real and complex cases. In the last case joint distributions of random matrices, say and dependent with joint density function, termed bimatrix variate distribution, are proposed such that the marginal densities of and are the usual assumptions, see Olkin and Rubin [29], Díaz-García, and Gutiérrez-Jáimez [8, 9, 11], Bekker et al. [1], and [16] and references therein.
2.
In another case, we are interested in defining the likelihood function by a joint function of the sample, but which is not defined as the product of the marginals, that is, the sample is not independent. In the univariate problem, one answer considers the elliptically contoured distribution of the vector as a likelihood function, noting that in reality the elliptically contoured distribution actually defines a distribution family, see Fang et al. [18], Fang and Zhang [17], Gupta and Varga [21] and the reference therein.
Based on the family of matrix variate elliptically contoured distributions, the multimatrix variate and multimatricvariate distributions were proposed as a generalisation of the bi-matrix variate distributions, which are defined as the joint distribution of the dependent random matrices , see Díaz-García et al. [7], and Díaz-García and Caro-Lopera [5, 6]. Thus, the multimatrix variate and multimatricvariate distributions can be used as likelihood functions for a sample of dependent random matrices with certain (usual) marginal distributions. Thus, the likelihood function of the sample is defined as
Under the theory of multimatrix matrix or multimatricvariate distributions, each matrix , into the density can follow a different marginal distribution. This answers the following problem under an independent or dependent samples: Suppose that we have a matrix random sample as follows:
And assume that the matrix contains the parameters of interest. Then the likelihood function can be defined as:
In the bi-matrix variate case, these problems have been studied in the real and complex cases, each giving rise to a series of non correlated publications. Fortunately, in terms of the theory of the real normed division algebras, a unification of the real and complex cases is possible. And an extension to the quaternionic and octonionic algebras is also feasible. It is worth mentioning that the octornionic case is still under research. At this time, they are valid for octornionic matrices and in general it can only be conjectured that they may be valid.
In the present work, the multimatrix variate and multimatricvariate distributions are studied for matrix arguments which elements belong to the real normed division algebras. A brief description of the notation and some Jacobians for real normed division algebras is presented in Section 2. In addition, two more Jacobians are obtained and the definition of the matrix variate elliptically contoured distribution for real normed division algebras is presented. The main results on multimatrix variate and multimatricvariate distributions for real normed division algebras are obtained in Section 3. Some properties and extensions of multimatrix variate and multimatricvariate distribution with more than two different types of distributions in their arguments are studied in Section 4. An example in the quaternionic case is full derived in Section 5.
2 Notation and preliminary results
A detailed discussion of real normed division algebras may be found in Baez [2]. For
convenience, we shall introduce some notations, although in general we adhere to standard
notations.
A vector space is always a finite-dimensional module over the field
of real numbers. An algebra is a vector space that is equipped with a
bilinear map termed
multiplication and a nonzero element termed the unit such
that . As usual, we abbreviate as . We do not assume
associative. Given an algebra, we freely think of real numbers as elements of
this algebra via the map .
An algebra is a division algebra if given with
, then either or . Equivalently, is a division algebra if the
operation of left and right multiplications by any nonzero element is invertible. A
normed division algebra is an algebra that is also a normed vector
space with . This implies that is a division algebra and
that .
There are exactly four normed division algebras: real numbers (), complex numbers
(), quaternions () and octonions (), see
Baez [2]. Taking into account that , , and
are the only normed division algebras; moreover, they are the only alternative
division algebras, and all division algebras have a real dimension of or , which
is denoted by , see Baez [2, Theorems 1, 2 and 3]. In other branches of mathematics,
the parameter is used, see Edelman and Rao [15].
Let be the linear space of all matrices of rank over with distinct positive singular values, where
denotes a real finite-dimensional normed division algebra. Let be the set of all matrices over . The dimension of
over is . Let , then denotes the usual conjugate
transpose.
The set of matrices such that
is a manifold denoted , is termed the Stiefel manifold ( is also known as
semi-orthogonal (), semi-unitary (), semi-symplectic
() and semi-exceptional type () matrices, see Dray and Manogue [13]). The
dimension of over is . In
particular, with dimension over , , is
the maximal compact subgroup of and
consists of all matrices such that
. Therefore, is the
real orthogonal group (), the unitary group
(), compact symplectic group ()
or exceptional type matrices (), for ,
, or , respectively.
We denote by the real vector space of all such that . Let
be the cone of positive definite matrices ; then is an open subset of . Over , consist of symmetric matrices;
over , Hermitian matrices; over , quaternionic
Hermitian matrices (also termed self-dual matrices) and over ,
octonionic Hermitian matrices. Generically, the elements of
are termed
Hermitian matrices, irrespective of the nature of . The
dimension of over is .
Let be the diagonal subgroup of
consisting of all , .
For any matrix , denotes the
matrix of differentials . Finally, we define the measure or volume element
when ,
or , see Dimitriu [12].
If then (the Lebesgue measure in
) denotes the exterior product of the functionally
independent variables
If (or )
then (the Lebesgue measure in or in
) denotes the exterior product of the functionally
independent variables (or denotes the exterior product of the functionally
independent variables, if for all )
The context generally establishes the conditions on the elements of , that is, if
, , or . It is
considered that
Observe, too, that for the Lebesgue measure defined thus, it is required that
, that is, must be a non singular
Hermitian matrix (Hermitian positive definite matrix).
If then (the Legesgue
measure in ) denotes the exterior product of the
functionally independent variables
If then
where . It can be
proved that this differential form does not depend on the choice of the
matrix. When ; defines the unit sphere in
. This is, of course, an - dimensional surface in
. When and denoting by ,
is termed the Haar measure on .
The surface area or volume of the Stiefel manifold is
(1)
where denotes the multivariate Gamma function for the space
, and is defined by
where , denotes the determinant and , see Gross and Richards [20]. If then by we mean the vector formed by stacking the columns of
under each other; that is, if , where for
Below are summarised some Jacobians in terms of the parameter. For a detailed
discussion of this and related issues see Dimitriu [12], Edelman and Rao [15], Forrester [19] and
Kabe [22].
Proposition 2.1.
Let ,
and be matrices of constants, and
a matrices of functionally independent variables
such that . Then
(2)
Proposition 2.2.
Let . If ,
(3)
Proposition 2.3(Singular value decomposition, ).
Assume that , such that with ,
and , . Then
(4)
where
As a consequence of this result, we have the following statement.
Proposition 2.4.
Let , and Then
(5)
with .
Proposition 2.5.
Let Then ignoring the sign, if
(6)
Theorem 2.1.
Assume that and are matrices of functionally independent variables.
i)
Define . Then
(7)
ii)
If , we have
(8)
where .
Proof.
i) Define
and . And observe that
Then by (5), for , and writing these for and we obtain
Then, by the uniqueness of the nonnormalised measure on Stiefel manifold,
. Thus,
And using and , the required result is obtained.
ii). The proof is similar to the preceding exposition given in i).
∎
Corollary 2.1.
Assume that and are matrices of functionally independent variables.
i)
Let . Then
(12)
ii)
If , we have
(13)
Proof.
i). Observing that can be write as , where and
. Its proof is obtained as a particular case to that given
for the theorem. ii). Its proof is analogous to one given to i).
∎
Definition 2.1.
It is said that the random matrix has a matrix
variate elliptical distribution, denoted as , if its density is
(14)
where
(15)
and where , and
are constant matrices.
Finally, note that for constant, making the change of variable and in Fang et al. [18, Equation 2.21, p. 26] we have
(16)
Finally, from Díaz-García, and Gutiérrez-Jáimez [10]
(17)
where and .
3 Multimatrix variate and multimatricvariate distributions
First, note that any new particular distribution indexed by the kernel form part of its density function, then it defines a family of distributions itself in terms of each possible choice of .
Assume that ,
such that , and . Then (14) can be written as
(18)
where , .
Take in account that, only under a matrix variate normal distribution the random matrices are
independent, see Fang and Zhang [17], Gupta and Varga [21] and Fang et al. [18]. In general, the random matrices are
probabilistically dependent.
Theorem 3.1.
Suppose that , with , (remember that ), .
i)
Define . Then, the joint density is given by
(19)
where . This distribution shall be termed multimatrix variate generalised Gamma - Elliptical distribution.
ii)
Let . Hence, the joint density is given by
(20)
where . This distribution shall be called multimatricvariate generalised Whishart - Elliptical distribution.
where . The desired result is obtained make de change of variable in (21) and integrating over , using (1).
∎
Proceeding as Díaz-García et al. [7, Equation 4.2], defining , , and as in Díaz-García and Caro-Lopera [5, Equation (1), p. 216] the following result in general case, is obtained.
Theorem 3.2.
Suppose that has a matrix variate spherical distribution, with , . Then:
i)
If we define , , it is obtained
(22)
where , , , which distribution shall be termed multivariate generalised Gamma distribution.
ii)
The joint density when , . is given by
(23)
where , . This distribution shall be termed multimatricvariate generalised Whishart distribution.
Particular cases of these two distributions have been studied in the literature under a normal distribution in the real, complex and quaternionic cases, see Nadarajah [26], Fang and Zhang [17], Libby and Novick [24], Díaz-García and Caro-Lopera [5] and Li and Xue [23], among many other authors.
Theorem 3.3.
Assume that , with , .
i)
Define and , . The joint density
is given by
(24)
where , and , . This distribution shall be termed multimatrix variate generalised Gamma - Pearson type VII
distribution.
ii)
Define and , . Then, the joint density is given by
(25)
where , and , . This distribution shall be called multimatricvariate generalised Wishart-T distribution.
Proof.
i)
The density (24) is follow from (19) defining , . Hence by Proposition 2.1, we have
Then the required result follows.
ii)
Now, from (20), making the change of variable , and considering that
which shall be termed multimatrix variate Pearson type II distribution.
ii)
The multimaticvariate marginal density is
(31)
which shall be termed multimatricvariate Pearson type II distribution.
Where , and , .
Proof.
i)
First, (30) is archived integrating (28) over using (16).
ii)
Similarly, (31) is follows integrating (29) over using (17).
∎
Theorem 3.5.
Assuming the hypotheses of Theorem 3.3 and defining , .
i)
Then the joint density is
(32)
This distribution shall be termed multimatrix variate generalised Gamma-beta type II distribution.
ii)
Then the joint density is
(33)
This distribution can be termed multimatricvariate generalised Wishart-beta type II distribution.
Proof.
Multimatrix variate and multimatricvariate density functions (32) and (33) are obtained considering the change of variable , in expressions (24) and (25), respectively. Observing that by (2.4)
Whose distribution can be termed multimatrix variate beta type II distribution.
ii)
And associated marginal density function of (33) is given by
(35)
This distribution shall be termed multimatricvariate beta type II distribution.
Proof.
The proof of both result are obtained integrating (32) and (33) with respect to and using (16) and (17), respectively. Or proceeding exactly as in the proof of Theorem 3.5
∎
The density function (34) contain as particular case to the distribution in real case, proposed in Muirhead [25, Problem 3.18, p.118].
This distribution shall be termed multimatrix variate generalised Gamma-beta type I distribution.
ii)
Then the joint density , where , is
(37)
This distribution can be termed multimatricvariate generalised Wishart-beta type I distribution.
Proof.
Making the change of variable , in the densities (28) and (29), respectively, and noting that
where , . The proof is archived, integrating over using (1). In which case
∎
Integrating (36) and (37) with respect to and using (16) and (17), respectively; we obtain the marginal densities of multimatrix variate and multimatricvariate beta type I distribution. Summarising:
Corollary 3.4.
i)
The density function can written as
(38)
Whose marginal distribution shall be termed multimatrix variate beta type I distribution.
ii)
In this case, the density function
is
(39)
This marginal distribution shall be named multimatricvariate beta type I distribution.
Finally
Theorem 3.7.
Assume that has a matrix variate spherical distribution, with , , . Define and
, .
Then, the joint density
is given by
(40)
where , , . This distribution shall be named multimatrix variate generalised Gamma - generalised Wishart distribution.
Defining with and proceeding as in the proof of Theorem 3.6, the result is immediate.
∎
4 Some properties and extensions
Now, multimatrix and multimatric variate distributions for two and three matrix arguments can be achived. Then we can obtain two or more different classes of marginal distributions. The metodology can be extended to more than three different marginal distributions. In addition, the inverse distributions of some multimatrix and multimatric variate distributions are also obtained.
Theorem 4.1.
Let be has a matrix variate spherical distribution, with .
i)
Define , , and , where . The joint density is given by
(41)
where , , , such that . This distribution shall be termed threematrix variate generalised Gamma - Pearson type VII - Pearson type II distribution.
ii)
Let be , , , and . Then the joint density is given by
(42)
where , , , and . The distribution of shall be termed trimatricvariate generalised Wishart-Pearson VII-Pearson type II distribution.
The density function of the termed trimatrix variate generalised Gamma - Pearson type VII - Pearson type II
distribution, is given by
(45)
ii)
Similarly, the density is
(46)
This density shall be termed trimatricvariate generalised Wishart-beta type II-beta type I distribution.
Where ,, , , , and .
Proof.
The proofs of (45) and (46) are follows from (41) and (42), respectively; making the change of variables , , and by Proposition 2.4,
The desired results are archived, integrating over using (1). Moreover
∎
Additionally, we are interested in the distributions of the inverse of one or more of the arguments in the multimatrix variate or multimatricvariate distributions, which shall be termed inverse multimatrix variate or inverse multimatricvariate distributions.
Theorem 4.2.
Define , .
i)
Assume that have a multimatrix variate beta type I distribution. The join density function
is
(47)
where , , , .
ii)
Consider that have a multimatricvariate beta type I distribution. Then, the join density function
is given by
(48)
where , , ,
Proof.
The density functions (47) and (48) are archived from (38) and (39), respectively, defining , , and using the Proposition 2.5.
∎
The parameter domain of the real normed division algebra in the multimatrix and multimatric variate distributions can be extended as in the real and complex cases. However, the statistical and/or geometrical interpretation, perhaps can be lost. In any case, these distributions are valid if we replace by , by and by . Where the are complex number with positive real part. From practical point of view, this parameter domain extension its allow to use nonlinear optimisation rather integer nonlinear optimisation in the procedure of estimation, among other possibilities.
Each distribution can be reparametrised in order to obtain a general expression for its density function. As in the normal case, the expressions obtained in this article appear in their standard form.
If follows a multimatricvariate or multimatrix variate generalised Whishart distribution, with density function (23). Define , , , then by Proposition 2.2, we have that
Then the density is
Hence
where , .
5 Example
In this Section we provide an example in quaternions. Finding a random suitable data base for this algebra is difficult, then we try first to explain a way of generating a number of applications by using data bases of the literature of shape theory.
We start with a known representation of a quaternion number in terms of complex matrices. Let be a quaternion, then can be written in terms of a the following matrix of complex entries:
(49)
Thus can be seen as an array of 4 complex points, with a double symmetry: and are symmetric respect the axis; meanwhile and are symmetric about the axis.
Now, shape theory deals, for example, with sets of planar figures summarised by corresponding landmarks between populations in order to obtain means, variability and discrimination via statistics on certain quotients spaces. Instead of the noisy Euclidean space, the statistics is performed with equivalent classes after filtering out some non meaning geometrical information, such as scaling, translation, rotation, reflection, etc.. Thus, a landmark data with the referred symmetries can be set into a vector variate quaternion sample for estimation of the extended real parameters of the distributions here derived. We focus on the mouse vertebra landmark data given for example in Dryden and Mardia [14]. The sample consists of 23 small, 23 large and 30 control second thoracic vertebrae with 60 landmarks. After transforming the data for a suitable application of the complex matrix representation, we have in figure 1 an example of an small vertebra.
Figure 1: A 60 landmark small vertebra with high symmetry, divided in four parts for suitable application of a quaternionic complex matrix representation.
The required sample most follows the symmetric suggestions of . For getting this end, just cut the bones on landmarks and and two sectors are obtained: , from landmarks to ; and, , from landmarks to . Now the free part can be placed symmetrically to sector (landmarks to ) as a reflection on the imaginary axis . Finally, for each bone we have 14 pairs of landmarks , each one representing the quaternion , where . Namely, for each , the first landmark belongs to the sector (symmetric to sector , respect axis) and it is paired with the second landmark (symmetric to the translated sector ). Summarising, the sample for the three classes of bones consists of upper landmarks 2 to 29, distributed by the pairs . With each pair providing a quaternion, we have the following three dependent samples: 23 quaternion vectors of size 14 for the small group (Figure 2(a)), 23 quaternion vectors of size 14 for the large class (Figure 2(b)), and 30 quaternion vectors of size 14 for the control set (Figure 2(c)).
(a)
(b)
(c)
Figure 2: (a) Dependent sample of 23 quaternion vectors of size 14 for the small group. (b) Dependent sample of 23 quaternion vectors of size 14 for the large class. (c) Dependent sample of 30 quaternion vectors of size 14 for the control set. The sector (green) with complex constitutes the first two components of the quaternion , and the sector (red) with complex indexes the last two components of .
The mouse vertebra landmark data has been studied in several works based on the classical assumptions of normality and independent probabilistic sample, see for example Dryden and Mardia [14] and the references therein. Gaussian restriction can not deal properly with the outlier shapes, meanwhile assuming a sample of independent small and large bones just facilitates the estimations via likelihood function, but it seems to be out of underlying sample extraction and population description given in the original source of the experiments back to the earlies 70s.
This work provides two solutions of the previous problems. First, within the multimatricvariate and multimatrix variate distributions, the choice of distributions that are invariant under the family of elliptically contoured distributions. A notorious advantage facing the lack of knowledge about the random matrix law for the landmark data. And second, the distributions here derived give join density functions of dependent matrices, a fact eliminates the historical controversy between the probabilistic independence of the sample data. Another important fact, hidden for the classical studies based on Gaussianity (independently equivalent) resides on the real normed division algebra supporting the landmark data. In particular, the use of quaternions via the complex matrix representation integrates in the study the high symmetry of the bones. Finally, for this data, the distributions under quaternions are extremely simple, they are reduced to vectors instead of the real matrix setting given in the referred studies.
For the sake of illustration and simple computation, we consider the multimatrix variate beta type II distribution (34) as the likelihood function dependent sample invariant under the quaternionic elliptically contoured distribution.
For the small and large samples, , and , meanwhile in the control group . In the three samples we use (34) for the maximum likelihood estimates of and . Here, is a quaternion vector, then is a real value for all . Thus the likelihood in (34) takes the form:
The computations are performed in the Optimx package of R under several methods of optimisations and a wide range of seeds for a consistent estimation. The results are given in the following table
Sample
Small
0.040714
45.194923
Large
0.03294941
12.82131179
Control
0.03765324
32.99296063
Finally, we can use again the symmetry of the modified mouse vertebra data in order to define quaternion matrices . In this case the first column is the same formed by sectors and , and the second column corresponds to sector (the reflection of sector ) and the translated and reflected sector , which is symmetric respect sector . Then we obtain the quaternion matrix , , and the likelihood function (34) can be computed in terms of the latent roots of the quaternionic Hermitian matrices .
The use of high symmetric planar landmark data for characterising quatenion applications opens an interesting perspective in shape theory, in particular the computation of probabilities are tractable expressions which can be implemented easily. Finally, under these symmetries, (49) shows a way of avoiding the quaternions by performing a study based on block complex matrices, a real normed division algebra more easily understood and handled because its commutative property. These aspects are taking part of a future work.
6 Conclusions
This work has set the multimatrix and multimatric variate distributions in a unified approach for the real normed division algebras. The distributions are also indexed by the class of elliptical contoured models. The main advantages of the proposed theory are in agreement with the current paradigms of the distribution theory: 1) The distributions are computable in a simple PC. 2) After integrations, the results can be seen as joint distributions of several combinations of scalars, vectors and matrix variates, some of them invariant under the family of matrix variate elliptically contoured distributions. An ideal property for situations where the marginals and joint distributions are completely unknown. 3) The multimatrix variate and multimatricvariate distributions share the philosophy of copula theory, but without the restriction to reals, vectors and likelihood copula parameter estimation based on independent distributions. 4) The join distributions can be seen as likelihood functions of probabilistically dependent matrices, as a more real alternative a likelihood function of independent sample variate. 5) The multimatrix variate and multimatricvariate distributions emerged into as unified point of view for all the real normed division algebras, just modulated by a parameter . 6) The properties presented here are valid for all real normed division algebras, then several applications can be switched according to the sample dependent origin. Finally, a application of symmetric landmark data popularised in real shape theory is translated into the quaternion setting. Current a research about multiple computation of probabilities on symmetric cones is considered.
References
Bekker et al. [2011]
Bekker, A., Roux, J. J. J., Ehlers, E., Arashi, M. (2001).
Bimatrix variate beta type IV distribution: relation to Wilks’s statistics
and bimatrix variate Kummer-beta type IV distribution.
Communications in Statistics - Theory and Methods, 40, 4165-4178.
Baez [2002]
J. C. Baez, J. C. 2002.
The octonions,
Bulletin of the American Mathematical Society, 39, 145–205.
Chen and Novick [1984]
Chen, J. J., Novick, M. R. (1984).
Bayesian analysis for binomial models with generalized beta prior distributions.
Journal of Educational Statistics, 9, 163–175.
Díaz-García [2014]
Díaz-García, J. A., (2014).
Integral properties of zonal spherical functions, hypergeometric functions and invariant polynomials.
Journal of the Iranian Statistical Society, 13 (1), 83-124.
Díaz-García and Caro-Lopera [2022]
Díaz-García, J. A., Caro-Lopera, F. J. (2022).
Multimatricvariate distribution under elliptical models.
Journal of Statistical Planning and Inference, 2016, 109-117.
Díaz-García and Caro-Lopera [2024]
Díaz-García, J. A., Caro-Lopera, F. J. (2024).
Multimatrix variate distribution.
https://arxiv.org/abs/2405.02498.
Díaz-García et al. [2022]
Díaz-García, J. A., Caro-Lopera, F. J., Pérez Ramírez, F. O. (2022).
Multivector variate distributions: An application in Finance.
Sankhyā, 84-A,Part 2, 534-555.
Díaz-García, and Gutiérrez-Jáimez [2010a]
Díaz-García, J. A., Gutiérrez-Jáimez, R. (2010a).
Bimatrix variate generalised beta distributions.
South African Statistical Journal, 44, 193-208.
Díaz-García, and Gutiérrez-Jáimez [2010b]
Díaz-García, J. A., Gutiérrez-Jáimez, R. (2010b).
Complex bimatrix variate generalised beta distributions.
Linear Algebra and its Applications, 432 (2-3), 571-582.
Díaz-García, and Gutiérrez-Jáimez [2011a]
Díaz-García, J. A., Gutiérrez-Jáimez, R. (2011a).
On Wishart distribution: Some extensions.
Linear Algebra and its Applications, 435, 1296-1310.
Díaz-García, and Gutiérrez-Jáimez [2011b]
Díaz-García, J. A., Gutiérrez-Jáimez, R. (2011b).
Noncentral bimatrix variate generalised beta distributions.
Metrika, 73(3), 317-333.
Dimitriu [2002]
Dimitriu, I. (2002).
Eigenvalue statistics for beta-ensembles.
PhD thesis, Department of Mathematics.
Massachusetts Institute of Technology, Cambridge, MA.
Dray and Manogue [1999]
Dray, T., Manogue, C. A. (1999).
The exceptional Jordan eigenvalue problem,
International Journal of Theoretical Physics, 38(11), 2901–2916.
Dryden and Mardia [1998]
Dryden, I. L., Mardia, K. V. (1998).
Statistical Shape Analysis,
Wiley, Chichester.
Edelman and Rao [2005]
Edelman, A. Rao, R. R. (2005).
Random matrix theory.
Acta Numerica, 14, 233–297.
Ehlers [2011]
Ehlers, R. (2011).
Bimatrix variate distributions of Wishart ratios with application.
Doctoral dissertation, Faculty of Natural & Agricultural Sciences University of Pretoria,
Pretoria. http://hdl.handle.net/2263/31284.
Fang and Zhang [1990]
Fang, K. T., Zhang, Y. T. (1990).
Generalized Multivariate Analysis,
Science Press, Springer-Verlag, Beijing.
Fang et al. [1990]
Fang, K. T., Zhang, Y. T., Ng, K. W. (1990).
Symmetric Multivariate and realted distributions.
Springer-Science+Business Media, B. V., New Delhi.
Forrester [2009]
Forrester, P. J. (2009).
Log-gases and random matrices.
To appear. Available in: http://www.ms.unimelb.edu.au/~matpjf/matpjf.html.
Gross and Richards [1987]
Gross, K. I., Richards, D. ST, P. (1987).
Special functions of matrix argument I: Algebraic induction zonal polynomials
and hypergeometric functions.
Transactions of the American Mathematical Society, 301(2), 475–501.
Gupta and Varga [1993]
Gupta, A. K., Varga, T. (1993).
Elliptically Contoured Models in Statistics.
Kluwer Academic Publishers, Dordrecht.
Kabe [1984]
Kabe, D. G. (1984).
Classical statistical analysis based on a certain hypercomplex multivariate
normal distribution.
Metrika, 31, 63–76.
Li and Xue [2009]
Li, F., Xue, Y. (2009).
Zonal polynomials and hypergeometric functions of quaternion matrix argument,
Communications in Statistics - Theory and Methods, 38(8), 1184-1206.
Libby and Novick [1982]
Libby, D. L., Novick, M. R. (1982).
Multivariate Generalized beta distributions with applications to utility assessment.
Journal of Educational Statistics, 7, 271–294.
Muirhead [2005]
Muirhead, R. J. (2005).
Aspects of Multivariate Statistical Theory.
John Wiley & Sons, New York.
Nadarajah [2007]
Nadarajah, S. (2007).
A bivariate gamma model for drought.
Water Resources Research, 43, W08501, doi:10.1029/2006WR005641.
Nadarajah [2013]
Nadarajah, S. (2013).
A bivariate distribution with gamma and beta marginals with application to drought data.
Journal of Applied Statistics, 36(3), 277-301.
Olkin and Liu [2003]
Olkin, I., Liu, R. (2003).
A bivariate beta distribution.
Statistics and Probability Letters, 62, 407–412.
Olkin and Rubin [1964]
Olkin, I., Rubin, H. (1964).
Multivariate beta distributions and independence properties of Wishart distribution.
Annals of Mathematical Statistics, 35, 261–269. Correction 1966, 37(1), 297.
Sarabia et al. [2014]
Sarabia, J. M., Prieto, F., Jordá, V. (2014).
Bivariate beta-generated distributions with application to well-being data.
Journal of Statistical Distributions and Applications, 1:15. http://www.jsdajournal.com/content/1/1/15.