Inverse Clustering of Gibbs Partitions via independent fragmentation and dual dependent coagulation operators
Abstract
Gibbs partitions of the integers generated by stable subordinators of index form remarkable classes of random partitions where in principle much is known about their properties, including practically effortless obtainment of otherwise complex asymptotic results potentially relevant to applications in general combinatorial stochastic processes, random tree/graph growth models and Bayesian statistics. This class includes the well-known models based on the two-parameter Poisson-Dirichlet distribution which forms the bulk of explicit applications. This work continues efforts to provide interpretations for a larger classes of Gibbs partitions by embedding important operations within this framework. Here we address the formidable problem of extending the dual, infinite-block, coagulation/fragmentation results of Pitman [41], where in terms of coagulation they are based on independent two-parameter Poisson-Dirichlet distributions, to all such Gibbs (stable Poisson-Kingman) models. Our results create nested families of Gibbs partitions, and corresponding mass partitions, over any We primarily focus on the fragmentation operations, which remain independent in this setting, and corresponding remarkable calculations for Gibbs partitions derived from that operation. We also present definitive results for the dual coagulation operations, now based on our construction of dependent processes, and demonstrate its relatively simple application in terms of Mittag-Leffler and generalized gamma models. The latter demonstrates another approach to recover the duality results in [41].
keywords:
[class=AMS]keywords:
, and
1 Introduction
Gibbs (random) partitions, as developed in [42, Theorem 8] and subsequently [17, 43], say, of the integers where form remarkable classes of infinitely exchangeable random partitions [43, Section 2.2], whose distribution is defined consistently for each as the restriction of an exchangeable partition probability function (EPPF) on that has the distinguishing feature of having Gibbs (product) form for each In principle, the Gibbs form is attractive in terms of practical implementation, and much is known about the properties of this class, including practically effortless obtainment of otherwise complex asymptotic results potentially relevant to applications in combinatorial stochastic processes, random tree/graph growth models and Bayesian statistics. See, for example, [4, 6, 10, 13, 15, 19, 22, 23, 25, 33, 34, 38, 44, 47, 48].
The class is defined for each here we shall focus on the case where which may be generated by an -stable subordinator in relation to the work [39, 46, 42, 43] on excursion lengths of Bessel processes. In particular, let denote a strong Markov process on whose normalized ranked lengths of excursions, follow a Poisson Dirichlet law with parameters , for as discussed in Pitman and Yor [46]. Denote this law as on the space of mass partitions summing to one, Let denote its local time starting at and let , denote its inverse local time. In this case, is an -stable subordinator. For each where is the inverse local time at with density and Laplace transform Due to the scaling identity (see [45]),
(1.1) |
the local time up to time follows a Mittag-Leffler distribution with density satisfying
(1.2) |
The process for independent of , is called a exchangeable bridge, see [6] for this terminology, with the equivalence
In turn, sampling that is, , produces unique values and a -partition of where with size for with an EPPF denoted by the ,
(1.3) |
where, for any non-negative integer , denotes the Pochhammer symbol. From this, the probability of the number of blocks can be expressed as , where denotes the generalized Stirling number of the second kind.
1.1 -stable Poisson-Kingman distributions and Gibbs partitions
Now conditioning on (or ) leads to the distribution of , and for a non-negative function with one may, as in [42], define the -stable Poisson-Kingman distribution with mixing distribution and write defined as
Setting , for , leads to corresponding to the important two-parameter Poisson-Dirichlet distribution as described in [39, 40, 42, 43, 46], whose size biased re-arrangement, say, where is the two-parameter Griffiths-Engen-McCloskey distribution, and is widely used in applications [6, 26, 39, 40, 43]. The inverse local time at of a process with lengths say, has density , and the corresponding local time at or its -diversity, denoting that it has a generalized Mittag-Leffler distribution with density . When , is its corresponding bridge defined similarly as , and sampling variables from , as in the case, leads to the general class of Gibbs partitions of , with an EPPF , denoted by as in [17, 42],
(1.4) |
where, using the interpretation of the expressions derived by [17, 42] in [25],
In the first expectation, is evaluated for the case with The second equality follows from the fact that such a conditional random variable equates in distribution to a variable with density such that (pointwise), as in [25, eq. (2.13), p. 323],
(1.5) |
where variables in each ratio are independent, and throughout, denotes a random variable. It is noteworthy that (1.5) indicates that Mittag-Leffler variables play a role in the general class of Gibbs partitions.
1.2 Interpreting Gibbs partitions via infinite block fragmentation and coagulation operations
While the general class of distributions and the corresponding Gibbs partitions exhibit many desirable properties, most choices of do not have any particular interpretation. The most notable exceptions are the important distributions which dominate the broad literature. There are some additional examples, such as the generalized gamma and Mittag-Leffler classes. Of particular interest to us are classes that correspond to nested families of mass partitions, whose marginal distributions follow some explicit collection of distributions, and whose Markovian dynamics (dependence structure) may be described by some operations on and hence equivalently on spaces of integer partitions. The two most notable examples of such families are the nested families of the form represented by the Poisson-Dirichlet laws and the laws The first such collection is related to (dual) operations of size-biased deletion and insertion, as described in [46, Propositions 34 and 35], and the latter may be constructed by a single block fragmentation operation, fragmenting successively the size-biased pick of the indicated families, leading to, in the initial case of an increase of to and inversely by a dual coagulation operation as described in [14]. It is notable that the operations of size-biased deletion, as described in [39, 42, 43], and single-block fragmentation can be applied in principle to general families taking values on whereas the latter (dual) operations involve the usage of independent beta distributed variables, which are particular to the distribution. Common descriptions, for extensions of size-biased deletion involving general can be deduced from [39, 42], whereas fragmentation by has been treated in [25, 31]. Results for specific examples of requiring more detailed analysis, are also discussed in those and related works.
This work continues efforts to provide interpretations for larger classes of Gibbs partitions by embedding important operations within this framework. Here we address the formidable problem of extending the dual, infinite block, coagulation/fragmentation results of Pitman [41], that is, for a dual relationship between and with appropriate coagulation and fragmentation operations indicated, as in [43, Section 5.5], by the following diagram
(1.6) |
This creates nested families of mass/integer partitions having laws Setting and the (homogeneous) continuous-time Markov operator coincides with the semi-group of the Bolthausen-Sznitman coalescent [9], as discussed in [6, 41, 43]. Briefly, following [6, 43], we describe the fragmentation operator, hereafter denoted as and corresponding coagulation operator. For the operator is defined as, for denoting the ranked re-arrangement of masses,
(1.7) |
where, for each the collection are iid mass partitions, taken independent of the input There is also the property for all In terms of partitions , one is shattering each block of the partition, say, with size by an independent partition of elements. We now proceed to first describe the coagulation for more general independent laws on by using the equivalence in terms of compositions of exchangeable bridges as in [43, Lemma 5.18] or [6]. Suppose that and are independent exchangeable bridges defined for and in with respective distributions and Then, for the input is equivalent to the ranked re-arrangement of masses formed by the composition for each or simply where has distribution Letting denote a bridge, the result in [41], as indicated from left to right in (1.6), corresponds to
Remark 1.1.
Note and the same is true for other variables with
In this work, for and general where we shall apply the same independent operator leading to explicit identification of laws, and calculations for nested families of mass partitions and corresponding Gibbs partitions over This represents the primary focus of our work, however we also describe in detail how to construct a natural dual coagulation operation via dependent compositions of bridges and corresponding mass partitions, and demonstrate how easy it can be applied. We note that, in the literature, coagulation operations of this sort are generally defined for independent processes as in [43, Lemma 5.18]. While, in general, fragmentation and coagulation operations are clearly defined and hence straightforward to apply, the formidable challenge is to identify the resulting relevant distributions, and find ones with tractability. The duality result of [41], as described in (1.6), is achieved by working with the EPPF’s of the corresponding random partitions of In principle one may try such an approach to identify the various laws associated with or attempt via the explicit constructions of exchangeable bridges. However, neither approach seems feasible. Here working on the space of mass partitions, we show, in Sections 2 and 3, that the classes of (marginal) distributions and with
(1.8) |
may be interpreted as being equivalent in distribution to those arising from and the marginal distribution of the corresponding coagulator which leads to respectively. In Sections 4.1-4.3, we obtain remarkable calculations for Gibbs partitions, and related identities, derived from the operation. Sections 4.4 and 4.5, provide interesting identities, fixed point equations, and asymptotic results, which has interpretations in relation to and intervals as discussed in [3, 43]. Sections 4.6 and 5 provide developments in relation to the Mittag-Leffler class [25]. Section 6 demonstrates how to use our dual coagulation and fragmentation operations to easily identify all the relevant laws, and constructs duality results for generalized gamma models, and size biased extensions. This presents another approach to recover the duality results in [41] for all
For nested models primarily related to the fragmentation operator see [5, 12]. In addition, [48], see also [16], applies the coag/frag duality on the space of partitions of to -gram natural language models. This represents an application in Bayesian statistical machine learning involving the usage of inverse clustering (via fragmentation) and merging (via coagulation) on the space of partitions of Related to this, [32] constructs (nested) hierarchical network/graph models using the coagulation fragmentation operations in [41] and also [14]. For some other references on Gibbs-partitions and -stable Poisson-Kingman models, see [4, 10, 13, 21, 24, 25, 36, 44]. See [22, 23] for other occurrences of the coag/frag operators in the setting.
2 fragmentation of mass partitions
In order to achieve our results, we work with independent stable subordinators and and representations of their relevant quantities under the independent and distributions. The corresponding independent local time processes are and , satisfying (1.1) and (1.2), with local times at denoted respectively as and , playing the role of , as we have described, and otherwise following the more detailed description in [8], as it relates to the special case of the duality [41, Theorem 14 and Corollary 15]. In particular, is formed by the independent coagulation, , equivalent to, as in [6, 7, 8, 43],
(2.1) |
and has local time at and inverse local time at Conversely,
Remark 2.1.
Note there is the well known distributional equivalence However, in the case of interpretation of the coagulation, as in (2.1), the order matters and thus we will only use .
Define
(2.2) |
such that the conditional distribution of may be expressed in terms of that of the transformed variable as,
(2.3) |
which is equivalent to the conditional density of given .
Theorem 2.1.
Let with local time at say having density For any choice of let denote an fragmentation operator independent of Then,
-
(i)
where
That is, it has a local time at time say, , with density .
- (ii)
-
(iii)
The conditional distribution of is equivalent to the distribution of which is
(2.4) for defined in (2.2).
Proof.
Let denote the expectation with respect to the joint law of where with local time at , with density , and independent of this, are iid mass partitions. Consider The distribution of is characterized, for a measurable function by
(2.5) |
But, from [8, 41], as described in (1.6), this is equivalent to, for
(2.6) |
It follows that has distribution not depending on Using this, the scaling property and elementary arguments to describe the joint density, it follows that the expectation (2.6) can be expressed as
which can also be expressed as,
for yielding the results. ∎
Recall that for any
Corollary 2.1.
Suppose that with Then, where Hence, where
3 Duality via dependent coagulation
We now describe how to construct dependent coagulations to complete the dual process of recovering from the coagulation of Our results show how specification of leads to a prescription to identify the laws of without guess-work.
Recall that for the independent mass partitions described in (2.1), the process of coagulation yields an inverse local time at for to be For as described above, we consider the dependent pair with joint law, say, characterized by
(3.1) |
with , and the notation referring to an expectation evaluated under the joint law of the independent and distributions. We use this for clarity, but will suppress it when it is clear we are referring to such variables. Equivalently, by conditioning and scaling properties, the joint law of is given by
(3.2) |
Remark 3.1.
For further clarity, we may use the notation
In addition, for collections of iid variables independent of define random distribution functions (exchangeable bridges), for ,
(3.3) |
Remark 3.2.
It follows that when for is independent of Hence, and .
Proposition 3.1.
For let have a joint distribution, specified by (3.1) or equivalently (3.2), such that and are bridges defined in (3.3). Let be the ranked masses of the bridge defined by the composition Then, is equivalent to the coagulation of by and there are the following properties.
-
(i)
-
(ii)
The marginal distribution of where
(3.4) and the corresponding inverse local time has density .
-
(iii)
The distribution of is where
Proof.
We first recall from (2.1) that under independent and laws, the bridge follows the law of a bridge with inverse local time at Hence, under the joint law of specified by (3.1), it follows that, for
showing that is a bridge and thus in statement (i). Statements (ii) and (iii) follow from straightforward usage of (3.2). ∎
The next result shows that is equivalent to
Proposition 3.2.
For let have a joint distribution, specified by (3.1) or equivalently (3.2), such that and are bridges defined in (3.3). Let be the ranked masses of the bridge defined by the composition , with inverse local time at denoted as with density Then, by a change of variable, the joint distribution can be expressed as
(3.5) |
That is, the joint distribution of is equivalent to , for all , and it is given by
The next Corollary provides an answer to when or under which situation the pair specified by the coagulation are in the same family of distributions, specifically, distributions of the form where as in (3.4).
Corollary 3.1.
For consider the settings in Proposition 3.1 where now and is obtained by the coagulation equivalent to ranked masses of Then, the variables have the following marginal (or conditional) distributions
-
(i)
, for .
-
(ii)
where has density for
(3.6) -
(iii)
The distribution of is where
Proof.
Remark 3.3.
In Corollary 3.1, has distribution where .
4 Gibbs partitions of derived from
Recall from [42, 43] that when , is equivalent in distribution to and has the associated Gibbs partition of described by the
(4.1) |
(4.2) |
and being the conditional density of corresponding to a random variable denoted as as otherwise described in (1.5) with in place of Note, furthermore, as in [25], this means for We use these facts to obtain interesting expressions for -Gibbs partitions equivalent to those arising from the operator.
4.1 Gibbs partitions of of equivalently of
Recall from Theorem 2.1 that the distribution of is equivalent to that of , with distribution denoted as in (2.4), where is a ratio of stable densities and hence does not have an explicit form for general We now present results for the EPPF of the Gibbs partition of We first note that since is equivalent in distribution to with density the can be expressed as
where the first integral term is the density of divided by and does not have an obvious recognizable form. However, we can use the approach in [24] to express in terms of Fox- functions [37], leading to an expression for the EPPF in terms of Fox- functions in the Appendix.
The next result provides a more revealing expression which is not obvious.
Theorem 4.1.
The of the Gibbs partition of can be expressed as
(4.3) |
where is the distribution of the number of blocks in a partition of , and is the conditional density of , for the number of blocks in a partition of with being equivalent to the inverse local time at of .
Proof.
The expression for the is the conditional distribution of a partition of given The joint distribution may be expressed as in (4.3) in terms of the marginal EPPF and the conditional density of It remains to show that agrees with the expression in (4.3) as indicated. Recall that and hence the corresponding inverse local time at is corresponding to the coagulation operation dictated by as expressed in (2.1). Sampling from that is, according to variables it follows that this procedure produces a partition of with blocks, where the two components are independent. Furthermore, the order matters, giving the interpretation as the number of blocks to be merged, according to a partition of for . Now from [25], Hence is equivalent to which, using (4.1), leads to the description of the density of appearing in (4.3). ∎
Remark 4.1.
The result above is equivalent to showing that which can be deduced directly using the subordinator representation [25, Theorem 2.1 and Proposition 2.1] and decompositions of beta variables.
We now describe the distribution of the number of blocks and its limiting behavior.
Corollary 4.1.
Consider the of a partition of as in (4.3), for each Let denote the corresponding random number of unique blocks. Then, for
and, as where is equivalent in distribution to that of for
Proof.
The distribution follows as a special case of known properties of the distribution of the number of blocks of Gibbs partitions, and is otherwise easy to verify directly from the EPPF. The limiting distribution follows as a special case of [42, Proposition 13]. ∎
4.2 EPPF of
Recall from [17, 42], see also [25], that if with , then the of its associated Gibbs partition of is described as
(4.4) |
where and, for clarity, is the number of blocks of a partition of
Theorem 4.1 leads to the EPPF corresponding to or any variable in having the same distribution.
Proposition 4.1.
Suppose that for where Then, the of the associated Gibbs partition of can be expressed as
(4.5) |
and there is the identity, for
Remark 4.2.
The expression in (4.5) provides a description of any mass partition with distribution where regardless of whether or not it actually arises from a fragmentation operation.
4.3 Generating partitions via fragmentation of partitions
While the EPPF’s (4.3) and (4.5) are quite interesting from various perspectives, it is not entirely necessary to employ them directly to obtain random partitions from and A two-stage sampling scheme may be employed utilizing the dual partition-based interpretation of the operator. The following scheme can be deduced from Bertoin [6], see also [41, 43].
-
1.
Generate iid partitions of say, where, for each with blocks.
-
2.
Independent of this, for each fixed generate a partition of , say, , where denotes the number of blocks.
-
3.
For , Consider the pairs and fragment by according to
-
4.
The collection (arranged according to the least element) constitutes a partition of with
(4.8) where and given is equivalent to the number of blocks in a partition of conditionally independent for
-
5.
Replace Step 2 with a partition of to obtain a partition of
Remark 4.3.
The scheme above requires sampling of a partition of The relevant results of [24] show that this is the easiest when is a rational number. In that case, has a tractable representation in terms of Meijer functions. So, this applies to, in particular, for every where are co-prime positive integers. We look at perhaps the most remarkable case, in the forthcoming section 4.5.
4.4 Representations of the -diversity, and interval partitions, and fixed point equations
The sampling scheme above shows that the number of blocks of a , say, of partition of satisfies
(4.9) |
where is the number of blocks in a partition of . Hence, by standard results for exchangeable partitions, see for instance [43], as , for the size-biased re-arrangement of and from [42], Expressed in other terms, as
(4.10) |
as varies. The result (4.10), and its ranked version involving can be interpreted as extensions of descriptions in [3, Theorems 6,7, Propositions 10,11], see also [43, Chapter 9], to the present setting with general in place of for the sequence of paired lengths and local times of and intervals. Hence the next results may be thought of in those terms.
Proposition 4.2.
Recall the (pointwise) identity from size-biased sampling for with, for each
(4.11) |
where is independent of the independent collection with each , and is independent of the independent collection with each See [14, 25, 46].
Applying the operator, in the case of the setting established in [41], to and for yields mass partitions and Proposition 4.2 and special cases of (4.11), with leads to the following fixed point equations in the sense of [1, 18, 27], for certain generalized Mittag-Leffler variables, which we believe are new,
Proposition 4.3.
For consider the identity in (4.11) for the case where let and denote iid collections of variables having distribution equivalent to and respectively, and, independent of other variables, let and denote iid collections of variables with each component having distribution and , respectively. Then, for each , there are the following fixed point equations,
-
(i)
for and equivalent in distribution to the -diversity of
and, hence,
-
(ii)
For and equivalent in distribution to the -diversity of
and, hence,
4.5 Fragmentation of a Brownian excursion partition conditioned on its local time
Following Pitman [42, Section 8] and [43, Section 4.5, p.90], let denote the ranked excursion lengths of a standard Brownian motion , with corresponding local time at up till time given by Then, it follows that has a distribution. Furthermore, with respect to we describe the special explicit case of the Gibbs partitions (EPPF) of in terms of Hermite functions as derived in [42], see also [43, Section 4.5], as
(4.12) |
where, for a confluent hypergeometric function of the secondkind (see [35, p.263]),
is a Hermite function of index That is, to say
(4.13) |
Proposition 4.4.
Suppose that Then, for
with corresponding expressed in terms of a mixture of Hermite functions,
(4.14) |
Proof.
Remark 4.4.
Now recall from [42, Proposition 14], equivalently [2, Corollary 5], that the size biased re-arrangement of has a version with the explicit representation for each jointly and pointwise,
(4.15) |
for , where are independent and identically distributed variables like for a standard Gaussian variable. Now, in view of the results and discussions in Sections 4.3 and 4.4, we easily obtain the following interesting result.
Proposition 4.5.
Let denote the number of blocks in a partition of as can otherwise be generated according to the scheme in Section 4.3, by using a fragmentation of a partition of with blocks is as otherwise described in Corollary 4.1 with corresponding -diversity Then, for as in (4.15),
Furthermore, as
jointly for where is equivalent in distribution to that of , and
4.6 for the Mittag-Leffler class
We now present results for an application of the operator to the most basic case of the Mittag-Leffler class as described in [25], see also [28]. Recall that for the Laplace transform of equates with the Mittag-Leffler function, see for instance [20], expressed as,
Let denote a standard Poisson process, where and consider the mixed Poisson process Then, as shown in [25], for corresponds in distribution to
(4.16) |
that is, to say a stable Poisson-Kingman distribution with index and
5 Duals in the generalized Mittag-Leffler class
In Section 4.6, we examined the fragmentation of the Mittag-Leffler variable, having distribution equivalent to The extension to more generalized Mittag-Leffler classes could proceed as in [25], by directly conditioning on as for fixed Here we show that the corresponding joint distribution can arise directly from which, has not been previously studied. In order to describe this, we first recall the Laplace transform of the local time at variable, or -diversity, say with density as given in [28, Section 3], and [25, Section 4.1],
(5.1) |
where is the Lamperti variable studied in [28], and
(5.2) |
and from [25, Proposition 4.4] there is the density
It follows that for such that,
(5.3) |
and, hence, in this case,
Setting it follows for the variables and defined according to coag/frag duality in [8, 41], that the distribution of which can be expressed
equivalently,
It follows that
(5.4) |
Hence a direct application of Proposition 3.1 leads to identification of all relevant laws as described in the next result.
6 Coagulation and fragmentation of generalized gamma models
For any let denote a generalized gamma subordinator specified by its Laplace transform The generalized gamma subordinator and corresponding mass partitions, and bridges defined by normalization, as described in [42], arises in numerous contexts. However, for purpose of this exposition, the reader may refer to its role in the construction of distributions as described in [46, Proposition 21]. More generally, similar to the Mittag-Leffler class, let denote a mixed Poisson process. Then, for and for the distribution of corresponds to the laws , where
(6.1) |
for , as described in [25, 30, 31]. Here, we show how to use Proposition 3.1 to easily identify laws and explicit constructions of (dependent) random measures leading to a coag/frag duality in the case of and also show how one may recover the Poisson-Dirichlet coag/frag duality results of [41], based on independent and distributions, in the case of using , and the general case of using Results for general using Proposition 3.1, are also manageable but require too many additional details for the present exposition.
Now, for a fixed value define the scaled subordinator , for such that
has density and one may form a normalized general gamma bridge, for as
(6.2) |
where with and equates to its inverse local time at Using this and Proposition 3.1, for we obtain versions of in this case as follows; for
that is, with inverse local time at , with density Hence, and has a marginal distribution, corresponding to
Hence, jointly and component-wise, and is determined by the coagulation
where, for clarity,
Conversely, is equivalent in distribution to Now, following [46, Proposition 21], for , it follows that for , and
where also recovering the coag/frag duality in [41] for
Remark 6.1.
See [29] for an earlier, less refined, treatment of these results which requires considerably more effort.
6.1 Results for size biased generalized gamma
In order to recover the duality for the entire range of , we now work with the size biased law of a generalized gamma density. Suppose that , where
(6.3) |
and, now
with density , is the corresponding inverse local time at Since this case and a derivation for is not well known, we apply Proposition 3.1 to identify all the relevant distributions in the next result.
Proposition 6.1.
Consider the variables and forming the coagulation and fragmentation operations as described in Proposition 3.1, where , and thus Then,
-
(i)
with
-
(ii)
-
(iii)
-
(iv)
jointly and component-wise.
-
(v)
-
(vi)
for
-
(vii)
independent of
Proof.
The results follow from a straightforward application of Proposition 3.1 using the distributional representation of , and the appropriate Gamma randomization to obtain independent laws. The generalized gamma subordinator representation of and Poisson Dirichlet distributional identities can be found in [25, 30, 31]. The independence of the Poisson-Dirichlet laws is due to [46, Proposition 21, see p. 877] and beta-gamma algebra, see also [25, Proposition 2.1]. ∎
has distribution
with inverse local time at equivalent in distribution to
Remark 6.2.
If independent of , it is evident that
The of the Gibbs partition of in Theorem 4.1 may be alternatively expressed in terms of Fox- functions [37] as
The above expression follows by noting the Fox- representations for and followed by applying [11, Theorem 4.1]. Otherwise details are similar to the arguments in [24].
L.F. James was supported in part by grants RGC-GRF 16301521, 16300217 and 601712 of the Research Grants Council (RGC) of the Hong Kong SAR.
References
- [1] Aldous, D. J. and Bandyopadhyay, A. (2005). A survey of max-type recursive distributional equations. Ann. Appl. Probab. 15 1047–1110.
- [2] Aldous, D. and Pitman, J. (1998). The standard additive coalescent Ann. Probab. 26 1703–1726.
- [3] Aldous, D. and Pitman, J. (2006). Two recursive decompositions of Brownian bridge related to the asymptotics of random mappings. In memoriam Paul-André Meyer: Séminaire de Probabilités XXXIX, 269–303, Lecture Notes in Math., 1874, Springer, Berlin.
- [4] Bacallado, S., Battiston, M., Favaro, S. and Trippa, L. (2017). Sufficientness postulates for Gibbs-type priors and hierarchical generalizations. Statist. Sci. 32 487–500.
- [5] Basdevant, A. L. (2006). Ruelleś probability cascades seen as a fragmentation process. Markov Process and Related Fields 12 447–474.
- [6] Bertoin, J. (2006). Random Fragmentation and Coagulation Processes, Cambridge University Press.
- [7] Bertoin, J. and Le Gall, J.-F. (2000). The Bolthausen-Sznitman coalescent and the genealogy of continuous-state branching processes. Probab. Theory Related Fields 117 249–266.
- [8] Bertoin, J. and Pitman, J. (2000). Two coalescents derived from the ranges of stable subordinators. Electron. J. of Probab. 5 1–17.
- [9] Bolthausen, E. and Sznitman, A. S. (1998). On Ruelleś probability cascades and an abstract cavity method. Comm. Math. Phys. 197(2):247(276).
- [10] Caron, F. and Fox, E. B. (2017). Sparse graphs using exchangeable random measures. J. R. Stat. Soc. Ser. B. Stat. Methodol. 79 1–44.
- [11] Carter, B. D. and Springer, M. D. (1977). The distribution of products, quotients and powers of independent -function variates. SIAM J. Appl. Math. 33 542–558.
- [12] Curien, N. and Haas, B. (2013). The stable trees are nested. Probab. Theory Related Fields 157 847–883.
- [13] De Blasi, P., Favaro, S., Lijoi, A., Mena, R., Prünster, I. and Ruggiero, M. (2015). Are Gibbs-type priors the most natural generalization of the Dirichlet Process? IEEE Trans. Pattern Anal. Mach. Intell. 37 212–229.
- [14] Dong, R., Goldschmidt, C. and Martin, J. (2006). Coagulation-fragmentation duality, Poisson-Dirichlet distributions and random recursive trees. Ann. Appl. Probab. 16 1733–1750.
- [15] Feng, S. (2010). The Poisson-Dirichlet Distribution and Related Topics: Models and Asymptotic Behaviors. Springer, Berlin, Heidelberg.
- [16] Gasthaus, J. and Teh, Y.W.. (2010). Improvements to the sequence memoizer. Advances in Neural Information Processing Systems 23.
- [17] Gnedin, A. and Pitman, J. (2006). Exchangeable Gibbs partitions and Stirling triangles. J. Math. Sci. 138 5674–5685.
- [18] Gnedin, A. and Yakubovich, Y. (2011). Recursive partition structures. Ann. Probab. 34 2203–2218.
- [19] Goldschmidt, C. and Haas, B. (2015). A line-breaking construction of the stable trees. Electron. J. Probab. 20 1–24.
- [20] Gorenflo, R., Kilbas, A. A., Mainardi, F. and Rogosin, S. V. (2014). Mittag-Leffler Functions, Related Topics and Applications, Springer, Berlin.
- [21] Griffiths, R. C. and Spanò, D. (2007). Record indices and age-ordered frequencies in exchangeable Gibbs partitions. Electron. J. Probab. 12 1101–1130.
- [22] Haas, B., Miermont, G., Pitman, J. and Winkel, M. (2008). Continuum tree asymptotics of discrete fragmentations and applications to phylogenetic models. Ann. Probab. 36 1790–1837.
- [23] Haas, B., Pitman, J. and Winkel, M. (2009). Spinal partitions and invariance under re-rooting of continuum random trees. Ann. Probab. 37 1381–1411.
- [24] Ho, M.-W., James, L. F. and Lau, J. W. (2007). Gibbs partitions (EPPF’s) derived from a stable subordinator are Fox and Meijer transforms. arXiv:0708.0619 [math.PR].
- [25] Ho, M.-W., James, L. F. and Lau, J. W. (2021). Gibbs partitions, Riemann-Liouville fractional operators, Mittag-Leffler functions, and fragmentations derived from stable subordinators. J. Appl. Probab. 58 314–334.
- [26] Ishwaran, H. and James. L. F. (2001). Gibbs sampling methods for stick-breaking priors. J. Amer. Statist. Assoc. 96 161–173.
- [27] Iksanov, A. M. (2004). Elementary fixed points of the BRW smoothing transforms with infinite number of summands. Stochastic Processes and their Applications 114 27–50.
- [28] James, L. F. (2010). Lamperti type laws. Ann. Appl. Probab. 20 1303–1340.
- [29] James, L. F. (2010). Coag-Frag duality for a class of stable Poisson-Kingman mixtures. Unpublished manuscript. arXiv:1008.2420 [math.PR].
- [30] James, L. F. (2019). Stick-breaking Pitman-Yor processes given the species sampling size. arXiv:1908.07186 [math.ST].
- [31] James,L. F. (2022). Single-Block Recursive Poisson–Dirichlet Fragmentations of Normalized Generalized Gamma Processes. Mathematics 10(4):561.
- [32] James, L. F, Lee, J., and Ross, N. (2022). Hierarchical network models for data with granularity via fragmentation and coagulation. Manuscript in preparation.
- [33] Janson, S. (2006). Limit theorems for triangular urn schemes. Probab. Theory Related Fields 134 417–452.
- [34] Janson, S., Kuba, M. and Panholzer, A. (2011). Generalized Stirling permutations, families of increasing trees and urn models. J. Combin. Theory, Ser. A 118 94–114.
- [35] Lebedev, N. N. (1972). Special Functions and Their Applications, Dover Publications, Incorporated, New York.
- [36] Lomeli, M., Favaro, S. and Teh, Y. W. (2017). A marginal sampler for -stable Poisson-Kingman mixture models. J. Comput. Graph. Statist. 26 44–53.
- [37] Mathai, A. M., Saxena, R. K. and Haubold, H. J. (2010). The -Function. Theory and Applications, Springer, New York.
- [38] Peköz, E., Röllin, A. and Ross, N. (2017). Joint degree distributions of preferential attachment random graphs. Adv. Appl. Probab. 49 368–387.
- [39] Perman, M., Pitman, J. and Yor, M. (1992). Size-biased sampling of Poisson point processes and excursions. Probab. Theory Related Fields 92 21–39.
- [40] Pitman, J. (1996). Some developments of the Blackwell-MacQueen urn scheme. In Statistics, Probability and Game Theory. IMS Lecture Notes Monogr. Ser. 30 245–267, Inst. Math. Statist., Hayward, CA.
- [41] Pitman, J. (1999). Coalescents with multiple collisions. Ann. Probab. 27 1870–1902.
- [42] Pitman, J. (2003). Poisson-Kingman partitions. In Statistics and Science: a Festschrift for Terry Speed. IMS Lecture Notes Monogr. Ser. 40 1–34, Inst. Math. Statist., Beachwood, OH.
- [43] Pitman, J. (2006). Combinatorial Stochastic Processes. Lectures from the 32nd Summer School on Probability Theory held in Saint-Flour, July 7–24, 2002. With a foreword by Jean Picard. Lecture Notes in Mathematics 1875, Springer-Verlag, Berlin.
- [44] Pitman, J. and Yakubovich, Y. (2018). Ordered and size-biased frequencies in GEM and Gibbs models for species sampling. Ann. Appl. Probab. 28 1793–1820.
- [45] Pitman, J. and Yor, M. (1992). Arcsine laws and interval partitions derived from a stable subordinator. Proc. London Math. Soc. 65 326–356.
- [46] Pitman, J. and Yor, M. (1997). The two-parameter Poisson-Dirichlet distribution derived from a stable subordinator. Ann. Probab. 25 855–900.
- [47] Rembart, F. and Winkel, M. (2018). Recursive construction of continuum random trees. Ann. Probab. 46 2715–2748.
- [48] Wood, F., Gasthaus, J., Archambeau, C., James, L. F. and Teh, Y. W. (2011). The Sequence Memoizer. Communications of the ACM (Research Highlights) 54 91–98.