Categorification of characteristic structures
Abstract.
We develop a representation theory of categories as a means to explore characteristic structures in algebra. Characteristic structures play a critical role in isomorphism testing of groups and algebras, and their construction and description often rely on specific knowledge of the parent object and its automorphisms. In many cases, questions of reproducibility and comparison arise. Here we present a categorical framework that addresses these questions. We prove that every characteristic structure is the image of a functor equipped with a natural transformation. This shifts the local description in the parent object to a global one in the ambient category. Through constructions in representation theory, such as tensor products, we can combine characteristic structure across multiple categories. Our results are constructive, stated in the language of a constructive type theory, which facilitates implementations in theorem checkers.
1. Introduction
The problem of deciding when two algebraic structures are isomorphic is fundamental to algebra and computer science. It encompasses issues of decidability and complexity, and it tests the limits of our theories and algorithms. An initial tactic in deciding isomorphism is to identify substructures that are invariant under isomorphisms because doing so reduces the search space. We first discuss groups, where the literature is most developed (see, for example, \citelist[ELGO2002][BOW][Maglione2021][Wilson:filters]), but our results apply to monoids, loops, rings, and non-associative algebras.
A subgroup of a group is characteristic if for every automorphism ; it is fully invariant if for every homomorphism . We use the language of categories, following [Riehl], and a type of natural transformation to describe our main results (details are given in Section 5.1).
Definition 1.1.
Let be a category, and let be a subcategory with inclusion functor . A counital is a natural transformation for some functor . The class of all such counitals is denoted . For an object of , the -component of is the morphism in .
A special case of our results, for the category of groups, can be stated as follows.
Theorem 1.
For the category of groups and subcategory of groups and their isomorphisms, the following equalities of sets hold:
. |
Theorem 1 contrasts a “recognizable” description of characteristic (fully invariant) subgroups with a “constructive” one. For a fixed group , the sets on the left are of the form , where is the appropriate logical predicate that allows us to recognize when a subgroup belongs to the set; those on the right are of the form , where allows us to construct members of the subset by applying a function. Also, the descriptions on the left are “local” since they reference just a single parent group, whereas those on the right are “global” since they apply to the ambient categories.
The characterization of characteristic subgroups by natural transformations allows one to recast the lattice theory of characteristic subgroups into the globular compositions of natural transformations as explored in [Baez, Power]. We now explore other implications of Theorem 1.
1.1. Constraining isomorphism by characteristic subgroups
Characteristic subgroups constrain isomorphisms in the following sense:
Fact 1.2.
If is a characteristic subgroup of , and are isomorphisms, then .
It is therefore useful for an isomorphism test to locate characteristic subgroups of a group : every hypothetical isomorphism from to must then assign such a subgroup to a unique corresponding subgroup of . This raises at least two issues. First, if the task is to construct isomorphisms, then we should assume that is not yet known. How then do we verify that is characteristic? Is there an alternative definition of the characteristic property that does not directly reference ? A second issue is how to determine the possible when we know only that is characteristic in . For familiar characteristic subgroups such as the center this is possible because the definition is already global to all groups. Hence, a hypothetical isomorphism must satisfy , and typically and can be constructed without explicit knowledge of or . However, the following family of examples, first explored by Rottlaender [Rottlander28], exhibits groups whose characteristic subgroups have no known global definition, so it is difficult to utilize Fact 1.2.
Example 1.3.
Let be a prime and a positive integer. Let be a prime and denote by the field with elements. Let , with , be diagonalizable with eigenvalues , each different from , satisfying the following property. If there exists with for all , then for . For , this requires .
The cyclic group of order acts on the vector space via . The condition on means that each eigenspace in is a characteristic subgroup of the semidirect product determined by , and exactly of the order subgroups of are characteristic. Two such groups and may be isomorphic even if the eigenvalues of and are different. For example, this occurs when for some coprime to . Thus, the correspondence between characteristic subgroups of and is not a priori clear.
One of the goals of this work is to reinterpret the definition of a characteristic subgroup in a way that is independent of automorphisms and which is unambiguously defined for all groups. We do this by formulating the characteristic condition on the entire category of groups, thereby providing a categorification of the property of being characteristic. Moreover, our formulation pairs well with—and indeed is motivated by—the necessities of computation (see Section 1.3). To address this, we employ methods from theorem-checking, specifically type-theoretic techniques [Hindley-Seldin, Pierce:types, HoTT]; these have recently become accessible through systems such as Agda [Agda], Coq [Coq], and Lean [lean].
1.2. A local-to-global problem
Our approach is to transform the local characteristic property of subgroups into an equivalent global property of the category of all groups and their isomorphisms. Calculations now take place within the category instead of within individual groups, which opens up new ways to search for characteristic subgroups. Our approach also facilitates an a priori verification of the global characteristic property, rather than the usual a posteriori check that requires knowledge of automorphisms. The process is analogous to proving that is characteristic without employing specific properties of . Our methods extend to every characteristic subgroup, even those discovered via bespoke calculations.
The traditional model of a category involves both objects and morphisms. By sometimes focusing only on morphisms, we work with categories as an algebraic structure with a partial binary associative product on —given by composition of its morphisms—and an object in . It is partial because not every pair of morphisms is composable, in which case the product is undefined. This perspective yields a general algebraic framework for our computations.
The morphisms of a category can act on the morphisms of another category either on the left or the right. Although several interpretations of “category action” appear in the literature \citelist [Bergner-Hackney]*§2 [nlab:action] [FS]*1.271–274, there is no single established meaning. Let , , and be categories. A left -action on is a partial-function, where is defined for some morphisms of and of , that satisfies two conditions inspired by group actions. The first is that , whenever defined, for all morphisms of and of . The second is that . To simplify notation, we write . As in the theory of bimodules of rings, an -biaction on is a left -action and a right -action on such that for every morphism in , in , and in ,
whenever both sides of the equation are defined. Suppose there are -biactions on categories and . An -morphism is a partial-function, which we denote by , such that
whenever is defined for morphisms in , in , and in .
We write to indicate that is a subcategory of , and denote the identity functor of by . A counit is a counital of the form . The following specialization of one of our principal results to groups describes how characteristic subgroups relate to counits and morphisms of category biactions.
Theorem 2.
Let be a group and with inclusion . There exist categories and , where , such that the following are equivalent.
-
(1)
is characteristic in .
-
(2)
There is a functor and a counit such that .
-
(3)
There is an -morphism such that .
We emphasize that the category in Theorem 2 need not be a subcategory of ; see Section 8 for an example. Moreover, our results apply to characteristic substructures of eastern algebras, which include monoids, loops, rings, and non-associative algebras. This generalization (Theorem 2-cat) and its dual version (2-dual) are proved in Section 6. We conclude this section with an example that illustrates how natural transformations arise from characteristic substructures.
Example 1.4.
The derived subgroup of a group determines the inclusion homomorphism and a functor mapping groups to their derived subgroup and mapping homomorphisms to their restriction onto the derived subgroups. For every group homomorphism , observe that , so is a natural transformation.
The center of determines the inclusion homomorphism . To define a functor with object map , we must restrict the type of homomorphisms between groups since homomorphisms need not map centers to centers. (Consider, for example, an embedding .) Since every isomorphism maps center to center, we restrict to , defining a functor mapping and mapping each homomorphism to its restriction. If is the inclusion functor, then is a natural transformation.
1.3. Applications to computation
Part of the motivation for our work comes from computational challenges that arise in contemporary isomorphism tests in algebra. One of these is to develop new ways to discover characteristic subgroups. Standard constructions—such as the commutator subgroup, the center, and the Fitting subgroup—can be applied to any group. However, these subgroups often contribute little to resolving isomorphism. Many ideas have been introduced to search for new structures; see, for example, \citelist[BOW][ELGO2002][Maglione2021]. Often these involve very detailed computations with individual groups, and their application is ad hoc. Indeed, a primary motivation for this study is to systematize the disparate techniques currently used to search for characteristic subgroups.
Theorem 2 provides the framework for a systematic search for characteristic subgroups. Indeed, an -morphism generalizes the familiar and much studied category theory notion of adjoint functor pairs. We show in Section 4.6 that category actions offer a flexible way to implement the behavior of natural transformations in a computer algebra system. To exploit the full power of the categorical interpretation of characteristic subgroups, we work in a suitably general algebraic framework that allows a seamless transfer of information from one category to another. The familiar examples from Sections 7 and 8 demonstrate how to identify characteristic structure in a category and transfer it back to groups.
A second challenge concerns reproducibility and comparison of characteristic subgroups. Algorithms to decide isomorphism often, as a first step, generate a list of characteristic subgroups in a given group. For example, we could extract such a list for the family described in Example 1.3. An immediate question is: if we rerun this step for a different group, do we obtain the same list of corresponding characteristic subgroups (Fact 1.2)? It is not always clear that we do. For instance, some characteristic subgroup constructions employ randomization or make labelling choices that could change from one run to the next. Such shortcomings can compromise the utility of characteristic subgroup lists in deciding isomorphism.
Our proposed solution is to develop algorithms that return the natural transformation (or a morphism of biactions) from Theorem 2 instead of the characteristic subgroup itself. This will allow us, in principle, to extend the reach of a specific characteristic subgroup of a given group to an entire category, in much the same way that the commutator subgroup and center behave. The natural transformation can then be applied to a group to produce a characteristic subgroup that corresponds to in the sense of Fact 1.2: every isomorphism necessarily maps to , so allowing a meaningful comparison of characteristic subgroups. The precise circumstances under which such extensions are possible are specified in Theorem 5.4.
A third challenge is verifiability: in a computer algebra system, subgroups are often given by monomorphisms which are defined on a given set of group generators. The construction of such a monomorphism usually invokes computations that prove the claimed properties (such as homomorphism, monic, characteristic image, and so on). We present our work in a framework that combines these computations, data, and proofs, by employing an intuitionistic Martin-Löf type theory; such a model also allows machine verification of proofs. In this setting, if a computer algebra system returns a counital , then this counital comes with a “type” that certifies that each morphism of yields a characteristic substructure.
1.4. Structure of this paper
In Section 2, we discuss the required background for our foundations (type theory). In Section 3, we introduce eastern algebras (essentially algebraic structures) and show how they can be viewed as abstract categories.
Section 4 studies category actions. In particular, we define capsules (category modules) and describe a computational model for natural transformations as category bimorphisms (Proposition 4.10). This also allows us to describe counitals (Theorem 4.11) and adjoint functor pairs (Theorem 4.13) in the language of bicapsules and bimorphisms.
In Section 5, we explain how characteristic structures can be described by counitals. The functors involved in this construction are defined on categories with one object, but Theorem 5.4—which we call the Extension Theorem—allows us to extend these functors to larger categories. This theorem is the essential ingredient for proving our main results. We also generalize Theorem 1 to eastern algebras (Theorem 1-cat).
In Section 6, we generalize Theorem 2 to eastern algebras (Theorem 2-cat). We show that characteristic substructures can be described as certain counits, and as bimorphism actions on capsules. We also prove the dual version of this result for characteristic quotients (Theorem 2-dual).
In Section 7, we use our framework to provide categorical descriptions of common characteristic subgroups, including verbal and marginal subgroups.
In Section 8, we describe a cross-category translation of counitals and explain, in categorical terms, how a counital for a category of groups can be constructed from a counital for a category of algebras.
Table 1 summarizes notation used throughout the paper.
Symbol | Description |
---|---|
Eastern variety | |
Abstract categories or categories that act | |
Capsules | |
Bicapsules | |
Identity morphism of type | |
Identity morphisms of | |
Morphisms between categories | |
Inclusion functors | |
Capsule morphisms | |
Functions | |
Functions | |
Signature | |
Type of -eastern algebras | |
The void type | |
The type | |
If is defined, then | |
Computable equality of functions | |
Source and target of a morphism | |
Guards for a category action | |
, , | The iso-, epi-, and mono-morphisms of (resp.) |
2. Type theory and certifying characteristic structure
To certify that a subgroup of a group , with inclusion , is characteristic, we must verify that
(2.1) |
At face value, this a posteriori check requires knowledge of . To provide a certificate of being characteristic, we instead develop a constructive version of our main results using type theory language. Specifically, we use an intuitionistic Martin-Löf type theory (MLTT), a model of computation capable of expressing aspects of proofs that can be machine verified. In an MLTT, (2.1) can be expressed as
this notation we explain below. An advantage of this approach is that certificate data can be verified by practical type-checkers. An MLTT employs the “propositions as types” paradigm (Curry–Howard Correspondence), where types correspond to propositions and terms are programs that correspond to proofs. The remainder of this section is a concise treatment of type theory from \citelist [Hindley-Seldin]*Chapters 10–13 [HoTT]*Chapter 3.
2.1. Types
Informally, types annotate data by signalling which syntax rules apply to the data. We write and say “ is a term of type ” or “ inhabits ”. For example, signals that can only be used as a natural number. A type is inhabited if there exists at least one term and uninhabited if no term of type exists. The void type has no inhabitants by definition. Deciding whether a type is inhabited or not is computationally undecidable [Hindley-Seldin]*pp. 66–67. Therefore in computational settings types are permitted to be neither inhabited nor uninhabited. Type annotations enable us to use symbols according to their logical purpose; for example, is analogous to , but type theories do not have the axioms of set theory.
Types are introduced from two sources. First, there is a context that defines a priori the types that we need: for example, . Next, there are type-builders that construct new types from existing ones. We use both and to denote the type of functions, and set and . If is a natural number, then an inhabitant of type can be interpreted as an -tuple with each , or alternatively as a function . There is a unique function (akin to the uniqueness of a function ), so is a type with a single inhabitant—it is not void.
The notation together with projection maps is used for Cartesian products, and together with inclusion maps is used for disjoint unions. (The tradition in type theory is to use instead of , but this conflicts with algebraic uses of .)
2.2. Propositions as types
In set theory, propositions are part of the existing foundations. In type theory, propositions co-evolve with the theory as special types. A proposition in logic is associated to a type . (Only in this section do we distinguish propositions in logic from propositions as types with the notation .) If the type is inhabited by data , then the term is regarded as a proof that is true. For example, an implication (here means “implies”) can be proved by means of a function , where and are the respective types associated with and , because it suffices to assume and derive a proof of . Likewise, if we assume that there is a term and apply the function , then it produces a term .
In classical logic, it is only the existence of a proof for a proposition that is relevant. Analogously, in type theory, is a mere proposition, written , if it has at most one inhabitant.
Consider the function . Now is expressed by terms of type ; and is expressed by terms of type (technically, the truncation of that type [HoTT]*§3.7). The negation of a proposition is , which accords with functions of type . For additional details, see [Hindley-Seldin]*Chapters 12–13 or [HoTT]*Chapter 3.
Martin-Löf developed a notion of equality that imitates the Leibniz Law [Feldman]:
where runs over all predicates of a single variable . Thus, for every type and terms , we define an auxiliary type (where terms are proofs that ) with the rule that, given a function , there is a function
(2.2) |
2.3. Subtypes and inclusion functions
Sets are a special case of types: we write for a type if the type is a mere proposition for all . Let be a type. If , then
is the subtype of defined by . We also write this as . Terms of type have the form for and , where is a proof that is inhabited. We sometimes use set theory notation to improve readability when describing a subtype. For more details, see [HoTT]*§3.5. For a typed function , the image is shorthand for .
Subtypes have an associated inclusion function where . A subtlety is that if with inclusion map , then the composition is injective but does not show directly that . A term of type , with , has the form , which differs from those of type . A small modification addresses the fact that the relation is not strictly transitive. Define a subtype , where , and inclusion . Now construct a map given by
where and . Thus, , and the composition is equivalent to . Hence, is transitive up to this equivalence.
2.4. Partial-functions
For a type , we define
A partial-function is a function is a term, , of type . It is defined at if there is with , where is the inclusion.
Given partial-functions , the notion of equality as “” for every is too strict. We impose that condition only for those for which both and are defined. This motivates a notion of “directional equality”, where having one side defined implies that the other is also defined; only now do we decide whether the results are equal. Freyd and Scedrov [FS]*1.12 introduced the following venturi-tube relation on partial-functions.
Definition 2.1.
Given , and , we write
For we write if for all , and we write if and .
Note that is a pre-order (reflexive and transitive). If we compare with (extensional) function equality
then implies that . In classical logic (with the law of the excluded middle) the converse also holds because one can, by fiat, declare that is defined or undefined at . In some computational models such a separation is non-constructive, so computing may not halt. Thus, we retain the notation.
2.5. Certifying that the trivial group is characteristic
As an illustration, we present a type verifying the characteristic property of the trivial subgroup. Let be a group with identity . Let be the subtype of representing the trivial subgroup. Recall that terms of have the form , where and , and there is a map , . If , then , and, by (2.2), for every there is an invertible function of type
(2.3) |
The latter function depends on and , but we suppress this dependency to simplify the exposition. Let be a proof that fixes . Using (2.3), we define the term
that takes as input and produces . Therefore we obtain the term
which certifies that is characteristic in ; compare to (2.1). Recall that in MLTT, types correspond to propositions, and terms are programs that correspond to proofs. Thus, the term is not an exhaustive tuple listing , but a program (function) that takes as input and , and produces and .
3. Essentially algebraic structures
To interpret characteristic structure as computable categorical information, we treat categories as algebraic structures. (Computational categories should not be confused with categorical semantics of computation.) For our purpose, it suffices to use operations that may only be partially defined, so categories are important examples, as are monoids, groups, groupoids, rings, and non-associative algebras. We give an abridged account and refer to \citelist[Cohn]*§II.2 [AR1994:categories]*Chapter 3 for details.
3.1. Operators, grammars, and signatures
Informally, a grammar is a description of rules for formulas.
Definition 3.1.
An operator is a symbol with a grammar, which we describe using the Backus–Naur Form (BNF) [Pierce:types]*p. 24. The valence of an operator , written , is the number of parameters in its grammar. A set of operators is a signature.
Example 3.2.
A signature for additive formulas specifies three operators:
<Add> ::= (<Add> + <Add>) | 0 | (-<Add>) |
The bivalent addition depends on terms to the left and right; zero () depends on nothing; and univalent negation () is followed by a term.
It is easy to reject since it is not meaningful. However, we might write intending ; the BNF grammar <Add> accepts only the latter.
The purpose of the signature is to formulate important algebraic concepts such as homomorphisms. To declare that a function is a homomorphism between additive groups, we use the signature of Example 3.2 as follows:
3.2. Algebraic structures
An algebra is a single type with a signature [Cohn]*§II.2.
Definition 3.3.
An algebraic structure with signature is a type and a function , where and . A homomorphism of algebraic structures and , each having signature , is a function such that, for every and ,
As in Section 2.2, we extend these propositions to types as follows:
Terms of type are -algebras.
For example, consider the additive group signature from Example 3.2. The underlying structure of an additive group can be described by a type (set) together with assignments of the operators in Add such as (<Add> + <Add>) to .
3.3. Free algebras and formulas
We now extend signatures to include variables that allow us to work with formulas.
Definition 3.4.
Let be a signature and let be a type whose terms are variables. The free -algebra in variables , denoted by , is the type of every formula in constructed using the operators in .
Example 3.5.
Fact 3.6.
Let be an -algebra and , where is a type whose terms are variables. There is a unique homomorphism that satisfies .
Consequently, we write for formulas and .
Remark 3.7.
The construction in Fact 3.6 is categorical in nature, and we use it in Section 7 to construct characteristic subgroups. The category of -algebras has objects of type together with homomorphisms. The pair of functors (given only by their object maps)
forms an adjoint functor pair between the categories of types and -algebras; see Section 4.5 for related discussion.
3.4. Laws and varieties
Let be an -algebra and let be a type for variables. We now describe the variety of algebraic structures whose operators satisfy a list of laws such as the axioms of a group. A law is a term of type . We index laws by a type , so they are terms of type and are written . We say that is in the variety for the laws if
Example 3.8.
The signature for groups is the following:
The variety of groups uses three laws, indexed by with variables . If , then is
so and . Hence, associativity is imposed on the -algebra by requiring a term (“proof”) of type
Encoding and as additional laws gives a complete description of the variety of groups. Laws need not be algebraically independent: for example, and are often also encoded.
For clarity, henceforth we write laws as propositions. For example, we write rather than terms of a mere proposition type.
3.5. Eastern algebras
We cannot always compose a pair of morphisms in a category: composition may be a partial-function. Hence, the morphisms need not form an algebraic structure under composition. We address this limitation by identifying precisely when the operators yield partial-functions.
Example 3.9.
The type of every function is given as
Technically, to quantify over all types, we shift to a larger universe ; see Remark 3.21. For , define
The condition guards against composing non-composable functions. (A helpful mnemonic for is “What enters must match what exits ”.) This yields the composition signature:
Note that , and similarly .
The composition signature defined in Example 3.9 is used throughout. Motivated by it, we make the following general definition.
Definition 3.10.
For a signature , operator , variables , type , and formulas , the partial-function is -guarded if
(3.1) |
The formulas are the rails of . If , then the rails are trivial, so is everywhere defined and is total.
We define a type whose terms are pairs , where is a proof of (3.1). Fix a tuple of variables . Define the type whose terms are the rails for the operators in . Observe that the rails are everywhere defined.
We now define essentially algebraic structures, which we call eastern algebras; see [AR1994:categories]*§3.D.
Definition 3.11.
For a signature , an -eastern algebra is a type and an assignment of operators to -guarded partial-functions. Formally, -eastern algebras are terms of the type
Every algebraic structure defines an eastern algebra by using trivial rails for each operator. In the next example, we observe that categories are eastern algebras. Recall that a category has objects of type , morphisms of type , and a composition operation .
Example 3.12 (Categories as eastern algebras).
Let be a category with object type . Form the type of all morphisms of :
(3.2) |
For objects , there is an inclusion map (see Section 2.1)
Thus, for each , there exist unique and such that . The type is an eastern algebra with the composition signature from Example 3.9, which is realized as follows. For , with and ,
As before, guards against composing incompatible morphisms.
3.6. Abstract categories
We consider categories as described in [FS]*1.11; their type differs from those of Example 3.12.
Definition 3.13.
Let be the composition signature of Example 3.9. An abstract category is an -eastern algebra satisfying the following laws in variables :
We sometimes refer to the operators and as guards.
A useful subtype of an abstract category is the type of identities:
Lemma 3.14.
The following hold in every abstract category.
-
(a)
The guards are idempotent, namely
-
(b)
Terms and satisfy
Proof.
For a term in an abstract category,
A similar argument shows , so (a) holds.
For (b), suppose is another term and . Now is an equality: . Since ,
Hence, , and the other formula follows similarly. ∎
Proposition 3.15.
Proof.
If in , then Similarly, . Since the operators and have trivial rails, both of the equations and are everywhere defined.
Observe that is defined and equals ; also is defined and equals . For , the expression is defined whenever , and is defined whenever . Since is idempotent by Lemma 3.14(a), both and are defined when . Thus, implies
so . A similar argument holds for .
Lastly, composition is associative everywhere it is defined, so . ∎
Example 3.16.
Let be an abstract category with and additional morphisms , where and . Using the signature of Example 3.9, is an eastern algebra with multiplication defined in Table 2 where every instance of is omitted. It is not easy to discern structure from this table, so two additional visualizations of are given in Figure 1. The first is the Cayley graph of the multiplication with undefined products omitted. The second is the Peirce decomposition, which we now discuss.
3.7. Peirce decomposition of abstract categories
Treating categories as algebraic structures allows us to frame aspects of category theory in algebraic terms. Our goal is an elementary representation theory of categories. In particular, we seek matrix-like structures—known as Peirce decompositions in ring theory—for abstract categories.
One can recover from an abstract category notions of objects and morphisms by considering the identities . Using the laws in Definition 3.13,
In algebraic terms, the subtype is a type of pairwise orthogonal idempotents. For subtypes and of , define
Fact 3.17.
If , then ; we write simply .
Given , we define three subtypes:
These subtypes appear in Figure 2 in the left, middle, and right images, respectively. If for , then .
If , then , and so on, from which we deduce the following.
Proposition 3.18.
If is an abstract category, then , , and induce invertible functions denoted by “” of the following types:
Proposition 3.18, which we use to prove Theorem 5.4, allows us to draw upon intuition from matrix algebras. The morphisms of a category appear in its multiplication table, as in Table 2. Products of morphisms and slices are defined, as with matrix products, only when the inner indices agree. In this model, can be visualized as the identity matrix, where the entries on the diagonal are the individual identities . In Figure 1(b), that product is represented in a matrix-like form respecting the conditions of the Peirce decomposition.
Remark 3.19.
While there are differences between the types for categories and abstract categories, every theorem stated in one setting translates to a corresponding theorem in the other. More precisely, the translation is a model-theoretic definable interpretation [Marker:models]*§1.4: there is a prescribed formula that translates every theorem and its proof between the two theories. Example 3.16 shows how the model of categories with both objects and morphisms may be interpreted as definable types in the theory of categories with only morphisms (abstract categories). Conversely, if is an abstract category, then we obtain a category with object type as follows. For objects , we define
where the identity morphisms of are . To compose morphisms with for objects , we define
Hence, we no longer distinguish between categories and abstract categories.
3.8. Eastern algebras as categories
Example 3.12 shows that a category is an eastern algebra. We now show that a variety of eastern algebras forms a category.
Definition 3.20.
Fix a signature . Let , where and . A morphism from to is a partial-function such that for every and every ,
The type of morphisms from to is
The object type and the morphism type form the category of -eastern algebras. In particular, the -morphisms of , namely the type
can be viewed as an abstract category (see Proposition 3.15). Therefore, forms an eastern algebra with the composition signature.
We call a subcategory of eastern algebras an eastern variety (with respect to signature and laws ) if it is full and its objects are those eastern -algebras satisfying the laws . We reserve to denote an eastern variety.
Remark 3.21.
Regarding categories as eastern algebras could lead to a paradox of Russell type. The paradox is avoided either by limiting -types to forbid some quantifications [Tucker] or by creating an increasing tower of universe types and pushing the larger categories into the next universe [HoTT]*§9.9. Both resolutions allow us to define categories and eastern algebras computationally.
Under the correspondence of Remark 3.19, morphisms between abstract categories are precisely functors between categories. This translation serves two of our goals. The first is an elementary representation theory for categories: by regarding categories as “monoids with partial-operators”, we mimic monoid actions. The second is to treat a category as a single data type with operations defined on it. This is considerably easier to implement as a computer program. Indeed, both GAP [GAP4] and Magma [magma] are designed for such algebras. There are benefits to the usual description of categories, but the translation to abstract categories is essential for our approach to computing with and within categories.
Next, we prove a generalization of Noether’s Isomorphism Theorem.
Theorem 3.22.
Let be a morphism between eastern algebras. There exist eastern algebras and , an epimorphism , a monomorphism , and an isomorphism such that the following diagram commutes.
Proof.
Let be a signature, and let be an -eastern algebra. If every operation in is total, then we apply Noether’s Isomorphism Theorem [Cohn]*Theorem II.3.7. It is clear that the existence of the stated algebras and morphisms is constructive.
Otherwise, at least one operation is a partial-function. We define a new eastern algebra where all operations are total. Let be a formal symbol, disjoint from . Define a new type with inclusion function . By abuse of notation, we also apply to tuples over . Define a new signature , obtained from by including as a constant. We use trivial rails for every operator, and define via
Now every operation in is total, so the Isomorphism Theorem applies. Since every homomorphism of -eastern algebras fixes constants, the statement follows. ∎
3.9. Subobjects and images
We close with a list of facts about eastern varieties, which we use heavily in Section 5. We first define a pre-order that enables abbreviation of compositions of multiple morphisms. To motivate this, assume is a morphism of eastern algebras. Theorem 3.22 states there exists such that . We denote this by and make the following more general definition. For morphisms ,
(3.3) | ||||
(3.4) |
Two monomorphisms are equivalent if and . Similarly, epimorphisms are equivalent if and .
Lemma 3.23.
Let be an eastern variety. For morphisms , if , then .
Proof.
By Theorem 3.22, there exist isomorphisms such that
By the universal property of coimages, there exists a unique morphism such that . Therefore,
Since is an epimorphism, . ∎
Lemma 3.24.
Let be an eastern variety. For morphisms , if , then . If is also monic, then .
Proof.
The first claim follows from the universal property of images, so we assume is monic. By Theorem 3.22, there exists an isomorphism such that
Since is monic and , by the universal property of images, there exists a morphism such that . ∎
Eastern varieties have a coproduct [Riehl]*p. 81 given by the free product [Riehl]*p. 183. An example concerning groups is given in [Riehl]*Corollary 4.5.7. We list some facts concerning coproducts in eastern varieties.
Fact 3.25.
Let be a type. In an eastern variety , the following hold for all and .
-
(a)
There exists a coproduct morphism and morphisms satisfying for each .
-
(b)
If is uninhabited, then is the identity on the free algebra on the empty set. In particular, is the unique morphism inhabiting .
-
(c)
If such that for all , then .
-
(d)
If with for all , then .
-
(e)
If , then .
Finally, if is monomorphism satisfying , for some identity , then can be regarded as a subobject of the object associated to . Given a collection of such monomorphisms, consider the smallest subobject containing all set-wise images of the . The coproduct allows us to effectively “glue” together all of the monomorphisms, but the result is not a monomorphism. To obtain a monomorphism, we take the image of the coproduct, namely
(3.5) |
4. Category actions, capsules, and counits
Theorem 2 asserts that characteristic subgroups arise from categories acting on other categories.
4.1. Category actions
Our formulation of category actions generalizes the familiar notion for groups and also actions of monoids and groupoids [MonoidsAC]*§I.4. The technical aspects of the definition concern the additional guards, denoted , needed to express where products are defined. Their use is similar to the guards used for abstract categories (see Definition 3.13).
Definition 4.1.
Let be an abstract category with guards denoted by and
. Let be a type. A left category action of
on consists of a type , functions
and , and a
partial-function that satisfies the following
rules:
(1)
()
()
;
(2)
(
()
; and
(3)
()
()
.
Given a left action of on a type , a partial-function is an -morphism if whenever and with .
Right category actions are similarly defined. We unpack the symbolic expressions in Definition 4.1. Condition (1) states that the functions and serve as guards for the partial-function : namely, (1) characterizes precisely when is defined. The first part of Condition (2) asserts that respects the identity of ; the second part states that identity morphisms of act as identities. Condition (3) is the familiar group action axiom in the setting of partial-functions.
For subtypes and , we write
From Definition 4.1, an -morphism is always defined on ; hence, we do not need guards for -morphisms.
Definition 4.2.
The category action of on is full if defines a bijection from to .
Note that the category action of on is full if and only if for every there exists an such that .
Since we identify categories and abstract categories (Remark 3.19), we say that a category acts on a type if its morphisms act on .
Example 4.3.
Let be a category with object type and morphism type . Set . Define via and define via . Let be the partial-function defined by
This defines a full left action of on . A full right action is defined similarly.
Remark 4.4.
Let be a category and let . The definition of category action in [FS]*1.271–1.274 is similar to ours, but it requires and and for every and . Thus, for and , both and are defined only when ; this is too restrictive for our purposes.
4.2. Capsules
As identified in Section 1.2, we focus on the action of one category on another category ; we call these “category modules” capsules. Note the change in notation from to to emphasize this setting. In this case, already has a candidate type for , namely . Furthermore, because a category has its own operation of composition, the action by respects composition. For example, given a group homomorphism , we get an action that satisfies .
Definition 4.5.
A category is a left -capsule if there is a full left
-action on with such
that the following hold:
(a)
;
(b)
.
A right -capsule is similarly defined. We present our results below for left -capsules, but they can be formulated for both.
Much of our intuition on actions draws on familiar themes in representation theory. A reader may be assisted by translating “-capsule” to “-module” and considering the matching statement for modules. We write to indicate the presence of a left -capsule action on .
From now on, if a category acts on itself, then we assume it is by the (left) regular action, where is given by composition in . Moreover, a category action on another category is implicitly understood to be on the morphisms. We now show that capsules arise from morphisms between categories.
Proposition 4.6.
A category is a left -capsule of a category if, and only if, there is a morphism such that . Furthermore, the morphism is unique.
The following lemma proves one direction of Proposition 4.6.
Lemma 4.7.
Every morphism of categories makes a left -capsule, where for each and , the guard is defined by and the action is defined by .
Proof.
Condition (1) of Definition 4.1 is satisfied by the defined action.
For the first part of Condition (2), let . Since is a morphism and is everywhere defined, . Hence, by Lemma 3.14(a),
For the second part of Condition (2), let and with , so by definition. Thus,
so for every and .
For Condition (3), let and with and , so is defined and . We need to show that . Since is a morphism,
Hence, implies . Thus, is defined. Also, implies , so
It follows that is defined. Since is a morphism,
and therefore for every and .
To see that the action is full, consider and define . By the laws of an abstract category, . Finally, , so is a left -capsule. ∎
Our proof of the reverse direction of Proposition 4.6 uses the following result.
Lemma 4.8.
Let be a left -capsule. For every , there is a unique such that is the unique term of type .
Proof.
Since the action is full, for each there exists such that , so is defined. Since is a left -capsule, , so
Hence, is defined and has type . Suppose and , so that . Furthermore,
Thus, , and there is exactly one term with type . ∎
Under the assumptions of Lemma 4.8, we simplify notation and identify with its unique term.
Proof of Proposition 4.6.
By Lemma 4.7, it remains to prove the forward direction and uniqueness. Suppose that is a left -capsule. By Lemma 4.8, for each there is a unique such that is defined. Since is a left -capsule and is an identity,
Thus, is also defined. Put . If , then is defined whenever
Hence, is also defined in . Because is an identity, . Since is a left -capsule, . Hence, it remains to prove that is a morphism of categories.
For , by the action laws
Thus, . But is a left -capsule, so Fact 3.17 implies that
(4.1) |
Hence, . By Lemma 3.14(b) and (4.1) for all ,
Similarly, . Hence, is a morphism.
Lastly, we prove uniqueness of . Suppose there exists such that for every and whenever . Since , it follows that , so . ∎
If is a subcategory of with inclusion , then the (left) regular action of on is defined to be the action given by . In other words, the regular action of on is given by for and . By Lemma 4.7, each regular action defines a capsule. With regular actions we sometimes omit the “”.
4.3. Category biactions and cyclic bicapsules
We now define the concepts appearing in Theorem 2(3).
Definition 4.9.
Let and be categories and let and be types.
-
(a)
An -biaction on is a left -action on and a right -action on such that for every , , and . Hence, writing is unambiguous. If, in addition, is a left -capsule and right -capsule, then is an -bicapsule.
-
(b)
Suppose there are -biactions on and . An -morphism is a partial-function such that , whenever , , with and .
We sometimes write for an -biaction on for clarity. Notice that an -morphism must be defined on . As with capsule morphisms, we do not need to establish guards. We abbreviate -bicapsule to -bicapsule, -morphism to -bimorphism, and -biaction to -biaction. Note that -bimorphisms are defined everywhere. Just as ring homomorphisms are not always linear maps, morphisms of capsules need not be morphisms of categories—morphisms of capsules do not in general send identities to identities.
Motivated by Proposition 4.6, we show that bicapsules provide a computationally useful perspective to record natural transformations of functors. If are functors and is a natural transformation, then, using Remark 3.19, the natural transformation property written with guards is
for every morphism in .
Proposition 4.10.
In the following statements, the category is also regarded as an -bicapsule via its regular action.
-
(a)
For every natural transformation between functors , the assignment
makes into an -bicapsule, and the assignment defines an -bimorphism .
-
(b)
Conversely, for every -bimorphism , the assignments
define functors , and the assignment
defines a natural transformation .
Proof.
We summarize the conclusion in Proposition 4.10(b), namely for every , by writing . While consists of many terms, and produce unique values, so plays a role similar to multiplying by . Since is an -bimorphism, for , which shows that is determined by . We write
(4.2) |
The bicapsule in (4.2) is the cyclic -bicapsule determined by .
4.4. Units and counits
Given a category and functor , a unit is a natural transformation , and a counit is a natural transformation . We will prove that units and counits are responsible for all characteristic structure. It therefore makes sense to translate these into capsule actions. We show that a unit is characterized as an -bimorphism and a counit by an -morphism . As the relationship is dual, and we emphasize substructures instead of quotients, we state and prove this relationship only for counits.
Theorem 4.11.
Let and be categories.
-
(a)
If both and are -bicapsules and is an -morphism, then and define functors and , and is a counit .
-
(b)
If and are functors and is a counit, then and are -bicapsules, where and for and . Also, is an -morphism such that for all .
Proof.
-
(a)
By Proposition 4.6, the maps and define functors where and , for and . Put . For ,
and
Hence, for all , so is a natural transformation.
We show that yields an -morphism . First, if and with , then
Next, if such that , then
Finally, consider . Since functors map identities to identities, we deduce that
4.5. Adjoint functor pairs
Adjoint functor pairs are an important special case of natural transformations. We give one of many equivalent definitions [Riehl]*§4.1.
Definition 4.12.
Let and be categories. An adjoint functor pair is a pair of functors and with the following property. For every object in and in , there is an invertible function
that is natural in the following sense: if and for objects in and in then, for every ,
(4.3) |
We say that is left-adjoint to and is right-adjoint to and write this as .
We now characterize adjoint functor pairs in terms of bicapsules. A reader may find it useful to review the translation between categories and abstract categories in Remark 3.19. The invertibility of in Definition 4.12 is equivalent to a pseudo-inverse property of morphisms of bicapsules.
For types and , partial-functions and are pseudo-inverses if, for and , and .
Theorem 4.13.
Let and be categories.
-
(a)
If and are -bicapsules and and are -morphisms that are pseudo-inverses, then where
and for and the bijections and are given by
-
(b)
If is an adjoint functor pair, then and are -bicapsules with actions defined by
for and and yields a pair of -morphisms and that are pseudo-inverses where
Proof.
First we prove (a). Since and are -bicapsules, by Proposition 4.6 there are functors and defining the right -capsule and the left -capsule respectively. Since and are pseudo-inverses and capsule actions are full, inverts on and inverts on . For objects of and of , let and . For (see Remark 3.19), we define . Therefore, for , the map inverts , so the result follows.
Now we prove (b). By Proposition 4.6, we can exchange functors for capsules, so affords a right -capsule . We enrich this action by adding the left regular action by to produce an -bicapsule . We do likewise with producing a second -capsule .
To encode , we define an -bimorphism by for . This defines on
For all other values, is undefined. Now (4.3) shows that on with , , and ,
so is an -bimorphism. We define analogously: if , then (for suitable subscripts of ), and otherwise is undefined. Therefore, for and ,
4.6. A computational model for natural transformations
We use the algebraic perspective of Section 3.6 to discuss briefly a model for computing with natural transformations. The next definition formalizes how to treat morphisms of a category as functors between two other categories.
Definition 4.14.
Let , and be abstract categories. A
natural map of from to consists of
functions and
that satisfy the
following properties:
(1)
;
(2)
and ;
(3)
;
(4)
.
The use of in (1) and (4) depends only on and , respectively, being defined. For the composition signature from Example 3.9, the first two conditions of Definition 4.14 imply that there is a function given by where is the type of morphisms between abstract categories; see also Definition 3.20. This function enables us to treat the objects of as functors from to . As illustrated in Example 4.15, conditions (3) and (4) are equivalent to the commutative diagrams in Figure 3 in the shaded and entries, respectively.

Example 4.15.
We illustrate how the four conditions of
Definition 4.14 translate to categories with objects and
morphisms. Let and be two such categories, and let
be the composition signature from Example 3.9. Then
is the type functors from to
. Let be the category whose objects are the functors in
and whose morphisms are natural
transformations. Let be a natural
transformation between . Treating as an abstract category,
the guards are defined as follows: and . Define by ,
and by
. Now the conditions of
Definition 4.14 become:
(1)
;
(2)
and ;
(3)
;
(4)
.
The theory of functors and natural transformations is equivalent to that of natural maps on abstract categories, but the latter allows us to use multiple encodings of functors and natural transformations such as those available in computer algebra systems. If, for example, we compute the derived subgroup of a group in Magma, then the system may use an encoding for that differs from that supplied for . In such cases, Magma also returns an inclusion homomorphism .
5. The Extension Theorem
One of our goals is a categorification of characteristic subgroups and their analogues in eastern algebras. We start by translating the characteristic condition into the language of natural transformations.
5.1. Natural transformations express characteristic subgroups
Suppose that is a characteristic subgroup of a group . Hence, every automorphism restricts to an automorphism of . In categorical terms, we now treat as the subcategory of consisting of a single object and all isomorphisms . Likewise, we treat as a subcategory of . The restriction defines a functor . Of course, and are also groups and is a group homomorphism, but the discussion below justifies the functor language.
Now we use the fact that is a subgroup of (by using the inclusion map ). That is a subgroup of can be expressed as Recognizing the different categories, we use the inclusion functors and to deduce the following:
Thus, a characteristic subgroup determines a natural transformation
The next definition generalizes Definition 1.1.
Definition 5.1.
Fix an eastern variety with subcategories and and inclusion functors and . A counital is a natural transformation for some functor . The counital is monic if is a monomorphism for all objects in .
A common way to illustrate categories, functors, and natural transformations uses a 2-dimensional diagram where categories are vertices, functors are directed edges, and natural transformations are oriented 2-cells. The next diagram illustrates the counital discussed above.
We now generalize the notion of a characteristic subgroup to an arbitrary eastern algebra. Let be a subcategory of an eastern variety . For , let be the category with a single object and morphisms , so . Observe that is a full subcategory of .
Definition 5.2.
Let , and let and be inclusions. A monomorphism is -invariant if there is a functor and a monic counital such that is equivalent to (see Section 3.9 for the definition of equivalence).
Using the language of Definition 5.2, a characteristic subgroup of a group determines and is determined by a -invariant monomorphism . For fully invariant subgroups, the corresponding monomorphism is -invariant.
5.2. The extension problem and representation theory
In Section 5.1, we observed that a characteristic subgroup of determines a functor and a natural transformation , where and are categories with one object, namely and respectively. If a group is isomorphic to , then by Fact 1.2 has a characteristic subgroup corresponding to . It seems plausible that we may be able to extend the functor to more groups and, hence, to larger categories. We now make this notion of extension precise and generalize it to the setting of eastern algebras.
Fix an eastern variety . Let , , and be subcategories of where . We have inclusion functors , , and , where . Suppose that is a monic counital as depicted in Figure 4(a). The extension problem asks whether there is a functor and a natural transformation such that
(5.1) |
for some invertible morphism for all objects of . This is depicted in Figure 4(b).
For now, we are concerned only with the existence and construction of such extensions. For use within an isomorphism test, it will be necessary to develop tools to compute efficiently with categories; the data types of Section 4.6 are designed for that purpose.
In light of Proposition 4.10, we can explore the natural transformations from Figure 4 through the lens of actions. Recall that concatenation always denotes regular actions. The natural transformation defined above is encoded as an -bimorphism , where , and this bimorphism defines a cyclic -bicapsule via (4.2), which we fix throughout.
Our goal in part is to extend the cyclic -bicapsule to a cyclic -bicapsule . Specifically, we will define , where is depicted in Figure 4(b). This is the content of Theorem 5.4, but given in the general setting of eastern algebras. By construction (Proposition 4.10), the left actions on and are regular; hence, we focus on right actions.
Example 5.3.
For the purposes of illustration, we consider a familiar construction that is similar to our context, namely Frobenius reciprocity and Morita condensation [Rowen, Theorem 25A.19]. Here is a ring, and and are subrings. Considering a Peirce decomposition of , let and be idempotents in such that . Then is a (non-unital) subring of , and is a subring of both and . Furthermore, is an -bimodule and is a -bimodule. Suppose is a right -module and a right -module. The theory of induction and restriction provides us, respectively, with a right -module and a right -module: namely,
Thus, yields a map . If, for example, , then .
Guided by the Peirce decomposition from Example 5.3, we seek similar constructions for categories and capsules. Recall that contains a subcategory that contains a subcategory . This containment implies that is contained in (rather embeds under the inclusion functors into) . The bicapsule action of on induces a -bicapsule, denoted . By mimicking modules, we can consider a formal extension process. We form the type whose terms are pairs, denoted for and , subject to the equivalence relation . Then we equip this type with the right -action . Defining , we write
We return to this construction in Section 8. Finally, since and are both subtypes of , the product in defines a map that factors through . The image of the map is a cyclic -bicapsule embedded in , with corresponding -bimorphism . The following theorem states that this is always possible if is full in . Table 3 summarizes some of the notation fixed throughout this section.
eastern variety | |
subcategories of | |
full subcategory of |
bimorphism | monic counital | cyclic bicapsule |
---|---|---|
Theorem 5.4 (Extension).
Let be as in Table 3. If is full in , then there is a cyclic -bicapsule on and unique cyclic -bicapsules , on such that
We briefly describe the idea of the proof. We start with a cyclic -bicapsule with associated -bimorphism . We seek an extension of to a -bimorphism that satisfies , where is the inclusion functor. If this holds, then, for every and every with ,
We now derive some necessary conditions for a putative of this type. Recall from (3.3) that the notation for morphisms and implies that there is a morphism such that . Applying Lemma 3.24 to yields . For define
The left -actions on and are regular, as is the left -action on . Hence, for ,
Therefore, , so is defined. Since holds for every , we can make a single inclusion (see (3.5)):
(5.2) |
Observe that (5.2) also holds if, instead of , we assume that there exists such that for all ; here denotes the restriction of to . This motivates us to choose such that is defined as the left hand side of (5.2), and then solve for a suitable .
5.3. Building blocks
Lemma 5.5.
Let and be as in Table 3. For , the following are equivalent.
-
(1)
There is a -bicapsule on such that the function given by is a -bimorphism.
-
(2)
For all , .
Proof.
We assume (1) holds and prove (2). By Proposition 4.10(b), there exists a unique functor that induces the action of on the right of . Since is a function and a -bimorphism by assumption, for all , and
Thus, (2) holds.
We now assume (2) holds and prove (1). First, we show that an satisfying is unique. Suppose satisfies , so . Since is a monomorphism, . We denote this unique morphism by . Since is defined for all ,
Next, we define a right -capsule structure on as follows. Let be given by , and let be given by . For all and , let , which is defined if, and only if, . Condition (2) of Definition 4.1 follows from and for all since is monic. Lastly, let with . Then and, since we have a regular left action,
Since is a monomorphism, . Hence, this defines a right -capsule on since for all . Since acts regularly on on the left, there exists a -bicapsule on by Proposition 4.6, with the regular left and right actions just defined. Finally, we prove that is a -bimorphism. For all ,
provided . If , then
Lemma 5.6.
Proof.
Proposition 5.7.
Let be as in Table 3. If is full in , then defined by
is a -bimorphism, and there exists a unique such that for all ,
Proof.
Take , and recall that acts regularly on both the left and right of . Since is full in , these actions are full, so
Since the left actions of on and on and the left action of on are regular, for each and with ,
(5.5) |
Fix . Set , so . Thus, since the -action on is full,
(5.6) |
where acts on in the final expression. Therefore,
For the application of Lemma 3.24 in the last step, recall that is monic for by our assumption (Table 3).
We establish the other direction as follows:
Acting with from the left, we obtain .
From both computations, there exist such that
It remains to show that for all and is unique. For all , is monic by the assumptions in Theorem 5.4, and is also monic by the definition of . Since is monic,
implies . Similarly, because is monic and
The uniqueness of follows since is monic. ∎
5.4. Proof of Theorem 5.4
Let be the -bimorphism in Proposition 5.7. This proposition shows that there exists a unique such that for all ,
(5.7) |
Since is an -bimorphism,
(5.8) |
Let be the -bicapsule on defined by the -bimorphism . Both and are bicapsules, so applying (5.7) to (5.8) yields
(5.9) |
Since the left -action on and the left -action on are regular, . Thus, . Applying this to (5.9) and using the monic property of , we deduce that
Since the actions are capsules, defines a natural transformation. By Proposition 4.10(a), the function defines an -bimorphism . Thus, , and therefore
The uniqueness of follows from Proposition 4.10(b) and the uniqueness of . ∎
5.5. Proof of Theorem 1 for eastern varieties
Recall that denotes the type of all counitals , where and are categories, is a functor, and and are inclusion functors. Let be the type of all unitals ; these are the duals of counitals. Recall from Theorem 3.22 that and produce categorical morphisms, and from Section 3.9 the equivalence relations on monomorphisms and epimorphisms. Our use of set theory notation in the following generalization of Theorem 1 is justified because we compare subsets of a fixed algebra.
Theorem 1-cat.
Let be an eastern variety. For every , the following equalities of sets hold up to equivalence:
(1) | ||||
(2) | ||||
(3) | ||||
(4) |
Proof.
We prove (1) in detail; the proof of (3) is analogous but requires replacing with . The proofs of (2) and (4) are dual to the proofs of (1) and (3), respectively.
Let , which is a morphism in . Recall that the single-object category consists of and all its automorphisms, and likewise for . Both are subcategories of , and full subcategories of . We denote the relevant inclusion functors by , , , and . As in Section 5.1, we obtain a natural transformation with (restriction) functor , so is a monic counital.
We now use Proposition 4.10 to pass to the associated cyclic -bicapsule . Recall that the left action is defined by , hence regular, and the right action is defined by . By construction, satisfies the conditions of Theorem 5.4 since is full in . We extend to a cyclic -bicapsule where is a monic counital extending , namely, there exists an isomorphism such that ; see (5.1). Since is the inclusion functor, there exists an isomorphism such that , so and are equivalent. Hence, and are equivalent. Since , this proves the “” part of (1).
For the converse, consider , say for some functor , subcategory , and inclusion . If , then , and so . Since , it follows that the morphism is characteristic, and therefore so is its monic image . This proves the “” part of (1). ∎
6. Categorification of characteristic substructure
The final step in our work is to describe the source of all characteristic subgroups, and more generally characteristic substructures in eastern algebras. In Section 5, we showed that characteristic structure arises naturally from counitals. Now we demonstrate that all counitals are derived from counits. In particular, in Section 6.3, we prove the following generalization of Theorem 2 to eastern algebras.
Theorem 2-cat.
Fix an eastern variety . Let be an object in with subobject and inclusion . There exist categories and , where , such that the following are equivalent.
-
(1)
is characteristic in .
-
(2)
There is a functor and a counit such that .
-
(3)
There is an -morphism such that .
Our proof relies on the Extension Theorem 5.4 and further consideration of counitals.
Definition 6.1.
Fix a category with subcategories and and inclusion functors and . A counital is isosceles if and , and flat if, in addition, and . Otherwise, it is scalene.
Example 6.2.
We mention three examples in and illustrate their natural transformations in Figure 5. The first two are the derived subgroup and the centre of a group considered in Example 1.4. For the third example, we consider an arbitrary characteristic subgroup of a group . As discussed in Section 5.2, define to be the category with one object and its morphisms are the automorphisms of . Hence, and are subcategories of with inclusion functors and , respectively. We define a functor by mapping to and automorphisms of to their restriction to , and so obtain a natural transformation .
In our study of characteristic structure we consider the three types of counitals. First, we use induced actions from Theorem 6.5 to pass from a scalene counital to one that is isosceles. Next, we work with isosceles counitals to determine an intermediate class of isosceles counitals known as internal counitals. Finally, we show that internal counitals are completely determined by a morphism of bicapsules.
Counits are common in many categorical contexts; for example, they occur for every adjoint functor pair. The case of flat counitals coincides precisely with the stricter class of fully-invariant substructures.
6.1. Composing counitals
In this section, we describe two ways to construct new counitals from given counitals by composing natural transformations and functors in different ways. These are two instances of a much larger theory; see [Baez, Power]. Figure 4 illustrates the usual composition of natural transformations. We now describe how to compose a functor with a natural transformation. Consider functors , , and for categories , , , and , and a natural transformation . Define by setting for each object in . Similarly, define by setting for each in . The effects of and are displayed in Figure 6.
The composition we describe next is specific to natural transformations of a particular form, which include counitals. It composes two natural transformations that share a functor and reflects our expectation that the characteristic relation is transitive. In , for example, given a counital describing a characteristic subgroup of , and a counital describing a characteristic subgroup of , we expect to have a counital that prescribes how is characteristic in .
To that end, suppose is a category of eastern algebras with subcategories , , and , and respective inclusions , , and . Suppose and are natural transformations. Define by
for all objects in , see Figure 7. This construction reflects the fact that being a characteristic substructure is a transitive property.
6.2. Categorifying isosceles counitals
All extensions used in our proof of Theorem 1-cat lead to isosceles counitals. Counits—namely, counitals in which is the identity functor—are one source of isosceles counitals. This hints at a way to characterize characteristic subgroups.
We now prove that all counitals arising from characteristic subgroups extend to isosceles counitals, thereby proving Theorem 2-cat. The most direct proof might utilize Kan lifts, the dual of the better known Kan extensions [Riehl]*Chapter 6, but we give a self-contained proof.
Let be an inclusion functor of categories and let be a functor. If is a natural transformation, then, for every object in , the morphism is a morphism in . We consider the special case when this morphism is also in .
Definition 6.3.
A natural transformation is internal if, for every object in , the morphism is a morphism in .
The property of being internal is strong. Take, for example, , so the morphisms are exclusively isomorphisms. If is internal, then is an isomorphism. Such an does not identify a new substructure. In other words, has too few morphisms for our purposes. By extending the types of morphisms, we prove in Proposition 6.4 that every monic isosceles counital lifts to an internal one, see Figure 8 for an illustration.
Proposition 6.4.
Let be a category with subcategory and inclusion . Suppose every object in is also an object in . Let be a monic isosceles counital with . There exists a category with inclusions and , a functor , and an internal monic isosceles counital such that , , and .
Proof.
We define a subcategory of as follows: its objects are the objects of ; its morphisms are given as finite compositions of morphisms , where is a morphism in , and morphisms , where an object in . Hence, we have inclusions and such that . Since both and have the same objects as , it follows that , , and are the identities on objects. Moreover, is the identity on morphisms.
We now construct a functor such that . It suffices to define on morphisms and then verify that is a functor. Set
If is well defined, then is a morphism in , and by construction. To verify that is well defined, it suffices to consider the case where (with an object in ) is also a morphism in : specifically, there is a morphism such that . Since is the identity on objects, . We will show that . To see this, we apply to the morphism and obtain the following diagram (see shaded entry of Figure 3).
Since , the diagram implies that . Since is monic by assumption, . This proves that is well defined.
We claim that there exists a natural transformation such that . Since the objects of are those of , we define to be and show that this yields the required counital. First, we consider the case that is a morphism in . Then , so
Now we assume for some object in . Since is the identity on objects and is the identity on morphisms,
Lastly, we consider the case of an arbitrary finite composition where each is either for some morphism in or a morphism for some object in . It suffices to consider only the case where , say with and . Now
Thus, . Since is monic, so is . Also, is a morphism in for every object , so it is internal, as claimed. ∎
We now prove that every characteristic substructure of an eastern algebra is induced by a morphism of category biactions.
Theorem 6.5.
Let be an object in a category of eastern algebras. Let be characteristic in with inclusion . There exist subcategories and with , and an -morphism such that .
Proof.
Let be the inclusion functor. The proof of Theorem 1-cat shows that there exists a functor and a monic counital such that . We use Proposition 6.4 (with ) to create a category generated from and , an inclusion functor , a functor , and an internal monic counital with for all objects in . Lastly, we apply Proposition 4.10(a) to to obtain an -bimorphism such that . Since is internal, there exists an -bimorphism such that . Hence, . With , it follows that , as claimed. ∎
6.3. Proofs of main theorems
Having developed the required theory, we can now complete the proofs of our main results. Theorem 1 is a special case of Theorem 1-cat, which we proved in the previous section.
6.4. Duality
Recall from Section 5.5 that a natural transformation is a unital if is an inclusion functor. If , then is a unit. A unital is epic if is epic for all objects . Units and unitals are the duals of counits and counitals.
We state a dual analogue of Theorem 2-cat for characteristic quotients in eastern algebras; its proof follows mutatis mutandis from that of Theorem 2-cat.
Theorem 2-dual.
Let be an eastern variety, and let be an object of with quotient and projection . There exist categories and , where , such that the following are equivalent.
-
(1)
is a characteristic quotient of .
-
(2)
There is a functor and a unit such that .
-
(3)
There is an -morphism such that .
Although a characteristic subgroup of a group is associated with a characteristic quotient of , and vice-versa, there are subtle differences in other categories of eastern algebras.
Example 6.6.
Let be the ring of rational numbers and its subring of integers. If is a homomorphism of unital rings, then . This forces , so is fully invariant in . Since is a field, its only quotients are itself and the trivial ring. Hence, has many fully-invariant substructures, but only two fully-invariant quotients.
In general, kernels of group homomorphisms are representable as subgroups (unlike ideals, which are not necessarily unital subrings). Conversely, every characteristic subgroup is normal and has an associated quotient. Formalizing these observations, we say that invariant structures of groups are self-dual up to equivalence of natural transformations in , see [HoTT]*pp. 59–61. The next proposition provides a categorical description of this observation for ; we use it in Section 7.
Proposition 6.7.
The following hold for categories and with inclusion functors and .
-
(a)
Given a unital , there is a category , with inclusion , and a functor such that is a counital where and is the inclusion for every group .
-
(b)
Given a counital , there is a category , with inclusion , and a functor such that is a unital where and for every group .
-
(c)
With the notation of (a) and (b), there are unique invertible such that and .
Proof.
-
(a)
For every morphism in , there is an induced morphism such that , so
Therefore . In particular, the restriction
is well defined. Let be the category whose objects are for all groups and whose morphisms are for all morphisms in . Let be the inclusion functor. Moreover, there is a functor given by and . If we define to be the associated inclusion map for the kernel, then is the required counital.
-
(b)
The proof is dual to that of (a).
-
(c)
Consider the unital . By Theorem 3.22, for each group there is an isomorphism
Thus, ; likewise, for and . ∎
7. Categorification of standard characteristic subgroups
Theorem 2 states that every characteristic subgroup can be studied in three ways: as a group, as a natural transformation, and as a morphism of category biactions. In this section, we describe common characteristic subgroups using all three forms. In so doing, we reveal insights gained from the categorical perspective.
Throughout, we use the following notation for restriction and induction. Let be a homomorphism of groups, and let and be subgroups of and , respectively. If the restriction of to maps into , then we denote it by
(7.1) |
Similarly, if maps a normal subgroup of into a normal subgroup of , then the induction of via is
(7.2) |
7.1. Abelianization and derived subgroups
Figure 9 gives the three perspectives on the derived subgroup. We develop this example, so that we may also treat the lower central series and all verbal subgroups in Section 7.2.
The counital of Example 6.2 associated with the derived subgroup of a group can be constructed also as the kernel of the unital associated with abelianization. We explore the category biaction interpretation. Let be the category of abelian groups, a subcategory of with inclusion . We define a morphism given by . The functors and turn the categories and into -bicapsules.
We show that is a -morphism. Let and be group homomorphisms, and let be a homomorphism of abelian groups. Now
To obtain the counital associated with the derived subgroup, we apply Proposition 6.7 and take the kernel of . Since the unital-counital pair obtained through this process is a unit-counit pair, we obtain the well-known observation that the derived subgroup is fully invariant.

7.2. Verbal subgroups
We generalize the approach taken in Section 7.1. Let be the group signature from Example 3.8. To each set of words from the free group we associate a category as follows (see Section 3.3). For each word , group , and -tuple , define by . Define to be the full subcategory of with objects
with inclusion functor . The category is the group variety with laws . Let be the minimal normal subgroup of a group such that is in . Let be the functor such that is the largest quotient of contained in , where the functor carries to , and morphisms are sent to .
Proposition 7.1.
The functors and form an adjoint functor pair: .
Proof.
The adjoint functor pair in Proposition 7.1 categorifies verbal subgroups. The dual version of Theorem 4.11 describes how to obtain the unit from . Applying Proposition 6.7, the kernel of yields a counit for some functor . If is a group, then is the -verbal subgroup. We conclude that all verbal subgroups are fully invariant. Thus, from Proposition 7.1, we get an exact sequence of natural transformations
The corresponding diagram appears in Figure 10.

7.3. Marginal subgroups
Now we consider characteristic subgroups such as the center of a group . As seen in Example 1.4, there are group homomorphisms for which , so, unlike verbal subgroups, the center is not fully invariant. This fact is revealed by the categorification of the center—it does not yield a counit between functors , but rather a proper counital between functors of the form , where is the category of groups whose morphisms are epimorphisms. We establish this fact more generally for the class of marginal subgroups introduced by P. Hall [Hall40].
Example 7.2 (Hall’s Isoclinism).
For an integer we write for the -fold direct product of a group . The commutator map is given by . We define a congruence relation on and write if and only if for all . Factoring through this congruence relation and restricting the outputs to the verbal subgroups, we obtain a map such that the following diagram commutes.
Two groups are isoclinic if their commutator maps are equivalent.
For each group and each word , there is a unique minimal normal subgroup such that the map given by
is non-degenerate: namely, fixing any entries of the -tuple argument of yields an injective map . Here is as defined in Section 7.2.
For a set of words, the associated marginal subgroup of a group is defined as . Clearly, is characteristic in . The image of , and thus also , is the verbal subgroup associated with , written .

Hall [Hall40] introduced the general notion of isologism for word-map equivalence. We extend this language to categories. Each word determines a category with maps as objects, where the morphisms are pairs of group epimorphisms such that the following diagram commutes.
We define two functors. The first is given by and . The second is given by and . For a group , let be the usual projection homomorphism. Now is a unit. Let be the inclusion functor. Then the unital is a categorification of marginal quotients.
To categorify the marginal subgroup, we take the kernel of via Proposition 6.7 and compose with : namely, for some functor . Figure 11 displays the various morphisms and their relationships. This construction demonstrates that marginal subgroups are not just characteristic, but invariant under all epimorphisms.
The construction applies to other algebraic structures by simply involving formulas in the appropriate signature. However, the notion of congruence does not always yield a substructure, so the structures are more naturally expressed as characteristic quotients.
8. Composite characteristic structures
We now address one remaining powerful feature of our categorical description of characteristic structure. It relates to a comment we made after Theorem 2: a characteristic subgroup may arise from -morphisms where is not a category of groups. We give one illustration of how this “transferability” explains techniques currently used in isomorphism tests.
In [Wilson:filters]*§4, it is shown that a -group of class at most with exponent has a characteristic subgroup induced by the Jacobson radical of an algebra associated to the bilinear commutator map of . Here we construct that characteristic subgroup using a tensor product of capsules, as described in Section 5.2.
8.1. From groups to bimaps
Fix an odd prime , and let be the category whose objects are -groups of class at most with exponent , and whose morphisms are isomorphisms. The objects of are groups with exponent and central derived subgroup, so .
Let be the field with elements, and let be the category of alternating -bilinear maps. The objects of are bilinear maps , where and are -spaces, such that for all vectors . For objects and in , a morphism is a pair of invertible linear maps such that, for all ,
Define a functor that takes a group to
and a homomorphism to the pair , as defined in (7.1) and (7.2). Since has exponent and by assumption, is an alternating -bilinear map.
Next, define a functor that takes an -bilinear map to the group on with binary operation
A morphism from to in induces a group isomorphism, denoted , mapping to by
Lemma 8.1.
The functor is a -morphism.
Proof.
The functor induces a left -action on (the morphisms of) , and induces a right -action on , so and are -bicapsules. Let be morphisms of and let be a morphism of such that is defined. Now
so is a -morphism. ∎
8.2. From bimaps to algebras
Let be the category of -matrix algebras with algebra isomorphisms. Using [Wilson:filters]*§4, define a functor by
Invertible morphisms in from to are sent to
Fact 8.2.
The functor is a -morphism.
8.3. From matrix algebras to semisimple algebras
Every matrix algebra over a field is Artinian, so the quotient of by its Jacobson radical is a semisimple algebra. The map is a functor from to the category of semisimple -algebras. It is also an -morphism.
8.4. Combining capsules
Recall that
Denote by the bicapsule associated to the -morphism in Lemma 8.1. Denote by and , respectively, the bicapsules associated to the - and -morphisms in Fact 8.2 and Section 8.3. These three capsules can now be combined to produce the -capsule
The resulting generator of this cyclic bicapsule is a unital. By Theorem 2-dual this provides the characteristic subgroup used in [Maglione:adjoints] and [Wilson:filters]*§4.
Acknowledgements
We thank Chris Liu for fruitful discussions and some proof-of-concept implementations. We thank John Power and Mima Stanojkovski for comments on a draft. Brooksbank was supported by NSF grant DMS-2319371. Maglione was supported by DFG grant VO 1248/4-1 (project number 373111162) and DFG-GRK 2297. O’Brien was supported by the Marsden Fund of New Zealand Grant 23-UOA-080 and by a Research Award of the Alexander von Humboldt Foundation. Wilson was supported by a Simons Foundation Grant identifier #636189 and by NSF grant DMS-2319370.
References
- \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry \ProcessBibTeXEntry