This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

The view of TK-SVM on the phase hierarchy in the classical kagome Heisenberg antiferromagnet

Jonas Greitemann1,2, Ke Liu1,2,∗, Lode Pollet1,2,3 1 Arnold Sommerfeld Center for Theoretical Physics, University of Munich, Theresienstr. 37, 80333 München, Germany 2 Munich Center for Quantum Science and Technology (MCQST), Schellingstr. 4, 80799 München, Germany 3 Wilczek Quantum Center, School of Physics and Astronomy, Shanghai Jiao Tong University, Shanghai 200240, China ke.liu@lmu.de
Abstract

We illustrate how the tensorial kernel support vector machine (TK-SVM) can probe the hidden multipolar orders and emergent local constraint in the classical kagome Heisenberg antiferromagnet. We show that TK-SVM learns the finite-temperature phase diagram in an unsupervised way. Moreover, in virtue of its strong interpretability, it identifies the tensorial quadrupolar and octupolar orders, which define a biaxial D3hD_{3h} spin nematic, and the local constraint that underlies the selection of coplanar states. We then discuss the disorder hierarchy of the phases, which can be inferred from both the analytical order parameters and a SVM bias parameter. For completeness we mention that the machine also picks up the leading 3×3\sqrt{3}\times\sqrt{3} correlations in the dipolar channel at very low temperature, which are however weak compared to the quadrupolar and octupolar orders. Our work shows how TK-SVM can facilitate and speed up the analysis of classical frustrated magnets.

  • August 11, 2025

Keywords: machine learning, TK-SVM, multipolar order, classical spin liquid \ioptwocol

1 Introduction

The kagome Heisenberg antiferromagnet (KHAFM) is one of the most characteristic many-body systems where a simple Hamiltonian hosts strikingly rich physics. A kagome lattice is built from corner-shared triangular plaquettes, as illustrated in Figure 1. By placing Heisenberg spins on each lattice site and coupling the nearest-neighboring spins, one defines a kagome Heisenberg model, H=Jij𝐒i𝐒jH=J\sum_{\langle ij\rangle}\mathbf{S}_{i}\mathbf{S}_{j}. When the interaction is antiferromagnetic, J>0J>0, the system is highly geometrically frustrated and fails to find a unique ground state [1]. Such frustration has played a crucial role in the search for spin liquids and other exotic states of matter [2].

In the most general formulation of the problem one takes the interplay between geometric frustration, quantum fluctuations, and thermal fluctuations into account. However, it turns out that the thermodynamics of the classical KHAFM is already quite rewarding. In Ref. [3], Chalker et al. realized that there exists a hidden quadrupolar order characterized by a rank-22 tensorial order parameter. The emergence of this order is driven by an order-by-disorder phenomenon [4, 5], where coplanar states are entropically selected at low temperatures owing to the presence of soft modes [3]. Shortly afterwards, Reimers and Berlinsky [6] carried out a thorough investigation of excitations of the soft modes and the correlations of degenerate magnetic states. In spite of limited computational resources the authors showed, by classical Monte Carlo simulations, convincing signatures that the quadrupolar order indeed appears to be (quasi-)long ranged. At around the same time, Huse and Rutenberg [7] and Ritchey et al. [8] proposed that the physics in the selected plane might be effectively described by three-state Potts-like degrees of freedom, and the latter paper also discussed the associated topological defects which can support a generalized Berezinskii-Kosterlitz-Thouless (BKT) transition [9, 10, 11]. This is highly non-trivial, because the BKT transition is not realized in a typical two dimensional Heisenberg magnet as the fundamental group π1(SO(3)/SO(2))=0\pi_{1}\left(SO(3)/SO(2)\right)=0 is trivial [12, 13]. It, however, becomes possible with the effective Potts-like degrees of freedom, where the fundamental group is π1(O(3)/D3h)=D¯3\pi_{1}\left(O(3)/D_{3h}\right)=\bar{D}_{3} which gives rise to gapped topological defects [13], where D3hD_{3h} (D¯3\bar{D}_{3}) denotes the (binary) dihedral group. Although these results firmly evidenced that there is further structure in the selected coplanar states, it was only much later that Zhitomirsky pointed out that the structure is described by another hidden order, a rank-33 octupolar order [14, 15]. This author also showed that such order coexists with pinch points, which are usually seen in classical spin liquids such as spin ice [16, 17]. While the quadrupolar and octupolar orders are firmly established, they do not exhaust the rich hierarchy of phases in the classical KHAFM; specifically, the possibility of dipolar order is debated in the literature. Already in Ref. [7], the authors argued that fluctuations around the extensively degenerate KHAFM ground states interact in a non-linear way and will lead to unequally weighted Potts states. As a consequence, there may exist a critical point for a long-range antiferromagnetic 3×3\sqrt{3}\times\sqrt{3} order in the T0T\rightarrow 0 limit. This scenario was further developed by Henley with a self-consistent effective Hamiltonian approach [18]. A recent Monte Carlo simulation, equipped with an ingenious cluster update, by Chern and Moessner also suggests that the 3×3\sqrt{3}\times\sqrt{3} magnetization retains a finite, though remarkably small, value in the thermodynamical limit as T0T\rightarrow 0 [19].

Whereas establishing the T0T\rightarrow 0 magnetic order is very difficult because of the lack of a controlled theory [18] and efficient algorithms to simulate large system sizes from first principles at extremely low temperature [19, 20], a major challenge in understanding the hidden multipolar orders in the classical KHAFM is rooted in the complexity of those high-rank tensorial order parameters. It is therefore interesting to ask whether machine learning (ML), which is designed to analyze complex patterns, can assist in the analysis of such problems. This also concerns ML techniques as practical tools for physics. Although these techniques have proven useful for classifying phases and detecting phase transitions [21, 22, 23, 24, 25, 26, 27], representing quantum wave functions [28, 29, 30, 31, 32, 33], designing algorithms [34, 35, 36, 37], and predicting properties of materials [38, 39, 40, 41] (see Refs. [42, 43, 44] for recent reviews), the number of applications to intricate issues remains limited.

Refer to caption
Figure 1: A kagome lattice of Heisenberg spins coupled by antiferromagnetic interaction. At low temperature, spins in each trianglar plaquette obey the ground-state constraint 𝐒(1)+𝐒(2)+𝐒(3)=𝟎\mathbf{S}^{\rm(1)}+\mathbf{S}^{\rm(2)}+\mathbf{S}^{\rm(3)}=\mathbf{0}. By the mechanism known as order-by-disorder a hidden biaxial spin-nematic order characterized by rank-22 and rank-33 tensors can develop at even lower temperatures.

In the present paper, we apply our recently developed tensorial kernel support vector machine (TK-SVM) [45, 46, 47], which is an unsupervised and interpretable approach, to the classical KHAFM. We show that our machine readily picks up the hidden quadrupolar and octupolar orders and gives their tensorial order parameters in analytical form while its only inputs are real-space spin configurations. It identifies the ground-state constraint (GSC) and different temperature scales in the KHAFM. In addition, the machine also recognizes that the magnetic correlation of classical KHAFM at low-temperature is dominated by a 3×3\sqrt{3}\times\sqrt{3} structure.

The manuscript is organized as follows. In Section 2, we review the method of TK-SVM. In Section 3, the finite-temperature phase diagram of KHAFM is discussed. Section 4 is devoted to the multipolar order parameters and GSC. Section 5 discusses magnetic correlations. We conclude in Section 6.

Refer to caption
(a) single-spin cluster, rank-2
Refer to caption
(b) single-spin cluster, rank-3
Refer to caption
(c) three-spin triangular cluster, rank-2
Refer to caption
(d) three-spin triangular cluster, rank-3
Figure 2: Graphs corresponding to the multiclassification along four different TK-SVM setups (two cluster choices (a kagome unit cell and a single spin) combined with the two ranks, n=2n=2 and n=3n=3). Each vertex represents a different temperature T/JT/J. Edges connecting the vertices are weighted by the bias parameter ρ\rho in the decision function Eq. (1); see Section 2.2 for details. In order to facilitate the graphical representation of the one-dimensional parameter space, a random second coordinate is introduced, which offsets the temperatures vertically and which hence avoids overlaps of edges. The clustering of the vertices is described by the respective Fiedler vector in Figure 3.

2 Tensorial-kernel support vector machine

The tensorial kernel support vector machine (TK-SVM) is a numerical method to detect general symmetry-breaking orders [45, 46] and emergent local constraints [47, 48] in the problem of phase classifications; see also Ref. [49]. It inherits from support vector machines [50, 51], a well-known and successful classifying technique in machine learning, the property that it is interpretable: The decision function, which is the optimal classifier between two sets of data with a distinct property, can be shown to learn the square of order parameters when a quadratic kernel is used [21, 45, 46]. In TK-SVM the order parameter (or local constraint) can be any local tensor of a possibly high rank [45, 46].

Furthermore, the decision function also contains an offset known as the bias. Specifically for phase classification, the bias is sensitive to the presence of phase transitions or crossovers [46, 47]. It can be analyzed prior to and independent of the determination of the order parameters and allows one to perform an unsupervised graph partitioning of the phase diagram.

The main advantages of our approach are that the user does not need to devise suitable order parameters, which are typically very hard to construct for exotic states of matter, and that one gets near certainty about all phases with learned order parameters without supervision, resulting in enormous speedups for the analysis of an (unknown) phase diagram. Below we explain the most important concepts of TK-SVM.

2.1 TK-SVM decision function

The TK-SVM finds the optimal decision function

d(𝐱)=μνCμνϕμ(𝐱)ϕν(𝐱)ρ,d(\mathbf{x})=\sum_{\mu\nu}C_{\mu\nu}\phi_{\mu}(\mathbf{x})\phi_{\nu}(\mathbf{x})-\rho, (1)

in classifying two sets of data with a distinct property (e.g. a different order parameter or local constraint) in the sense of determining a maximal margin between the sets.

Here, 𝐱\mathbf{x} denotes a real-space configuration of NN spins, which serves as the training data,

𝐱={Si,a|a=x,y,z;i=1,2,,N}.\mathbf{x}=\{S_{i,a}|a=x,y,z;i=1,2,\dots,N\}. (2)

O(3)O(3) spins are of interest to us, but applications to XYXY and Ising spins are straightforward. It plays no role what algorithm has generated the samples 𝐱\mathbf{x}.

The power of SVMs lies in the usage of appropriate kernel methods. Samples are mapped by some transformation ϕ\phi onto a feature space where only the inner product of the transformed data needs to be known [51]. The conditions on ϕ\phi are very mild. Often, a linear separation in feature space is possible, which can correspond to a highly non-linear separation in physical space. In the TK-SVM, ϕμ(𝐱)\phi_{\mu}(\mathbf{x}) maps 𝐱\mathbf{x} to degree-nn monomial configurations,

ϕ(𝐱)={ϕμ}={Sa1α1Sanαncl},\boldsymbol{\phi}(\mathbf{x})=\{\phi_{\mu}\}=\{\langle S^{\alpha_{1}}_{a_{1}}\dots S^{\alpha_{n}}_{a_{n}}\rangle_{\rm cl}\}, (3)

where a1,,an=x,y,za_{1},\dots,a_{n}=x,y,z; α1,,αn=1,2,,r\alpha_{1},\dots,\alpha_{n}=1,2,\dots,r; μ=(α1,a1;;αn,an)\mu=(\alpha_{1},a_{1};\dots;\alpha_{n},a_{n}) is a collective index; cl\langle\dots\rangle_{\rm cl} denotes a lattice average up to clusters of rr spins. This mapping is established by the fact that local orientational orders can be generally represented by finite-rank tensors built from a finite number of vector fields [52, 53, 54]. For example, magnetic orders are defined as rank-11 tensors, and quadrupolar orders correspond to rank-22 tensors. Emergent local constraints, such as ground-state constraints for spin liquids, show up as relations between local tensors [47, 48]. The dimension of the ϕ\phi-space is (3r)n(3r)^{n}. However, as it contains a massive amount of redundant information, the actual complexity in the SVM optimization problem is linearly determined by the number of independent components, given by (3r+n1)!n!(3r1)!\frac{(3r+n-1)!}{n!(3r-1)!} [45]. While this can still be a big number, a bottleneck is only encountered in extreme situations when dealing with very large clusters at high ranks. In real applications, a TK-SVM can handle, for example, a cluster of several hundreds spins at rank-22 without a problem, which is only necessary when a quadrupolar order or a spin-liquid constraint has such a vast “unit cell”. In the case of rank n=1n=1, which detects magnetic orders, the size of a feasible cluster can be even greater.

Coefficient matrices, CμνC_{\mu\nu}, will be constructed by support vectors,

Cμν=kλkϕμ(𝐱(k))ϕν(𝐱(k)),C_{\mu\nu}=\sum_{k}\lambda_{k}\phi_{\mu}(\mathbf{x}^{(k)})\phi_{\nu}(\mathbf{x}^{(k)}), (4)

where λk\lambda_{k} is the Lagrangian multiplier related to the kk-th sample x(k)x^{(k)} and is solved in the underlying SVM optimization problem. The coefficient matrices can be viewed as encoders of order parameters, from which analytical expressions of the detected orders and constraints are extracted. As we investigated in Ref. [45] by comparing results using Ns=102N_{s}=10^{2} to 10510^{5} samples, a few hundred samples can already give coefficient matrices of decent quality; more samples reduce statistical errors. The complexity of the underlying SVM optimization problem generally scales as 𝒪(Ns2)\mathcal{O}(N_{s}^{2}) to 𝒪(Ns3)\mathcal{O}(N_{s}^{3}) [55]. However, the computational cost for carrying out the machine learning is in general small compared to that of generating the training data.

The parameter ρ\rho is called the bias. In a binary classification over two sample sets AA and BB, its behavior can be summarized as follows [47]

ρ(A|B){11}AB in the same phase,1A in the disordered phase,1B in the disordered phase,(1,1)not directly comparable.\displaystyle\rho(A\,|\,B)\begin{cases}\!\begin{rcases}\gg 1\\ \ll-1\end{rcases}\quad&\textup{$A$, $B$ in the same phase},\\ \approx 1&\textup{$A$ in the disordered phase},\\ \approx-1&\textup{$B$ in the disordered phase},\\ \in(-1,1)&\textup{not directly comparable}.\end{cases} (5)

Therefore, the ρ\rho parameter can act as an indicator of phase transitions and crossovers. Its magnitude indicates the absence or presence of a phase transition or crossover,

|ρAB|{1A,B in the same phase,1A,B in different phases.\displaystyle|\rho_{\rm AB}|\begin{cases}\gg 1&\textup{${\rm A,B}$ in the same phase},\\ \lesssim 1&\textup{${\rm A,B}$ in different phases}.\end{cases} (6)

The sign of ρ\rho further reveals which sample set originates from the (dis-)ordered side, namely, the orientation of the transition or crossover. The last case in Eq. (5) corresponds to situations where AA and BB have characteristics that can not straightforwardly be compared; hence, a relative disorderedness may not be well defined.

2.2 Graph partitioning

The use of graph partitioning in TK-SVM maximally exploits the reduced criterion Eq. (6) for ρ\rho. Consider a spin model involving a set of physical parameters such as interactions and temperature, whose phase diagram we seek to learn. We collect spin configurations from distinct parameter points in the physical parameter space. The distribution of the points can be uniform or not in case one wants to have a higher density of points in the regions of special interests. We then perform SVM multi-classification over the collected data. For MM sets of samples, one multi-classification produces M(M1)/2M(M-1)/2 decision functions; each is responsible for a binary classification between two sample sets [56].

The graph is constructed by viewing those parameter points as vertices and assigning an edge to each pair of them, while the edge weights are determined by the value of ρ\rho in the corresponding decision function, leading to a graph of MM vertices and M(M1)/2M(M-1)/2 edges.

The graph may be partitioned by different means. A simple yet efficient approach is Fiedler’s theory of spectral clustering [57, 58]. According to Fiedler’s theory, the graph can be described by a M×MM\times M Laplacian matrix L^\hat{L}. The off-diagonal elements of L^\hat{L} record the edge weights, and the diagonal elements represent the total weights of every vertex. The second smallest eigenvalue, λ2\lambda_{2}, of L^\hat{L} measures the algebraic connectivity of the graph. The eigenvector, 𝐟2\mathbf{f}_{2}, associated with λ2\lambda_{2} is known as the Fiedler vector and reflects the clustering of the graph. As we shall discuss in Section 3, the Fiedler vector plays the role of a phase diagram.

Refer to caption
Figure 3: Fiedler vectors obtained from the partition of the four graphs. The entries of each Fiedler vector are plotted against temperature. The curve corresponding to the triangle cluster at rank-22 captures the ground-state constraint Eq. (13). The two rank-33 curves realize the hidden octupolar order, formulated in Eqs. (18) and  (4.2). The single-spin cluster at rank-22 realizes the hidden quadrupolar Eq. (16), which occurs simultaneously with the octupolar one at TK=0.004JT_{K}=0.004J. The dashed lines indicate the boundaries of the three regimes: I, hidden D3hD_{3h} nematic; II, cooperative paramagnet; III, trivial paramagnet. The interval between T/J=0.4T/J=0.4 and 1010 is a crossover region where the ground-state constraint develops.

3 The phase diagram learned by TK-SVM

In order to learn the phase diagram of the classical KHAFM in an unsupervised way, we start the analysis with the graph partitioning. For this purpose, we collect spin configurations ranging from high to extremely low temperatures: we choose 8585 logarithmically equidistant temperatures between T/J=103T/J=10^{3} and 10510^{-5}, and store 10001000 independent configurations at each temperature. The samples are obtained by classical Monte Carlo simulations utilizing parallel tempering and heat-bath updates for a lattice consisting of 30723072 spins (32×3232\times 32 kagome unit cells) [49].

Next, we apply TK-SVM between any two temperature points and repeat this for different ranks and clusters, which both need to be sufficiently large in order to accommodate the correct orders. In practice, we typically examine clusters comprising a number of lattice unit cells and scrutinize the ranks that are compatible with a crystallographic system (n=1,2,3,4,6n=1,2,3,4,6). For the kagome Heisenberg model, it turns out that the optimal choice for learning the quadrupolar and octupolar order is the kagome unit cell (i.e., a three-spin triangle) with ranks n=2n=2 and 33. The results will be compared with those using a single-spin cluster. In Section 5, we will also discuss the rank n=1n=1 results of magnetic correlations.

Here are hence four different TK-SVM classification setups (the two types of clusters (a single unit cell and a single spin) combined with the two ranks (n=2n=2 and n=3n=3). Each leads to a graph of 8585 vertices and 35703570 edges, visualized by Figure 2. The edge weights are defined by a Lorentzian function

w(ρ)\displaystyle w(\rho) =1ρc2(|ρ|1)2+ρc2[0,1),\displaystyle=1-\frac{\rho_{c}^{2}}{(|\rho|-1)^{2}+\rho_{c}^{2}}\in[0,1), (7)

where ρ\rho is the bias parameter of the binary classification, and ρc\rho_{c} determines a characteristic value for “1\gg 1” in the reduced ρ\rho criterion Eq. (6). The choice of ρc\rho_{c} is nevertheless not crucial because vertices in the same phase typically have stronger connections than those from different phases. The robustness of the graph partitioning follows from the fact that ρc\rho_{c} can be varied over several orders of magnitude without significant changes to the topology of the phase diagram [47].

Partitioning these graphs leads to four different Fiedler vectors, depicted in Figure 3. Clearly, the temperature axis is divided into three regions: A rank-22 quantity constructed from the triangle cluster (see also panel (c) in Figure 2) discriminates around a temperature T/J1T/J\approx 1 a high-temperature region (region III) from the rest. Both the single-spin and the triangle cluster (cf. panels (b) and (d) in Figure 2) define a rank-33 quantity which distinguishes the very low temperatures (region I, T/J0.004T/J\leq 0.004) from the intermediate temperature region II. The result of the single-spin cluster at rank-22 (cf. panel (a) in Figure 2) is in line with the results the rank-33 ones. The discrimination of these regions reproduces in fact the finite-temperature phase diagram of the kagome Heisenberg model [15, 3] – even before we visit the nature of those phases.

4 Analytical order parameters

Having learned the topology of the KHAFM phase diagram, we now interpret the nature of each phase by their order parameters and possible local constraints. This will be done by extracting the analytical order parameters from the CμνC_{\mu\nu} matrices. In order to reduce the statistical errors on CμνC_{\mu\nu}, we pool the data according to the phase diagram of Figure 3. In addition, we introduce 25,00025,\!000 fictitious configurations generated at T=+T=+\infty which represent completely disordered states. This sets up a reduced multi-classification problem among four classes: regime I (28,00028,\!000 samples), regime II (21,00021,\!000 samples), regime III (21,00021,\!000 samples), and regime TT_{\infty}. Consequently, we obtain six Cμν(A|B)C_{\mu\nu}(A\,|\,B) for each rank and cluster, with A,B{I,II,III,T}A,B\in\{{\rm I,II,III,T_{\infty}}\} [49].

Not surprisingly, the high-temperature regime III is not distinct from the TT_{\infty} regime; the associated coefficient matrices are noise-like. Hence, only the two low-temperature regimes need further interpretation.

p[Qx2+y2+z2]p[Q_{x^{2}+y^{2}+z^{2}}] p[Qz2]p[Q_{z^{2}}] p[Qx2+y2]p[Q_{x^{2}+y^{2}}] p[Qx2y2]p[Q_{x^{2}-y^{2}}] p[Qxy+yx]p[Q_{xy+yx}] p[Qyz+zy]p[Q_{yz+zy}] p[Qzx+xz]p[Q_{zx+xz}]
on-site -0.487 1.460 0.731 0.732 0.733 0.727 0.735
I || II cross 0.081 -0.730 -0.366 -0.366 -0.367 -0.363 -0.367
bond 0.038 0.365 0.183 0.183 0.183 0.182 0.184
on-site 0.059 -0.176 -0.088 -0.088 -0.088 -0.088 -0.088
I || TT_{\infty} cross 0.910 0.089 0.044 0.044 0.044 0.044 0.044
bond -0.462 -0.044 -0.022 -0.022 -0.022 -0.022 -0.022
on-site 0.000 0.000 0.000 0.000 0.000 0.000 0.000
II || TT_{\infty} cross -0.998 -0.001 0.000 0.000 0.000 0.000 0.000
bond 0.445 -0.001 0.000 0.000 0.000 0.000 0.000
Table 1: The weights of quadrupolar ordering components, p[Q]p[Q_{\bullet}], extracted from the corresponding coefficient matrices. These weights are calculated through a least-squares fit based on all blocks in the full coefficient matrix of each of the site-site (“on-site”), site-bond (“cross”), and bond-bond (“bond”) types. The ratios between these weights for the three block types are given in Table 2.
γx2+y2+z2\gamma_{x^{2}+y^{2}+z^{2}} γz2\gamma_{z^{2}} γx2+y2\gamma_{x^{2}+y^{2}} γx2y2\gamma_{x^{2}-y^{2}} γxy+yx\gamma_{xy+yx} γyz+zy\gamma_{yz+zy} γzx+xz\gamma_{zx+xz}
I || II c ÷\div s -0.166 -0.500 -0.500 -0.500 -0.500 -0.500 -0.500
b ÷\div c 0.471 -0.500 -0.500 -0.500 -0.500 -0.500 -0.500
I || TT_{\infty} c ÷\div s -0.504 -0.499 -0.502 -0.502 -0.502 -0.502
b ÷\div c -0.508 -0.492 -0.495 -0.493 -0.493 -0.493 -0.493
II || TT_{\infty} b ÷\div c -0.446
Table 2: Ratios of the quadrupolar moments in Table 2, γQαβ/Qαα\gamma_{\bullet}\coloneqq\langle Q^{\alpha\beta}_{\bullet}\rangle/\langle Q^{\alpha\alpha}_{\bullet}\rangle. These can be calculated by taking the ratio between the weights of the relevant component with respect to “cross” and “on-site” blocks (c ÷\div s), or “bond” and “cross” blocks (b ÷\div c).
Refer to caption
Figure 4: The coefficient matrix Cμν(II|T)C_{\mu\nu}(\textup{II}\,|\,T_{\infty}) learned from a rank-2 TK-SVM on a three-spin triangular cluster. The dotted lines demarcate 9×99\times 9 blocks corresponding to identical spin indices (α,β),(α,β)(\alpha,\beta),(\alpha^{\prime},\beta^{\prime}) while the contents of each 9×99\times 9 block is enumerated in a similar way by component indices (a,b),(a,b)(a,b),(a^{\prime},b^{\prime}). On-site blocks (α=β,α=β\alpha=\beta,\alpha^{\prime}=\beta^{\prime}) are seen to be empty, while “cross” and “bond” blocks exhibit a non-vanishing δabδab\delta_{ab}\delta_{a^{\prime}b^{\prime}} pattern with relative weights of 1-1 and 0.446=γ0.446=-\gamma, respectively. See also Tables 1 and 2.

4.1 Ground-state constraint

We first discuss the emergent local constraint in the two low-temperature phases, which drives the system to coplanar order and is manifest using the triangular cluster at rank-22. The term “emergent” in the current case is intended to distinguish from the trivial constraint of spin normalization |𝐒|=1|\mathbf{S}|=1. Examples of learning more non-trivial constraints, including previously unknown ones, can be found in Ref. [48] in cases of Kitaev and Γ\Gamma spin liquids.

The coefficient matrix Cμν(II|T)C_{\mu\nu}({\rm II\,|\,T_{\infty}}) is visualized in Figure 4. It is composed of 9×99\times 9 blocks enumerated by the spin indices [α,β;αβ][\alpha,\beta;\alpha^{\prime}\beta^{\prime}], with α,β=1,2,3\alpha,\beta=1,2,3. One easily distinguishes three types of blocks: “on-site” (α=β,α=β\alpha=\beta,\,\alpha^{\prime}=\beta^{\prime}), “bond” (αβ,αβ\alpha\neq\beta,\,\alpha^{\prime}\neq\beta^{\prime}), and the mixed case “cross”. These blocks can be expressed as

Cab;abαβ;αβ=[(γ+γ¯δαβ)(γ+γ¯δαβ)δαβδαβ]δabδab,\displaystyle C^{\alpha\beta;\alpha^{\prime}\beta^{\prime}}_{ab;a^{\prime}b^{\prime}}=\left[(\gamma+\bar{\gamma}\delta_{\alpha\beta})(\gamma+\bar{\gamma}\delta_{\alpha^{\prime}\beta^{\prime}})-\delta_{\alpha\beta}\delta_{\alpha^{\prime}\beta^{\prime}}\right]\delta_{ab}\delta_{a^{\prime}b^{\prime}}, (8)

where γ\gamma denotes the ratio between the weight of the on-site and bond quadratic correlations in a coefficient matrix (cf. Table 1 and Table 2), and γ¯1γ\bar{\gamma}\coloneqq 1-\gamma.

Substituting Cab;abαβ;αβC^{\alpha\beta;\alpha^{\prime}\beta^{\prime}}_{ab;a^{\prime}b^{\prime}} into the decision function Eq. (1), one obtains

d({𝐒i})\displaystyle d(\{\mathbf{S}_{i}\}) α,βα,βa,ba,bCab;abαβ;αβSaαSbβSaαSbβ\displaystyle\sim\sum_{\begin{subarray}{c}\alpha,\beta\\ \alpha^{\prime},\beta^{\prime}\end{subarray}}\sum_{\begin{subarray}{c}a,b\\ a^{\prime},b^{\prime}\end{subarray}}C^{\alpha\beta;\alpha^{\prime}\beta^{\prime}}_{ab;a^{\prime}b^{\prime}}\langle S^{\alpha}_{a}S^{\beta}_{b}\rangle\langle S^{\alpha^{\prime}}_{a^{\prime}}S^{\beta^{\prime}}_{b^{\prime}}\rangle
=(γa(αSaα)2+γ¯α𝐒α2)2\displaystyle=\bigg{(}\gamma\langle\sum_{a}\big{(}\sum_{\alpha}S^{\alpha}_{a}\big{)}^{2}\rangle+\bar{\gamma}\sum_{\alpha}\langle\left\|\mathbf{S}^{\alpha}\right\|^{2}\rangle\bigg{)}^{2}
(α𝐒α2)2\displaystyle\qquad-\bigg{(}\sum_{\alpha}\langle\left\|\mathbf{S}^{\alpha}\right\|^{2}\rangle\bigg{)}^{2}
=9[γ2(Γ1γ)21],\displaystyle=9\bigg{[}\gamma^{2}\big{(}\Gamma-\frac{1}{\gamma}\big{)}^{2}-1\bigg{]}, (9)

where Γ\Gamma is a normalized constraint order parameter whose meaning will become transparent in the forthcoming discussion,

Γ=113α𝐒α2cl.\displaystyle\Gamma=1-\frac{1}{3}\langle\big{\|}\sum_{\alpha}\mathbf{S}^{\alpha}\big{\|}^{2}\rangle_{\textup{cl}}. (10)

As seen in Figure 4 and Table 1, the only non-vanishing quadratic correlation in Cμν(II|T)C_{\mu\nu}(\textup{II}\,|\,T_{\infty}) is

Qx2+y2+z2αβ=SxαSxβ+SyαSyβ+SzαSzβ.\displaystyle Q^{\alpha\beta}_{x^{2}+y^{2}+z^{2}}=S^{\alpha}_{x}S^{\beta}_{x}+S^{\alpha}_{y}S^{\beta}_{y}+S^{\alpha}_{z}S^{\beta}_{z}. (11)

γ\gamma is then determined by Qx2+y2+z2αβ/Qx2+y2+z2α=β0.446Q^{\alpha\neq\beta}_{x^{2}+y^{2}+z^{2}}/Q^{\alpha=\beta}_{x^{2}+y^{2}+z^{2}}\sim-0.446 for regime II.

The value of γ\gamma in fact appears to be temperature-dependent and converges to 12-\frac{1}{2} in regime I. This can be understood from the constraint order parameter, Eq. (10). As the squared sum in Γ\Gamma comprises three on-site (α=β\alpha=\beta) and six bond (αβ\alpha\neq\beta) correlations, and that their ratio is γ\gamma, the fulfillment of γ=12\gamma=-\frac{1}{2} is equivalent to the relation

𝐒(1)+𝐒(2)+𝐒(3)2cl=𝟎.\displaystyle\langle\big{\|}\mathbf{S}^{(1)}+\mathbf{S}^{(2)}+\mathbf{S}^{(3)}\big{\|}^{2}\rangle_{\textup{cl}}=\mathbf{0}. (12)

Since 2\big{\|}\dots\big{\|}^{2} is semi-positive definite, this in turn means a local constraint at each triangular plaquette,

𝐒(1)+𝐒(2)+𝐒(3)=𝟎.\displaystyle\mathbf{S}^{(1)}+\mathbf{S}^{(2)}+\mathbf{S}^{(3)}=\mathbf{0}. (13)

Namely, γ=12\gamma=-\frac{1}{2}, or equivalently Γ=1\Gamma=1, expresses the ground-state constraint of the KHAFM (the spins of every triangle lie in a plane), while the deviation of γ0.446\gamma\sim-0.446 in the higher-temperature regime II reflects thermal fluctuations of the constraint.

Refer to caption
(a) Cμν(I|T)C_{\mu\nu}({\rm I\,|\,T_{\infty}}), single spin, rank-22
Refer to caption
(b) Cμν(I|T)C_{\mu\nu}({\rm I\,|\,T_{\infty}}), single spin, rank-33
Figure 5: Coefficient matrices for regime I learned from a single-spin cluster at rank-22 and 33. They represent the quadrupolar (Eq. (16)) and octupolar (Eq. (18)) tensor order parameters, respectively. They also appear to be the motifs for the blocks shown in Figs. 6 and 7 obtained with a triangular cluster.

4.2 Hidden nematic order

We proceed by examining the order parameter of regime I. Given the Fiedler vectors in Figure 3, it is evident that the CμνC_{\mu\nu} matrices learned with the single-spin cluster are able to distinguish regime I from the high-temperature phases. We now look at the corresponding coefficient matrices, and will afterwards revisit this issue with the three-spin triangular cluster.

The patterns of Cμν(I|T)C_{\mu\nu}({\rm I\,|\,T_{\infty}}) using the single-spin cluster are shown in Figure 5, where it is a 9×99\times 9 matrix for rank-22 and 27×2727\times 27 matrix for rank-33.

The rank-22 pattern, shown in Figure 5a, has the following analytic expression,

34δaaδbb+34δabδba12δabδab.\displaystyle\frac{3}{4}\delta_{aa^{\prime}}\delta_{bb^{\prime}}+\frac{3}{4}\delta_{ab^{\prime}}\delta_{ba^{\prime}}-\frac{1}{2}\delta_{ab}\delta_{a^{\prime}b^{\prime}}. (14)

Substituting this into the decision function Eq. (1), we obtain

d(𝐱)\displaystyle d(\mathbf{x}) 32ab(SaSb13δab)2\displaystyle\sim\frac{3}{2}\sum_{ab}\left(\left<S_{a}S_{b}\right>-\frac{1}{3}\delta_{ab}\right)^{2}
=32Tr[QabQba],\displaystyle=\frac{3}{2}\Tr\left[Q_{ab}Q_{ba}\right], (15)

where

Qab=SaSb13δab\displaystyle Q_{ab}=\left<S_{a}S_{b}\right>-\frac{1}{3}\delta_{ab} (16)

is the famous uniaxial nematic tensor [59]. This shows that a rank-22 Cμν(I|T)C_{\mu\nu}({\rm I\,|\,T_{\infty}}) using a single-spin cluster probes the hidden quadrupolar order in the KHAFM [3, 15].

The rank-33 pattern Figure 5b is interpreted in the same way. It can be expressed as

δaaδbbδcc15(δaaδbcδbc+δacδbbδac+δabδabδcc),\displaystyle\delta_{aa^{\prime}}\delta_{bb^{\prime}}\delta_{cc^{\prime}}-\frac{1}{5}(\delta_{aa^{\prime}}\delta_{bc}\delta_{b^{\prime}c^{\prime}}+\delta_{ac}\delta_{bb^{\prime}}\delta_{a^{\prime}c^{\prime}}+\delta_{ab}\delta_{a^{\prime}b^{\prime}}\delta_{cc^{\prime}}), (17)

leading to a rank-33 tensor

Tabc=SaSbSc15Saδbc15Sbδca15Scδab,\displaystyle T_{abc}=S_{a}S_{b}S_{c}-\frac{1}{5}S_{a}\delta_{bc}-\frac{1}{5}S_{b}\delta_{ca}-\frac{1}{5}S_{c}\delta_{ab}, (18)

which is precisely the octupolar order parameter [60].

We are left with the examination of the three-spin cluster. Although it is not quite visible from the Fiedler vector in Figure 3, TK-SVM with a triangular cluster at rank-22 does in fact discriminate between regime I and II, but the distinction is blurred by the constraint, Eq. (10), which appears as the primary order parameter: The triangular rank-22 curve in Figure 3 varies in the low-temperature regions, but the gradient is much more modest than the profound change seen when the constraint emerges.

Figure 6 shows the Cμν(I|II)C_{\mu\nu}({\rm I\,|\,II}) obtained from a three-spin cluster at rank-22, which displays a richer structure than the Cμν(II|T)C_{\mu\nu}({\rm II\,|\,T_{\infty}}) shown in Figure 4. Following a similar analysis, it gives rise to another quadrupolar tensor,

𝕆quad=13α𝐒α𝐒α(13α𝐒α)2cl.\displaystyle\mathbb{O}_{\textup{quad}}=\bigg{\langle}\frac{1}{3}\sum_{\alpha}\mathbf{S}^{\alpha}\otimes\mathbf{S}^{\alpha}-\big{(}\frac{1}{3}\sum_{\alpha}\mathbf{S}^{\alpha}\big{)}^{\otimes 2}\bigg{\rangle}_{\textup{cl}}. (19)

The first term is equivalent to the single-spin nematic tensor Eq. (16), while the second term expresses the constraint, Eq. (13). In other words, the triangular cluster at rank-22 simultaneously detects the hidden quadrupolar order and the ground-state constraint.

In Figure 7, we show the block structure of the rank-33 Cμν(I|T)C_{\mu\nu}({\rm I\,|\,T_{\infty}}) using the triangular cluster. The full matrix has a dimensionality of 729×729729\times 729, which is composed of 27×2727\times 27 blocks. Each block exhibits the same pattern as that in Figure 5b for the single-spin cluster, but is further multiplied with a weight. Its analytic expression displays a new octupolar tensor,

𝕆oct\displaystyle\mathbb{O}_{\textup{oct}} =19[α(𝐒α) 3+αβγ𝐒α𝐒β𝐒γ]\displaystyle=\bigg{\langle}\frac{1}{9}\bigg{[}\sum_{\alpha}\left(\mathbf{S}^{\alpha}\right)^{\otimes\,3}+\sum_{\alpha\neq\beta\neq\gamma}\mathbf{S}^{\alpha}\otimes\mathbf{S}^{\beta}\otimes\mathbf{S}^{\gamma}\bigg{]}
(13α𝐒α) 3cl.\displaystyle\qquad-\big{(}\frac{1}{3}\sum_{\alpha}\mathbf{S}^{\alpha}\big{)}^{\otimes\,3}\bigg{\rangle}_{\textup{cl}}. (20)

The first term of 𝕆oct\mathbb{O}_{\textup{oct}} reproduces the single-spin order parameter Eq. (18). The second term represents the ground-state constraint Eq. (13) and will become a constant tensor, 𝟎3\mathbf{0}^{\otimes 3}, when Γ\Gamma saturates to Γ=1\Gamma=1, which is the case in regime I. The second term is a rank-33 tensor defined by the three spins in a triangular plaquette. It serves as an alternative characterization of the octupolar order and is equivalent to the single-spin form. This also explains the agreement of the two rank-33 Fiedler vectors in Figure 3.

Refer to caption
Figure 6: The coefficient matrix Cμν(I|II)C_{\mu\nu}(\textup{I}\,|\,\textup{II}) learned from TK-SVM set up at rank-2 on a three-spin triangular cluster. The dotted lines demarcate 9×99\times 9 blocks corresponding to identical spin indices. The same motif repeats in each block, but multiplied by a factor of γ=1/2\gamma=-1/2 on cross-type blocks and γ2\gamma^{2} on bond-type blocks, as compared to on-site blocks (cf. Table 2). This modulation encodes the constraint, in addition to the quadrupolar order parameter that is seen using a single-spin cluster.
Refer to caption
Figure 7: The block structure of the coefficient matrix obtained from rank-3 TK-SVM on a triangular cluster. Each block follows the form of Figure 5b, multiplied by the block weights indicated here. Basis tensors 𝐒α𝐒β𝐒γ\mathbf{S}^{\alpha}\otimes\mathbf{S}^{\beta}\otimes\mathbf{S}^{\gamma} with α=β=γ\alpha=\beta=\gamma and αβγ\alpha\neq\beta\neq\gamma are seen to contribute equally, whereas all others are diminished by by a factor of γ=1/2\gamma=-1/2.
Cluster single spin triangle
Rank 2 3 2 3
I || II -1.0131 -1.0131 -3.286 -1.0145
I || III -1.0121 -1.0041 -1.0035 -1.0026
I || TT_{\infty} -1.0129 -1.0044 -0.9928 -1.0028
II || III 10.63 -0.9592 -1.0260 -1.0691
II || TT_{\infty} 4.218 -1.0598 -1.0103 -1.0805
III || TT_{\infty} 2.158 -1285 -20.54 -1.9802
Table 3: TK-SVM biases obtained from the reduced multiclassification problems among the temperature regimes revealed by Figure 3 as well as the fictitious control group, TT_{\infty}. The meaning of these values is understood by the ρ\rho criterion Eq. (5).

4.3 Phase Hierarchy

Let us put the learned order parameters and constraints together and infer a single coherent physical picture. Regime III is a trivial paramagnet (PM) which is equivalent to the infinite temperature state. Regime II is an instance of a cooperative paramagnet (CPM), which may also be referred to as a classical spin liquid and is characterized by an emergent local constraint. Regime I meets the standard of a biaxial spin nematic (BSN) where the uniaxial quadrupolar and the biaxial octupolar order together define the D3hD_{3h} symmetry [53].

Accordingly, the temperature scales in the phase diagram Figure 3 can be understood from a hierarchy of disorderedness,

O(3)PM (III)O(3)CPM (II)D3hBSN (I).\underset{\text{PM (III)}}{\mathrm{O}(3)}\longrightarrow\underset{\text{CPM (II)}}{\mathrm{O}(3)}\longrightarrow\underset{\text{BSN (I)}}{D_{3h}}. (21)

In the high-temperature PM phase, spins can freely fluctuate. In the intermediate CPM phase, although the system still preserves the O(3)O(3) symmetry of the KHAFM Hamiltonian, fluctuations of spins become correlated. Then the BSN order emerges from a constrained subset of the phase space.

This phase hierarchy is also reflected by the bias parameters in the four different multi-classification setups, which are shown in Table 3. Following the bias criterion of Eq. (5), regime I (BSN) has the least disorder since ρ(I|II,III)1\rho({\rm I\,|\,II,\,III})\lesssim-1 in all instances. The ρ(I|II)\rho(\textup{I}\,|\,\textup{II}) learned with the triangular cluster at rank-22 acquired a value noticeably smaller than 1-1, since the constraint Eq. 13 is satisfied in both phases (Γ1\Gamma\sim 1). However, its sign remains revealing that regime II is more disordered. Regime III (PM) is the most disordered as ρ(I,II|III)1\rho({\rm I,\,II\,|\,III})\lesssim-1 in all cases, with one exception as the single-spin cluster cannot represent the three-spin constraint Eq. (13).

Refer to caption
Figure 8: Illustration of a cluster consisting of m×mm\times m kagome unit cells. Numbers on the triangle plaquettes enumerate the unit cells from 11 to m2m^{2}. A lattice with linear size LL is then divided into (Lm)2\big{(}\frac{L}{m}\big{)}^{2} clusters. The three-spin cluster used to represent the quadrupolar and octupolar orders in Eqs. 19 and 4.2 corresponds to the special case m=1m=1. A perfect 3×3\sqrt{3}\times\sqrt{3} state is shown as an illustration, where, for example, the spin pattern in plaquette-11 is repeated identically in plaquette-(m+2)(m+2).
Refer to caption
Figure 9: The coefficient matrix Cμν(Tlow|T)C_{\mu\nu}(T_{\rm low}\,|\,T_{\infty}) learned from a rank-1 TK-SVM with a cluster of 3×33\times 3 kagome unit cells, where Tlow=105JT_{\rm low}=10^{-5}J. A,B,CA,B,C distinguish the three spins (sub-lattices) in a triangular plaquette, enumerating anti-clockwise from the left corner. The numbers label unit cells in the cluster, as illustrated in Figure 8; here m=3m=3. The CμνC_{\mu\nu} matrix can be divided into 3×33\times 3 blocks; each detects the form of correlations between two spins. The pattern here reflects a dominant 3×3\sqrt{3}\times\sqrt{3} structure in magnetic correlations. For example, in the bottom row, the A1A_{1}-A1A_{1} block is positive and has the same structure as the A1A_{1}-B2B_{2} block, indicating that A1A_{1} and B2B_{2} spin are also positively correlated, as in a 3×3\sqrt{3}\times\sqrt{3} state; see Figure 8. Consistently, the A5,B5,C5A_{5},B_{5},C_{5} blocks (plaquette-(m+2)(m+2)) repeat the structure of A1,B1,C1A_{1},B_{1},C_{1} blocks (plaquette-11).
Refer to caption
Figure 10: Block structure of the coefficient matrix Cμν(Tlow|T)C_{\mu\nu}(T_{\rm low}\,|\,T_{\infty}) learned using a rank-1 TK-SVM and cluster of 12×1212\times 12 kagome unit cells. Each matrix element is related to the correlation between two spins in the cluster. Spins are enumerated from 11 to 3m23m^{2} from left (bottom) to right (top), as in Figure 9; here m=12m=12. The color (value) of a pixel is defined as the Frobenius norm of the corresponding 3×33\times 3 small block in the full Cμν(Tlow|T)C_{\mu\nu}(T_{\rm low}\,|\,T_{\infty}).

5 Magnetic correlations

We now examine the magnetic correlations in the low-temperature regime, T=105JT=10^{-5}J, of the classical KHAFM, learned by rank-11 TK-SVM.

We first consider a cluster containing 3×33\times 3 kagome unit cells, i.e. 2727 spins (see Figure 8), for a system defined on a lattice of linear size L=36L=36. The resulting CμνC_{\mu\nu} matrix is shown in Figure 9. It has dimension 81×8181\times 81 and can be divided into small 3×33\times 3 blocks. The structure of those blocks evinces in which way the correlation between two spins is defined. Because the small 3×33\times 3 blocks have only entries on the diagonal, the correlation is simply captured by the usual inner product SaSbδabS_{a}S_{b}\delta_{ab}. (In general, the contraction can be more complicated even for magnetic orders). The change of color between the blocks indicates an antiferromagnetic arrangement of the spins: The blue blocks correspond to positive correlations, and their locations precisely reflect a 3×3\sqrt{3}\times\sqrt{3} structure.

Next, in Figure 10, we show the result using a cluster of 12×1212\times 12 kagome unit cells (432432 spins). Owing to the large dimensionality of the CμνC_{\mu\nu} matrix, dimCμν=12962\dim C_{\mu\nu}=1296^{2}, only the block structure is plotted, where each of its pixels corresponds to a 3×33\times 3 block in Figure 9. In the presence of long-range order for an order parameter that can be defined on a small cluster, CμνC_{\mu\nu} learnt for a large cluster should display the same structure as for that small cluster. However, the pattern of Figure 10 does not repeat that of Figure 9: It fades rather fast, indicating that the linear correlation between two spins is not robustly established at longer distances. (Note that CμνC_{\mu\nu} does not directly measure the strength of spin-spin correlations. Instead, it probes the form of the correlations and order parameters. See Eqs. 14 and 4.2 for a concrete example.)

Therefore, consistent with observations in the literature [7, 6, 15, 19, 20], our machine detects that the antiferromagnetic 3×3\sqrt{3}\times\sqrt{3} structure is the dominant type of correlation in the dipolar channel of the classical KHAFM at low temperature. However, to establish the T0T\rightarrow 0 critical point, a systematic analysis of temperature and finite-size effect is needed [19], which is beyond the scope of the present paper and current algorithms.

6 Conclusion

In summary, we have shown how TK-SVM can infer the phase diagram of the classical Heisenberg model defined on the kagome lattice in an unsupervised way. It has successfully learned that the spins are constrained to coplanar states owing to an emergent GSC and identified the hidden octupolar and quadrupolar tensor order parameters, which define a D3hD_{3h} biaxial-nematic state. Moreover, the machine also recognized the dominant 3×3\sqrt{3}\times\sqrt{3} correlations in the low temperature regime.

As a data-driven approach, it does not require any prior knowledge or particular insight in the system. Instead, by revealing the phase diagram and analytical order parameters in an unsupervised setting, the underlying physics becomes immediately evident. We expect TK-SVM to become an indispensable tool in analyzing many-body spin systems, as it provides an enormous speedup when the order parameters and phase diagrams are complicated.

We wish to thank Roderich Moessner and Ludovic Jaubert for helpful discussions. JG, KL, and LP acknowledge support from FP7/ERC Consolidator Grant QSIMCORR, No. 771891, and the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) under Germany’s Excellence Strategy – EXC-2111 – 390814868. Our simulations make use of the ν\nu-SVM formulation [61], the LIBSVM library [62, 63], and the ALPSCore library [64].

Open source

The TK-SVM library has been made openly available with documentation and examples [65].

Reference

References