On the stress dependence of the elastic tensor

Matthew Maitra & David Al-Attar
Department of Earth Sciences Bullard Laboratories University of Cambridge
Madingley Road Cambridge CB3 OEZ United Kingdom.
Email: mam221@cam.ac.uk

keywords:

Theoretical seismology; elasticity and anelasticity; seismic anisotropy.

{summary}

The dependence of the elastic tensor on the equilibrium stress is investigated theoretically. Using ideas from finite elasticity, it is first shown that both the equilibrium stress and elastic tensor are given uniquely in terms of the equilibrium deformation gradient relative to a fixed choice of reference body. Inversion of the relation between the deformation gradient and stress might, therefore, be expected to lead neatly to the desired expression for the elastic tensor. Unfortunately, the deformation gradient can only be recovered from the stress up to a choice of rotation matrix. Hence it is not possible in general to express the elastic tensor as a unique function of the equilibrium stress. By considering material symmetries, though, it is shown that the degree of non-uniqueness can sometimes be reduced, and in some cases even removed entirely. These results are illustrated through a range numerical calculations, and we also obtain linearised relations applicable to small perturbations in equilibrium stress. Finally, we make a comparison with previous studies before considering implications for geophysical forward- and inverse-modelling.

1 Introduction

Approaches to seismic wave propagation within a pre-stressed Earth have a long and complicated history; see Dahlen & Tromp (1998, Chapter 1). Early work on theoretical seismology (e.g. Thomson, 1863; Lamb, 1881) was built on the theory of classical linear elasticity. But classical linear elasticity is founded upon an assumption of small deformations away from a stress-free equilibrium. Its applicability to seismology is therefore unclear, given the presence of large equilibrium stresses within the Earth. In fact, it was not until the work of Dahlen (1972a) that a correct treatment was given.

Dahlen (1972a) derived the equations of motion relevant to global seismology by direct linearisation of the equations of finite elasticity. It is a result of this approach that the elastic tensor can be written as the sum of two pieces: one without explicit stress dependence, and a second piece that depends on stress linearly. There is no question that this decomposition is valid and that Dahlen’s equations of motion are correct, but the decomposition is not unique in that the elastic tensor’s stress dependence is left (partially) implicit.

Later that year Dahlen (1972b) used his earlier results to study plane-wave propagation in the presence of an arbitrary equilibrium pre-stress. To do so, he theorised that the elastic tensor’s stress dependence should take a particular functional form. He assumed specifically that the only stress dependence was what his earlier formulae had made explicit. That theory has two important implications. Firstly, the elastic wave speeds display no explicit dependence on equilibrium pressure. Secondly, deviatoric stresses induced within an isotropic medium have no first-order effect on P-wave speeds, whilst S-waves are split to the same accuracy. This is illustrated in the left panel of Fig.1; as with all the figures in this paper, the values of the physical quantities have been chosen for the sake of illustration and are not necessarily geophysically realistic.

Refer to caption — Figure 1: The effect of an induced deviatoric stress on plane-wave propagation within an initially isotropic body. Each panel shows a slice through the $x$ – $y$ plane of the slowness surface of an isotropic material in which a stress has been induced. The zero-stress result is shown dashed in the background for reference. The material on the left follows the theory of Dahlen (1972b). Whilst the S-waves are noticeably split the P-wave surface has not perceptibly moved from its isotropic position, illustrating Dahlen’s result that P-wave speeds are not changed to first-order by a small stress. By contrast, the material on the right obeys Tromp & Trampert (2018), and there we see that P-wave speeds are indeed perturbed to first-order in small stress. There is also a marked difference in the S-wave splitting pattern. In this figure we have taken $\mu^{\prime}=\kappa^{\prime}=1$ , consistent with Stacey (1992).

The problem of the elastic tensor’s stress dependence has since been revisited by Tromp & Trampert (2018) who were motivated by the possibility of using seismic data to image stresses within the Groningen gas field. Importantly, they arrived at a theory that predicts physical behaviour both quantitatively and qualitatively distinct from that derived by Dahlen. In direct contravention of Dahlen, Tromp & Trampert suggest that both P- and S-wave speeds change to first-order if a small deviatoric stress is induced in an isotropic medium (Fig.1, right panel).

The most recent work on the problem is that of Tromp et al. (2019). These authors not only generalised Tromp & Trampert’s results beyond the framework provided by Dahlen & Tromp (1998) (see below) but also included comparisons between their new theoretical results and ab initio calculations. Their results reduce to those of Tromp & Trampert (2018) for an isotropic body, but make different predictions otherwise. Fig. 2 contrasts the two theories as applied to a transversely-isotropic body.

The seismological community is left with three distinct theories for the effect of equilibrium stress on seismic wave propagation. This has led us to undertake the present work, which revisits the elastic tensor’s stress dependence and seeks to clarify it from the perspective of finite elasticity. It should be emphasised that the work of Dahlen (1972a) – which has underlain most of global seismology and related fields since its publication – is fully correct and general. The problem that we wish to address concerns only the elastic tensor’s dependence on equilibrium stress. To provide more specific context we will begin by presenting a heuristic approach to the problem that slightly generalises the previous discussions and points towards their limitations.

1.1 A heuristic linearised theory of the elastic tensor’s stress dependence

1.1.1 Preliminaries

In the notation of Dahlen & Tromp (1998) the equations of motion governing global seismology are (Dahlen & Tromp, 1998, p.84)

\displaystyle\partial_{t}^{2}\mathbf{s}-\frac{1}{\rho^{0}}\nabla\cdot\!\left(\bm{\Lambda}:\!\nabla\mathbf{s}\right)+2\mathbf{\Omega}\times\partial_{t}\mathbf{s}+\nabla\phi^{E1}+\mathbf{s}\cdot\nabla\nabla\!\left(\phi^{0}+\psi\right)=0,

(1.1)

for an Earth model initially at equilibrium with density $\rho^{0}$ and gravitational potential $\phi^{0}$ , and which rotates at constant angular velocity $\mathbf{\Omega}$ giving rise to the centrifugal potential $\psi$ . The displacement from equilibrium is $\mathbf{s}$ , with $\phi^{E1}$ the corresponding perturbation to the gravitational potential. Most importantly for our purposes, the Earth model is assumed to be pre-stressed, supporting a nonzero equilibrium Cauchy stress $\mathbf{T}^{0}$ , while $\bm{\Lambda}$ is the elastic tensor. (Henceforward we will always refer to equilibrium Cauchy stress simply as “stress” unless we state otherwise or the context offers no possibility of confusion; see Section 2.1.2 for a discussion of different stress tensors.) As discussed at length in Dahlen & Tromp’s Section 3.6, $\bm{\Lambda}$ is one of a number of elastic tensors that can be used depending on how the equations of motion are formulated. It is particularly relevant for us that there exists another elastic tensor $\bm{\Xi}$ related to $\bm{\Lambda}$ by (Dahlen & Tromp, 1998, eq.3.122)

\displaystyle\Lambda_{ijkl}=\Xi_{ijkl}+T^{0}_{ik}\delta_{jl}.

(1.2)

$\bm{\Xi}$ is obtained as the second derivative of a strain-energy function. It therefore satisfies the hyperelastic symmetry

\displaystyle\Xi_{ijkl}=\Xi_{klij}

(1.3)

and the minor symmetries

\displaystyle\Xi_{ijkl}=\Xi_{jikl}=\Xi_{ijlk}

(1.4)

(collectively referred to as the classical elastic symmetries) and thus possesses 21 independent components at most. We see that $\bm{\Lambda}$ is decomposed into two parts, one with explicit, linear stress dependence, and another whose dependence on stress is a priori unclear. The purpose of the present work is to establish how $\bm{\Xi}$ – and hence $\bm{\Lambda}$ – depends on $\mathbf{T}^{0}$ .

For geophysical applications it seems reasonable to restrict attention to linearised stress dependence about a hydrostatic background; finding the linearised stress dependence of $\bm{\Xi}$ is necessary and sufficient to specify that of $\bm{\Lambda}$ . We regard $\mathbf{T}^{0}$ as being composed of a background hydrostatic stress $\mathbf{T}^{B}$ described by a large pressure $p^{0}$ , with a small incremental stress $\Delta\mathbf{T}^{0}$ superimposed. The total equilibrium stress is thus written as

	$\displaystyle\mathbf{T}^{0}$	$\displaystyle=\mathbf{T}^{B}+\Delta\mathbf{T}^{0}$
		$\displaystyle=-p^{0}\mathbf{1}-\Delta p^{0}\mathbf{1}+\Delta\bm{\tau}^{0},$		(1.5)

where $\Delta p^{0}$ and $\Delta\bm{\tau}^{0}$ are the hydrostatic and deviatoric components of $\Delta\mathbf{T}^{0}$ . We will use this notation consistently for the rest of this section, which means that some of the expressions we quote will look slightly different from how they appear elsewhere. Now we ask what the most general linear stress dependence of $\bm{\Xi}$ could be. To that end we write $\bm{\Xi}$ as a Taylor series,

\displaystyle\Xi_{ijkl}

\displaystyle=\Gamma_{ijkl}+\Pi_{ijklmn}\Delta T^{0}_{mn}+\mathcal{O}\!\left(\left\|\Delta\mathbf{T}^{0}\right\|^{2}\right),

(1.6)

and neglect all terms of quadratic and higher order in $\left\|\Delta\mathbf{T}^{0}\right\|$ . $\bm{\Gamma}$ is then the elastic tensor of the background hydrostatic state while $\bm{\Pi}$ represents the elastic tensor’s stress-derivatives evaluated in that background state. To first-order accuracy a body’s response to changes in incremental stress is thus determined entirely by $\bm{\Pi}$ . Decomposing the stress into its hydrostatic and deviatoric parts,

\displaystyle\Delta T^{0}_{ij}=-\Delta p^{0}\delta_{ij}+\Delta\tau^{0}_{ij},

(1.7)

we can write

\displaystyle\Xi_{ijkl}=\Gamma_{ijkl}-\Pi_{ijklaa}\Delta p^{0}+\Pi_{ijklmn}\Delta\tau^{0}_{mn},

(1.8)

based on which we define the pressure-derivatives of the elastic tensor as

\displaystyle\frac{\partial\Xi_{ijkl}}{\partial p}\equiv\Xi^{\prime}_{ijkl}=-\Pi_{ijklaa}

(1.9)

and its derivatives with respect to deviatoric stress as

\displaystyle\frac{\partial\Xi_{ijkl}}{\partial\tau_{mn}}

\displaystyle=\Pi_{ijklmn}-\frac{1}{3}\Pi_{ijklaa}\delta_{mn}.

(1.10)

Both $\bm{\Xi}$ and $\bm{\Gamma}$ must possess the full set of classical elastic symmetries, and the symmetry of the Cauchy stress means that $\bm{\Pi}$ may be taken as symmetric on its last two indices without loss of generality. Therefore $\bm{\Pi}$ is required at minimum to obey the relations

\displaystyle\hbox to0.0pt{$\overbrace{\phantom{\Pi_{ijklnm}=\Pi_{ijklmn}}}^{\text{Symmetry of }\mathbf{T}^{0}}$\hss}\Pi_{ijklnm}=\underbrace{\Pi_{ijklmn}=\Pi_{jiklmn}=\Pi_{ijlkmn}=\Pi_{klijmn}}_{\text{Classical elastic symmetries}},

(1.11)

from which one can show that it possesses at most $21\times 6=126$ independent components.

1.1.2 Isotropic materials

In an isotropic, hydrostatically pre-stressed material neither $\bm{\Gamma}$ nor $\bm{\Pi}$ can exhibit any preferred direction. This is true if and only if the material is isotropic. No matter the value of the background hydrostatic pressure, $\bm{\Gamma}$ therefore has just two independent components, the bulk modulus $\kappa$ and shear modulus $\mu$ . According to convenience one can also use the relation

\displaystyle\kappa=\lambda+\frac{2}{3}\mu

(1.12)

to write $\bm{\Gamma}$ in terms of the Lamé parameters $\lambda$ and $\mu$ . In terms of the elastic moduli $\bm{\Gamma}$ takes the familiar form

\displaystyle\Gamma_{ijkl}=\!\left(\kappa-\frac{2}{3}\mu\right)\delta_{ij}\delta_{kl}+\mu\!\left(\delta_{ik}\delta_{jl}+\delta_{il}\delta_{jk}\right).

(1.13)

The tensor $\bm{\Pi}$ also has considerably fewer than 126 components. It too will only be composed of Kronecker deltas, and the most general such tensor that also satisfies the required symmetries (eq.1.11) has four independent components. Thus

$\displaystyle\Pi_{ijklmn}$	$\displaystyle=a\!\left[\delta_{ij}\delta_{k(m}\delta_{n)l}+\delta_{kl}\delta_{i(m}\delta_{n)j}-\frac{2}{3}\delta_{ij}\delta_{kl}\delta_{mn}\right]$
	$\displaystyle\qquad+b\!\left[\delta_{ik}\delta_{j(m}\delta_{n)l}+\delta_{il}\delta_{j(m}\delta_{n)k}+\delta_{jk}\delta_{i(m}\delta_{n)l}+\delta_{jl}\delta_{i(m}\delta_{n)k}-\frac{2}{3}\!\left(\delta_{ik}\delta_{jl}+\delta_{il}\delta_{jk}\right)\delta_{mn}\right]$
	$\displaystyle\qquad-\!\left[\frac{c}{3}\,\delta_{ij}\delta_{kl}+\frac{d}{3}\,\!\left(\delta_{ik}\delta_{jl}+\delta_{il}\delta_{jk}\right)\right]\delta_{mn},$	(1.14)

where we have explicitly symmetrised on $m$ and $n$ by writing parentheses around the indices, e.g.

\displaystyle\delta_{k(m}\delta_{n)l}=\frac{1}{2}\!\left(\delta_{km}\delta_{nl}+\delta_{kn}\delta_{ml}\right).

(1.15)

One can derive eq.(1.1.2) by splitting the incremental stress into its hydrostatic and deviatoric components,

\displaystyle\Delta T^{0}_{ij}=-\Delta p^{0}\delta_{ij}+\Delta\tau^{0}_{ij},

(1.16)

and considering the pressure and deviatoric stress separately. By symmetry, the only possible effect of an induced incremental pressure is to alter the elastic moduli, which gives the terms in $c$ and $d$ above. Turning to the deviatoric stress, we must construct all possible tensors with the classical elastic symmetries that are permutations of $\delta_{ij}\Delta\tau^{0}_{kl}$ (see Dahlen & Tromp, 1998, p.79). The first possibility is

\displaystyle\delta_{ij}\Delta\tau^{0}_{kl}+\delta_{kl}\Delta\tau^{0}_{ij}.

(1.17)

Each term possesses the minor symmetries trivially, but it is only when the two terms are added together that we gain the hyperelastic symmetry. The second such tensor is a little more complicated, taking the form

\displaystyle\delta_{ik}\Delta\tau^{0}_{jl}+\delta_{il}\Delta\tau^{0}_{jk}+\delta_{jk}\Delta\tau^{0}_{il}+\delta_{jl}\Delta\tau^{0}_{ik}.

(1.18)

These are the only two tensors that fulfil our requirements, and they lead respectively to the terms in $a$ and $b$ above. Note that we have defined $a$ , $b$ , $c$ and $d$ so that the terms in $a$ and $b$ only interact with deviatoric stress (they are traceless on $(mn)$ ), while those in $c$ and $d$ interact with pressure alone. The components of $\bm{\Xi}$ are finally

	$\displaystyle\Xi_{ijkl}$	$\displaystyle=\!\left(\kappa-\frac{2}{3}\mu+\Delta p^{0}c\right)\delta_{ij}\delta_{kl}+\!\left(\mu+\Delta p^{0}d\right)\!\left(\delta_{ik}\delta_{jl}+\delta_{il}\delta_{jk}\right)$
		$\displaystyle\qquad+a\!\left(\delta_{ij}\Delta\tau^{0}_{kl}+\delta_{kl}\Delta\tau^{0}_{ij}\right)+b\!\left(\delta_{ik}\Delta\tau^{0}_{jl}+\delta_{il}\Delta\tau^{0}_{jk}+\delta_{jk}\Delta\tau^{0}_{il}+\delta_{jl}\Delta\tau^{0}_{ik}\right).$		(1.19)

We have arrived at the most general linearised theory consistent with an isotropic, hydrostatically stressed background. It should be noted that this heuristic theory is based on symmetry arguments alone and provides no further information about the four scalars it identifies. One could therefore just regard them as free parameters of the theory, to be fitted experimentally. However, it is not clear that they actually represent four degrees of freedom. They could well be found to depend on some smaller (or larger) set of material-dependent parameters if constitutive behaviour were considered.

The theories of Dahlen (1972b), Tromp & Trampert (2018) and indeed Dahlen & Tromp (1998, eq.3.135) can all be seen as special cases of eq.(1.1.2). Each theory corresponds to choosing

\displaystyle\begin{aligned} c&=-2a\\ d&=-2b,\end{aligned}

(1.20)

and they are then distinguished by their values of $a$ and $b$ . In Dahlen & Tromp (1998, eq.3.135) $a$ and $b$ are regarded as free parameters, yielding the expression

\displaystyle\Xi_{ijkl}=\!\left(\kappa-\frac{2}{3}\mu\right)\delta_{ij}\delta_{kl}+\mu\!\left(\delta_{ik}\delta_{jl}+\delta_{il}\delta_{jk}\right)+a\!\left(\delta_{ij}\Delta T^{0}_{kl}+\delta_{kl}\Delta T^{0}_{ij}\right)+b\!\left(\delta_{ik}\Delta T^{0}_{jl}+\delta_{il}\Delta T^{0}_{jk}+\delta_{jk}\Delta T^{0}_{il}+\delta_{jl}\Delta T^{0}_{ik}\right).

(1.21)

The authors subsequently arrive at Dahlen’s original expression (e.g. Dahlen & Tromp, 1998, eq.3.139)

\displaystyle\Xi_{ijkl}=\!\left(\kappa-\frac{2}{3}\mu\right)\delta_{ij}\delta_{kl}+\mu\!\left(\delta_{ik}\delta_{jl}+\delta_{il}\delta_{jk}\right)+\frac{1}{2}\!\left(\delta_{ij}\Delta T^{0}_{kl}+\delta_{kl}\Delta T^{0}_{ij}\right)-\frac{1}{2}\!\left(\delta_{ik}\Delta T^{0}_{jl}+\delta_{il}\Delta T^{0}_{jk}+\delta_{jk}\Delta T^{0}_{il}+\delta_{jl}\Delta T^{0}_{ik}\right)

(1.22)

by making the “convenient” choice

\displaystyle a

\displaystyle=-b=\frac{1}{2}.

(1.23)

Given eq.(1.20) this is the unique choice of $a$ and $b$ that ensures the aforementioned characteristic features of Dahlen’s theory: that seismic wave speeds have no explicit pressure-dependence, and that P-wave speeds are unaffected by deviatoric stress to first order in perturbation theory. Tromp & Trampert, on the other hand, set

\displaystyle\begin{aligned} a&=\frac{1}{2}+\frac{1}{3}\mu^{\prime}-\frac{1}{2}\kappa^{\prime}\\ b&=-\frac{1}{2}-\frac{1}{2}\mu^{\prime},\end{aligned}

(1.24)

with $\kappa^{\prime}$ and $\mu^{\prime}$ denoting the pressure-derivatives of the elastic moduli. Their elastic tensor is thus (Tromp & Trampert, 2018, eq.44)

	$\displaystyle\Xi_{ijkl}$	$\displaystyle=\!\left(\kappa-\frac{2}{3}\mu\right)\delta_{ij}\delta_{kl}+\mu\!\left(\delta_{ik}\delta_{jl}+\delta_{il}\delta_{jk}\right)$
		$\displaystyle\qquad+\frac{1}{2}\!\left(1-\kappa^{\prime}+\frac{2}{3}\mu^{\prime}\right)\!\left(\delta_{ij}\Delta T^{0}_{kl}+\delta_{kl}\Delta T^{0}_{ij}\right)-\frac{1}{2}\!\left(1+\mu^{\prime}\right)\!\left(\delta_{ik}\Delta T^{0}_{jl}+\delta_{il}\Delta T^{0}_{jk}+\delta_{jk}\Delta T^{0}_{il}+\delta_{jl}\Delta T^{0}_{ik}\right).$		(1.25)

The motivation behind this particular combination of $a$ and $b$ is that it leads to the wave speeds

\displaystyle\begin{aligned} \rho c_{p}^{2}&=\!\left(\kappa+\kappa^{\prime}\Delta p^{0}\right)+\frac{4}{3}\!\left(\mu+\mu^{\prime}\Delta p^{0}\right)\\ \rho c_{s}^{2}&=\!\left(\mu+\mu^{\prime}\Delta p^{0}\right)\end{aligned}

(1.26)

when an incremental hydrostatic stress $\Delta p^{0}$ is induced in an isotropic medium of density $\rho$ . The authors argue that this is desirable on the grounds that these are the classical expressions for isotropic wave speeds, but with elastic constants that are explicitly corrected for an incremental pressure.

1.1.3 Anisotropic materials

The theory just discussed depended on the assumption that the body was isotropic. It would be rendered inconsistent if $\bm{\Gamma}$ were taken to be anything other than an isotropic tensor, as we did throughout Section 1.1.2. This is a consequence of the fact that the theory is based on eq.(1.1.2); a more general form of $\bm{\Pi}$ must be used if materials of lower symmetry are to be considered.

The work of Tromp et al. (2019), mentioned earlier, has partially resolved this issue. They generalise the results of Tromp & Trampert (2018) to obtain an elastic tensor that depends on the stress linearly, but does not implicitly assume that the material under study is isotropic. They take the tensor $\bm{\Gamma}$ to have components (Tromp et al., 2019, eq.8)

\displaystyle\Gamma_{ijkl}=\!\left(\kappa-\frac{2}{3}\mu\right)\delta_{ij}\delta_{kl}+\mu\!\left(\delta_{ik}\delta_{jl}+\delta_{il}\delta_{jk}\right)+\gamma_{ijkl},

(1.27)

where $\bm{\gamma}$ satisfies the classical elastic symmetries but is not an isotropic tensor. Their elastic tensor $\bm{\Xi}$ can be derived from our eq.(1.6) by demanding that $\bm{\Pi}$ possess the symmetries

\displaystyle\Pi_{ijklmn}=\Pi_{ijmnkl}=\Pi_{klmnij}=\Pi_{mnijkl}=\Pi_{mnklij}

(1.28)

in addition to those stated in eq.(1.11). By doing this, eq.(1.9) can be solved uniquely to give all of $\bm{\Pi}$ ’s components in terms of those of $\bm{\Xi}^{\prime}$ :

\displaystyle\Pi_{ijklmn}=-\frac{1}{8}\!\left(\delta_{in}\Xi^{\prime}_{mjkl}+\delta_{im}\Xi^{\prime}_{njkl}+\delta_{jn}\Xi^{\prime}_{imkl}+\delta_{jm}\Xi^{\prime}_{inkl}+\delta_{kn}\Xi^{\prime}_{ijml}+\delta_{km}\Xi^{\prime}_{ijnl}+\delta_{ln}\Xi^{\prime}_{ijkm}+\delta_{lm}\Xi^{\prime}_{ijkn}\right).

(1.29)

The elastic tensor’s dependence on incremental stress is thus parametrised by pressure-derivatives alone, analogously to the isotropic case (cf. eq.1.20). Under this theory $\bm{\Pi}$ possesses the symmetries that would be expected of the third strain-derivative of a strain-energy function, and is described by 21 independent components at most.

1.2 Aims of this paper

This completes our initial survey of the elastic tensor’s linearised stress dependence. Working on the basis of symmetry arguments alone we have established that the previous approaches to the problem might not be sufficiently general. In particular, equations (1.20) and (1.29) seem to imply that the stress dependence of the elastic tensor should be parametrisable solely in terms of the pressure dependence of the elastic moduli, a physical result that is not obvious to the present authors. Nevertheless, recall that Tromp et al. (2019) tested the validity of their theory by carrying out ab initio calculations. They found good, but not perfect, agreement between theory and experiment. Given the nature of such calculations it is difficult to make precise statements about the significance of the discrepancies, but the general agreement does provide clear support for their parametrisation. It is also worth noting that the approach of Section 1.1 brings one to a usable theory rather quickly. The issue, however, is that it gives no clear sense of how $\bm{\Pi}$ is determined from the underlying constitutive relation. As the theory stands there appears to be no way to obtain $\bm{\Pi}$ ’s components besides by performing experiments. Such experiments are already challenging for the cubic materials considered by Tromp et al. (2019), wherein three parameters were to be found, and they might become very difficult for more anisotropic materials. One might wonder if $\bm{\Pi}$ emerges from a more “fundamental” source than its definition in eq.(1.6).

The present work thus has two main aims. The first is to better understand the previous theoretical work on the elastic tensor’s linearised stress dependence. Having established the general characteristics of that theory in Section 1.1 we now wish to ask whether the components of $\bm{\Pi}$ can be obtained more readily in some other way. A secondary aim is to construct a nonlinear theory of the elastic tensor’s stress dependence. This is not purely academic, despite the fact that deviatoric stresses within the present-day Earth are presumably rather small. Methodologically speaking, we feel that it is clearer to derive as much as possible without approximating any physical behaviour because it provides a firm foundation for subsequent, physically-motivated linearisation. In formulating a nonlinear theory we hope to gain greater insight into the linear theory.

In order to make progress we take a new approach rooted firmly in the theories of finite elasticity and constitutive behaviour. We present that argument in Section 3, having first reviewed the necessary ideas in Section 2. After deriving our main result we give some examples, both numerical and analytical, in Section 4, and discuss the implications of our theory in Section 5. In this introduction we have used as far as possible the notation of Dahlen & Tromp (1998) in order to make close contact with previous work. Our theoretical developments owe a lot to Marsden & Hughes (1994), so from Section 2 onwards we switch to (approximately) their notation, which will allow the interested reader of Sections 2 and 3 to “cross-reference” easily. The content of Sections 2–4 is necessarily quite technical, so the reader who is primarily interested in our results might wish to proceed straight to Section 5. We have therefore restated some of the paper’s important results therein using the notation of Dahlen & Tromp, as well as including a complete “translation table” between the two notation systems in Appendix A.4.

2 A review of elasticity

In this section we summarise the aspects of elasticity pertinent to this paper. For more details see, for example, Marsden & Hughes (1994), Dahlen & Tromp (1998) or Truesdell & Noll (2004). In order to make reference to formulae within the solid mechanics literature we will now follow the notation of Marsden & Hughes (henceforward MH) fairly closely. A modest innovation on our part is to use sans-serif bold font for fourth-rank tensors so as to contrast them with second-rank tensors. This allows us to distinguish between the second elastic tensor $\bm{\mathsf{C}}$ and the right Cauchy-Green deformation tensor $\mathbf{C}$ without having to resort to index notation. We also refer the reader to Appendix A: Section A.1 lists some standard results from group theory that we will refer to; A.2 defines some non-standard notations and operators that we have found helpful; and A.3 finally illustrates the usage of these operators while presenting a calculation that is salient to the present work.

2.1 Basic definitions and results

2.1.1 Equations of motion

The deformation of an elastic body is described relative to a fixed reference configuration, with each particle labelled by its position within the associated reference body $\mathcal{B}\subseteq\mathbb{R}^{3}$ , which is assumed to be connected, bounded, and have an open interior. At a time $t$ , the position in physical space of the particle at $\mathbf{x}\in\mathcal{B}$ is written $\bm{\varphi}(\mathbf{x},t)$ . In this manner, we define a mapping

\displaystyle\bm{\varphi}:\mathcal{B}\times\mathbb{R}\rightarrow\mathbb{R}^{3},

(2.1)

which is called the motion of the body relative to the reference configuration. For a fixed time, $t$ , the image of the mapping $\bm{\varphi}\!\left(\cdot,t\right)$ is written $\mathcal{B}_{t}$ and represents the region of physical space the body instantaneously occupies. It will be assumed that for each fixed time the mapping $\bm{\varphi}\!\left(\cdot,t\right):\mathcal{B}\rightarrow\mathcal{B}_{t}$ is smooth with a smooth inverse. A fundamental object derived from the motion is its deformation gradient,

\displaystyle\mathbf{F}=D_{\mathbf{x}}\bm{\varphi},

(2.2)

where $D_{\mathbf{x}}$ denotes partial differentiation with respect to position as defined through

\displaystyle\bm{\varphi}\!\left(\mathbf{x}+\delta\mathbf{x},t\right)=\bm{\varphi}\!\left(\mathbf{x},t\right)+\!\left(D_{\mathbf{x}}\bm{\varphi}\right)\!\left(\mathbf{x},t\right)\cdot\delta\mathbf{x}+\mathcal{O}\!\left(\|\delta\mathbf{x}\|^{2}\right).

(2.3)

(We will generally neglect the subscript on $D$ where it is unambiguous which variable we are differentiating with respect to.) Equivalently, the Cartesian components of the deformation gradient are

\displaystyle F_{ij}=\frac{\partial\varphi_{i}}{\partial x_{j}}.

(2.4)

The Jacobian is then defined as

\displaystyle J=\det\mathbf{F}.

(2.5)

Due to our assumption that $\bm{\varphi}\!\left(\cdot,t\right):\mathcal{B}\rightarrow\mathcal{B}_{t}$ has a smooth inverse, it follows from the inverse function theorem (MH, p.31) that the deformation gradient takes values in the general linear group $\mathbf{GL}(3)$ (Appendix A.1). We assume without loss of generality that $J$ is everywhere positive, meaning that the motion is orientation preserving.

The density at time $t$ at the point $\mathbf{y}\in\mathcal{B}_{t}$ in physical space is written $\varrho\!\left(\mathbf{y},t\right)$ . From conservation of mass, we are led to define the referential density,

\displaystyle\rho(\mathbf{x})=J(\mathbf{x},t)\varrho\!\left[\bm{\varphi}(\mathbf{x},t),t\right],

(2.6)

a time-independent function within the reference body (MH, p.87, Theorem 5.7). Cauchy’s theorem implies that the traction $\mathbf{T}$ acting on a surface-element within $\mathcal{B}_{t}$ is related linearly to the unit-normal $\hat{\mathbf{N}}$ of the corresponding reference surface element within $\mathcal{B}$ (MH, p.127). We can therefore define the first Piola-Kirchhoff stress tensor $\mathbf{P}$ through (MH, p.7)

\displaystyle\mathbf{T}=\mathbf{P}\cdot\hat{\mathbf{N}}.

(2.7)

This expression is equivalent to eq.(2.41) of Dahlen & Tromp (1998) but, as discussed in Appendix A.4, we place our indices according to a different convention. From Newton’s second law we obtain the momentum equation

\displaystyle\rho\ddot{\bm{\varphi}}-\mathrm{Div}\mathbf{P}=\mathbf{f},

(2.8)

where dots are used to represent time differentiation, the divergence of a tensor field is given by

\displaystyle\!\left(\mathrm{Div}\mathbf{P}\right)_{i}=\partial_{j}P_{ij},

(2.9)

and $\mathbf{f}$ denotes the body forces acting on $\mathcal{B}$ .

2.1.2 Constitutive relations

To complete the equations of motion we need to relate $\mathbf{P}$ and $\bm{\varphi}$ through a suitable constitutive relation. We follow Dahlen (1972a) by restricting attention to hyperelastic materials, in which case the first Piola-Kirchhoff stress depends on the motion through the expression

\displaystyle\mathbf{P}(\mathbf{x},t)=\!\left(D_{\mathbf{F}}W\right)\!\left[\mathbf{x},\mathbf{F}(\mathbf{x},t)\right],

(2.10)

where $W:\mathcal{B}\times\mathbf{GL}(3)\rightarrow\mathbb{R}$ is the strain-energy function and $D_{\mathbf{F}}W$ denotes its partial derivative with respect to $\mathbf{F}$ (MH, p.190, Theorem 2.4).

The form of the strain-energy function is constrained by the principle of material-frame indifference. Discussed at length by Marsden & Hughes (1994) and Truesdell & Noll (2004), it requires that

\displaystyle W(\mathbf{x},\mathbf{R}\mathbf{F})

\displaystyle=W(\mathbf{x},\mathbf{F})

(2.11)

for all rotation matrices $\mathbf{R}\in\mathbf{SO}(3)$ and $\mathbf{F}\in\mathbf{GL}(3)$ (see Appendix A.1). It can be shown using the polar decomposition theorem (MH, p.8) that this condition holds if and only if for some auxiliary strain-energy function $V$ we can write

\displaystyle W(\mathbf{x},\mathbf{F})=V\!\left(\mathbf{x},\mathbf{C}\right),

(2.12)

with the symmetric right Cauchy–Green deformation tensor defined to be

\displaystyle\mathbf{C}=\mathbf{F}^{T}\mathbf{F}.

(2.13)

Applying the chain rule to differentiate eq.(2.12), we arrive at an alternative expression for the first Piola-Kirchhoff stress in terms of $V$ . It is readily established (see Appendix A.3) that

\displaystyle D_{\mathbf{F}}W\!\left(\mathbf{x},\mathbf{F}\right)=2\mathbf{F}D_{\mathbf{C}}V\!\left(\mathbf{x},\mathbf{C}\right).

(2.14)

From this expression we are led to define the symmetric second Piola-Kirchhoff stress tensor (MH, p.196, Theorem 2.11)

\displaystyle\mathbf{S}\!\left(\mathbf{x},t\right)=2D_{\mathbf{C}}V\!\left[\mathbf{x},\mathbf{C}\!\left(\mathbf{x},t\right)\right].

(2.15)

Hence we obtain the identity

\displaystyle\mathbf{P}=\mathbf{F}\mathbf{S}.

(2.16)

A third useful stress tensor is the Cauchy stress, $\bm{\sigma}$ . It relates the traction on a surface element at $\mathbf{y}\in\mathcal{B}_{t}$ to the surface’s instantaneous unit-normal $\hat{\mathbf{n}}$ . From this definition it can be shown (MH, p.135) that

\displaystyle\mathbf{P}=J\!\left(\bm{\sigma}\circ\bm{\varphi}\right)\mathbf{F}^{-T}.

(2.17)

Although it is perhaps not obvious from this expression, the Cauchy stress is symmetric. Using eq.(2.16) we obtain (MH, p.136, Definition 2.8)

\displaystyle\bm{\sigma}\circ\bm{\varphi}=\frac{1}{J}\mathbf{F}\mathbf{S}\mathbf{F}^{T},

(2.18)

and the symmetry of $\bm{\sigma}$ follows from that of $\mathbf{S}$ .

2.1.3 Linearisation of the equations of motion

For seismological purposes it is generally sufficient to study linearised elastodynamics. We define an equilibrium configuration $\bm{\varphi}_{e}:\mathcal{B}\rightarrow\mathbb{R}^{3}$ to be a time-independent solution of the equations of motion subject to a time-independent body force $\mathbf{f}_{e}$ and surface traction $\mathbf{T}_{e}$ . The resulting equilibrium first Piola-Kirchhoff stress is given by

\displaystyle\mathbf{P}_{e}(\mathbf{x})=D_{\mathbf{F}}W\!\left[\mathbf{x},\mathbf{F}_{e}(\mathbf{x})\right],

(2.19)

where $\mathbf{F}_{e}=D_{\mathbf{x}}\bm{\varphi}_{e}$ . Using eq.(2.16) and (2.18), the three equilibrium stress tensors are then related by

\displaystyle\mathbf{P}_{e}=\mathbf{F}_{e}\mathbf{S}_{e}=J_{e}\!\left(\bm{\sigma}_{e}\circ\bm{\varphi}_{e}\right)\mathbf{F}_{e}^{-T}.

(2.20)

If such a body is subject to a small disturbance from equilibrium, we can look for solutions of the form

\displaystyle\bm{\varphi}(\mathbf{x},t)=\bm{\varphi}_{e}(\mathbf{x})+\epsilon\,\mathbf{u}(\mathbf{x},t)+\mathcal{O}\!\left(\epsilon^{2}\right),

(2.21)

where $\mathbf{u}$ is the displacement vector and $\epsilon$ a dimensionless perturbation parameter. Under this ansatz the deformation gradient becomes

\displaystyle\mathbf{F}(\mathbf{x},t)=\mathbf{F}_{e}(\mathbf{x})+\epsilon\,D_{\mathbf{x}}\mathbf{u}(\mathbf{x},t)+\mathcal{O}\!\left(\epsilon^{2}\right),

(2.22)

while the first Piola-Kirchhoff stress expands to

\displaystyle\mathbf{P}(\mathbf{x},t)

\displaystyle=\mathbf{P}_{e}(\mathbf{x})+\epsilon\,\bm{\mathsf{A}}\!\left(\mathbf{x}\right)\cdot D_{\mathbf{x}}\mathbf{u}(\mathbf{x},t)+\mathcal{O}(\epsilon^{2}),

(2.23)

where we have defined the first elastic tensor (MH, p.209, Proposition 4.4b)

\displaystyle\bm{\mathsf{A}}(\mathbf{x})=D^{2}_{\mathbf{F}}W\!\left[\mathbf{x},\mathbf{F}_{e}(\mathbf{x})\right].

(2.24)

Note that the first elastic tensor possesses the so-called hyperelastic symmetry,

\bm{\mathsf{A}}^{T}=\bm{\mathsf{A}},

(2.25)

due to the equality of mixed partial derivatives. In index notation this relationship takes the familiar form $\mathsf{A}_{ijkl}=\mathsf{A}_{klij}$ . We will henceforward follow standard seismological terminology and refer to $\bm{\mathsf{A}}$ simply as “the elastic tensor” unless that is likely to cause confusion.

As shown by eq.(2.23), the elastic tensor completely describes the linearised constitutive behaviour of the body. In particular, at a point $\mathbf{x}\in\mathcal{B}$ there will be three possible elastic wave speeds in the direction of the unit vector $\hat{\mathbf{p}}$ . These wave speeds, $c$ , are determined through the eigenvalue problem (e.g. Dahlen & Tromp, 1998, Section 3.6.3)

\displaystyle\mathbf{B}(\mathbf{x},\hat{\mathbf{p}})\cdot\mathbf{a}=c^{2}\mathbf{a},

(2.26)

where $\mathbf{a}$ is the corresponding polarisation vector, and the Christoffel matrix has components

\displaystyle B_{ik}(\mathbf{x},\hat{\mathbf{p}})=\frac{1}{\rho(\mathbf{x})}\mathsf{A}_{ijkl}(\mathbf{x})\hat{p}_{j}\hat{p}_{l}.

(2.27)

This matrix is symmetric due to eq.(2.25), hence the $c^{2}$ are real. Within an elastic solid it is conventionally assumed that these squared wave speeds are positive, a necessary condition for well-posedness of the linearised equations of motion (e.g. Marsden & Hughes, 1994). As the propagation direction $\hat{\mathbf{p}}$ varies over the unit two-sphere, the three positive wave-speeds define the so-called slowness surface at $\mathbf{x}$ . In general, this surface will be comprised of three distinct sheets, though they can sometimes touch due to degenerate eigenvalues within eq.(2.26).

Finally, it is useful to express the elastic tensor in terms of the auxiliary strain-energy function $V$ . We define the equilibrium second Piola-Kirchhoff stress as

\displaystyle\mathbf{S}_{e}\!\left(\mathbf{x}\right)=2D_{\mathbf{C}}V\!\left[\mathbf{x},\mathbf{C}_{e}\!\left(\mathbf{x}\right)\right],

(2.28)

and introduce the second elastic tensor (MH, p.209, Proposition 4.4a)

\displaystyle\bm{\mathsf{C}}\!\left(\mathbf{x}\right)=4D^{2}_{\mathbf{C}}V\!\left[\mathbf{x},\mathbf{C}_{e}\!\left(\mathbf{x}\right)\right].

(2.29)

Suppressing all spatial arguments to avoid clutter, it then follows from the results of Appendix A.3 (see also MH, p.209, Proposition 4.5) that

\displaystyle\bm{\mathsf{A}}=\bm{\mathsf{L}}_{\mathbf{F}_{e}}\bm{\mathsf{C}}\,\bm{\mathsf{L}}_{\mathbf{F}_{e}}^{T}+\bm{\mathsf{R}}_{\mathbf{S}_{e}}.

(2.30)

It is worth emphasising that the tensor $\bm{\mathsf{C}}$ has the full set of classical elastic symmetries,

\displaystyle\mathsf{C}_{ijkl}=\mathsf{C}_{jikl}=\mathsf{C}_{ijlk}=\mathsf{C}_{klij},

(2.31)

due to the symmetry of $\mathbf{C}_{e}$ , and so possesses at most 21 independent components.

2.2 Transformation of the reference configuration

The motion of an elastic body has been described relative to a fixed reference configuration involving material parameters $\rho$ and $W$ . The same body can, of course, be described using a different choice of reference configuration, and it is natural to ask how the two points of view are related. This question was discussed by Al-Attar & Crawford (2016) and Al-Attar et al. (2018); here we simply recall the results relevant to this work.

2.2.1 Particle-relabelling

Let $\bm{\varphi}:\mathcal{B}\times\mathbb{R}\rightarrow\mathbb{R}$ denote the motion of an elastic body relative to a given reference configuration. The same motion relative to a different reference configuration will be written $\tilde{\bm{\varphi}}:\tilde{\mathcal{B}}\times\mathbb{R}\rightarrow\mathbb{R}$ , where $\tilde{\mathcal{B}}$ is the associated reference body that will not, in general, be equal to $\mathcal{B}$ . At a time $t$ , the particle labelled by $\mathbf{x}\in\mathcal{B}$ lies at the point $\bm{\varphi}(\mathbf{x},t)$ in physical space. Relative to the second description of the motion, there must be a unique point $\tilde{\mathbf{x}}\in\tilde{\mathcal{B}}$ such that

\displaystyle\bm{\varphi}(\mathbf{x},t)=\tilde{\bm{\varphi}}(\tilde{\mathbf{x}},t).

(2.32)

This correspondence between $\mathbf{x}$ and $\tilde{\mathbf{x}}$ holds for all times, defining a mapping $\bm{\xi}:\tilde{\mathcal{B}}\rightarrow\mathcal{B}$ that relates the two motions through

\displaystyle\bm{\varphi}(\mathbf{x},t)=\tilde{\bm{\varphi}}[\bm{\xi}(\tilde{\mathbf{x}}),t].

(2.33)

It is assumed for simplicity that $\bm{\xi}$ is smooth and has a smooth inverse. Under such a particle relabelling transformation the form of the equations of motion is clearly left unchanged, while it was shown by Al-Attar & Crawford (2016) that the material parameters $\tilde{\rho}$ and $\tilde{W}$ relative to the second reference configuration can be obtained from those in the first by

	$\displaystyle\tilde{\rho}(\tilde{\mathbf{x}})$	$\displaystyle=J_{\bm{\xi}}(\tilde{\mathbf{x}})\,\rho[\bm{\xi}(\tilde{\mathbf{x}})],$		(2.34)
	$\displaystyle\tilde{W}(\tilde{\mathbf{x}},\tilde{\mathbf{F}})$	$\displaystyle=J_{\bm{\xi}}(\tilde{\mathbf{x}})\,W[\bm{\xi}\!\left(\tilde{\mathbf{x}}\right),\tilde{\mathbf{F}}\,\mathbf{F}_{\bm{\xi}}(\tilde{\mathbf{x}})^{-1}],$		(2.35)

where $\mathbf{F}_{\bm{\xi}}=D\bm{\xi}$ and $J_{\bm{\xi}}=\det\mathbf{F}_{\bm{\xi}}$ .

2.2.2 Natural reference configurations

When considering linearised motions of an elastic body it is conventional to select the reference configuration so that the equilibrium configuration takes the simple form

\displaystyle\bm{\varphi}_{e}(\mathbf{x})=\mathbf{x}.

(2.36)

In this manner, the label for each particle is simply its position in physical space at equilibrium. In the terminology of Al-Attar & Crawford (2016), such a reference configuration is said to be natural. The equilibrium deformation gradient then satisfies

\displaystyle\mathbf{F}_{e}(\mathbf{x})=\mathbf{1},

(2.37)

while its Jacobian is everywhere equal to one. Given this choice, the equilibrium first Piola-Kirchhoff stress is obtained by evaluating the first derivative of the strain-energy at the identity:

\displaystyle\mathbf{P}_{e}=D_{\mathbf{F}}W\!\left(\cdot,\mathbf{1}\right).

(2.38)

An attractive feature of natural reference configurations is that the distinction between the different equilibrium stress-tensors vanishes. It is trivial to verify that equation (2.20) now reads

\displaystyle\mathbf{P}_{e}=\mathbf{S}_{e}=\bm{\sigma}_{e}.

(2.39)

In particular, it follows that

\displaystyle\bm{\sigma}_{e}=D_{\mathbf{F}}W\!\left(\cdot,\mathbf{1}\right)=2D_{\mathbf{C}}V\!\left(\cdot,\mathbf{1}\right),

(2.40)

an expression we will dissect in Section 3.

In the same manner, the elastic tensor takes the simpler form

\displaystyle\bm{\mathsf{a}}=D^{2}_{\mathbf{F}}W\!\left(\cdot,\mathbf{1}\right),

(2.41)

which, from eq.(A.50), can be written equivalently as

\displaystyle\bm{\mathsf{a}}=\bm{\mathsf{c}}+\bm{\mathsf{R}}_{\bm{\sigma}_{e}}.

(2.42)

Note that we have denoted the first and second elastic tensors by lower-case $\bm{\mathsf{a}}$ and $\bm{\mathsf{c}}$ . This is a notational convention used by Marsden & Hughes, the aim of which is to emphasise that these elastic tensors are defined with respect to (what is here termed) a natural reference configuration. As noted above, the tensor

\displaystyle\bm{\mathsf{c}}=4D^{2}_{\mathbf{C}}V\!\left(\cdot,\mathbf{1}\right)

(2.43)

possesses all the classical elastic symmetries. In contrast, the second term in eq.(2.42) has components

\displaystyle[\bm{\mathsf{R}}_{\bm{\sigma}_{e}}]_{ijkl}=\delta_{ik}[\bm{\sigma}_{e}]_{jl}

(2.44)

which, for general $\bm{\sigma}_{e}\neq\mathbf{0}$ , are not invariant under the interchange of either $i\leftrightarrow j$ or $k\leftrightarrow l$ (although the symmetry of $\bm{\sigma}_{e}$ ensures that $\bm{\mathsf{R}}_{\bm{\sigma}_{e}}$ possesses the hyperelastic symmetry). We therefore see that $\bm{\mathsf{a}}$ inherits the full complement of classical symmetries only with respect to a stress-free natural reference configuration. Moreover, it is only with respect to a natural reference configuration that the propagation directions $\hat{\mathbf{p}}$ within eq.(2.26) can be equated with directions in physical space, allowing the slowness surface to be interpreted in a straightforward manner. Eq.(2.42) is precisely equivalent to eq.(1.2), though it is important to note that this equivalence only holds with respect to a natural reference configuration.

2.3 Material symmetries

We end our review of finite elasticity theory by discussing material symmetries. These results can be found in MH (Chapter 3, Section 3.5) and Gurtin et al. (2010, Chapter 50), though we place additional emphasis on certain points.

Relative to a fixed reference configuration, the material symmetry group of a strain-energy function $W$ at a point $\mathbf{x}\in\mathcal{B}$ is

\displaystyle\mathrm{Sym}(W,\mathbf{x})=\left\{\mathbf{Q}\in\mathbf{SL}(3)\,|\,W(\mathbf{x},\mathbf{F}\mathbf{Q})=W(\mathbf{x},\mathbf{F}),\,\forall\mathbf{F}\in\mathbf{GL}(3)\right\},

(2.45)

where $\mathbf{SL}(3)$ denotes the special linear group on $\mathbb{R}^{3}$ whose definition is recalled in Appendix A.1. Physically, the symmetry group reflects the invariance of the strain-energy with respect to orientation of stretching. Following Noll (1974), the body is said to be fluid at a point if the symmetry group is equal to $\mathbf{SL}(3)$ , and solid if it is a proper subgroup thereof.

Under a change of reference configuration, the symmetry group is not generally invariant. To see this, let $\bm{\xi}:\tilde{\mathcal{B}}\rightarrow\mathcal{B}$ be a particle relabelling transformation, and suppose that $\bm{\xi}(\tilde{\mathbf{x}})=\mathbf{x}$ . If $\tilde{\mathbf{Q}}\in\mathrm{Sym}(\tilde{W},\tilde{\mathbf{x}})$ then from eq.(2.35) we obtain

\displaystyle W[\mathbf{x},\tilde{\mathbf{F}}\tilde{\mathbf{Q}}\mathbf{F}_{\bm{\xi}}(\tilde{\mathbf{x}})^{-1}]=W[\mathbf{x},\tilde{\mathbf{F}}\mathbf{F}_{\bm{\xi}}(\tilde{\mathbf{x}})^{-1}]

(2.46)

for all $\tilde{\mathbf{F}}\in\mathbf{GL}(3)$ . Hence for a unique ${\mathbf{Q}}\in\mathrm{Sym}({W},\mathbf{x})$ we have

\displaystyle\tilde{\mathbf{Q}}\mathbf{F}_{\bm{\xi}}(\tilde{\mathbf{x}})^{-1}=\mathbf{F}_{\bm{\xi}}(\tilde{\mathbf{x}})^{-1}\mathbf{Q}.

(2.47)

This establishes a group isomorphism between $\mathrm{Sym}({W},\mathbf{x})$ and $\mathrm{Sym}(\tilde{W},\tilde{\mathbf{x}})$ , which is given concretely through matrix conjugation:

\displaystyle\mathrm{Sym}({W},\mathbf{x})\ni\mathbf{Q}\mapsto\mathbf{F}_{\bm{\xi}}(\tilde{\mathbf{x}})^{-1}\mathbf{Q}\mathbf{F}_{\bm{\xi}}(\tilde{\mathbf{x}})\in\mathrm{Sym}(\tilde{W},\tilde{\mathbf{x}}).

(2.48)

This mapping leaves $\mathbf{SL}(3)$ invariant, hence our definitions of the material symmetry group itself and of an elastic fluid are sound. Noll (1965) showed that, up to this isomorphism, the largest proper subgroup of $\mathbf{SL}(3)$ is equal to the special orthogonal group $\mathbf{SO}(3)$ . We can, therefore, define an elastic solid to be isotropic at a point if its symmetry group relative to an arbitrary reference configuration is isomorphic under matrix conjugation to $\mathbf{SO}(3)$ . Equivalently, it is isotropic if for some reference configuration its symmetry group is equal to $\mathbf{SO}(3)$ . If the symmetry group is isomorphic to a proper subgroup of $\mathbf{SO}(3)$ we say the solid is anisotropic, with the extreme case being when this group consists of the identity matrix alone. In between these two end-members can be found, for example, transversely-isotropic materials, whose symmetry group is isomorphic under matrix conjugation to $\mathbf{SO}(2)$ . This corresponds physically to the strain-energy being invariant under rotations about a certain fixed axis.

Importantly, the preceding discussion is independent of any particular choice of reference configuration. It corrects a mistake of Al-Attar & Crawford (2016), who implied that material symmetries can be lost or gained through particle relabelling transformations. Such transformations simply represent a change in our description of the body’s deformation. They cannot entail any physical consequences.

As a final concept that we will need later, consider a natural reference configuration for a stress-free elastic body in equilibrium. Such a configuration is defined by $DV\!\left(\mathbf{C}^{*}\right)=\mathbf{0}$ with $\mathbf{C}^{*}=\mathbf{1}$ . From eq.(2.45), the material symmetry group acts on $\mathbf{C}$ according to $\mathbf{C}\mapsto\mathbf{Q}^{T}\mathbf{C}\mathbf{Q}$ , so by definition we have

\displaystyle V\!\left(\mathbf{Q}^{T}\mathbf{C}^{*}\mathbf{Q}\right)=V\!\left(\mathbf{C}^{*}\right),

(2.49)

from which

\displaystyle V\!\left(\mathbf{Q}^{T}\mathbf{Q}\right)=V\!\left(\mathbf{1}\right).

(2.50)

For the equilibrium to be stable, $\mathbf{C}^{*}=\mathbf{1}$ must lie at a strict local minimum of the strain-energy function. This allows us to equate the arguments of the left and right hand sides of eq.(2.50). The elements of the symmetry group then satisfy

\displaystyle\mathbf{Q}^{T}\mathbf{Q}=\mathbf{1},

(2.51)

from which it is clear that $\mathbf{Q}\in\mathbf{SO}(3)$ . The symmetry group of a stress-free elastic body, described with respect to a natural reference configuration, is therefore equal to a subgroup of $\mathbf{SO}(3)$ rather than just isomorphic thereto. For example, in the stress-free case an isotropic body in a natural reference configuration has a material symmetry group equal to the whole of $\mathbf{SO}(3)$ , while that of a transversely-isotropic material is equal to $\mathbf{SO}(2)$ , having fixed the orientation of the symmetry-axis.

3 Functional dependence of the elastic tensor on equilibrium stress

Having recalled the necessary results and notations from the theory of elasticity, we now turn to our main question. Namely, we seek to determine the functional dependence of the elastic tensor on the equilibrium Cauchy stress.

3.1 Parametrised dependence on the equilibrium configuration

For an equilibrium body $\mathcal{B}$ , the equilibrium Cauchy stress and elastic tensor take the form


$\displaystyle\bm{\sigma}\!\left(\mathbf{x}\right)$	$\displaystyle=DW\!\left(\mathbf{x},\mathbf{1}\right)$	(3.1a)
$\displaystyle\bm{\mathsf{a}}\!\left(\mathbf{x}\right)$	$\displaystyle=D^{2}W\!\left(\mathbf{x},\mathbf{1}\right),$	(3.1b)

where $W$ is the strain-energy function with respect to a natural reference configuration, and for notational simplicity we have dropped the subscript from $\bm{\sigma}_{e}$ . Our hope is to express the elastic tensor as a function of the equilibrium stress. Variations in $\bm{\sigma}$ arise, of course, though changes to the equilibrium configuration, but the dependence of eqs.(3.1) thereon is masked by the use of a natural reference configuration. As a first step, we must reformulate eqs.(3.1) in a way that makes fully explicit the dependence of the two equations on the equilibrium configuration.

We consider an arbitrary fixed reference configuration with the associated reference body denoted by $\tilde{\mathcal{B}}$ , and with strain-energy function $\tilde{W}$ . The correspondence between this reference configuration and the natural reference configuration $\mathcal{B}$ is given by a mapping

\displaystyle\bm{\Phi}:\tilde{\mathcal{B}}\rightarrow\mathcal{B}.

(3.2)

This is just the equilibrium configuration of the body relative to our newly introduced reference configuration (see Fig. 3). Regarding the inverse mapping $\bm{\Phi}^{-1}$ as a particle relabelling transformation, we can use eq.(2.35) to relate $W$ to $\tilde{W}$ :

\displaystyle W\!\left[\bm{\Phi}(\tilde{\mathbf{x}}),\mathbf{F}^{\prime}\right]=J_{\mathbf{F}}(\tilde{\mathbf{x}})^{-1}\tilde{W}\!\left[\tilde{\mathbf{x}},\mathbf{F}^{\prime}\mathbf{F}\!\left(\tilde{\mathbf{x}}\right)\right].

(3.3)

To avoid cluttered notations here and in what follows, we write $\mathbf{F}$ for the equilibrium deformation gradient $D\bm{\Phi}$ and $J_{\mathbf{F}}=\det\mathbf{F}$ , while $\mathbf{F}^{\prime}$ represents an arbitrary element of $\mathbf{GL}(3)$ . From eq.(3.3) we see that $W[\bm{\Phi}(\tilde{\mathbf{x}}),\cdot]$ depends only on $\tilde{W}$ at the fixed point $\tilde{\mathbf{x}}\in\tilde{\mathcal{B}}$ . Furthermore, the relationship is parametrised by $\mathbf{F}$ evaluated at $\tilde{\mathbf{x}}$ . All in all, the two functions are related in a local manner; no generality is lost by focusing on a single, arbitrary point in $\mathcal{B}$ and its pre-image in $\tilde{\mathcal{B}}$ . We do this from now on, dropping all spatial arguments to arrive at the simpler relations


$\displaystyle\bm{\sigma}$	$\displaystyle=DW\!\left(\mathbf{1}\right)$	(3.4a)
$\displaystyle\bm{\mathsf{a}}$	$\displaystyle=D^{2}W\!\left(\mathbf{1}\right)$	(3.4b)

and

\displaystyle W\!\left(\mathbf{F}^{\prime}\right)

\displaystyle=J_{\mathbf{F}}^{-1}\tilde{W}\!\left(\mathbf{F}^{\prime}\mathbf{F}\right).

(3.5)

To complete the reformulation of eqs.(3.4) we apply the chain rule to eq.(3.5) so as to write $\bm{\sigma}$ and $\bm{\mathsf{a}}$ explicitly in terms of $\tilde{W}$ and $\mathbf{F}$ . It is in fact preferable to work not with $W$ , but rather with the auxiliary strain-energy function $V$ that encodes material-frame indifference automatically. Defining $\mathbf{C}^{\prime}=(\mathbf{F}^{\prime})^{T}\mathbf{F}^{\prime}$ , we recall that $V$ will satisfy

\displaystyle W\!\left(\mathbf{F}^{\prime}\right)=V\!\left(\mathbf{C}^{\prime}\right),

(3.6)

and from eqs. (3.5) and (3.6) we see that the relationship between $V$ in $\mathcal{B}$ and its counterpart $\tilde{V}$ in $\tilde{\mathcal{B}}$ is given by

\displaystyle V(\mathbf{C}^{\prime})=J_{\mathbf{F}}^{-1}\tilde{V}(\bm{\mathsf{T}}_{\mathbf{F}}^{T}\cdot\mathbf{C}^{\prime}),

(3.7)

where the operator $\bm{\mathsf{T}}_{\mathbf{F}}$ is defined in eq.(A.20). From Appendix A.3 we have

	$\displaystyle DW(\mathbf{F}^{\prime})$	$\displaystyle=2\bm{\mathsf{L}}_{\mathbf{F}^{\prime}}\cdot DV(\mathbf{C}^{\prime})$		(3.8)
	$\displaystyle D^{2}W\!\left(\mathbf{F}^{\prime}\right)$	$\displaystyle=2\bm{\mathsf{R}}_{DV\!\left(\mathbf{C}^{\prime}\right)}+4\bm{\mathsf{L}}_{\mathbf{F}^{\prime}}D^{2}V\!\left(\mathbf{C}^{\prime}\right)\,\bm{\mathsf{L}}_{\mathbf{F}^{\prime}}^{T},$		(3.9)

and it is readily shown from eq.(3.7) that

	$\displaystyle DV\!\left(\mathbf{C}^{\prime}\right)$	$\displaystyle=\bm{\mathsf{T}}_{\mathbf{F}}\cdot D\tilde{V}(\bm{\mathsf{T}}_{\mathbf{F}}^{T}\cdot\mathbf{C}^{\prime})$		(3.10)
	$\displaystyle D^{2}V\!\left(\mathbf{C}^{\prime}\right)$	$\displaystyle=\bm{\mathsf{T}}_{\mathbf{F}}D^{2}\tilde{V}(\bm{\mathsf{T}}_{\mathbf{F}}^{T}\cdot\mathbf{C}^{\prime})\bm{\mathsf{T}}_{\mathbf{F}}^{T}.$		(3.11)

Evaluating these results at $\mathbf{C}^{\prime}=\mathbf{1}$ – and noting that $\bm{\mathsf{T}}_{\mathbf{F}}^{T}\cdot\mathbf{1}=\mathbf{C}$ – we obtain the first of our desired expressions,

\displaystyle\bm{\sigma}=2J_{\mathbf{F}}^{-1}\bm{\mathsf{T}}_{\mathbf{F}}\cdot D\tilde{V}(\mathbf{C}).

(3.12)

Note that it is equivalent to eq.(2.18) but has been restated in a different notation. By considering the second derivative we then obtain the expression

\displaystyle\bm{\mathsf{a}}

\displaystyle=4J_{\mathbf{F}}^{-1}\bm{\mathsf{T}}_{\mathbf{F}}D^{2}\tilde{V}(\mathbf{C})\,\bm{\mathsf{T}}_{\mathbf{F}}^{T}+\bm{\mathsf{R}}_{\bm{\sigma}},

(3.13)

which is equivalent to the expression of MH (p.217, Box 4.1). We have thus expressed both the equilibrium Cauchy stress and the elastic tensor as explicit functions of $\mathbf{F}$ , the equilibrium deformation gradient relative to the fixed reference configuration $\tilde{\mathcal{B}}$ . To emphasise this point we introduce the notation


$\displaystyle\bm{\sigma}$	$\displaystyle=\hat{\bm{\sigma}}\!\left(\mathbf{F}\right)$	$\displaystyle\equiv 2J_{\mathbf{F}}^{-1}\bm{\mathsf{T}}_{\mathbf{F}}\cdot D\tilde{V}\!\left(\mathbf{C}\right)$	(3.14a)
$\displaystyle\bm{\mathsf{a}}$	$\displaystyle=\hat{\bm{\mathsf{a}}}\!\left(\mathbf{F}\right)$	$\displaystyle\equiv 4J_{\mathbf{F}}^{-1}\bm{\mathsf{T}}_{\mathbf{F}}D^{2}\tilde{V}\!\left(\mathbf{C}\right)\bm{\mathsf{T}}_{\mathbf{F}}^{T}+\bm{\mathsf{R}}_{\hat{\bm{\sigma}}\!\left(\mathbf{F}\right)},$	(3.14b)

with $\hat{\bm{\sigma}}$ (resp. $\hat{\bm{\mathsf{a}}}$ ) the function that takes an equilibrium deformation gradient to the corresponding equilibrium Cauchy stress (resp. elastic tensor). It is worth observing that both of these relations are nonlinear for any non-trivial choice of strain-energy function $\tilde{V}$ .

Through eqs.(3.14) one can consider a wide variety of problems where a body’s elastic properties change as a result of changing its equilibrium configuration. Tromp et al. (2019, Section 4) studied just such a problem when they performed their ab initio calculations: the (unstrained) sample was subjected to a known strain, and this induced both an incremental Cauchy stress and a change in the elastic tensor. On a more seismological level (e.g. Tromp & Trampert, 2018) one might wish to understand how a given deformation of one of the Earth’s regions affects seismic wave propagation and stresses therein. Such a deformation could result from small-scale phenomena such as a cave-in within a gas field, or from large-scale effects like tidal loading. The important feature common to all these problems is that there is a known initial state $\tilde{\mathcal{B}}$ that is deformed in a prescribed way. This produces a new state $\mathcal{B}$ whose elastic properties are fully determined in terms of those of $\tilde{\mathcal{B}}$ by eqs.(3.14). As long as $\tilde{V}$ and its derivatives are well-behaved the computations necessitated by these problems can in principle be performed readily. Note that to solve these problems there is in fact no need to find the stress dependence of the elastic tensor.

3.2 Parametrised dependence on equilibrium stress

Seismic inverse theory leads one to a subtly different problem. Say that we wish to study the equilibrium elastic properties of a certain region within the deep Earth as the equilibrium configuration evolves, and that we would particularly like to know about the equilibrium stress. We measure that region’s properties using seismic data: we make surface observations of (small-amplitude, high-frequency) seismic waves that have passed through the region’s neighbourhood, construct seismograms, and then use those seismograms to invert for the elastic tensor $\bm{\mathsf{a}}$ of the region. Importantly, the equilibrium stress $\bm{\sigma}$ cannot be measured “directly” in this way. However, if we knew the stress dependence of the elastic tensor, we would be able to invert changes in $\bm{\mathsf{a}}$ for changes in $\bm{\sigma}$ as the equilibrium configuration evolves. Our task is therefore to find $\bm{\mathsf{a}}$ as an explicit function of $\bm{\sigma}$ . Eqs.(3.14) provide the necessary link between $\bm{\mathsf{a}}$ and $\bm{\sigma}$ , but it is not yet in a useful form, parametrised as it is by $\mathbf{F}$ . Unlike in the previous problem, $\mathbf{F}$ cannot be regarded as a known quantity because it could only ever be “measured” by inverting seismic data for $\bm{\mathsf{a}}$ and then using eq.(3.14b) to invert for $\mathbf{F}$ . Given this, we will eliminate $\mathbf{F}$ from eqs.(3.14) and ask only how $\bm{\mathsf{a}}$ varies upon varying $\bm{\sigma}$ . The mathematical problem we are trying to solve can be couched most succinctly (with reference to Figure 3) as follows: if we observe a given $\bm{\sigma}$ in $\mathcal{B}$ , assume that $\bm{\sigma}$ was induced by elastically deforming $\tilde{\mathcal{B}}$ , and parametrise the properties of $\tilde{\mathcal{B}}$ by choosing a specific form of $\tilde{V}$ , what elastic tensor $\bm{\mathsf{a}}$ will be observed in $\mathcal{B}$ ?

The ideal approach to this second problem appears to be to find $\mathbf{F}$ as a function of $\bm{\sigma}$ from eq.(3.14a), and then substitute the result into eq.(3.14b) to give $\bm{\mathsf{a}}$ as a function of $\bm{\sigma}$ . However, a given equilibrium Cauchy stress can be produced by many different deformation gradients, so there is rather a conspicuous mathematical problem. The function $\hat{\bm{\sigma}}$ maps elements in the nine-dimensional group $\mathbf{GL}(3)$ into the six-dimensional vector space of symmetric matrices on $\mathbb{R}^{3}$ , which means that the equation $\hat{\bm{\sigma}}(\mathbf{F})=\bm{\sigma}$ for $\mathbf{F}$ is underdetermined. How might we “invert” $\hat{\bm{\sigma}}$ for $\mathbf{F}$ when $\bm{\sigma}$ only provides us with six numbers? What does this underdetermination imply for the elastic tensor? A way into the problem is to recall the polar decomposition theorem (MH, p.3; see also Fig. 4), which shows that any $\mathbf{F}\in\mathbf{GL}(3)$ can be written as

\displaystyle\mathbf{F}=\mathbf{R}\mathbf{U}=\mathbf{V}\mathbf{R},

(3.15)

for unique $\mathbf{R}\in\mathbf{SO}(3)$ and where $\mathbf{U}$ and $\mathbf{V}$ are unique, symmetric, positive-definite matrices.

The theorem is a rigorous demonstration that any deformation gradient can be considered as “stretch followed by rotation” (the first equality) or “rotation followed by stretch” (the second). It thus provides a natural way of factoring the deformation gradient into the product of a rotation matrix and a symmetric matrix. Crucially, $\mathbf{U}$ and $\bm{\sigma}$ have the same dimensions. Our goal now becomes to investigate how far $\mathbf{R}$ and $\mathbf{U}$ “interact” and whether or not $\hat{\bm{\sigma}}$ can in some sense be inverted for $\mathbf{U}$ . To fully resolve the issue we must examine carefully how the principle of material-frame indifference and material symmetries manifest within eqs.(3.14).

Material-frame indifference concerns the behaviour of the strain-energy function and related quantities under transformations of the deformation gradient of the form $\mathbf{F}\mapsto\mathbf{R}\mathbf{F}$ with $\mathbf{R}\in\mathbf{SO}(3)$ . Recalling that the value of the right Cauchy–Green deformation tensor is invariant under such a transformation, and using eq.(A.23), we see immediately from eqs.(3.14) that


$\displaystyle\hat{\bm{\sigma}}\!\left(\mathbf{R}\mathbf{F}\right)$	$\displaystyle=\bm{\mathsf{T}}_{\mathbf{R}}\cdot\hat{\bm{\sigma}}\!\left(\mathbf{F}\right)$	(3.16a)
$\displaystyle\hat{\bm{\mathsf{a}}}\!\left(\mathbf{R}\mathbf{F}\right)$	$\displaystyle=\bm{\mathsf{T}}_{\mathbf{R}}\hat{\bm{\mathsf{a}}}\!\left(\mathbf{F}\right)\bm{\mathsf{T}}_{\mathbf{R}}^{T}$	(3.16b)

for all $\mathbf{R}\in\mathbf{SO}(3)$ . Substituting the polar decomposition $\mathbf{F}=\mathbf{R}\mathbf{U}$ into eqs.(3.14) and making use of eqs.(3.16) we then obtain


$\displaystyle\hat{\bm{\sigma}}\!\left(\mathbf{F}\right)$	$\displaystyle=\bm{\mathsf{T}}_{\mathbf{R}}\cdot\hat{\bm{\Sigma}}\!\left(\mathbf{U}\right)$	(3.17a)
$\displaystyle\hat{\bm{\mathsf{a}}}\!\left(\mathbf{F}\right)$	$\displaystyle=\bm{\mathsf{T}}_{\mathbf{R}}\hat{\bm{\mathsf{A}}}\!\left(\mathbf{U}\right)\bm{\mathsf{T}}_{\mathbf{R}}^{T},$	(3.17b)

where we have defined the functions


$\displaystyle\hat{\bm{\Sigma}}\!\left(\mathbf{U}\right)$	$\displaystyle=2J_{\mathbf{U}}^{-1}\bm{\mathsf{T}}_{\mathbf{U}}\cdot D\tilde{V}\!\left(\mathbf{U}^{2}\right)$	(3.18a)
$\displaystyle\hat{\bm{\mathsf{A}}}\!\left(\mathbf{U}\right)$	$\displaystyle=4J_{\mathbf{U}}^{-1}\bm{\mathsf{T}}_{\mathbf{U}}D^{2}\tilde{V}\!\left(\mathbf{U}^{2}\right)\bm{\mathsf{T}}_{\mathbf{U}}+\bm{\mathsf{R}}_{\hat{\bm{\Sigma}}\!\left(\mathbf{U}\right)},$	(3.18b)

whose arguments are required to be symmetric, positive-definite matrices. Importantly, the function $\hat{\bm{\Sigma}}$ maps symmetric matrices into symmetric matrices. As a result, there is no dimensional obstruction to its being invertible. We will assume that a unique inverse $\hat{\bm{\Sigma}}^{-1}$ exists wherever required, an assumption that is natural as long as the stress is not too large (see Appendix B).

We can now examine solutions of the equation $\hat{\bm{\sigma}}(\mathbf{F})=\bm{\sigma}$ for given $\bm{\sigma}$ . We write $\mathbf{F}=\mathbf{R}\mathbf{U}$ as above, but now suppose that the value of $\mathbf{R}\in\mathbf{SO}(3)$ has been fixed arbitrarily. Using eq.(3.17a) we then trivially obtain

\displaystyle\mathbf{U}

\displaystyle=\hat{\bm{\Sigma}}^{-1}\!\left(\bm{\mathsf{T}}_{\mathbf{R}}^{T}\cdot\bm{\sigma}\right),

(3.19)

whence it follows that the equilibrium deformation gradient is given by

\displaystyle\mathbf{F}=\hat{\mathbf{F}}\!\left(\bm{\sigma},\mathbf{R}\right)\equiv\bm{\mathsf{L}}_{\mathbf{R}}\cdot\hat{\bm{\Sigma}}^{-1}\!\left(\bm{\mathsf{T}}_{\mathbf{R}}^{T}\cdot\bm{\sigma}\right).

(3.20)

Here we see concretely where the missing three degrees of freedom enter into the inversion of $\hat{\bm{\sigma}}$ . While eq.(3.20) constitutes one solution of the given equation, it is not unique. Letting $\mathbf{R}$ vary over $\mathbf{SO}(3)$ generates a three-parameter family of solutions, and the uniqueness of the polar decomposition means that every $\mathbf{R}$ must correspond to a different $\mathbf{F}$ . Moreover, any $\mathbf{F}$ that solves $\hat{\bm{\sigma}}(\mathbf{F})=\bm{\sigma}$ can be polar-decomposed and written in terms of $\bm{\sigma}$ using eq.(3.20), so we have clearly obtained all solutions of the equation. Crucially, the non-uniqueness carries over into the elastic tensor. Substitution of eq.(3.20) into eq.(3.14b) yields

\displaystyle\bm{\mathsf{a}}=\hat{\bm{\mathsf{a}}}\!\!\left[\hat{\mathbf{F}}\!\left(\bm{\sigma},\mathbf{R}\right)\right]\equiv\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma},\mathbf{R}\right),

(3.21)

where the function $\bar{\bm{\mathsf{a}}}$ can be written more expansively as

\displaystyle\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma},\mathbf{R}\right)=\bm{\mathsf{T}}_{\mathbf{R}}\!\left[\!\left(\hat{\bm{\mathsf{A}}}\circ\hat{\bm{\Sigma}}^{-1}\right)\!\left(\bm{\mathsf{T}}_{\mathbf{R}}^{T}\cdot\bm{\sigma}\right)\right]\bm{\mathsf{T}}_{\mathbf{R}}^{T}.

(3.22)

It follows that we cannot expect to write the elastic tensor as a function of the equilibrium Cauchy stress alone. A definite value for $\bm{\mathsf{a}}$ depends on the arbitrary choice of an element of $\mathbf{SO}(3)$ .

We can add some nuance to this result by considering material symmetries. Let $\mathrm{Sym}(\tilde{W})$ denote the material symmetry group (which could be trivial) at the point of interest, whose elements $\mathbf{Q}$ act on the deformation gradient on the right through $\mathbf{F}\mapsto\mathbf{F}\mathbf{Q}$ . In terms of the right Cauchy–Green deformation tensor such a transformation takes the form $\mathbf{C}\mapsto\mathbf{Q}^{T}\mathbf{C}\mathbf{Q}=\bm{\mathsf{T}}_{\mathbf{Q}}^{T}\cdot\mathbf{C}$ , and by definition we have

\displaystyle\tilde{V}(\bm{\mathsf{T}}_{\mathbf{Q}}^{T}\cdot\mathbf{C})=\tilde{V}(\mathbf{C})

(3.23)

for all $\mathbf{Q}\in\mathrm{Sym}(\tilde{W})$ . Differentiating this relation in the now standard manner yields

	$\displaystyle D\tilde{V}\!\left(\mathbf{C}\right)$	$\displaystyle=\bm{\mathsf{T}}_{\mathbf{Q}}\cdot D\tilde{V}\!\left(\bm{\mathsf{T}}_{\mathbf{Q}}^{T}\cdot\mathbf{C}\right)$		(3.24)
	$\displaystyle D^{2}\tilde{V}\!\left(\mathbf{C}\right)$	$\displaystyle=\bm{\mathsf{T}}_{\mathbf{Q}}D^{2}\tilde{V}\!\left(\bm{\mathsf{T}}_{\mathbf{Q}}^{T}\cdot\mathbf{C}\right)\bm{\mathsf{T}}_{\mathbf{Q}}^{T}.$		(3.25)

Using the properties of $\bm{\mathsf{T}}_{\mathbf{F}}$ we then find from eqs.(3.14) that


$\displaystyle\hat{\bm{\sigma}}\!\left(\mathbf{F}\mathbf{Q}\right)$	$\displaystyle=2J_{\mathbf{F}}^{-1}\bm{\mathsf{T}}_{\mathbf{F}}\cdot\!\left[\bm{\mathsf{T}}_{\mathbf{Q}}\cdot D\tilde{V}\!\left(\bm{\mathsf{T}}_{\mathbf{Q}}^{T}\cdot\mathbf{C}\right)\right]$	(3.26a)
$\displaystyle\hat{\bm{\mathsf{a}}}\!\left(\mathbf{F}\mathbf{Q}\right)$	$\displaystyle=4J_{\mathbf{F}}^{-1}\bm{\mathsf{T}}_{\mathbf{F}}\!\left[\bm{\mathsf{T}}_{\mathbf{Q}}D^{2}\tilde{V}\!\left(\bm{\mathsf{T}}_{\mathbf{Q}}^{T}\cdot\mathbf{C}\right)\bm{\mathsf{T}}_{\mathbf{Q}}^{T}\right]\bm{\mathsf{T}}_{\mathbf{F}}^{T}+\bm{\mathsf{R}}_{\hat{\bm{\sigma}}\!\left(\mathbf{F}\mathbf{Q}\right)},$	(3.26b)

from which it readily follows that


$\displaystyle\hat{\bm{\sigma}}\!\left(\mathbf{F}\mathbf{Q}\right)$	$\displaystyle=\hat{\bm{\sigma}}\!\left(\mathbf{F}\right)$	(3.27a)
$\displaystyle\hat{\bm{\mathsf{a}}}\!\left(\mathbf{F}\mathbf{Q}\right)$	$\displaystyle=\hat{\bm{\mathsf{a}}}\!\left(\mathbf{F}\right)$	(3.27b)

for any $\mathbf{Q}\in\mathrm{Sym}(\tilde{W})$ .

We can now understand $\bar{\bm{\mathsf{a}}}$ ’s non-unique stress dependence by studying how the structure implied by eqs.(3.27) manifests within $\hat{\bm{\Sigma}}^{-1}$ . To that end, it is simplest if we consider $\tilde{\mathcal{B}}$ to constitute a natural reference configuration for an equilibrium body that is stress-free. What follows is therefore based on the assumption that there exists some stress-free reference-configuration with respect to which we can describe $\mathcal{B}$ . This way we can take $\mathrm{Sym}(\tilde{W})$ to be equal, rather than just isomorphic, to a subgroup of $\mathbf{SO}(3)$ . Because all $\mathbf{Q}\in\mathrm{Sym}(\tilde{W})$ are then rotations, the material symmetry group acts through $(\mathbf{R},\mathbf{U})\mapsto(\mathbf{R}\mathbf{Q},\mathbf{Q}^{T}\mathbf{U}\mathbf{Q})$ on the level of the polar decomposition. Therefore, using eq.(3.17a) we find that eq.(3.27a) requires

\displaystyle\hat{\bm{\Sigma}}\!\left(\bm{\mathsf{T}}_{\mathbf{Q}}\cdot\mathbf{U}\right)=\bm{\mathsf{T}}_{\mathbf{Q}}\cdot\hat{\bm{\Sigma}}\!\left(\mathbf{U}\right)

(3.28)

for arbitrary $\mathbf{Q}\in\mathrm{Sym}(\tilde{W})$ . This equation expresses the intuitive notion that a stretch $\mathbf{U}$ and rotation $\mathbf{Q}$ will together induce the same stress no matter the order in which they are imposed – as long as $\mathbf{Q}$ is in the material symmetry group. In any case, by acting the inverse function $\hat{\bm{\Sigma}}^{-1}$ on this expression we obtain the analogous result

\displaystyle\hat{\bm{\Sigma}}^{-1}\!\left(\bm{\mathsf{T}}_{\mathbf{Q}}\cdot\bm{\Sigma}\right)=\bm{\mathsf{T}}_{\mathbf{Q}}\cdot\hat{\bm{\Sigma}}^{-1}\!\left(\bm{\Sigma}\right)

(3.29)

for arbitrary symmetric $\bm{\Sigma}$ , which we may substitute into eq.(3.20) to conclude that

$\displaystyle\hat{\mathbf{F}}\!\left(\bm{\sigma},\mathbf{R}\mathbf{Q}\right)$	$\displaystyle=\bm{\mathsf{L}}_{\mathbf{R}\mathbf{Q}}\cdot\hat{\bm{\Sigma}}^{-1}(\bm{\mathsf{T}}_{\mathbf{R}\mathbf{Q}}^{T}\cdot\bm{\sigma})$
	$\displaystyle=(\bm{\mathsf{L}}_{\mathbf{R}}\bm{\mathsf{L}}_{\mathbf{Q}})\cdot\hat{\bm{\Sigma}}^{-1}[\bm{\mathsf{T}}_{\mathbf{Q}}^{T}\cdot(\bm{\mathsf{T}}_{\mathbf{R}}^{T}\cdot\bm{\sigma})]$
	$\displaystyle=(\bm{\mathsf{L}}_{\mathbf{R}}\bm{\mathsf{L}}_{\mathbf{Q}}\bm{\mathsf{T}}_{\mathbf{Q}}^{T})\cdot\hat{\bm{\Sigma}}^{-1}(\bm{\mathsf{T}}_{\mathbf{R}}^{T}\cdot\bm{\sigma})$
	$\displaystyle=\hat{\mathbf{F}}\!\left(\bm{\sigma},\mathbf{R}\right)\mathbf{Q}.$	(3.30)

This relationship does not allow us to determine $\mathbf{F}$ any more precisely – it will always be known only up to an element of $\mathbf{SO}(3)$ – but we may trivially write

\displaystyle\hat{\bm{\mathsf{a}}}\!\left[\hat{\mathbf{F}}\!\left(\bm{\sigma},\mathbf{R}\mathbf{Q}\right)\right]=\hat{\bm{\mathsf{a}}}\!\left[\hat{\mathbf{F}}\!\left(\bm{\sigma},\mathbf{R}\right)\mathbf{Q}\right].

(3.31)

It follows immediately from eq.(3.27b) that

\displaystyle\hat{\bm{\mathsf{a}}}\!\left[\hat{\mathbf{F}}\!\left(\bm{\sigma},\mathbf{R}\mathbf{Q}\right)\right]=\hat{\bm{\mathsf{a}}}\!\left[\hat{\mathbf{F}}\!\left(\bm{\sigma},\mathbf{R}\right)\right].

(3.32)

Hence, using the notation of eq.(3.21), we have obtained the key identity

\displaystyle\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma},\mathbf{R}\mathbf{Q}\right)=\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma},\mathbf{R}\right)\quad\forall\mathbf{Q}\in\mathrm{Sym}(\tilde{W}).

(3.33)

Eq.(3.33) implies that two distinct rotation matrices $\mathbf{R},\mathbf{R}^{\prime}\in\mathbf{SO}(3)$ will lead to the same elastic tensor via eq.(3.21) if $\mathbf{R}^{T}\mathbf{R}^{\prime}\in\mathrm{Sym}(\tilde{W})$ . It is readily verified that this defines an equivalence relation, meaning that $\mathbf{SO}(3)$ can be partitioned into distinct equivalence classes, with the resulting quotient space denoted by $\mathbf{SO}(3)/\mathrm{Sym}(\tilde{W})$ . The function $\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma},\cdot\right)$ therefore depends not on the rotation matrix $\mathbf{R}$ directly, but only on the equivalence class in $\mathbf{SO}(3)/\mathrm{Sym}(\tilde{W})$ to which it belongs. In this manner the number of additional parameters required to fix a definite value of the elastic tensor can be reduced.

In summary, we have shown that it is possible to express the elastic tensor as a function of equilibrium stress, but only at the cost of introducing extra arbitrary parameters. Given our initial comments about the form of $\hat{\bm{\sigma}}$ , the presence of these parameters is not surprising. After all, we were essentially trying to fix nine numbers knowing only six. What is pleasing is that we have been able to exploit material-frame indifference to ‘package’ these extra degrees of freedom into a rotation matrix and write down a solution that is still well-defined. In addition, although our argument appeared at first to suggest that the rotation matrix was wholly arbitrary, we have shown that the presence of material symmetries can reduce the number of arbitrary parameters to be specified. On a physical level, we have shown formally that if one observes a given Cauchy stress in an arbitrary (hyperelastic) material, and assumes the material to have reached its present state by some elastic deformation from a prior state with known properties, it is generally impossible to “disentangle” the effects of the rotation- and stretch-components of the elastic deformation. As a consequence, one cannot in general construct the elastic tensor unambiguously.

3.3 Linearisation

In a geophysical context it will often be convenient to regard the total equilibrium Cauchy stress in the body $\mathcal{B}$ as some small perturbation to the equilibrium Cauchy stress of the reference-body $\tilde{\mathcal{B}}$ . It is therefore useful to linearise expression (3.21), the calculations for which are laid out in Appendix C. We assume that there exists a zeroth-order equilibrium configuration $\tilde{\mathcal{B}}$ possessing Cauchy stress $\bm{\sigma}^{0}$ and elastic tensor $\bm{\mathsf{a}}^{0}=\bm{\mathsf{c}}^{0}+\bm{\mathsf{R}}_{\bm{\sigma}^{0}}$ . We can set $\mathbf{R}^{0}$ to the identity without loss of generality because this zeroth-order state is taken to be known.

We linearise the system about a small perturbing stress. With $\epsilon$ a small parameter, the stress is set to

\displaystyle\bm{\sigma}=\bm{\sigma}^{0}+\epsilon\,\bm{\sigma}^{1}.

(3.34)

We must also linearise the rotation matrix of eq.(3.21). It is set to the identity at zeroth-order, so its perturbation satisfies

\displaystyle\mathbf{R}=\mathbf{1}+\epsilon\,\bm{\omega}^{1}+\mathcal{O}\!\left(\epsilon^{2}\right),

(3.35)

with $\bm{\omega}^{1}$ an antisymmetric matrix. Substituting these into eq.(3.21) and Taylor expanding, we may write

\displaystyle\bm{\mathsf{a}}=\bm{\mathsf{a}}^{0}+\epsilon\,\bm{\mathsf{a}}^{1}+\mathcal{O}\!\left(\epsilon^{2}\right).

(3.36)

The perturbed elastic tensor $\bm{\mathsf{a}}^{1}$ is decomposed as

\displaystyle\bm{\mathsf{a}}^{1}=\bm{\mathsf{c}}^{1}+\bm{\mathsf{R}}_{\bm{\sigma}^{1}},

(3.37)

consistent with eq.(2.42). Under these definitions, we show in Appendix C.1 that $\bm{\mathsf{c}}^{1}$ is given by


	$\displaystyle\bm{\mathsf{c}}^{1}=\bm{\mathsf{X}}_{\mathbf{u}^{1}+\bm{\omega}^{1}}\bm{\mathsf{c}}^{0}+\bm{\mathsf{c}}^{0}\bm{\mathsf{X}}_{\mathbf{u}^{1}-\bm{\omega}^{1}}-\bm{\mathsf{c}}^{0}\mathrm{tr}\!\left(\mathbf{u}^{1}\right)+8D^{3}\tilde{V}\!\left(\mathbf{1}\right)\cdot\mathbf{u}^{1},$		(3.38a)
	where $\mathbf{u}^{1}$ is a symmetric matrix which satisfies a generalised linear stress-strain relationship
	$\displaystyle\bm{\sigma}^{1}-\bm{\mathsf{X}}_{\bm{\omega}^{1}}\cdot\bm{\sigma}^{0}=\!\left(\bm{\mathsf{c}}^{0}+\overline{\bm{\mathsf{X}}}_{\bm{\sigma}^{0}}-\bm{\sigma}^{0}\otimes\mathbf{1}\right)\cdot\mathbf{u}^{1}.$		(3.38b)

In these expressions we have introduced the tensor product on matrices (eq.A.19) and the notations $\bm{\mathsf{X}}$ (eq.A.25) and $\overline{\bm{\mathsf{X}}}$ (eq.A.32). Thus, in order to fully specify the perturbation to the elastic tensor we must provide:

(1).

the perturbation to the stress, $\bm{\sigma}^{1}$ ;
(2).

$\bm{\sigma}^{0}$ and $\bm{\mathsf{c}}^{0}$ , which encode information about the zeroth-order equilibrium body;
(3).

further information about the zeroth-order equilibrium body, via the third derivatives of its strain-energy function at equilibrium;
(4).

an arbitrary antisymmetric matrix $\bm{\omega}^{1}$ .

Eqs.(3.38) is a general result whose derivation makes no particular demands on the form of the zeroth-order equilibrium configuration, but we can already make several observations. Firstly, no matter its functional form, $\bm{\mathsf{c}}^{1}$ can be shown to possess all the classical elastic symmetries, as required by eq.(2.42). Secondly, given that it is explicitly linear in the stress-perturbation, this theory is superficially rather close to those of Dahlen (1972a), Dahlen & Tromp (1998), Tromp & Trampert (2018) and Tromp et al. (2019), discussed in Section 1. There are some important differences, though. For one, our linearised theory makes explicit reference to third derivatives of the strain-energy function at equilibrium. In fact, those third derivatives can be seen as parametrising the theory. Moreover, eqs.(3.38) is parametrised further by the arbitrary degrees of freedom associated with $\bm{\omega}^{1}$ . One might imagine that terms in $\bm{\omega}^{1}$ would drop out when we compute, say, the Christoffel operator, so that it would have no effect on observable phenomena, but we show later, for the particular case of a transversely-isotropic material, that this does not happen. The relationship between the existing theories and this new linearised theory will become clearer as we consider some more concrete examples.

4 Examples

We now illustrate how the preceding results apply to a few different physical situations. We will consider both large and small stresses, making use of eqs.(3.38) for the latter. The examples discussed here are intended simply to illustrate the general behaviour implied by our theoretical results. For that reason we have used standard, simple strain-energy functions, and all physical quantities are presented suitably nondimensionalised. Henceforward, we will refer to the body $\mathcal{B}$ of the previous section simply as ‘the equilibrium body’. We will describe the fixed reference body $\tilde{\mathcal{B}}$ as the background body, and similarly for all its associated quantities such as strain-energy function and material symmetry group. All calculations for the following examples were carried out in Mathematica 12 (with the relevant notebook included in the supplementary material). The scenarios involving large stress required numerical inversion of the nonlinear function $\hat{\bm{\Sigma}}$ , for which we used Mathematica’s inbuilt ‘FindRoot’ function.

4.1 Isotropic materials

We begin by considering isotropic, stress-free background bodies. In the isotropic special case $\bm{\sigma}$ alone provides a unique specification of $\bm{\mathsf{a}}$ . To see this, it is sufficient to return to the identity

\displaystyle\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma},\mathbf{R}\right)=\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma},\mathbf{R}\mathbf{Q}\right),

(4.1)

for some $\mathbf{R}$ and $\mathbf{Q}$ both in $\mathbf{SO}(3)$ . By the standard group axioms we may write

\displaystyle\mathbf{Q}=\mathbf{R}^{T}\mathbf{R}^{\prime},

(4.2)

where $\mathbf{R}^{\prime}$ is another arbitrary element of $\mathbf{SO}(3)$ , whence

\displaystyle\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma},\mathbf{R}\right)=\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma},\mathbf{R}^{\prime}\right).

(4.3)

The matrices $\mathbf{R}$ and $\mathbf{R}^{\prime}$ are both arbitrary, so $\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma},\cdot\right)$ is independent of the choice of rotation matrix. When evaluating $\bar{\bm{\mathsf{a}}}$ we may therefore set $\mathbf{R}=\mathbf{1}$ without loss of generality. The elastic tensor is then given conveniently by

\displaystyle\bm{\mathsf{a}}=\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma},\mathbf{1}\right).

(4.4)

This argument shows that all rotations are equivalent up to right multiplication by $\mathbf{SO}(3)$ ; in other words, the quotient space

\displaystyle\mathbf{SO}(3)/\mathrm{Sym}(\tilde{W})=\mathbf{SO}(3)/\mathbf{SO}(3)

(4.5)

is a set which contains just one element. With these preliminaries we are in position to investigate the behaviour of isotropic materials under induced stress.

4.1.1 Exact response to deviatoric stress

The nonlinearity of expression (3.21) implies that a general material’s response to induced stress is influenced by derivatives of the strain-energy function higher than second-order.

We demonstrate this in Fig. 5, contrasting the slowness surfaces of two different isotropic materials under the same induced stress. The strain-energy functions describing the background bodies are (e.g. Holzapfel, 2000) modified Saint-Venant Kirchhoff,

\displaystyle\tilde{W}_{\mathrm{MSVK}}\!\left(\mathbf{F}\right)=\frac{\lambda}{2}\log^{2}{J}+\frac{\mu}{4}\mathrm{tr}\!\left(\,\!\left[\mathbf{C}-\mathbf{1}\right]^{2}\right),

(4.6)

and neo-Hookean,

\displaystyle\tilde{W}_{\mathrm{NH}}\!\left(\mathbf{F}\right)=\frac{\mu}{2}\!\left[\mathrm{tr}\!\left(\mathbf{C}\right)-3+\frac{2\mu}{\lambda}(J^{-\frac{\lambda}{\mu}}-1)\right],

(4.7)

where $\lambda$ and $\mu$ are constants. The limit of vanishing induced stress is obtained in both cases by evaluating the background strain-energy functions and their derivatives at $\mathbf{F}=\mathbf{1}$ . In that case the materials are indistinguishable, each possessing the classical isotropic elastic tensor

\displaystyle\mathsf{a}_{ijkl}=\mathsf{c}_{ijkl}=\lambda\delta_{ij}\delta_{kl}+\mu\!\left(\delta_{ik}\delta_{jl}+\delta_{il}\delta_{jk}\right).

(4.8)

Indeed, any strain-energy function that purports to describe an isotropic solid must give this result in the relevant limit. It is also apparent that $\lambda$ and $\mu$ should be interpreted as the standard Lamé parameters. Under large induced stress, though, the strain-energy functions are evaluated away from the identity, meaning that third- and higher-order derivatives become relevant. As shown in Fig.5, where we have induced a deviatoric stress of magnitude $\|\bm{\sigma}\|\sim\mu$ , the materials then display distinct behaviour.

4.1.2 Exact response to hydrostatic stress

When a hydrostatic pressure is induced in a stress-free, isotropic solid, its only effect on the elastic tensor is to change the elastic moduli. Here we derive exact expressions for $\lambda$ and $\mu$ as functions of pressure. We return to eqs.(3.18) and write

\displaystyle\mathbf{U}=\phi\mathbf{1},

(4.9)

for some positive scalar $\phi$ . Given that the system is isotropic and the induced stress hydrostatic, it follows from symmetry considerations and eq.(3.28) that $\mathbf{U}$ cannot take any other form. The deformation gradient itself can only ever be known up to an arbitrary rotation matrix, so it is given by

\displaystyle\mathbf{F}=\mathbf{R}\mathbf{U}=\phi\mathbf{R},

(4.10)

with $\mathbf{R}\in\mathbf{SO}(3)$ , while the right Cauchy–Green deformation tensor is

\displaystyle\mathbf{C}=\phi^{2}\mathbf{1}.

(4.11)

This deformation gradient corresponds physically to a local compression or dilation of the background-body with a rotation superimposed; $\phi<1$ effects a compression and vice versa. If the resulting equilibrium pressure in $\mathcal{B}$ is $p$ , the Cauchy stress is $\bm{\sigma}=-p\mathbf{1}$ , so from eqs.(3.14,3.17,3.18)


$\displaystyle-p\mathbf{1}$	$\displaystyle=\frac{2}{\phi}D\tilde{V}\!\left(\phi^{2}\mathbf{1}\right)$	(4.12a)
$\displaystyle\bm{\mathsf{a}}$	$\displaystyle=4\phi D^{2}\tilde{V}\!\left(\phi^{2}\mathbf{1}\right)+\bm{\mathsf{R}}_{\bm{\sigma}}.$	(4.12b)

We can set $\mathbf{R}$ to the identity in these expressions because the background material is isotropic (see eq.4.4).

Now, for the stress-free isotropic medium represented by $\tilde{\mathcal{B}}$ , the strain-energy function $\tilde{V}$ is a function of the three scalar invariants of $\mathbf{C}$ (Holzapfel, 2000). Defining the scalar invariants as

\displaystyle I_{i}\!\left(\mathbf{C}\right)\equiv\mathrm{tr}\!\left(\mathbf{C}^{i}\right),\quad i=1,2,3,

(4.13)

we can write the strain-energy function as

\displaystyle\tilde{V}\!\left(\mathbf{C}\right)\equiv\widehat{V}\!\left[I_{1}\!\left(\mathbf{C}\right),I_{2}\!\left(\mathbf{C}\right),I_{3}\!\left(\mathbf{C}\right)\right].

(4.14)

From the chain rule, its first and second derivatives are

	$\displaystyle D\tilde{V}$	$\displaystyle=\sum_{i}\widehat{V}_{i}\,DI_{i}$		(4.15)
	$\displaystyle D^{2}\tilde{V}$	$\displaystyle=\sum_{ij}\widehat{V}_{ij}\,DI_{i}\otimes DI_{j}+\sum_{i}\widehat{V}_{i}\,D^{2}I_{i},$		(4.16)

where we have written the derivatives of $\widehat{V}$ with respect to its arguments in an obvious way. When the derivatives are evaluated at $\mathbf{C}=\phi^{2}\mathbf{1}$ , we will write e.g.

\displaystyle v_{13}=\widehat{V}_{13}\!\left[I_{1}\!\left(\phi^{2}\mathbf{1}\right)\!,\,I_{2}\!\left(\phi^{2}\mathbf{1}\right)\!,\,I_{3}\!\left(\phi^{2}\mathbf{1}\right)\right],

(4.17)

and similarly for the other derivatives. With this, one finds after a little algebra that


$\displaystyle-p\mathbf{1}$	$\displaystyle=\frac{2}{\phi}\!\left(v_{1}+2v_{2}\phi^{2}+3v_{3}\phi^{4}\right)\mathbf{1}$	(4.18a)
$\displaystyle\bm{\mathsf{a}}$	$\displaystyle=4\phi\!\left[\!\left(v_{11}+4v_{12}\phi^{2}+\!\left(4v_{22}+6v_{13}\right)\phi^{4}+12v_{23}\phi^{6}+9v_{33}\phi^{8}\right)\mathbf{1}\otimes\mathbf{1}\right.$
	$\displaystyle\qquad\quad\left.+\!\left(2v_{2}+6v_{3}\phi^{2}\right)\overline{\bm{\mathsf{id}}}\right]+\bm{\mathsf{R}}_{\bm{\sigma}}.$	(4.18b)

where the operator $\overline{\bm{\mathsf{id}}}$ has been defined in eq.(A.33) and has components

\displaystyle(\overline{\bm{\mathsf{id}}})_{ijkl}=\frac{1}{2}\!\left(\delta_{ik}\delta_{jl}+\delta_{il}\delta_{jk}\right).

(4.19)

We identify the coefficients of the first two operators in the expression for $\bm{\mathsf{a}}$ as the Lamé parameters, given as functions of $\phi$ , so we reach the equations


	$\displaystyle\lambda=4\phi\!\left[v_{11}+4v_{12}\phi^{2}+\!\left(4v_{22}+6v_{13}\right)\phi^{4}+12v_{23}\phi^{6}+9v_{33}\phi^{8}\right]$		(4.20a)
	$\displaystyle\mu=4\phi\!\left(v_{2}+3v_{3}\phi^{2}\right).$		(4.20b)
	$\phi$ is obtained as a function of $p$ by inverting the relation
	$\displaystyle-p=\frac{2}{\phi}\!\left(v_{1}+2v_{2}\phi^{2}+3v_{3}\phi^{4}\right),$		(4.20c)

which would be performed numerically for a general strain-energy function. The Lamé parameters are then given as explicit functions of the equilibrium pressure, parametrised by the derivatives of $\widehat{V}$ .

It is also useful to note that the form of $\mathbf{U}$ considered here, despite producing a stressed configuration from an unstressed one, does not alter the material symmetry group, $\mathrm{Sym}(\tilde{W})=\mathbf{SO}(3)$ . Material symmetry groups transform under a particle-relabelling transformation according to eq.(2.48), and with $\mathbf{F}=\phi\mathbf{R}$

$\displaystyle\mathrm{Sym}(W)$	$\displaystyle=\!\left\{\mathbf{F}^{-1}\mathbf{Q}\mathbf{F},\mathbf{Q}\in\mathrm{Sym}(\tilde{W})\right\}$
	$\displaystyle=\!\left\{\mathbf{R}^{T}\mathbf{Q}\mathbf{R},\mathbf{Q}\in\mathrm{Sym}(\tilde{W})\right\}$
	$\displaystyle=\mathrm{Sym}(\tilde{W}).$	(4.21)

As expected on physical grounds, inducing a pressure in an isotropic body does not break the isotropy. All our conclusions from Section 3 therefore apply to an isotropic body even under hydrostatic stress. In particular, we can linearise about a hydrostatically stressed equilibrium given by $\mathbf{U}=\mathbf{1}$ (Appendix C.1) without having to refer the system back to some unstressed state.

4.1.3 Linearised response to small stress

Eqs.(3.38) simplify dramatically when we consider a small stress induced in a hydrostatically pre-stressed equilibrium. The total stress is written as

\displaystyle\bm{\sigma}=-p^{0}\mathbf{1}-p^{1}\mathbf{1}+\bm{\tau}^{1},

(4.22)

with

$\displaystyle p^{1}$	$\displaystyle\ll p^{0}$
$\displaystyle\\|\bm{\tau}^{1}\\|$	$\displaystyle\ll p^{0}$
$\displaystyle\mathrm{tr}\!\left(\bm{\tau}^{1}\right)$	$\displaystyle=0,$	(4.23)

and the complete elastic tensor is (Appendix C.2)

	$\displaystyle\mathsf{a}_{ijkl}$	$\displaystyle=\!\left(\kappa-\frac{2}{3}\mu+p^{1}c\right)\delta_{ij}\delta_{kl}+\!\left(\mu+p^{1}d\right)\!\left(\delta_{ik}\delta_{jl}+\delta_{il}\delta_{jk}\right)-\!\left(p^{0}+p^{1}\right)\delta_{ik}\delta_{jl}$
		$\displaystyle\qquad+a\!\left(\delta_{ij}\tau^{1}_{kl}+\delta_{kl}\tau^{1}_{ij}\right)+b\!\left(\delta_{ik}\tau^{1}_{jl}+\delta_{jl}\tau^{1}_{ik}+\delta_{il}\tau^{1}_{jk}+\delta_{jk}\tau^{1}_{il}\right)+\delta_{ik}\tau^{1}_{jl}.$		(4.24)

The constants $a$ , $b$ , $c$ and $d$ are defined to be


$\displaystyle a$	$\displaystyle=\frac{\kappa-\frac{2}{3}\mu+\frac{1}{2}\zeta_{2}}{\mu-p^{0}}$	(4.25a)
$\displaystyle b$	$\displaystyle=\frac{\mu+\frac{1}{4}\zeta_{3}}{\mu-p^{0}}$	(4.25b)
$\displaystyle c$	$\displaystyle=-\frac{\kappa-\frac{2}{3}\mu+3\zeta_{1}+2\zeta_{2}}{3\kappa+p^{0}}$	(4.25c)
$\displaystyle d$	$\displaystyle=-\frac{\mu+\frac{3}{2}\zeta_{2}+\zeta_{3}}{3\kappa+p^{0}},$	(4.25d)

with $\zeta_{1}$ , $\zeta_{2}$ and $\zeta_{3}$ the Murnaghan constants (Murnaghan, 1937) which offer a complete characterisation of the third derivatives of an isotropic strain-energy function about equilibrium. Up to third-order accuracy in the deformation gradient, specification of $p^{0}$ , $\kappa$ , $\mu$ , $\zeta_{1}$ , $\zeta_{2}$ and $\zeta_{3}$ is sufficient to fix all the elastic properties of the background body. In light of Section 4.1.2, it should be emphasised that $\kappa$ , $\mu$ , $\zeta_{1}$ , $\zeta_{2}$ and $\zeta_{3}$ are constants defined relative to the hydrostatically stressed background. It should be noted that these results first appeared (up to notation) in the nearly forgotten work of Walton (1974, Section 5), which was in part a response to Dahlen’s (1972a; 1972b) papers.

Eq.(4.1.3) is precisely equivalent to eq.(1.1.2). In particular, the four independent components of $\bm{\Pi}$ that we identified by symmetry arguments in Section 1.1.2 have simply dropped out of the linearisation procedure. Moreover, our consideration of constitutive theory has led to a more precise definition of those constants. All four are explicitly determined by the constitutive relation used to describe the background body, emerging from the theory as dimensionless combinations of the equilibrium pressure, shear and bulk moduli, and the three Murnaghan constants. This shows explicitly that they represent just three degrees of freedom. These results reduce to those of Dahlen (1972b) and Tromp & Trampert (2018) for certain values of the elastic moduli and Murnaghan constants; we present a detailed comparison between this theory and previous work in Appendix D.

4.2 Transversely-isotropic materials

Whilst an isotropic material’s response to a given induced stress is determined solely by its background strain-energy function, we also need to account for eq.(3.21)’s non-unique stress-dependence when considering materials with smaller symmetry groups. For example, we stated above that the symmetry group of a stress-free, transversely-isotropic material is $\mathbf{SO}(2)$ . A definite value of $\bm{\mathsf{a}}$ therefore depends upon the choice of an arbitrary element of the quotient space $\mathbf{SO}(3)/\mathbf{SO}(2)$ . This reflects the fact that $\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma},\cdot\right)$ cannot distinguish between matrices that only differ in how much rotation they cause about the symmetry axis. Therefore to evaluate $\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma},\mathbf{R}\right)$ we should only choose arbitrarily between rotation matrices whose own axes of rotation lie in the plane perpendicular to the symmetry axis. In order to pick such a matrix we must choose a direction for its rotation-axis – a direction in $\mathbb{R}^{2}$ described by an angle $\phi$ – and then specify an angle of rotation $\theta$ about that axis. Careful consideration of the possibility of double-counting shows that

\displaystyle\begin{aligned} \theta&\in[0,\pi)\\ \phi&\in[0,2\pi)\end{aligned}

(4.26)

(or vice versa). It is clear that we have effectively specified a point on $\mathbb{S}^{2}$ , the unit 2-sphere, or equivalently a direction in $\mathbb{R}^{3}$ . Indeed, it may be established by rigorous methods that $\mathbf{SO}(3)/\mathbf{SO}(2)\cong\mathbb{S}^{2}$ .

4.2.1 Exact response to deviatoric stress

The effect of $\bm{\mathsf{a}}$ ’s non-unique stress dependence is illustrated by Fig. 6, which shows slowness surfaces of a material described by the transversely-isotropic strain-energy function

\displaystyle\tilde{W}_{\mathrm{TI}}\!\left(\mathbf{F}\right)=\tilde{W}_{\mathrm{MSVK}}\!\left(\mathbf{F}\right)+\!\left[\alpha+2\beta\log{J}+\gamma\!\left(I_{4}-1\right)\right]\!\left(I_{4}-1\right)-\frac{\alpha}{2}\!\left(I_{5}-1\right).

(4.27)

In this equation $\alpha$ , $\beta$ and $\gamma$ are extra material constants required to describe a transversely-isotropic material, while $I_{4}$ and $I_{5}$ are the two further scalar invariants in terms of which a transversely-isotropic strain energy function is parametrised (Holzapfel, 2000). They are defined as

	$\displaystyle I_{4}$	$\displaystyle=\left\langle\bm{\nu},\mathbf{C}\cdot\bm{\nu}\right\rangle$		(4.28)
	$\displaystyle I_{5}$	$\displaystyle=\left\langle\bm{\nu},\mathbf{C}^{2}\cdot\bm{\nu}\right\rangle,$		(4.29)

with the unit-vector $\bm{\nu}$ pointing along the material’s symmetry-axis. This strain-energy function is adapted from Bonet & Burton (1998), although our definition of $\beta$ differs from theirs by a factor of two and we have defined the ‘isotropic part’ of the function differently.

The equilibrium configuration is unstressed, with elastic tensor

\displaystyle\mathsf{a}_{ijkl}=\mathsf{c}_{ijkl}

\displaystyle=\lambda\delta_{ij}\delta_{kl}+\mu\!\left(\delta_{ik}\delta_{jl}+\delta_{il}\delta_{jk}\right)+8\gamma\nu_{i}\nu_{j}\nu_{k}\nu_{l}+4\beta\!\left(\nu_{i}\nu_{j}\delta_{kl}+\delta_{ij}\nu_{k}\nu_{l}\right)-\alpha\!\left(\nu_{i}\nu_{k}\delta_{jl}+\nu_{j}\nu_{k}\delta_{il}+\nu_{j}\nu_{l}\delta_{ik}+\nu_{i}\nu_{l}\delta_{jk}\right),

(4.30)

which can be written alternatively as

\displaystyle\bm{\mathsf{a}}=\bm{\mathsf{c}}

\displaystyle=\lambda\mathbf{1}\otimes\mathbf{1}+2\mu\overline{\bm{\mathsf{id}}}+8\gamma\mathbf{N}\otimes\mathbf{N}+4\beta\!\left(\mathbf{1}\otimes\mathbf{N}+\mathbf{N}\otimes\mathbf{1}\right)-2\alpha\overline{\bm{\mathsf{X}}}_{\mathbf{N}}

(4.31)

if we define $\mathbf{N}$ as

\displaystyle\mathbf{N}\equiv\bm{\nu}\otimes\bm{\nu}.

(4.32)

We have induced the same stress in the material in both panels of the figure, but selected different $\!\left(\theta,\phi\right)$ pairs, producing slowness surfaces of different shapes. It should be emphasised that the material is described by the same strain-energy function in both panels; that the two slowness surfaces demonstrate distinct physical behaviour is due solely to $\bm{\mathsf{a}}$ ’s non-unique dependence on $\bm{\sigma}$ (eq.3.21).

4.2.2 Linearised response to deviatoric stress

Finally, we consider how a transversely-isotropic material responds to a small induced stress. This is the simplest nontrivial example in which one can show analytically how the arbitrary rotation matrix of eq.(3.21) manifests in the linearised elastic tensor.

In an isotropic material we took the background (zeroth-order) state to be hydrostatic, but in the transversely-isotropic case we should consider a more general zeroth-order stress of the form

\displaystyle\bm{\sigma}^{0}

\displaystyle=-\!\left(p^{0}+\frac{q^{0}}{3}\right)\mathbf{1}+q^{0}\bm{\nu}\otimes\bm{\nu}.

(4.33)

The pressure is $p^{0}$ as before, but the stress now possesses in addition a background deviatoric component consistent with the $\mathbf{SO}(2)$ symmetry. We then induce a small stress $\bm{\sigma}^{1}$ , and a tedious calculation laid out in Appendix C.3 leads to the linearised elastic tensor

$\displaystyle\bm{\mathsf{c}}^{1}$	$\displaystyle=\eta_{1}\mathbf{1}\otimes\mathbf{1}+\eta_{2}\overline{\bm{\mathsf{id}}}+\eta_{3}\mathbf{N}\otimes\mathbf{N}+\eta_{4}\!\left(\mathbf{1}\otimes\mathbf{N}+\mathbf{N}\otimes\mathbf{1}\right)+\eta_{5}\overline{\bm{\mathsf{X}}}_{\mathbf{N}}$
	$\displaystyle\qquad+\eta_{6}\overline{\bm{\mathsf{X}}}_{\bm{\sigma}^{1}}+\eta_{7}\!\left(\mathbf{1}\otimes\bm{\sigma}^{1}+\bm{\sigma}^{1}\otimes\mathbf{1}\right)+\eta_{8}\!\left(\mathbf{N}\otimes\bm{\sigma}^{1}+\bm{\sigma}^{1}\otimes\mathbf{N}\right)+\eta_{9}\!\left[\mathbf{1}\otimes\!\left(\bm{\mathsf{X}}_{\mathbf{N}}\cdot\bm{\sigma}^{1}\right)+\!\left(\bm{\mathsf{X}}_{\mathbf{N}}\cdot\bm{\sigma}^{1}\right)\otimes\mathbf{1}\right]$
	$\displaystyle\qquad+\eta_{10}\!\left[\mathbf{N}\otimes\!\left(\bm{\mathsf{X}}_{\mathbf{N}}\cdot\bm{\sigma}^{1}\right)+\!\left(\bm{\mathsf{X}}_{\mathbf{N}}\cdot\bm{\sigma}^{1}\right)\otimes\mathbf{N}\right]+\eta_{11}\overline{\bm{\mathsf{X}}}_{\!\left(\bm{\mathsf{X}}_{\mathbf{N}}\cdot\bm{\sigma}^{1}\right)}+\eta_{12}\!\left(\overline{\bm{\mathsf{X}}}_{\bm{\sigma}^{1}}\overline{\bm{\mathsf{X}}}_{\mathbf{N}}+\overline{\bm{\mathsf{X}}}_{\mathbf{N}}\overline{\bm{\mathsf{X}}}_{\bm{\sigma}^{1}}\right)$
	$\displaystyle\qquad+\eta_{13}\overline{\bm{\mathsf{X}}}_{\bm{\omega}}+\eta_{14}\!\left[\mathbf{1}\otimes\!\left(\bm{\mathsf{X}}_{\bm{\omega}}\cdot\mathbf{N}\right)+\!\left(\bm{\mathsf{X}}_{\bm{\omega}}\cdot\mathbf{N}\right)\otimes\mathbf{1}\right]+\eta_{15}\!\left[\mathbf{N}\otimes\!\left(\bm{\mathsf{X}}_{\bm{\omega}}\cdot\mathbf{N}\right)+\!\left(\bm{\mathsf{X}}_{\bm{\omega}}\cdot\mathbf{N}\right)\otimes\mathbf{N}\right]$
	$\displaystyle\qquad+\eta_{16}\!\left(\overline{\bm{\mathsf{X}}}_{\bm{\omega}}\overline{\bm{\mathsf{X}}}_{\mathbf{N}}-\overline{\bm{\mathsf{X}}}_{\mathbf{N}}\overline{\bm{\mathsf{X}}}_{\bm{\omega}}\right),$	(4.34)

with the constants $\!\left\{\eta_{i}\right\}$ defined in eqs.(C.90). The antisymmetric matrix $\bm{\omega}$ defines a vector pointing in an arbitrary direction in the plane perpendicular to the unperturbed material symmetry axis. Its components are

\displaystyle\omega_{i}=-\epsilon_{ijk}\omega_{jk},

(4.35)

where $\epsilon_{ijk}$ are the components of the Levi-Civita tensor, and its direction and magnitude are precisely the two arbitrary constants that must be fixed. On the other hand the $\!\left\{\eta_{i}\right\}$ are unique, scalar-valued functions of:

(1).

the constants $p^{0}$ and $q^{0}$ which parametrise the zeroth-order equilibrium stress;
(2).

the transversely-isotropic constants $\lambda$ , $\mu$ , $\alpha$ , $\beta$ and $\gamma$ (which are implicitly functions of $p^{0}$ , $q^{0}$ and their respective stress-free values);
(3).

$\mathrm{tr}\!\left(\bm{\sigma}^{1}\right)$ and $\left\langle\bm{\nu},\bm{\sigma}^{1}\cdot\bm{\nu}\right\rangle$ ;
(4).

the seven constants $\!\left\{\zeta_{i}\right\}$ defined in eq.(C.3) which, analogously to the Murnaghan constants, parametrise the third derivatives of a transversely-isotropic strain-energy function.

The precise functional forms of the $\!\left\{\eta_{i}\right\}$ are not nearly as important as the fact that they depend on the seven further degrees of freedom represented by the $\!\left\{\zeta_{i}\right\}$ , not to mention the two arbitrary parameters contained within $\bm{\omega}^{1}$ . In contrast, when Tromp et al.’s (2019) result is specialised to the transversely-isotropic case it has just five free parameters, namely the pressure derivatives of the elastic moduli (the independent components of their tensor $\bm{\Gamma}^{\prime}$ ). We also believe that the term in $\eta_{11}$ within eq.(4.2.2) is not present in Tromp et al.’s expression.

As a last point, let us consider the perturbation to the Christoffel operator associated with terms in $\bm{\omega}^{1}$ . Continuing to ignore spatial arguments, one can show that definition (2.27) is equivalent to

\displaystyle\left\langle\mathbf{a},\rho\mathbf{B}\!\left(\mathbf{p}\right)\cdot\mathbf{a}\right\rangle=\left\langle\mathbf{a}\otimes\mathbf{p},\bm{\mathsf{a}}\cdot\!\left(\mathbf{a}\otimes\mathbf{p}\right)\right\rangle

(4.36)

for arbitrary $\mathbf{a}$ . From this it is a matter of algebra to show that the contribution to $\rho\mathbf{B}$ of the terms in $\bm{\omega}^{1}$ is

	$\displaystyle\rho\mathbf{B}$	$\displaystyle=\!\left(\eta_{14}+\eta_{16}\right)\left\langle\bm{\nu},\mathbf{p}\right\rangle\!\left[\mathbf{p}\otimes\!\left(\bm{\omega}^{1}\times\bm{\nu}\right)+\!\left(\bm{\omega}^{1}\times\bm{\nu}\right)\otimes\mathbf{p}\right]+\!\left(\eta_{14}+\frac{1}{2}\eta_{16}\right)\left\langle\bm{\omega}^{1}\times\bm{\nu},\mathbf{p}\right\rangle\!\left(\mathbf{p}\otimes\bm{\nu}+\bm{\nu}\otimes\mathbf{p}\right)$
		$\displaystyle\qquad+\!\left(\eta_{15}\left\langle\bm{\nu},\mathbf{p}\right\rangle^{2}+\eta_{16}\\|\mathbf{p}\\|^{2}\right)\!\left[\bm{\nu}\otimes\!\left(\bm{\omega}^{1}\times\bm{\nu}\right)+\!\left(\bm{\omega}^{1}\times\bm{\nu}\right)\otimes\bm{\nu}\right]+\left\langle\bm{\nu},\mathbf{p}\right\rangle\left\langle\bm{\omega}^{1}\times\bm{\nu},\mathbf{p}\right\rangle\!\left(\eta_{15}\bm{\nu}\otimes\bm{\nu}+\eta_{16}\mathbf{1}\right).$		(4.37)

Each of the coefficients $\eta_{14}$ , $\eta_{15}$ and $\eta_{16}$ depends on a different combination of the $\!\left\{\zeta_{i}\right\}$ . But the $\!\left\{\zeta_{i}\right\}$ parametrise the third-derivatives of the strain energy and can be specified independently both of the other elastic constants and of each other. It is therefore clear that $\bm{\omega}^{1}$ will generally contribute nonzero terms to the Christoffel operator.

5 Discussion

Working under the theory of finite elasticity, we have investigated how changes of equilibrium, changes in stress and changes in the elastic tensor are interrelated within hyperelastic bodies. Central to the discussion are eqs.(3.14), which we can restate in the notation of Dahlen & Tromp (1998, Sections 2.10 & 3.6) as


$\displaystyle T^{0}_{ij}$	$\displaystyle=\frac{\rho^{0}}{\det\mathbf{F}}F_{ip}F_{jq}\frac{\partial U^{L}}{\partial E^{L}_{pq}}$	(5.1a)
$\displaystyle\Xi_{ijkl}$	$\displaystyle=\frac{\rho^{0}}{\det\mathbf{F}}F_{ip}F_{jq}F_{kr}F_{ls}\frac{\partial^{2}U^{L}}{\partial E^{L}_{pq}\partial E^{L}_{rs}}.$	(5.1b)

These equations tell us how a hyperelastic body’s equilibrium Cauchy stress and elastic tensor change when the body is deformed by a motion with deformation gradient $\mathbf{F}$ , assuming that the body is described by a given constitutive function $U^{L}$ . They can be used to approach both forward- and inverse-problems within geophysics, which we now illustrate by considering briefly the problem of monsoon loading, an example that also allows us to contextualise our main results.

In regions that experience heavy monsoons, the rainwater accumulating on the Earth’s surface causes the crust to be loaded nontrivially by different amounts at different times of the year (e.g. Fu et al., 2013). This load causes the crust to deform, and induces associated stresses within it. Given that the deformation occurs over a timescale of months – a timescale significantly greater than that associated with seismic wave propagation – we model the deformations to be quasi-static. That is, we take seismic waves propagating through the Earth at a given time to ‘see’ a static equilibrium Earth that does not interact with them dynamically. Nevertheless, the deformation of the crust does affect the propagation of the seismic waves because the equilibrium configuration has changed, and through eq.(5.1b) there is a consequent change in the elastic tensor. The results of this paper allow us to think about this problem in a number of ways.

A first approach makes direct use of eqs.(5.1) from the perspective of forward-modelling. For concreteness, say that we know the constitutive relation governing the Earth. We then model the deformation induced by the rainfall by solving a quasi-static mapping problem, from which we obtain the change in equilibrium configuration. The resultant mapping has a deformation gradient $\mathbf{F}$ , and we can use eqs.(5.1) to compute the resulting change in both the equilibrium Cauchy stress and the elastic tensor. Hence, we can directly compute the expected seismic anisotropy.

The procedure just outlined is appealing, but it rests on the assumption that we know the rainfall loading sufficiently well to predict the change in configuration. Although this might be realistic in some cases, it is unlikely to be true in general. If anything, it is perhaps more pragmatic to pose the inverse-problem instead: given that an equilibrium deformation leads to seismic anisotropy, and given that we can observe seismic waves readily, what can seismic data tell us about $\mathbf{F}$ and, hence, about the change in the equilibrium configuration? For this problem we do not even need eq.(5.1a); we need only collect seismic data and use eq.(5.1b) to invert for the nine components of $\mathbf{F}$ (taking account of the condition that $\mathbf{F}$ be the gradient of a mapping). The equilibrium mapping could then be (partially) reconstructed.

So far in this example, the problem has been couched in terms of the equilibrium mapping: not once have we needed to consider how the elastic tensor depends on equilibrium stress. But in addition to simply modelling the seismological effect of large deformations, we could also carry out studies that seek to invert seismological observations for changes in the equilibrium stress. In that case the mapping itself would not (necessarily) be a relevant quantity; we would be more interested in the direct stress dependence of the elastic tensor. Now, we could invert seismic data for $\mathbf{F}$ (as just described) and then use eq.(5.1a) to compute $\mathbf{T}^{0}$ , but it would be desirable from the perspective of inversion to find an explicit expression for $\bm{\Xi}$ as a function of $\mathbf{T}^{0}$ , not least because the Cauchy stress has fewer components than $\mathbf{F}$ . In searching for such an expression we are effectively asking if it is possible to parametrise an equilibrium configuration, not in terms of the deformation gradient that gave rise to it, but rather in terms of its present Cauchy stress. We are thus led to address the following problem: if we observe a given Cauchy stress and assume that it arose due to some elastic deformation of a state with given constitutive properties, what elastic tensor would we measure?

As discussed at length in Section 3, in order to find $\bm{\Xi}$ as a function of $\mathbf{T}^{0}$ we must eliminate $\mathbf{F}$ from eqs.(5.1). We solve (5.1a) in order to find $\mathbf{F}$ as a function of $\mathbf{T}^{0}$ , and substitute the result into (5.1b). Physically, (5.1a) describes how a deformation alters an elastic body’s equilibrium Cauchy stress. But because many different values of $\mathbf{F}$ can lead to the same change in $\mathbf{T}^{0}$ , the mathematical problem of inverting (5.1a) to find $\mathbf{F}$ in terms of $\mathbf{T}^{0}$ is naturally underdetermined. It turns out that $\mathbf{F}$ depends not only on $\mathbf{T}^{0}$ , but also on an arbitrary rotation matrix. One can then show that the elastic tensor itself is given as a function of both the Cauchy stress and an arbitrary rotation. As emphasised in Section 3, the presence of these arbitrary parameters shows that measurement of the Cauchy stress alone is not sufficient to constrain the elastic tensor fully. However, we also found that the more symmetric the initial, reference state, the fewer the arbitrary parameters. In fact, for an isotropic reference state one can ignore the rotation matrix entirely; in that case the Cauchy stress does provide enough information to fix the elastic tensor.

Having derived an expression for the nonlinear dependence of $\bm{\Xi}$ on $\mathbf{T}^{0}$ , we proceeded to linearise it. From the point of view of performing inversions this step is not strictly necessary, but it is useful for two reasons. Firstly, the changes in the Earth’s equilibrium stress due to quasi-static deformation are likely to be small because the deformations we consider will almost always be small. This will certainly be the case with monsoon loading, and we also refer the reader to a discussion of the effects of equilibrium stress on seismic waveforms in the Groningen gas field by Tromp & Trampert (2018, Section 8.2). The change in the elastic tensor will therefore be well described by a linearised theory, and such a theory should be marginally quicker to implement computationally. Secondly, the theories discussed in Section 1 are all linear in the stress, so we must linearise our theory in order to carry out a comparison.

Linearising within a general anisotropic material, the elastic tensor depends not only on $\Delta\mathbf{T}^{0}$ but also on a small, arbitrary, antisymmetric matrix $\bm{\omega}^{1}$ that results from linearising the rotation matrix of eq.(3.21). Our general linearised expression (3.38) can be expanded and rewritten in the notation of Section 1 to give

\displaystyle\Delta\Xi_{ijkl}=\Pi_{ijklmn}\Delta T^{0}_{mn}+\Pi^{\prime}_{ijklmn}\omega^{1}_{mn},

(5.2)

where $\bm{\Pi}$ is the tensor relating changes in $\bm{\Xi}$ to $\Delta\mathbf{T}^{0}$ , just as before, while we have defined another tensor $\bm{\Pi}^{\prime}$ that relates changes in $\bm{\Xi}$ to $\bm{\omega}^{1}$ . The components of $\bm{\Pi}$ and $\bm{\Pi}^{\prime}$ are expressed in terms of the third derivative of the strain-energy function evaluated in the background state, and the components of the background elastic tensor $\bm{\Gamma}$ ; it is trivial to show that they therefore possess up to 56 independent components besides the elastic moduli. These rather complicated expressions are given in Appendix A.4, while here we make two general points. Firstly, $\bm{\Pi}^{\prime}$ will not usually vanish, which indicates that even linearised stress dependence will generally involve some level of arbitrariness. Inspection of Section C.1 will show that the neglect of $\bm{\Pi}^{\prime}$ is equivalent to assuming that $\Delta\mathbf{T}^{0}$ was induced by a symmetric deformation gradient $\mathbf{F}$ , i.e. a pure stretch. Secondly, we have shown that $\bm{\Pi}$ need not possess the further symmetries,

\displaystyle\Pi_{ijklmn}=\Pi_{ijmnkl}=\Pi_{klmnij}=\Pi_{mnijkl}=\Pi_{mnklij},

(5.3)

imposed in eq.(1.28). In short, our theory is parametrised differently from that of Tromp et al. (2019); we require up to 59 ( $=$ 56 $+$ 3) components, while Tromp et al. need a maximum of 21. The respective theories are therefore unlikely to give the same results when used to perform inversions.

This statement can be substantiated a little further by applying our general linearised expression to a transversely-isotropic material. We found that seven material-dependent parameters and two arbitrary constants (besides the five elastic moduli) were needed to specify the elastic tensor’s linearised stress dependence. Our results cannot be equivalent to those of Tromp et al. because their expression requires just five further constants, parametrised as it is by pressure-derivatives of the elastic moduli. (We have not presented here the complete expression for transversely-isotropic $\bm{\Pi}$ ; we trust that the reader who has worked through Appendix C.3 will forgive us.)

We have performed the most complete comparison with Tromp et al.’s work in the linearised isotropic case – which is presumably the case of most practical importance. Our expression for the elastic tensor takes precisely the form derived in Section 1.1 featuring the four constants $a$ , $b$ , $c$ and $d$ (eq.1.1.2). Furthermore, we have shown by considering constitutive behaviour that those constants are functions of the Murnaghan constants $\zeta_{1,2,3}$ (Murnaghan, 1937):


$\displaystyle a$	$\displaystyle=\frac{\kappa-\frac{2}{3}\mu+\frac{1}{2}\zeta_{2}}{\mu-p^{0}}$	(5.4a)
$\displaystyle b$	$\displaystyle=\frac{\mu+\frac{1}{4}\zeta_{3}}{\mu-p^{0}}$	(5.4b)
$\displaystyle c$	$\displaystyle=-\frac{\kappa-\frac{2}{3}\mu+3\zeta_{1}+2\zeta_{2}}{3\kappa+p^{0}}$	(5.4c)
$\displaystyle d$	$\displaystyle=-\frac{\mu+\frac{3}{2}\zeta_{2}+\zeta_{3}}{3\kappa+p^{0}}.$	(5.4d)

As a result they represent three degrees of freedom. Up to notation, these four expressions first appeared in the work of Walton (1974, eqs.31–34). Taking into account different definitions of the elastic moduli, we have also established in Appendix D that the expressions of Dahlen (1972b) and Tromp & Trampert (2018) are consistent with this theory for certain parameter choices. Those theories thus apply to a subset of isotropic materials. If one wished to invert seismic data for equilibrium stress using Tromp & Trampert’s theory, one would need to specify (or invert for) two parameters, the pressure-derivatives of $\kappa$ and $\mu$ . Our theory would require three such parameters. Finally, it is interesting that we have found expressions for the pressure-derivatives of the elastic moduli ( $c$ and $d$ ; see Appendix D) in terms of the Murnaghan constants. To our knowledge, this result has not appeared in the literature since Walton’s work.

Our linearised results would probably be more taxing to apply to observational seismology than those of Tromp et al. because we require the measurement and fitting of more parameters. Moreover, the strain-energy function’s third-derivatives are at present measured less readily than the moduli’s pressure-derivatives. It is also evident from Tromp et al.’s ab initio calculations (carried out for a material with cubic symmetry) that the extra effects we have derived are not particularly large. We would be keen to see if further such calculations can clarify this apparent “unreasonable effectiveness” of pressure-derivatives. Nevertheless, our theory should be practical for isotropic reference states. In that case it just requires one more parameter than Tromp et al.’s, and the Murnaghan constants of various materials have been measured in the past (e.g. Hughes & Kelly, 1953; Egle & Bray, 1976; Payan et al., 2009).

There are a number of potential geophysical applications of this work, but we must first mention two caveats. Firstly, the Earth is not elastic over geological time-scales, hence it is not reasonable to regard its equilibrium as having arisen through a finite deformation of an elastic material away from some hypothetical stress-free state. Within future work it would therefore be interesting to extend our methods to account for viscoelastic effects. Nevertheless, our framework should give valid descriptions of phenomena that occur over time-scales sufficiently short for the Earth to respond in an elastic – or only slightly anelastic – manner. For example, one might consider the effect on elastic wave speeds of processes that are fast relative to viscoelastic relaxation times but slow compared to those of seismic wave propagation, such as body tides, seasonal loading in the hydrosphere, or anthropogenic activity. A second caveat is that inverting seismic data for equilibrium stress is already rendered challenging by the fact that the equilibrium stress is not an entirely free parameter, but is constrained to satisfy the equilibrium equations (Backus, 1967; Al-Attar & Woodhouse, 2010). Our results make clear that it is also necessary to either provide or simultaneously invert for additional parameters related to third derivatives of the strain-energy function and, in some cases, infinitesimal rotations. Finally, given the importance of symmetry groups to this work, we would also be interested to see how our results mesh with the theory of homogenisation (e.g. Cupillard & Capdeville, 2018; Capdeville & Métivier, 2018) which describes how small-scale structures are ‘smeared out’ to produce effective media with different symmetry properties (recall for instance Backus’s (1962) point that long-wavelength seismic waves passing through a layered isotropic medium ‘see’ a transversely-isotropic effective medium).

6 Conclusions

We have derived an expression for the elastic tensor as an explicit function of equilibrium Cauchy stress. Our results differ from previous treatments in two main ways: they show that the elastic tensor’s dependence on equilibrium stress is generally both nonlinear and non-unique. On account of the nonlinearity alone, knowledge of a material’s background elastic-tensor is not sufficient to determine the material’s response to an induced stress; we require the information contained within higher-order derivatives of the background strain-energy function. Furthermore, the elastic tensor is a function not only of the equilibrium stress, but also of an arbitrary rotation matrix. As such, even with a definite strain-energy function in hand, the change in the elastic tensor due to an induced stress depends on the non-unique choice of this matrix. The non-uniqueness arises from the fact that the stress is considered to have been induced by an unspecified elastic deformation. However, we have also shown that the degree of non-uniqueness is reduced if the material under study has a nontrivial background material symmetry group.

In the linearised case, our approach shows that the characterisation of a material’s response to a small induced stress depends on more parameters than have been made explicit in previous studies (Tromp & Trampert, 2018; Tromp et al., 2019). We have shown moreover that the previous expressions for the elastic tensor’s stress dependence can be obtained as special cases of our linearised results. Our expressions should be seen to extend the work of the previous authors by including some extra terms that were not captured by their derivation. The approach of first deriving a nonlinear expression, and only then linearising, has also allowed us to suggest a different interpretation of the parametrisation of the elastic tensor’s linearised stress dependence. Whilst Tromp et al. suggest that the relevant parameters are the pressure-derivatives of the elastic moduli, we propose the use of a larger set of parameters: the third-derivatives of the strain-energy function, and, for anisotropic materials, infinitesimal rotations. That said, we would like to emphasise once again that the expressions of Tromp et al. are able to fit experimental data rather well for the case of a cubic material. So whilst it seems theoretically necessary to account for more parameters, on a practical level it might not be required. This is a curious result that is not obvious to the present authors, and we would be keen to see it investigated further.

Acknowledgements.

MM is supported by an EPSRC studentship and a CASE award from BP. We are grateful to Jeroen Tromp, Brent Delbridge and editor Juan Carlos Afonso for their helpful comments and suggestions. We would particularly like to thank Prof. Tromp for taking the time to discuss our initial results with us at length. Those conversations contributed greatly to the shaping of Section 1, Section 3.1 and Appendix D, and they significantly improved our own understanding of the work.

References

Al-Attar & Crawford (2016) Al-Attar, D. & Crawford, O., 2016. Particle relabelling transformations in elastodynamics, Geophysical Journal International, 205(1), 575–593.
Al-Attar & Woodhouse (2010) Al-Attar, D. & Woodhouse, J. H., 2010. On the parametrization of equilibrium stress fields in the earth, Geophysical Journal International, 181(1), 567–576.
Al-Attar et al. (2018) Al-Attar, D., Crawford, O., Valentine, A. P., & Trampert, J., 2018. Hamilton’s principle and normal mode coupling in an aspherical planet with a fluid core, Geophysical Journal International.
Backus (1962) Backus, G. E., 1962. Long-wave elastic anisotropy produced by horizontal layering, Journal of Geophysical Research, 67(11), 4427–4440.
Backus (1967) Backus, G. E., 1967. Converting vector and tensor equations to scalar equations in spherical coordinates, Geophysical Journal International, 13(1-3), 71–101.
Bonet & Burton (1998) Bonet, J. & Burton, A., 1998. A simple orthotropic, transversely isotropic hyperelastic constitutive equation for large strain computations, Computer Methods in Applied Mechanics and Engineering, 162(1-4), 151–164.
Capdeville & Métivier (2018) Capdeville, Y. & Métivier, L., 2018. Elastic full waveform inversion based on the homogenization method: theoretical framework and 2-d numerical illustrations, Geophysical Journal International, 213(2), 1093–1112.
Cupillard & Capdeville (2018) Cupillard, P. & Capdeville, Y., 2018. Non-periodic homogenization of 3-d elastic media for the seismic wave equation, Geophysical Journal International, 213(2), 983–1001.
Dahlen & Tromp (1998) Dahlen, F. & Tromp, J., 1998. Theoretical global seismology, Princeton university press.
Dahlen (1972a) Dahlen, F. A., 1972a. Elastic dislocation theory for a self-gravitating elastic configuration with an initial static stress field, Geophysical Journal International, 28(4), 357–383.
Dahlen (1972b) Dahlen, F. A., 1972b. Elastic velocity anisotropy in the presence of an anisotropic initial stress, Bulletin of the Seismological Society of America, 62(5), 1183–1193.
Egle & Bray (1976) Egle, D. M. & Bray, D. E., 1976. Measurement of acoustoelastic and third-order elastic constants for rail steel, The Journal of the Acoustical Society of America, 60(3), 741–744.
Fu et al. (2013) Fu, Y., Argus, D. F., Freymueller, J. T., & Heflin, M. B., 2013. Horizontal motion in elastic response to seasonal loading of rain water in the amazon basin and monsoon water in southeast asia observed by GPS and inferred from GRACE, Geophysical Research Letters, 40(23), 6048–6053.
Gurtin et al. (2010) Gurtin, M. E., Fried, E., & Anand, L., 2010. The mechanics and thermodynamics of continua, Cambridge University Press.
Holzapfel (2000) Holzapfel, G., 2000. Nonlinear Solid Mechanics: A Continuum Approach for Engineering, John Wiley and Sons, Chichester, UK.
Hughes & Kelly (1953) Hughes, D. S. & Kelly, J. L., 1953. Second-order elastic deformation of solids, Physical Review, 92(5), 1145–1149.
Lamb (1881) Lamb, H., 1881. On the vibrations of an elastic sphere, Proceedings of the London Mathematical Society, s1-13(1), 189–212.
Marsden & Hughes (1994) Marsden, J. E. & Hughes, T. J., 1994. Mathematical foundations of elasticity, Courier Corporation.
Murnaghan (1937) Murnaghan, F. D., 1937. Finite deformations of an elastic solid, American Journal of Mathematics, 59(2), 235.
Noll (1965) Noll, W., 1965. Proof of the maximality of the orthogonal group in the unimodular group, Archive for Rational Mechanics and Analysis, 18(2), 100–102.
Noll (1974) Noll, W., 1974. A new mathematical theory of simple materials, Springer.
Payan et al. (2009) Payan, C., Garnier, V., Moysan, J., & Johnson, P. A., 2009. Determination of third order elastic constants in a complex solid applying coda wave interferometry, Applied Physics Letters, 94(1), 011904.
Stacey (1992) Stacey, F. D., 1992. Physics of the Earth, Brookfield Press.
Thomson (1863) Thomson, W., 1863. XXVII. on the rigidity of the earth, Philosophical Transactions of the Royal Society of London, 153, 573–582.
Tromp & Trampert (2018) Tromp, J. & Trampert, J., 2018. Effects of induced stress on seismic forward modelling and inversion, Geophysical Journal International, 213(2), 851–867.
Tromp et al. (2019) Tromp, J., Marcondes, M. L., Wentzcovitch, R. M. M., & Trampert, J., 2019. Effects of induced stress on seismic waves: Validation based on ab initio calculations, Journal of Geophysical Research: Solid Earth, 124(1), 729–741.
Truesdell & Noll (2004) Truesdell, C. & Noll, W., 2004. The Non-Linear Field Theories of Mechanics, Springer Berlin Heidelberg, Berlin, Heidelberg.
Walton (1974) Walton, K., 1974. The seismological effects of elastic pre-straining within the earth, Geophysical Journal International, 36(3), 651–671.

Appendix A Notations and definitions

A.1 Groups

We define $\mathbf{GL}(n)$ , the general linear group of dimension n, to be the set of invertible $n\times n$ matrices under the operation of matrix multiplication. For a general group $\mathbf{G}$ , a subgroup $\mathbf{H}$ of $\mathbf{G}$ is a subset of the elements of $\mathbf{G}$ that is itself a group; $\mathbf{H}$ is described as a proper subgroup of $\mathbf{G}$ if $\mathbf{H}\neq\mathbf{G}$ . With this, we can define $\mathbf{SL}(n)$ , the special linear group of $n\times n$ matrices with unit determinant, which is a proper subgroup of $\mathbf{GL}(n)$ . A particularly important proper subgroup of $\mathbf{SL}(n)$ is $\mathbf{SO}(n)$ , the n-dimensional special orthogonal group whose elements are rotation matrices in $n$ dimensions. For any $\mathbf{R}\in\mathbf{SO}(n)$ ,

	$\displaystyle\det{\mathbf{R}}$	$\displaystyle=1$		(A.1)
	$\displaystyle\mathbf{R}^{-1}$	$\displaystyle=\mathbf{R}^{T}.$		(A.2)

A.2 Some nonstandard linear operators

We have found it useful to introduce the left- and right-multiplication operators $\bm{\mathsf{L}}_{\mathbf{A}}$ and $\bm{\mathsf{R}}_{\mathbf{A}}$ which act according to

	$\displaystyle\bm{\mathsf{L}}_{\mathbf{A}}\cdot\mathbf{B}$	$\displaystyle=\mathbf{A}\mathbf{B}$		(A.3)
	$\displaystyle\bm{\mathsf{R}}_{\mathbf{A}}\cdot\mathbf{B}$	$\displaystyle=\mathbf{B}\mathbf{A}$		(A.4)

for arbitrary matrices $\mathbf{A},\mathbf{B}\in\mathbf{GL}(3)$ . These operators may be represented by fourth-rank tensors. An operator $\bm{\mathsf{O}}$ acts on a matrix $\mathbf{A}$ according to

\displaystyle\!\left(\bm{\mathsf{O}}\cdot\mathbf{A}\right)_{ij}=\mathsf{O}_{ijkl}A_{kl},

(A.5)

where we use a dot to represent the action of a linear operator on its operand (as is the case throughout this paper). Two such operators are composed by writing

\displaystyle\!\left(\bm{\mathsf{O}}^{1}\bm{\mathsf{O}}^{2}\right)_{ijkl}=\mathsf{O}^{1}_{ijpq}\mathsf{O}^{2}_{pqkl}.

(A.6)

The left- and right-multiplication operators are expressed in index-notation as

	$\displaystyle\!\left(\bm{\mathsf{L}}_{\mathbf{A}}\right)_{ijkl}$	$\displaystyle=A_{ik}\delta_{lj}$		(A.7)
	$\displaystyle\!\left(\bm{\mathsf{R}}_{\mathbf{A}}\right)_{ijkl}$	$\displaystyle=\delta_{ik}A_{lj},$		(A.8)

and, as an example,

$\displaystyle\!\left(\bm{\mathsf{L}}_{\mathbf{A}}\cdot\mathbf{B}\right)_{ij}$	$\displaystyle=\!\left(\bm{\mathsf{L}}_{\mathbf{A}}\right)_{ijkl}B_{kl}$
	$\displaystyle=A_{ik}\delta_{lj}B_{kl}$
	$\displaystyle=A_{ik}B_{kj}$
	$\displaystyle=\!\left(\mathbf{A}\mathbf{B}\right)_{ij}.$	(A.9)

It is clear that $\bm{\mathsf{L}}_{\mathbf{A}}$ and $\bm{\mathsf{R}}_{\mathbf{B}}$ commute for any choice of $\mathbf{A}$ and $\mathbf{B}$ , and that they satisfy

	$\displaystyle\bm{\mathsf{R}}_{\mathbf{A}\mathbf{B}}$	$\displaystyle=\bm{\mathsf{R}}_{\mathbf{B}}\bm{\mathsf{R}}_{\mathbf{A}}$		(A.10)
	$\displaystyle\bm{\mathsf{L}}_{\mathbf{A}\mathbf{B}}$	$\displaystyle=\bm{\mathsf{L}}_{\mathbf{A}}\bm{\mathsf{L}}_{\mathbf{B}},$		(A.11)

while their inverses have the property that

	$\displaystyle\bm{\mathsf{L}}_{\mathbf{A}}^{-1}$	$\displaystyle=\bm{\mathsf{L}}_{\mathbf{A}^{-1}}$		(A.12)
	$\displaystyle\bm{\mathsf{R}}_{\mathbf{A}}^{-1}$	$\displaystyle=\bm{\mathsf{R}}_{\mathbf{A}^{-1}}.$		(A.13)

We also define the operators

	$\displaystyle\bm{\mathsf{L}}_{\mathbf{A}}^{T}$	$\displaystyle=\bm{\mathsf{L}}_{\mathbf{A}^{T}}$		(A.14)
	$\displaystyle\bm{\mathsf{R}}_{\mathbf{A}}^{T}$	$\displaystyle=\bm{\mathsf{R}}_{\mathbf{A}^{T}}.$		(A.15)

Then, defining the inner-product on matrices by

\displaystyle\left\langle\mathbf{A},\mathbf{B}\right\rangle

\displaystyle=\mathrm{tr}\!\left(\mathbf{A}\mathbf{B}^{T}\right)

(A.16)

and introducing $\mathbf{C}\in\mathbf{GL}(3)$ , it follows quickly that

\displaystyle\left\langle\mathbf{A},\bm{\mathsf{L}}_{\mathbf{C}}\cdot\mathbf{B}\right\rangle=\left\langle\bm{\mathsf{L}}_{\mathbf{C}}^{T}\cdot\mathbf{A},\mathbf{B}\right\rangle,

(A.17)

and similarly for $\bm{\mathsf{R}}_{\mathbf{C}}$ . This is the origin of our suggestive notation for $\bm{\mathsf{L}}_{\mathbf{A}}^{T}$ and $\bm{\mathsf{R}}_{\mathbf{A}}^{T}$ : with the inner product as defined, they behave superficially as though they were the respective transpose operators of $\bm{\mathsf{L}}_{\mathbf{A}}$ and $\bm{\mathsf{R}}_{\mathbf{A}}$ . We also define a norm on matrices,

\displaystyle\|\mathbf{A}\|

\displaystyle=\sqrt{\left\langle\mathbf{A},\mathbf{A}\right\rangle},

(A.18)

and tensor-product,

\displaystyle\!\left(\mathbf{A}\otimes\mathbf{B}\right)\cdot\mathbf{C}=\left\langle\mathbf{B},\mathbf{C}\right\rangle\mathbf{A}.

(A.19)

In order to avoid clutter we have – just as for the inner product – simply used the standard notation for a tensor-product and norm on $\mathbb{R}^{3}$ , trusting that context will make our meaning unambiguous.

A particularly useful operator is

\displaystyle\bm{\mathsf{T}}_{\mathbf{A}}

\displaystyle=\bm{\mathsf{L}}_{\mathbf{A}}\bm{\mathsf{R}}_{\mathbf{A}}^{T}.

(A.20)

From the properties of $\bm{\mathsf{L}}_{\mathbf{A}}$ and $\bm{\mathsf{R}}_{\mathbf{A}}$ discussed above, we clearly have

	$\displaystyle\bm{\mathsf{T}}_{\mathbf{A}}^{-1}$	$\displaystyle=\bm{\mathsf{T}}_{\mathbf{A}^{-1}}$		(A.21)
	$\displaystyle\bm{\mathsf{T}}_{\mathbf{A}}^{T}$	$\displaystyle=\bm{\mathsf{T}}_{\mathbf{A}^{T}},$		(A.22)

as well as the useful functoral property

\displaystyle\bm{\mathsf{T}}_{\mathbf{A}\mathbf{B}}

\displaystyle=\bm{\mathsf{T}}_{\mathbf{A}}\bm{\mathsf{T}}_{\mathbf{B}}.

(A.23)

For $\mathbf{R}\in\mathbf{SO}(3)$ ,

\displaystyle\bm{\mathsf{T}}_{\mathbf{R}}\cdot\mathbf{A}=\mathbf{R}\mathbf{A}\mathbf{R}^{T}

(A.24)

evidently represents a rotation of $\mathbf{A}$ by $\mathbf{R}$ . The related operator

\displaystyle\bm{\mathsf{X}}_{\mathbf{A}}

\displaystyle=\bm{\mathsf{L}}_{\mathbf{A}}+\bm{\mathsf{R}}_{\mathbf{A}}^{T}

(A.25)

is the term in $\bm{\mathsf{T}}_{\mathbf{1}+\mathbf{A}}$ linear in $\mathbf{A}$ , and is particularly useful when we consider linearisation. Furthermore, if $\mathbf{A}$ and $\mathbf{B}$ are both symmetric the operator satisfies

\displaystyle\bm{\mathsf{X}}_{\mathbf{A}}\cdot\mathbf{B}=\bm{\mathsf{X}}_{\mathbf{B}}\cdot\mathbf{A}.

(A.26)

Finally, we must sometimes be careful when writing these operators in component form. Consider the expression for the quadratic part of an isotropic strain-energy function (e.g. Holzapfel, 2000):

\displaystyle V_{2}\!\left(\mathbf{C}\right)=\frac{\lambda}{8}\mathrm{tr}\!\left(\mathbf{C}\right)^{2}+\frac{\mu}{4}\mathrm{tr}\!\left(\mathbf{C}^{2}\right).

(A.27)

It is easy to show that the stress vanishes at $\mathbf{C}=\mathbf{1}$ . In such a stress-free equilibrium we have (eq.2.42)

\displaystyle\bm{\mathsf{a}}=4D^{2}V_{2}\!\left(\mathbf{1}\right).

(A.28)

We can evaluate the derivative by using the definitions above to rewrite $V_{2}$ as

	$\displaystyle V_{2}\!\left(\mathbf{C}\right)$	$\displaystyle=\frac{\lambda}{8}\left\langle\mathbf{1},\mathbf{C}\right\rangle^{2}+\frac{\mu}{4}\left\langle\mathbf{C},\bm{\mathsf{id}}\cdot\mathbf{C}\right\rangle$
		$\displaystyle=\left\langle\mathbf{C},\!\left(\frac{\lambda}{8}\mathbf{1}\otimes\mathbf{1}+\frac{\mu}{4}\bm{\mathsf{id}}\right)\cdot\mathbf{C}\right\rangle,$		(A.29)

where we further define the identity operator $\bm{\mathsf{id}}$ , which has components

\displaystyle\!\left(\bm{\mathsf{id}}\right)_{ijkl}=\delta_{ik}\delta_{jl}.

(A.30)

However, in the present context this is not the correct identity operator to use. When differentiating strain-energy functions that depend on the symmetric matrix $\mathbf{C}$ we find ourselves dealing with operators that naturally act on symmetric matrices. Therefore we must remind ourselves that the component-form expressions for these operators will be symmetric on each pair of indices. We denote this by an overline. In general, the process of symmetrisation is defined as follows. The tensor $\mathbf{M}$ is symmetrised on its $p$ ’th pair of indices by

\displaystyle M_{i_{1}j_{1}i_{2}j_{2}\dots i_{p}j_{p}\dots i_{n}j_{n}}\rightarrow M_{i_{1}j_{1}i_{2}j_{2}\dots\!\left(i_{p}j_{p}\right)\dots i_{n}j_{n}}\equiv\frac{1}{2}\!\left(M_{i_{1}j_{1}i_{2}j_{2}\dots i_{p}j_{p}\dots i_{n}j_{n}}+M_{i_{1}j_{1}i_{2}j_{2}\dots j_{p}i_{p}\dots i_{n}j_{n}}\right).

(A.31)

A fourth-rank tensor $\bm{\mathsf{M}}$ , for example, is symmetrised on both pairs of indices by writing

\displaystyle(\overline{\bm{\mathsf{M}}})_{ijkl}=\mathsf{M}_{(ij)(kl)}=\frac{1}{4}\!\left(\mathsf{M}_{ijkl}+\mathsf{M}_{ijlk}+\mathsf{M}_{jikl}+\mathsf{M}_{jikl}\right).

(A.32)

The symmetrised identity operator is thus written as

\displaystyle(\overline{\bm{\mathsf{id}}})_{ijkl}=\frac{1}{2}\!\left(\delta_{ik}\delta_{jl}+\delta_{il}\delta_{jk}\right).

(A.33)

Substituting this into eq.(A.2) will yield the correct component-form expression for the elastic tensor. Note that this process confers the minor elastic symmetries on a fourth-rank tensor, but not necessarily the hyperelastic symmetry.

A.3 Example: Differentiation of the strain-energy functions $W$ and $V$

All the calculations discussed in this paper could in principle be performed using index notation. However, we found it to be too cumbersome to carry out the calculations of Appendix C. The purpose of this appendix is to explain how the operator-based notation detailed above can be used to carry out the differentiations of strain-energy functions in Sections 2 and 3. It avoids clutter in the main text and moreover illustrates the manipulation of these operators.

For an arbitrary $\mathbf{F}\in\mathbf{GL}(3)$ and $\mathbf{C}=\mathbf{F}^{T}\mathbf{F}$ we have

\displaystyle W\!\left(\mathbf{F}\right)=V\!\left(\mathbf{C}\right),

(A.34)

neglecting the spatial argument to avoid clutter. We define small perturbations to $\mathbf{F}$ and $\mathbf{C}$ , respectively $\delta\mathbf{F}$ and $\delta\mathbf{C}$ , such that

	$\displaystyle W(\mathbf{F}+\delta\mathbf{F})-W(\mathbf{F})$	$\displaystyle=\left\langle DW(\mathbf{F}),\delta\mathbf{F}\right\rangle+\mathcal{O}(\\|\delta\mathbf{F}\\|^{2})$		(A.35)
	$\displaystyle V(\mathbf{C}+\delta\mathbf{C})-V(\mathbf{C})$	$\displaystyle=\left\langle DV(\mathbf{C}),\delta\mathbf{C}\right\rangle+\mathcal{O}(\\|\delta\mathbf{C}\\|^{2})$		(A.36)

where $\left\langle\cdot,\cdot\right\rangle$ is the inner product for matrices (eq.A.16) and $\|\cdot\|$ is the corresponding norm (eq.A.18). It follows from the definition of $\mathbf{C}$ that

\delta\mathbf{C}=\mathbf{F}^{T}\delta\mathbf{F}+\delta\mathbf{F}^{T}\mathbf{F}+\mathcal{O}(\|\delta\mathbf{F}\|^{2}),

(A.37)

which allows us to write

\displaystyle\left\langle DW(\mathbf{F}),\delta\mathbf{F}\right\rangle=\left\langle DV(\mathbf{C}),\mathbf{F}^{T}\delta\mathbf{F}+\delta\mathbf{F}^{T}\mathbf{F}\right\rangle.

(A.38)

Now, for any matrices $\mathbf{A}$ and $\mathbf{B}$ our inner product satisfies $\left\langle\mathbf{A},\mathbf{B}\right\rangle=\left\langle\mathbf{A}^{T},\mathbf{B}^{T}\right\rangle$ . This identity, along with the fact that $DV(\mathbf{C})$ is necessarily symmetric, implies that

\displaystyle\left\langle DV(\mathbf{C}),\delta\mathbf{F}^{T}\mathbf{F}\right\rangle=\left\langle DV(\mathbf{C}),\mathbf{F}^{T}\delta\mathbf{F}\right\rangle,

(A.39)

and therefore

\displaystyle\left\langle DW(\mathbf{F}),\delta\mathbf{F}\right\rangle=2\left\langle DV(\mathbf{C}),\mathbf{F}^{T}\delta\mathbf{F}\right\rangle.

(A.40)

Looking at the right hand side, the left multiplication operator defined in eq.(A.3) lets us write

\displaystyle\mathbf{F}^{T}\delta\mathbf{F}=\bm{\mathsf{L}}_{\mathbf{F}}^{T}\cdot\delta\mathbf{F},

(A.41)

where we have made use of eq.(A.14). Using this notation we find that

\displaystyle\left\langle DV(\mathbf{C}),\mathbf{F}^{T}\delta\mathbf{F}\right\rangle=\left\langle DV(\mathbf{C}),\bm{\mathsf{L}}_{\mathbf{F}}^{T}\cdot\delta\mathbf{F}\right\rangle=\left\langle\bm{\mathsf{L}}_{\mathbf{F}}\cdot DV(\mathbf{C}),\delta\mathbf{F}\right\rangle.

(A.42)

Therefore we conclude that

\displaystyle\left\langle DW(\mathbf{F}),\delta\mathbf{F}\right\rangle=\left\langle 2\bm{\mathsf{L}}_{\mathbf{F}}\cdot DV(\mathbf{C}),\delta\mathbf{F}\right\rangle.

(A.43)

Because the perturbation $\delta\mathbf{F}$ was arbitrary, we have established the identity

\displaystyle DW(\mathbf{F})=2\bm{\mathsf{L}}_{\mathbf{F}}\cdot DV(\mathbf{C}).

(A.44)

The derivation of the second-derivative follows a similar pattern. We take as a starting point the definitions

	$\displaystyle DW(\mathbf{F}+\delta\mathbf{F})-DW(\mathbf{F})$	$\displaystyle=D^{2}W\!\left(\mathbf{F}\right)\cdot\delta\mathbf{F}+\mathcal{O}(\\|\delta\mathbf{F}\\|^{2})$		(A.45)
	$\displaystyle DV(\mathbf{C}+\delta\mathbf{C})-DV(\mathbf{C})$	$\displaystyle=D^{2}V\!\left(\mathbf{C}\right)\cdot\delta\mathbf{C}+\mathcal{O}(\\|\delta\mathbf{C}\\|^{2}),$		(A.46)

for the previously defined $\delta\mathbf{F}$ and $\delta\mathbf{C}$ . Using eq.(A.44) the left hand side of eq.(A.45) can be written

\displaystyle 2\bm{\mathsf{L}}_{\mathbf{F}+\delta\mathbf{F}}\cdot DV(\mathbf{C}+\delta\mathbf{C})-2\bm{\mathsf{L}}_{\mathbf{F}}\cdot DV(\mathbf{C}).

(A.47)

Expanding this out to first-order it follows that

\displaystyle D^{2}W\!\left(\mathbf{F}\right)\cdot\delta\mathbf{F}=2\bm{\mathsf{L}}_{\delta\mathbf{F}}\cdot DV\!\left(\mathbf{C}\right)+2\bm{\mathsf{L}}_{\mathbf{F}}\cdot\!\left[D^{2}V\!\left(\mathbf{C}\right)\cdot\!\left(\mathbf{F}^{T}\delta\mathbf{F}+\delta\mathbf{F}^{T}\mathbf{F}\right)\right],

(A.48)

where we have used eqs. (A.37) and (A.46). To proceed, we first note that $\bm{\mathsf{L}}_{\delta\mathbf{F}}\cdot DV\!\left(\mathbf{C}\right)=\bm{\mathsf{R}}_{DV\!\left(\mathbf{C}\right)}\cdot\delta\mathbf{F}$ by definition of the left and right multiplication operators. Secondly,

\displaystyle D^{2}V\!\left(\mathbf{C}\right)\cdot(\mathbf{F}^{T}\delta\mathbf{F}+\delta\mathbf{F}^{T}\mathbf{F})=2D^{2}V\!\left(\mathbf{C}\right)\cdot(\mathbf{F}^{T}\delta\mathbf{F})

(A.49)

because $V$ is a function on symmetric matrices. Given that $\delta\mathbf{F}$ is arbitrary, this establishes our desired expression for the second-derivative of $W$ :

\displaystyle D^{2}W\!\left(\mathbf{F}\right)=2\bm{\mathsf{R}}_{DV\!\left(\mathbf{C}\right)}+4\bm{\mathsf{L}}_{\mathbf{F}}D^{2}V\!\left(\mathbf{C}\right)\,\bm{\mathsf{L}}_{\mathbf{F}}^{T}.

(A.50)

A.4 Mapping between different notation conventions

As stated above, Sections 1 and 5 are written in a notation close to that of Dahlen & Tromp (1998) whilst for the middle sections we use a notation heavily inspired by Marsden & Hughes (1994). Table 1 summarises the relationship between the two systems. Besides differing choices of letter for various quantities, the principal difference between the two systems concerns index placement. All second-rank tensors must have their indices switched to go from one convention to the other, and for tensors of higher (even) rank the same applies to each pair of indices individually; this is illustrated schematically in Table 1’s last two lines. Note also that we do not follow MH’s convention of using upper- and lower-case indices to distinguish between referential and spatial coordinate-systems because we consider both systems to be implicitly Cartesian. Finally, where a quantity of interest does not appear in MH we have notated it with the same letter as DT, although in Sections 2–4 we consistently place our indices according to MH’s conventions; such quantities do not appear in the table.

To conclude this section we “translate” eqs.(3.38) (reproduced in Table 1) into the notation of Section 1, giving expressions for the components of eq.(5.2)’s tensors $\bm{\Pi}$ and $\bm{\Pi}^{\prime}$ . Let us first recall the elastic tensor $\bm{\Upsilon}$ that is defined by Dahlen & Tromp (1998, eq.3.123) to have components

\displaystyle\Upsilon_{ijkl}=\Xi_{ijkl}+T^{0}_{ik}\delta_{jl}+T^{0}_{jk}\delta_{il}-T^{0}_{ij}\delta_{kl}.

(A.51)

We now define the very closely related tensor $\mathbf{Y}$ , whose components are

\displaystyle Y_{ijkl}=\Gamma_{ijkl}+T^{B}_{ik}\delta_{jl}+T^{B}_{jk}\delta_{il}-T^{B}_{ij}\delta_{kl},

(A.52)

and we then symmetrise it to form a new tensor $\overline{\mathbf{Y}}$ , which has components

\displaystyle\overline{Y}_{ijkl}=Y_{ij(kl)}

(A.53)

and satisfies the symmetries

\displaystyle\overline{Y}_{jikl}=\overline{Y}_{ijkl}=\overline{Y}_{ijlk}.

(A.54)

The inverse tensor $\overline{\mathbf{Y}}^{-1}$ is then defined by the relation

\displaystyle\overline{Y}_{ijab}\overline{Y}^{-1}_{abkl}=\frac{1}{2}\!\left(\delta_{ik}\delta_{jl}+\delta_{jk}\delta_{il}\right).

(A.55)

We can now rewrite the generalised stress-strain relationship (eq.3.38b) as

\displaystyle\mathbf{u}^{1}=\overline{\mathbf{Y}}^{-1}\!:\!\left(\Delta\mathbf{T}^{0}-\bm{\omega}^{1}\cdot\mathbf{T}^{B}-\mathbf{T}^{B}\cdot\bm{\omega}^{1}\right).

(A.56)

Eq.(3.38a) can be dealt with by defining two final tensors:

	$\displaystyle\Theta_{ijklmn}$	$\displaystyle=\delta_{im}\Gamma_{njkl}+\delta_{jm}\Gamma_{inkl}+\delta_{km}\Gamma_{ijnl}+\delta_{lm}\Gamma_{ijkn}-\Gamma_{ijkl}\delta_{mn}+\rho^{0}\frac{\partial^{3}U^{L}}{\partial E^{L}_{ij}\partial E^{L}_{kl}\partial E^{L}_{mn}},$		(A.57)
	$\displaystyle\Psi_{ijklmn}$	$\displaystyle=\delta_{im}\Gamma_{njkl}+\delta_{jm}\Gamma_{inkl}+\delta_{km}\Gamma_{ijnl}+\delta_{lm}\Gamma_{ijkn}.$		(A.58)

The components of $\bm{\Pi}$ and $\bm{\Pi}^{\prime}$ are then given by

	$\displaystyle\Pi_{ijklmn}$	$\displaystyle=\Theta_{ijklpq}\overline{Y}^{-1}_{pqmn},$		(A.59)
	$\displaystyle\Pi^{\prime}_{ijklmn}$	$\displaystyle=\Theta_{ijklpq}\!\left(\overline{Y}^{-1}_{pqrn}T^{B}_{rm}-\overline{Y}^{-1}_{pqrm}T^{B}_{rn}\right)+\Psi_{ijklmn}.$		(A.60)

Table 1: The mapping between the two major notation conventions used in this paper. See the main text for a discussion of the components of

\bm{\Pi}

and

\bm{\Pi}^{\prime}

. Note also that the first and second elastic tensors are sometimes represented by

\bm{\mathsf{A}}

and

\bm{\mathsf{C}}

; we have listed

\bm{\mathsf{a}}

and

\bm{\mathsf{c}}

in the table because, like the elastic tensors of Dahlen & Tromp, they are expressed relative to a natural reference configuration.

Quantity	Dahlen & Tromp	Marsden & Hughes
First elastic tensor	$\bm{\Lambda}$	$\bm{\mathsf{a}}$
Second elastic tensor	$\bm{\Xi}$	$\bm{\mathsf{c}}$
Background (second) elastic tensor	$\bm{\Gamma}$	$\bm{\mathsf{c}}^{0}$
Incremental second elastic tensor	$\Delta\bm{\Xi}$	$\bm{\mathsf{c}}^{1}$
First Piola-Kirchhoff stress	$\mathbf{T}^{PK}$	$\mathbf{P}$
Second Piola-Kirchhoff stress	$\mathbf{T}^{SK}$	$\mathbf{S}$
Eulerian Cauchy stress	$\mathbf{T}^{E}$	$\bm{\sigma}$
Equilibrium Cauchy stress	$\mathbf{T}^{0}$	$\bm{\sigma}_{e}$ (shortened to $\bm{\sigma}$ in Sections 3 and 4)
Background Cauchy stress	$\mathbf{T}^{B}$	$\bm{\sigma}^{0}$
Incremental Cauchy stress	$\Delta\mathbf{T}^{0}$	$\bm{\sigma}^{1}$
Incremental pressure	$\Delta p^{0}$	$p^{1}$
Incremental deviatoric stress	$\Delta\bm{\tau}^{0}$	$\bm{\tau}^{1}$
Polar decomposition theorem	$\mathbf{F}=\mathbf{Q}\mathbf{R}=\mathbf{L}\mathbf{Q}$	$\mathbf{F}=\mathbf{R}\mathbf{U}=\mathbf{V}\mathbf{R}$
Central equations (3.14)	$\!\begin{aligned} T^{0}_{ij}&=\frac{\rho^{0}}{\det\mathbf{F}}F_{ip}F_{jq}\frac{\partial U^{L}}{\partial E^{L}_{pq}}\\ \Xi_{ijkl}&=\frac{\rho^{0}}{\det\mathbf{F}}F_{ip}F_{jq}F_{kr}F_{ls}\frac{\partial^{2}U^{L}}{\partial E^{L}_{pq}\partial E^{L}_{rs}}\end{aligned}$	$\!\begin{aligned} \bm{\sigma}&=2J_{\mathbf{F}}^{-1}\bm{\mathsf{T}}_{\mathbf{F}}\cdot D\tilde{V}\!\left(\mathbf{C}\right)\\ \bm{\mathsf{a}}&=4J_{\mathbf{F}}^{-1}\bm{\mathsf{T}}_{\mathbf{F}}D^{2}\tilde{V}\!\left(\mathbf{C}\right)\bm{\mathsf{T}}_{\mathbf{F}}^{T}+\bm{\mathsf{R}}_{\bm{\sigma}}\end{aligned}$
Linearised expressions (3.38)	$\!\begin{aligned} \Delta\Xi_{ijkl}&=\Pi_{ijklmn}\Delta T^{0}_{mn}+\Pi^{\prime}_{ijklmn}\omega^{1}_{mn}\end{aligned}$	$\!\begin{aligned} \bm{\mathsf{c}}^{1}&=\bm{\mathsf{X}}_{\mathbf{u}^{1}+\bm{\omega}^{1}}\bm{\mathsf{c}}^{0}+\bm{\mathsf{c}}^{0}\bm{\mathsf{X}}_{\mathbf{u}^{1}-\bm{\omega}^{1}}-\bm{\mathsf{c}}^{0}\mathrm{tr}\!\left(\mathbf{u}^{1}\right)+8D^{3}\tilde{V}\!\left(\mathbf{1}\right)\cdot\mathbf{u}^{1}\\ \mathbf{u}^{1}&=\!\left(\bm{\mathsf{c}}^{0}+\overline{\bm{\mathsf{X}}}_{\bm{\sigma}^{0}}-\bm{\sigma}^{0}\otimes\mathbf{1}\right)^{-1}\cdot\!\left(\bm{\sigma}^{1}-\bm{\mathsf{X}}_{\bm{\omega}^{1}}\cdot\bm{\sigma}^{0}\right)\end{aligned}$
Any second-rank tensor	$\!\left[\mathbf{D\&T}\right]_{ij}$	$\!\left[\mathbf{M\&H}\right]_{ji}$
Any fourth-rank tensor	$\!\left[\mathbf{D\&T}\right]_{ijkl}$	$\!\left[\mathbf{M\&H}\right]_{jilk}$

Appendix B The invertibility of $\hat{\bm{\Sigma}}$

The existence of the inverse function $\hat{\bm{\Sigma}}^{-1}$ depends on the specific choice of constitutive relation; without fixing $\tilde{V}$ concretely it is difficult to obtain definite results. However, we can show that the inverse mapping exists at least locally in cases of some physical importance. To proceed, we appeal to the inverse function theorem (e.g. Marsden & Hughes, 1994). The theorem tells us that the nonlinear mapping $\hat{\bm{\Sigma}}$ has a well-defined inverse function in some open neighbourhood of $\hat{\bm{\Sigma}}\!\left(\mathbf{U}\right)$ if $D\hat{\bm{\Sigma}}(\mathbf{U})$ , a linear mapping from the vector-space of symmetric matrices into itself, is invertible.

Let us work in the vicinity of $\mathbf{U}=\mathbf{1}$ . We will set $\mathbf{R}=\mathbf{1}$ for clarity, but that does not affect the validity of our argument. In that case we have

\displaystyle\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)

\displaystyle=2DV\!\left(\mathbf{1}\right).

(B.1)

Making use of eq.(A.26) of Appendix A, we can expand $\hat{\bm{\Sigma}}$ about the identity as

$\displaystyle\hat{\bm{\Sigma}}\!\left(\mathbf{1}+\mathbf{u}\right)$	$\displaystyle=2\!\left[1-\mathrm{tr}\!\left(\mathbf{u}\right)\right]\!\left[\bm{\mathsf{id}}+\bm{\mathsf{X}}_{\mathbf{u}}\right]\cdot\!\left[DV\!\left(\mathbf{1}\right)+2D^{2}V\!\left(\mathbf{1}\right)\cdot\mathbf{u}\right]$
	$\displaystyle=\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)-\mathrm{tr}\!\left(\mathbf{u}\right)\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)+\bm{\mathsf{X}}_{\mathbf{u}}\cdot\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)+4D^{2}V\!\left(\mathbf{1}\right)\cdot\mathbf{u}$
	$\displaystyle=\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)+\!\left(4D^{2}V\!\left(\mathbf{1}\right)+\bm{\mathsf{X}}_{\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)}-\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)\otimes\mathbf{1}\right)\cdot\mathbf{u},$	(B.2)

with $\mathbf{u}$ a small symmetric matrix. We have shown that

\displaystyle D\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)=4D^{2}V\!\left(\mathbf{1}\right)+\overline{\bm{\mathsf{X}}}_{\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)}-\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)\otimes\mathbf{1},

(B.3)

where we have written $\bm{\mathsf{X}}_{\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)}$ in its explicitly symmetrised form $\overline{\bm{\mathsf{X}}}_{\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)}$ (eq.A.32). Inspection of expression (B.3) indicates that the invertibility of $D^{2}V\!\left(\mathbf{1}\right)$ is a sufficient condition for $D\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)$ to possess a unique inverse locally, in the absence of equilibrium stress. As long as $\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)$ is not too large, invertibility of $D^{2}V\!\left(\mathbf{1}\right)$ should remain sufficient for the local existence of a unique $\hat{\bm{\Sigma}}^{-1}$ . One can show that $D^{2}V\!\left(\mathbf{1}\right)$ being invertible is equivalent to the condition of linearised stability of the equilibrium (e.g. Marsden & Hughes, 1994). It is essential to enforce linearised stability, for otherwise unphysical motions are permitted in which the strain-energy decreases upon deformation.

In summary, we have argued that linearised stability of the equilibrium is a sufficient condition for $\hat{\bm{\Sigma}}$ to possess a unique inverse in the neighbourhood of $\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)$ , as long as the equilibrium stress is not too large. By no means is this a proof of the global invertibility of $\hat{\bm{\Sigma}}$ , but we believe that it makes it plausible that we should be able to write down a well-defined inverse function $\hat{\bm{\Sigma}}^{-1}$ when required.

Appendix C Calculation of the linearised elastic tensor

C.1 General expressions

We assume that the body is initially in a state described by

	$\displaystyle\bm{\sigma}$	$\displaystyle=\bm{\sigma}^{0}$		(C.1)
	$\displaystyle\bm{\mathsf{a}}$	$\displaystyle=\bm{\mathsf{a}}^{0}.$		(C.2)

Both quantities are considered to be the derivatives of some background strain-energy evaluated at the identity, so that $\bm{\mathsf{a}}^{0}$ is given uniquely by

\displaystyle\bm{\mathsf{a}}^{0}=\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma}^{0},\mathbf{1}\right).

(C.3)

Once again, the function $\bar{\bm{\mathsf{a}}}$ has the form

\displaystyle\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma},\mathbf{R}\right)=\bm{\mathsf{T}}_{\mathbf{R}}\!\left[\!\left(\hat{\bm{\mathsf{A}}}\circ\hat{\bm{\Sigma}}^{-1}\right)\!\left(\bm{\mathsf{T}}_{\mathbf{R}}^{T}\cdot\bm{\sigma}\right)\right]\bm{\mathsf{T}}_{\mathbf{R}}^{T}.

(C.4)

We then introduce a small change in the stress:

\displaystyle\bm{\sigma}=\bm{\sigma}^{0}+\bm{\sigma}^{1}

(C.5)

for small $\bm{\sigma}^{1}$ . The system is assumed to be perturbative, so that we may write

\displaystyle\bm{\mathsf{a}}=\bm{\mathsf{a}}^{0}+\bm{\mathsf{a}}^{1}

(C.6)

for small $\bm{\mathsf{a}}^{1}$ . For the rest of this section we will neglect all terms higher than first order.

Under the framework of Section 3, the change in stress is considered to be induced by some deformation gradient $\mathbf{F}$ , so it follows that $\mathbf{F}$ should take the form

\displaystyle\mathbf{F}=\mathbf{1}+\mathbf{f}.

(C.7)

with $\mathbf{f}$ small. Now, the arbitrary rotation matrix $\mathbf{R}$ in eq.(C.4) is a relic of $\mathbf{F}$ , having been introduced by the identity

\displaystyle\mathbf{F}=\mathbf{R}\mathbf{U}.

(C.8)

Therefore it too must be considered to be a small perturbation to its background value:

\displaystyle\mathbf{R}=\mathbf{1}+\bm{\omega},

(C.9)

with $\bm{\omega}$ some small antisymmetric matrix. The same is true for $\mathbf{U}$ , that is

\displaystyle\mathbf{U}=\mathbf{1}+\mathbf{u},

(C.10)

but with the small matrix $\mathbf{u}$ of course symmetric. It is clear that the elastic tensor satisfies

\displaystyle\bm{\mathsf{a}}^{0}+\bm{\mathsf{a}}^{1}=\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma}^{0}+\bm{\sigma}^{1},\mathbf{1}+\bm{\omega}\right),

(C.11)

which we may expand in a Taylor series to find

\displaystyle\bm{\mathsf{a}}^{1}=\!\left(D_{\bm{\sigma}}\bar{\bm{\mathsf{a}}}\right)\!\left(\bm{\sigma}^{0},\mathbf{1}\right)\cdot\bm{\sigma}^{1}+\!\left(D_{\mathbf{R}}\bar{\bm{\mathsf{a}}}\right)\!\left(\bm{\sigma}^{0},\mathbf{1}\right)\cdot\bm{\omega}.

(C.12)

The $\mathbf{R}$ -derivative can conveniently be written in terms of the $\bm{\sigma}$ -derivative. Using the full form of $\bar{\bm{\mathsf{a}}}$ from eq.(C.4), as well as the operator $\bm{\mathsf{id}}$ defined in eq.(A.30),

$\displaystyle\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma},\mathbf{1}+\bm{\omega}\right)$	$\displaystyle=\!\left(\bm{\mathsf{id}}+\bm{\mathsf{X}}_{\bm{\omega}}\right)\!\left[\!\left(\hat{\bm{\mathsf{A}}}\circ\hat{\bm{\Sigma}}^{-1}\right)\!\left(\bm{\sigma}-\bm{\mathsf{X}}_{\bm{\omega}}\cdot\bm{\sigma}\right)\right]\!\left(\bm{\mathsf{id}}-\bm{\mathsf{X}}_{\bm{\omega}}\right)$
	$\displaystyle=\!\left(\hat{\bm{\mathsf{A}}}\circ\hat{\bm{\Sigma}}^{-1}\right)\!\left(\bm{\sigma}-\bm{\mathsf{X}}_{\bm{\omega}}\cdot\bm{\sigma}\right)+\bm{\mathsf{X}}_{\bm{\omega}}\!\left[\!\left(\hat{\bm{\mathsf{A}}}\circ\hat{\bm{\Sigma}}^{-1}\right)\!\left(\bm{\sigma}\right)\right]-\!\left[\!\left(\hat{\bm{\mathsf{A}}}\circ\hat{\bm{\Sigma}}^{-1}\right)\!\left(\bm{\sigma}\right)\right]\bm{\mathsf{X}}_{\bm{\omega}}$
	$\displaystyle=\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma}-\bm{\mathsf{X}}_{\bm{\omega}}\cdot\bm{\sigma},\mathbf{1}\right)+\bm{\mathsf{X}}_{\bm{\omega}}\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma},\mathbf{1}\right)-\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma},\mathbf{1}\right)\bm{\mathsf{X}}_{\bm{\omega}}$
	$\displaystyle=\bm{\mathsf{a}}^{0}-\!\left(D_{\bm{\sigma}}\bar{\bm{\mathsf{a}}}\right)\!\left(\bm{\sigma},\mathbf{1}\right)\cdot\!\left(\bm{\mathsf{X}}_{\bm{\omega}}\cdot\bm{\sigma}\right)+\bm{\mathsf{X}}_{\bm{\omega}}\bm{\mathsf{a}}^{0}-\bm{\mathsf{a}}^{0}\bm{\mathsf{X}}_{\bm{\omega}},$	(C.13)

from which it is apparent that

\displaystyle\!\left(D_{\mathbf{R}}\bar{\bm{\mathsf{a}}}\right)\!\left(\bm{\sigma}^{0},\mathbf{1}\right)\cdot\bm{\omega}=\bm{\mathsf{X}}_{\bm{\omega}}\bm{\mathsf{a}}^{0}-\bm{\mathsf{a}}^{0}\bm{\mathsf{X}}_{\bm{\omega}}-\!\left(D_{\bm{\sigma}}\bar{\bm{\mathsf{a}}}\right)\!\left(\bm{\sigma}^{0},\mathbf{1}\right)\cdot\!\left(\bm{\mathsf{X}}_{\bm{\omega}}\cdot\bm{\sigma}^{0}\right).

(C.14)

Substituting this into eq.(C.12), the perturbation to the elastic tensor is

\displaystyle\bm{\mathsf{a}}^{1}=\bm{\mathsf{X}}_{\bm{\omega}}\bm{\mathsf{a}}^{0}-\bm{\mathsf{a}}^{0}\bm{\mathsf{X}}_{\bm{\omega}}+\!\left(D_{\bm{\sigma}}\bar{\bm{\mathsf{a}}}\right)\!\left(\bm{\sigma}^{0},\mathbf{1}\right)\cdot\!\left(\bm{\sigma}^{1}-\bm{\mathsf{X}}_{\bm{\omega}}\cdot\bm{\sigma}^{0}\right).

(C.15)

Calculation of $D_{\bm{\sigma}}\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma}^{0},\mathbf{1}\right)$ requires us to find $D\hat{\bm{\mathsf{A}}}\!\left(\mathbf{1}\right)$ and $D\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)$ . They are evaluated most conveniently by forming explicit binomial expansions of $\hat{\bm{\Sigma}}$ and $\hat{\bm{\mathsf{A}}}$ about $\mathbf{U}=\mathbf{1}+\mathbf{u}$ . At zeroth order

	$\displaystyle\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)$	$\displaystyle=\bm{\sigma}^{0}$	$\displaystyle=2DV\!\left(\mathbf{1}\right)$		(C.16)
	$\displaystyle\hat{\bm{\mathsf{A}}}\!\left(\mathbf{1}\right)$	$\displaystyle=\bm{\mathsf{a}}^{0}$	$\displaystyle=4D^{2}V\!\left(\mathbf{1}\right)+\bm{\mathsf{R}}_{\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)},$		(C.17)

and it is useful to define

\displaystyle\bm{\mathsf{c}}^{0}=4D^{2}V\!\left(\mathbf{1}\right)=\hat{\bm{\mathsf{A}}}\!\left(\mathbf{1}\right)-\bm{\mathsf{R}}_{\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)}.

(C.18)

We showed in Appendix B that

\displaystyle D\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)=\bm{\mathsf{c}}^{0}+\overline{\bm{\mathsf{X}}}_{\bm{\sigma}^{0}}-\bm{\sigma}^{0}\otimes\mathbf{1}.

(C.19)

In the same manner, $\hat{\bm{\mathsf{A}}}$ expands as

	$\displaystyle\hat{\bm{\mathsf{A}}}\!\left(\mathbf{1}+\mathbf{u}\right)$	$\displaystyle=4\!\left(1-\mathrm{tr}\!\left(\mathbf{u}\right)\right)\!\left(\bm{\mathsf{id}}+\bm{\mathsf{X}}_{\mathbf{u}}\right)\!\left[D^{2}V\!\left(\mathbf{1}\right)+2D^{3}V\!\left(\mathbf{1}\right)\cdot\mathbf{u}\right]\!\left(\bm{\mathsf{id}}+\bm{\mathsf{X}}_{\mathbf{u}}\right)+\bm{\mathsf{R}}_{\hat{\bm{\Sigma}}\!\left(\mathbf{1}+\mathbf{u}\right)}$
		$\displaystyle=\bm{\mathsf{c}}^{0}+8D^{3}V\!\left(\mathbf{1}\right)\cdot\mathbf{u}+\bm{\mathsf{X}}_{\mathbf{u}}\bm{\mathsf{c}}^{0}+\bm{\mathsf{c}}^{0}\bm{\mathsf{X}}_{\mathbf{u}}-\bm{\mathsf{c}}^{0}\mathrm{tr}\!\left(\mathbf{u}\right)+\bm{\mathsf{R}}_{\bm{\sigma}^{0}}+\bm{\mathsf{R}}_{D\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)\cdot\mathbf{u}},$		(C.20)

so its derivative at the identity is given by

\displaystyle D\hat{\bm{\mathsf{A}}}\!\left(\mathbf{1}\right)\cdot\mathbf{u}=\bm{\mathsf{R}}_{D\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)\cdot\mathbf{u}}+8D^{3}V\!\left(\mathbf{1}\right)\cdot\mathbf{u}+\bm{\mathsf{X}}_{\mathbf{u}}\bm{\mathsf{c}}^{0}+\bm{\mathsf{c}}^{0}\bm{\mathsf{X}}_{\mathbf{u}}-\bm{\mathsf{c}}^{0}\mathrm{tr}\!\left(\mathbf{u}\right)

(C.21)

for any $\mathbf{u}$ .

Now we can compute $D_{\bm{\sigma}}\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma}^{0},\mathbf{1}\right)$ by writing

$\displaystyle\bar{\bm{\mathsf{a}}}\!\left(\bm{\sigma}^{0}+\bm{\Sigma},\mathbf{1}\right)$	$\displaystyle=\hat{\bm{\mathsf{A}}}\!\left[\hat{\bm{\Sigma}}^{-1}\!\left(\bm{\sigma}^{0}+\bm{\Sigma}\right)\right]$
	$\displaystyle=\hat{\bm{\mathsf{A}}}\!\left[\hat{\bm{\Sigma}}^{-1}\!\left(\bm{\sigma}^{0}\right)+\!\left[D\!\left(\hat{\bm{\Sigma}}^{-1}\right)\right]\!\left(\bm{\sigma}^{0}\right)\cdot\bm{\Sigma}\right]$
	$\displaystyle=\hat{\bm{\mathsf{A}}}\!\left\{\mathbf{1}+\!\left[D\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)\right]^{-1}\cdot\bm{\Sigma}\right\}$
	$\displaystyle=\hat{\bm{\mathsf{A}}}\!\left(\mathbf{1}\right)+D\hat{\bm{\mathsf{A}}}\!\left(\mathbf{1}\right)\cdot\!\left\{\!\left[D\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)\right]^{-1}\cdot\bm{\Sigma}\right\},$	(C.22)

valid for small symmetric $\bm{\Sigma}$ , where we have used the identity

\displaystyle D\!\left(\hat{\bm{\Sigma}}^{-1}\right)\!\left(\bm{\sigma}^{0}\right)=\!\left\{D\hat{\bm{\Sigma}}\!\left[\hat{\bm{\Sigma}}^{-1}\!\left(\bm{\sigma}^{0}\right)\right]\right\}^{-1}=\!\left[D\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)\right]^{-1}.

(C.23)

Thus,

\displaystyle\!\left(D_{\bm{\sigma}}\bar{\bm{\mathsf{a}}}\right)\!\left(\bm{\sigma}^{0},\mathbf{1}\right)\cdot\!\left(\bm{\sigma}^{1}-\bm{\mathsf{X}}_{\bm{\omega}}\cdot\bm{\sigma}^{0}\right)=D\hat{\bm{\mathsf{A}}}\!\left(\mathbf{1}\right)\cdot\!\left\{\!\left[D\hat{\bm{\Sigma}}\!\left(\mathbf{1}\right)\right]^{-1}\cdot\!\left(\bm{\sigma}^{1}-\bm{\mathsf{X}}_{\bm{\omega}}\cdot\bm{\sigma}^{0}\right)\right\},

(C.24)

and we can combine this with eqs. (C.21), (C.19) and (C.15) to write the perturbation to the elastic tensor as

\displaystyle\bm{\mathsf{a}}^{1}=\bm{\mathsf{X}}_{\bm{\omega}}\bm{\mathsf{a}}^{0}-\bm{\mathsf{a}}^{0}\bm{\mathsf{X}}_{\bm{\omega}}+\bm{\mathsf{X}}_{\mathbf{u}}\bm{\mathsf{c}}^{0}+\bm{\mathsf{c}}^{0}\bm{\mathsf{X}}_{\mathbf{u}}-\bm{\mathsf{c}}^{0}\mathrm{tr}\!\left(\mathbf{u}\right)+8D^{3}\tilde{V}\!\left(\mathbf{1}\right)\cdot\mathbf{u}+\bm{\mathsf{R}}_{\!\left(\bm{\sigma}^{1}-\bm{\mathsf{X}}_{\bm{\omega}}\cdot\bm{\sigma}^{0}\right)},

(C.25)

with

\displaystyle\mathbf{u}=\!\left(\bm{\mathsf{c}}^{0}+\overline{\bm{\mathsf{X}}}_{\bm{\sigma}^{0}}-\bm{\sigma}^{0}\otimes\mathbf{1}\right)^{-1}\cdot\!\left(\bm{\sigma}^{1}-\bm{\mathsf{X}}_{\bm{\omega}}\cdot\bm{\sigma}^{0}\right).

(C.26)

A final simplification is obtained by noting that

\displaystyle\bm{\mathsf{X}}_{\bm{\omega}}\bm{\mathsf{a}}^{0}-\bm{\mathsf{a}}^{0}\bm{\mathsf{X}}_{\bm{\omega}}=\bm{\mathsf{X}}_{\bm{\omega}}\bm{\mathsf{c}}^{0}-\bm{\mathsf{c}}^{0}\bm{\mathsf{X}}_{\bm{\omega}}+\bm{\mathsf{X}}_{\bm{\omega}}\bm{\mathsf{R}}_{\bm{\sigma}^{0}}-\bm{\mathsf{R}}_{\bm{\sigma}^{0}}\bm{\mathsf{X}}_{\bm{\omega}}.

(C.27)

It is then readily shown that

\displaystyle\bm{\mathsf{X}}_{\bm{\omega}}\bm{\mathsf{R}}_{\bm{\sigma}^{0}}-\bm{\mathsf{R}}_{\bm{\sigma}^{0}}\bm{\mathsf{X}}_{\bm{\omega}}=\bm{\mathsf{R}}_{\bm{\mathsf{X}}_{\bm{\omega}}\cdot\bm{\sigma}^{0}},

(C.28)

which partially cancels with the final term in eq.(C.25). Thus, our final expression for the perturbation to the elastic tensor is

\displaystyle\begin{aligned} \bm{\mathsf{a}}^{1}&=\bm{\mathsf{c}}^{1}+\bm{\mathsf{R}}_{\bm{\sigma}^{1}}\\ \bm{\mathsf{c}}^{1}&=\bm{\mathsf{X}}_{\mathbf{u}+\bm{\omega}}\bm{\mathsf{c}}^{0}+\bm{\mathsf{c}}^{0}\bm{\mathsf{X}}_{\mathbf{u}-\bm{\omega}}-\bm{\mathsf{c}}^{0}\mathrm{tr}\!\left(\mathbf{u}\right)+8D^{3}\tilde{V}\!\left(\mathbf{1}\right)\cdot\mathbf{u}\\ \mathbf{u}&=\!\left(\bm{\mathsf{c}}^{0}+\overline{\bm{\mathsf{X}}}_{\bm{\sigma}^{0}}-\bm{\sigma}^{0}\otimes\mathbf{1}\right)^{-1}\cdot\!\left(\bm{\sigma}^{1}-\bm{\mathsf{X}}_{\bm{\omega}}\cdot\bm{\sigma}^{0}\right).\end{aligned}

(C.29)

Note that the arbitrary antisymmetric matrix $\bm{\omega}$ insinuates itself implicitly via the definition of $\mathbf{u}$ , as well as appearing explicitly in the expression for $\bm{\mathsf{c}}^{1}$ . When the background material is isotropic $\bm{\omega}$ can be set to zero without loss of generality.

C.2 Isotropic, hydrostatically pre-stressed background

For a zeroth-order hydrostatic stress and elastic tensor given by

	$\displaystyle\bm{\sigma}^{0}$	$\displaystyle=-p^{0}\mathbf{1}$
	$\displaystyle\bm{\mathsf{c}}^{0}$	$\displaystyle=\lambda\mathbf{1}\otimes\mathbf{1}+2\mu\overline{\bm{\mathsf{id}}},$		(C.30)

the perturbation to the stress satisfies

$\displaystyle\bm{\sigma}^{1}$	$\displaystyle=\!\left(\bm{\mathsf{c}}^{0}+\overline{\bm{\mathsf{X}}}_{\bm{\sigma}^{0}}-\bm{\sigma}^{0}\otimes\mathbf{1}\right)\cdot\mathbf{u}$
	$\displaystyle=\!\left(\lambda\mathbf{1}\otimes\mathbf{1}+2\mu\overline{\bm{\mathsf{id}}}-2p^{0}\overline{\bm{\mathsf{id}}}+p^{0}\mathbf{1}\otimes\mathbf{1}\right)\cdot\mathbf{u}$
	$\displaystyle=\tilde{\lambda}\mathrm{tr}\!\left(\mathbf{u}\right)\mathbf{1}+2\tilde{\mu}\mathbf{u},$	(C.31)

with


$\displaystyle\tilde{\lambda}$	$\displaystyle=\lambda+p^{0}$	(C.32a)
$\displaystyle\tilde{\mu}$	$\displaystyle=\mu-p^{0}.$	(C.32b)

It follows easily that

\displaystyle\mathrm{tr}\!\left(\bm{\sigma}^{1}\right)

\displaystyle=\!\left(3\tilde{\lambda}+2\tilde{\mu}\right)\mathrm{tr}\!\left(\mathbf{u}\right),

(C.33)

from which

\displaystyle\mathbf{u}

\displaystyle=\frac{1}{2\tilde{\mu}}\!\left[\bm{\sigma}^{1}-\frac{\tilde{\lambda}}{3\tilde{\lambda}+2\tilde{\mu}}\mathrm{tr}\!\left(\bm{\sigma}^{1}\right)\mathbf{1}\right].

(C.34)

Next, we split the stress into its hydrostatic and deviatoric components. Writing

\displaystyle\bm{\sigma}^{1}=-p^{1}\mathbf{1}+\bm{\tau}^{1},

(C.35)

with

\displaystyle\mathrm{tr}\!\left(\bm{\tau}^{1}\right)=0,

(C.36)

we find that

\displaystyle\mathbf{u}=-x\mathbf{1}+\frac{1}{2\tilde{\mu}}\bm{\tau}^{1},

(C.37)

where we define the shorthand

\displaystyle x\equiv\frac{p^{1}}{3\tilde{\lambda}+2\tilde{\mu}}.

(C.38)

Let us now turn to the expression for $\bm{\mathsf{c}}^{1}$ , which is given by

\displaystyle\bm{\mathsf{c}}^{1}=\bm{\mathsf{X}}_{\mathbf{u}}\bm{\mathsf{c}}^{0}+\bm{\mathsf{c}}^{0}\bm{\mathsf{X}}_{\mathbf{u}}-\mathrm{tr}\!\left(\mathbf{u}\right)\bm{\mathsf{c}}^{0}+8D^{3}\tilde{V}\!\left(\mathbf{1}\right)\cdot\mathbf{u}

(C.39)

in the isotropic case. For the third-derivative term, we follow the same philosophy as Murnaghan (1937), considering a Taylor-expansion of the strain-energy function $\tilde{V}$ up to third-order in the scalar-invariants of $\mathbf{C}$ . The most general such expression is, in index-notation,

\displaystyle[8D^{3}\tilde{V}\!\left(\mathbf{1}\right)]_{ijklmn}C_{ij}C_{kl}C_{mn}

\displaystyle=\zeta_{1}\mathrm{tr}\!\left(\mathbf{C}\right)^{3}+3\zeta_{2}\mathrm{tr}\!\left(\mathbf{C}\right)\mathrm{tr}\!\left(\mathbf{C}^{2}\right)+2\zeta_{3}\mathrm{tr}\!\left(\mathbf{C}^{3}\right),

(C.40)

where we have defined the constants $\zeta_{1}$ , $\zeta_{2}$ and $\zeta_{3}$ . This expression should be compared with Murnaghan (1937, p.250, 2nd equation from bottom), wherein the Murnaghan constants $l$ , $m$ and $n$ are introduced, as well as the bottom equation of p.240 of that paper, where the scalar-invariants of a tensor are defined. Our constants $\zeta_{1}$ , $\zeta_{2}$ and $\zeta_{3}$ , which we have found convenient to use when performing calculations, are equivalent to Murnaghan’s, and we shall simply refer to them as ‘the Murnaghan constants’. Using our operator notation, the expression for the third-derivative term in $\bm{\mathsf{c}}^{1}$ is

\displaystyle 8D^{3}\tilde{V}\!\left(\mathbf{1}\right)\cdot\mathbf{u}=\zeta_{1}\mathrm{tr}\!\left(\mathbf{u}\right)\mathbf{1}\otimes\mathbf{1}+\zeta_{2}\mathrm{tr}\!\left(\mathbf{u}\right)\overline{\bm{\mathsf{id}}}+\zeta_{2}\!\left(\mathbf{1}\otimes\mathbf{u}+\mathbf{u}\otimes\mathbf{1}\right)+\zeta_{3}\overline{\bm{\mathsf{X}}}_{\mathbf{u}},

(C.41)

where we have used hats to mark symmetrisation of certain operators (see eq.A.32). With $\bm{\mathsf{c}}^{0}$ given above, and bearing in mind that

	$\displaystyle\bm{\mathsf{X}}_{\mathbf{u}}\!\left(\mathbf{1}\otimes\mathbf{1}\right)$	$\displaystyle=2\mathbf{u}\otimes\mathbf{1},$		(C.42)
	$\displaystyle\bm{\mathsf{X}}_{\mathbf{u}}\overline{\bm{\mathsf{id}}}$	$\displaystyle=\overline{\bm{\mathsf{X}}}_{\mathbf{u}},$		(C.43)

we obtain

\displaystyle\bm{\mathsf{c}}^{1}=\!\left(\zeta_{1}-\lambda\right)\mathrm{tr}\!\left(\mathbf{u}\right)\mathbf{1}\otimes\mathbf{1}+\!\left(\zeta_{2}-2\mu\right)\mathrm{tr}\!\left(\mathbf{u}\right)\overline{\bm{\mathsf{id}}}+\!\left(2\lambda+\zeta_{2}\right)\!\left(\mathbf{1}\otimes\mathbf{u}+\mathbf{u}\otimes\mathbf{1}\right)+\!\left(4\mu+\zeta_{3}\right)\overline{\bm{\mathsf{X}}}_{\mathbf{u}}.

(C.44)

We can now substitute in our expression for $\mathbf{u}$ in terms of $p^{1}$ and $\bm{\tau}^{1}$ , yielding

\displaystyle\bm{\mathsf{c}}^{1}

\displaystyle=-\frac{\lambda+3\zeta_{1}+2\zeta_{2}}{3\lambda+2\mu+p^{0}}p^{1}\mathbf{1}\otimes\mathbf{1}-\frac{2\mu+3\zeta_{2}+2\zeta_{3}}{3\lambda+2\mu+p^{0}}p^{1}\overline{\bm{\mathsf{id}}}+\frac{2\lambda+\zeta_{2}}{2\!\left(\mu-p^{0}\right)}\!\left(\mathbf{1}\otimes\bm{\tau}^{1}+\bm{\tau}^{1}\otimes\mathbf{1}\right)+\frac{4\mu+\zeta_{3}}{2\!\left(\mu-p^{0}\right)}\overline{\bm{\mathsf{X}}}_{\bm{\tau}^{1}}.

(C.45)

With this, we may write the perturbation to first elastic tensor as

\displaystyle\bm{\mathsf{a}}^{1}=cp^{1}\mathbf{1}\otimes\mathbf{1}+2dp^{1}\overline{\bm{\mathsf{id}}}+a\!\left(\mathbf{1}\otimes\bm{\tau}^{1}+\bm{\tau}^{1}\otimes\mathbf{1}\right)+2b\overline{\bm{\mathsf{X}}}_{\bm{\tau}^{1}}-p^{1}\bm{\mathsf{id}}+\bm{\mathsf{R}}_{\bm{\tau}^{1}},

(C.46)

where $a$ , $b$ , $c$ and $d$ are given in terms of the Murnaghan constants as

$\displaystyle a$	$\displaystyle\equiv\frac{\lambda+\frac{1}{2}\zeta_{2}}{\mu-p^{0}}$	(C.47)
$\displaystyle b$	$\displaystyle\equiv\frac{\mu+\frac{1}{4}\zeta_{3}}{\mu-p^{0}}$	(C.48)
$\displaystyle c$	$\displaystyle\equiv-\frac{\lambda+3\zeta_{1}+2\zeta_{2}}{3\kappa+p^{0}}$	(C.49)
$\displaystyle d$	$\displaystyle\equiv-\frac{\mu+\frac{3}{2}\zeta_{2}+\zeta_{3}}{3\kappa+p^{0}}.$	(C.50)

The first elastic tensor is written in index notation as

	$\displaystyle\mathsf{a}_{ijkl}$	$\displaystyle=\!\left(\kappa-\frac{2}{3}\mu+p^{1}c\right)\delta_{ij}\delta_{kl}+\!\left(\mu+p^{1}d\right)\!\left(\delta_{ik}\delta_{jl}+\delta_{il}\delta_{jk}\right)-\!\left(p^{0}+p^{1}\right)\delta_{ik}\delta_{jl}$
		$\displaystyle\qquad+a\!\left(\delta_{ij}\tau^{1}_{kl}+\delta_{kl}\tau^{1}_{ij}\right)+b\!\left(\delta_{ik}\tau^{1}_{jl}+\delta_{jl}\tau^{1}_{ik}+\delta_{il}\tau^{1}_{jk}+\delta_{jk}\tau^{1}_{il}\right)+\delta_{ik}\tau^{1}_{jl}.$		(C.51)

C.3 Transversely-isotropic, ‘quasi-hydrostatically’ pre-stressed background

The transversely-isotropic calculation is more intricate than the isotropic one for a number of reasons. Firstly, and perhaps most simply remedied, we can no longer ignore the arbitrary antisymmetric matrix $\bm{\omega}$ . Secondly, the stress-strain relationship

\displaystyle\mathbf{u}

\displaystyle=\!\left(\bm{\mathsf{c}}^{0}+\overline{\bm{\mathsf{X}}}_{\bm{\sigma}^{0}}-\bm{\sigma}^{0}\otimes\mathbf{1}\right)^{-1}\cdot\!\left(\bm{\sigma}^{1}-\bm{\mathsf{X}}_{\bm{\omega}}\cdot\bm{\sigma}^{0}\right)

(C.52)

is not as easily inverted for a transversely-isotropic elastic tensor. Thirdly, there are many more terms to keep track of.

Let us begin by recalling the stress-free transversely-isotropic elastic-tensor given in eq.(4.30):

\displaystyle\mathsf{a}_{ijkl}=\mathsf{c}_{ijkl}

\displaystyle=\lambda\delta_{ij}\delta_{kl}+\mu\!\left(\delta_{ik}\delta_{jl}+\delta_{il}\delta_{jk}\right)+8\gamma\nu_{i}\nu_{j}\nu_{k}\nu_{l}+4\beta\!\left(\nu_{i}\nu_{j}\delta_{kl}+\delta_{ij}\nu_{k}\nu_{l}\right)-\alpha\!\left(\nu_{i}\nu_{k}\delta_{jl}+\nu_{j}\nu_{k}\delta_{il}+\nu_{j}\nu_{l}\delta_{ik}+\nu_{i}\nu_{l}\delta_{jk}\right).

(C.53)

For the remainder of the calculation we will switch to our ‘operator-based’ notation. We also introduce a shorthand that is standard in many areas of physics, whereby a symmetric expression is written as

\displaystyle\!\left(\text{symmetric object}\right)=\!\left(\text{not-necessarily-symmetric object}\right)+h.c.,

(C.54)

an example being

\displaystyle\!\left(\mathbf{1}\otimes\bm{\tau}^{1}+\bm{\tau}^{1}\otimes\mathbf{1}\right)=\mathbf{1}\otimes\bm{\tau}^{1}+h.c..

(C.55)

It will allow us to avoid significant clutter later on. (Here, $h.c.$ technically stands for ‘hermitian conjugate’.) We can now rewrite the zeroth-order elastic tensor as

\displaystyle\bm{\mathsf{c}}^{0}=\lambda\mathbf{1}\otimes\mathbf{1}+2\mu\overline{\bm{\mathsf{id}}}+8\gamma\mathbf{N}\otimes\mathbf{N}+4\beta\!\left(\mathbf{1}\otimes\mathbf{N}+h.c.\right)-2\alpha\overline{\bm{\mathsf{X}}}_{\mathbf{N}},

(C.56)

and the pre-stress (eq.4.33) as

\displaystyle\bm{\sigma}^{0}=-\!\left(p^{0}+\frac{q^{0}}{3}\right)\mathbf{1}+q^{0}\mathbf{N}.

(C.57)

$\mathbf{N}$ satisfies the relations

\displaystyle\mathrm{tr}\!\left(\mathbf{N}\right)=1,

(C.58)

and

\displaystyle\mathbf{N}^{n}=\mathbf{N}

(C.59)

for positive integer $n$ , from which it follows by induction that

\displaystyle\bm{\mathsf{X}}_{\mathbf{N}}^{n}=\bm{\mathsf{X}}_{\mathbf{N}}+\!\left(2^{n}-2\right)\mathbf{N}\otimes\mathbf{N}.

(C.60)

It is useful to note that

\displaystyle\overline{\bm{\mathsf{X}}}_{\bm{\sigma}^{0}}-\bm{\sigma}^{0}\otimes\mathbf{1}

\displaystyle=-2\!\left(p^{0}+\frac{q^{0}}{3}\right)\overline{\bm{\mathsf{id}}}+q^{0}\overline{\bm{\mathsf{X}}}_{\mathbf{N}}+\!\left(p^{0}+\frac{q^{0}}{3}\right)\mathbf{1}\otimes\mathbf{1}-q^{0}\mathbf{N}\otimes\mathbf{1},

(C.61)

for now we can rewrite the stress-strain relationship (eq.C.52) as

	$\displaystyle\bm{\sigma}^{1}-\bm{\mathsf{X}}_{\bm{\omega}}\cdot\bm{\sigma}^{0}$	$\displaystyle=\!\left[\bm{\mathsf{c}}^{0}-2\!\left(p^{0}+\frac{q^{0}}{3}\right)\overline{\bm{\mathsf{id}}}+q^{0}\overline{\bm{\mathsf{X}}}_{\mathbf{N}}+\!\left(p^{0}+\frac{q^{0}}{3}\right)\mathbf{1}\otimes\mathbf{1}-q^{0}\mathbf{N}\otimes\mathbf{1}\right]\cdot\mathbf{u}$
		$\displaystyle=\tilde{\lambda}\mathrm{tr}\!\left(\mathbf{u}\right)\mathbf{1}+2\tilde{\mu}\mathbf{u}+8\tilde{\gamma}\left\langle\mathbf{N},\mathbf{u}\right\rangle\mathbf{N}-2\tilde{\alpha}\overline{\bm{\mathsf{X}}}_{\mathbf{N}}\cdot\mathbf{u}+4\tilde{\beta}\left\langle\mathbf{N},\mathbf{u}\right\rangle\mathbf{1}+4\tilde{\beta}^{\prime}\mathrm{tr}\!\left(\mathbf{u}\right)\mathbf{N},$		(C.62)

with


$\displaystyle\tilde{\lambda}$	$\displaystyle=\lambda+p^{0}+\frac{q^{0}}{3}$	(C.63a)
$\displaystyle\tilde{\mu}$	$\displaystyle=\mu-p^{0}-\frac{q^{0}}{3}$	(C.63b)
$\displaystyle\tilde{\gamma}$	$\displaystyle=\gamma$	(C.63c)
$\displaystyle\tilde{\alpha}$	$\displaystyle=\alpha-\frac{q^{0}}{2}$	(C.63d)
$\displaystyle\tilde{\beta}$	$\displaystyle=\beta$	(C.63e)
$\displaystyle\tilde{\beta}^{\prime}$	$\displaystyle=\beta-\frac{q^{0}}{4}.$	(C.63f)

We are now in position to solve for $\mathbf{u}$ in terms of $\bm{\sigma}^{1}$ and $\bm{\omega}$ . In the isotropic case we could invert for $\mathbf{u}$ simply by writing $\mathrm{tr}\!\left(\mathbf{u}\right)$ in terms of $\mathrm{tr}\!\left(\bm{\sigma}^{1}\right)$ . Here, by analogy, we take not only the trace of both sides, but also the inner product with $\mathbf{N}$ , to find that

\displaystyle\begin{pmatrix}\mathrm{tr}\!\left(\bm{\sigma}^{1}\right)\\ \left\langle\mathbf{N},\bm{\sigma}^{1}\right\rangle\end{pmatrix}=\begin{pmatrix}3\tilde{\lambda}+2\tilde{\mu}+4\tilde{\beta}^{\prime}&8\tilde{\gamma}-4\tilde{\alpha}+12\tilde{\beta}\\ \tilde{\lambda}+4\tilde{\beta}^{\prime}&2\tilde{\mu}+8\tilde{\gamma}-4\tilde{\alpha}+4\tilde{\beta}\end{pmatrix}\begin{pmatrix}\mathrm{tr}\!\left(\mathbf{u}\right)\\ \left\langle\mathbf{N},\mathbf{u}\right\rangle\end{pmatrix}.

(C.64)

Note that $\bm{\omega}$ is not present here because

\displaystyle\mathrm{tr}\!\left(\bm{\mathsf{X}}_{\bm{\omega}}\cdot\bm{\sigma}^{0}\right)

\displaystyle=\left\langle\mathbf{1},\bm{\mathsf{X}}_{\bm{\omega}}\cdot\bm{\sigma}^{0}\right\rangle=\left\langle\bm{\mathsf{X}}_{\bm{\omega}}^{T}\cdot\mathbf{1},\bm{\sigma}^{0}\right\rangle,

(C.65)

which vanishes due to the antisymmetry of $\bm{\omega}$ . By the same token,

\displaystyle\left\langle\mathbf{N},\bm{\mathsf{X}}_{\bm{\omega}}\cdot\bm{\sigma}^{0}\right\rangle

\displaystyle=-\!\left(p^{0}+\frac{q^{0}}{3}\right)\left\langle\mathbf{N},\bm{\mathsf{X}}_{\bm{\omega}}\cdot\mathbf{1}\right\rangle+q^{0}\left\langle\mathbf{N},\bm{\mathsf{X}}_{\bm{\omega}}\cdot\mathbf{N}\right\rangle=0,

(C.66)

with the second term vanishing due to the antisymmetry of the operator $\bm{\mathsf{X}}_{\bm{\omega}}$ (which obviously follows from that of $\bm{\omega}$ ). Thus, both $\mathrm{tr}\!\left(\mathbf{u}\right)$ and $\left\langle\mathbf{N},\mathbf{u}\right\rangle$ can be written in terms of known quantities wherever they appear.

Now, in a further step reminiscent of the isotropic case, we move most of the terms in eq.(C.3) over to the left hand side, giving

\displaystyle 2\tilde{\mu}\mathbf{u}-2\tilde{\alpha}\bm{\mathsf{X}}_{\mathbf{N}}\cdot\mathbf{u}=2\tilde{\mu}\bm{\Sigma}^{\prime},

(C.67)

where $\bm{\Sigma}^{\prime}$ is a shorthand defined as

\displaystyle\bm{\Sigma}^{\prime}=\frac{1}{2\tilde{\mu}}\!\left[\bm{\sigma}^{1}-\bm{\mathsf{X}}_{\bm{\omega}}\cdot\bm{\sigma}^{0}-\tilde{\eta}\mathbf{1}-\tilde{\theta}\mathbf{N}\right],

(C.68)

with

\displaystyle\begin{pmatrix}\tilde{\eta}\\ \tilde{\theta}\end{pmatrix}=\begin{pmatrix}\tilde{\lambda}&4\tilde{\beta}\\ 4\tilde{\beta}^{\prime}&8\tilde{\gamma}\end{pmatrix}\begin{pmatrix}\mathrm{tr}\!\left(\mathbf{u}\right)\\ \left\langle\mathbf{N},\mathbf{u}\right\rangle\end{pmatrix}.

(C.69)

We are left to solve the equation

\displaystyle\!\left(\bm{\mathsf{id}}-\frac{\tilde{\alpha}}{\tilde{\mu}}\bm{\mathsf{X}}_{\mathbf{N}}\right)\cdot\mathbf{u}=\bm{\Sigma}^{\prime}.

(C.70)

It is readily verified that the necessary inverse operator is

\displaystyle\!\left(\bm{\mathsf{id}}-\frac{\tilde{\alpha}}{\tilde{\mu}}\bm{\mathsf{X}}_{\mathbf{N}}\right)^{-1}

\displaystyle=\bm{\mathsf{id}}+\tilde{P}\bm{\mathsf{X}}_{\mathbf{N}}+\tilde{Q}\mathbf{N}\otimes\mathbf{N},

(C.71)

with


$\displaystyle\tilde{P}$	$\displaystyle=\frac{\frac{\tilde{\alpha}}{\tilde{\mu}}}{1-\frac{\tilde{\alpha}}{\tilde{\mu}}}$	(C.72a)
$\displaystyle\tilde{Q}$	$\displaystyle=\frac{2\frac{\tilde{\alpha}}{\tilde{\mu}}}{1-2\frac{\tilde{\alpha}}{\tilde{\mu}}}\tilde{P},$	(C.72b)

from which it follows that

\displaystyle\mathbf{u}=\bm{\Sigma}^{\prime}+\tilde{P}\bm{\mathsf{X}}_{\mathbf{N}}\cdot\bm{\Sigma}^{\prime}+\tilde{Q}\left\langle\mathbf{N},\bm{\Sigma}^{\prime}\right\rangle\mathbf{N}.

(C.73)

This procedure seems to break down when $\tilde{\mu}$ equals $\tilde{\alpha}$ or $2\tilde{\alpha}$ , but the physical significance of these cases is not yet clear to the present authors. From here, with the help of the identity

\displaystyle\bm{\mathsf{X}}_{\mathbf{N}}\bm{\mathsf{X}}_{\bm{\omega}}\cdot\mathbf{N}=\bm{\mathsf{X}}_{\bm{\omega}}\cdot\mathbf{N},

(C.74)

it is a matter of relatively straightforward algebra to find that

\displaystyle 2\tilde{\mu}\mathbf{u}=\bm{\sigma}^{1}-\tilde{\eta}\mathbf{1}+\tilde{P}\bm{\mathsf{X}}_{\mathbf{N}}\cdot\bm{\sigma}^{1}+\tilde{R}\mathbf{N}-q^{0}(1+\tilde{P})\bm{\mathsf{X}}_{\bm{\omega}}\cdot\mathbf{N},

(C.75)

with

\displaystyle\tilde{R}=\tilde{Q}\!\left(\left\langle\mathbf{N},\bm{\sigma}^{1}\right\rangle-\tilde{\eta}-\tilde{\theta}\right)-\tilde{\theta}-2\tilde{P}\!\left(\tilde{\eta}+\tilde{\theta}\right).

(C.76)

We now have an expression for $\mathbf{u}$ in terms of:

(1).

the transversely-isotropic constants $\lambda$ , $\mu$ , $\alpha$ , $\beta$ and $\gamma$ , as well as $\bm{\nu}$ ;
(2).

the constants $p^{0}$ and $q^{0}$ which define the equilibrium stress-state;
(3).

the perturbation to the stress $\bm{\sigma}^{1}$ ;
(4).

an antisymmetric matrix $\bm{\omega}$ ‘pointing’ in an arbitrary direction in the plane perpendicular to the unperturbed symmetry-axis.

This is to be substituted into the expression

\displaystyle\bm{\mathsf{c}}^{1}

\displaystyle=\bm{\mathsf{X}}_{\mathbf{u}+\bm{\omega}}\bm{\mathsf{c}}^{0}+\bm{\mathsf{c}}^{0}\bm{\mathsf{X}}_{\mathbf{u}-\bm{\omega}}-\bm{\mathsf{c}}^{0}\mathrm{tr}\!\left(\mathbf{u}\right)+8D^{3}\tilde{V}\!\left(\mathbf{1}\right)\cdot\mathbf{u}

(C.77)

for the perturbed elastic tensor.

In order to make progress we must parametrise the third derivatives of the transversely-isotropic strain-energy function. As stated in the main text, such a strain-energy function will generally depend on $\mathbf{C}$ not only through the invariants $I_{1}$ , $I_{2}$ and $I_{3}$ defined in eq.(4.13), but also through the terms (e.g. Holzapfel, 2000)

	$\displaystyle\left\langle\bm{\nu},\mathbf{C}\cdot\bm{\nu}\right\rangle$	$\displaystyle=\left\langle\mathbf{N},\mathbf{C}\right\rangle$		(C.78)
	$\displaystyle\left\langle\bm{\nu},\mathbf{C}^{2}\cdot\bm{\nu}\right\rangle$	$\displaystyle=\left\langle\mathbf{N},\mathbf{C}^{2}\right\rangle=\frac{1}{2}\left\langle\mathbf{C},\bm{\mathsf{X}}_{\mathbf{N}}\cdot\mathbf{C}\right\rangle.$		(C.79)

The third order terms of $\tilde{V}$ ’s Taylor-series about the identity therefore satisfy

$\displaystyle[8D^{3}\tilde{V}\!\left(\mathbf{1}\right)]_{ijklmn}C_{ij}C_{kl}C_{mn}$	$\displaystyle=\zeta_{1}\mathrm{tr}\!\left(\mathbf{C}\right)^{3}+3\zeta_{2}\mathrm{tr}\!\left(\mathbf{C}\right)\mathrm{tr}\!\left(\mathbf{C}^{2}\right)+2\zeta_{3}\mathrm{tr}\!\left(\mathbf{C}^{3}\right)$
	$\displaystyle\qquad+3\zeta_{4}\mathrm{tr}\!\left(\mathbf{C}\right)\left\langle\mathbf{C},\mathbf{N}\right\rangle^{2}+3\zeta_{5}\mathrm{tr}\!\left(\mathbf{C}\right)\left\langle\mathbf{C},\bm{\mathsf{X}}_{\mathbf{N}}\cdot\mathbf{C}\right\rangle$
	$\displaystyle\qquad+3\zeta_{6}\left\langle\mathbf{C},\bm{\mathsf{id}}\cdot\mathbf{C}\right\rangle\left\langle\mathbf{C},\mathbf{N}\right\rangle+3\zeta_{7}\left\langle\mathbf{C},\mathbf{N}\right\rangle\left\langle\mathbf{C},\bm{\mathsf{X}}_{\mathbf{N}}\cdot\mathbf{C}\right\rangle,$	(C.80)

for some material-dependent constants $\!\left\{\zeta_{i}\right\}$ , so that the corresponding term in the elastic tensor is

$\displaystyle 8D^{3}\tilde{V}\!\left(\mathbf{1}\right)\cdot\mathbf{u}$	$\displaystyle=\zeta_{1}\mathrm{tr}\!\left(\mathbf{u}\right)\mathbf{1}\otimes\mathbf{1}$
	$\displaystyle\qquad+\zeta_{2}\!\left[\mathrm{tr}\!\left(\mathbf{u}\right)\overline{\bm{\mathsf{id}}}+\!\left(\mathbf{1}\otimes\mathbf{u}+h.c.\right)\right]$
	$\displaystyle\qquad+\zeta_{3}\overline{\bm{\mathsf{X}}}_{\mathbf{u}}$
	$\displaystyle\qquad+\zeta_{4}\!\left[\left\langle\mathbf{N},\mathbf{u}\right\rangle\!\left(\mathbf{1}\otimes\mathbf{N}+h.c.\right)+\mathrm{tr}\!\left(\mathbf{u}\right)\mathbf{N}\otimes\mathbf{N}\right]$
	$\displaystyle\qquad+\zeta_{5}\!\left[\!\left(\mathbf{1}\otimes\!\left(\bm{\mathsf{X}}_{\mathbf{N}}\cdot\mathbf{u}\right)+h.c.\right)+\mathrm{tr}\!\left(\mathbf{u}\right)\bm{\mathsf{X}}_{\mathbf{N}}\right]$
	$\displaystyle\qquad+\zeta_{6}\!\left[\!\left(\mathbf{N}\otimes\mathbf{u}+h.c.\right)+\left\langle\mathbf{N},\mathbf{u}\right\rangle\bm{\mathsf{id}}\right]$
	$\displaystyle\qquad+\zeta_{7}\!\left[\!\left(\mathbf{N}\otimes\!\left(\bm{\mathsf{X}}_{\mathbf{N}}\cdot\mathbf{u}\right)+h.c.\right)+\left\langle\mathbf{N},\mathbf{u}\right\rangle\bm{\mathsf{X}}_{\mathbf{N}}\right]$
	$\displaystyle=\zeta_{1}\mathrm{tr}\!\left(\mathbf{u}\right)\mathbf{1}\otimes\mathbf{1}+\!\left(\zeta_{2}\mathrm{tr}\!\left(\mathbf{u}\right)+\zeta_{6}\left\langle\mathbf{N},\mathbf{u}\right\rangle\right)\overline{\bm{\mathsf{id}}}+\zeta_{2}\!\left(\mathbf{1}\otimes\mathbf{u}+h.c.\right)+\zeta_{3}\overline{\bm{\mathsf{X}}}_{\mathbf{u}}$
	$\displaystyle\qquad+\zeta_{4}\!\left[\left\langle\mathbf{N},\mathbf{u}\right\rangle\!\left(\mathbf{1}\otimes\mathbf{N}+h.c.\right)+\mathrm{tr}\!\left(\mathbf{u}\right)\mathbf{N}\otimes\mathbf{N}\right]+\zeta_{5}\!\left[\mathbf{1}\otimes\!\left(\bm{\mathsf{X}}_{\mathbf{N}}\cdot\mathbf{u}\right)+h.c.\right]$
	$\displaystyle\qquad+\zeta_{6}\!\left(\mathbf{N}\otimes\mathbf{u}+h.c.\right)+\zeta_{7}\!\left[\mathbf{N}\otimes\!\left(\bm{\mathsf{X}}_{\mathbf{N}}\cdot\mathbf{u}\right)+h.c.\right]+\!\left(\zeta_{5}\mathrm{tr}\!\left(\mathbf{u}\right)+\zeta_{7}\left\langle\mathbf{N},\mathbf{u}\right\rangle\right)\overline{\bm{\mathsf{X}}}_{\mathbf{N}}.$	(C.81)

Whilst for an isotropic solid we needed three extra material-dependent constants to parametrise the third derivatives, here we need seven.

With this, we are ready to write down an expression for the elastic tensor’s perturbation. Observe that

	$\displaystyle\bm{\mathsf{X}}_{\mathbf{u}}\bm{\mathsf{c}}^{0}$	$\displaystyle=\!\left(\bm{\mathsf{c}}^{0}\bm{\mathsf{X}}_{\mathbf{u}}\right)^{T}$		(C.82)
	$\displaystyle\bm{\mathsf{X}}_{\bm{\omega}}\bm{\mathsf{c}}^{0}$	$\displaystyle=\!\left(\bm{\mathsf{c}}^{0}\bm{\mathsf{X}}_{\!\left(-\bm{\omega}\right)}\right)^{T}.$		(C.83)

Thus,

$\displaystyle\bm{\mathsf{c}}^{1}$	$\displaystyle=\mathrm{tr}\!\left(\mathbf{u}\right)\!\left[\!\left(\zeta_{1}-\lambda\right)\mathbf{1}\otimes\mathbf{1}+\!\left(\zeta_{2}-2\mu\right)\overline{\bm{\mathsf{id}}}+\!\left(\zeta_{4}-8\gamma\right)\mathbf{N}\otimes\mathbf{N}+\!\left(\zeta_{5}+2\alpha\right)\overline{\bm{\mathsf{X}}}_{\mathbf{N}}-4\beta\!\left(\mathbf{1}\otimes\mathbf{N}+h.c.\right)\right]$
	$\displaystyle\qquad+\left\langle\mathbf{N},\mathbf{u}\right\rangle\!\left[\zeta_{6}\overline{\bm{\mathsf{id}}}+\zeta_{7}\overline{\bm{\mathsf{X}}}_{\mathbf{N}}+\zeta_{4}\!\left(\mathbf{1}\otimes\mathbf{N}+h.c.\right)\right]$
	$\displaystyle\qquad+\!\left(\zeta_{3}+4\mu\right)\overline{\bm{\mathsf{X}}}_{\mathbf{u}}+\!\left(\zeta_{2}+2\lambda\right)\!\left(\mathbf{1}\otimes\mathbf{u}+h.c.\right)+\!\left(\zeta_{6}+8\beta\right)\!\left(\mathbf{N}\otimes\mathbf{u}+h.c.\right)$
	$\displaystyle\qquad+\!\left(\zeta_{5}+4\beta\right)\!\left[\mathbf{1}\otimes\!\left(\bm{\mathsf{X}}_{\mathbf{N}}\cdot\mathbf{u}\right)+h.c.\right]+\!\left(\zeta_{7}+8\gamma\right)\!\left[\mathbf{N}\otimes\!\left(\bm{\mathsf{X}}_{\mathbf{N}}\cdot\mathbf{u}\right)+h.c.\right]-2\alpha\!\left(\overline{\bm{\mathsf{X}}}_{\mathbf{u}}\overline{\bm{\mathsf{X}}}_{\mathbf{N}}+h.c.\right)$
	$\displaystyle\qquad+4\mu\overline{\bm{\mathsf{X}}}_{\bm{\omega}}+8\gamma\!\left[\mathbf{N}\otimes\!\left(\bm{\mathsf{X}}_{\bm{\omega}}\cdot\mathbf{N}\right)+h.c.\right]+4\beta\!\left[\mathbf{1}\otimes\!\left(\bm{\mathsf{X}}_{\bm{\omega}}\cdot\mathbf{N}\right)+h.c.\right]-2\alpha\!\left(\overline{\bm{\mathsf{X}}}_{\bm{\omega}}\overline{\bm{\mathsf{X}}}_{\mathbf{N}}+h.c.\right).$	(C.84)

It is now a matter of tedious algebra to complete the calculation. The nontrivial identities that we need are:

$\displaystyle\bm{\mathsf{X}}_{\mathbf{N}}^{2}$	$\displaystyle=\bm{\mathsf{X}}_{\mathbf{N}}+2\mathbf{N}\otimes\mathbf{N}$	(C.85)
$\displaystyle\bm{\mathsf{X}}_{\mathbf{N}}\bm{\mathsf{X}}_{\bm{\omega}}\cdot\mathbf{N}$	$\displaystyle=\bm{\mathsf{X}}_{\bm{\omega}}\cdot\mathbf{N}$	(C.86)
$\displaystyle\bm{\mathsf{X}}_{\!\left(\bm{\mathsf{X}}_{\bm{\omega}}\cdot\mathbf{N}\right)}$	$\displaystyle=\bm{\mathsf{X}}_{\bm{\omega}}\bm{\mathsf{X}}_{\mathbf{N}}+h.c.$	(C.87)
$\displaystyle\overline{\bm{\mathsf{X}}}_{\bm{\mathsf{X}}_{\mathbf{N}}\cdot\bm{\sigma}^{1}}\overline{\bm{\mathsf{X}}}_{\mathbf{N}}+h.c.$	$\displaystyle=\overline{\bm{\mathsf{X}}}_{\bm{\mathsf{X}}_{\mathbf{N}}\cdot\bm{\sigma}^{1}}+2\!\left[\mathbf{N}\otimes\!\left(\bm{\mathsf{X}}_{\mathbf{N}}\cdot\bm{\sigma}^{1}\right)+h.c.\right]+2\left\langle\mathbf{N},\bm{\sigma}^{1}\right\rangle\overline{\bm{\mathsf{X}}}_{\mathbf{N}}.$	(C.88)

Finally, the perturbation to the elastic tensor is given by

$\displaystyle\bm{\mathsf{c}}^{1}$	$\displaystyle=\eta_{1}\mathbf{1}\otimes\mathbf{1}+\eta_{2}\overline{\bm{\mathsf{id}}}+\eta_{3}\mathbf{N}\otimes\mathbf{N}+\eta_{4}\!\left(\mathbf{1}\otimes\mathbf{N}+h.c.\right)+\eta_{5}\overline{\bm{\mathsf{X}}}_{\mathbf{N}}$
	$\displaystyle\qquad+\eta_{6}\overline{\bm{\mathsf{X}}}_{\bm{\sigma}^{1}}+\eta_{7}\!\left(\mathbf{1}\otimes\bm{\sigma}^{1}+h.c.\right)+\eta_{8}\!\left(\mathbf{N}\otimes\bm{\sigma}^{1}+h.c.\right)+\eta_{9}\!\left[\mathbf{1}\otimes\!\left(\bm{\mathsf{X}}_{\mathbf{N}}\cdot\bm{\sigma}^{1}\right)+h.c.\right]$
	$\displaystyle\qquad+\eta_{10}\!\left[\mathbf{N}\otimes\!\left(\bm{\mathsf{X}}_{\mathbf{N}}\cdot\bm{\sigma}^{1}\right)+h.c.\right]+\eta_{11}\overline{\bm{\mathsf{X}}}_{\bm{\mathsf{X}}_{\mathbf{N}}\cdot\bm{\sigma}^{1}}+\eta_{12}\!\left(\overline{\bm{\mathsf{X}}}_{\bm{\sigma}^{1}}\overline{\bm{\mathsf{X}}}_{\mathbf{N}}+h.c.\right)$
	$\displaystyle\qquad+\eta_{13}\overline{\bm{\mathsf{X}}}_{\bm{\omega}}+\eta_{14}\!\left[\mathbf{1}\otimes\!\left(\bm{\mathsf{X}}_{\bm{\omega}}\cdot\mathbf{N}\right)+h.c.\right]+\eta_{15}\!\left[\mathbf{N}\otimes\!\left(\bm{\mathsf{X}}_{\bm{\omega}}\cdot\mathbf{N}\right)+h.c.\right]+\eta_{16}\!\left(\overline{\bm{\mathsf{X}}}_{\bm{\omega}}\overline{\bm{\mathsf{X}}}_{\mathbf{N}}+h.c.\right).$	(C.89)

The $\!\left\{\eta_{i}\right\}$ are defined as


$\displaystyle 2\tilde{\mu}\eta_{1}$	$\displaystyle=2\tilde{\mu}\mathrm{tr}\!\left(\mathbf{u}\right)\!\left(\zeta_{1}-\lambda\right)-\!\left[2\tilde{\eta}\!\left(\zeta_{2}+2\lambda\right)\right]$	(C.90a)
$\displaystyle 2\tilde{\mu}\eta_{2}$	$\displaystyle=2\tilde{\mu}\!\left[\!\left(\zeta_{2}-2\mu\right)\mathrm{tr}\!\left(\mathbf{u}\right)+\zeta_{6}\left\langle\mathbf{N},\mathbf{u}\right\rangle\right]-\!\left[2\tilde{\eta}\!\left(\zeta_{3}+4\mu\right)\right]$	(C.90b)
$\displaystyle 2\tilde{\mu}\eta_{3}$	$\displaystyle=2\tilde{\mu}\mathrm{tr}\!\left(\mathbf{u}\right)\!\left(\zeta_{4}-8\gamma\right)+\!\left[2\tilde{R}\!\left(\zeta_{6}+8\beta\right)-8\alpha\tilde{R}+\!\left(4\tilde{R}-4\tilde{\eta}+4\tilde{P}\left\langle\mathbf{N},\bm{\sigma}^{1}\right\rangle\right)\!\left(\zeta_{7}+8\gamma\right)\right]$	(C.90c)
$\displaystyle 2\tilde{\mu}\eta_{4}$	$\displaystyle=2\tilde{\mu}\!\left[\zeta_{4}\left\langle\mathbf{N},\mathbf{u}\right\rangle-4\beta\mathrm{tr}\!\left(\mathbf{u}\right)\right]+\!\left[\tilde{R}\!\left(\zeta_{2}+2\lambda\right)-\tilde{\eta}\!\left(\zeta_{6}+8\beta\right)+\!\left(2\tilde{P}\left\langle\mathbf{N},\bm{\sigma}^{1}\right\rangle+2\tilde{R}-2\tilde{\eta}\right)\!\left(\zeta_{5}+4\beta\right)\right]$	(C.90d)
$\displaystyle 2\tilde{\mu}\eta_{5}$	$\displaystyle=2\tilde{\mu}\!\left[\!\left(\zeta_{5}+2\alpha\right)\mathrm{tr}\!\left(\mathbf{u}\right)+\zeta_{7}\left\langle\mathbf{N},\mathbf{u}\right\rangle\right]+\!\left[\tilde{R}\!\left(\zeta_{3}+4\mu\right)-2\alpha\!\left(2\tilde{R}-4\tilde{\eta}+2\tilde{P}\left\langle\mathbf{N},\bm{\sigma}^{1}\right\rangle\right)\right]$	(C.90e)
$\displaystyle 2\tilde{\mu}\eta_{6}$	$\displaystyle=\zeta_{3}+4\mu$	(C.90f)
$\displaystyle 2\tilde{\mu}\eta_{7}$	$\displaystyle=\zeta_{2}+2\lambda$	(C.90g)
$\displaystyle 2\tilde{\mu}\eta_{8}$	$\displaystyle=\zeta_{6}+8\beta$	(C.90h)
$\displaystyle 2\tilde{\mu}\eta_{9}$	$\displaystyle=\tilde{P}\!\left(\zeta_{2}+2\lambda\right)+\!\left(1+\tilde{P}\right)\!\left(\zeta_{5}+4\beta\right)$	(C.90i)
$\displaystyle 2\tilde{\mu}\eta_{10}$	$\displaystyle=\tilde{P}\!\left(\zeta_{6}+8\beta-4\alpha\right)+\!\left(1+\tilde{P}\right)\!\left(\zeta_{7}+8\gamma\right)$	(C.90j)
$\displaystyle 2\tilde{\mu}\eta_{11}$	$\displaystyle=\tilde{P}\!\left(\zeta_{3}+4\mu-2\alpha\right)$	(C.90k)
$\displaystyle 2\tilde{\mu}\eta_{12}$	$\displaystyle=-2\alpha$	(C.90l)
$\displaystyle 2\tilde{\mu}\eta_{13}$	$\displaystyle=8\mu\tilde{\mu}$	(C.90m)
$\displaystyle 2\tilde{\mu}\eta_{14}$	$\displaystyle=8\beta\tilde{\mu}-q^{0}\!\left(1+\tilde{P}\right)\!\left(\zeta_{2}+\zeta_{5}+2\lambda+4\beta\right)$	(C.90n)
$\displaystyle 2\tilde{\mu}\eta_{15}$	$\displaystyle=16\gamma\tilde{\mu}-q^{0}\!\left(1+\tilde{P}\right)\!\left(\zeta_{6}+\zeta_{7}-4\alpha+8\beta+8\gamma\right)$	(C.90o)
$\displaystyle 2\tilde{\mu}\eta_{16}$	$\displaystyle=-\!\left[4\alpha\tilde{\mu}+q^{0}\!\left(1+\tilde{P}\right)\!\left(\zeta_{3}+4\mu-2\alpha\right)\right],$	(C.90p)

definitions which should in turn be combined with the earlier definitions eqs.(C.63,C.64,C.69,C.72,C.76,C.3). The perturbation to the elastic tensor is given in terms of

(1).

the transversely-isotropic constants $\lambda$ , $\mu$ , $\alpha$ , $\beta$ and $\gamma$ , as well as $\bm{\nu}$ ;
(2).

the constants $p^{0}$ and $q^{0}$ which parametrise the zeroth-order equilibrium stress;
(3).

the seven constants $\!\left\{\zeta_{i}\right\}$ defined in eq.(C.3), which parametrise the third derivatives of a transversely-isotropic strain-energy function about equilibrium;
(4).

the small induced stress $\bm{\sigma}^{1}$ ;
(5).

the arbitrary 2-parameter antisymmetric matrix $\bm{\omega}$ .

Appendix D The isotropic elastic moduli, their pressure-derivatives, and wave speeds

Working under the assumption of isotropic background states, this section compares our linearised theory with the theories of Dahlen (1972b) and Tromp & Trampert (2018). We demonstrate that the theories are compatible for certain choices of parameter and, moreover, that our parameters $c$ and $d$ are not readily comparable with Tromp & Trampert’s $\kappa^{\prime}$ and $\mu^{\prime}$ . For this section we will work in the notation of Section 1, using the Lamé parameter $\lambda$ interchangeably with $\kappa$ .

Recall from Section 1 that the most general isotropic elastic tensor $\bm{\Xi}$ is written, upon inducing an incremental stress, as

	$\displaystyle\Xi_{ijkl}$	$\displaystyle=\!\left(\kappa-\frac{2}{3}\mu+\Delta p^{0}c\right)\delta_{ij}\delta_{kl}+\!\left(\mu+\Delta p^{0}d\right)\!\left(\delta_{ik}\delta_{jl}+\delta_{il}\delta_{jk}\right)$
		$\displaystyle\qquad+a\!\left(\delta_{ij}\Delta\tau^{0}_{kl}+\delta_{kl}\Delta\tau^{0}_{ij}\right)+b\!\left(\delta_{ik}\Delta\tau^{0}_{jl}+\delta_{il}\Delta\tau^{0}_{jk}+\delta_{jk}\Delta\tau^{0}_{il}+\delta_{jl}\Delta\tau^{0}_{ik}\right)$		(D.1)

and that we then obtained Tromp & Trampert’s $\bm{\Xi}$ by setting

	$\displaystyle c$	$\displaystyle=-2a$		(D.2)
	$\displaystyle d$	$\displaystyle=-2b$		(D.3)

and

	$\displaystyle a$	$\displaystyle=\frac{1}{2}+\frac{1}{3}\mu^{\prime}-\frac{1}{2}\kappa^{\prime}$		(D.4)
	$\displaystyle b$	$\displaystyle=-\frac{1}{2}-\frac{1}{2}\mu^{\prime}$		(D.5)

with the constants $\kappa^{\prime}$ and $\mu^{\prime}$ interpreted as adiabatic pressure-derivatives of the elastic moduli. These equations can be considered as a four-dimensional system


$\displaystyle c$	$\displaystyle=\lambda^{\prime}-1$	(D.6a)
$\displaystyle d$	$\displaystyle=\mu^{\prime}+1$	(D.6b)
$\displaystyle c$	$\displaystyle=-2a$	(D.6c)
$\displaystyle d$	$\displaystyle=-2b$	(D.6d)

parametrised by $\lambda^{\prime}$ and $\mu^{\prime}$ , where

\displaystyle\lambda^{\prime}\equiv\kappa^{\prime}-\frac{2}{3}\mu^{\prime}.

(D.7)

But in eqs.(4.25) we showed that $a$ , $b$ , $c$ and $d$ are given in terms of the three Murnaghan constants. Under what conditions are the two theories compatible?

Eqs.(D.6c,D.6d) just constrain eqs.(4.25), effectively fixing two of the Murnaghan constants. Let us define two shorthands

	$\displaystyle y$	$\displaystyle=\frac{\lambda+p^{0}}{\mu-p^{0}}$		(D.8)
	$\displaystyle z$	$\displaystyle=\frac{2\lambda+\mu+p^{0}}{\mu-p^{0}}.$		(D.9)

The explicit forms of eqs.(4.25) can then be used to write eqs.(D.6c,D.6d) as

\displaystyle\begin{pmatrix}1&-y&0\\ 0&1&-y\end{pmatrix}\begin{pmatrix}\zeta_{1}\\ \zeta_{2}\\ \zeta_{3}\end{pmatrix}=z\begin{pmatrix}\lambda\\ 2\mu\end{pmatrix},

(D.10)

after which we may solve for $\zeta_{2,3}$ in terms of $\zeta_{1}$ , finding that

\displaystyle\begin{pmatrix}\zeta_{2}\\ \zeta_{3}\end{pmatrix}=\frac{1}{y^{2}}\begin{pmatrix}-y&0\\ 1&-y\end{pmatrix}\begin{pmatrix}z&0&-1\\ 0&2z&0\end{pmatrix}\begin{pmatrix}\lambda\\ \mu\\ \zeta_{1}\end{pmatrix}.

(D.11)

This expression is written in a rather strange way, but we have found it highly amenable to algebraic manipulation. One free parameter, $\zeta_{1}$ , remains with which to satisfy eqs.(D.6a,D.6b). Using eqs.(4.25) once again, after some algebra we find from eqs.(D.6a,D.6b) that the constants $\lambda^{\prime}$ and $\mu^{\prime}$ are given by

\displaystyle\begin{pmatrix}\lambda^{\prime}\\ \mu^{\prime}\end{pmatrix}=\begin{pmatrix}1\\ -1\end{pmatrix}-\frac{1}{3\kappa+p^{0}}\begin{pmatrix}1-2\frac{z}{y}&0&3+\frac{2}{y}\\ -\!\left(\frac{3}{2}y+1\right)\frac{z}{y^{2}}&1-2\frac{z}{y}&\!\left(\frac{3}{2}y+1\right)\frac{1}{y^{2}}\end{pmatrix}\begin{pmatrix}\lambda\\ \mu\\ \zeta_{1}\end{pmatrix}.

(D.12)

As $\zeta_{1}$ is arbitrary, this shows that there exists a one-parameter family of values of $\lambda^{\prime}$ and $\mu^{\prime}$ whereby both eqs.(D.6) and eqs.(4.25) are satisfied and Tromp & Trampert’s theory thus agrees with ours. Moreover, this will hold for any physically allowed $\lambda$ , $\mu$ and $p^{0}$ . It formalises our earlier assertion that their theory is valid for a certain class of isotropic materials. The theory of Dahlen (1972b) is formally obtained from that of Tromp & Trampert (2018) by setting $\kappa^{\prime}=\mu^{\prime}=0$ . However, now that two free parameters have been removed, eq.(D.12) reduces to the expression

\displaystyle f\!\left(\lambda,\mu,p^{0}\right)=0

(D.13)

for some real-valued function $f$ , and the elastic moduli themselves are constrained.

One small puzzle remains. From looking at eq.(D), our two constants $c$ and $d$ might be thought to represent the pressure-derivatives of the elastic moduli, but eqs.(D.6a,D.6b) make clear that they cannot be equivalent to Tromp & Trampert’s $\kappa^{\prime}$ and $\mu^{\prime}$ . In order to better understand this discrepancy we should focus on the pressure dependence of P- and S-wave speeds. We can do that by neglecting deviatoric stress increments and considering the Christoffel operator (defined in eq.2.27 and written here with $\hat{\mathbf{k}}$ substituted for $\hat{\mathbf{p}}$ )

\displaystyle\rho B_{jl}

\displaystyle=\!\left[\!\left(\kappa+\frac{1}{3}\mu\right)+\Delta p^{0}\!\left(c+d\right)\right]\hat{k}_{j}\hat{k}_{l}+\!\left[\mu-\Delta p^{0}\!\left(1-d\right)\right]\delta_{jl}+\!\left(\text{deviatoric}\right).

(D.14)

This gives wave speeds of

	$\displaystyle\rho c_{P}^{2}$	$\displaystyle=\!\left[\!\left(\lambda+\Delta p^{0}c\right)+2\!\left(\mu+\Delta p^{0}d\right)\right]-\Delta p^{0}$		(D.15)
	$\displaystyle\rho c_{S}^{2}$	$\displaystyle=\!\left[\mu+\Delta p^{0}d\right]-\Delta p^{0}.$		(D.16)

The wave speeds’ dependence on $\Delta p^{0}$ comes not only from terms in $c$ and $d$ that belong to $\bm{\Xi}$ , but also from the term $\Delta T^{0}_{ik}\delta_{jl}$ that is added to $\bm{\Xi}$ in order to form $\bm{\Lambda}$ (and hence the Christoffel operator). The construction of the theories of Dahlen, Tromp and Trampert is such that the hydrostatic contribution of $\Delta T^{0}_{ik}\delta_{jl}$ is always cancelled out. For example, the components of Tromp & Trampert’s $\bm{\Lambda}$ are found to be

\displaystyle\Lambda_{ijkl}=\!\left(\lambda+\Delta p^{0}\lambda^{\prime}\right)\delta_{ij}\delta_{kl}+\!\left(\mu+\Delta p^{0}\mu^{\prime}\right)\!\left(\delta_{ik}\delta_{jl}+\delta_{il}\delta_{jk}\right)+\Delta p^{0}\!\left(\delta_{il}\delta_{jk}-\delta_{ij}\delta_{kl}\right)+\!\left(\text{deviatoric}\right).

(D.17)

The third term vanishes upon contraction with $\hat{k}_{i}\hat{k}_{k}$ and can therefore be neglected when forming the Christoffel operator. This leaves us with the neatly defined elastic moduli and derivatives within the first two terms, and we ultimately arrive at Tromp & Trampert’s “quasi-classical” expressions for the wave speeds

	$\displaystyle\rho c_{P}^{2}$	$\displaystyle=\!\left(\lambda+\lambda^{\prime}\Delta p^{0}\right)+2\!\left(\mu+\mu^{\prime}\Delta p^{0}\right)$		(D.18)
	$\displaystyle\rho c_{S}^{2}$	$\displaystyle=\!\left(\mu+\mu^{\prime}\Delta p^{0}\right).$		(D.19)

In effect, by using this definition of the elastic moduli’s pressure derivatives one is “redefining” the elastic moduli in order to subsume the pressure-dependence introduced by $\Delta T^{0}_{ik}\delta_{jl}$ .

On the stress dependence of the elastic tensor

keywords:

1 Introduction

1.1 A heuristic linearised theory of the elastic tensor’s stress dependence

1.1.1 Preliminaries

1.1.2 Isotropic materials

1.1.3 Anisotropic materials

1.2 Aims of this paper

2 A review of elasticity

2.1 Basic definitions and results

2.1.1 Equations of motion

2.1.2 Constitutive relations

2.1.3 Linearisation of the equations of motion

2.2 Transformation of the reference configuration

2.2.1 Particle-relabelling

2.2.2 Natural reference configurations

2.3 Material symmetries

3 Functional dependence of the elastic tensor on equilibrium stress

3.1 Parametrised dependence on the equilibrium configuration

3.2 Parametrised dependence on equilibrium stress

3.3 Linearisation

4 Examples

4.1 Isotropic materials

4.1.1 Exact response to deviatoric stress

4.1.2 Exact response to hydrostatic stress

4.1.3 Linearised response to small stress

4.2 Transversely-isotropic materials

4.2.1 Exact response to deviatoric stress

4.2.2 Linearised response to deviatoric stress

5 Discussion

6 Conclusions

Acknowledgements.

References

Appendix A Notations and definitions

A.1 Groups

A.2 Some nonstandard linear operators

A.3 Example: Differentiation of the strain-energy functions WW and VV

A.4 Mapping between different notation conventions

Appendix B The invertibility of 𝚺^\hat{\bm{\Sigma}}

Appendix C Calculation of the linearised elastic tensor

C.1 General expressions

C.2 Isotropic, hydrostatically pre-stressed background

C.3 Transversely-isotropic, ‘quasi-hydrostatically’ pre-stressed background

Appendix D The isotropic elastic moduli, their pressure-derivatives, and wave speeds

A.3 Example: Differentiation of the strain-energy functions $W$ and $V$

Appendix B The invertibility of $\hat{\bm{\Sigma}}$