This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

[subfigure]position=top

Symmetry analysis of strain, electric and magnetic fields in the Bi2Se3\text{Bi}_{2}\text{Se}_{3}-class of topological insulators

Mathias Rosdahl Jensen Technical University of Denmark, Department of Photonics Engineering, Kgs-Lyngby, 2800, Denmark    Jens Paaske Center for Quantum Devices, Niels Bohr Institute, University of Copenhagen, Universitetsparken 5, DK-2100 Copenhagen, Denmark    Anders Mathias Lunde Center for Quantum Devices, Niels Bohr Institute, University of Copenhagen, Universitetsparken 5, DK-2100 Copenhagen, Denmark    Morten Willatzen Beijing Institute of Nanoenergy and Nanosystems, Chinese Academy of Sciences, Beijing, China
and Technical University of Denmark, Department of Photonics Engineering, Kgs-Lyngby, 2800, Denmark
Abstract

Based on group theoretical arguments we derive the most general Hamiltonian for the Bi2Se3\text{Bi}_{2}\text{Se}_{3}-class of materials including terms to third order in the wave vector, first order in electric and magnetic fields, first order in strain and first order in both strain and wave vector. We determine analytically the effects of strain on the electronic structure of Bi2Se3\text{Bi}_{2}\text{Se}_{3}. For the most experimentally relevant surface termination we analytically derive the surface state spectrum, revealing an anisotropic Dirac cone with elliptical constant energy counturs giving rise to different velocities in different in-plane directions. The spin-momentum locking of strained Bi2Se3\text{Bi}_{2}\text{Se}_{3} is shown to be modified and for some strain configurations we see a non-zero spin component perpendicular to the surface. Hence, strain control can be used to manipulate the spin degree of freedom via the spin-orbit coupling. We show that for a thin film of Bi2Se3\text{Bi}_{2}\text{Se}_{3} the surface state band gap induced by coupling between the opposite surfaces changes opposite to the bulk band gap under strain. Tuning the surface state band gap by strain, gives new possibilities for the experimental investigation of the thickness dependent gap and optimization of optical properties relevant for, e.g., photodetector and energy harvesting applications. We finally derive analytical expressions for the effective mass tensor of the Bi2Se3 class of materials as a function of strain and electric field.

I Introduction

Topological insulators have an inverted band gap which engenders topologically protected surface states (SS). Exhibiting linear dispersion, the electrons at the surface resemble massless helical Dirac fermions, with spin locked to the momentum. The prime examples of three-dimensional topological insulators are among Bi2Se3\text{Bi}_{2}\text{Se}_{3}-class of materials Zhang et al. (2009). This class, also known as the tetradymite group, contains compounds M2X3\text{M}_{2}\text{X}_{3} where M is either Bi or Sb and X is a combination of Se, S and Te. The crystal structure consists of unit layers of five atomic layers, so-called quintuple layers. For the simplest case of a (111)(111) surface termination, where the surface is parallel to the quintuple layer, the Dirac cone is fully isotropic for small in-plane momentum, perturbed only by a hexagonal warping to third order in momentumFu (2009). For other surface terminations the Dirac cone of the surface states becomes anisotropic with elliptical curves of constant energy, as reported for the (221)(221) surface in Ref. Moon et al., 2011. Anisotropic Dirac fermions have interesting transport properties due to the different group velocity in different directions Trescher et al. (2015), and have attracted attention in other Dirac materials Feng et al. (2014); Lu et al. (2016).

For both fundamental research and applications it is interesting to be able to tune the physical properties of these materials. One way to do this is by the application of strain. It was reported in Refs. Luo et al., 2012; Young et al., 2011 that strain strongly affects the band gap at the Γ\Gamma point of bulk Bi2Se3\text{Bi}_{2}\text{Se}_{3}, and could even close the gap at large strain values, i.e. induce a topological phase transition. Whether this is possible to achieve in real materials is not known, but a recent study points to the feasibility of strain induced effects in Bi2Se3\text{Bi}_{2}\text{Se}_{3} where strain up to 3%3\% was induced by lattice mismatch Li et al. (2017).

In this paper, we use the method of invariants Lew Yan Voon and Willatzen (2009) to derive the most general Hamiltonian at the Γ\Gamma point to third order in wave vector, first order in strain, first order in electric and magnetic fields as well as terms to first order in both wave vector and strain. The allowed third order terms in the wave vector include three terms that were neglected in a previous analysis Liu et al. (2010a). Since this model is based solely on the symmetry of the crystal, it is valid for all materials in the Bi2Se3\text{Bi}_{2}\text{Se}_{3} class.

Refer to caption
Figure 1: For unstrained Bi2Se3\text{Bi}_{2}\text{Se}_{3} the linear dispersion of the helical surface states is isotropic in the plane to linear order in the wave vector. In the case of strain, the in plane rotational symmetry is broken by terms to first order in both strain and wave vector leading to a Dirac cone with elliptical contours of constant energy, here shown for ϵxy=5%\epsilon_{xy}=5\%. Hence, the group velocity of the surface electrons becomes dependent on the direction. The broken rotational symmetry also allows for a spin component perpendicular to the surface, as shown by the red arrows.

Using this model, we determine analytical expressions for the modified bulk band structure and the effective mass tensor near the Γ\Gamma-point. Changes in the effective mass tensor with strain and electric field are important for transport and optical properties Jensen et al. . Comparing the strain dependence of the band gap to recent density functional theory (DFT) calculations, allows us to determine some of the strain related parameters in the band structure. From this bulk spectrum, we go on to investigate the effects of strain on the surface states of a semi-infinite topological insulator. We analytically derive the spectrum of the surface states, applying hard-wall boundary conditions at the surface for a semi-infinite topological insulator with a (111) surface termination. In contrast to the unstrained case, the full in-plane rotational symmetry to linear order in wave vector is broken by terms linear in both strain and wave vector, leading to an anisotropic Dirac spectrum. The spin expectation values of the SS are calculated, and it is shown that the spin-momentum locking is affected by strain, revealing a non-zero spin component perpendicular to the surface. The modified Dirac cone and spin structure of the SS is shown in Fig. 1. Finally, we show that strain affects the localization of the SS wave functions and thereby the band gap arising in thin films due to the coupling between SS on opposite surfaces. We show that the SS band gap changes oppositely to the bulk band gap under strain: increasing the bulk band gap, localizes the SS and decreases the SS band gap, and vice versa. This shows that not only the bulk, but also the SS band gap can be tuned via strain.

II Model Hamiltonian

In line with Ref. Liu et al., 2010a, we wish to derive a 4-band model Hamiltonian describing the Bi2Se3\text{Bi}_{2}\text{Se}_{3} class, which now includes all allowed terms to orders cubic in the wave vector 𝐤\mathbf{k}, linear in the strain tensor ϵij\epsilon_{ij}, linear in magnetic field 𝐁\mathbf{B} or electric field 𝐄\mathbf{E}, and linear in both 𝐤\mathbf{k} and ϵij\epsilon_{ij}.

In general any 4×44\times 4 Hamiltonian can be written in terms of Dirac Γ\Gamma matrices defined in terms of the Pauli matrices by:

Γ1=σ1τ1,Γ2=σ2τ1,Γ3=σ3τ1,\displaystyle\Gamma_{1}=\sigma_{1}\otimes\tau_{1},\quad\Gamma_{2}=\sigma_{2}\otimes\tau_{1},\quad\Gamma_{3}=\sigma_{3}\otimes\tau_{1},
Γ4=σ0τ2,Γ5=σ0τ3,\displaystyle\Gamma_{4}=\sigma_{0}\otimes\tau_{2},\quad\Gamma_{5}=\sigma_{0}\otimes\tau_{3}, (1)

and their commutators Γij=12i[Γi,Γj]\Gamma_{ij}=\tfrac{1}{2i}[\Gamma_{i},\Gamma_{j}]. A general 4×44\times 4 matrix can then be written:

Hm=ε𝐈+i=15diΓi+i,jdijΓij,\displaystyle H_{m}=\varepsilon\mathbf{I}+\sum_{i=1}^{5}d_{i}\Gamma_{i}+\sum_{i,j}d_{ij}\Gamma_{ij}, (2)

where the coefficients ε,di,dij\varepsilon,d_{i},d_{ij} must be real to ensure hermiticity. In the present case the coefficients are functions of 𝐤\mathbf{k}, ϵij\epsilon_{ij}, 𝐁\mathbf{B} and 𝐄\mathbf{E}.

Here we use the basis |P1+,12|P1_{-}^{+},\tfrac{1}{2}\rangle, i|P2+,12-i|P2_{+}^{-},\tfrac{1}{2}\rangle, |P1+,12|P1_{-}^{+},-\tfrac{1}{2}\rangle, i|P2+,12i|P2_{+}^{-},-\tfrac{1}{2}\rangle, derived from the pp orbitals of the Bi and Se atoms. The upper sign denotes the inversion eigenvalue and the ±12\pm\tfrac{1}{2} the total angular momentum. For a more complete discussion see Ref. Liu et al., 2010a. In this basis the σ\sigma matrices do not represent spin, but are related to spin by:

sx=σ1τ32sy=σ2τ32sz=σ3τ02,\displaystyle s_{x}=\frac{\sigma_{1}\otimes\tau_{3}}{2}\quad s_{y}=\frac{\sigma_{2}\otimes\tau_{3}}{2}\quad s_{z}=\frac{\sigma_{3}\otimes\tau_{0}}{2}, (3)

as noted in Ref. Silvestrov et al., 2012.

In this basis, the transformation operators corresponding to the symmetries of the crystal are inversion I=σ0τ3I=\sigma_{0}\otimes\tau_{3}, three-fold rotation around the zz-axis R3=eiπσ3/3τ0R_{3}=e^{i\pi\sigma_{3}/3}\otimes\tau_{0}, two fold rotation around the xx-axis R2=iσ1τ3R_{2}=i\sigma_{1}\otimes\tau_{3} and time-reversal T=i(σ2τ0)KT=i(\sigma_{2}\otimes\tau_{0})K, where KK is the complex conjugation operator. Now the Γ\Gamma-matrices can be characterized by their irreducible representation under the group D3D_{3} as well as their eigenvalues under II and TT.

II.1 Group theory

Next, we construct polynomials in wave vector, strain, electric and magnetic fields transforming according to the irreducible representations of the group D3dD_{3d}. This group is a direct product group of D3D_{3} with the group of the inversion operator. Here we will consider D3D_{3} separately and simply add inversion eigenvalues as well as time-reversal eigenvalues. The group D3D_{3} has three irreducible representations: Γ(1)\Gamma^{(1)}, which is the trivial representation, Γ(2)\Gamma^{(2)}, which is one-dimensional as well and the two-dimensional representation Γ(3)\Gamma^{(3)}.

\otimes Γ(1)\Gamma^{(1)} Γ(2)\Gamma^{(2)} Γ(3)\Gamma^{(3)}
Γ(1)\Gamma^{(1)} Γ(1)\Gamma^{(1)} Γ(2)\Gamma^{(2)} Γ(3)\Gamma^{(3)}
Γ(2)\Gamma^{(2)} Γ(2)\Gamma^{(2)} Γ(1)\Gamma^{(1)} Γ(3)\Gamma^{(3)}
Γ(3)\Gamma^{(3)} Γ(3)\Gamma^{(3)} Γ(3)\Gamma^{(3)} Γ(1)Γ(2)Γ(3)\Gamma^{(1)}\oplus\Gamma^{(2)}\oplus\Gamma^{(3)}
(a)
u(2)v1(3)u^{(2)}v^{(3)}_{1} u(2)v2(3)u^{(2)}v^{(3)}_{2}
ψ1(3)\psi^{(3)}_{1} i 0
ψ2(3)\psi^{(3)}_{2} 0 -i
(b)
u1(3)v1(3)u^{(3)}_{1}v^{(3)}_{1} u1(3)v2(3)u^{(3)}_{1}v^{(3)}_{2} u2(3)v1(3)u^{(3)}_{2}v^{(3)}_{1} u2(3)v2(3)u^{(3)}_{2}v^{(3)}_{2}
ψ(1)\psi^{(1)} 0 12\frac{1}{\sqrt{2}} 12\frac{1}{\sqrt{2}} 0
ψ(2)\psi^{(2)} 0 i2\frac{i}{\sqrt{2}} i2\frac{-i}{\sqrt{2}} 0
ψ1(3)\psi^{(3)}_{1} 0 0 0 1
ψ2(3)\psi^{(3)}_{2} 1 0 0 0
(c)
Table 1: Multiplication table for the irreducible representations of the group D3dD_{3d}, (a), and the coupling constants for the basis functions of the relevant products Γ(2)Γ(3)\Gamma^{(2)}\otimes\Gamma^{(3)}, (b) and Γ(3)Γ(3)\Gamma^{(3)}\otimes\Gamma^{(3)}, (c). Note that the multiplication table is completely general since it is in principle concerning equivalence classes of representations, while the coupling constants are only valid for a specific representation, here the representation with basis functions {k+,k}\{k_{+},k_{-}\}. Adopted from Ref. Koster et al., 1963.

The wave vector component kzk_{z} transforms according to the one-dimensional representation Γ(2)\Gamma^{(2)}, i.e. it only changes by multiplication of a constant under the transformations of D3D_{3}. The in-plane components, kxk_{x} and kyk_{y}, transform according to the two-dimensional irreducible representation Γ(3)\Gamma^{(3)}. Here we will use the basis {k+,k}\{k_{+},k_{-}\}, where k±=kx±ikyk_{\pm}=k_{x}\pm ik_{y}. As an example we will show how to construct the second order terms transforming according to irreducible representations including only in-plane momentum. First we note that

Γ(3)Γ(3)=Γ(1)Γ(2)Γ(3),\displaystyle\Gamma^{(3)}\otimes\Gamma^{(3)}=\Gamma^{(1)}\oplus\Gamma^{(2)}\oplus\Gamma^{(3)}, (4)

i.e. the second order terms of in-plane momenta transform according to a representation equivalent to the sum of these three irreducible representations. The basis functions for the irreducible representations can be formed using Table LABEL:sub@table:coupling33, with {u1,u2}={v1,v2}={k+,k}\{u_{1},u_{2}\}=\{v_{1},v_{2}\}=\{k_{+},k_{-}\}, and we get:

Γ(1)\displaystyle\Gamma^{(1)} :ψ(1)=12k+k+12kk+k||2,\displaystyle\mathrel{\mathop{\ordinarycolon}}\quad\psi^{(1)}=\frac{1}{\sqrt{2}}k_{+}k_{-}+\frac{1}{\sqrt{2}}k_{-}k_{+}\propto k_{||}^{2}, (5)
Γ(2)\displaystyle\Gamma^{(2)} :ψ(2)=i2k+k+i2kk+=0,\displaystyle\mathrel{\mathop{\ordinarycolon}}\quad\psi^{(2)}=\frac{i}{\sqrt{2}}k_{+}k_{-}+\frac{-i}{\sqrt{2}}k_{-}k_{+}=0, (6)
Γ(3)\displaystyle\Gamma^{(3)} :{ψ1(3),ψ2(3)}={k2,k+2},\displaystyle\mathrel{\mathop{\ordinarycolon}}\quad\{\psi^{(3)}_{1},\psi^{(3)}_{2}\}=\{k_{-}^{2},k_{+}^{2}\}, (7)

where we have defined k||2=kx2+ky2k_{||}^{2}=k_{x}^{2}+k_{y}^{2}. Since {u1,u2}={v1,v2}\{u_{1},u_{2}\}=\{v_{1},v_{2}\} the Γ(2)\Gamma^{(2)} term is simply 0, and we have 3 independent second order terms in the in-plane momenta: k||2k_{||}^{2}, which is invariant, and the pair {k2,k+2}\{k_{-}^{2},k_{+}^{2}\} transforming according to the two-dimensional irreducible representation Γ(3)\Gamma^{(3)}. The inversion and time-reversal eigenvalues follow the simple rule ±±=+\pm\otimes\pm=+ and ±=\pm\otimes\mp=-. Proceeding like this we can go to any desired order, and include external fields as well. We note that the strain tensor ϵij\epsilon_{ij} transforms as kikjk_{i}k_{j}Bir and Pikus (1974). This result follows from the definition of strain

ϵij\displaystyle\epsilon_{ij} =12(uixj+ujxi),\displaystyle=\frac{1}{2}\left(\frac{\partial u_{i}}{\partial x_{j}}+\frac{\partial u_{j}}{\partial x_{i}}\right), (8)

where 𝐮{\bf u} is the displacement vector. The resulting irreducible basis functions are summarized in Table 2.

TT II Γ(1)\Gamma^{(1)} Γ(2)\Gamma^{(2)} Γ(3)\Gamma^{(3)}
dim=1 dim=1 dim=2
- - Re(k+3)Im(k+ϵz)Re(k+ϵ+)Γ4\begin{matrix}\operatorname{Re}(k_{+}^{3})\\ \operatorname{Im}(k_{+}\epsilon_{z-})\\ \operatorname{Re}(k_{+}\epsilon_{+})\\ \Gamma_{4}\end{matrix} kzkz3k||2kzIm(k+3)ϵzzkzϵ||kzRe(k+ϵz)Im(k+ϵ+)Γ3\begin{matrix}k_{z}\\ k_{z}^{3}\\ k_{||}^{2}k_{z}\\ \operatorname{Im}(k_{+}^{3})\\ \epsilon_{zz}k_{z}\\ \epsilon_{||}k_{z}\\ \operatorname{Re}(k_{+}\epsilon_{z-})\\ \operatorname{Im}(k_{+}\epsilon_{+})\\ \Gamma_{3}\end{matrix} {k+,k}k||2{k+,k}kz2{k+,k}{ikzk2,ikzk+2}ϵzz{k+,k}ϵ||{k+,k}kz{ϵz+,ϵz}kz{iϵ,iϵ+}{ikϵz,ik+ϵz+}{kϵ+,k+ϵ}{Γ+1,2,Γ1,2}\begin{matrix}\{k_{+},k_{-}\}\\ k_{||}^{2}\{k_{+},k_{-}\}\\ k_{z}^{2}\{k_{+},k_{-}\}\\ \{ik_{z}k_{-}^{2},-ik_{z}k_{+}^{2}\}\\ \epsilon_{zz}\{k_{+},k_{-}\}\\ \epsilon_{||}\{k_{+},k_{-}\}\\ k_{z}\{\epsilon_{z+},\epsilon_{z-}\}\\ k_{z}\{i\epsilon_{-},-i\epsilon_{+}\}\\ \{ik_{-}\epsilon_{z_{-}},-ik_{+}\epsilon_{z+}\}\\ \{k_{-}\epsilon_{+},k_{+}\epsilon_{-}\}\\ \{\Gamma_{+}^{1,2},\Gamma_{-}^{1,2}\}\end{matrix}
+ + kz2k||2ϵzzϵ||Γ5\begin{matrix}k_{z}^{2}\\ k_{||}^{2}\\ \epsilon_{zz}\\ \epsilon_{||}\\ \Gamma_{5}\end{matrix} {ikzk+,ikzk}{k2,k+2}{iϵz+,iϵz}{ϵ,ϵ+}\begin{matrix}\{ik_{z}k_{+},-ik_{z}k_{-}\}\\ \{k_{-}^{2},k_{+}^{2}\}\\ \{i\epsilon_{z+},-i\epsilon_{z-}\}\\ \{\epsilon_{-},\epsilon_{+}\}\end{matrix}
+ - Γ45\begin{matrix}\Gamma_{45}\end{matrix} EzΓ35\begin{matrix}E_{z}\\ \Gamma_{35}\end{matrix} {E+,E}{iΓ+25,51,iΓ25,51}\begin{matrix}\{E_{+},E_{-}\}\\ \{i\Gamma_{+}^{25,51},-i\Gamma_{-}^{25,51}\}\end{matrix}
- + BzΓ12Γ34\begin{matrix}B_{z}\\ \Gamma_{12}\\ \Gamma_{34}\end{matrix} {B+,B}{Γ+13,23,Γ23,31}{Γ+14,24,Γ14,24}\begin{matrix}\{B_{+},B_{-}\}\\ \{\Gamma_{+}^{13,23},\Gamma_{-}^{23,31}\}\\ \{\Gamma_{+}^{14,24},\Gamma_{-}^{14,24}\}\end{matrix}
Table 2: Polynomials of 𝐤\mathbf{k}, strain and magnetic fields and the Γ\Gamma matrices under the transformations of the group D3D_{3}, inversion and time reversal. To simplify the notation we have introduced ϵ||=ϵxx+ϵyy\epsilon_{||}=\epsilon_{xx}+\epsilon_{yy}, ϵ±=ϵxxϵyy±2iϵxy\epsilon_{\pm}=\epsilon_{xx}-\epsilon_{yy}\pm 2i\epsilon_{xy}, ϵz±=ϵzx±iϵzy\epsilon_{z\pm}=\epsilon_{zx}\pm i\epsilon_{zy}. In the 2-dimensional representations we have changed basis to Γ±i,j=Γi±iΓj\Gamma^{i,j}_{\pm}=\Gamma_{i}\pm i\Gamma_{j}. These have been constructed such that they are real, and each pair is hermitian conjugates of each other and the pairs transform like {k+,k}\{k_{+},k_{-}\} under the group D3D_{3}.

II.2 Model Hamiltonian

For the Hamiltonian to be invariant under the group D3D_{3} it can only contain terms from the invariant representation, Γ(1)\Gamma^{(1)}. As seen in the multiplication Table LABEL:sub@table:multiplicationtable, Γ(1)\Gamma^{(1)} terms can only be formed by combining terms from the same irreducible representation. For invariance under inversion and time-reversal, we must combine terms and matrices with the same eigenvalues, i.e. from the same cell in Table 2. For the Γ(3)\Gamma^{(3)} representation we can use Table LABEL:sub@table:coupling33 to construct an invariant term, Γ(1)\Gamma^{(1)} and Γ(2)\Gamma^{(2)} are 1 dimensional so we simply take the product.

For example, the terms {ikzk2,ikzk+2}\{ik_{z}k_{-}^{2},-ik_{z}k_{+}^{2}\} belong to the irreducible representation Γ(3)\Gamma^{(3)} and are odd under both TR and I. The pair of matrices {Γ+1,2,Γ1,2}\{\Gamma^{1,2}_{+},\Gamma^{1,2}_{-}\}, defined as Γ±i,j=Γi±iΓj\Gamma^{i,j}_{\pm}=\Gamma_{i}\pm i\Gamma_{j}, transform exactly the same way, and using Table LABEL:sub@table:coupling33 we can therefore combine them as follows:

Γ(1):ψ(1)\displaystyle\Gamma^{(1)}\mathrel{\mathop{\ordinarycolon}}\psi^{(1)} =12ikzk2Γ1,212ikzk+2Γ+1,2\displaystyle=\frac{1}{\sqrt{2}}ik_{z}k_{-}^{2}\Gamma^{1,2}_{-}-\frac{1}{\sqrt{2}}ik_{z}k_{+}^{2}\Gamma^{1,2}_{+}
kz(ik2Γ1,2ik+2Γ+1,2).\displaystyle\propto k_{z}(ik_{-}^{2}\Gamma^{1,2}_{-}-ik_{+}^{2}\Gamma^{1,2}_{+}). (9)

This forms an invariant, and thus it is an allowed term in the Hamiltonian. Any multiple of this term is of course also an invariant, and we get a free parameter in front, which we denote Z3Z_{3}. Proceeding this way we achieve the following Hamiltonian, for now excluding electric and magnetic fields:

Hm=0\displaystyle H_{m}=\mathcal{E}_{0} +(βiki0αikiβikiαiki00αikiβikiαiki0βiki)\displaystyle+\begin{pmatrix}\mathcal{M}&\beta^{*}_{i}k_{i}&0&\alpha^{*}_{i}k_{i}\\ \beta_{i}k_{i}&-\mathcal{M}&\alpha^{*}_{i}k_{i}&0\\ 0&\alpha_{i}k_{i}&\mathcal{M}&-\beta_{i}k_{i}\\ \alpha_{i}k_{i}&0&-\beta^{*}_{i}k_{i}&-\mathcal{M}\end{pmatrix} (10)
Z1Re(k+3)Γ4+Z2Im(k+3)Γ3\displaystyle-Z_{1}\operatorname{Re}(k_{+}^{3})\Gamma_{4}+Z_{2}\operatorname{Im}(k_{+}^{3})\Gamma_{3}
+Z3kz(ik2Γ1,2ik+2Γ+1,2),\displaystyle+Z_{3}k_{z}(ik_{-}^{2}\Gamma^{1,2}_{-}-ik_{+}^{2}\Gamma^{1,2}_{+}),

where repeated indices are summed over and where:

0\displaystyle\mathcal{E}_{0} =C+C1ϵzz+C2ϵ||+D1kz2+D2k||2,\displaystyle=C+C_{1}\epsilon_{zz}+C_{2}\epsilon_{||}+D_{1}k_{z}^{2}+D_{2}k_{||}^{2}, (11a)
\displaystyle\mathcal{M} =M+M1ϵzz+M2ϵ||B1kz2B2k||2,\displaystyle=M+M_{1}\epsilon_{zz}+M_{2}\epsilon_{||}-B_{1}k_{z}^{2}-B_{2}k_{||}^{2}, (11b)
α1\displaystyle\alpha_{1} =A2+A21ϵzz+A22ϵ||+A23kz2+A24k||2\displaystyle=A_{2}+A_{21}\epsilon_{zz}+A_{22}\epsilon_{||}+A_{23}k_{z}^{2}+A_{24}k_{||}^{2}
+iY3ϵz+Y4ϵ+,\displaystyle+iY_{3}\epsilon_{z-}+Y_{4}\epsilon_{+}, (11c)
α2\displaystyle\alpha_{2} =iA2+iA21ϵzz+iA22ϵ||+iA23kz2+iA24k||2\displaystyle=iA_{2}+iA_{21}\epsilon_{zz}+iA_{22}\epsilon_{||}+iA_{23}k_{z}^{2}+iA_{24}k_{||}^{2}
+Y3ϵziY4ϵ+,\displaystyle+Y_{3}\epsilon_{z-}-iY_{4}\epsilon_{+}, (11d)
α3\displaystyle\alpha_{3} =Y1ϵz++iY2ϵ,\displaystyle=Y_{1}\epsilon_{z+}+iY_{2}\epsilon_{-}, (11e)
β1\displaystyle\beta_{1} =X1ϵzx+2X2ϵxyiX3ϵzy+iX4(ϵxxϵyy),\displaystyle=X_{1}\epsilon_{zx}+2X_{2}\epsilon_{xy}-iX_{3}\epsilon_{zy}+iX_{4}(\epsilon_{xx}-\epsilon_{yy}), (11f)
β2\displaystyle\beta_{2} =X1ϵzy+X2(ϵxxϵyy)+iX3ϵzx2iX4ϵxy,\displaystyle=X_{1}\epsilon_{zy}+X_{2}(\epsilon_{xx}-\epsilon_{yy})+iX_{3}\epsilon_{zx}-2iX_{4}\epsilon_{xy}, (11g)
β3\displaystyle\beta_{3} =A1+A11ϵzz+A12ϵ||+A13kz2+A14k||2,\displaystyle=A_{1}+A_{11}\epsilon_{zz}+A_{12}\epsilon_{||}+A_{13}k_{z}^{2}+A_{14}k_{||}^{2}, (11h)

with ϵ||=ϵxx+ϵyy\epsilon_{||}=\epsilon_{xx}+\epsilon_{yy}, ϵ±=ϵxxϵyy±2iϵxy\epsilon_{\pm}=\epsilon_{xx}-\epsilon_{yy}\pm 2i\epsilon_{xy}, ϵz±=ϵzx±iϵzy\epsilon_{z\pm}=\epsilon_{zx}\pm i\epsilon_{zy}. From the method of invariants it follows that all model parameters designated by capital roman letters must be real parameters, which are not determined by the symmetries of the crystal. The Hamiltonian above contains all allowed terms to third order in 𝐤\mathbf{k}, first order in strain and terms first order in both. Neglecting strain, this Hamiltonian reduces to the one derived in Ref. Liu et al., 2010a, except for three terms combining the in-plane and out-of-plane components of the wave vector, kz2(k+Γ1,2+kΓ+1,2)k_{z}^{2}(k_{+}\Gamma^{1,2}_{-}+k_{-}\Gamma^{1,2}_{+}), k||2kzΓ3k_{||}^{2}k_{z}\Gamma_{3} and kz(ik2Γ1,2ik+2Γ+1,2)k_{z}(ik_{-}^{2}\Gamma^{1,2}_{-}-ik_{+}^{2}\Gamma^{1,2}_{+}), which were not included in Ref. Liu et al., 2010a. The 𝐤\mathbf{k} independent strain terms depend only on ϵ||\epsilon_{||} and ϵzz\epsilon_{zz}, and are either proportional to the identity matrix or Γ5\Gamma_{5}. Hence, the effects of these terms can be described within the unstrained model by making the parameters strain dependent MM~(ϵ)M\rightarrow\tilde{M}(\mathbf{\epsilon}) and CC~(ϵ)C\rightarrow\tilde{C}(\mathbf{\epsilon}), where:

M~(ϵ)\displaystyle\tilde{M}(\mathbf{\epsilon}) =M+M1ϵzz+M2ϵ||,\displaystyle=M+M_{1}\epsilon_{zz}+M_{2}\epsilon_{||}, (12a)
C~(ϵ)\displaystyle\tilde{C}(\mathbf{\epsilon}) =C+C1ϵzz+C2ϵ||.\displaystyle=C+C_{1}\epsilon_{zz}+C_{2}\epsilon_{||}. (12b)

Similarly the terms linear in 𝐤\mathbf{k} and ϵ||\epsilon_{||} or ϵzz\epsilon_{zz}, can be described by strain-dependent model parameters in the unstrained model. The shear strain terms and the terms with ϵxxϵyy\epsilon_{xx}-\epsilon_{yy} give new terms linear in 𝐤\mathbf{k}, which cannot be described by making the parameters of the unstrained model strain-dependent. The effects of ϵzx\epsilon_{zx} are similar to the effects of ϵxy\epsilon_{xy}, since they contribute to the real parts of β1\beta_{1}, α2\alpha_{2} and α3\alpha_{3} and the imaginary parts of α1\alpha_{1} and β2\beta_{2}. For the same reason ϵxxϵyy\epsilon_{xx}-\epsilon_{yy} and ϵzy\epsilon_{zy} have similar effects. As we demonstrate below, the new terms linear in the wave vector give rise to new physics at the (111)(111) surface of a semi-infinite topological insulator.

Higher order terms in strain will always be both inversion and time-reversal symmetric, and in Table 2 we see that the only matrices symmetric under both I and TR are the identity matrix and Γ5\Gamma_{5}, and kk-independent strain terms may therefore be lumped into the strain dependent parameters M~(ϵ)\tilde{M}(\mathbf{\epsilon}) and C~(ϵ)\tilde{C}(\mathbf{\epsilon}).

II.3 Magnetic field

The Hamiltonian in Eq. (10) is invariant under both TR and inversion, thus all bands are doubly degenerate. The TR symmetry can be broken by a magnetic field, which must be included in the following form:

HB\displaystyle H_{B} =μB2(g1zBz0g1pB00g2zBz0g2pBg1pB+0g1zBz00g2pB+0g2zBz),\displaystyle=\frac{\mu_{B}}{2}\begin{pmatrix}g_{1z}B_{z}&0&g_{1p}B_{-}&0\\ 0&g_{2z}B_{z}&0&g_{2p}B_{-}\\ g_{1p}B_{+}&0&-g_{1z}B_{z}&0\\ 0&g_{2p}B_{+}&0&-g_{2z}B_{z}\end{pmatrix}, (13)

where g1p,2p,1z,2zg_{1p,2p,1z,2z} are real parameters and B±=Bx±iByB_{\pm}=B_{x}\pm iB_{y}. This contribution was discussed in Ref. Liu et al., 2010a.

II.4 Electric field

Inversion symmetry can be broken by an electric field, giving rise to a Hamiltonian of the form:

HE\displaystyle H_{E} =W||(iEΓ+25,51iE+Γ25,51)+WzEzΓ35\displaystyle=W_{||}(iE_{-}\Gamma^{25,51}_{+}-iE_{+}\Gamma^{25,51}_{-})+W_{z}E_{z}\Gamma_{35}
=(0iWzEz0iW||EiWzEz0iW||E00iW||E+0iWzEziW||E+0iWzEz0),\displaystyle=\begin{pmatrix}0&iW_{z}E_{z}&0&iW_{||}E_{-}\\ -iW_{z}E_{z}&0&-iW_{||}E_{-}&0\\ 0&iW_{||}E_{+}&0&-iW_{z}E_{z}\\ -iW_{||}E_{+}&0&iW_{z}E_{z}&0\end{pmatrix}, (14)

where W||,zW_{||,z} are real parameters and E±=Ex±iEyE_{\pm}=E_{x}\pm iE_{y}.

III Bulk spectrum and band-gap

In this section we will analyze the effects of strain and electric field on the bulk band structure close to the Γ\Gamma point. The Hamiltonian H=Hm+HEH=H_{m}+H_{E} of Eq. (10) and (14) can be diagonalized analytically giving:

E\displaystyle E =0±(2+|αiki+iZ3kzk2|2\displaystyle=\mathcal{E}_{0}\pm\Bigg{(}\mathcal{M}^{2}+\mathinner{\!\left\lvert\alpha_{i}k_{i}+iZ_{3}k_{z}k_{-}^{2}\right\rvert}^{2}
+|βikiiZ1Re(k+3)+Z2Im(k+3)|2+Wz2Ez2+W||2E||2\displaystyle+\left|\beta_{i}k_{i}-iZ_{1}\operatorname{Re}(k_{+}^{3})+Z_{2}\operatorname{Im}(k_{+}^{3})\right|^{2}+W_{z}^{2}E_{z}^{2}+W_{||}^{2}E_{||}^{2}
±(2|(αiki+iZ3kzk2)W||E+\displaystyle\pm\bigg{(}2\Big{|}\left(\alpha_{i}k_{i}+iZ_{3}k_{z}k_{-}^{2}\right)W_{||}E_{+}
+(βikiiZ1Re(k+3)+Z2Im(k+3))WzEz|2\displaystyle+\left(\beta_{i}k_{i}-iZ_{1}\operatorname{Re}(k_{+}^{3})+Z_{2}\operatorname{Im}(k_{+}^{3})\right)W_{z}E_{z}\Big{|}^{2}
+4W||2E||2|βikiiZ1Re(k+3)+Z2Im(k+3)|2\displaystyle+4W_{||}^{2}E_{||}^{2}\mathinner{\!\left\lvert\beta_{i}k_{i}-iZ_{1}\operatorname{Re}(k_{+}^{3})+Z_{2}\operatorname{Im}(k_{+}^{3})\right\rvert}^{2}
+4Wz2Ez2|αiki+iZ3kzk2|2\displaystyle+4W_{z}^{2}E_{z}^{2}\mathinner{\!\left\lvert\alpha_{i}k_{i}+iZ_{3}k_{z}k_{-}^{2}\right\rvert}^{2}
2Re((αiki+iZ3kzk2)W||E+\displaystyle-2\operatorname{Re}\Big{(}\left(\alpha_{i}k_{i}+iZ_{3}k_{z}k_{-}^{2}\right)W_{||}E_{+}
(βikiiZ1Re(k+3)+Z2Im(k+3))WzEz))12)12.\displaystyle-\left(\beta_{i}k_{i}-iZ_{1}\operatorname{Re}(k_{+}^{3})+Z_{2}\operatorname{Im}(k_{+}^{3})\right)W_{z}E_{z}\Big{)}\bigg{)}^{\frac{1}{2}}\Bigg{)}^{\frac{1}{2}}. (15)

In Fig. 2 an example of the band structure with and without strain is plotted. The terms in the Hamiltonian linear in both strain and wave vector split the bands in the xx and yy directions closer to the Γ\Gamma point compared to the case without strain, where the bands in the xx and yy directions are only split by the 𝐤3\mathbf{k}^{3} terms. The electric field breaks the inversion symmetry, and the degeneracies of the bulk conduction and valence bands are split. At the Γ\Gamma point the degeneracies are protected by TR symmetry which is not broken. Another important quantity of the electronic structure is the gap between the valence and conduction band at the Γ\Gamma point, given by:

ΔΓ=2(M+M1ϵzz+M2ϵ||)2+Wz2Ez2+W||2E||2.\displaystyle\Delta_{\Gamma}=2\sqrt{(M+M_{1}\epsilon_{zz}+M_{2}\epsilon_{||})^{2}+W_{z}^{2}E_{z}^{2}+W_{||}^{2}E_{||}^{2}}. (16)

Note that the gap can only be increased by applying an electric field, whereas strain can increase or decrease the gap depending on the sign of the strain. The effects of strain have been investigated earlier by DFT calculations. In Ref. Luo et al., 2012 both Bi2Se3\text{Bi}_{2}\text{Se}_{3} and Bi2Te3\text{Bi}_{2}\text{Te}_{3} were investigated. Here the lattice constant in the xyxy-plane or the zz-direction was fixed relative to the known unstrained lattice constant, and the lattice constant in the other direction was relaxed before the calculations were performed. An approximate linear relationship between the in-plane, and out-of-plane lattice constants was reported Luo et al. (2012). In Ref. Young et al., 2011 the band gap was calculated for many different strain configurations and then fitted to a second order polynomial of the strain components. In figure 3 the results of these calculations are shown together with second and third order fits to the results of Ref. Luo et al., 2012. The results show good agreement and both predict a topological phase transition at approximately 6%6\% uni-axial strain. According to our model the topological phase transition will occur when:

ϵzz=MM1M2M1ϵ||.\displaystyle\epsilon_{zz}=-\frac{M}{M_{1}}-\frac{M_{2}}{M_{1}}\epsilon_{||}. (17)

It should be emphasized that strain can only change the gap by a term proportional to the matrix Γ5\Gamma_{5} in the Hamiltonian, which is equivalent to simply changing the band gap parameter MM. In the following sections we will then simply use a strain-dependent band gap parameter M~(ϵ)\tilde{M}(\mathbf{\epsilon}), and use the parameters of Ref. Young et al., 2011 to get a quantitative strain dependence.

Refer to caption
Figure 2: Band structure close to the gamma point, with ϵzy=10%\epsilon_{zy}=10\% (solid) and without strain (dashed) in the absence of electric and magnetic fields. Without strain the in-plane rotational symmetry is broken only by terms to third order in the wave vector, and only a small splitting between different in-plane direction is seen. Including strain allows terms to first order in the wave vector that breaks the in-plane rotational symmetry, and we see that a splitting of the in-plane directions occurs at lower kk values. Here we have used the parameters of Ref. Liu et al., 2010a for the unstrained model, and Xi=Yi=10 eV ÅX_{i}=Y_{i}=$10\text{\,}\mathrm{eV}\text{\,}\mathrm{\SIUnitSymbolAngstrom}$. We note that ϵxxϵyy=10%\epsilon_{xx}-\epsilon_{yy}=10\% gives the same dispersion for the parameters used here.

Band gap at Γ\Gamma point (eV\mathrm{eV}) Refer to caption

Figure 3: The band gap at Γ\Gamma point as a function of strain according to the DFT calculations of Refs. Young et al., 2011; Luo et al., 2012. The curves from Ref. Young et al., 2011 are calculated using the approximate linear relationships between the in-plane and out-of-plane lattice constant reported in Ref. Luo et al., 2012. The fits to the data from Ref. Luo et al., 2012 show good agreement, especially for Bi2Te3\text{Bi}_{2}\text{Te}_{3}. Going to third order makes only a slight improvement compared to second order.

In general, the parameters of this symmetry-adapted model can be determined by using DFT calculations. The values of the parameters are highly dependent on the computational details as can be seen by comparing Refs. Zhang et al., 2009; Liu et al., 2010a; Nechaev and Krasovskii, 2016. In this paper,we use the values from Ref. Liu et al., 2010a for the parameters not related to strain.

III.1 Effective masses

An important quantity for transport and optical properties of a solid is the effective mass tensor. Here we calculate the diagonal components of the inverse effective mass tensor including both strain and electric field. The inverse effective mass tensor is given by:

[1m]ij=122Ekikj.\displaystyle\left[\frac{1}{m^{*}}\right]_{ij}=\frac{1}{\hbar^{2}}\frac{\partial^{2}E}{\partial k_{i}\partial k_{j}}. (18)

With 𝐄=0\mathbf{E}=0 we get the effective masses along the xx and yy axes:

[2m]ii\displaystyle\left[\frac{\hbar^{2}}{m}\right]_{ii} =2D2±(|αi|2+|βi|22(M+M1ϵzz+M2ϵ||)B2ΔΓ2),\displaystyle=2D_{2}\pm\bigg{(}\frac{|\alpha_{i}|^{2}+|\beta_{i}|^{2}-2(M+M_{1}\epsilon_{zz}+M_{2}\epsilon_{||})B_{2}}{\frac{\Delta_{\Gamma}}{2}}\bigg{)}, (19)

for i=x,yi=x,y and along the zz axis

[2m]zz\displaystyle\left[\frac{\hbar^{2}}{m}\right]_{zz} =2D1±(|α3|2+|β3|22(M+M1ϵzz+M2ϵ||)B1ΔΓ2).\displaystyle=2D_{1}\pm\bigg{(}\frac{|\alpha_{3}|^{2}+|\beta_{3}|^{2}-2(M+M_{1}\epsilon_{zz}+M_{2}\epsilon_{||})B_{1}}{\frac{\Delta_{\Gamma}}{2}}\bigg{)}. (20)

The plus (minus) sign corresponds to the conduction (valence) band. For a non-zero electric field each of the two bands is split into two subbands having the same effective mass, but differing by linear and cubic terms in the wave vector. The effective masses including both strain and electric field are given in appendix A.

Refer to caption
Figure 4: The bulk conduction and valence band effective masses of Bi2Se3 plotted as a function of the strain component ϵzz\epsilon_{zz} and in the absence of electric and magnetic fields. In the calculations, we have used the DFT computed strain parameters from Ref. Luo et al., 2012 and Ref. Liu et al., 2010a for the parameters not related to strain. The solid (dashed) lines refer to the conduction (valence) band effective masses.

In Fig. 4 the bulk effective masses of Bi2Se3 are plotted as a function of the strain component ϵzz\epsilon_{zz} and in the absence of electric and magnetic fields. Conduction (valence) band masses are shown as solid (dashed) lines and the effective mass superscript cc (vv) refers to the conduction (valence) band. In this case where only ϵzz0\epsilon_{zz}\neq 0 and the electric field is zero the effective masses along the xx and yy directions are the same. Note that since the maximum of the valence band shifts away from the Γ\Gamma point at a sufficiently large strain value, the valence band effective masses change as a function of strain from positive to negative and reach infinite values in-between.

IV Surface state spectrum and spin structure

We shall now proceed to calculate the surface state (SS) spectrum and spin structure for the model Hamiltonian (10). In this and the following section we exclude 𝐤3\mathbf{k}^{3} terms, i.e. the last three terms in Eq. (10) and the 𝐤\mathbf{k} dependent terms in αi\alpha_{i} and βi\beta_{i}. First, we consider a semi-infinite topological insulator filling the z<0z<0 half space, implemented via the boundary conditions Ψ(0)=0\Psi(0)=0 and Ψ(z)0\Psi(z)\rightarrow 0 for zz\rightarrow-\infty. The translational invariance is broken in the zz direction and we make the substitution kzizk_{z}\rightarrow-i\partial_{z}. Using the ansatz ψλezλ\psi_{\lambda}e^{z\lambda} the stationary Schrödinger equation implies the following secular equation:

0\displaystyle\Leftrightarrow 0 =D+Dλ4+(2(EC~(ϵ)D2k||2)D1\displaystyle=D_{+}D_{-}\lambda^{4}+\bigg{(}2(E-\tilde{C}(\mathbf{\epsilon})-D_{2}k_{||}^{2})D_{1}
2(M~(ϵ)B2k||2)B1+|α3|2+|β3|2)λ2\displaystyle-2(\tilde{M}(\mathbf{\epsilon})-B_{2}k_{||}^{2})B_{1}+|\alpha_{3}|^{2}+|\beta_{3}|^{2}\bigg{)}\lambda^{2}
+i(2Re(α1α3+β1β3)kx+2Re(α2α3+β2β3)ky)λ\displaystyle+i(2\operatorname{Re}(\alpha_{1}\alpha_{3}^{*}+\beta_{1}\beta_{3}^{*})k_{x}+2\operatorname{Re}(\alpha_{2}\alpha_{3}^{*}+\beta_{2}\beta_{3}^{*})k_{y})\lambda
+(EC~(ϵ)D2k||2)2(M~(ϵ)B2k||2)2\displaystyle+(E-\tilde{C}(\mathbf{\epsilon})-D_{2}k_{||}^{2})^{2}-(\tilde{M}(\mathbf{\epsilon})-B_{2}k_{||}^{2})^{2}
|α1kx+α2ky|2|β1kx+β2ky|2,\displaystyle-|\alpha_{1}k_{x}+\alpha_{2}k_{y}|^{2}-|\beta_{1}k_{x}+\beta_{2}k_{y}|^{2}, (21)

where D±=D1±B1D_{\pm}=D_{1}\pm B_{1}. This equation has 4 complex solutions λi\lambda_{i}, each with two corresponding eigenspinors ψj(λi)\psi_{j}(\lambda_{i}) given by:

ψ1(λi)\displaystyle\psi_{1}(\lambda_{i}) =(E0+β1kx+β2kyiβ3λi0α1kx+α2kyiα3λi),\displaystyle=\begin{pmatrix}E-\mathcal{E}_{0}+\mathcal{M}\\ \beta_{1}k_{x}+\beta_{2}k_{y}-i\beta_{3}\lambda_{i}\\ 0\\ \alpha_{1}k_{x}+\alpha_{2}k_{y}-i\alpha_{3}\lambda_{i}\end{pmatrix}, (22)
ψ2(λi)\displaystyle\psi_{2}(\lambda_{i}) =(β1kx+β2kyiβ3λiE0α1kx+α2kyiα3λi0).\displaystyle=\begin{pmatrix}\beta_{1}^{*}k_{x}+\beta_{2}^{*}k_{y}-i\beta_{3}^{*}\lambda_{i}\\ E-\mathcal{E}_{0}-\mathcal{M}\\ \alpha_{1}k_{x}+\alpha_{2}k_{y}-i\alpha_{3}\lambda_{i}\\ 0\end{pmatrix}. (23)

The general solution to the Schrödinger equation can then be written on the form:

Ψ(z)=j=12i=14Cijψj(λi)eλiz,\displaystyle\Psi(z)=\sum_{j=1}^{2}\sum_{i=1}^{4}C_{ij}\psi_{j}(\lambda_{i})e^{\lambda_{i}z}, (24)

where the coefficients CijC_{ij} are to be determined from the boundary conditions. For surface states we require that Ψ(z)0\Psi(z)\rightarrow 0 for zz\rightarrow-\infty, implying that Re(λi)>0\operatorname{Re}(\lambda_{i})>0. To satisfy the boundary condition at the surface we need two different solutions with positive real part. By complex conjugation of Eq. (21) we see that for a solution λi\lambda_{i}, λi-\lambda_{i}^{*} will also be a solution. Hence, the solutions are either imaginary or occur in pairs related by λi=λj\lambda_{i}=-\lambda_{j}^{*}. This allows for three distinct cases: (i) All solutions are imaginary. (ii) Two complex solutions related by λ1=λ2\lambda_{1}=-\lambda_{2}^{*} and two imaginary solutions. (iii) Two pairs of complex solutions λ1=λ3\lambda_{1}=-\lambda_{3}^{*} and λ2=λ4\lambda_{2}=-\lambda_{4}^{*}. Existence of SSs is only possible in case (iii) and we therefore assume that λ1\lambda_{1} and λ2\lambda_{2} have positive real parts. Hence, we limit the sum in Eq. (24) to i{1,2}i\in\{1,2\}. Applying the boundary condition at z=0z=0 we obtain the following secular equation for nontrivial solution to the coefficients CijC_{ij}:

(λ1+λ2)2=|α3|2+|β3|2D+D,\displaystyle(\lambda_{1}+\lambda_{2})^{2}=\frac{|\alpha_{3}|^{2}+|\beta_{3}|^{2}}{-D_{+}D_{-}}, (25)

and combining this with Eq. (21) we find the SS spectrum:

E\displaystyle E =C~(ϵ)+D1B1M~(ϵ)\displaystyle=\tilde{C}(\mathbf{\epsilon})+\frac{D_{1}}{B_{1}}\tilde{M}(\mathbf{\epsilon})
±1D12B12(|α1kx+α2ky|2+|β1kx+β2ky|2\displaystyle\pm\sqrt{1-\frac{D_{1}^{2}}{B_{1}^{2}}}\Big{(}|\alpha_{1}k_{x}+\alpha_{2}k_{y}|^{2}+|\beta_{1}k_{x}+\beta_{2}k_{y}|^{2}
Re((α1kx+α2ky)α3+(β1kx+β2ky)β3)2|α3|2+|β3|2)12\displaystyle-\frac{\operatorname{Re}((\alpha_{1}k_{x}+\alpha_{2}k_{y})\alpha_{3}^{*}+(\beta_{1}k_{x}+\beta_{2}k_{y})\beta_{3}^{*})^{2}}{|\alpha_{3}|^{2}+|\beta_{3}|^{2}}\Big{)}^{\frac{1}{2}}
+(D2D1B1B2)k||2.\displaystyle+\left(D_{2}-\frac{D_{1}}{B_{1}}B_{2}\right)k_{||}^{2}. (26)

For small k||k_{||} we see that the spectrum is linear but due to the strain, the group velocity has now become anisotropic and the contours of constant energy have become elliptical. Note that the position of the Dirac-point can be shifted by changing M~(ϵ)\tilde{M}(\mathbf{\epsilon}), however the relative position within the bulk band gap is not changed. Notice also that the second order term is not changed by strain, since we have systematically retained terms of order ϵ1k1\epsilon^{1}k^{1} and discarded terms of order ϵ1k2\epsilon^{1}k^{2}. Both the directional dependence of the group velocity and the ellipticity of the contours of constant energy are shown in Fig. 5.

For k||=0k_{||}=0, the linear term in Eq. (21) vanishes and λ1\lambda_{1} and λ2\lambda_{2} can be calculated analytically:

λα=F+(1)αR2D+D,\displaystyle\lambda_{\alpha}=\sqrt{\frac{-F+(-1)^{\alpha}\sqrt{R}}{2D_{+}D_{-}}}, (27)

where:

F\displaystyle F =|β3|2+|α3|22M~(ϵ)B1+2ED12C~(ϵ)D1,\displaystyle=|\beta_{3}|^{2}+|\alpha_{3}|^{2}-2\tilde{M}(\mathbf{\epsilon})B_{1}+2ED_{1}-2\tilde{C}(\mathbf{\epsilon})D_{1}, (28a)
R\displaystyle R =F24D+D(EC~(ϵ)+M~(ϵ))(EC~(ϵ)M~(ϵ)).\displaystyle=F^{2}-4D_{+}D_{-}(E-\tilde{C}(\mathbf{\epsilon})+\tilde{M}(\mathbf{\epsilon}))(E-\tilde{C}(\mathbf{\epsilon})-\tilde{M}(\mathbf{\epsilon})). (28b)

Imposing the boundary condition Ψ(0)=0\Psi(0)=0 with Eq. (24) we find the exact wave functions of the two degenerate eigenstates at k||=0k_{||}=0, which are related by TR:

Ψ1\displaystyle\Psi_{1} =N(isgn(D+)|D+||2B1|β3|α3|2+|β3|2|D||2B1|0α3|α3|2+|β3|2|D||2B1|)(eλ1zeλ2z),\displaystyle=N\begin{pmatrix}i\operatorname{sgn}(D_{+})\sqrt{\frac{|D_{+}|}{|2B_{1}|}}\\ \frac{\beta_{3}}{\sqrt{|\alpha_{3}|^{2}+|\beta_{3}|^{2}}}\sqrt{\frac{|D_{-}|}{|2B_{1}|}}\\ 0\\ \frac{\alpha_{3}}{\sqrt{|\alpha_{3}|^{2}+|\beta_{3}|^{2}}}\sqrt{\frac{|D_{-}|}{|2B_{1}|}}\end{pmatrix}(e^{\lambda_{1}z}-e^{\lambda_{2}z}), (29a)
Ψ2\displaystyle\Psi_{2} =N(0α3|α3|2+|β3|2|D||2B1|isgn(D+)|D+||2B1|β3|α3|2+|β3|2|D||2B1|)(eλ1zeλ2z).\displaystyle=N\begin{pmatrix}0\\ \frac{\alpha_{3}^{*}}{\sqrt{|\alpha_{3}|^{2}+|\beta_{3}|^{2}}}\sqrt{\frac{|D_{-}|}{|2B_{1}|}}\\ i\operatorname{sgn}(D_{+})\sqrt{\frac{|D_{+}|}{|2B_{1}|}}\\ \frac{-\beta_{3}^{*}}{\sqrt{|\alpha_{3}|^{2}+|\beta_{3}|^{2}}}\sqrt{\frac{|D_{-}|}{|2B_{1}|}}\end{pmatrix}(e^{\lambda_{1}z}-e^{\lambda_{2}z}). (29b)

Note that the α3\alpha_{3} term couples the spin blocks and in contrast to the case without strain the eigenstates at the k||=0k_{||}=0 point are mixed up/down spinstates. The conditions for existence of surface states at k||=0k_{||}=0 are:

D+D<0andM~(ϵ)B1>0.\displaystyle D_{+}D_{-}<0\quad\text{and}\quad\tilde{M}(\mathbf{\epsilon})B_{1}>0. (30)

As expected, the surface states can only be destroyed by changing the sign of M~(ϵ)\tilde{M}(\mathbf{\epsilon}), i.e. closing and reopening the bulk band gap.

Refer to captionvF/vF0v_{F}/v_{F}^{0}10π2\frac{\pi}{2}0π\piθ\theta
(a) Group velocity
Refer to captionkx (Å1)k_{x}\text{ (${\mathrm{\SIUnitSymbolAngstrom}}^{-1}$)}ky (Å1)k_{y}\text{ (${\mathrm{\SIUnitSymbolAngstrom}}^{-1}$)}
(b) Unstrained Bi2Se3\text{Bi}_{2}\text{Se}_{3}
Refer to captionkx (Å1)k_{x}\text{ (${\mathrm{\SIUnitSymbolAngstrom}}^{-1}$)}ky (Å1)k_{y}\text{ (${\mathrm{\SIUnitSymbolAngstrom}}^{-1}$)}
(c) ϵzx=10%\epsilon_{zx}=10\%
Refer to captionkx (Å1)k_{x}\text{ (${\mathrm{\SIUnitSymbolAngstrom}}^{-1}$)}ky (Å1)k_{y}\text{ (${\mathrm{\SIUnitSymbolAngstrom}}^{-1}$)}
(d) ϵzy=5%\epsilon_{zy}=5\%
Refer to captionkx (Å1)k_{x}\text{ (${\mathrm{\SIUnitSymbolAngstrom}}^{-1}$)}ky (Å1)k_{y}\text{ (${\mathrm{\SIUnitSymbolAngstrom}}^{-1}$)}
(e) ϵxy=5%\epsilon_{xy}=5\%
Refer to captionkx (Å1)k_{x}\text{ (${\mathrm{\SIUnitSymbolAngstrom}}^{-1}$)}ky (Å1)k_{y}\text{ (${\mathrm{\SIUnitSymbolAngstrom}}^{-1}$)}
(f) ϵxxϵyy=5%\epsilon_{xx}-\epsilon_{yy}=5\%
Figure 5: LABEL:sub@fig:fermivelocity Group velocity relative to the velocity in unstrained Bi2Se3\text{Bi}_{2}\text{Se}_{3} given by vF0=A21D12B12v_{F}^{0}=A_{2}\sqrt{1-\frac{D_{1}^{2}}{B_{1}^{2}}} as a function of the angle θ=arctan(kykx)\theta=\arctan(\frac{k_{y}}{k_{x}}) in the kx,kyk_{x},k_{y} plane. (b)-(f) Contour plots showing the elliptical curves of constant energy of the upper Dirac cone of the surface states in the (kx,ky)(k_{x},k_{y}) plane. The black arrows indicate the direction in-plane part of the expectation value of the spin, while the red arrows show the expectation value of the spin in the zz direction. Spin-momentum locking is still present, but spin and momentum are no longer perpendicular. Here we have used the parameters of Ref. Liu et al., 2010a for the parameters of the unstrained model, and Xi=Yi=10 eV ÅX_{i}=Y_{i}=$10\text{\,}\mathrm{eV}\text{\,}\mathrm{\SIUnitSymbolAngstrom}$.

Due to TR-symmetry the spin of the SS at (kx,ky)(k_{x},k_{y}) must be opposite of the spin of the SS at (kx,ky)(-k_{x},-k_{y}). For a (111)(111) surface without strain, the model Hamiltonian is invariant under rotations in the xyxy-plane of any angle. Hence Sz=0\langle S_{z}\rangle=0 since (kx,ky)(k_{x},k_{y}) and (kx,ky)(-k_{x},-k_{y}) are related by a rotation around the zz-axis which does not change SzS_{z}. The terms linear in both wave vector and strain break the full rotation symmetry, and non-zero Sz\langle S_{z}\rangle values are allowed. At finite k||k_{||}, Eq. (21) was solved numerically, and using Ψ(0)=0\Psi(0)=0 a numerical expression for the SS spinor was determined. Finally, the expectation values of the spin operators were calculated and are shown in figure 5. As expected, non-zero values of Sz\langle S_{z}\rangle occur. Such out-of-plane spin components also show up to third order in kk in the case of hexagonal warpingFu (2009). The actual values of SzS_{z} shown in Fig. LABEL:sub@fig:spin_strainxy are of course strongly dependent on the chosen parameters, but the non-zero SzS_{z}-component is generally present for any values of XiX{i} and YiY_{i}.

IV.1 Effective 2D model

Using the surface states at k||=0k_{||}=0 as basis we can derive an effective 2D model for the surface states. We use the wave functions in Eq. (29), but with ϵij=0\epsilon_{ij}=0 to have a basis independent of strain. We note that in the absence of strain, ψ1\psi_{1} and ψ2\psi_{2} represent spin up/down states respectively. The matrix elements of the Γ\Gamma-matrices in this basis read:

Ψi|Γ1|Ψj\displaystyle\langle\Psi_{i}|\Gamma_{1}|\Psi_{j}\rangle =sgn(B1A1)1D12B12[σy]ij,\displaystyle=-\operatorname{sgn}(B_{1}A_{1})\sqrt{1-\frac{D_{1}^{2}}{B_{1}^{2}}}[\sigma_{y}]_{ij}, (31a)
Ψi|Γ2|Ψj\displaystyle\langle\Psi_{i}|\Gamma_{2}|\Psi_{j}\rangle =sgn(B1A1)1D12B12[σx]ij,\displaystyle=\operatorname{sgn}(B_{1}A_{1})\sqrt{1-\frac{D_{1}^{2}}{B_{1}^{2}}}[\sigma_{x}]_{ij}, (31b)
Ψi|Γ3|Ψj\displaystyle\langle\Psi_{i}|\Gamma_{3}|\Psi_{j}\rangle =0,\displaystyle=0, (31c)
Ψi|Γ4|Ψj\displaystyle\langle\Psi_{i}|\Gamma_{4}|\Psi_{j}\rangle =sgn(B1A1)1D12B12[σz]ij,\displaystyle=-\operatorname{sgn}(B_{1}A_{1})\sqrt{1-\frac{D_{1}^{2}}{B_{1}^{2}}}[\sigma_{z}]_{ij}, (31d)
Ψi|Γ5|Ψj\displaystyle\langle\Psi_{i}|\Gamma_{5}|\Psi_{j}\rangle =D1B1[σ0]ij.\displaystyle=\frac{D_{1}}{B_{1}}[\sigma_{0}]_{ij}. (31e)

The effective 2D model including 𝐤\mathbf{k} up to second order becomes:

H2D=C~(ϵ)+D1B1M~(ϵ)+𝐝(𝐤)σ+(D2D1B1B2)k||2,\displaystyle H^{2D}=\tilde{C}(\mathbf{\epsilon})+\frac{D_{1}}{B_{1}}\tilde{M}(\mathbf{\epsilon})+\mathbf{d}(\mathbf{k})\cdot\mathbf{\sigma}+\left(D_{2}-\frac{D_{1}}{B_{1}}B_{2}\right)k_{||}^{2}, (32)

where 𝐝(𝐤)=(d1(𝐤),d2(𝐤),d3(𝐤))\mathbf{d}(\mathbf{k})=(d_{1}(\mathbf{k}),d_{2}(\mathbf{k}),d_{3}(\mathbf{k})) is given by:

d1(𝐤)\displaystyle d_{1}(\mathbf{k}) =1D12B12Im(α1kx+α2ky)\displaystyle=\sqrt{1-\frac{D_{1}^{2}}{B_{1}^{2}}}\operatorname{Im}(\alpha_{1}k_{x}+\alpha_{2}k_{y}) (33a)
=1D12B12((Y3ϵzx+2Y4ϵxy)kx\displaystyle=\sqrt{1-\frac{D_{1}^{2}}{B_{1}^{2}}}\bigg{(}\big{(}Y_{3}\epsilon_{zx}+2Y_{4}\epsilon_{xy}\big{)}k_{x}
+(A2+A21ϵzz+A22ϵ||Y3ϵzyY4(ϵxxϵyy))ky),\displaystyle+\big{(}A_{2}+A_{21}\epsilon_{zz}+A_{22}\epsilon_{||}-Y_{3}\epsilon_{zy}-Y_{4}(\epsilon_{xx}-\epsilon_{yy})\big{)}k_{y}\bigg{)},
d2(𝐤)\displaystyle d_{2}(\mathbf{k}) =1D12B12Re(α1kx+α2ky)\displaystyle=-\sqrt{1-\frac{D_{1}^{2}}{B_{1}^{2}}}\operatorname{Re}(\alpha_{1}k_{x}+\alpha_{2}k_{y}) (33b)
=1D12B12((Y3ϵzy+2Y4ϵxy)ky\displaystyle=-\sqrt{1-\frac{D_{1}^{2}}{B_{1}^{2}}}\bigg{(}(Y_{3}\epsilon_{zy}+2Y_{4}\epsilon_{xy})k_{y}
+(A2+A21ϵzz+A22ϵ||+Y3ϵzy+Y4(ϵxxϵyy))kx),\displaystyle+\big{(}A_{2}+A_{21}\epsilon_{zz}+A_{22}\epsilon_{||}+Y_{3}\epsilon_{zy}+Y_{4}(\epsilon_{xx}-\epsilon_{yy})\big{)}k_{x}\bigg{)},
d3(𝐤)\displaystyle d_{3}(\mathbf{k}) =1D12B12Im(β1kx+β2ky)\displaystyle=-\sqrt{1-\frac{D_{1}^{2}}{B_{1}^{2}}}\operatorname{Im}(\beta_{1}k_{x}+\beta_{2}k_{y}) (33c)
=1D12B12((X3ϵzy(ϵxxϵyy))kx\displaystyle=\sqrt{1-\frac{D_{1}^{2}}{B_{1}^{2}}}\bigg{(}\big{(}X_{3}\epsilon_{zy}(\epsilon_{xx}-\epsilon_{yy})\big{)}k_{x}
+(X3ϵzx+2X4ϵxy)ky).\displaystyle+\big{(}-X_{3}\epsilon_{zx}+2X_{4}\epsilon_{xy}\big{)}k_{y}\bigg{)}.

Diagonalizing the effective Hamiltonian gives the spectrum:

E\displaystyle E =C~(ϵ)+D1B1M~(ϵ)+(D2D1B1B2)k||2\displaystyle=\tilde{C}(\mathbf{\epsilon})+\frac{D_{1}}{B_{1}}\tilde{M}(\mathbf{\epsilon})+\left(D_{2}-\frac{D_{1}}{B_{1}}B_{2}\right)k_{||}^{2} (34)
±1D12B12|α1kx+α2ky|2+Im(β1kx+β2ky)2.\displaystyle\pm\sqrt{1-\frac{D_{1}^{2}}{B_{1}^{2}}}\sqrt{|\alpha_{1}k_{x}+\alpha_{2}k_{y}|^{2}+\operatorname{Im}(\beta_{1}k_{x}+\beta_{2}k_{y})^{2}}.

This spectrum is different from Eq. (26), obtained from the 3D model. The two spectra become equal, however, if we take α3=0\alpha_{3}=0. We note that the Berry curvature of the SS band vanishes identically, i.e. Ω(𝐤)=d^[(kxd^)×(kyd^)]=0\Omega(\mathbf{k})=\hat{d}\cdot[(\partial_{k_{x}}\hat{d})\times(\partial_{k_{y}}\hat{d})]=0.

V Localization of surface states and finite size effect

In the case of a finite slab, a gap will open in the SS spectrum at k||=0k_{||}=0. This is due to the overlap of SS on opposite surfaces. For a given thickness the overlap will be highly dependent on the penetration depth of the SS. From the wave functions the expectation value of the distance from the surface d=zd=-\langle z\rangle can be calculated to:

d=B12M~(ϵ)+D+D|α3|2+|β3|2D+D|α3|2+|β3|2,\displaystyle d=\frac{\frac{B_{1}}{2\tilde{M}(\mathbf{\epsilon})}+\frac{-D_{+}D_{-}}{|\alpha_{3}|^{2}+|\beta_{3}|^{2}}}{\sqrt{\frac{-D_{+}D_{-}}{|\alpha_{3}|^{2}+|\beta_{3}|^{2}}}}, (35)

and we see that dd depends on strain through the band gap parameter, and the quantity |α3|2+|β3|2\sqrt{|\alpha_{3}|^{2}+|\beta_{3}|^{2}} involving the coefficients of the terms first order in kzk_{z}.

The eigenenergies in the finite system can be calculated by imposing the boundary condition Ψ(±L/2)=0\Psi(\pm L/2)=0, where LL is the slab thickness. Now the wave function can be written as Eq. (24), but using all 4 solutions to Eq. (21). At the Γ\Gamma point, the solutions are ±λ1,2\pm\lambda_{1,2} with λ1,2\lambda_{1,2} given by Eq. (27). The secular equation of the nontrivial solution for the coefficients gives:

(EC~(ϵ)+M~(ϵ)+D+λ12)λ2(EC~(ϵ)+M~(ϵ)+D+λ22)λ1=(tanh(λ1L/2)tanh(λ2L/2))±1,\displaystyle\frac{(E-\tilde{C}(\mathbf{\epsilon})+\tilde{M}(\mathbf{\epsilon})+D_{+}\lambda_{1}^{2})\lambda_{2}}{(E-\tilde{C}(\mathbf{\epsilon})+\tilde{M}(\mathbf{\epsilon})+D_{+}\lambda_{2}^{2})\lambda_{1}}=\left(\frac{\tanh(\lambda_{1}L/2)}{\tanh(\lambda_{2}L/2)}\right)^{\pm 1}, (36)

for even/odd states respectively. This is the same form as found using the model without strain Shan et al. (2010) , but here the parameters M~(ϵ)\tilde{M}(\mathbf{\epsilon}) and C~(ϵ)\tilde{C}(\mathbf{\epsilon}) as well as λ1,2\lambda_{1,2} from Eq. (27) depend on strain. The parameter C~(ϵ)\tilde{C}(\mathbf{\epsilon}) is not important, since it simply shifts all energies. Again the important quantities are M~(ϵ)\tilde{M}(\mathbf{\epsilon}) and |α3|2+|β3|2\sqrt{|\alpha_{3}|^{2}+|\beta_{3}|^{2}}.

To find the gap of the SSs, Δ\Delta, Eq. (36) has been solved numerically for L=n9.547 ÅL=n\cdot$9.547\text{\,}\mathrm{\SIUnitSymbolAngstrom}$ for integers nn between 2 and 6 where L=9.547 ÅL=$9.547\text{\,}\mathrm{\SIUnitSymbolAngstrom}$ is the thickness of 1 QL. First we have analyzed the effect of the bulk band gap changed by strain, and taking |α3|2+|β3|2=A1\sqrt{|\alpha_{3}|^{2}+|\beta_{3}|^{2}}=A_{1} the value without strain. We see that increasing the bulk band gap, decreases the SS band gap. Close to the phase transition this behavior is evident, since the SS must approach bulk states extending further into the material as the bulk gap approaches zero, giving a large coupling between opposite surfaces. As we saw in section III, the bulk gap is most sensitive to ϵzz\epsilon_{zz}. Using the relation

ΔEg=(5.27 eV)ϵzz+12(60.1 eV)ϵzz2,\Delta E_{g}=($-5.27\text{\,}\mathrm{eV}$)\epsilon_{zz}+\tfrac{1}{2}($-60.1\text{\,}\mathrm{eV}$)\epsilon_{zz}^{2},

from Ref. Young et al., 2011 with 2|M~(ϵ)|=Eg2|\tilde{M}(\mathbf{\epsilon})|=E_{g} we have plotted the SS gap as function of ϵzz\epsilon_{zz} in Fig. 6a, using the value M=0.28 eVM=$0.28\text{\,}\mathrm{eV}$ from Ref. Liu et al., 2010a for the band gap in unstrained Bi2Se3\text{Bi}_{2}\text{Se}_{3} and setting A11=0A_{11}=0.

In Fig. 6b, we have plotted the dependence of the SS gap on |α3|2+|β3|2\sqrt{|\alpha_{3}|^{2}+|\beta_{3}|^{2}} assuming a constant bulk band gap parameter of M=0.28 eVM=$0.28\text{\,}\mathrm{eV}$ from Ref. Liu et al., 2010a and setting M1=M2=0M_{1}=M_{2}=0. Without strain |α3|2+|β3|2=A1=2.26 eV Å\sqrt{|\alpha_{3}|^{2}+|\beta_{3}|^{2}}=A_{1}=$2.26\text{\,}\mathrm{eV}\text{\,}\mathrm{\SIUnitSymbolAngstrom}$ according to Ref. Liu et al., 2010a, but the quantitative dependence on strain has not been determined. We see that the SS gap closes at low values, which is related to the fact that for |α3|2+|β3|2<4M~|D+D|B1|\alpha_{3}|^{2}+|\beta_{3}|^{2}<\frac{4\tilde{M}|D_{+}D_{-}|}{B_{1}} the solutions to Eq. (21), λi\lambda_{i}, are complex giving an oscillatory spatial dependence of the SS. Oscillatory surface states give oscillations in the SS gap as a function of LL, which has been theorized to signify topological phase transitions between a trivial 2D insulator and a quantum spin hall phaseLu et al. (2010); Liu et al. (2010b); Linder et al. (2009). We note here that these oscillations are present only using the parameters of Ref. Zhang et al., 2009, not the parameters of Ref. Liu et al., 2010a as shown in Ref. Jensen, 2014. Experimental evidence of oscillations in the gap remains inconclusive, since both a simple decay of the gap for 2-5 QL has been reportedZhang et al. (2010) as well as an increase in the gap from 2 to 3 QLSakamoto et al. (2010) signifying a topological phase transition between 2 and 3 QL. The observed gaps are summarized in Table 3.

We have demonstrated that the gap of the SS can be tuned by strain. However, quantitative predictions require access to the parameters relating the strain tensor to |α3|2+|β3|2\sqrt{|\alpha_{3}|^{2}+|\beta_{3}|^{2}}. Tuning the band gap of the SS is important for both applications and fundamental research. One particular experimental challenge is that the layered structure of Bi2Se3\text{Bi}_{2}\text{Se}_{3} makes it easy to fabricate integer numbers of quintuple layers. Thus the gap dependence on the layer thickness has only been experimentally studied with few data points, since the gap becomes too small to be measured already at 6 QLZhang et al. (2010). With strain, it is be possible to increase the SS gap and investigate the thickness dependence of the gap further. Another intriguing prospect would be tuning the gap via strain, so as to pass through the topological transition in a controlled manner.

Refer to captionϵzz (%)\epsilon_{zz}\text{ (\%)}Δ (eV)\Delta\text{ (eV)}z(Å)-\langle z\rangle\text{($\mathrm{\SIUnitSymbolAngstrom}$)}
(a)
Refer to caption|α3|2+|β3|2 (eV Å)\sqrt{|\alpha_{3}|^{2}+|\beta_{3}|^{2}}\text{ ($\mathrm{eV}\text{\,}\mathrm{\SIUnitSymbolAngstrom}$)}Δ (eV)\Delta\text{ (eV)}z(Å)-\langle z\rangle\text{($\mathrm{\SIUnitSymbolAngstrom}$)}
(b)
Figure 6: Band gap for thicknesses of 2 to 6 quintuple layers (QL). In LABEL:sub@fig:gap_epsilonzz the effect of changing the bulk band gap by uni-axial strain on the SS gap Δ\Delta is shown. We have used a relation between the bulk band gap and ϵzz\epsilon_{zz} from Ref. Young et al., 2011. In this plot we have neglected the strain dependence of |α3|2+|β3|2\sqrt{|\alpha_{3}|^{2}+|\beta_{3}|^{2}} by setting A11=0A_{11}=0. In LABEL:sub@fig:gap_alphabeta we show the SS gap as a function of |α3|2+|β3|2\sqrt{|\alpha_{3}|^{2}+|\beta_{3}|^{2}}, assuming a constant bulk band gap, i.e., setting M1=M2=0M_{1}=M_{2}=0. The insets shows the expectation value of the distance to the surface for the surface state in the semi-infinite case, with the same horizontal axis. A qualitative agreement between the localization of the SS on single surface and the gap induced by coupling between opposite surfaces is seen. We have used the parameters of the model without strain from Ref. Liu et al., 2010a.
LL 2 QL 3 QL 4 QL 5 QL
Δ(eV)\Delta($\mathrm{eV}$) 0.250.25 0.1380.138 0.070.07 0.0410.041
Δ(eV)\Delta($\mathrm{eV}$) 0.280.28 0.340.34
Table 3: Experimental measurements of the surface state gap, induced by coupling between opposite surfaces. First row is from Ref. Zhang et al., 2010, second row is from Ref. Sakamoto et al., 2010. In the second row the gap increases from 2 to 3 QL, signifying an oscillatory dependence of the gap on the thickness.

VI Conclusion

We have derived the most general Hamiltonian for the Bi2Se3\text{Bi}_{2}\text{Se}_{3}-class of materials including terms to third order in the wave vector, first order in electric and magnetic fields, first order in strain and first order in both strain and wave vector. We show that this model provides a description of a range of different effects of strain on the electronic structure of these materials. Specifically we have analytically derived the spectrum of the surface states for a semi-infinite topological insulator, showing qualitatively the effects of strain on both the spectrum and the spin structure. The terms first order in both wave vector and strain break the full rotational symmetry close to k||=0k_{||}=0, leading to an anisotropy in the Dirac cone otherwise absent for the simple (111) surface termination. The spin structure is altered as well, and for some strain configurations a spin component perpendicular to the surface arises. In an analysis of the finite size effect we show that increasing the bulk band gap by virtue of strain, decreases the induced gap in the surface states. The possibility of tuning the SS gap gives new possibilities for experimental investigation of the thickness dependence of the SS gap and control of transport and optical properties.

Acknowledgements.
MRJ and MW gratefully acknowledge financial support from the Danish Council of Independent Research (Natural Sciences) grant no.: DFF-4181-00182. AML gratefully acknowledges financial support from the Carlsberg Foundation. The Center for Quantum Devices is funded by the Danish National Research Foundation.

Appendix A Effective masses

Here we give the expressions for the bulk effective masses including both contributions from strain and electric fields. Upper (lower) signs refer to the conduction (valence) band.

2mx\displaystyle\frac{\hbar^{2}}{m_{x}} =2D2±2ΔΓ(2(M+M1ϵzz+M2ϵ||)B2+(A2+A21ϵzz+A22ϵ||+Y3ϵzy+Y4(ϵxxϵyy))2\displaystyle=2D_{2}\pm\frac{2}{\Delta_{\Gamma}}\bigg{(}-2(M+M_{1}\epsilon_{zz}+M_{2}\epsilon_{||})B_{2}+(A_{2}+A_{21}\epsilon_{zz}+A_{22}\epsilon_{||}+Y_{3}\epsilon_{zy}+Y_{4}(\epsilon_{xx}-\epsilon_{yy}))^{2}
+(Y3ϵzx+2Y4ϵxy)2+(X1ϵzx+2X2ϵxy)2+(X3ϵzy+X4(ϵxxϵyy))2\displaystyle+(Y_{3}\epsilon_{zx}+2Y_{4}\epsilon_{xy})^{2}+(X_{1}\epsilon_{zx}+2X_{2}\epsilon_{xy})^{2}+(-X_{3}\epsilon_{zy}+X_{4}(\epsilon_{xx}-\epsilon_{yy}))^{2}
4ΔΓ2(4Ez2Wz2((A2+A21ϵzz+A22ϵ||+Y3ϵzy+Y4(ϵxxϵyy))2+(Y3ϵzx+2Y4ϵxy)2)\displaystyle-\frac{4}{\Delta_{\Gamma}^{2}}\Big{(}4E_{z}^{2}W_{z}^{2}\big{(}(A_{2}+A_{21}\epsilon_{zz}+A_{22}\epsilon_{||}+Y_{3}\epsilon_{zy}+Y_{4}(\epsilon_{xx}-\epsilon_{yy}))^{2}+(Y_{3}\epsilon_{zx}+2Y_{4}\epsilon_{xy})^{2}\big{)}
+4W||2E||2((X1ϵzx+2X2ϵxy)2+(X3ϵzy+X4(ϵxxϵyy))2)\displaystyle+4W_{||}^{2}E_{||}^{2}\big{(}(X_{1}\epsilon_{zx}+2X_{2}\epsilon_{xy})^{2}+(-X_{3}\epsilon_{zy}+X_{4}(\epsilon_{xx}-\epsilon_{yy}))^{2}\big{)}
+(W||Ex(A2+A21ϵzz+A22ϵ||+Y3ϵzy+Y4(ϵxxϵyy))W||Ey(Y3ϵzx+2Y4ϵxy)\displaystyle+\big{(}W_{||}E_{x}(A_{2}+A_{21}\epsilon_{zz}+A_{22}\epsilon_{||}+Y_{3}\epsilon_{zy}+Y_{4}(\epsilon_{xx}-\epsilon_{yy}))-W_{||}E_{y}(Y_{3}\epsilon_{zx}+2Y_{4}\epsilon_{xy})
+WzEz(X1ϵzx+2X2ϵxy))2\displaystyle+W_{z}E_{z}(X_{1}\epsilon_{zx}+2X_{2}\epsilon_{xy})\big{)}^{2}
+(W||Ex(Y3ϵzx+2Y4ϵxy)+W||Ey(A2+A21ϵzz+A22ϵ||+Y3ϵzy+Y4(ϵxxϵyy))\displaystyle+\big{(}W_{||}E_{x}(Y_{3}\epsilon_{zx}+2Y_{4}\epsilon_{xy})+W_{||}E_{y}(A_{2}+A_{21}\epsilon_{zz}+A_{22}\epsilon_{||}+Y_{3}\epsilon_{zy}+Y_{4}(\epsilon_{xx}-\epsilon_{yy}))
+WzEz(X3ϵzy+X4(ϵxxϵyy)))2\displaystyle+W_{z}E_{z}(-X_{3}\epsilon_{zy}+X_{4}(\epsilon_{xx}-\epsilon_{yy}))\big{)}^{2}
2(W||Ex(A2+A21ϵzz+A22ϵ||+Y3ϵzy+Y4(ϵxxϵyy))W||Ey(Y3ϵzx+2Y4ϵxy)\displaystyle-2\big{(}W_{||}E_{x}(A_{2}+A_{21}\epsilon_{zz}+A_{22}\epsilon_{||}+Y_{3}\epsilon_{zy}+Y_{4}(\epsilon_{xx}-\epsilon_{yy}))-W_{||}E_{y}(Y_{3}\epsilon_{zx}+2Y_{4}\epsilon_{xy})
WzEz(X1ϵzx+2X2ϵxy))2\displaystyle-W_{z}E_{z}(X_{1}\epsilon_{zx}+2X_{2}\epsilon_{xy})\big{)}^{2}
+2(W||Ex(Y3ϵzx+2Y4ϵxy)+W||Ey(A2+A21ϵzz+A22ϵ||+Y3ϵzy+Y4(ϵxxϵyy))\displaystyle+2\big{(}W_{||}E_{x}(Y_{3}\epsilon_{zx}+2Y_{4}\epsilon_{xy})+W_{||}E_{y}(A_{2}+A_{21}\epsilon_{zz}+A_{22}\epsilon_{||}+Y_{3}\epsilon_{zy}+Y_{4}(\epsilon_{xx}-\epsilon_{yy}))
WzEz(X3ϵzy+X4(ϵxxϵyy)))2)),\displaystyle-W_{z}E_{z}(-X_{3}\epsilon_{zy}+X_{4}(\epsilon_{xx}-\epsilon_{yy}))\big{)}^{2}\Big{)}\bigg{)}, (37)
2my\displaystyle\frac{\hbar^{2}}{m_{y}} =2D2±2ΔΓ(2(M+M1ϵzz+M2ϵ||)B2+(Y3ϵzx+2Y4ϵxy)2\displaystyle=2D_{2}\pm\frac{2}{\Delta_{\Gamma}}\bigg{(}-2(M+M_{1}\epsilon_{zz}+M_{2}\epsilon_{||})B_{2}+(Y_{3}\epsilon_{zx}+2Y_{4}\epsilon_{xy})^{2}
+(A2+A21ϵzz+A22ϵ||Y3ϵzyY4(ϵxxϵyy))2+(X1ϵzy+X2(ϵxxϵyy))2+(X3ϵzx2X4ϵxy)2\displaystyle+(A_{2}+A_{21}\epsilon_{zz}+A_{22}\epsilon_{||}-Y_{3}\epsilon_{zy}-Y_{4}(\epsilon_{xx}-\epsilon_{yy}))^{2}+(X_{1}\epsilon_{zy}+X_{2}(\epsilon_{xx}-\epsilon_{yy}))^{2}+(X_{3}\epsilon_{zx}-2X_{4}\epsilon_{xy})^{2}
4ΔΓ2(4Ez2Wz2((Y3ϵzx+2Y4ϵxy)2+(A2+A21ϵzz+A22ϵ||Y3ϵzyY4(ϵxxϵyy))2)\displaystyle-\frac{4}{\Delta_{\Gamma}^{2}}\Big{(}4E_{z}^{2}W_{z}^{2}\big{(}(Y_{3}\epsilon_{zx}+2Y_{4}\epsilon_{xy})^{2}+(A_{2}+A_{21}\epsilon_{zz}+A_{22}\epsilon_{||}-Y_{3}\epsilon_{zy}-Y_{4}(\epsilon_{xx}-\epsilon_{yy}))^{2}\big{)}
+4W||2E||2((X1ϵzy+X2(ϵxxϵyy))2+(X3ϵzx2X4ϵxy)2)\displaystyle+4W_{||}^{2}E_{||}^{2}\big{(}(X_{1}\epsilon_{zy}+X_{2}(\epsilon_{xx}-\epsilon_{yy}))^{2}+(X_{3}\epsilon_{zx}-2X_{4}\epsilon_{xy})^{2}\big{)}
+(W||Ex(Y3ϵzx+2Y4ϵxy)W||Ey(A2+A21ϵzz+A22ϵ||Y3ϵzyY4(ϵxxϵyy))\displaystyle+\big{(}W_{||}E_{x}(Y_{3}\epsilon_{zx}+2Y_{4}\epsilon_{xy})-W_{||}E_{y}(A_{2}+A_{21}\epsilon_{zz}+A_{22}\epsilon_{||}-Y_{3}\epsilon_{zy}-Y_{4}(\epsilon_{xx}-\epsilon_{yy}))
+WzEz(X1ϵzy+X2(ϵxxϵyy)))2\displaystyle+W_{z}E_{z}(X_{1}\epsilon_{zy}+X_{2}(\epsilon_{xx}-\epsilon_{yy}))\big{)}^{2}
+(W||Ex(A2+A21ϵzz+A22ϵ||Y3ϵzyY4(ϵxxϵyy))+W||Ey(Y3ϵzx+2Y4ϵxy)\displaystyle+\big{(}W_{||}E_{x}(A_{2}+A_{21}\epsilon_{zz}+A_{22}\epsilon_{||}-Y_{3}\epsilon_{zy}-Y_{4}(\epsilon_{xx}-\epsilon_{yy}))+W_{||}E_{y}(Y_{3}\epsilon_{zx}+2Y_{4}\epsilon_{xy})
+WzEz(X3ϵzx2X4ϵxy))2\displaystyle+W_{z}E_{z}(X_{3}\epsilon_{zx}-2X_{4}\epsilon_{xy})\big{)}^{2}
2(W||Ex(Y3ϵzx+2Y4ϵxy)W||Ey(A2+A21ϵzz+A22ϵ||Y3ϵzyY4(ϵxxϵyy))\displaystyle-2\big{(}W_{||}E_{x}(Y_{3}\epsilon_{zx}+2Y_{4}\epsilon_{xy})-W_{||}E_{y}(A_{2}+A_{21}\epsilon_{zz}+A_{22}\epsilon_{||}-Y_{3}\epsilon_{zy}-Y_{4}(\epsilon_{xx}-\epsilon_{yy}))
WzEz(X1ϵzy+X2(ϵxxϵyy)))2\displaystyle-W_{z}E_{z}(X_{1}\epsilon_{zy}+X_{2}(\epsilon_{xx}-\epsilon_{yy}))\big{)}^{2}
+2(W||Ex(A2+A21ϵzz+A22ϵ||Y3ϵzyY4(ϵxxϵyy))+W||Ey(Y3ϵzx+2Y4ϵxy)\displaystyle+2\big{(}W_{||}E_{x}(A_{2}+A_{21}\epsilon_{zz}+A_{22}\epsilon_{||}-Y_{3}\epsilon_{zy}-Y_{4}(\epsilon_{xx}-\epsilon_{yy}))+W_{||}E_{y}(Y_{3}\epsilon_{zx}+2Y_{4}\epsilon_{xy})
WzEz(X3ϵzx2X4ϵxy))2)),\displaystyle-W_{z}E_{z}(X_{3}\epsilon_{zx}-2X_{4}\epsilon_{xy})\big{)}^{2}\Big{)}\bigg{)}, (38)
2mz\displaystyle\frac{\hbar^{2}}{m_{z}} =2D1±2ΔΓ(2(M+M1ϵzz+M2ϵ||)B1+(Y1ϵzx+2Y2ϵxy)2+(Y1ϵzy+Y2(ϵxxϵyy))2\displaystyle=2D_{1}\pm\frac{2}{\Delta_{\Gamma}}\bigg{(}-2(M+M_{1}\epsilon_{zz}+M_{2}\epsilon_{||})B_{1}+(Y_{1}\epsilon_{zx}+2Y_{2}\epsilon_{xy})^{2}+(Y_{1}\epsilon_{zy}+Y_{2}(\epsilon_{xx}-\epsilon_{yy}))^{2}
+(A1+A11ϵzz+A12ϵ||)2\displaystyle+(A_{1}+A_{11}\epsilon_{zz}+A_{12}\epsilon_{||})^{2}
4ΔΓ2(4Ez2Wz2((Y1ϵzx+2Y2ϵxy)2+(Y1ϵzy+Y2(ϵxxϵyy))2)+4W||2E||2(A1+A11ϵzz+A12ϵ||)\displaystyle-\frac{4}{\Delta_{\Gamma}^{2}}\Big{(}4E_{z}^{2}W_{z}^{2}\big{(}(Y_{1}\epsilon_{zx}+2Y_{2}\epsilon_{xy})^{2}+(Y_{1}\epsilon_{zy}+Y_{2}(\epsilon_{xx}-\epsilon_{yy}))^{2}\big{)}+4W_{||}^{2}E_{||}^{2}(A_{1}+A_{11}\epsilon_{zz}+A_{12}\epsilon_{||})
+(W||Ex(Y1ϵzx+2Y2ϵxy)W||Ey(Y1ϵzy+Y2(ϵxxϵyy))+WzEz(A1+A11ϵzz+A12ϵ||))2\displaystyle+\big{(}W_{||}E_{x}(Y_{1}\epsilon_{zx}+2Y_{2}\epsilon_{xy})-W_{||}E_{y}(Y_{1}\epsilon_{zy}+Y_{2}(\epsilon_{xx}-\epsilon_{yy}))+W_{z}E_{z}(A_{1}+A_{11}\epsilon_{zz}+A_{12}\epsilon_{||})\big{)}^{2}
+(W||Ex(Y1ϵzy+Y2(ϵxxϵyy))+W||Ey(Y1ϵzx+2Y2ϵxy))2\displaystyle+\big{(}W_{||}E_{x}(Y_{1}\epsilon_{zy}+Y_{2}(\epsilon_{xx}-\epsilon_{yy}))+W_{||}E_{y}(Y_{1}\epsilon_{zx}+2Y_{2}\epsilon_{xy})\big{)}^{2}
2(W||Ex(Y1ϵzx+2Y2ϵxy)W||Ey(Y1ϵzy+Y2(ϵxxϵyy))WzEz(A1+A11ϵzz+A12ϵ||))2\displaystyle-2\big{(}W_{||}E_{x}(Y_{1}\epsilon_{zx}+2Y_{2}\epsilon_{xy})-W_{||}E_{y}(Y_{1}\epsilon_{zy}+Y_{2}(\epsilon_{xx}-\epsilon_{yy}))-W_{z}E_{z}(A_{1}+A_{11}\epsilon_{zz}+A_{12}\epsilon_{||})\big{)}^{2}
+2(W||Ex(Y1ϵzy+Y2(ϵxxϵyy))+W||Ey(Y1ϵzx+2Y2ϵxy))2)).\displaystyle+2\big{(}W_{||}E_{x}(Y_{1}\epsilon_{zy}+Y_{2}(\epsilon_{xx}-\epsilon_{yy}))+W_{||}E_{y}(Y_{1}\epsilon_{zx}+2Y_{2}\epsilon_{xy})\big{)}^{2}\Big{)}\bigg{)}. (39)

References