This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A fresh view on Frenkel excitons
Electron-hole pair exchange and many-body formalism

Shiue-Yuan Shiau shiau.sean@gmail.com Monique Combescot monique.combescot@insp.upmc.fr Physics Division, National Center for Theoretical Sciences, 10617 Taipei, Taiwan Institut des NanoSciences de Paris, Sorbonne Université, CNRS, 75005 Paris, France
(March 16, 2025)
Abstract

Excitons are coherent excitations that travel over the semiconductor sample. Two types are commonly distinguished: Wannier excitons that are formed in inorganic materials, and Frenkel excitons that are formed in organic materials and rare-gas crystals.

(i) Wannier excitons are made from delocalized conduction electrons and delocalized valence holes. The Coulomb interaction acts in two different ways. The intraband processes bind a free electron and a free hole into a Wannier exciton wave, while the interband processes separate the optically bright excitons made of spin-singlet electron-hole pairs from dark excitons made of spin-triplet pairs, with an additional transverse-longitudinal splitting that results from a nonanalytical Coulomb scattering which depends on the exciton wave vector direction with respect to the crystal axes.

(ii) Frenkel excitons are made from highly localized excitations. The Coulomb interaction then acts in one way only, the intralevel Coulomb processes between sites, analogous to intraband processes, being negligible for on-site excitations in the tight-binding limit. The interlevel Coulomb processes, analogous to interband processes, produce both, the Frenkel exciton wave and its singular splitting, through electron-hole pair exchange between sites: the excitonic wave is produced by delocalizing the on-site excitations through the recombination of an electron-hole pair on one site and its creation on another site. These interlevel processes also split the exciton level in exactly the same singular way as for Wannier excitons. Importantly, since the interlevel Coulomb interaction only acts on spin-singlet pairs, just like the electron-photon interaction, optically dark pairs do not form excitonic waves; as a result, Frenkel excitons are optically bright in the tight-binding limit.

We here present a fresh approach to Frenkel excitons in cubic semiconductor crystals, with a special focus on the spin and spatial degeneracies of the electronic states. This approach uses a second quantization formulation of the problem in terms of creation operators for electronic states on all lattice sites — their creation operators being true fermion operators in the tight-binding limit valid for semiconductors hosting Frenkel excitons. This operator formalism avoids using cumbersome (6Ns×6Ns6N_{s}\times 6N_{s}) Slater determinants — 2 for spin, 3 for spatial degeneracy and NsN_{s} for the number of lattice sites — to represent state wave functions out of which the Frenkel exciton eigenstates are derived. A deep understanding of the tricky Coulomb physics that takes place in the Frenkel exciton problem, is a prerequisite for possibly diagonalizing this very large matrix analytically. This is done in three steps:

(i) the first diagonalization, with respect to lattice sites, follows from transforming excitations on the NsN_{s} lattice sites R\textbf{R}_{\ell} into NsN_{s} exciton waves Kn\textbf{K}_{n}, by using appropriate phase prefactors;

(ii) the second diagonalization, with respect to spin, follows from the introduction of spin-singlet and spin-triplet electron-hole pair states, through the commonly missed sign change when transforming electron-absence operators into hole operators;

(iii) the third diagonalization, with respect to threefold spatial degeneracy, leads to the splitting of the exciton level into one longitudinal and two transverse modes, that result from the singular interlevel Coulomb scattering in the small  Kn\textbf{ K}_{n} limit.

To highlight the advantage of the second quantization approach to Frenkel exciton we here propose, over the standard first quantization procedure used for example in the seminal book by R. Knox, we also present detailed calculations of some key results on Frenkel excitons when formulated in terms of Slater determinants.

Finally, as a way forward, we show how many-body effects between Frenkel excitons can be handled through a composite boson formalism appropriate to excitons made from electron-hole pairs with zero spatial extension. Interestingly, this Frenkel exciton study led us to reformulate the dimensionless parameter that controls exciton many-body effects — first understood in terms of the Wannier exciton Bohr radius driven by Coulomb interaction — as a parameter entirely driven by the Pauli exclusion principle between the exciton fermionic components. This formulation is not only valid for Wannier excitons but also for composite bosons like Cooper pairs.

1 Introduction

Excitons have been very early identified as “quantum of electronic excitations traveling in a periodic structure, whose motion is characterized by a wave vector”[1]. These electronic excitations, commonly produced by photon absorption in a semiconductor, exist in two configurations, known as Wannier excitons[2] and Frenkel excitons[3, 4]. Although they both result from electron-hole pairs correlated by the Coulomb interaction, this interaction acts in opposite ways because the pairs on which Wannier or Frenkel excitons are made have a totally different spatial extension[5, 6]. The electronic picture for semiconductors hosting Wannier excitons starts with itinerant conduction and valence electrons, with energies separated by a gap[7, 8]. Through intraband Coulomb processes, a delocalized conduction electron and a delocalized valence-electron absence, i.e, a valence hole, end by forming a bound state having a delocalized center of mass. By contrast, the electronic picture for semiconductors hosting Frenkel excitons starts with excitations on the lattice sites of a periodic crystal[9, 10, 11]. These highly localized excitations are delocalized through interlevel Coulomb processes, to also end as electron-hole pairs with a center of mass delocalized over the whole sample.

In short, the Coulomb interaction localizes a delocalized electron and a delocalized hole to form a Wannier exciton bound state, through intraband processes, while it delocalizes an electron and a hole localized on the same lattice site, to form a Frenkel exciton wave, through interlevel processes.

Intralevel Coulomb processes, analogous to intraband processes, a priori exist for Frenkel excitons. However, as they require finite overlaps between the electronic wave functions of different lattice sites, these processes are negligible in the tight-binding limit, i.e., no wave function overlap between sites, an approximation valid for semiconductors hosting Frenkel excitons. In addition, keeping these overlaps prevents using a second quantization procedure based on a clean set of fermionic operators, as we here propose, whereas the introduction of finite wave function overlaps does not bring any significant effect.

Similarly, interband Coulomb processes, analogous to interlevel processes, also exist for Wannier excitons. They physically correspond to one conduction electron returning to the valence band, while a valence electron is excited to the conduction band. These processes are small compared to intraband processes; but they cannot be neglected because they bring two significant effects:

(i) the interband Coulomb interaction only acts on electron-hole pairs that are in a spin-singlet state, just like the electron-photon interaction. As a direct consequence, the interband Coulomb interaction participates in the splitting between bright and dark excitons[12], that ultimately drives the exciton Bose-Einstein condensation to occur in a dark state[13].

(ii) the scattering associated with interband processes is singular in the limit of small exciton center-of-mass wave vector. For cubic crystals, this singularity induces a splitting of the degenerate Wannier exciton level, which has a direct link to the transverse-longitudinal splitting in the exciton-polariton problem[14, 15, 16].

In short, the interband and interlevel Coulomb processes are physically similar: they both correspond to the recombination of an electron-hole pair along with the excitation of another pair. In the case of Frenkel excitons, these processes are essential: they are the ones responsible for the exciton formation, that is, the delocalization of on-site excitations into a wave quantum over the whole sample. By contrast, the interband processes for Wannier excitons are secondary: they just split the otherwise degenerate exciton level.

The purpose of this manuscript is to provide a microscopic understanding, starting from scratch, of the Coulomb processes that are responsible not only for the Frenkel exciton formation, but also for the splitting of its degenerate levels when the electronic levels are not only spin but also spatially degenerate. This understanding brings a fresh view to its Wannier exciton analog, known as “electron-hole exchange” — an improper name because different fermions do not quantum exchange: this effect just comes from interband Coulomb processes. Our goal is to catch the interplay between the spatial part of the problem that enters the Coulomb scatterings through the electronic wave functions, and the spin part conserved in a Coulomb process. Discussing these two parts separately enlightens the “electron-hole exchange” splitting discussed in papers dealing with Wannier excitons, and its so-called “short-range” and “nonanalytical long-range” contributions.

Frenkel excitons follow from the diagonalization of the system Hamiltonian in a (2×3×Ns2\times 3\times N_{s})-fold excitation subspace, 22 for the spin degeneracy of the excited electron, 33 for the spatial degeneracy of the unexcited electron level, and NsN_{s} for the number of lattice sites on which the excitation can take place. The analytical diagonalization of the resulting (6Ns×6Ns6N_{s}\times 6N_{s}) matrix from which the Frenkel exciton eigenstates are obtained, is a formidable mathematical task that requires a deep physical understanding of the problem, to possibly solve it analytically. The best way to reach this understanding is to separately study its three parts — lattice site degeneracy, spin degeneracy and spatial degeneracy — before putting them together.

Numerous Frenkel exciton-based applications have been proposed over the past several decades. In this manuscript, we have chosen not to enter into these applications. Still, for interested readers, we rather list some references devoted to quantum computing[18, 19], photosynthesis[20, 21], light harvesting devices[22, 23], light emitting devices[24, 25, 26], and solar cells[27, 28, 29], in all of which the Frenkel exciton plays a key role.

The paper is organized as follows:

In Sec. 2, we analyze the whole Frenkel exciton problem step by step.

In Sec. 3, we forget spin and spatial degeneracies. This allows us to identify the phase prefactor that transforms electronic excitations on any lattice sites into coherent wave excitations. This also highlights the necessity for the electronic ground and excited levels to have a different parity in order to possibly form a Frenkel exciton. As a direct consequence, taking into account the state spatial degeneracy is mandatory.

In Sec. 4, we introduce the spin but not yet the spatial degeneracy. This part highlights the importance of formulating the problem in terms of electrons and holes: indeed, this formulation leads us, in a natural way, to distinguish spin-triplet from spin-singlet pairs and to readily catch that spin-triplet electron-hole pairs do not suffer interlevel Coulomb processes; so, these pairs do not participate in the Frenkel exciton formation.

In Sec. 5, we consider the spatial degeneracy of the electronic level but we forget the spin degeneracy. This allows us to pin down the effect of this degeneracy on the interlevel Coulomb processes and the splitting of the exciton level that comes from the singular behavior of the Coulomb scattering in the limit of small exciton center-of-mass wave vector.

In Sec. 6, we consider Frenkel excitons made of electronic states having both a threefold spatial degeneracy and a twofold spin degeneracy. The previous sections provide the necessary help to comprehend the formation of Frenkel excitons in its full complexity.

In Sec. 7, we discuss the presentation of Frenkel excitons given in the representative exciton “Bible” written by R. Knox[5]. It relies on a first quantization formulation of the problem that makes use of Slater determinants for many-body state wave functions. It is well known that calculations involving Slater determinants are very cumbersome. Many results in this book are qualified as “easy to find”, somewhat dismissive to our opinion. This is why we find it useful here to provide some detailed derivations. These derivations once more demonstrate the great superiority of the second quantization formalism when dealing with a many-body problem, which fundamentally is what the Frenkel exciton problem is, in spite of the fact that we ultimately end with one electron-hole pair only.

In Sec. 8, we briefly show how to handle many-body effects between Frenkel excitons through a composite boson formalism for excitons having a “size” equal to zero. Wannier excitons, characterized by two quantum indices, namely their center-of-mass wave vector and their relative-motion index, have a finite size, their Bohr radius. By contrast, Frenkel excitons have one quantum index only, their wave vector, the size of these excitons being vanishingly small because they are made of on-site excitations. The “sizelessness” in the case of Frenkel excitons forced us to reconsider the physics of the dimensionless parameter that controls exciton many-body effects, first understood in terms of the Coulomb-driven exciton overlap through the exciton Bohr radius. We ultimately understood that this parameter is entirely controlled by the Pauli exclusion principle between the fermionic components of the excitons — an understanding that also extends to Cooper pairs which are composite bosons made of opposite-spin electrons.

We then conclude.

2 The Frenkel exciton problem, step by step

The Frenkel exciton problem at its root is a math problem: the diagonalization of a 6Ns×6Ns6N_{s}\times 6N_{s} matrix for electrons on the NsN_{s} lattice sites of a semiconductor crystal, the electronic states having a twofold spin degeneracy and a ground level with a threefold spatial degeneracy. No doubt, a good guess of the form of the eigenstates, based on wise physical considerations, is necessary to possibly solve this formidable math problem analytically. The purpose of this section is to guide the reader to the result, through a convoluted journey that we hope shall ultimately appear as an “easy ride”, once the physics of each step is revealed. This journey experiences four different physical landscapes:

(i) the electronic levels for atoms or molecules located on a periodic lattice, in the absence of spin and spatial degeneracies;

(ii) these electronic levels with spin, but no spatial degeneracy;

(iii) these electronic levels with no spin but a threefold spatial degeneracy either for the excited level or for the ground level;

(iv) these electronic levels with both, spin and spatial degeneracies.

2.1 In the absence of spin and spatial degeneracies

Let us begin with the simplest problem, to establish the procedure[17]: NsN_{s} electrons with charge |e|-|e| and NsN_{s} ions with charge |e||e| located at the nodes of a periodic lattice.

The first step is to find the physically relevant basis for one-electron states that will be used to define the one-electron operators for a quantum formulation of the problem. The whole spectrum made of the ν\nu eigenstates for one electron in the presence of one ion located at the lattice site R\textbf{R}_{\ell}, comes across as a possible basis. Indeed, being Hamiltonian eigenstates, the (ν;R\nu;\textbf{R}_{\ell}) states for a particular R\textbf{R}_{\ell} but different ν\nu’s form a complete basis that can in principle be used to describe electrons located on any other lattice site, provided that enough ν\nu states are included into the description. It is however clear that a better idea is to use a basis in which enter all R\textbf{R}_{\ell} sites. This can be done by restricting the ν\nu states to the ground and lowest-excited levels, (g,e)(g,e), provided that the (g,e)(g,e) states from different lattice sites have a very small wave function overlap, as for materials hosting Frenkel excitons: in the tight-binding limit, that is, no wave function overlap between different sites, the (ν;R)(\nu;\textbf{R}_{\ell}) electronic states for ν=(g,e)\nu=(g,e) and =(1,,Ns)\ell=(1,\cdots,N_{s}), can indeed be used to cleanly define the one-body fermionic operators necessary for a second quantization formulation of the problem.

The second step is to write the system in second quantization using the creation operators a^ν,\hat{a}^{\dagger}_{\nu,\ell} for electrons in these (ν;R)(\nu;\textbf{R}_{\ell}) states. The system ground state essentially corresponds to each (g;R)(g;\textbf{R}_{\ell}) ground state of all NsN_{s} lattice sites occupied by one electron, while for the lowest set of excited states, one (g;R)(g;\textbf{R}_{\ell}) ground state is replaced by the (e;R)(e;\textbf{R}_{\ell}) excited state of the same lattice site: indeed, a jump to the excited level of another site would lead to a higher-energy excitation due to the electrostatic cost resulting from charge separation; this cost, large in the tight-biding limit, forces the electronic cloud to stay close to the R\textbf{R}_{\ell} site. With respect to the system ground state, this excited state corresponds to one excited-level electron and one ground-level electron absence, on the same lattice site.

The third step is to turn from excited-level electron and ground-level electron absence to electron and hole. Although this procedure is not mandatory in the absence of spin, it actually corresponds to the proper physical description of the Frenkel exciton problem in terms of on-site electron-hole pair excitations. Turning to electron-hole pairs becomes crucial when the spin is introduced because this formulation naturally goes along with a splitting between spin-singlet and spin-triplet subspaces, neatly defined when speaking in terms of electrons and holes.

We are then left with the diagonalization of a Ns×NsN_{s}\times N_{s} matrix for one electron-hole pair excitation on each of the =(1,2,,Ns)\ell=(1,2,\cdots,N_{s}) lattice sites, the coupling between lattice sites being mediated by interlevel Coulomb processes in which the excited electron of one lattice site returns to the ground level of the same site, while another site is excited. This diagonalization is easy to perform by turning from NsN_{s} pairs localized on the 𝐑{\bf R}_{\ell} sites, to NsN_{s} correlated pairs, characterized by a wave vector 𝐊n{\bf K}_{n} with n=(1,,Ns)n=(1,\cdots,N_{s}), that are linear combinations of the 𝐑{\bf R}_{\ell} pairs, with a prefactor which is just a phase

ei𝐊n𝐑e^{i{\bf K}_{n}\cdot{\bf R}_{\ell}} (1)

This linear combination, known as Frenkel exciton, fundamentally corresponds to delocalizing the on-site excitations over the whole sample.

Importantly, the interlevel Coulomb scatterings responsible for the pair delocalization, differ from zero provided that the ground and excited electronic levels have a different parity. For this reason, it is mandatory to bring the spatial degeneracy of the electronic states into the problem, in order to possibly explain the Frenkel exciton formation.

2.2 With spin but no spatial degeneracy

Before introducing this spatial degeneracy, let us tackle a generic difficulty associated with the electronic state degeneracy that comes from the electron spin. Each ground level then is occupied by two electrons having opposite spins. This goes along with the fact that the ion on each lattice site must carry a 2|e|2|e| charge, as required by the system neutrality. After some thoughts that will be detailed later on, we have reached the conclusion that the relevant electronic states for second quantization are not the ones of an electron in the presence of a 2|e|2|e| ion, but still the ones of an electron in the presence of a |e||e| charge, this electron possibly having an up or down spin. The lowest set of excited states then corresponds to one of the two ground-level electrons jumping with its (±1/2\pm 1/2) spin, to the excited level of the same lattice site: indeed, the Coulomb interaction or the electron-photon interaction that can produce such electronic excitation, conserves the spin. The system then has two possible excited states on each of the NsN_{s} lattice sites of the crystal. The Frenkel excitons are constructed by diagonalizing the resulting (2Ns×2Ns2N_{s}\times 2N_{s}) matrix that represents the system Hamiltonian in this lowest excited subspace.

The appropriate way to catch the physics of the Coulomb coupling between these excited states is to turn to electron-hole pairs and to write these pairs in their spin-triplet and spin-singlet configurations. The phase factor that appears between electron destruction operator and hole creation operator[30, 31], differentiates spin-singlet from spin-triplet pairs. We then easily find that the pairs in the spin-singlet configuration are the only ones that suffer the interlevel Coulomb processes responsible for the delocalization of on-site excitations over the whole sample. So, by simply writing the problem in terms of spin-triplet and spin-singlet states, the (2Ns×2Ns2N_{s}\times 2N_{s}) matrix splits into a diagonal (Ns×NsN_{s}\times N_{s}) matrix for the spin-triplet subspace, and a nondiagonal (Ns×NsN_{s}\times N_{s}) matrix for the spin-singlet subspace. The diagonalization of the spin-singlet matrix is then performed, as in the absence of spin, by turning from pairs localized on the 𝐑{\bf R}_{\ell} lattice sites to delocalized 𝐊n{\bf K}_{n} pairs, through the same phase factor as the one given in Eq. (1).

2.3 With spatial degeneracy but no spin

\bullet Let us first consider that the ground (g)(g) level is nondegenerate and the excited (e)(e) level is threefold degenerate because this degeneracy configuration is simpler. When excited, one electron in the ground-level state (g;𝐑)(g;{\bf R}_{\ell}) of the 𝐑{\bf R}_{\ell} lattice site, jumps into one of the three excited states (μ,e;𝐑)(\mu,e;{\bf R}_{\ell}) of the same site, with μ=(x,y,z)\mu=(x,y,z) along the cubic crystal axes. The system then has 3Ns3N_{s} possible excited states, each of which corresponds to an electron in one of the three excited states μ\mu on one of the NsN_{s} lattice sites and an empty ground level on the same site.

A first diagonalization of the resulting (3Ns×3Ns3N_{s}\times 3N_{s}) matrix is performed with respect to lattice sites, by using the ei𝐊n𝐑e^{i{\bf K}_{n}\cdot{\bf R}_{\ell}} phase factor of Eq. (1). This delocalizes the excitations on the 𝐑{\bf R}_{\ell} lattice sites as 𝐊n{\bf K}_{n} wave over the whole sample.

We are left with NsN_{s} submatrices (3×33\times 3) that are associated with the NsN_{s} different 𝐊n{\bf K}_{n} wave vectors. It turns out that the interlevel Coulomb scatterings that enters their nondiagonal matrix elements, are singular in the small 𝐊n{\bf K}_{n} limit. This explains why their diagonalization leads to a splitting of the 𝐊n{\bf K}_{n} Frenkel exciton wave, into one “longitudinal” and two “transverse” levels with respect to the 𝐊n{\bf K}_{n} direction, the energy splitting depending on the direction of the 𝐊n/Kn{\bf K}_{n}/K_{n} vector with respect to the cubic axes.

\bullet The situation seems at first more complicated when the ground level is threefold and the excited level is nondegenerate because the ion on the 𝐑{\bf R}_{\ell} lattice site now hosts three ground-level electrons; so, its charge is 3|e|3|e|, due to crystal neutrality. When excited, one of the three ground electrons (μ,g;𝐑)(\mu,g;{\bf R}_{\ell}) jumps to the unique excited level (e;𝐑)(e;{\bf R}_{\ell}).

The smart way to understand these excited states is not to see them as a 3|e|3|e| ion with two ground electrons and one excited electron, but in terms of electronic excitations: the 𝐑{\bf R}_{\ell} site then hosts three types of excited states that can be labeled by the μ\mu index of the ground electron that has jumped to the unique excited level. Within this formulation, the calculation follows straightforwardly the one for a nondegenerate ground level and a threefold excited level: we first perform the diagonalization of the resulting (3Ns×3Ns3N_{s}\times 3N_{s}) matrix with respect to the 𝐑{\bf R}_{\ell} lattice sites, to get NsN_{s} excitons 𝐊n{\bf K}_{n}. Next, we perform the diagonalization of the resulting (3×33\times 3) submatrix associated with a particular 𝐊n{\bf K}_{n}, from which we obtain the transverse-longitudinal splitting of the exciton level that arises from the singularity of the interlevel Coulomb scattering in the small 𝐊n{\bf K}_{n} limit.

2.4 With spin and spatial degeneracies

Having understood the consequences of spin and spatial degeneracies separately, the strategy to handle them all together becomes easier to construct.

We first introduce the one-electron eigenstates in the presence of a |e||e| ion located on the 𝐑{\bf R}_{\ell} lattice site. As a basis, we use the ground and lowest excited levels for all lattice sites 𝐑{\bf R}_{\ell}, provided that these states are highly localized, i.e., no overlap between wave functions of different lattice sites. We moreover consider that the ground level is spatially threefold and the excited level is nondegenerate.

These (ν;𝐑\nu;{\bf R}_{\ell}) states with ν=(μ,g)\nu=(\mu,g) or ν=e\nu=e, are used to define the one-electron operators, in terms of which we formulate the system in second quantization.

Next, we turn from excited electron and ground electron absence to electron and hole. The phase factor that appears in this change differentiates spin-singlet from spin-triplet electron-hole pair subspaces in a straightforward way.

The Frenkel exciton problem begins with electron-hole pairs having a (2×32\times 3) degeneracy, due to spin and spatial degrees of freedom, that are each localized on one of the NsN_{s} lattice sites 𝐑{\bf R}_{\ell}. So, the matrix we have to diagonalize is (6Ns×6Ns6N_{s}\times 6N_{s}). We first turn to electrons and holes and then to the spin-singlet and spin-triplet combinations. The part in the spin-triplet subspace readily appears diagonal because interlevel Coulomb processes do not exist for spin-triplet pairs. So, we are left with diagonalizing a (3Ns×3Ns3N_{s}\times 3N_{s}) matrix in the spin-singlet subspace. A first diagonalization is performed by switching from NsN_{s} lattice sites 𝐑{\bf R}_{\ell} to NsN_{s} wave vectors 𝐊n{\bf K}_{n} through the phase factor given in Eq. (1). We remain with a set of (3×33\times 3) submatrices associated with different 𝐊n{\bf K}_{n} wave vectors. Due to the singular behavior of the interlevel Coulomb processes in the small 𝐊n{\bf K}_{n} limit, their diagonalization leads to the same transverse-longitudinal splitting along the 𝐊n{\bf K}_{n} direction, as the one found in the absence of spin.

Let us now study these four steps in details, to confirm the above understanding.

3 Frenkel exciton without spin and spatial degeneracies

3.1 Appropriate basis for quantum formulation

3.1.1 System Hamiltonian

We consider a neutral system made of NsN_{s} free electrons with mass m0m_{0}, charge |e|-|e|, spatial coordinate rj\textbf{r}_{j} for j=(1,,Nsj=(1,\cdots,N_{s}), and Ns{N_{s}} ions with infinite mass, charge |e||e|, located at the 𝐑{\bf R}_{\ell} nodes of a periodic lattice for =(1,,Ns\ell=(1,\cdots,N_{s}). The system Hamiltonian reads in first quantization as

HNs\displaystyle H_{N_{s}} =\displaystyle= j=1Ns𝐩j22m0+j=1Ns=1Nse2|𝐫j𝐑|+12j=1NsjjNse2|𝐫j𝐫j|\displaystyle\sum_{j=1}^{N_{s}}\frac{{\bf p}_{j}^{2}}{2m_{0}}+\sum_{j=1}^{N_{s}}\sum_{\ell=1}^{N_{s}}\frac{-e^{2}}{|{\bf r}_{j}-{\bf R}_{\ell}|}+\frac{1}{2}\sum_{j=1}^{N_{s}}\sum_{j^{\prime}\not=j}^{N_{s}}\frac{e^{2}}{|{\bf r}_{j}-{\bf r}_{j^{\prime}}|} (2)
+12=1NsNse2|𝐑𝐑|\displaystyle+\frac{1}{2}\sum_{\ell=1}^{N_{s}}\sum_{\ell^{\prime}\not=\ell}^{N_{s}}\frac{e^{2}}{|{\bf R}_{\ell}-{\bf R}_{\ell^{\prime}}|}

The first term corresponds to the electron kinetic energy, the second term to the electron-ion attraction, the third term to the electron-electron repulsion. The last term, which is a constant with respect to the electron motion, ensures the elimination of volume divergent terms coming from the long-range character of the Coulomb interaction when the sample volume L3L^{3} goes to infinity. Note that, by considering Ns{N_{s}} electron-ion pairs, we de facto consider that Ns/L3N_{s}/L^{3} has a finite value in the large LL thermodynamic limit.

We wish to note that the above Hamiltonian corresponds to electrons in the presence of point-charge ions. In reality, the crystal cells are occupied by a finite-size atom or molecule that has to be visualized as a “core” plus one electron either in the ground or excited level. The core includes the nucleus plus the remaining electron cloud, the core total charge being equal to |e||e| since the atom or molecule is neutral. Taking into account the cloud spatial extension would mean to replace the point-charge potential e2/|𝐫s,j𝐑|-e^{2}/|{\bf r}_{s,j}-{\bf R}_{\ell}| appearing in Eq. (2), by a v|e|(𝐫s,j𝐑)v_{|e|}({\bf r}_{s,j}-{\bf R}_{\ell}) potential corresponding to the same |e||e| charge but somewhat broadened over the 𝐑{\bf R}_{\ell} cell. The shape of this potential does not affect the Coulomb physics we here study because, as shown below, the formation of Frenkel excitons is entirely driven by the electron-electron Coulomb interaction. The precise shape of this potential only enters the wave functions of the one-electron states that are used in the quantum formulation of the problem, that is, the numerical values of the electron-electron Coulomb scatterings that appear in the formalism; the precise v|e|(𝐫s,j𝐑)v_{|e|}({\bf r}_{s,j}-{\bf R}_{\ell}) shape has no effect on the physics of the formalism we present.

For systems represented by the above Hamiltonian, the physically relevant electrons either have wave functions that are highly localized on the ion site at the lattice cell scale, or wave functions that are delocalized over the sample. In the former case, the excitons that are formed are called Frenkel excitons, while in the latter case, they are called Wannier excitons. In this work, we concentrate on Frenkel excitons.

The appropriate way to handle many-body states like the ones we here study, is through the second quantization formalism. The very first step is to choose a one-electron basis. Although any basis can be used, choosing a “good” basis facilitates the calculations and enlighten their physics. The “good basis” is made of one-body states that contain as much physics as possible. This prompts us to first analyze the problem, with this goal in mind.

3.1.2 On choosing the good one-electron basis

When the relevant electron wave functions are highly localized on ions, the NsN_{s}-electron states fundamentally correspond to one electron on each ion site. So, the good one-electron basis has to be related to the eigenstates for one electron in the presence of one ion. The Hamiltonian for one electron and a |e||e| ion located at 𝐑{\bf R}_{\ell} reads

h𝐑=𝐩22m0e2|𝐫𝐑|h_{\scriptscriptstyle{{\bf R}_{\ell}}}=\frac{{\bf p}^{2}}{2m_{0}}-\frac{e^{2}}{|{\bf r}-{\bf R}_{\ell}|} (3)

Its ν\nu eigenstates, with energy εν\varepsilon_{\nu} and wave function

𝐫|ν,𝐑=𝐫𝐑|ν\langle{\bf r}|\nu,{\bf R}_{\ell}\rangle=\langle{\bf r}-{\bf R}_{\ell}|\nu\rangle (4)

reduce to the hydrogen atom states when the core is replaced by a point charge. As with any Hamiltonian eigenstates, these states, which are orthogonal

ν,𝐑|ν,𝐑=δν,ν\langle\nu^{\prime},{\bf R}_{\ell}|\nu,{\bf R}_{\ell}\rangle=\delta_{\nu^{\prime},\nu} (5)

form a complete basis. So, the |ν,𝐑|\nu,{\bf R}_{\ell}\rangle states can a priori be used to describe an electron located on any other ion site: indeed, the ν,𝐑|ν,𝐑\langle\nu^{\prime},{\bf R}_{\ell^{\prime}}|\nu,{\bf R}_{\ell}\rangle overlaps for 𝐑𝐑{\bf R}_{\ell^{\prime}}\neq{\bf R}_{\ell} differ from zero for high-energy extended states. However, this requires using a very large number of ν\nu’s in the state description. Due to this, the |ν,𝐑|\nu,{\bf R}_{\ell}\rangle states with 𝐑{\bf R}_{\ell} fixed, do not constitute the good basis we are looking for, to describe NsN_{s}-body states with one electron on each 𝐑{\bf R}_{\ell} ion.

To construct this basis, we note that for highly localized states at the lattice cell scale, the wave function overlaps between different ions, although not exactly zero, are very small, even for nearby ions. So, a physically reasonable idea is to take the tight-binding approximation as strict, that is, no wave function overlap between different ions

ν|𝐫𝐑𝐫𝐑|ν=0for()and(ν,ν)=(e,g)\langle\nu^{\prime}|{\bf r}-{\bf R}_{\ell^{\prime}}\rangle\langle{\bf r}-{\bf R}_{\ell}|\nu\rangle=0\quad\textrm{for}\quad(\ell^{\prime}\not=\ell)\quad\textrm{and}\quad(\nu,\nu^{\prime})=(e,g) (6)

Indeed, the above mathematical limit requires highly localized electronic states, which is physically acceptable for the two lowest-energy states ν=(e,g)\nu=(e,g) that drive the Frenkel exciton physics.

We wish to stress that this tight-binding limit underlies the entire Frenkel exciton story. In particular, it energetically supports taking the semiconductor lowest-excited states as made of one electron in the ground level of the 𝐑{\bf R}_{\ell} lattice site, jumping to the lowest excited level of the same site, due to the electrostatic energy cost for separating the electron from its ion at a distance large compared to electronic state extension. It also is of importance to note that the tight-binding limit eliminates all intralevel Coulomb processes between different sites (see Fig. 1(a)), because the associated scatterings contain wave function overlaps e,𝐑|𝐫𝐫|e,𝐑\langle e,{\bf R}_{\ell^{\prime}}|{\bf r}\rangle\langle{\bf r}|e,{\bf R}_{\ell}\rangle that in this limit, reduce to zero for \ell^{\prime}\not=\ell. These intralevel processes are unimportant because the Frenkel exciton physics is driven by the interlevel Coulomb interaction (see Fig. 1(b)), with scatterings that read in terms of on-site overlaps g,𝐑|𝐫𝐫|e,𝐑\langle g,{\bf R}_{\ell}|{\bf r}\rangle\langle{\bf r}|e,{\bf R}_{\ell}\rangle. This is in stark contrast to Wannier excitons, for which the intraband Coulomb processes, i.e., the counterparts of the intralevel processes, are entirely responsible for the formation of the excitonic wave. Still, by accepting the above relation (6), we by construction drop all deviations from the tight-binding approximation. Although these deviations produce electron hopping from ion to ion, they do not affect the formation of the Frenkel exciton wave we here study.

Refer to caption
Figure 1: (a) In an intralevel Coulomb process, the electron keeps its ν\nu level while changing lattice site. The associated scattering then contains the wave function overlap between different sites — which reduces to zero in the tight-binding limit. (b) In an interlevel Coulomb process, the electron changes level while staying in the same lattice site.

When added to Eq. (5), the equation (6) leads to

ν,𝐑|ν,𝐑=δν,νδ,for(ν,ν)=(e,g)\langle\nu^{\prime},{\bf R}_{\ell^{\prime}}|\nu,{\bf R}_{\ell}\rangle=\delta_{\nu^{\prime},\nu}\delta_{\ell^{\prime},\ell}\quad\textrm{for}\quad(\nu,\nu^{\prime})=(e,g) (7)

This orthogonality allows us to use the |ν,𝐑|\nu,{\bf R}_{\ell}\rangle states as a one-electron basis to construct the set of creation operators necessary for a quantum description of the Frenkel exciton problem, through

|ν,𝐑=a^ν,|v|\nu,{\bf R}_{\ell}\rangle=\hat{a}^{\dagger}_{\nu,\ell}|v\rangle (8)

with |v|v\rangle denoting the vacuum state. Indeed, due to Eq. (7), these operators fulfill the anticommutation relations for fermion operators (see A),

[a^ν,,a^ν,]+\displaystyle\left[\hat{a}^{\dagger}_{\nu^{\prime},\ell^{\prime}},\hat{a}^{\dagger}_{\nu,\ell}\right]_{+} =\displaystyle= 0\displaystyle 0 (9)
[a^ν,,a^ν,]+\displaystyle\left[\hat{a}_{\nu^{\prime},\ell^{\prime}},\hat{a}^{\dagger}_{\nu,\ell}\right]_{+} =\displaystyle= δν,νδ,\displaystyle\delta_{\nu^{\prime},\nu}\,\,\delta_{\ell^{\prime},\ell} (10)

3.2 Hamiltonian in terms of (ν,\nu,\ell) electron operators

Once the good basis of the problem is determined, we can proceed along the second quantization formalism to rewrite the HNsH_{N_{s}} Hamiltonian for NsN_{s} electrons as a H^\hat{H} operator that reads in terms of the electron creation operators a^ν,\hat{a}^{\dagger}_{\nu,\ell} associated with the chosen basis. The H^\hat{H} operator is a priori valid whatever NsN_{s}, which is one of the advantages of the second quantization formalism. However, we must keep in mind that the H^\hat{H} operator we are going to write is only valid for a physics driven by highly localized states, like (g,e)(g,e), due to the tight-binding limit (6)(\ref{5}) that we have accepted to possibly construct the a^ν,\hat{a}^{\dagger}_{\nu,\ell} fermionic operators: this is necessary to avoid handling Slater determinants for many-body states.

The second quantization procedure to transform HNsH_{N_{s}} into H^\hat{H} depends on the nature of the operator at hand, one-body or two-body. Let us consider the various terms of HNsH_{N_{s}} successively.

3.2.1 One-body part

The one-body part of the HNsH_{N_{s}} Hamiltonian given in Eq. (2), which includes the electron kinetic energy and the electron-ion attraction, can be written as a sum of one-body Hamiltonians

H0,Ns=j=1Ns(𝐩j22m0+=1Nse2|𝐫j𝐑|)j=1NshjH_{0,{N_{s}}}=\sum_{j=1}^{N_{s}}\left(\frac{{\bf p}^{2}_{j}}{2m_{0}}+\sum_{\ell=1}^{N_{s}}\frac{-e^{2}}{|{\bf r}_{j}-{\bf R}_{\ell}|}\right)\equiv\sum_{j=1}^{N_{s}}h_{j} (11)

Note that hjh_{j} differs from the Hamiltonian h𝐑h_{\scriptscriptstyle{{\bf R}_{\ell}}} given in Eq. (3) because the jj electron interacts with all the ions. As a direct consequence, H0,NsH_{0,{N_{s}}} will not appear diagonal in the (ν,)(\nu,\ell) basis of Eq. (8).

According to the second quantization procedure, the H^0\hat{H}_{0} operator associated with H0,NsH_{0,{N_{s}}} reads as

H^0=ν,ν,hν,;ν,a^ν,a^ν,\hat{H}_{0}=\sum_{\nu^{\prime},\ell^{\prime}}\sum_{\nu,\ell}h_{\nu^{\prime},\ell^{\prime};\nu,\ell}\,\,\hat{a}^{\dagger}_{\nu^{\prime},\ell^{\prime}}\hat{a}_{\nu,\ell} (12)

The prefactor, given by

hν,;ν,=L3d3rν,𝐑|𝐫(𝐩^22m0+′′=1Nse2|𝐫𝐑′′|)𝐫|ν,𝐑\displaystyle h_{\nu^{\prime},\ell^{\prime};\nu,\ell}=\int_{L^{3}}d^{3}r\,\langle\nu^{\prime},{\bf R}_{\ell^{\prime}}|{\bf r}\rangle\left(\frac{\hat{{\bf p}}^{2}}{2m_{0}}{+}\sum_{\ell^{\prime\prime}=1}^{N_{s}}\frac{-e^{2}}{|{\bf r}{-}{\bf R}_{\ell^{\prime\prime}}|}\right)\langle{\bf r}|\nu,{\bf R}_{\ell}\rangle (13)

is calculated by first noting that the wave-function product is equal to zero for \ell^{\prime}\not=\ell, due to the tight-binding limit (6). Next, we note that |ν,𝐑|\nu,{\bf R}_{\ell}\rangle is h𝐑h_{\scriptscriptstyle{{\bf R}_{\ell}}} eigenstate. So, by isolating the ′′=\ell^{\prime\prime}=\ell term from the ′′\ell^{\prime\prime} sum, we can split this prefactor as

hν,;ν,=δ,(ενδν,ν+tν,ν)h_{\nu^{\prime},\ell^{\prime};\nu,\ell}=\delta_{\ell^{\prime},\ell}\left(\varepsilon_{\nu}\delta_{\nu^{\prime},\nu}+t_{\nu^{\prime},\nu}\right) (14)

The tν,νt_{\nu^{\prime},\nu} part comes from the electron interaction with all the other ions, ′′\ell^{\prime\prime}\not=\ell. Using Eq. (4) for 𝐫𝐑=𝐫{\bf r}-{\bf R}_{\ell}={\bf r}_{\ell}, it precisely reads

tν,ν=tν,ν\displaystyle t_{\nu^{\prime},\nu}=t^{\ast}_{\nu,\nu^{\prime}} =\displaystyle= L3d3rν|𝐫𝐫|ν′′Ne2|𝐫+𝐑𝐑′′|\displaystyle\int_{L^{3}}d^{3}r_{\ell}\,\langle\nu^{\prime}|{\bf r}_{\ell}\rangle\langle{\bf r}_{\ell}|\nu\rangle\sum_{\ell^{\prime\prime}\not=\ell}^{N}\frac{-e^{2}}{|{\bf r}_{\ell}+{\bf R}_{\ell}-{\bf R}_{\ell^{\prime\prime}}|} (15)
=\displaystyle= L3d3rν|𝐫𝐫|ν𝐑𝟎e2|𝐫𝐑|\displaystyle\int_{L^{3}}d^{3}r\,\langle\nu^{\prime}|{\bf r}\rangle\langle{\bf r}|\nu\rangle\sum_{{\bf R}\not=\bf 0}\frac{-e^{2}}{|{\bf r}-{\bf R}|}

the ′′\ell^{\prime\prime}\not=\ell sum becoming \ell-independent when the Born-von Karman boundary condition f(𝐫)=f(𝐫+𝐑)f({\bf r})=f({\bf r}+{\bf R}_{\ell}) is used to extend the lattice periodicity to a finite crystal.

We could think to drop the tν,νt_{\nu^{\prime},\nu} term that comes from the Coulomb interaction of the electron distribution ν|𝐫𝐫|ν\langle\nu^{\prime}|{\bf r}\rangle\langle{\bf r}|\nu\rangle, highly localized on the 𝐑=𝟎{\bf R}=\bf 0 site, with ions located on different lattice sites, where this distribution is very small. Yet, the long-range character of the Coulomb potential renders this dropping questionable in the large sample limit: we will see that the tν,νt_{\nu^{\prime},\nu} term is necessary to properly eliminate spurious large-r singularities that originate from the electron-electron repulsion.

The above equations give the one-body Hamiltonian H^0\hat{H}_{0} as

H^0=ν,ενa^ν,a^ν,+ν,ν,tν,νa^ν,a^ν,\hat{H}_{0}=\sum_{\nu,\ell}\varepsilon_{\nu}\,\hat{a}^{\dagger}_{\nu,\ell}\hat{a}_{\nu,\ell}+\sum_{\nu^{\prime},\nu,\ell}t_{\nu^{\prime},\nu}\,\hat{a}^{\dagger}_{\nu^{\prime},\ell}\hat{a}_{\nu,\ell} (16)

its second term allowing transitions between electronic levels on the same ion site.

3.2.2 Two-body electron-electron interaction

The HNsH_{N_{s}} Hamiltonian given in Eq. (2) also contains a two-body part that corresponds to the electron-electron repulsion. In second quantization, this interaction appears in terms of the electron operators a^ν,\hat{a}^{\dagger}_{\nu,\ell} as

V^ee=12{ν,}V(ν2,2ν2,2ν1,1ν1,1)a^ν1,1a^ν2,2a^ν2,2a^ν1,1\hat{V}_{e-e}=\frac{1}{2}\sum_{\{\nu,\ell\}}V\left(\begin{smallmatrix}\nu^{\prime}_{2},\ell^{\prime}_{2}&\hskip 2.84544pt\nu_{2},\ell_{2}\\ \nu^{\prime}_{1},\ell^{\prime}_{1}&\hskip 2.84544pt\nu_{1},\ell_{1}\end{smallmatrix}\right)\hat{a}^{\dagger}_{\nu^{\prime}_{1},\ell^{\prime}_{1}}\hat{a}^{\dagger}_{\nu^{\prime}_{2},\ell^{\prime}_{2}}\hat{a}_{\nu_{2},\ell_{2}}\hat{a}_{\nu_{1},\ell_{1}} (17)

The prefactor given by

V(ν2,2ν2,2ν1,1ν1,1)=\displaystyle V\left(\begin{smallmatrix}\nu^{\prime}_{2},\ell^{\prime}_{2}&\hskip 2.84544pt\nu_{2},\ell_{2}\\ \nu^{\prime}_{1},\ell^{\prime}_{1}&\hskip 2.84544pt\nu_{1},\ell_{1}\end{smallmatrix}\right)=
L3d3r1d3r2ν1,𝐑1|𝐫1ν2,𝐑2|𝐫2e2|𝐫1𝐫2|𝐫2|ν2,𝐑2𝐫1|ν1,𝐑1\displaystyle\iint_{L^{3}}d^{3}r_{1}d^{3}r_{2}\,\langle\nu^{\prime}_{1},{\bf R}_{\ell^{\prime}_{1}}|{\bf r}_{1}\rangle\langle\nu^{\prime}_{2},{\bf R}_{\ell^{\prime}_{2}}|{\bf r}_{2}\rangle\frac{e^{2}}{|{\bf r}_{1}-{\bf r}_{2}|}\langle{\bf r}_{2}|\nu_{2},{\bf R}_{\ell_{2}}\rangle\langle{\bf r}_{1}|\nu_{1},{\bf R}_{\ell_{1}}\rangle

differs from zero for 1=1\ell^{\prime}_{1}=\ell_{1} and 2=2\ell^{\prime}_{2}=\ell_{2} due to the tight-binding limit (6); so, it also reads as

V(ν2,2ν2,2ν1,1ν1,1)=δ1,1δ2,2𝒱𝐑1𝐑2(ν2ν2ν1ν1)V\left(\begin{smallmatrix}\nu^{\prime}_{2},\ell^{\prime}_{2}&\nu_{2},\ell_{2}\\ \hskip 2.84544pt\nu^{\prime}_{1},\ell^{\prime}_{1}&\hskip 2.84544pt\nu_{1},\ell_{1}\end{smallmatrix}\right)=\delta_{\ell^{\prime}_{1},\ell_{1}}\delta_{\ell^{\prime}_{2},\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}\nu^{\prime}_{2}&\nu_{2}\\ \hskip 2.84544pt\nu^{\prime}_{1}&\hskip 2.84544pt\nu_{1}\end{smallmatrix}\right)\hskip 99.58464pt (19)

with the Coulomb part given by

𝒱𝐑(ν2ν2ν1ν1)=L3d3r1d3r2ν1|𝐫1ν2|𝐫2e2|𝐑+𝐫1𝐫2|𝐫2|ν2𝐫1|ν1\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}\nu^{\prime}_{2}&\hskip 2.84544pt\nu_{2}\\ \nu^{\prime}_{1}&\hskip 2.84544pt\nu_{1}\end{smallmatrix}\right)=\iint_{L^{3}}d^{3}r_{\ell_{1}}d^{3}r_{\ell_{2}}\,\langle\nu^{\prime}_{1}|{\bf r}_{\ell_{1}}\rangle\langle\nu^{\prime}_{2}|{\bf r}_{\ell_{2}}\rangle\frac{e^{2}}{|{\bf R}+{\bf r}_{\ell_{1}}-{\bf r}_{\ell_{2}}|}\langle{\bf r}_{\ell_{2}}|\nu_{2}\rangle\langle{\bf r}_{\ell_{1}}|\nu_{1}\rangle (20)

As a result, the electron-electron interaction (17) appears as

V^ee=12{ν}1,2𝒱𝐑1𝐑2(ν2ν2ν1ν1)a^ν1,1a^ν2,2a^ν2,2a^ν1,1\displaystyle\hat{V}_{e-e}=\frac{1}{2}\sum_{\{\nu\}}\sum_{\ell_{1},\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}\nu^{\prime}_{2}&\hskip 2.84544pt\nu_{2}\\ \hskip 2.84544pt\nu^{\prime}_{1}&\nu_{1}\end{smallmatrix}\right)\hat{a}^{\dagger}_{\nu^{\prime}_{1},\ell_{1}}\hat{a}^{\dagger}_{\nu^{\prime}_{2},\ell_{2}}\hat{a}_{\nu_{2},\ell_{2}}\hat{a}_{\nu_{1},\ell_{1}} (21)

This interaction destroys two electrons located on the 1\ell_{1} and 2\ell_{2} sites and recreates them on the same sites, either in the same level ν=ν\nu^{\prime}=\nu, or in a different level νν\nu^{\prime}\not=\nu. The fact that the Coulomb interaction is restricted to on-site processes, follows from enforcing the tight-binding limit to the relevant electronic states.

3.2.3 Ion-ion interaction

The ion-ion interaction, i.e., the last term of the HNsH_{N_{s}} Hamiltonian (2), provides a constant contribution equal to

Vii=12=1NsNse2|𝐑𝐑|=Ns2𝐑𝟎e2|𝐑|V_{i-i}=\frac{1}{2}\sum_{\ell=1}^{N_{s}}\sum_{\ell^{\prime}\not=\ell}^{N_{s}}\frac{e^{2}}{|{\bf R}_{\ell}-{\bf R}_{\ell^{\prime}}|}=\frac{{N_{s}}}{2}\sum_{{\bf R}\not=\bf 0}\frac{e^{2}}{|{\bf R}|} (22)

due to the lattice periodicity extended through the Born-von Karman boundary condition.

3.3 Semiconductor states for Ns{N_{s}} electrons

3.3.1 Ground state |Φg|\Phi_{g}\rangle

\bullet Let |g;𝐑=a^g,|v|g;{\bf R}_{\ell}\rangle=\hat{a}^{\dagger}_{g,\ell}|v\rangle be the electronic ground state of the 𝐑{\bf R}_{\ell} ion, with energy εg\varepsilon_{g}. The Ns{N_{s}}-electron ground state has one electron in the ground level of each 𝐑{\bf R}_{\ell} ion. In second quantization, this state reads

|Φg=a^g,1a^g,2a^g,Ns|v|\Phi_{g}\rangle=\hat{a}^{\dagger}_{g,1}\hat{a}^{\dagger}_{g,2}\cdots\hat{a}^{\dagger}_{g,{N_{s}}}|v\rangle (23)

In first quantization, this state would be written through its wave function represented by the Slater determinant

1Ns!|𝐫1𝐑1|g𝐫1𝐑Ns|g𝐫Ns𝐑1|g𝐫Ns𝐑Ns|g|\frac{1}{\sqrt{{N_{s}}!}}\begin{vmatrix}\langle{\bf r}_{1}-{\bf R}_{1}|g\rangle&\cdots&\langle{\bf r}_{1}-{\bf R}_{N_{s}}|g\rangle\\ \vdots&\ddots&\vdots\\ \langle{\bf r}_{\scriptscriptstyle{N_{s}}}-{\bf R}_{1}|g\rangle&\cdots&\langle{\bf r}_{\scriptscriptstyle{N_{s}}}-{\bf R}_{N_{s}}|g\rangle\end{vmatrix} (24)

which definitely is far heavier to handle than the operator form given above, let alone all the tricky minus signs that would have to be followed up carefully when calculating scalar products involving such Slater determinants. We will come back to these Slater determinants in Sec. 7.

\bullet In the absence of Coulomb interactions between the electron of a given ion and the other (Ns1)({N_{s}}-1) ions, and between the electrons themselves, the |Φg|\Phi_{g}\rangle energy reduces to

Eg=NsεgE_{g}={N_{s}}\,\varepsilon_{g} (25)

With these Coulomb interactions treated at first order, the ground-state energy reads as

Eg=Φg|H^|ΦgE^{\prime}_{g}=\langle\Phi_{g}|\hat{H}|\Phi_{g}\rangle (26)

3.3.2 Lowest set of excited states |Φ𝐑|\Phi_{{\bf R}_{\ell}}\rangle

\bullet The lowest excited states for Ns{N_{s}} electrons have one electron excited from the ground level gg of the 𝐑{\bf R}_{\ell} ion, to the lowest excited level ee of the same ion. Indeed, as previously said, exciting the electron onto another ion site would induce a charge separation with an electrostatic energy cost that would lead to a higher excited-state subspace. So, the lowest excited states follow from |Φg|\Phi_{g}\rangle with a^g,\hat{a}^{\dagger}_{g,\ell} replaced by a^e,\hat{a}^{\dagger}_{e,\ell}, namely

|Φ𝐑\displaystyle|\Phi_{{\bf R}_{\ell}}\rangle =\displaystyle= a^g,1a^g,2a^g,1(a^e,)a^g,+1a^g,N|v\displaystyle\hat{a}^{\dagger}_{g,1}\hat{a}^{\dagger}_{g,2}\cdots\hat{a}^{\dagger}_{g,\ell-1}\left(\hat{a}^{\dagger}_{e,\ell}\right)\hat{a}^{\dagger}_{g,\ell+1}\cdots\hat{a}^{\dagger}_{g,N}|v\rangle (27)
=\displaystyle= a^e,ag,|Φg\displaystyle\hat{a}^{\dagger}_{e,\ell}a_{g,\ell}|\Phi_{g}\rangle

since a pair of electron operators commutes with different electron operators. By using the anticommutation relations (9,10), and the fact that in |Φg|\Phi_{g}\rangle, the ground level of all ion sites is occupied by one electron, we can check that these excited states form an orthogonal set, namely

Φ𝐑|Φ𝐑=δ,\langle\Phi_{{\bf R}_{\ell^{\prime}}}|\Phi_{{\bf R}_{\ell}}\rangle=\delta_{\ell^{\prime},\ell} (28)

\bullet Since such a jump can occur on any 𝐑{\bf R}_{\ell} ion, the energy of the Ns{N_{s}} states |Φ𝐑|\Phi_{{\bf R}_{\ell}}\rangle in the absence of Coulomb interactions between the electron of an ion and the other ions and between the electrons themselves, reads

Ee=(Ns1)εg+εe=Eg+(εeεg)E_{e}=({N_{s}}-1)\varepsilon_{g}+\varepsilon_{e}=E_{g}+(\varepsilon_{e}-\varepsilon_{g}) (29)

In the presence of these Coulomb interactions, treated at first order, the excited-state energies follow from the diagonalization of the H^\hat{H} Hamiltonian within the |Φ𝐑|\Phi_{{\bf R}_{\ell}}\rangle degenerate subspace, that is, the diagonalization of the (Ns×Ns{N_{s}}\times{N_{s}}) matrix

Φ𝐑|H^|Φ𝐑\langle\Phi_{{\bf R}_{\ell^{\prime}}}|\hat{H}|\Phi_{{\bf R}_{\ell}}\rangle (30)

3.4 Relevant parts of the system Hamiltonian

The relevant parts of H^\hat{H} in Φ𝐑|H^|Φ𝐑\langle\Phi_{{\bf R}_{\ell^{\prime}}}|\hat{H}|\Phi_{{\bf R}_{\ell}}\rangle are the ones that conserve the number of ground-level electrons and the number of excited-level electrons. Let us write them explicitly to better catch their physics.

3.4.1 One-body parts

The part of the one-body Hamiltonian H^0\hat{H}_{0} given in Eq. (16) that conserves the number of ground-level electrons and the number of excited-level electrons, reduces to

H^0H^0,g+H^0,e\hat{H}_{0}\longrightarrow\hat{H}_{0,g}+\hat{H}_{0,e} (31)

with the ground and excited parts given by

H^0,g\displaystyle\hat{H}_{0,g} =\displaystyle= (εg+tg,g)=1Nsa^g,a^g,\displaystyle(\varepsilon_{g}+t_{g,g})\sum_{\ell=1}^{N_{s}}\hat{a}^{\dagger}_{g,\ell}\hat{a}_{g,\ell} (32)
H^0,e\displaystyle\hat{H}_{0,e} =\displaystyle= (εe+te,e)=1Nsa^e,a^e,\displaystyle(\varepsilon_{e}+t_{e,e})\sum_{\ell=1}^{N_{s}}\hat{a}^{\dagger}_{e,\ell}\hat{a}_{e,\ell} (33)

3.4.2 Two-body electron-electron interactions

The part of the V^ee\hat{V}_{e-e} interaction given in Eq. (21), that conserves the number of ground-level electrons and the number of excited-level electrons, reduces to

V^eeV^gg+V^eg(intra)+V^eg(inter)\hat{V}_{e-e}\longrightarrow\hat{V}_{gg}+\hat{V}^{(intra)}_{eg}+\hat{V}^{(inter)}_{eg} (34)

\bullet In V^gg\hat{V}_{gg}, given by

V^gg=121=1Ns21Ns𝒱𝐑1𝐑2(gggg)a^g,1a^g,2a^g,2a^g,1\hat{V}_{gg}=\frac{1}{2}\sum_{\ell_{1}=1}^{N_{s}}\,\,\sum_{\ell_{2}\not=\ell_{1}}^{N_{s}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)\hat{a}^{\dagger}_{g,\ell_{1}}\hat{a}^{\dagger}_{g,\ell_{2}}\hat{a}_{g,\ell_{2}}\hat{a}_{g,\ell_{1}} (35)

the Coulomb interaction acts on the ground level only (see Fig. 2(a)), with 2\ell_{2} different from 1\ell_{1} due to the Pauli exclusion principle.

Refer to caption
Figure 2: (a) In V^gg\hat{V}_{gg} given in Eq. (35), the Coulomb interaction acts between two ground-level electrons. (b) In V^eg(intra)\hat{V}^{(intra)}_{eg} given in Eq. (36), the Coulomb interaction acts between one ground-level electron and one excited-level electron, each electron staying in its level. (c) In V^eg(inter)\hat{V}^{(inter)}_{eg} given in Eq. (37), these electrons change level.

\bullet In the intralevel interaction on the same ion, V^eg(intra)\hat{V}^{(intra)}_{eg}, given by

V^eg(intra)=1=1Ns2=1Ns𝒱𝐑1𝐑2(eegg)a^g,1a^e,2a^e,2a^g,1\hat{V}^{(intra)}_{eg}=\sum_{\ell_{1}=1}^{N_{s}}\,\,\sum_{\ell_{2}=1}^{N_{s}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)\hat{a}^{\dagger}_{g,\ell_{1}}\hat{a}^{\dagger}_{e,\ell_{2}}\hat{a}_{e,\ell_{2}}\hat{a}_{g,\ell_{1}} (36)

each electron stays in its level (see Fig. 2(b)).

\bullet The electron-electron Coulomb interaction also has interlevel processes

V^eg(inter)=1=1Ns2=1Ns𝒱𝐑1𝐑2(geeg)a^e,1a^g,2a^e,2a^g,1\hat{V}^{(inter)}_{eg}=\sum_{\ell_{1}=1}^{N_{s}}\,\,\sum_{\ell_{2}=1}^{N_{s}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)\hat{a}^{\dagger}_{e,\ell_{1}}\hat{a}^{\dagger}_{g,\ell_{2}}\hat{a}_{e,\ell_{2}}\hat{a}_{g,\ell_{1}} (37)

in which each electron changes level (see Fig. 2(c)). These processes are the crucial ones to produce the Frenkel exciton wave because when 21\ell_{2}\not=\ell_{1}, the excitation moves from the 𝐑2{\bf R}_{\ell_{2}} site to the 𝐑1{\bf R}_{\ell_{1}} site.

3.4.3 System Hamiltonian in the ground and excited subspaces

We now have all the tools to calculate the energy of the Ns{N_{s}}-electron ground state |Φg|\Phi_{g}\rangle through Φg|H^|Φg\langle\Phi_{g}|\hat{H}|\Phi_{g}\rangle, and the energy of the lowest excited states through the diagonalization of the (Ns×Ns{N_{s}}\times{N_{s}}) matrix Φ𝐑|H^|Φ𝐑\langle\Phi_{{\bf R}_{\ell^{\prime}}}|\hat{H}|\Phi_{{\bf R}_{\ell}}\rangle. These calculations are given in B and C.

Actually, the precise value of the ground-state energy Φg|H^|Φg\langle\Phi_{g}|\hat{H}|\Phi_{g}\rangle does not matter when it comes to determining the Frenkel exciton waves because they just follow from the diagonalization of the Φ𝐑|H^|Φ𝐑\langle\Phi_{{\bf R}_{\ell^{\prime}}}|\hat{H}|\Phi_{{\bf R}_{\ell}}\rangle matrix. Moreover, the proper way to calculate the Φ𝐑|H^|Φ𝐑\langle\Phi_{{\bf R}_{\ell^{\prime}}}|\hat{H}|\Phi_{{\bf R}_{\ell}}\rangle matrix elements is not in terms of ground-level and excited-level electrons as done in C, but in terms of excitations, that is, electrons and holes. Let us turn to this language.

3.5 From ground-level and excited-level electrons to electrons and holes

3.5.1 Electron and hole operators

The smart way to perform calculations involving semiconductor excitations is to introduce the concept of hole[32, 33]: the destruction of a ground-level electron on the 𝐑{\bf R}_{\ell} lattice site corresponds to the creation of a hole on the same site. In terms of operators, this reads

a^g,=b^\hat{a}_{g,\ell}=\hat{b}^{\dagger}_{\ell} (38)

without any phase factor in the absence of spin and spatial degeneracy. In this language, an electron in the lowest excited level is just called “electron”, with creation operator

a^e,=a^\hat{a}^{\dagger}_{e,\ell}=\hat{a}^{\dagger}_{\ell} (39)

The lowest set of excited states |Φ𝐑|\Phi_{{\bf R}_{\ell}}\rangle given in Eq. (27) then appears as

|Φ𝐑=a^b^|Φga^b^|0|\Phi_{{\bf R}_{\ell}}\rangle=\hat{a}^{\dagger}_{\ell}\hat{b}^{\dagger}_{\ell}|\Phi_{g}\rangle\equiv\hat{a}^{\dagger}_{\ell}\hat{b}^{\dagger}_{\ell}|0\rangle (40)

as the |Φg|\Phi_{g}\rangle ground state contains zero hole and zero “electron” in the sense of Eq. (39).

To fully exploit the advantage of describing the system in terms of electrons and holes, we also have to write the relevant parts of the H^\hat{H} Hamiltonian in terms of these operators.

3.5.2 Hamiltonian in terms of electrons and holes

\bullet The one-body ground part of the H^\hat{H} Hamiltonian given in Eq. (32) becomes, when using hole operators,

H^0,g=N(εg+tg,g)(εg+tg,g)=1Nb^b^\displaystyle\hat{H}_{0,g}=N(\varepsilon_{g}+t_{g,g})-(\varepsilon_{g}+t_{g,g})\sum_{\ell=1}^{N}\,\hat{b}^{\dagger}_{\ell}\hat{b}_{\ell} (41)

since a^g,a^g,=1a^g,a^g,\hat{a}^{\dagger}_{g,\ell}\hat{a}_{g,\ell}=1-\hat{a}_{g,\ell}\hat{a}^{\dagger}_{g,\ell}. The first term is just equal to Φg|H^0|Φg\langle\Phi_{g}|\hat{H}_{0}|\Phi_{g}\rangle because the |Φg|\Phi_{g}\rangle ground state does not have hole nor electron in the sense of “excited electron”.

The one-body excited part of the H^\hat{H} Hamiltonian given in Eq. (33) simply leads to

H^0,e=(εe+te,e)=1Na^a^\displaystyle\hat{H}_{0,e}=(\varepsilon_{e}+t_{e,e})\sum_{\ell=1}^{N}\hat{a}^{\dagger}_{\ell}\hat{a}_{\ell} (42)

\bullet Next, we consider the V^gg\hat{V}_{gg} part of the V^ee\hat{V}_{e-e} Coulomb interaction between ground-level electrons only, given in Eq. (35). From a^g,1a^g,2a^g,2a^g,1=1a^g,1a^g,1a^g,2a^g,2+a^g,1a^g,2a^g,2a^g,1\hat{a}^{\dagger}_{g,\ell_{1}}\hat{a}^{\dagger}_{g,\ell_{2}}\hat{a}_{g,\ell_{2}}\hat{a}_{g,\ell_{1}}=1-\hat{a}_{g,\ell_{1}}\hat{a}^{\dagger}_{g,\ell_{1}}-\hat{a}_{g,\ell_{2}}\hat{a}^{\dagger}_{g,\ell_{2}}+\hat{a}_{g,\ell_{1}}\hat{a}_{g,\ell_{2}}\hat{a}^{\dagger}_{g,\ell_{2}}\hat{a}^{\dagger}_{g,\ell_{1}} for 21\ell_{2}\not=\ell_{1}, the V^gg\hat{V}_{gg} interaction splits into three terms when written with electron and hole operators, namely

V^gg=12121𝒱𝐑1𝐑2(gggg)2121b^1b^1(21𝒱𝐑1𝐑2(gggg))+V^hh(intra)\displaystyle\hat{V}_{gg}=\frac{1}{2}\sum_{\ell_{1}}\!\!\sum_{\ell_{2}\not=\ell_{1}}\!\!\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\!\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)-2\,\frac{1}{2}\sum_{\ell_{1}}\hat{b}^{\dagger}_{\ell_{1}}\hat{b}_{\ell_{1}}\!\!\left(\sum_{\ell_{2}\not=\ell_{1}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\!\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)\right)+\hat{V}^{(intra)}_{hh}

The first V^gg\hat{V}_{gg} term is nothing but Φg|V^gg|Φg\langle\Phi_{g}|\hat{V}_{gg}|\Phi_{g}\rangle because the other terms of V^gg\hat{V}_{gg} require states having one hole at least to produce a nonzero contribution while |Φg|\Phi_{g}\rangle has no hole. The second term brings a constant shift to the hole energy since for a periodic crystal, the prefactor of b^1b^1\hat{b}^{\dagger}_{\ell_{1}}\hat{b}_{\ell_{1}} does not depend on 1\ell_{1}. The last V^gg\hat{V}_{gg} term corresponds to the Coulomb repulsion between two holes

V^hh(intra)=12121𝒱𝐑1𝐑2(gggg)b^1b^2b^2b^1\hat{V}^{(intra)}_{hh}=\frac{1}{2}\sum_{\ell_{1}}\sum_{\ell_{2}\not=\ell_{1}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\!\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)\hat{b}^{\dagger}_{\ell_{1}}\hat{b}^{\dagger}_{\ell_{2}}\hat{b}_{\ell_{2}}\hat{b}_{\ell_{1}} (43)
Refer to caption
Figure 3: (a) In V^eh(intra)\hat{V}^{(intra)}_{eh} given in Eq. (45), the Coulomb interaction acts between one excited-level electron and one ground-level electron absence, that is, one hole, each carrier staying in their level. (b) In V^eh(inter)\hat{V}^{(inter)}_{eh} given in Eq. (47), the electron and the hole of the 𝐑2{\bf R}_{\ell_{2}} site recombine, while an electron-hole pair is created on the 𝐑1{\bf R}_{\ell_{1}} site, possibly different from 𝐑2{\bf R}_{\ell_{2}}; this then leads to an excitation transfer from the 2\ell_{2} site to the 1\ell_{1} site.

\bullet The intralevel part (36) of the Coulomb interaction, V^eg(intra)\hat{V}^{(intra)}_{eg} between ground-level and excited-level electrons, leads to two terms since b^b^=1b^b^\hat{b}_{\ell}\hat{b}^{\dagger}_{\ell}=1-\hat{b}^{\dagger}_{\ell}\hat{b}_{\ell}, namely

V^eg(intra)\displaystyle\hat{V}^{(intra)}_{eg} =\displaystyle= 12𝒱𝐑1𝐑2(eegg)b^1a^2a^2b^1\displaystyle\sum_{\ell_{1}}\sum_{\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\!\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)\hat{b}_{\ell_{1}}\hat{a}^{\dagger}_{\ell_{2}}\hat{a}_{\ell_{2}}\hat{b}^{\dagger}_{\ell_{1}} (44)
=\displaystyle= 𝐑𝒱𝐑(eegg)a^a^+V^eh(intra)\displaystyle\sum_{\bf R}\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)\sum_{\ell}\hat{a}^{\dagger}_{\ell}\hat{a}_{\ell}+\hat{V}^{(intra)}_{eh}

The first V^eg(intra)\hat{V}^{(intra)}_{eg} term brings a constant shift to the electron energy, while V^eh(intra)\hat{V}^{(intra)}_{eh} corresponds to a Coulomb attraction in which the electron stays electron and the hole stays hole (see Fig. 3(a)), namely

V^eh(intra)=12𝒱𝐑1𝐑2(eegg)a^2b^1b^1a^2\hat{V}^{(intra)}_{eh}=-\sum_{\ell_{1}}\sum_{\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\!\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)\hat{a}^{\dagger}_{\ell_{2}}\hat{b}^{\dagger}_{\ell_{1}}\hat{b}_{\ell_{1}}\hat{a}_{\ell_{2}} (45)

\bullet The Coulomb interaction also has an interlevel part given in Eq. (37), which becomes, since b^2b^1=δ1,2b^1b^2\hat{b}_{\ell_{2}}\hat{b}^{\dagger}_{\ell_{1}}=\delta_{\ell_{1},\ell_{2}}-\hat{b}^{\dagger}_{\ell_{1}}\hat{b}_{\ell_{2}},

V^eg(inter)\displaystyle\hat{V}^{(inter)}_{eg} =\displaystyle= 12𝒱𝐑1𝐑2(geeg)a^1b^2a^2b^1\displaystyle\sum_{\ell_{1}}\sum_{\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\!\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)\hat{a}^{\dagger}_{\ell_{1}}\hat{b}_{\ell_{2}}\hat{a}_{\ell_{2}}\hat{b}^{\dagger}_{\ell_{1}} (46)
=\displaystyle= 𝒱𝐑=𝟎(geeg)a^a^+V^eh(inter)\displaystyle-\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)\sum_{\ell}\hat{a}^{\dagger}_{\ell}\hat{a}_{\ell}+\hat{V}^{(inter)}_{eh}

where V^eh(inter)\hat{V}^{(inter)}_{eh} corresponds to an interaction in which one electron-hole pair recombines on the 𝐑2{\bf R}_{\ell_{2}} site, while another pair is created on the 𝐑1{\bf R}_{\ell_{1}} site, possibly different from 𝐑2{\bf R}_{\ell_{2}} (see Fig. 3(b)), namely

V^eh(inter)=12𝒱𝐑1𝐑2(geeg)a^1b^1b^2a^2\hat{V}^{(inter)}_{eh}=\sum_{\ell_{1}}\sum_{\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\!\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)\hat{a}^{\dagger}_{\ell_{1}}\hat{b}^{\dagger}_{\ell_{1}}\hat{b}_{\ell_{2}}\hat{a}_{\ell_{2}} (47)

Note the positive sign in front of this important term for the Frenkel exciton formation.

3.5.3 Relevant Hamiltonian for Frenkel excitons

Since V^hh(intra)\hat{V}^{(intra)}_{hh} acts between two holes, the relevant parts of the Coulomb interaction in the one-hole subspace reduce to V^eh(intra)\hat{V}^{(intra)}_{eh} given in Eq. (45) and V^eh(inter)\hat{V}^{(inter)}_{eh} given in Eq. (47). As a result, the relevant parts of the total Hamiltonian H^\hat{H} in the electron-hole subspace |Φ𝐑|\Phi_{{\bf R}_{\ell}}\rangle reduces to

H^eh=Φg|H^|Φg+H^e+H^h+V^eh(intra)+V^eh(inter)\hat{H}_{eh}=\langle\Phi_{g}|\hat{H}|\Phi_{g}\rangle+\hat{H}_{e}+\hat{H}_{h}+\hat{V}^{(intra)}_{eh}+\hat{V}^{(inter)}_{eh} (48)

With the help of Eq. (48), we find that the first term of H^eh\hat{H}_{eh} is just

Φg|H^eh|Φg=Φg|H^|Φg=Eg\langle\Phi_{g}|\hat{H}_{eh}|\Phi_{g}\rangle=\langle\Phi_{g}|\hat{H}|\Phi_{g}\rangle=E^{\prime}_{g} (49)

the other terms of H^\hat{H} requiring states with one electron or one hole to contribute.

Note that the one-body part of H^eh\hat{H}_{eh} also contains Coulomb contributions, as seen from its electron part

H^e\displaystyle\hat{H}_{e} =\displaystyle= ε~ea^a^\displaystyle\tilde{\varepsilon}_{e}\sum_{\ell}\hat{a}^{\dagger}_{\ell}\hat{a}_{\ell} (50)
ε~e\displaystyle\tilde{\varepsilon}_{e} =\displaystyle= εe+te,e+𝐑𝒱𝐑(eegg)𝒱𝐑=𝟎(geeg)\displaystyle\varepsilon_{e}+t_{e,e}+\sum_{\bf R}\mathcal{V}_{{\bf R}}\!\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)-\mathcal{V}_{{\bf R}=\bf 0}\!\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right) (51)

and its hole part

H^h\displaystyle\hat{H}_{h} =\displaystyle= ε~hb^b^\displaystyle\tilde{\varepsilon}_{h}\sum_{\ell}\hat{b}^{\dagger}_{\ell}\hat{b}_{\ell} (52)
ε~h\displaystyle\tilde{\varepsilon}_{h} =\displaystyle= εgtg,g𝐑𝟎𝒱𝐑(gggg)\displaystyle-\varepsilon_{g}-t_{g,g}-\sum_{{\bf R}\not=\bf 0}\mathcal{V}_{{\bf R}}\!\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right) (53)

3.5.4 Electron-hole Hamiltonian in the |Φ𝐑|\Phi_{{\bf R}_{\ell}}\rangle excited subspace

\bullet By using Eqs. (40,50,52), we get the contribution from the one-body part of the Hamiltonian as

(H^e+H^h)|Φ𝐑=(H^e+H^h)a^b^|0=(ε~e+ε~h)|Φ𝐑\displaystyle\left(\hat{H}_{e}+\hat{H}_{h}\right)|\Phi_{{\bf R}_{\ell}}\rangle=\left(\hat{H}_{e}+\hat{H}_{h}\right)\,\hat{a}^{\dagger}_{\ell}\hat{b}^{\dagger}_{\ell}|0\rangle=\left(\tilde{\varepsilon}_{e}+\tilde{\varepsilon}_{h}\right)|\Phi_{{\bf R}_{\ell}}\rangle (54)

Turning to the Coulomb interaction, we find that the intralevel part of Eq. (45) simply gives

V^eh(intra)|Φ𝐑\displaystyle\hat{V}^{(intra)}_{eh}|\Phi_{{\bf R}_{\ell}}\rangle =\displaystyle= 1,2𝒱𝐑1𝐑2(eegg)(a^2b^1b^1a^2)a^b^|0\displaystyle-\sum_{\ell_{1},\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\!\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)\left(\hat{a}^{\dagger}_{\ell_{2}}\hat{b}^{\dagger}_{\ell_{1}}\hat{b}_{\ell_{1}}\hat{a}_{\ell_{2}}\right)\hat{a}^{\dagger}_{\ell}\hat{b}^{\dagger}_{\ell}|0\rangle (55)
=\displaystyle= 𝒱𝐑=𝟎(eegg)|Φ𝐑\displaystyle-\mathcal{V}_{{\bf R}=\bf 0}\!\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)|\Phi_{{\bf R}_{\ell}}\rangle\hskip 28.45274pt

for 𝐑=𝟎{\bf R}=\bf 0 because b^1a^2a^b^|Φg\hat{b}_{\ell_{1}}\hat{a}_{\ell_{2}}\hat{a}^{\dagger}_{\ell}\hat{b}^{\dagger}_{\ell}|\Phi_{g}\rangle differs from zero for 2==1\ell_{2}=\ell=\ell_{1} only.

In the same way, the interlevel part of Eq. (47) leads to

V^eh(intra)|Φ𝐑\displaystyle\hat{V}^{(intra)}_{eh}|\Phi_{{\bf R}_{\ell}}\rangle =\displaystyle= 1,2𝒱𝐑1𝐑2(geeg)(a^1b^1b^2a^2)a^b^|0\displaystyle\sum_{\ell_{1},\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\!\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)\left(\hat{a}^{\dagger}_{\ell_{1}}\hat{b}^{\dagger}_{\ell_{1}}\hat{b}_{\ell_{2}}\hat{a}_{\ell_{2}}\right)\hat{a}^{\dagger}_{\ell}\hat{b}^{\dagger}_{\ell}|0\rangle (56)
=\displaystyle= 1𝒱𝐑1𝐑(geeg)|Φ𝐑1\displaystyle\sum_{\ell_{1}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell}}\!\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)|\Phi_{{\bf R}_{\ell_{1}}}\rangle

\bullet So, the diagonal term of the H^eh\hat{H}_{eh} operator in the |Φ𝐑|\Phi_{{\bf R}_{\ell}}\rangle subspace reduces to

Φ𝐑|H^eh|Φ𝐑=Eg+εeεg+vegEe\langle\Phi_{{\bf R}_{\ell}}|\hat{H}_{eh}|\Phi_{{\bf R}_{\ell}}\rangle=E^{\prime}_{g}+\varepsilon_{e}-\varepsilon_{g}+v_{eg}\equiv E^{\prime}_{e} (57)

with vegv_{eg} given by

veg=L3d3rd3r𝐑𝟎(e2|𝐑+𝐫𝐫|e2|𝐑𝐫|)(|𝐫|e|2|𝐫|g|2)|𝐫|g|2\displaystyle v_{eg}=\iint_{L^{3}}d^{3}rd^{3}r^{\prime}\sum_{{\bf R}\not=\bf 0}\left(\frac{e^{2}}{|{\bf R}+{\bf r}-{\bf r}^{\prime}|}-\frac{e^{2}}{|{\bf R}-{\bf r}^{\prime}|}\right)\bigg{(}|\langle{\bf r}^{\prime}|e\rangle|^{2}-|\langle{\bf r}^{\prime}|g\rangle|^{2}\bigg{)}|\langle{\bf r}|g\rangle|^{2} (58)

while the nondiagonal terms simply read

Φ𝐑|H^eh|Φ𝐑=𝒱𝐑𝐑(geeg)\langle\Phi_{{\bf R}_{\ell^{\prime}\not=\ell}}|\hat{H}_{eh}|\Phi_{{\bf R}_{\ell}}\rangle=\mathcal{V}_{{\bf R}_{\ell^{\prime}}-{\bf R}_{\ell}}\!\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right) (59)

These nondiagonal terms correspond to Coulomb processes associated with interlevel processes on different lattice sites (see Fig. 3(b)). These processes are the ones responsible for the Frenkel exciton formation.

3.5.5 Diagonalization of the corresponding matrix

\bullet The linear combinations of |Φ𝐑|\Phi_{{\bf R}_{\ell}}\rangle states that render diagonal the H^eh\hat{H}_{eh} operator in the one-electron-hole-pair subspace, read as

|Φ𝐊n=1Ns=1Nsei𝐊n𝐑|Φ𝐑|\Phi_{{\bf K}_{n}}\rangle=\frac{1}{\sqrt{N_{s}}}\sum_{\ell=1}^{N_{s}}e^{i{\bf K}_{n}\cdot{\bf R}_{\ell}}|\Phi_{{\bf R}_{\ell}}\rangle (60)

for the Ns{N_{s}} vectors 𝐊n{\bf K}_{n} of the first Brillouin zone, quantized in 2π/L2\pi/L. These linear combinations give rise to the so-called Frenkel excitons.

\bullet To show it, we start with

Φ𝐊n|H^eh|Φ𝐊n=1Nsei(𝐊n𝐑+𝐊n𝐑)Φ𝐑|H^eh|Φ𝐑\langle\Phi_{{\bf K}_{n^{\prime}}}|\hat{H}_{eh}|\Phi_{{\bf K}_{n}}\rangle=\frac{1}{{N_{s}}}\sum_{\ell^{\prime}}\sum_{\ell}e^{i(-{\bf K}_{n^{\prime}}\cdot{\bf R}_{\ell^{\prime}}+{\bf K}_{n}\cdot{\bf R}_{\ell})}\langle\Phi_{{\bf R}_{\ell^{\prime}}}|\hat{H}_{eh}|\Phi_{{\bf R}_{\ell}}\rangle (61)

For =\ell^{\prime}=\ell, the Φ𝐑|H^eh|Φ𝐑\langle\Phi_{{\bf R}_{\ell^{\prime}}}|\hat{H}_{eh}|\Phi_{{\bf R}_{\ell}}\rangle matrix element is equal to EeE^{\prime}_{e} given in Eq. (57), while for \ell^{\prime}\neq\ell, it is given by Eq. (59). So, the above equation leads to

Φ𝐊n|H^eh|Φ𝐊n\displaystyle\langle\Phi_{{\bf K}_{n^{\prime}}}|\hat{H}_{eh}|\Phi_{{\bf K}_{n}}\rangle =\displaystyle= EeNsei(𝐊n+𝐊n)𝐑\displaystyle\frac{E^{\prime}_{e}}{{N_{s}}}\sum_{\ell}e^{i(-{\bf K}_{n^{\prime}}+{\bf K}_{n})\cdot{\bf R}_{\ell}}
+1Ns𝒱𝐑𝐑(geeg)ei(𝐊n𝐑+𝐊n𝐑)\displaystyle+\frac{1}{{N_{s}}}\sum_{\ell^{\prime}}\sum_{\ell\not=\ell^{\prime}}\mathcal{V}_{{\bf R}_{\ell^{\prime}}-{\bf R}_{\ell}}\!\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)e^{i(-{\bf K}_{n^{\prime}}\cdot{\bf R}_{\ell^{\prime}}+{\bf K}_{n}\cdot{\bf R}_{\ell})}

For 𝐊n{\bf K}_{n} quantized in 2π/L2\pi/L, the first sum is equal to Ns{N_{s}} when n=nn^{\prime}=n and to zero otherwise. To derive the second sum, we write 𝐑{\bf R}_{\ell} as 𝐑+𝐑1{\bf R}_{\ell^{\prime}}+{\bf R}_{\ell_{1}}. This second term then gives

1Ns𝐑1𝟎𝒱𝐑1(geeg)ei𝐊n𝐑1ei(𝐊n𝐊n)𝐑\displaystyle\frac{1}{{N_{s}}}\sum_{{\bf R}_{\ell_{1}}\not=\bf 0}\mathcal{V}_{{\bf R}_{\ell_{1}}}\!\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)e^{i{\bf K}_{n}\cdot{\bf R}_{\ell_{1}}}\sum_{\ell^{\prime}}e^{i({\bf K}_{n}-{\bf K}_{n^{\prime}})\cdot{\bf R}_{\ell^{\prime}}}\hskip 56.9055pt
=δn,n𝐑1𝟎𝒱𝐑1(geeg)ei𝐊n𝐑1δn,nv𝐊n\displaystyle=\delta_{n^{\prime},n}\sum_{{\bf R}_{\ell_{1}}\not=\bf 0}\mathcal{V}_{{\bf R}_{\ell_{1}}}\!\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)e^{i{\bf K}_{n}\cdot{\bf R}_{\ell_{1}}}\equiv\delta_{n^{\prime},n}\,\,v_{{}_{{\bf K}_{n}}} (63)

since the \ell^{\prime} sum is equal to Ns{N_{s}} when n=nn^{\prime}=n and to zero otherwise.

So, we end with

Φ𝐊n|H^eh|Φ𝐊n=δn,n(Ee+v𝐊n)δn,nE𝐊n\langle\Phi_{{\bf K}_{n^{\prime}}}|\hat{H}_{eh}|\Phi_{{\bf K}_{n}}\rangle=\delta_{n^{\prime},n}(E^{\prime}_{e}+v_{{}_{{\bf K}_{n}}})\equiv\delta_{n^{\prime},n}\,\,E_{{\bf K}_{n}} (64)

which proves that the |Φ𝐊n|\Phi_{{\bf K}_{n}}\rangle states render the H^eh\hat{H}_{eh} Hamiltonian diagonal.

3.5.6 Frenkel exciton energy in the small 𝐊n{\bf K}_{n} limit

Each 𝐊n{\bf K}_{n} vector of the first Brillouin zone is associated with a Frenkel exciton |Φ𝐊n|\Phi_{{\bf K}_{n}}\rangle that diagonalizes the H^eh\hat{H}_{eh} Hamiltonian. Equation (3.5.5) gives the 𝐊n{\bf K}_{n} dependence of its energy through

v𝐊n=𝐑𝟎𝒱𝐑(geeg)ei𝐊n𝐑v_{{}_{{\bf K}_{n}}}=\sum_{{\bf R}\not=\bf 0}\mathcal{V}_{{\bf R}}\!\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)e^{i{\bf K}_{n}\cdot{\bf R}} (65)

It comes from interlevel Coulomb transitions between gg and ee, as given in Eq. (47). The v𝐊nv_{\scriptscriptstyle{{\bf K}_{n}}} energy comes from all possible Coulomb processes in which a ground-level electron jumps to the excited level of the same ion, while the reverse occurs on ions 𝐑{\bf R} apart (see Fig. 4)[34, 35].

Refer to caption
Figure 4: Coulomb processes in which one electron is excited from the ground level gg to the excited level ee, while another electron located on another lattice site at a finite distance RR is de-excited from ee to gg. These processes are the ones responsible for the energy splitting of the |Φ𝐑|\Phi_{{\bf R}_{\ell}}\rangle subspace into NsN_{s} Frenkel excitons |Φ𝐊n|\Phi_{{\bf K}_{n}}\rangle (see Eqs. (64) and (65)).

By writing v𝐊nv_{{}_{{\bf K}_{n}}} as

v𝐊n=v𝟎+𝐑𝟎𝒱𝐑(geeg)(ei𝐊n𝐑1)v_{{}_{{\bf K}_{n}}}=v_{{}_{\bf 0}}+\sum_{{\bf R}\not=\bf 0}\mathcal{V}_{{\bf R}}\!\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)\left(e^{i{\bf K}_{n}\cdot{\bf R}}-1\right) (66)

we see that the small 𝐊n{\bf K}_{n} behavior of v𝐊nv_{{}_{{\bf K}_{n}}} is controlled by the large-𝐑{\bf R} behavior of the Coulomb scattering 𝒱𝐑(geeg)\mathcal{V}_{{\bf R}}\!\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right), which, according to Eq. (20), is given by

𝒱𝐑(geeg)=L3d3rd3re|𝐫𝐫|ge2|𝐑+𝐫𝐫|g|𝐫𝐫|e\displaystyle\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)=\iint_{L^{3}}d^{3}rd^{3}r^{\prime}\,\,\langle e|{\bf r}\rangle\langle{\bf r}|g\rangle\frac{e^{2}}{|{\bf R}+{\bf r}-{\bf r}^{\prime}|}\langle g|{\bf r}^{\prime}\rangle\langle{\bf r}^{\prime}|e\rangle (67)

Since the gg and ee electron wave functions confine the (𝐫,𝐫)({\bf r},{\bf r}^{\prime}) variables at a distance small compared to the lattice cell size, the large-𝐑{\bf R} behavior of 𝒱𝐑(geeg)\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right) follows from the 1/R1/R expansion of 1/|𝐑+𝐫𝐫|1/|{\bf R}+{\bf r}-{\bf r}^{\prime}|. By using

1R2+𝐫2+2𝐑𝐫=1R(1+2𝐑R𝐫R+𝐫2R2)1/2\displaystyle\frac{1}{\sqrt{R^{2}+{\bf r}^{2}+2{\bf R}\cdot{\bf r}}}=\frac{1}{R}\left(1+\frac{2{\bf R}}{R}\cdot\frac{{\bf r}}{R}+\frac{{\bf r}^{2}}{R^{2}}\right)^{-1/2}\hskip 56.9055pt
1R(112(2𝐑R𝐫R+𝐫2R2)+38(2𝐑R𝐫R)2+)\displaystyle\simeq\frac{1}{R}\left(1-\frac{1}{2}\left(\frac{2{\bf R}}{R}\cdot\frac{{\bf r}}{R}+\frac{{\bf r}^{2}}{R^{2}}\right)+\frac{3}{8}\left(\frac{2{\bf R}}{R}\cdot\frac{{\bf r}}{R}\right)^{2}+\cdots\right) (68)

into Eq. (67), we see that the prefactor of the 1/R1/R term reduces to zero because the ee and gg wave functions, eigenstates of a single ion charge, are orthogonal, e|g=0\langle e|g\rangle=0. The terms in 𝐫{\bf r}, 𝐫2{\bf r}^{2}, 𝐫{\bf r}^{\prime}, 𝐫2{\bf r}^{\prime 2} reduce to zero for the same reason. As a result, the dominant contribution to 𝒱𝐑(geeg)\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right) in the large RR limit, comes from the parts of the above equation in (𝐫,𝐫{\bf r},{\bf r}^{\prime}); they precisely read[36, 37]

1R3[𝐫𝐫3(𝐑R𝐫)(𝐑R𝐫)]\frac{1}{R^{3}}\left[{\bf r}\cdot{\bf r}^{\prime}-3\left(\frac{{\bf R}}{R}\cdot{\bf r}\right)\,\,\left(\frac{{\bf R}}{R}\cdot{\bf r}^{\prime}\right)\right] (69)

When inserted into Eq. (67), 𝐫{\bf r} appears along with wave function overlaps which physically corresponds to the dipole moment of the excited-electron distribution, namely

𝐝eg=eL3d3r𝐫(e|𝐫𝐫|g){\bf d}_{eg}=e\int_{L^{3}}d^{3}r\,{\bf r}\Big{(}\langle e|{\bf r}\rangle\,\langle{\bf r}|g\rangle\Big{)} (70)

We then note that, in order for the above integral to differ from zero, the ground and excited states must have different parities. So, to go further and derive the Frenkel energy dispersion relation which requires 𝐝eg{\bf d}_{eg} to differs from zero, it is necessary to reconsider the assumption that the electronic states |g,𝐑|g,{\bf R}\rangle and |e,𝐑|e,{\bf R}\rangle have no spatial degeneracy. Rare-gas crystals[38] like solid neon[39, 40] and solid argon[41, 42], which host Frenkel excitons, do have a threefold atomic ground level and a nondegenerate excited level.

To properly control this spatial degeneracy, it is necessary to redo the calculation all over again, with the state degeneracy introduced from the very first line.

Yet, before doing it, we are going to introduce the electron degeneracy associated with spin because the spin degree of freedom erects a similar but simpler difficulty that ensues from the ground-level degeneracy: when the electronic ground level is degenerate, either due to spin or to spatial degeneracy, the charge neutrality of the semiconductor crystal imposes the ion charge to differ from |e||e|. One then has to question using the eigenstates of Eq. (8) that are the ones for a |e||e| ion, as a good one-body basis for the second quantization formulation of the Frenkel exciton problem.

4 Frenkel exciton with spin but no spatial degeneracy

4.1 Quantum formulation

4.1.1 System Hamiltonian

We now consider electrons with spin s=±1/2s=\pm 1/2, in electronic levels with no spatial degeneracy. Each lattice site can be occupied by two opposite-spin electrons (see Fig. 5(a)); so, the ion charge must be equal to 2|e|2|e| in order to ensure the crystal neutrality. The system we then have to consider is made of 2Ns2N_{s} electrons with mass m0m_{0}, charge |e|-|e| and spin s=±1/2s=\pm 1/2, located at rs,j\textbf{r}_{s,j} for j=(1,,Ns)j=(1,\cdots,N_{s}), and NsN_{s} ions with infinite mass, charge 2|e|2|e|, located at the 𝐑{\bf R}_{\ell} nodes of a periodic lattice, for =(1,,Ns\ell=(1,\cdots,N_{s}). In first quantization, the Hamiltonian of this system reads as

H2Ns\displaystyle H_{2N_{s}} =\displaystyle= s=±1/2j=1Ns𝐩s,j22m0+s=±1/2j=1Ns=1Ns2e2|𝐫s,j𝐑|\displaystyle\sum_{s=\pm 1/2}\,\,\sum_{j=1}^{N_{s}}\frac{{\bf p}_{s,j}^{2}}{2m_{0}}+\sum_{s=\pm 1/2}\,\,\sum_{j=1}^{N_{s}}\sum_{\ell=1}^{N_{s}}\frac{-2e^{2}}{|{\bf r}_{s,j}-{\bf R}_{\ell}|} (71)
+12s=±1/2j=1Ns(jjNse2|𝐫s,j𝐫s,j|+j=1Nse2|𝐫s,j𝐫s,j|)\displaystyle+\frac{1}{2}\sum_{s=\pm 1/2}\,\,\sum_{j=1}^{N_{s}}\left(\sum_{j^{\prime}\not=j}^{N_{s}}\frac{e^{2}}{|{\bf r}_{s,j}-{\bf r}_{s,j^{\prime}}|}+\sum_{j^{\prime}=1}^{N_{s}}\frac{e^{2}}{|{\bf r}_{s,j}-{\bf r}_{-s,j^{\prime}}|}\right)
+12=1NsNs(2e)2|𝐑𝐑|\displaystyle+\frac{1}{2}\sum_{\ell=1}^{N_{s}}\sum_{\ell^{\prime}\not=\ell}^{N_{s}}\frac{(2e)^{2}}{|{\bf R}_{\ell}-{\bf R}_{\ell^{\prime}}|}

Here again, this Hamiltonian corresponds to point-charge ions located at each 𝐑{\bf R}_{\ell} lattice site, for simplicity. When spin is included, the atom or molecule located in the 𝐑{\bf R}_{\ell} cell should be visualized as a core plus two electrons. The “core” then includes the nucleus plus the cloud of the remaining electrons, the total core charge being 2|e|2|e| due to neutrality. This would lead us to replace the point-charge potential 2e2/|𝐫s,j𝐑|-2e^{2}/|{\bf r}_{s,j}-{\bf R}_{\ell}| in Eq. (71) by a v2|e|(𝐫s,j𝐑)v_{2|e|}({\bf r}_{s,j}-{\bf R}_{\ell}) potential with the same 2|e|2|e| charge somewhat broadened over the 𝐑{\bf R}_{\ell} cell. However, here again, the precise shape of this potential is not going to affect the Coulomb physics we study for it entirely comes from the electron-electron Coulomb interaction.

Refer to caption
Figure 5: (a) In the presence of spin, each ion site can be occupied by two ground-level electrons, with up and down spins. (b) In the lowest set of excited states, one site has a ground-level electron with spin σ-\sigma, and an excited-level electron with spin σ=σ\sigma^{\prime}=\sigma if we include the fact that for excitation induced by Coulomb interaction or electron-photon interaction, the electron spin is conserved.

4.1.2 Good one-electron basis

To determine the good one-electron basis for second quantization, we first have to perform the physical analysis of the problem. In spite of the fact that the ion charge now is 2|e|2|e| instead of |e||e|, the electronic states for one electron in the presence of a |e||e| charge located at 𝐑{\bf R}_{\ell},

|s,ν,𝐑=a^s,ν,|vν=(g,e)|s,\nu,{\bf R}_{\ell}\rangle=\hat{a}^{\dagger}_{s,\nu,\ell}|v\rangle\qquad\qquad\nu=(g,e) (72)

with energies εν\varepsilon_{\nu}, still form the relevant one-electron basis to describe this 2Ns2{N_{s}}-electron system. The reason is that when one electron jumps from the ground level to the excited level of the same ion, the electron with opposite spin, stays on this ion. So, the excited electron sees the 2|e|2|e| ion shielded by the |e|-|e| cloud of the remaining electron, which looks more as a |e||e| ion than as a 2|e|2|e| ion. By extending this understanding to the ground level, we are led to conclude that the relevant states to describe the initial and final states for the excited electron are the electronic ground state gg of a 2|e|2|e| ion surrounded by a |e|-|e| electron cloud, and the excited state ee of this |e||e| effective ion. A way to represent the system ground state then is

|Φg=(a^+,g,1a^,g,1)(a^+,g,2a^,g,2)(a^+,g,Nsa^,g,Ns)|v|\Phi_{g}\rangle=(\hat{a}^{\dagger}_{+,g,1}\hat{a}^{\dagger}_{-,g,1})(\hat{a}^{\dagger}_{+,g,2}\hat{a}^{\dagger}_{-,g,2})\cdots\cdots(\hat{a}^{\dagger}_{+,g,{N_{s}}}\hat{a}^{\dagger}_{-,g,{N_{s}}})|v\rangle (73)

from which the set of system excited states simply follows from replacing one of the ground levels gg by the excited level ee, the energy difference between the excited and ground states of 2Ns2N_{s} electrons then being (εeεg)(\varepsilon_{e}-\varepsilon_{g}), if we neglect Coulomb processes between the electron and the other ions and between the electrons themselves.

Indeed, the lowest set of excited states we are going to consider has one electron with spin σ\sigma in the excited level, and the second electron of the same ion, with spin σ-\sigma, in the ground level (see Fig. 5(b)). The reason is that the physical interactions responsible for this jump, like Coulomb interaction or electron-photon interaction, conserve the spin; so, the electron keeps its spin when jumping to the excited level. It is possible to formally include a spin-flip along the electron excitation. We would then have to consider all spin-triplet and spin-singlet electron-hole states, not just the two Sz=0S_{z}=0 combinations, that is, four states instead of two, without any more physical insight on the understanding of the energy splitting between triplet and singlet states because both singlet and triplet states exist for Sz=0S_{z}=0. This is why, to render the restriction of the interlevel Coulomb coupling to spin-singlet states more transparent, we have decided to restrict the possible excited states to the physically relevant configuration, σ=σ\sigma^{\prime}=\sigma. The resulting states, shown in Fig. 5(b), then read

|Φσ;𝐑=a^σ,e,a^σ,g,|Φgσ=±12|\Phi_{\sigma;{\bf R}_{\ell}}\rangle=\hat{a}^{\dagger}_{\sigma,e,\ell}\hat{a}_{\sigma,g,\ell}|\Phi_{g}\rangle\quad\quad\quad\quad\sigma=\pm\frac{1}{2} (74)

with energy close to Eg+εeεgE_{g}+\varepsilon_{e}-\varepsilon_{g} if we neglect additional Coulomb interactions. These states form a (2×Ns)(2\times N_{s}) degenerate subspace, with

Φσ;𝐑|Φσ;𝐑=δσ,σδ,\langle\Phi_{\sigma^{\prime};{\bf R}_{\ell^{\prime}}}|\Phi_{\sigma;{\bf R}_{\ell}}\rangle=\delta_{\sigma^{\prime},\sigma}\,\,\delta_{\ell^{\prime},\ell} (75)

due to the orthogonality of the |s,ν,𝐑|s,\nu,{\bf R}_{\ell}\rangle states in the tight-binding limit (6).

The Frenkel excitons follow from the diagonalization of the system Hamiltonian H2NsH_{2N_{s}} in this subspace, namely, the diagonalization of the (2Ns×2Ns{2N_{s}}\times{2N_{s}}) matrix

Φσ;𝐑|H2Ns|Φσ;𝐑\langle\Phi_{\sigma^{\prime};{\bf R}_{\ell^{\prime}}}|H_{2N_{s}}|\Phi_{\sigma;{\bf R}_{\ell}}\rangle (76)

To derive these matrix elements, the first step is to derive the operator H^\hat{H} that represents the H2NsH_{2N_{s}} Hamiltonian in the basis |s,ν,𝐑|s,\nu,{\bf R}_{\ell}\rangle with ν=(g,e)\nu=(g,e) and =(1,,Ns)\ell=(1,\cdots,{N_{s}}), taken in the tight-binding limit (6).

4.1.3 Hamiltonian in terms of (ν,)(\nu,\ell) electron operators

\bullet The one-body part of the H2NsH_{2N_{s}} Hamiltonian,

H0,2Ns=s=±1/2j=1Ns(𝐩s,j22m0+=1Ns2e2|𝐫s,j𝐑|)H_{0,{2N_{s}}}=\sum_{s=\pm 1/2}\,\,\sum_{j=1}^{N_{s}}\left(\frac{{\bf p}^{2}_{s,j}}{2m_{0}}+\sum_{\ell=1}^{N_{s}}\frac{-2e^{2}}{|{\bf r}_{s,j}-{\bf R}_{\ell}|}\right) (77)

appears in terms of electron operators a^s,ν,\hat{a}^{\dagger}_{s,\nu,\ell} as

H^0=s=±1/2ν,ν,hν,;ν,a^s,ν,a^s,ν,\hat{H}_{0}=\sum_{s=\pm 1/2}\,\,\sum_{\nu^{\prime},\ell^{\prime}}\sum_{\nu,\ell}h_{\nu^{\prime},\ell^{\prime};\nu,\ell}\,\,\hat{a}^{\dagger}_{s,\nu^{\prime},\ell^{\prime}}\hat{a}_{s,\nu,\ell} (78)

with H^0\hat{H}_{0} diagonal with respect to spin because H0,2NsH_{0,{2N_{s}}} does not act on spin. The second quantization procedure gives the prefactor as

hν,;ν,=L3d3rν,𝐑|𝐫(𝐩^22m0+′′=1Ns2e2|𝐫𝐑′′|)𝐫|ν,𝐑\displaystyle h_{\nu^{\prime},\ell^{\prime};\nu,\ell}=\int_{L^{3}}d^{3}r\,\langle\nu^{\prime},{\bf R}_{\ell^{\prime}}|{\bf r}\rangle\left(\frac{\hat{{\bf p}}^{2}}{2m_{0}}{+}\sum_{\ell^{\prime\prime}=1}^{N_{s}}\frac{-2e^{2}}{|{\bf r}{-}{\bf R}_{\ell^{\prime\prime}}|}\right)\langle{\bf r}|\nu,{\bf R}_{\ell}\rangle (79)

To calculate it, we first extract the e2/|𝐫𝐑|-e^{2}/|{\bf r}{-}{\bf R}_{\ell}| potential from the ′′\ell^{\prime\prime} sum to make appear the Hamiltonian of one electron in the presence of a |e||e| charge located at 𝐑{\bf R}_{\ell}, that brings the energy εν\varepsilon_{\nu}. The remaining part of the ′′\ell^{\prime\prime} sum corresponds to the electron interaction with all the other 2|e|2|e| ions and the electron interaction with the |e||e| effective ion located at 𝐑{\bf R}_{\ell}. Within the tight-binding limit which enforces =\ell^{\prime}=\ell in Eq. (79), this leads us to split hν,;ν,h_{\nu^{\prime},\ell^{\prime};\nu,\ell} as

hν,;ν,=δ,(ενδν,ν+tν,ν+uν,ν)δ,(ενδν,ν+tν,ν)h_{\nu^{\prime},\ell^{\prime};\nu,\ell}=\delta_{\ell^{\prime},\ell}\Big{(}\varepsilon_{\nu}\delta_{\nu^{\prime},\nu}+t_{\nu^{\prime},\nu}+u_{\nu^{\prime},\nu}\Big{)}\equiv\delta_{\ell^{\prime},\ell}\Big{(}\varepsilon_{\nu}\delta_{\nu^{\prime},\nu}+t^{\prime}_{\nu^{\prime},\nu}\Big{)} (80)

The tν,νt_{\nu^{\prime},\nu} contribution, that comes from the electron interaction with all the other 2|e|2|e| ions, is given, for 𝐫|ν,𝐑\langle{\bf r}|\nu,{\bf R}_{\ell}\rangle in Eq. (4) and 𝐫𝐑=𝐫{\bf r}-{\bf R}_{\ell}={\bf r}_{\ell}, by

tν,ν=tν,ν\displaystyle t_{\nu^{\prime},\nu}=t^{\ast}_{\nu,\nu^{\prime}} =\displaystyle= L3d3rν|𝐫𝐫|ν′′Ns2e2|𝐫+𝐑𝐑′′|\displaystyle\int_{L^{3}}d^{3}r_{\ell}\,\langle\nu^{\prime}|{\bf r}_{\ell}\rangle\langle{\bf r}_{\ell}|\nu\rangle\sum_{\ell^{\prime\prime}\not=\ell}^{N_{s}}\frac{-2e^{2}}{|{\bf r}_{\ell}+{\bf R}_{\ell}-{\bf R}_{\ell^{\prime\prime}}|} (81)
=\displaystyle= L3d3rν|𝐫𝐫|ν𝐑𝟎2e2|𝐫𝐑|\displaystyle\int_{L^{3}}d^{3}r\,\langle\nu^{\prime}|{\bf r}\rangle\langle{\bf r}|\nu\rangle\sum_{{\bf R}\not=\bf 0}\frac{-2e^{2}}{|{\bf r}-{\bf R}|}

the ′′\ell^{\prime\prime} sum being \ell-independent due to the Born-von Karman boundary condition used to extend the lattice periodicity to a size LL crystal, that is, f(𝐫)=f(𝐫+𝐋)f({\bf r})=f({\bf r}+{\bf L}) for a sample volume L3L^{3}.

The electron interaction with the |e||e| effective ion located at 𝐑{\bf R}_{\ell} brings another contribution that reads

uν,ν=L3d3rν,𝐑|𝐫e2|𝐫𝐑|𝐫|ν,𝐑=L3d3rν|𝐫e2|𝐫|𝐫|νu_{\nu^{\prime},\nu}=\int_{L^{3}}d^{3}r\,\langle\nu^{\prime},{\bf R}_{\ell}|{\bf r}\rangle\frac{-e^{2}}{|{\bf r}{-}{\bf R}_{\ell}|}\langle{\bf r}|\nu,{\bf R}_{\ell}\rangle=\int_{L^{3}}d^{3}r\,\langle\nu^{\prime}|{\bf r}\rangle\frac{-e^{2}}{|{\bf r}|}\langle{\bf r}|\nu\rangle (82)

This additional term comes from the fact that the ion charge of this system is 2|e|2|e|, but the electronic states that are used for the second quantization procedure is defined for a |e||e| ion. This uν,νu_{\nu^{\prime},\nu} term brings additional transitions between the electronic levels of the same ion, just like the tν,νt_{\nu^{\prime},\nu} terms.

The above results lead us to split the one-body Hamiltonian H^0\hat{H}_{0} in a way quite similar to its expression (16) in the absence of spin, namely

H^0=s,ν,ενa^s,ν,a^s,ν,+s,ν,ν,tν,νa^s,ν,a^s,ν,\hat{H}_{0}=\sum_{s,\nu,\ell}\varepsilon_{\nu}\hat{a}^{\dagger}_{s,\nu,\ell}\hat{a}_{s,\nu,\ell}+\sum_{s,\nu^{\prime},\nu,\ell}t^{\prime}_{\nu^{\prime},\nu}\hat{a}^{\dagger}_{s,\nu^{\prime},\ell}\hat{a}_{s,\nu,\ell} (83)

\bullet The electron-electron interaction has the same form as in the absence of spin, because this interaction is not affected by the ion-charge change from 2|e|2|e| to |e||e|. It simply follows from Eq. (21) as

V^ee=12{s,ν}1,2𝒱𝐑1𝐑2(ν2ν2ν1ν1)a^s1,ν1,1a^s2,ν2,2a^s2,ν2,2a^s1,ν1,1\displaystyle\hat{V}_{e-e}=\frac{1}{2}\sum_{\{s,\nu\}}\sum_{\ell_{1},\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}\nu^{\prime}_{2}&\hskip 2.84544pt\nu_{2}\\ \hskip 2.84544pt\nu^{\prime}_{1}&\nu_{1}\end{smallmatrix}\right)\hat{a}^{\dagger}_{s_{1},\nu^{\prime}_{1},\ell_{1}}\hat{a}^{\dagger}_{s_{2},\nu^{\prime}_{2},\ell_{2}}\hat{a}_{s_{2},\nu_{2},\ell_{2}}\hat{a}_{s_{1},\nu_{1},\ell_{1}} (84)

with the same scattering as the one given in Eq. (20), since the Coulomb interaction does not act on spin. Note that the Pauli exclusion principle imposes (s,j)(s,j)(s,j)\not=(s^{\prime},j^{\prime}) in the first quantization expression (71) of the electron-electron interaction, while the commutation relations for fermionic operators take care of this exclusion automatically, in the above expression.

4.1.4 Relevant parts of the Hamiltonian

The parts of the Hamiltonian that play a role in the derivation of Frenkel excitons, are the ones that keep constant the number of ground-level electrons and the number of excited-level electrons, as the other states have a higher energy.

\bullet The relevant parts of H^0\hat{H}_{0} given in Eq. (83) reduce to H^0,g+H^0,e\hat{H}_{0,g}+\hat{H}_{0,e}, which formally read as the ones given in Eqs. (32,33), namely

H^0,g\displaystyle\hat{H}_{0,g} =\displaystyle= (εg+tg,g)s,a^s,g,a^s,g,\displaystyle(\varepsilon_{g}+t^{\prime}_{g,g})\sum_{s,\ell}\hat{a}^{\dagger}_{s,g,\ell}\hat{a}_{s,g,\ell} (85)
H^0,e\displaystyle\hat{H}_{0,e} =\displaystyle= (εe+te,e)s,a^s,e,a^s,e,\displaystyle(\varepsilon_{e}+t^{\prime}_{e,e})\sum_{s,\ell}\hat{a}^{\dagger}_{s,e,\ell}\hat{a}_{s,e,\ell} (86)

\bullet Relevant parts of the Hamiltonian also come from the Coulomb interaction (84). The part between ground levels reads

V^gg=121,s12,s2𝒱𝐑1𝐑2(gggg)a^s1,g,1a^s2,g,2a^s2,g,2a^s1,g,1\displaystyle\hat{V}_{gg}=\frac{1}{2}\sum_{\ell_{1},s_{1}}\sum_{\ell_{2},s_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)\hat{a}^{\dagger}_{s_{1},g,\ell_{1}}\hat{a}^{\dagger}_{s_{2},g,\ell_{2}}\hat{a}_{s_{2},g,\ell_{2}}\hat{a}_{s_{1},g,\ell_{1}} (87)

Note that (s1,1)(s_{1},\ell_{1}) must differ from (s2,2)(s_{2},\ell_{2}) in order for a^s2,g,2a^s1,g,1\hat{a}_{s_{2},g,\ell_{2}}\hat{a}_{s_{1},g,\ell_{1}} to differ from zero, due to the Pauli exclusion principle. This allows intersite processes, 12\ell_{1}\not=\ell_{2}, for arbitrary spin (s1,s2)(s_{1},s_{2}), as in Fig. 6(b), but also on-site processes, 2=1\ell_{2}=\ell_{1}, provided that s1s2s_{1}\not=s_{2}, as in Fig. 6(a).

Refer to caption
Figure 6: The Coulomb interaction among ground-level electrons can take place on the same ion site, provided that the two electrons have different spins as in (a), or on different ion sites, whatever the electron spins, as in (b).

\bullet Relevant parts of the Coulomb interaction also involve the ground and excited levels. The intralevel Coulomb interaction between ground and excited levels given in Eq. (36) reads, when spin is included, as (see Fig. 7(a))

V^eg(intra)=1,s12,s2𝒱𝐑1𝐑2(eegg)a^s1,g,1a^s2,e,2a^s2,e,2a^s1,g,1\displaystyle\hat{V}^{(intra)}_{eg}=\sum_{\ell_{1},s_{1}}\sum_{\ell_{2},s_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)\hat{a}^{\dagger}_{s_{1},g,\ell_{1}}\hat{a}^{\dagger}_{s_{2},e,\ell_{2}}\hat{a}_{s_{2},e,\ell_{2}}\hat{a}_{s_{1},g,\ell_{1}} (88)

In the same way, the interlevel Coulomb interaction, given in Eq. (37), becomes (see Fig. 7(b))

V^eg(inter)=1,s12,s2𝒱𝐑1𝐑2(geeg)a^s1,e,1a^s2,g,2a^s2,e,2a^s1,g,1\displaystyle\hat{V}^{(inter)}_{eg}=\sum_{\ell_{1},s_{1}}\sum_{\ell_{2},s_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)\hat{a}^{\dagger}_{s_{1},e,\ell_{1}}\hat{a}^{\dagger}_{s_{2},g,\ell_{2}}\hat{a}_{s_{2},e,\ell_{2}}\hat{a}_{s_{1},g,\ell_{1}} (89)

From the above results, it is possible to derive the ground-state energy Φg|H^|Φg\langle\Phi_{g}|\hat{H}|\Phi_{g}\rangle, and the Hamiltonian matrix Φσ;𝐑|H^|Φσ;𝐑\langle\Phi_{\sigma^{\prime};{\bf R}_{\ell^{\prime}}}|\hat{H}|\Phi_{\sigma;{\bf R}_{\ell}}\rangle in the excited subspace, (see D). However, as mentioned earlier, the appropriate formulation of the exciton problem is not in terms of electron states but in terms of electron excitations, that is, electrons and holes. When dealing with spin, this formulation makes even more sense because the differentiation between spin-singlet and spin-triplet configurations is better seen with electron-hole pairs.

Refer to caption
Figure 7: Coulomb interaction between electrons in different levels, as given in Eqs. (88,89). The electrons can stay in their level as in (a) or change level as in (b). In the latter case, they still keep their spin because the Coulomb interaction does not act on spin.

4.1.5 Formulation in terms of electrons and holes

\bullet The sign difference between spin-singlet an spin-triplet configurations follows from the phase factor that appears when turning from electron-absence to hole operator[30, 31], namely

a^s,g,=(1)1/2sb^s,\hat{a}_{s,g,\ell}=(-1)^{1/2-s}\hat{b}^{\dagger}_{-s,\ell} (90)

while the electron operator simply follows from the excited state operator as

a^s,e,=a^s,\hat{a}^{\dagger}_{s,e,\ell}=\hat{a}^{\dagger}_{s,\ell} (91)

\bullet The above relations can be readily used to rewrite the various parts of the Hamiltonian given above. The one-body parts (85,86) appear in terms of electrons and holes as

H^0,g\displaystyle\hat{H}_{0,g} =\displaystyle= 2N(εg+tg,g)(εg+tg,g)s,b^s,b^s,\displaystyle 2N(\varepsilon_{g}+t^{\prime}_{g,g})-\left(\varepsilon_{g}+t^{\prime}_{g,g}\right)\sum_{s,\ell}\hat{b}^{\dagger}_{s,\ell}\hat{b}_{s,\ell} (92)
H^0,e\displaystyle\hat{H}_{0,e} =\displaystyle= (εe+te,e)s,a^s,a^s,\displaystyle\left(\varepsilon_{e}+t^{\prime}_{e,e}\right)\sum_{s,\ell}\hat{a}^{\dagger}_{s,\ell}\hat{a}_{s,\ell} (93)

Turning to the Coulomb parts, the V^gg\hat{V}_{gg} interaction given in Eq. (87) becomes

V^gg=Φg|V^gg|Φg(𝒱𝐑=𝟎(gggg)+2𝐑𝟎𝒱𝐑(gggg))s,b^s,b^s,+\hat{V}_{gg}=\langle\Phi_{g}|\hat{V}_{gg}|\Phi_{g}\rangle-\bigg{(}\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)+2\sum_{{\bf R}\not=\bf 0}\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)\bigg{)}\sum_{s,\ell}\hat{b}^{\dagger}_{s,\ell}\hat{b}_{s,\ell}+\cdots (94)

within an additional term that acts on two holes and thus plays no role in the excited subspace that contains one hole only. So, we forget it.

With regard to the Coulomb interaction between ground and excited levels, the intralevel part (88) becomes

V^eg(intra)\displaystyle\hat{V}^{(intra)}_{eg} =\displaystyle= s1,1s2,2𝒱𝐑1𝐑2(eegg)b^s1,1a^s2,2a^s2,2b^s1,g,1\displaystyle\sum_{s_{1},\ell_{1}}\sum_{s_{2},\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)\hat{b}_{-s_{1},\ell_{1}}\hat{a}^{\dagger}_{s_{2},\ell_{2}}\hat{a}_{s_{2},\ell_{2}}\hat{b}^{\dagger}_{-s_{1},g,\ell_{1}} (95)
=\displaystyle= 2𝐑𝒱𝐑(eegg)s,a^s,a^s,+V^eh(intra)\displaystyle 2\sum_{{\bf R}}\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)\sum_{s,\ell}\hat{a}^{\dagger}_{s,\ell}\hat{a}_{s,\ell}+\hat{V}^{(intra)}_{eh}

while the interlevel part (89) appears as

V^eg(inter)\displaystyle\hat{V}^{(inter)}_{eg} =\displaystyle= s1,1s2,2(1)1s1s2𝒱𝐑1𝐑2(geeg)a^s1,1b^s2,2a^s2,2b^s1,1\displaystyle\sum_{s_{1},\ell_{1}}\sum_{s_{2},\ell_{2}}(-1)^{1-s_{1}-s_{2}}\,\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)\,\hat{a}^{\dagger}_{s_{1},\ell_{1}}\hat{b}_{-s_{2},\ell_{2}}\hat{a}_{s_{2},\ell_{2}}\hat{b}^{\dagger}_{-s_{1},\ell_{1}} (96)
=\displaystyle= 𝒱𝐑=𝟎(geeg)s,a^s,a^s,+V^eh(inter)\displaystyle-\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)\sum_{s,\ell}\hat{a}^{\dagger}_{s,\ell}\hat{a}_{s,\ell}+\hat{V}^{(inter)}_{eh}

The Coulomb interaction between electrons and holes that corresponds to the Coulomb processes shown in Fig. 8(a), with each carrier staying in its level, is given by

V^eh(intra)=s1,1s2,2𝒱𝐑1𝐑2(eegg)a^s2,2b^s1,1b^s1,1a^s2,2\displaystyle\hat{V}^{(intra)}_{eh}=-\sum_{s_{1},\ell_{1}}\sum_{s_{2},\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)\,\hat{a}^{\dagger}_{s_{2},\ell_{2}}\hat{b}^{\dagger}_{-s_{1},\ell_{1}}\hat{b}_{-s_{1},\ell_{1}}\hat{a}_{s_{2},\ell_{2}} (97)
Refer to caption
Figure 8: Same as Fig. 7, but in terms of electron and hole: (a) intralevel processes given in Eq. (97); (b) interlevel processes given in Eq. (98). The fact that the interlevel Coulomb processes take place between electron-hole pairs in spin-singlet states and not in (Sz=0S_{z}=0) triplet states, follows from the phase factor between electron-absence and hole operators (see Eq. (90)).

The Coulomb interaction in which each electron changes electronic level leads, in terms of electrons and holes, to an interaction in which one electron-hole pair recombines while another pair is excited, within a phase factor that comes from Eq. (90). It reads (see Fig. 8(b))

V^eh(inter)=s1,1s2,2𝒱𝐑1𝐑2(geeg)(1)1s1s2a^s1,1b^s1,1b^s2,2a^s2,2\displaystyle\hat{V}^{(inter)}_{eh}=\sum_{s_{1},\ell_{1}}\sum_{s_{2},\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)(-1)^{1-s_{1}-s_{2}}\,\hat{a}^{\dagger}_{s_{1},\ell_{1}}\hat{b}^{\dagger}_{-s_{1},\ell_{1}}\hat{b}_{-s_{2},\ell_{2}}\hat{a}_{s_{2},\ell_{2}} (98)

or, by writing the spin part explicitly,

V^eh(inter)\displaystyle\hat{V}^{(inter)}_{eh} =\displaystyle= 1,2𝒱𝐑1𝐑2(geeg)\displaystyle\sum_{\ell_{1},\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)
(a^12,1b^12,1a^12,1b^12,1)(b^12,2a^12,2b^12,2a^12,2)\displaystyle\Big{(}\hat{a}^{\dagger}_{\frac{1}{2},\ell_{1}}\hat{b}^{\dagger}_{-\frac{1}{2},\ell_{1}}-\hat{a}^{\dagger}_{-\frac{1}{2},\ell_{1}}\hat{b}^{\dagger}_{\frac{1}{2},\ell_{1}}\Big{)}\Big{(}\hat{b}_{-\frac{1}{2},\ell_{2}}\hat{a}_{\frac{1}{2},\ell_{2}}-\hat{b}_{\frac{1}{2},\ell_{2}}\hat{a}_{-\frac{1}{2},\ell_{2}}\Big{)}

This evidences that the pairs involved in the interlevel Coulomb processes are in a spin-singlet state (S=0,Sz=0)(S=0,S_{z}=0) with creation operator

B^0,=a^12,b^12,a^12,b^12,2\hat{B}^{\dagger}_{0,\ell}=\frac{\hat{a}^{\dagger}_{\frac{1}{2},\ell}\hat{b}^{\dagger}_{-\frac{1}{2},\ell}-\hat{a}^{\dagger}_{-\frac{1}{2},\ell}\hat{b}^{\dagger}_{\frac{1}{2},\ell}}{\sqrt{2}} (100)

in contrast to pairs that also have a total spin equal to zero, but that are in a spin-triplet state (S=1,Sz=0)(S=1,S_{z}=0), their creation operator reading as

B^1,=a^12,b^12,+a^12,b^12,2\hat{B}^{\dagger}_{1,\ell}=\frac{\hat{a}^{\dagger}_{\frac{1}{2},\ell}\hat{b}^{\dagger}_{-\frac{1}{2},\ell}+\hat{a}^{\dagger}_{-\frac{1}{2},\ell}\hat{b}^{\dagger}_{\frac{1}{2},\ell}}{\sqrt{2}} (101)

\bullet As a result, the relevant parts of the H^\hat{H} Hamiltonian in the |Φσ;𝐑|\Phi_{\sigma;{\bf R}_{\ell}}\rangle subspace reduce, in terms of electron and hole operators, to

H^eh=Φg|H^|Φg+H^e+H^h+V^eh(intra)+V^eh(inter)\displaystyle\hat{H}_{eh}=\langle\Phi_{g}|\hat{H}|\Phi_{g}\rangle+\hat{H}_{e}+\hat{H}_{h}+\hat{V}^{(intra)}_{eh}+\hat{V}^{(inter)}_{eh} (102)

The two-body Coulomb parts are given in Eqs. (97) and (4.1.5), while the one-body parts now read

H^e\displaystyle\hat{H}_{e} =\displaystyle= ε~es,a^s,a^s,\displaystyle\tilde{\varepsilon}_{e}\sum_{s,\ell}\hat{a}^{\dagger}_{s,\ell}\hat{a}_{s,\ell} (103)
H^h\displaystyle\hat{H}_{h} =\displaystyle= ε~hs,b^s,b^s,\displaystyle\tilde{\varepsilon}_{h}\sum_{s,\ell}\hat{b}^{\dagger}_{s,\ell}\hat{b}_{s,\ell} (104)

with the electron and hole energies given by

ε~e\displaystyle\tilde{\varepsilon}_{e} =\displaystyle= εe+te,e+2𝐑𝒱𝐑(eegg)𝒱𝐑=𝟎(geeg)\displaystyle\varepsilon_{e}+t^{\prime}_{e,e}+2\sum_{{\bf R}}\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)-\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right) (105)
ε~h\displaystyle\tilde{\varepsilon}_{h} =\displaystyle= εgtg,g2𝐑𝟎𝒱𝐑(gggg)𝒱𝐑=𝟎(gggg)\displaystyle-\varepsilon_{g}-t^{\prime}_{g,g}-2\sum_{{\bf R}\not=\bf 0}\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)-\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right) (106)

4.2 Spin-singlet and spin-triplet subspaces |ΦS;𝐑|\Phi_{S;{\bf R}_{\ell}}\rangle

\bullet The lowest set of excited states |Φσ;𝐑|\Phi_{\sigma;{\bf R}_{\ell}}\rangle, given in Eq. (74), appears in terms of electron and hole operators as

|Φσ;𝐑=a^σ,e,a^σ,g,|Φg=(1)1/2σa^σ,b^σ,|0|\Phi_{\sigma;{\bf R}_{\ell}}\rangle=\hat{a}^{\dagger}_{\sigma,e,\ell}\hat{a}_{\sigma,g,\ell}|\Phi_{g}\rangle=(-1)^{1/2-\sigma}\,\hat{a}^{\dagger}_{\sigma,\ell}\hat{b}^{\dagger}_{-\sigma,\ell}|0\rangle (107)

They are made of electron-hole pairs located on the ion site R\textbf{R}_{\ell} with total spin Sz=0S_{z}=0. Yet, in view of Eq. (4.1.5), it is clear that the physically relevant electron-hole pair states are not these |Φσ;𝐑|\Phi_{\sigma;{\bf R}_{\ell}}\rangle states with σ=±1/2\sigma=\pm 1/2, but their symmetric and antisymmetric combinations, namely the spin-triplet and spin-singlet states, which read in terms of the operators defined in Eqs. (100) and (101), as

|ΦS;𝐑=B^S,|0forS=(1,0)|\Phi_{S;{\bf R}_{\ell}}\rangle=\hat{B}^{\dagger}_{S,\ell}|0\rangle\quad\quad\quad\textrm{for}\quad\quad\quad S=(1,0) (108)

These states also form an orthogonal set, as possible to check from Eq. (75),

ΦS;𝐑|ΦS;𝐑=δS,Sδ,\langle\Phi_{S^{\prime};{\bf R}_{\ell^{\prime}}}|\Phi_{S;{\bf R}_{\ell}}\rangle=\delta_{S^{\prime},S}\,\,\delta_{\ell^{\prime},\ell} (109)

\bullet When acting on these singlet and triplet states, the one-body parts of the system Hamiltonian give

(H^e+H^h)|ΦS;𝐑=(ε~e+ε~h)|ΦS;𝐑\Big{(}\hat{H}_{e}+\hat{H}_{h}\Big{)}|\Phi_{S;{\bf R}_{\ell}}\rangle=(\tilde{\varepsilon}_{e}+\tilde{\varepsilon}_{h})|\Phi_{S;{\bf R}_{\ell}}\rangle (110)

while the intralevel part of the Coulomb interaction leads to

V^eh(intra)|ΦS;𝐑=𝒱𝐑=𝟎(eegg)|ΦS;𝐑\displaystyle\hat{V}^{(intra)}_{eh}|\Phi_{S;{\bf R}_{\ell}}\rangle=-\mathcal{V}_{{\bf R}={\bf 0}}\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)|\Phi_{S;{\bf R}_{\ell}}\rangle (111)

If we now consider the interlevel part of the Coulomb interaction given in Eq. (4.1.5) which only acts on spin-singlet states, we readily find

V^eh(inter)|ΦS;𝐑=2δS,01𝒱𝐑𝐑1(geeg)|ΦS;𝐑1\displaystyle\hat{V}^{(inter)}_{eh}|\Phi_{S;{\bf R}_{\ell}}\rangle=2\delta_{S,0}\sum_{\ell_{1}}\mathcal{V}_{{\bf R}_{\ell}-{\bf R}_{\ell_{1}}}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)|\Phi_{S;{\bf R}_{\ell_{1}}}\rangle (112)

By combining the above equations, we end with

H^eh|ΦS;𝐑=ES(pair)|ΦS;𝐑+2δS,01𝒱𝐑𝐑1(geeg)|ΦS;𝐑1\displaystyle\hat{H}_{eh}|\Phi_{S;{\bf R}_{\ell}}\rangle=E^{(pair)}_{S}|\Phi_{S;{\bf R}_{\ell}}\rangle+2\delta_{S,0}\sum_{\ell_{1}\neq\ell}\mathcal{V}_{{\bf R}_{\ell}-{\bf R}_{\ell_{1}}}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)|\Phi_{S;{\bf R}_{\ell_{1}}}\rangle (113)

with the pair energy given by

ES(pair)\displaystyle E^{(pair)}_{S}\!\!\! =\displaystyle= Φg|H^|Φg+(εeεg)+(te,etg,g)+2𝐑𝟎(𝒱𝐑(eegg)𝒱𝐑(gggg))\displaystyle\!\!\!\langle\Phi_{g}|\hat{H}|\Phi_{g}\rangle+(\varepsilon_{e}-\varepsilon_{g})+(t^{\prime}_{e,e}-t^{\prime}_{g,g})+2\sum_{{\bf R}\not=\bf 0}\Big{(}\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)-\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)\Big{)} (114)
+(𝒱𝐑=𝟎(eegg)𝒱𝐑=𝟎(gggg))+(1)S𝒱𝐑=𝟎(geeg)\displaystyle\!\!\!+\Big{(}\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)-\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)\Big{)}+(-1)^{S}\,\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)

This energy depends on the spin state, triplet or singlet, through its last term: the on-site (1=)(\ell_{1}=\ell) interlevel Coulomb interaction in Eq. (112) differentiates the spin-triplet state (S=1,Sz=0)(S=1,S_{z}=0) from the spin-singlet state (S=0,Sz=0)(S=0,S_{z}=0).

4.3 Frenkel excitons with spin

\bullet To go further, we first note that

B^S,B^S,|0=δ,δS,S|0\displaystyle\hat{B}_{S^{\prime},\ell^{\prime}}\hat{B}^{\dagger}_{S,\ell}|0\rangle=\delta_{\ell^{\prime},\ell}\delta_{S^{\prime},S}|0\rangle (115)

It then becomes trivial to show that the (2Ns×2Ns2N_{s}\times 2N_{s}) matrix for H^eh\hat{H}_{eh} in the |ΦS;𝐑|\Phi_{S;{\bf R}_{\ell}}\rangle excited subspace, splits into a (Ns×NsN_{s}\times N_{s}) diagonal submatrix in the spin-triplet subspace (S=1,Sz=0)(S=1,S_{z}=0), with diagonal energy ES=1(pair)E^{(pair)}_{S=1}, and a non-diagonal (Ns×NsN_{s}\times N_{s}) submatrix in the spin-singlet subspace (S=0,Sz=0)(S=0,S_{z}=0), that remains to be diagonalized,

(ES=1(pair)000000ES=1(pair)0000ES=0(pair)𝒱,(e,g)00(𝒱,(e,g))ES=0(pair))\left(\begin{matrix}E^{(pair)}_{S=1}&\cdots&0&0&\cdots&0\\ \vdots&\ddots&0&\vdots&\ddots&\vdots\\ 0&0&E^{(pair)}_{S=1}&0&\cdots&0\\ 0&\cdots&0&E^{(pair)}_{S=0}&\cdots&\mathcal{V}^{(e,g)}_{\ell^{\prime},\ell}\\ \vdots&\ddots&\vdots&\vdots&\ddots&\vdots\\ 0&\cdots&0&\left(\mathcal{V}^{(e,g)}_{\ell^{\prime},\ell}\right)^{\ast}&\cdots&E^{(pair)}_{S=0}\end{matrix}\right) (116)

with 𝒱,(e,g)=2𝒱𝐑𝐑(geeg)\mathcal{V}^{(e,g)}_{\ell^{\prime},\ell}=2\mathcal{V}_{{\bf R}_{\ell^{\prime}}-{\bf R}_{\ell}}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right) for 𝒱𝐑(geeg)\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right) given in Eq. (67).

\bullet To diagonalize this (Ns×NsN_{s}\times N_{s}) submatrix, we use the same phase factor as the one given in Eq. (1), to transform one-site excitations into wave excitations, namely

|Φ0;𝐊n=1Ns=1Nsei𝐊n𝐑|Φ0;𝐑|\Phi_{0;{\bf K}_{n}}\rangle=\frac{1}{\sqrt{{N_{s}}}}\sum_{\ell=1}^{N_{s}}e^{i{\bf K}_{n}\cdot{\bf R}_{\ell}}|\Phi_{0;{\bf R}_{\ell}}\rangle (117)

Indeed, for the same reason as the one leading to Eq. (64), we find that the H^eh\hat{H}_{eh} Hamiltonian is diagonal in the |Φ0;𝐊n|\Phi_{0;{\bf K}_{n}}\rangle subspace

Φ0;𝐊n|H^eh|Φ0;𝐊n=δn,n(ES=0(pair)+v𝐊n)\langle\Phi_{0;{\bf K}_{n^{\prime}}}|\hat{H}_{eh}|\Phi_{0;{\bf K}_{n}}\rangle=\delta_{n^{\prime},n}\left(E^{(pair)}_{S=0}+v_{{}_{{\bf K}_{n}}}\right) (118)

with v𝐊nv_{{}_{{\bf K}_{n}}} now given by

v𝐊n=2𝐑𝟎𝒱𝐑(geeg)ei𝐊n𝐑v_{{}_{{\bf K}_{n}}}=2\sum_{{\bf R}\not=\bf 0}\mathcal{V}_{{\bf R}}\!\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)e^{i{\bf K}_{n}\cdot{\bf R}} (119)

the extra factor 22, compared to Eq. (65), coming from spin. The degeneracy of the spin-singlet subspace is lifted by these linear combinations, thanks to the interlevel Coulomb processes between different ion sites, 𝒱𝐑𝟎(geeg)\mathcal{V}_{{\bf R}\not=\bf 0}\!\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right). By contrast, the spin-triplet subspace, already diagonal, remains degenerate.

The major result of this section is the fact that spin-triplet pairs (S=1,Sz=0)(S=1,S_{z}=0) do not form excitonic waves. The Frenkel excitons that result from interlevel Coulomb processes through the delocalization of on-site electronic excitations, are only made of spin-singlet pairs. Since these spin-singlet pairs are coupled to photons, the Frenkel excitons that are formed are bright. We wish to stress that this strong conclusion is derived within the strict tight-binding limit that we have used to possibly formulate the problem in second quantization: once accepted, the delocalization of on-site excitations to waves, can only come from interlevel Coulomb processes, i.e., processes which are fundamentally similar to the interlevel processes that occur in photon absorption or emission: an electron-hole pair recombination on one site and a pair excitation on another site.

5 Frenkel exciton with spatial degeneracy but no spin

We now introduce the spatial degeneracy of the electronic states associated with one ion and we forget the spin, i.e., we perform the same calculations as the ones in Sec. 3 but we take into account this degeneracy from the very first line.

5.1 Threefold excited level

5.1.1 Lowest excited subspace |Φμ;𝐑|\Phi_{\mu;{\bf R}_{\ell}}\rangle

We start with the simplest case: NsN_{s} electrons with charge |e|-|e| and NsN_{s} ions with charge |e||e| located at the 𝐑{\bf R}_{\ell} lattice sites. The system Hamiltonian is the same as the one given in Eq. (2), so is the good one-electron basis for second quantization: this basis is made of the eigenstates of one electron |e|-|e| in the presence of one ion |e||e| located at R\textbf{R}_{\ell}. We consider that the ground level, with energy εg\varepsilon_{g}, is nondegenerate, but that the lowest excited level, with energy εe\varepsilon_{e}, has a threefold degeneracy. We will label these three excited states as μ=(x,y,z)\mu=(x,y,z) along axes that can be chosen at will: indeed, the basis we here consider follows from the eigenstates of a single electron in the presence of a single ion; the other ions play no role, along with the axes of the crystal to which they belong. We will come back to the importance of properly choosing these (x,y,z)(x,y,z) axes.

The relevant electronic states |ν,𝐑|\nu,{\bf R}_{\ell}\rangle now read

|g,𝐑&=a^g,|v|μ,e,𝐑=a^μ,e,|v|g,{\bf R}_{\ell}\rangle&=\hat{a}^{\dagger}_{g,\ell}|v\rangle\\ |\mu,e,{\bf R}_{\ell}\rangle=\hat{a}^{\dagger}_{\mu,e,\ell}|v\rangle (120)

The system ground state is still given by

|Φg=a^g,1a^g,2a^g,N|v|\Phi_{g}\rangle=\hat{a}^{\dagger}_{g,1}\hat{a}^{\dagger}_{g,2}\cdots\hat{a}^{\dagger}_{g,N}|v\rangle (121)

with energy Eg=NεgE_{g}=N\varepsilon_{g} in the absence of additional Coulomb processes. The lowest excited subspace is made of states in which the ground-state electron of a 𝐑{\bf R}_{\ell} ion is replaced by one of the three excited states of the same ion

|Φμ;𝐑=a^μ,e,a^g,|Φg|\Phi_{\mu;{\bf R}_{\ell}}\rangle=\hat{a}^{\dagger}_{\mu,e,\ell}\hat{a}_{g,\ell}|\Phi_{g}\rangle (122)

This subspace now has a 3Ns3N_{s} degeneracy, its energy being Ee=Eg+εeεgE_{e}=E_{g}+\varepsilon_{e}-\varepsilon_{g} in the absence of Coulomb processes other than the ones of the electron with “its” ion.

5.1.2 Hamiltonian in terms of electron operators

\bullet The one-body part of the system Hamiltonian HNsH_{N_{s}} given in Eq. (2) has a ground-level term still given by Eq. (32) and an excited-level term slightly different from the one given in Eq. (33), due to the spatial degeneracy of the excited level

H^0,e=(μ,μ)=(x,y,z)=1Ns(εeδμ,μ+tμ,μ)a^μ,e,a^μ,e,\hat{H}_{0,e}=\sum_{(\mu^{\prime},\mu)=(x,y,z)}\,\,\sum_{\ell=1}^{N_{s}}\left(\varepsilon_{e}\delta_{\mu^{\prime},\mu}+t_{\mu^{\prime},\mu}\right)\hat{a}^{\dagger}_{\mu^{\prime},e,\ell}\hat{a}_{\mu,e,\ell} (123)

the Coulomb term being now given by

tμ,μ=L3d3rμ,e|𝐫𝐫|μ,e𝐑𝟎e2|𝐫𝐑|t_{\mu^{\prime},\mu}=\int_{L^{3}}d^{3}r\langle\mu^{\prime},e|{\bf r}\rangle\langle{\bf r}|\mu,e\rangle\sum_{{\bf R}\not={\bf 0}}\frac{-e^{2}}{|{\bf r}-{\bf R}|} (124)

instead of Eq. (15). This term comes from interactions between one excited electron and all the other ions. Through it, a change of excited-level index from μ\mu to μμ\mu^{\prime}\not=\mu can take place.

\bullet The relevant parts of the Coulomb interaction V^ee\hat{V}_{e-e} given in Eq. (17) also reduce to V^gg+V^eg(intra)+V^eg(inter)\hat{V}_{gg}+\hat{V}^{(intra)}_{eg}+\hat{V}^{(inter)}_{eg}. The V^gg\hat{V}_{gg} interaction still reads as Eq. (35), while the other two interactions now contain additional excited-level indices μ\mu. The intralevel part, given in Eq. (36), now appears as

V^eg(intra)=μ,μ1,2𝒱𝐑1𝐑2(μ,eμ,egg)a^g,1a^μ,e,2a^μ,e,2a^g,1\displaystyle\hat{V}^{(intra)}_{eg}=\sum_{\mu^{\prime},\mu}\,\,\sum_{\ell_{1},\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}\mu^{\prime},e&\,\,\mu,e\\ g&\,\,g\end{smallmatrix}\right)\hat{a}^{\dagger}_{g,\ell_{1}}\hat{a}^{\dagger}_{\mu^{\prime},e,\ell_{2}}\hat{a}_{\mu,e,\ell_{2}}\hat{a}_{g,\ell_{1}} (125)

and the interlevel part, given in Eq. (37), now appears as

V^eg(inter)=μ,μ1,2𝒱𝐑1𝐑2(gμ,eμ,eg)a^μ,e,1a^g,2a^μ,e,2a^g,1\displaystyle\hat{V}^{(inter)}_{eg}=\sum_{\mu^{\prime},\mu}\,\,\sum_{\ell_{1},\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}g&\,\mu,e\\ \mu^{\prime},e&\,g\end{smallmatrix}\right)\hat{a}^{\dagger}_{\mu^{\prime},e,\ell_{1}}\hat{a}^{\dagger}_{g,\ell_{2}}\hat{a}_{\mu,e,\ell_{2}}\hat{a}_{g,\ell_{1}} (126)

5.1.3 Hamiltonian in terms of electrons and holes

\bullet Next, we turn from electron absence to hole and from excited electron to “electron” in the same way as in Eq. (38,39), namely

a^g,=b^a^μ,e,=a^μ,\hat{a}_{g,\ell}=\hat{b}^{\dagger}_{\ell}\qquad\qquad\qquad\hat{a}^{\dagger}_{\mu,e,\ell}=\hat{a}^{\dagger}_{\mu,\ell} (127)

This allows us to rewrite the lowest set of excited states, defined in Eq. (122), as

|Φμ;𝐑=a^μ,b^|0|\Phi_{\mu;{\bf R}_{\ell}}\rangle=\hat{a}^{\dagger}_{\mu,\ell}\hat{b}^{\dagger}_{\ell}|0\rangle (128)

\bullet The parts of the Hamiltonian that are relevant in this subspace reduce to

H^eh=Eg+H^e+H^h+V^eh(intra)+V^eh(inter)\hat{H}_{eh}=E^{\prime}_{g}+\hat{H}_{e}+\hat{H}_{h}+\hat{V}^{(intra)}_{eh}+\hat{V}^{(inter)}_{eh} (129)

The constant term EgE^{\prime}_{g} is equal to Φg|H^|Φg\langle\Phi_{g}|\hat{H}|\Phi_{g}\rangle, while the H^h\hat{H}_{h} part is just the one given in Eqs. (52,53). The other terms can be readily obtained from their expression in the absence of excited-level degeneracy, by simply adding some μ\mu indices, namely

H^e=μ,με~e(μ,μ)a^μ,a^μ,\hat{H}_{e}=\sum_{\mu^{\prime},\mu}\,\,\sum_{\ell}\tilde{\varepsilon}_{e}(\mu^{\prime},\mu)\,\,\hat{a}^{\dagger}_{\mu^{\prime},\ell}\hat{a}_{\mu,\ell} (130)

for the one-body part, with

ε~e(μ,μ)=εeδμ,μ+tμ,μ+𝐑𝒱𝐑(μ,eμ,egg)𝒱𝐑=𝟎(gμ,eμ,eg)\tilde{\varepsilon}_{e}(\mu^{\prime},\mu)=\varepsilon_{e}\delta_{\mu^{\prime},\mu}+t_{\mu^{\prime},\mu}+\sum_{\bf R}\mathcal{V}_{\bf R}\left(\begin{smallmatrix}\mu^{\prime},e&\,\mu,e\\ g&\,g\end{smallmatrix}\right)-\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}g&\mu,e\\ \mu^{\prime},e&g\end{smallmatrix}\right) (131)

The third term in the above equation comes from the intralevel interaction between the excited-level electron and all the ground-level electrons, while the last term comes from the interlevel interaction on a single ion site.

Refer to caption
Figure 9: Same as Fig. 3, when the excited level has a spatial degeneracy (see Eqs. (132,133)). (a) The electron can change its spatial index μ\mu. (b) The excited pair can have a spatial index μ\mu^{\prime} different from the μ\mu pair that recombines.

Finally, the intralevel part of the electron-hole interaction, that now reads

V^eh(intra)=μ,μ1,2𝒱𝐑1𝐑2(μ,eμ,egg)a^μ,2b^1b^1a^μ,2\displaystyle\hat{V}^{(intra)}_{eh}=-\sum_{\mu^{\prime},\mu}\,\,\sum_{\ell_{1},\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}\mu^{\prime},e&\mu,e\\ g&g\end{smallmatrix}\right)\hat{a}^{\dagger}_{\mu^{\prime},\ell_{2}}\hat{b}^{\dagger}_{\ell_{1}}\hat{b}_{\ell_{1}}\hat{a}_{\mu,\ell_{2}} (132)

allows transition between different excited levels of the same lattice site (see Fig. 9(a)), while the interlevel part

V^eh(inter)=μ,μ1,2𝒱𝐑1𝐑2(gμ,eμ,eg)a^μ,1b^1b^2a^μ,2\displaystyle\hat{V}^{(inter)}_{eh}=\sum_{\mu^{\prime},\mu}\,\,\sum_{\ell_{1},\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}g&\mu,e\\ \mu^{\prime},e&g\end{smallmatrix}\right)\hat{a}^{\dagger}_{\mu^{\prime},\ell_{1}}\hat{b}^{\dagger}_{\ell_{1}}\hat{b}_{\ell_{2}}\hat{a}_{\mu,\ell_{2}} (133)

allows for excitations of electron-hole pairs in a different μ\mu^{\prime} state of a different lattice site (see Fig. 9(b)).

5.1.4 Electron-hole Hamiltonian in the |Φμ;𝐑|\Phi_{\mu;{\bf R}_{\ell}}\rangle excited subspace

\bullet We start with the |Φμ;𝐑=a^μ,b^|0|\Phi_{\mu;{\bf R}_{\ell}}\rangle=\hat{a}^{\dagger}_{\mu,\ell}\hat{b}^{\dagger}_{\ell}|0\rangle state of the 3Ns3N_{s}-degenerate excited subspace and we calculate the relevant parts of the system Hamiltonian acting on this state. From the one-body parts

H^h|Φμ;𝐑&=ε~h|Φμ;𝐑H^e|Φμ;𝐑=με~e(μ,μ)|Φμ;𝐑\hat{H}_{h}|\Phi_{\mu;{\bf R}_{\ell}}\rangle&=\tilde{\varepsilon}_{h}|\Phi_{\mu;{\bf R}_{\ell}}\rangle\\ \hat{H}_{e}|\Phi_{\mu;{\bf R}_{\ell}}\rangle=\sum_{\mu^{\prime}}\tilde{\varepsilon}_{e}(\mu^{\prime},\mu)|\Phi_{\mu^{\prime};{\bf R}_{\ell}}\rangle (134)

we already see that the existence of a degenerate excited level keeps the H^e\hat{H}_{e} operator diagonal with respect to lattice sites, but not diagonal with respect to excited-level indices μ\mu.

In the same way, instead of Eq. (55), the intralevel Coulomb interaction now gives

V^eh(intra)|Φμ;𝐑=μ𝒱𝐑=𝟎(μ,eμ,egg)|Φμ;𝐑\hat{V}^{(intra)}_{eh}|\Phi_{\mu;{\bf R}_{\ell}}\rangle=-\sum_{\mu^{\prime}}\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}\mu^{\prime},e&\mu,e\\ g&g\end{smallmatrix}\right)|\Phi_{\mu^{\prime};{\bf R}_{\ell}}\rangle (135)

while instead of Eq. (56), the interlevel Coulomb interaction now gives (see Fig. 10)

V^eh(inter)|Φμ;𝐑=μ1𝒱𝐑1𝐑(gμ,eμ,eg)|Φμ;𝐑1\displaystyle\hat{V}^{(inter)}_{eh}|\Phi_{\mu;{\bf R}_{\ell}}\rangle=\sum_{\mu^{\prime}}\sum_{\ell_{1}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell}}\left(\begin{smallmatrix}g&\mu,e\\ \mu^{\prime},e&g\end{smallmatrix}\right)|\Phi_{\mu^{\prime};{\bf R}_{\ell_{1}}}\rangle (136)
Refer to caption
Figure 10: Effect of V^eh(inter)\hat{V}^{(inter)}_{eh} on the NsN_{s}-electron excited state |Φμ;𝐑|\Phi_{\mu;{\bf R}_{\ell}}\rangle, as given in Eq. (136). Under Coulomb interaction, the excited level can change its spatial index from μ\mu to μ\mu^{\prime}, with a possible ion site change from \ell to 1\ell_{1}\not=\ell.

The resulting (3Ns×3Ns3N_{s}\times 3N_{s}) matrix Φμ;𝐑|H^eh|Φμ;𝐑\langle\Phi_{\mu^{\prime};{\bf R}_{\ell^{\prime}}}|\hat{H}_{eh}|\Phi_{\mu;{\bf R}_{\ell}}\rangle is made of a set of identical 3×33\times 3 submatrices hh on the main diagonal, that reads

h(=)=(Eg+εeεg)I(3)+(vx,xvx,yvx,zvy,xvy,yvy,zvz,xvz,yvz,z)h^{(\ell^{\prime}=\ell)}=\big{(}E^{\prime}_{g}+\varepsilon_{e}-\varepsilon_{g}\big{)}\rm{I}^{(3)}+\left(\begin{matrix}v_{x,x}&v_{x,y}&v_{x,z}\\ v_{y,x}&v_{y,y}&v_{y,z}\\ v_{z,x}&v_{z,y}&v_{z,z}\end{matrix}\right) (137)

where I(3)\rm{I}^{(3)} is the 3×33\times 3 identity matrix, while vμ,μv_{\mu^{\prime},\mu} is given by

vμ,μ\displaystyle v_{\mu^{\prime},\mu} =\displaystyle= L3d3rd3r|𝐫|g|2(μ,e|𝐫𝐫|μ,eδμ,μ|𝐫|g|2)\displaystyle\iint_{L^{3}}d^{3}rd^{3}r^{\prime}\,|\langle{\bf r}|g\rangle|^{2}\Big{(}\langle\mu^{\prime},e|{\bf r}^{\prime}\rangle\langle{\bf r}^{\prime}|\mu,e\rangle-\delta_{\mu^{\prime},\mu}|\langle{\bf r}^{\prime}|g\rangle|^{2}\Big{)} (138)
×𝐑𝟎[e2|𝐑+𝐫𝐫|e2|𝐑𝐫|]\displaystyle\times\sum_{{\bf R}\not=\bf 0}\left[\frac{e^{2}}{|{\bf R}+{\bf r}-{\bf r}^{\prime}|}-\frac{e^{2}}{|{\bf R}-{\bf r}^{\prime}|}\right]

The (3Ns×3Ns3N_{s}\times 3N_{s}) matrix Φμ;𝐑|H^eh|Φμ;𝐑\langle\Phi_{\mu^{\prime};{\bf R}_{\ell^{\prime}}}|\hat{H}_{eh}|\Phi_{\mu;{\bf R}_{\ell}}\rangle also has nondiagonal ()(\ell^{\prime}\not=\ell) terms that come from interlevel excitations that read

hμ,μ()=𝒱𝐑𝐑(gμ,eμ,eg)h_{\mu^{\prime},\mu}^{(\ell^{\prime}\not=\ell)}=\mathcal{V}_{{\bf R}_{\ell^{\prime}}-{\bf R}_{\ell}}\left(\begin{smallmatrix}g&\,\mu,e\\ \mu^{\prime},e&\,g\end{smallmatrix}\right) (139)

and that allow coupling between lattice sites.

5.1.5 Diagonalization of the corresponding matrix

The diagonalization of this matrix is performed in two steps:

\bullet In a first step, we diagonalize the (3Ns×3Ns3N_{s}\times 3N_{s}) matrix with respect to the lattice sites. This amounts to delocalizing the excitation on the 𝐑{\bf R}_{\ell} site into a 𝐊n{\bf K}_{n} wave, as done through the same phase factor as the one given in Eq. (1). By introducing the linear combination

|Φμ;𝐊n=1Ns=1Nsei𝐊n𝐑|Φμ;𝐑|\Phi_{\mu;{\bf K}_{n}}\rangle=\frac{1}{\sqrt{N_{s}}}\sum_{\ell=1}^{N_{s}}e^{i{\bf K}_{n}\cdot{\bf R}_{\ell}}|\Phi_{\mu;{\bf R}_{\ell}}\rangle (140)

we get

Φμ;𝐊n|H^eh|Φμ;𝐊n=δn,n(hμ,μ+v𝐊n(μ,μ))\langle\Phi_{\mu^{\prime};{\bf K}_{n^{\prime}}}|\hat{H}_{eh}|\Phi_{\mu;{\bf K}_{n}}\rangle=\delta_{n^{\prime},n}\Big{(}h_{\mu^{\prime},\mu}+v_{{}_{{\bf K}_{n}}}(\mu^{\prime},\mu)\Big{)} (141)

the 𝐊n{\bf K}_{n} dependence of this matrix element coming from the interlevel Coulomb interaction between lattice sites

v𝐊n(μ,μ)=𝐑𝟎ei𝐊n𝐑𝒱𝐑(gμ,eμ,eg)v_{{}_{{\bf K}_{n}}}(\mu^{\prime},\mu)=\sum_{{\bf R}\not=\bf 0}e^{i{\bf K}_{n}\cdot{\bf R}}\,\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}g&\mu,e\\ \mu^{\prime},e&g\end{smallmatrix}\right) (142)

This renders the (3Ns×3Ns3N_{s}\times 3N_{s}) matrix block-diagonal with respect to the NsN_{s} wave vectors 𝐊n{\bf K}_{n}. The matrix elements of the remaining (3×33\times 3) submatrices, associated with a given 𝐊n{\bf K}_{n}, are equal to hμ,μ+v𝐊n(μ,μ)h_{\mu^{\prime},\mu}+v_{{}_{{\bf K}_{n}}}(\mu^{\prime},\mu).

\bullet In a second step, we diagonalize these (3×33\times 3) submatrices. A close look at v𝐊n(μ,μ)v_{{}_{{\bf K}_{n}}}(\mu^{\prime},\mu) defined above, which also reads, with the help of Eq. (67), as

v𝐊n(μ,μ)=L3d3rd3r(g|𝐫𝐫|μ,e)(μ,e|𝐫𝐫|g)𝐑𝟎ei𝐊n𝐑e2|𝐑+𝐫𝐫|v_{{}_{{\bf K}_{n}}}(\mu^{\prime},\mu)=\iint_{L^{3}}d^{3}r^{\prime}d^{3}r\,\Big{(}\langle g|{\bf r}^{\prime}\rangle\langle{\bf r}^{\prime}|\mu,e\rangle\Big{)}\Big{(}\langle\mu^{\prime},e|{\bf r}\rangle\langle{\bf r}|g\rangle\Big{)}\sum_{{\bf R}\not=\bf 0}e^{i{\bf K}_{n}\cdot{\bf R}}\frac{e^{2}}{|{\bf R}+{\bf r}-{\bf r}^{\prime}|} (143)

shows that, in the small 𝐊n{\bf K}_{n} limit, this coupling is controlled by large 𝐑{\bf R}’s, for the same reason as the one given in Eq. (66). By noting that the electronic wave functions keep (𝐫,𝐫)({\bf r},{\bf r}^{\prime}) small compared to 𝐑𝟎{\bf R}\not=\bf 0, we are led to perform a large-𝐑{\bf R} expansion of 1/|𝐑+𝐫𝐫|1/|{\bf R}+{\bf r}-{\bf r}^{\prime}|. Since the ground and excited levels now have a different parity, the dominant nonzero terms in the sum are just the ones in Eq. (69). When inserted into Eq. (143), this gives

v𝐊n(μ,μ)=|dge|2S𝐊n(μ,μ)v_{{}_{{\bf K}_{n}}}(\mu^{\prime},\mu)=|d_{ge}|^{2}S_{{\bf K}_{n}}(\mu^{\prime},\mu) (144)

the excited-electron dipole moment

dge=eL3d3rg|𝐫𝐫|μ,erμd_{ge}=e\int_{L^{3}}d^{3}r\,\langle g|{\bf r}\rangle\langle{\bf r}|\mu,e\rangle\,r_{\mu} (145)

being μ\mu independent due to cyclic invariance. The S𝐊(μ,μ)S_{{\bf K}}(\mu^{\prime},\mu) sum, defined as

S𝐊(μ,μ)=𝐑𝟎ei𝐊𝐑R3(δμ,μ3RμRμR2)S_{{\bf K}}(\mu^{\prime},\mu)=\sum_{{\bf R}\not=\bf 0}\frac{e^{i{\bf K}\cdot{\bf R}}}{R^{3}}\left(\delta_{\mu^{\prime},\mu}-3\frac{R_{\mu^{\prime}}R_{\mu}}{R^{2}}\right) (146)

is singular in the small 𝐊{\bf K} limit: as explicitly shown in E, it reads in this limit as[36, 37]

lim𝐊𝟎S𝐊(μ,μ)=4π3ac3(δμ,μ3KμKμK2)\lim_{{\bf K}\rightarrow\bf 0}S_{\bf K}(\mu^{\prime},\mu)=-\frac{4\pi}{3a_{c}^{3}}\left(\delta_{\mu^{\prime},\mu}-3\frac{K_{\mu^{\prime}}K_{\mu}}{K^{2}}\right) (147)

where aca_{c} is the distance between two adjacent lattice sites.

By taking the zz axis of the arbitrary (x,y,z)(x,y,z) set along 𝐊{\bf K}, the above equation gives lim𝐊𝟎S𝐊(μ,μ)=0\lim_{{\bf K}\rightarrow\bf 0}S_{\bf K}(\mu^{\prime},\mu)=0 for μμ\mu^{\prime}\not=\mu, this limit being equal to 4π/3ac3-4\pi/3a_{c}^{3} for μ=μ=(x,y)\mu^{\prime}=\mu=(x,y), and to 8π/3ac38\pi/3a_{c}^{3} for μ=μ=z\mu^{\prime}=\mu=z (see Fig. 11). This induces to the Frenkel exciton energy a singular splitting in the small wave-vector limit that depends on the 𝐊n{\bf K}_{n} direction, the barycenter of this splitting reducing to zero, as usual.

Refer to caption
Figure 11: Frenkel exciton energy in the small 𝐊{\bf K} limit, as given in Eq. (148). The splitting results from the singular behavior of S𝐊(μ,μ)S_{\bf K}(\mu^{\prime},\mu) in the small 𝐊{\bf K} limit (see Eqs. (146,147)).

As a last step, we note (see F) that the Coulomb terms in the 𝐑{\bf R} sum of vμ,μv_{\mu^{\prime},\mu} defined in Eq. (138), scales as 𝒪(R4)\mathcal{O}(R^{-4}) in the large 𝐑{\bf R} limit, while the ones in the v𝐊(μ,μ)v_{{}_{\bf K}}(\mu^{\prime},\mu) sum scale as 𝒪(R3)\mathcal{O}(R^{-3}). Consequently, we can neglect the vμ,μv_{\mu^{\prime},\mu} nondiagonal part of the (3×33\times 3) submatrix hh in the large sample limit. As a result, by choosing zz along 𝐊n{\bf K}_{n}, we render diagonal the (3×33\times 3) submatrix hh given in Eq. (137), in the small Frenkel exciton wave vector limit: this submatrix then appears as[43, 44]

(Eg+εeεg)I(3)+4π3ac3|dge|2(100010002)\big{(}E^{\prime}_{g}+\varepsilon_{e}-\varepsilon_{g}\big{)}{\rm{I}}^{(3)}+\frac{4\pi}{3a_{c}^{3}}|d_{ge}|^{2}\left(\begin{matrix}-1&0&0\\ 0&-1&0\\ 0&0&2\end{matrix}\right) (148)

We wish to stress that, by choosing the zz axis for the degenerate electronic basis along 𝐊n{\bf K}_{n}, we render diagonal the hh submatrix for exciton with wave vector 𝐊n{\bf K}_{n}. However, since each 3×33\times 3 submatrix is associated with a different 𝐊n{\bf K}_{n}, this does not simultaneously diagonalize all v𝐊v_{{}_{\bf K}} submatrices. Consequently, it is not possible to diagonalize the whole (3Ns×3Ns)3N_{s}\times 3N_{s}) matrix Φμ;𝐑|H^|Φμ;𝐑\langle\Phi_{\mu^{\prime};{\bf R}_{\ell^{\prime}}}|\hat{H}|\Phi_{\mu;{\bf R}_{\ell}}\rangle by properly choosing the (x,y,z)(x,y,z) axes once for all.

Refer to caption
Figure 12: (a) In the presence of a threefold spatial degeneracy in the electronic ground level, the system ground state has three ground-level electrons with spatial symmetry μ=(x,y,z)\mu=(x,y,z) and a 3|e|3|e| ion. (b) The lowest set of excited states has a 3|e|3|e| ion and two electrons in the ground level, let’s say (x,y)(x,y), the third electron being in the nondegenerate excited level. (c) This fundamentally corresponds to the 3|e|3|e| ion with its three ground-level electrons (x,y,z)(x,y,z), plus one excited electron and one ground-level hole with spatial symmetry μ=z\mu=z.

5.2 Threefold electronic ground level

The situation seems at first more complicated when the threefold degenerate level is not the excited level but the ground level: due to charge neutrality, the ion charge then has to be 3|e|3|e| (see Fig. 12(a)) because the ground level has its three (x,y,z)(x,y,z) states occupied by an electron. In the lowest set of excited states, one of the three ground electrons of a particular lattice site 𝐑{\bf R}_{\ell} is replaced by an excited electron, while the other two electrons stay in the ground level (see Fig. 12(b)). As a result, the excited electron does not feel the bare 3|e|3|e| ion, but a 3|e|3|e| ion surrounded by a cloud made of the remaining two electrons; so, the charge felt by the excited electron looks very much like a |e||e| ion. This difficulty is fundamentally the same as the one we faced in Sec. 4 for electrons with up and down spins: the ion charge was 2|e|2|e| instead of |e||e|, but the relevant electronic basis still is the one for one electron in the presence of a |e||e| ion.

To handle this problem, we first introduce the appropriate basis for one electron and one |e||e| ion located at R\textbf{R}_{\ell}; the electronic eigenstates we are going to use are threefold for the ground level and nondegenerate for the lowest excited level. In terms of the corresponding electron operators a^μ,g,\hat{a}^{\dagger}_{\mu,g,\ell} with μ=(x,y,z)\mu=(x,y,z) for the ground level and a^e,\hat{a}^{\dagger}_{e,\ell} for the excited level, the system ground state reads

|Φg=(a^x,g,1a^y,g,1a^z,g,1)(a^x,g,2a^y,g,2a^z,g,2)(a^x,g,Nsa^y,g,Nsa^z,g,Ns)|v\displaystyle|\Phi_{g}\rangle=\Big{(}\hat{a}^{\dagger}_{x,g,1}\hat{a}^{\dagger}_{y,g,1}\hat{a}^{\dagger}_{z,g,1}\Big{)}\Big{(}\hat{a}^{\dagger}_{x,g,2}\hat{a}^{\dagger}_{y,g,2}\hat{a}^{\dagger}_{z,g,2}\Big{)}\cdots\cdots\Big{(}\hat{a}^{\dagger}_{x,g,N_{s}}\hat{a}^{\dagger}_{y,g,N_{s}}\hat{a}^{\dagger}_{z,g,N_{s}}\Big{)}|v\rangle (149)

with energy Eg=3NsεgE_{g}=3N_{s}\varepsilon_{g}, and the lowest set of excited states corresponds to (see Fig. 12(c))

|Φμ;𝐑=a^e,a^μ,g,|Φg|\Phi_{\mu;{\bf R}_{\ell}}\rangle=\hat{a}^{\dagger}_{e,\ell}\hat{a}_{\mu,g,\ell}|\Phi_{g}\rangle (150)

with energy Ee=Eg+εeεgE_{e}=E_{g}+\varepsilon_{e}-\varepsilon_{g} in the absence of additional Coulomb processes.

The change from ground and excited electron operators to electron and hole operators follows from

a^μ,g,=b^μ,a^e,=a^\hat{a}_{\mu,g,\ell}=\hat{b}^{\dagger}_{\mu,\ell}\qquad\qquad\hat{a}^{\dagger}_{e,\ell}=\hat{a}^{\dagger}_{\ell} (151)

without phase factor, due to cyclic invariance. So, the lowest set of excited states reads in terms of electron and hole as

|Φμ;𝐑=a^b^μ,|0|\Phi_{\mu;{\bf R}_{\ell}}\rangle=\hat{a}^{\dagger}_{\ell}\hat{b}^{\dagger}_{\mu,\ell}|0\rangle (152)

This state has the same spatial symmetry μ\mu as the one for a nondegenerate ground level and an electron in the μ\mu state of a threefold excited level. So, we end with the same Frenkel exciton matrix as for nondegenerate ground level and threefold excited level.

A first diagonalization is performed with respect to the lattice sites through the ei𝐊n𝐑e^{i{\bf K}_{n}\cdot{\bf R}_{\ell}} phase factor. The diagonalization of the resulting 3×33\times 3 submatrix associated with a given 𝐊n{\bf K}_{n} is then performed by choosing the zz axis of the electronic states along 𝐊n{\bf K}_{n}; this renders the 3×33\times 3 submatrix diagonal in the small 𝐊n{\bf K}_{n} limit.

6 Frenkel excitons with spin and spatial degeneracies

6.1 The whole problem

\bullet The results obtained in the previous sections provide the keys for analytically solving the whole Frenkel exciton problem, that is, a system made of 6Ns6N_{s} electrons with spin 1/21/2 or 1/2-1/2, and NsN_{s} ions located on lattice sites, the electronic states for a single ion being nondegenerate for the lowest excited level and threefold degenerate for the ground level with states labeled as μ=(x,y,z)\mu=(x,y,z) along axes that can be chosen at will, since these states correspond to a single ion, the other ions of the lattice playing no role in the basis. (The simpler case, with a nondegenerate ground level and a threefold excited level, is easy to solve along the same line[5].) The lowest set of system excited states is made of states in which all the electronic ground states of the NsN_{s} lattice sites are occupied, except one site for which one of its three ground-level electrons is replaced by an excited-level electron. The corresponding excited subspace then has a (2×3×Ns2\times 3\times N_{s})-fold degeneracy, 2 for the spin of the excited electron, 3 for its spatial degeneracy in the ground level, and NsN_{s} for the lattice site on which the electron is excited. As a result, the mathematical problem we face is the diagonalization of a (6Ns×6Ns)6N_{s}\times 6N_{s}) matrix.

To overcome the formidable task of analytically diagonalizing such a large matrix, we use the physics we have previously learned, namely

(i) we introduce a one-electron basis made of the eigenstates of one electron in the presence of a single |e||e| charge located at a 𝐑{\bf R}_{\ell} lattice site, and we only consider the two lowest electronic levels on all 𝐑{\bf R}_{\ell} sites, for which the strict tight-binding limit is valid, in order to possibly perform a second quantization formulation of the problem;

(ii) next, we turn from ground-level and excited-level electron operators to electron and hole operators;

(iii) we further introduce electron-hole pairs in spin-triplet and spin-singlet states. This readily splits the (6Ns×6Ns)6N_{s}\times 6N_{s}) matrix into a diagonal (3Ns×3Ns3N_{s}\times 3N_{s}) submatrix in the spin-triplet subspace and a nondiagonal (3Ns×3Ns)3N_{s}\times 3N_{s}) submatrix in the spin-singlet subspace;

(iv) we diagonalize this nondiagonal matrix with respect to the NsN_{s} lattice sites, by constructing exciton waves that are linear combination of spin-singlet pair excitations on 𝐑{\bf R}_{\ell} sites, with a prefactor equal to ei𝐊n𝐑e^{i{\bf K}_{n}\cdot{\bf R}_{\ell}}. We are left with NsN_{s} submatrices (3×33\times 3) in the μ=(x,y,z)\mu=(x,y,z) subspace, each submatrix being characterized by a distinct wave vector 𝐊n{\bf K}_{n};

(v) this (3×33\times 3) submatrix can be made diagonal in the small 𝐊n{\bf K}_{n} limit, by choosing the electronic-state axis zz along 𝐊n{\bf K}_{n}; its eigenstates, quantized along zz and (x,y)(x,y), then have a positive and a negative energy shift, the splitting barycenter being equal to zero.

This set of smart transformations relies on a deep understanding of how Frenkel exciton waves come to be formed out of on-site electronic excitations, and the fact that the interlevel excitations responsible for the excitonic waves occur for spin-singlet electron-hole pairs only. The last step, associated with the singularity of the interlevel scattering in the small center-of-mass wave vector limit, requires a careful mathematical study of this scattering.

\bullet The fact that the system ground state has all the threefold ground states of each lattice site occupied by an up-spin and a down-spin electron, imposes

(i) the ion to have a 6|e|6|e| charge in order for the system to be neutral, and

(ii) the electron number to be equal to 3Ns3N_{s} for each up and down spin for a system having NsN_{s} ions.

So, the system Hamiltonian considered in Eq. (2) now reads, for these 6Ns6N_{s} electrons, as

H6Ns\displaystyle H_{6N_{s}} =\displaystyle= s=±1/2j=13Ns𝐩s,j22m0+s=±1/2j=13Ns=1Ns6e2|𝐫s,j𝐑|\displaystyle\sum_{s=\pm 1/2}\sum_{j=1}^{{3N_{s}}}\frac{{\bf p}_{s,j}^{2}}{2m_{0}}+\sum_{s=\pm 1/2}\sum_{j=1}^{{3N_{s}}}\sum_{\ell=1}^{N_{s}}\frac{-6e^{2}}{|{\bf r}_{s,j}-{\bf R}_{\ell}|} (153)
+12s=±1/2j=13Ns(jj3Nse2|𝐫s,j𝐫s,j|+j=13Nse2|𝐫s,j𝐫s,j|)\displaystyle+\frac{1}{2}\sum_{s=\pm 1/2}\sum_{j=1}^{{3N_{s}}}\left(\sum_{j^{\prime}\not=j}^{{3N_{s}}}\frac{e^{2}}{|{\bf r}_{s,j}-{\bf r}_{s,j^{\prime}}|}+\sum_{j^{\prime}=1}^{3N_{s}}\frac{e^{2}}{|{\bf r}_{s,j}-{\bf r}_{-s,j^{\prime}}|}\right)
+12=1NsNs(6e)2|𝐑𝐑|\displaystyle+\frac{1}{2}\sum_{\ell=1}^{N_{s}}\sum_{\ell^{\prime}\not=\ell}^{N_{s}}\frac{(6e)^{2}}{|{\bf R}_{\ell}-{\bf R}_{\ell^{\prime}}|}

6.2 Good one-electron basis

\bullet As already mentioned in Secs. 4 and 5 for degenerate ground level, the physically relevant electronic states to describe these 6Ns6{N_{s}} electrons are not the ones of an electron in the presence of a 6|e|6|e| ion, as the first two terms of Eq. (153) could naïvely suggest, but rather the ones of an electron in the presence of a 6|e|6|e| ion surrounded by a cloud made of the five negatively-charged electrons that remain in the ground level. As this ensemble is closer to a |e||e| charge than to a 6|e|6|e| charge, the one-body Hamiltonian that provides the physically relevant one-electron states for second quantization, cannot be isolated from the H6NsH_{6{N_{s}}} Hamiltonian due to the two-body repulsive interaction between one electron and the five-electrons cloud on the same lattice site. One of the beauties of the second quantization formalism is that the one-body operators can a priori be defined from any basis, not necessarily a basis constructed from a part of the system Hamiltonian. Choosing a “good” basis requires to first perform a physical analysis of the problem, in order for this basis to include as much physics as possible.

\bullet This good basis is made of the eigenstates of one electron in the presence of a single |e||e| ion located at 𝐑{\bf R}_{\ell}. The associated Hamiltonian reads

h𝐑=𝐩22m0e2|𝐫𝐑|h_{\scriptscriptstyle{{\bf R}_{\ell}}}=\frac{{\bf p}^{2}}{2m_{0}}-\frac{e^{2}}{|{\bf r}-{{\bf R}_{\ell}}|} (154)

The h𝐑h_{\scriptscriptstyle{{\bf R}_{\ell}}} eigenstate wave function 𝐫|ν,𝐑=𝐫𝐑|ν\langle{\bf r}|\nu,{\bf R}_{\ell}\rangle=\langle{\bf r}-{\bf R}_{\ell}|\nu\rangle only depends on the distance of the electron to the 𝐑{\bf R}_{\ell} site.

It turns out that the electronic states physically relevant for the Frenkel exciton problem are highly localized on lattice sites at the lattice size scale. For this reason, the good one-body basis to describe an electron located on any lattice site, can hardly be the |ν,𝐑|\nu,{\bf R}_{\ell}\rangle states for a specific 𝐑{\bf R}_{\ell} and all ν\nu’s, although these states do form a complete set. The good basis should include |ν,𝐑|\nu,{\bf R}_{\ell}\rangle states for all 𝐑{\bf R}_{\ell}’s. This can be done by restricting ν\nu to its two lowest states, that are the relevant states to describe 6Ns6{N_{s}} electrons in the case of Frenkel excitons, and by enforcing the tight-binding limit (6) on these lowest states, namely

ν|𝐫𝐑𝐫𝐑|ν=0for\langle\nu^{\prime}|{\bf r}-{\bf R}_{\ell^{\prime}}\rangle\langle{\bf r}-{\bf R}_{\ell}|\nu\rangle=0\quad\textrm{for}\quad\ell^{\prime}\not=\ell (155)

\bullet The |ν,𝐑|\nu,{\bf R}_{\ell}\rangle states for the two lowest ν\nu’s, which are the relevant states of the problem, are such that

ν,𝐑|ν,𝐑=δν,νδ,\langle\nu^{\prime},{\bf R}_{\ell^{\prime}}|\nu,{\bf R}_{\ell}\rangle=\delta_{\nu^{\prime},\nu}\delta_{\ell^{\prime},\ell} (156)

They can thus be used to define fermionic creation operators for spin s=±1/2s=\pm 1/2 electron, through

|s,ν,𝐑=a^s,ν,|v|s,\nu,{\bf R}_{\ell}\rangle=\hat{a}^{\dagger}_{s,\nu,\ell}|v\rangle (157)

with ν=e\nu=e for the nondegenerate excited level, and ν=(μ,g)\nu=(\mu,g) for the threefold ground level, the index μ\mu being taken along (x,y,z)(x,y,z) axes that can be chosen at will. These axes do not have to coincide with the (X,Y,Z)(X,Y,Z) cubic axes of the crystal because h𝐑h_{\scriptscriptstyle{{\bf R}_{\ell}}} corresponds to the Hamiltonian of a single ion, the position of the other ions in the lattice playing no role in this basis.

Refer to caption
Figure 13: (a) System ground state in which all up-spin and down-spin electron states, with spatial symmetry μ=(x,y,z)\mu=(x,y,z) of each 𝐑{\bf R}_{\ell} site, are occupied (see Eq. (6.2)). (b) In the lowest set of excited states, one ground-level electron jumps to the excited level with its σ\sigma spin, the opposite-spin (σ-\sigma) electron staying in the ground level (see Eq. (160)).

\bullet The (6Ns)(6N_{s})-electron ground state (see Fig. 13(a)) reads within this electronic basis as

|Φg\displaystyle|\Phi_{g}\rangle\!\!\! =\displaystyle= (a^+,x,g,1a^,x,g,1a^+,y,g,1a^,y,g,1a^+,z,g,1a^,z,g,1)\displaystyle\!\!\!\Big{(}\hat{a}^{\dagger}_{+,x,g,1}\hat{a}^{\dagger}_{-,x,g,1}\hat{a}^{\dagger}_{+,y,g,1}\hat{a}^{\dagger}_{-,y,g,1}\hat{a}^{\dagger}_{+,z,g,1}\hat{a}^{\dagger}_{-,z,g,1}\Big{)}\cdots
(a^+,x,g,Nsa^,x,g,Nsa^+,y,g,Nsa^,y,g,Nsa^+,z,g,Nsa^,z,g,Ns)|v\displaystyle\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\Big{(}\hat{a}^{\dagger}_{+,x,g,{N_{s}}}\hat{a}^{\dagger}_{-,x,g,{N_{s}}}\hat{a}^{\dagger}_{+,y,g,{N_{s}}}\hat{a}^{\dagger}_{-,y,g,{N_{s}}}\hat{a}^{\dagger}_{+,z,g,{N_{s}}}\hat{a}^{\dagger}_{-,z,g,{N_{s}}}\Big{)}|v\rangle

The energy of the H6NsH_{6{N_{s}}} Hamiltonian (153) for this state follows from

Eg=Φg|H6Ns|Φg\displaystyle E^{\prime}_{g}=\langle\Phi_{g}|H_{6{N_{s}}}|\Phi_{g}\rangle (159)

It is close to Eg=6NsεgE_{g}=6{N_{s}}\varepsilon_{g} if we only include Coulomb interaction between electrons and their effective |e||e| ion.

\bullet In the lowest set of (6Ns)(6N_{s})-electron excited states, one spin-σ\sigma electron goes from the spatial ground level (μ,g)(\mu,g) of the 𝐑{\bf R}_{\ell} site to the excited level ee of the same site. The lowest set of system excited states thus reads (see Fig. 13(b))

|Φσ,μ;𝐑=a^σ,e,a^σ,μ,g,|Φg|\Phi_{\sigma,\mu;{\bf R}_{\ell}}\rangle=\hat{a}^{\dagger}_{\sigma,e,\ell}\hat{a}_{\sigma,\mu,g,\ell}|\Phi_{g}\rangle (160)

This excited subspace, that has an excitation energy close to εeεg\varepsilon_{e}-\varepsilon_{g}, is (2×3×Ns2\times 3\times{N_{s}})-degenerate. The various Coulomb interactions that appear in the H6NsH_{6{N_{s}}} Hamiltonian (153) split this degeneracy, as obtained from the diagonalization of the (6Ns×6Ns6{N_{s}}\times 6{N_{s}}) matrix

Φσ,μ;𝐑|H6Ns|Φσ,μ;𝐑\displaystyle\langle\Phi_{\sigma^{\prime},\mu^{\prime};{\bf R}_{\ell^{\prime}}}|H_{6{N_{s}}}|\Phi_{\sigma,\mu;{\bf R}_{\ell}}\rangle (161)

6.3 Hamiltonian in terms of ground and excited electron operators

In order to calculate the above (6Ns×6Ns6{N_{s}}\times 6{N_{s}}) matrix for the |Φσ,μ;𝐑|\Phi_{\sigma,\mu;{\bf R}_{\ell}}\rangle subspace defined in Eq. (160), we first have to rewrite the H6NsH_{6{N_{s}}} Hamiltonian (153) in terms of electronic-state operators a^s,ν,\hat{a}^{\dagger}_{s,\nu,\ell}. The H6NsH_{6{N_{s}}} relevant parts for calculating this matrix are the ones that keep the number of ground-level electrons and the number of excited-level electrons because the other states are higher in energy.

6.3.1 One-body part

\bullet The first two terms of the H6NsH_{6{N_{s}}} Hamiltonian (153), which correspond to the electron kinetic energy and the Coulomb interaction of the electrons with all 6|e|6|e| ions, are a sum of one-body terms. As in Eq. (11), we rewrite them as

H0,6Ns=s=±1/2j=13Ns(𝐩s,j22m0+=1Ns6e2|𝐫s,j𝐑|)s,jNshs,jH_{0,6{N_{s}}}=\sum_{s=\pm 1/2}\sum_{j=1}^{3{N_{s}}}\left(\frac{{\bf p}^{2}_{s,j}}{2m_{0}}+\sum_{\ell=1}^{N_{s}}\frac{-6e^{2}}{|{\bf r}_{s,j}-{\bf R}_{\ell}|}\right)\equiv\sum_{s,j}^{N_{s}}h_{s,j} (162)

The hs,jh_{s,j} Hamiltonian differs from the h𝐑h_{\scriptscriptstyle{{\bf R}_{\ell}}} Hamiltonian used to define the one-body basis for second quantization, because, in hs,jh_{s,j}, the electron interacts with all 6|e|6|e| ions. The second quantization procedure gives the H^0\hat{H}_{0} operator associated with H0,6NsH_{0,6{N_{s}}} as

H^0=sν,ν,hν,;ν,a^s,ν,a^s,ν,\hat{H}_{0}=\sum_{s}\sum_{\nu^{\prime},\ell^{\prime}}\sum_{\nu,\ell}h_{\nu^{\prime},\ell^{\prime};\nu,\ell}\,\hat{a}^{\dagger}_{s,\nu^{\prime},\ell^{\prime}}\hat{a}_{s,\nu,\ell} (163)

since hs,jh_{s,j} does not act on spin.

The prefactor, that reads

hν,;ν,=L3d3rν,𝐑|𝐫(𝐩^22m0+′′=1Ns6e2|𝐫𝐑′′|)𝐫|ν,𝐑\displaystyle h_{\nu^{\prime},\ell^{\prime};\nu,\ell}=\int_{L^{3}}d^{3}r\,\langle\nu^{\prime},{\bf R}_{\ell^{\prime}}|{\bf r}\rangle\left(\frac{\hat{{\bf p}}^{2}}{2m_{0}}+\sum_{\ell^{\prime\prime}=1}^{N_{s}}\frac{-6e^{2}}{|{\bf r}-{\bf R}_{\ell^{\prime\prime}}|}\right)\langle{\bf r}|\nu,{\bf R}_{\ell}\rangle (164)

is calculated by first noting that the wave-function product is equal to zero for \ell^{\prime}\not=\ell, due to the tight-binding limit (6). Next, we note that |ν,𝐑|\nu,{\bf R}_{\ell}\rangle is h𝐑h_{\scriptscriptstyle{{\bf R}_{\ell}}} eigenstate. This leads us to add and subtract e2/|𝐫𝐑|-e^{2}/|{\bf r}-{\bf R}_{\ell}| from the above ′′\ell^{\prime\prime} sum, in order to bring the energy εν\varepsilon_{\nu} into this prefactor. We end with

hν,;ν,=δ,(ενδν,ν+tν,ν+uν,ν)=δ,(ενδν,ν+tν,ν)h_{\nu^{\prime},\ell^{\prime};\nu,\ell}=\delta_{\ell^{\prime},\ell}\left(\varepsilon_{\nu}\delta_{\nu^{\prime},\nu}+t_{\nu^{\prime},\nu}+u_{\nu^{\prime},\nu}\right)=\delta_{\ell^{\prime},\ell}\left(\varepsilon_{\nu}\delta_{\nu^{\prime},\nu}+t^{\prime}_{\nu^{\prime},\nu}\right) (165)

The tν,νt_{\nu^{\prime},\nu} term, that comes from the electron interaction with all the 6|e|6|e| ions located on other lattice sites, precisely reads, for 𝐫|ν,𝐑\langle{\bf r}|\nu,{\bf R}_{\ell}\rangle given in Eq. (4) and 𝐫𝐑=𝐫{\bf r}-{\bf R}_{\ell}={\bf r}_{\ell}, as

tν,ν\displaystyle t_{\nu^{\prime},\nu} =\displaystyle= L3d3rν|𝐫𝐫|ν′′Ns6e2|𝐫+𝐑𝐑′′|\displaystyle\int_{L^{3}}d^{3}r_{\ell}\,\langle\nu^{\prime}|{\bf r}_{\ell}\rangle\langle{\bf r}_{\ell}|\nu\rangle\sum_{\ell^{\prime\prime}\not=\ell}^{N_{s}}\frac{-6e^{2}}{|{\bf r}_{\ell}+{\bf R}_{\ell}-{\bf R}_{\ell^{\prime\prime}}|} (166)
=\displaystyle= L3d3rν|𝐫𝐫|ν𝐑𝟎6e2|𝐫𝐑|\displaystyle\int_{L^{3}}d^{3}r\,\langle\nu^{\prime}|{\bf r}\rangle\langle{\bf r}|\nu\rangle\sum_{{\bf R}\not=\bf 0}\frac{-6e^{2}}{|{\bf r}-{\bf R}|}

the sum being \ell-independent due to the Born-von Karman boundary condition. The electron interaction with the 6|e|6|e| charge on the 𝐑{\bf R}_{\ell} site, brings another contribution

uν,ν=L3d3rν,𝐑|𝐫5e2|𝐫𝐑|𝐫|ν,𝐑=L3d3rν|𝐫5e2|𝐫|𝐫|νu_{\nu^{\prime},\nu}=\int_{L^{3}}d^{3}r\,\langle\nu^{\prime},{\bf R}_{\ell}|{\bf r}\rangle\frac{-5e^{2}}{|{\bf r}-{\bf R}_{\ell}|}\langle{\bf r}|\nu,{\bf R}_{\ell}\rangle=\int_{L^{3}}d^{3}r\,\langle\nu^{\prime}|{\bf r}\rangle\frac{-5e^{2}}{|{\bf r}|}\langle{\bf r}|\nu\rangle (167)

as the |e||e| part of the 6|e|6|e| charge is already used in the h𝐑h_{\scriptscriptstyle{{\bf R}_{\ell}}} Hamiltonian.

The above results lead us to split the one-body Hamiltonian H^0\hat{H}_{0} as

H^0=s,ν,ενa^s,ν,a^s,ν,+s,ν,ν,tν,νa^s,ν,a^s,ν,\hat{H}_{0}=\sum_{s,\nu,\ell}\varepsilon_{\nu}\,\hat{a}^{\dagger}_{s,\nu,\ell}\hat{a}_{s,\nu,\ell}+\sum_{s,\nu^{\prime},\nu,\ell}t^{\prime}_{\nu^{\prime},\nu}\,\hat{a}^{\dagger}_{s,\nu^{\prime},\ell}\hat{a}_{s,\nu,\ell} (168)

This Hamiltonian allows transitions between different electronic levels of the same lattice site.

\bullet The parts of the H^0\hat{H}_{0} Hamiltonian that are relevant in the calculation of the (6Ns×6Ns6{N_{s}}\times 6{N_{s}}) matrix defined in Eq. (161) are the ones that keep the number of ground-level electrons and the number of excited-level electrons. They reduce to H^0,g+H^0,e\hat{H}_{0,g}+\hat{H}_{0,e}. The one-body Hamiltonian for threefold ground-level electrons appears as

H^0,g=s=±1/2(μ,μ)=(x,y,z)(εgδμ,μ+tμ,μ)=1Nsa^s,μ,g,a^s,μ,g,\hat{H}_{0,g}=\sum_{s=\pm 1/2}\,\sum_{(\mu^{\prime},\mu)=(x,y,z)}\big{(}\varepsilon_{g}\delta_{\mu^{\prime},\mu}+t^{\prime}_{\mu^{\prime},\mu}\big{)}\,\sum_{\ell=1}^{N_{s}}\,\hat{a}^{\dagger}_{s,\mu^{\prime},g,\ell}\hat{a}_{s,\mu,g,\ell} (169)

This Hamiltonian allows transitions between the different states μ\mu of the ground level. The part of H^0\hat{H}_{0} for nondegenerate excited electrons simply reads

H^0,e=(εe+te,e)σ=±1/2=1Nsa^s,e,a^s,e,\hat{H}_{0,e}=(\varepsilon_{e}+t^{\prime}_{e,e})\sum_{\sigma=\pm 1/2}\,\sum_{\ell=1}^{N_{s}}\hat{a}^{\dagger}_{s,e,\ell}\hat{a}_{s,e,\ell} (170)

6.3.2 Two-body electron-electron interactions

\bullet The electron-electron interaction in the H6NsH_{6{N_{s}}} Hamiltonian reads in the |s,ν,𝐑|s,\nu,{\bf R}_{\ell}\rangle basis just as the one of HNsH_{N_{s}} in this basis, except for the spin index that is conserved in Coulomb processes. By using Eq. (21), we simply get

V^ee=12{ν}s1,1s2,2𝒱𝐑1𝐑2(ν2ν2ν1ν1)a^s1,ν1,1a^s2,ν2,2a^s2,ν2,2a^s1,ν1,1\displaystyle\hat{V}_{e-e}=\frac{1}{2}\sum_{\{\nu\}}\sum_{s_{1},\ell_{1}}\sum_{s_{2},\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}\nu^{\prime}_{2}&\hskip 2.84544pt\nu_{2}\\ \hskip 2.84544pt\nu^{\prime}_{1}&\nu_{1}\end{smallmatrix}\right)\hat{a}^{\dagger}_{s_{1},\nu^{\prime}_{1},\ell_{1}}\hat{a}^{\dagger}_{s_{2},\nu^{\prime}_{2},\ell_{2}}\hat{a}_{s_{2},\nu_{2},\ell_{2}}\hat{a}_{s_{1},\nu_{1},\ell_{1}} (171)

where 𝒱𝐑(ν2ν2ν1ν1)\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}\nu^{\prime}_{2}&\hskip 2.84544pt\nu_{2}\\ \hskip 2.84544pt\nu^{\prime}_{1}&\nu_{1}\end{smallmatrix}\right) is the scattering given in Eq. (20).

\bullet The parts of this electron-electron interaction that are relevant for calculating the Φσ,μ;𝐑|H6Ns|Φσ,μ;𝐑\langle\Phi_{\sigma^{\prime},\mu^{\prime};{\bf R}_{\ell^{\prime}}}|H_{6{N_{s}}}|\Phi_{\sigma,\mu;{\bf R}_{\ell}}\rangle matrix, are the ones that conserve the number of ground-level electrons and the number of excited-level electrons. They reduce to V^gg+V^eg(intra)+V^eg(inter)\hat{V}_{gg}+\hat{V}^{(intra)}_{eg}+\hat{V}^{(inter)}_{eg}. The interaction involving ground-level electrons only, now contains transitions inside the degenerate μ\mu subspace. It reads

V^gg=12𝒱𝐑1𝐑2(μ2,gμ2,gμ1,gμ1,g)a^s1,μ1,g,1a^s2,μ2,g,2a^s2,μ2,g,2a^s1,μ1,g,1\hat{V}_{gg}=\frac{1}{2}\sum\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}\mu_{2}^{\prime},g&\,\mu_{2},g\\ \mu_{1}^{\prime},g&\,\mu_{1},g\end{smallmatrix}\right)\,\hat{a}^{\dagger}_{s_{1},\mu^{\prime}_{1},g,\ell_{1}}\hat{a}^{\dagger}_{s_{2},\mu^{\prime}_{2},g,\ell_{2}}\hat{a}_{s_{2},\mu_{2},g,\ell_{2}}\hat{a}_{s_{1},\mu_{1},g,\ell_{1}} (172)

the (s1,μ1,1)=(s2,μ2,2)(s_{1},\mu_{1},\ell_{1})=(s_{2},\mu_{2},\ell_{2}) process being equal to zero due to the Pauli exclusion principle.

Refer to caption
Figure 14: Same as Fig. 7, the electronic ground states now having a spatial index μ\mu.

With regard to the interaction between ground and excited levels, we again have intralevel and interlevel processes. Since the Coulomb interaction conserves the spin, the interaction in which the two scattered electrons stay in their level, reads (see Fig. 14(a))

V^eg(intra)=𝒱𝐑1𝐑2(eeμ,gμ,g)a^s1,μ,g,1a^s2,e,2a^s2,e,2a^s1,μ,g,1\displaystyle\hat{V}^{(intra)}_{eg}=\sum\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}e&\,e\\ \mu^{\prime},g&\,\mu,g\end{smallmatrix}\right)\hat{a}^{\dagger}_{s_{1},\mu^{\prime},g,\ell_{1}}\hat{a}^{\dagger}_{s_{2},e,\ell_{2}}\hat{a}_{s_{2},e,\ell_{2}}\hat{a}_{s_{1},\mu,g,\ell_{1}} (173)

while the interaction in which they change level is given by (see Fig. 14(b))

V^eg(inter)=𝒱𝐑1𝐑2(μ2,geeμ1,g)a^s1,e,1a^s2,μ2,g,2a^s2,e,2a^s1,μ1,g,1\displaystyle\hat{V}^{(inter)}_{eg}=\sum\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}\mu^{\prime}_{2},g&\,e\\ e&\,\mu_{1},g\end{smallmatrix}\right)\hat{a}^{\dagger}_{s_{1},e,\ell_{1}}\hat{a}^{\dagger}_{s_{2},\mu^{\prime}_{2},g,\ell_{2}}\hat{a}_{s_{2},e,\ell_{2}}\hat{a}_{s_{1},\mu_{1},g,\ell_{1}} (174)

\bullet The ion-ion interaction, i.e., the last term of H6NsH_{6{N_{s}}} in Eq. (153), appears as in Eq. (22), but for ion now having a 6|e|6|e| charge. It reads

Vii=Ns2𝐑𝟎36e2|𝐑|V_{i-i}=\frac{{N_{s}}}{2}\sum_{{\bf R}\not=\bf 0}\frac{36\,e^{2}}{|{\bf R}|} (175)

6.4 From ground and excited electrons to electrons and holes

\bullet The change from ground-level electron absence to hole goes along with a phase factor induced by its spin part, the spatial part bringing no phase factor when the states of the threefold level are labeled as μ=(x,y,z)\mu=(x,y,z), due to cyclic invariance [30, 31]. This change reads

as,μ,g,=(1)1/2sb^s,μ,a_{s,\mu,g,\ell}=(-1)^{1/2-s}\,\hat{b}^{\dagger}_{-s,\mu,\ell} (176)

The excited-level electron operator simply leads to electron operator as

as,e,=as,a^{\dagger}_{s,e,\ell}=a^{\dagger}_{s,\ell} (177)

\bullet So, the lowest set of excited states given in Eq. (160) becomes in terms of electron and hole

|Φσ,μ;𝐑=(1)1/2σa^σ,b^σ,μ,|Φg|\Phi_{\sigma,\mu;{\bf R}_{\ell}}\rangle=(-1)^{1/2-\sigma}\,\hat{a}^{\dagger}_{\sigma,\ell}\hat{b}^{\dagger}_{-\sigma,\mu,\ell}|\Phi_{g}\rangle (178)

Note that these electron-hole pair states have a total spin SzS_{z} equal to zero.

\bullet When written in terms of holes, the one-body ground-level Hamiltonian (169) becomes, since a^s,μ,g,a^s,μ,g,=δμ,μb^s,μ,b^s,μ,\hat{a}^{\dagger}_{s,\mu^{\prime},g,\ell}\hat{a}_{s,\mu,g,\ell}=\delta_{\mu^{\prime},\mu}-\hat{b}^{\dagger}_{-s,\mu,\ell}\hat{b}_{-s,\mu^{\prime},\ell},

H^0,g=2Ns(3εg+μtμ,μ)μ,μs,(εgδμ,μ+tμ,μ)b^s,μ,b^s,μ,\hat{H}_{0,g}=2{N_{s}}\Big{(}3\varepsilon_{g}+\sum_{\mu}t^{\prime}_{\mu,\mu}\Big{)}-\sum_{\mu^{\prime},\mu}\sum_{s,\ell}(\varepsilon_{g}\delta_{\mu^{\prime},\mu}+t^{\prime}_{\mu^{\prime},\mu})\,\hat{b}^{\dagger}_{s,\mu,\ell}\hat{b}_{s,\mu^{\prime},\ell} (179)

\bullet In the same way, the Coulomb interaction V^gg\hat{V}_{gg} given in Eq. (172) between ground-level electrons, reads in terms of hole operators as

V^gg=Vg𝒱g(μ,μ)b^s,μ,b^s,μ,+\hat{V}_{gg}=V_{g}-\sum\mathcal{V}_{g}(\mu^{\prime},\mu)\,\hat{b}^{\dagger}_{s,\mu,\ell}\hat{b}_{s,\mu^{\prime},\ell}+\cdots (180)

within a hole-hole interaction that does not act in the ground and lowest-excited subspaces. The precise values of the constant term VgV_{g} and the prefactor of the above one-body operator are given in G.

\bullet Following the same procedure, we find that the interactions involving ground and excited levels, given in Eqs. (173,174), split into a one-body part and two two-body parts

V^eg(intra)+V^eg(inter)=𝒱es,a^s,a^s,+V^eh(intra)+V^eh(inter)\hat{V}^{(intra)}_{eg}+\hat{V}^{(inter)}_{eg}=\mathcal{V}_{e}\sum_{s,\ell}\hat{a}^{\dagger}_{s,\ell}\hat{a}_{s,\ell}+\hat{V}^{(intra)}_{eh}+\hat{V}^{(inter)}_{eh} (181)

with 𝒱e\mathcal{V}_{e} given by

𝒱e=2μ𝐑𝒱𝐑(eeμ,gμ,g)μ𝒱𝐑=𝟎(μ,geeμ,g)\mathcal{V}_{e}=2\sum_{\mu}\sum_{\bf R}\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}e&e\\ \mu,g&\mu,g\end{smallmatrix}\right)-\sum_{\mu}\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}\mu,g&e\\ e&\mu,g\end{smallmatrix}\right) (182)

Equation (181) contains two two-body parts. The one that corresponds to Coulomb processes in which the electron stays electron reads

V^eh(intra)=s1,s2μ1,μ11,2𝒱𝐑1𝐑2(eeμ1,gμ1,g)a^s2,2b^s1,μ1,1b^s1,μ1,1a^s2,2\displaystyle\hat{V}^{(intra)}_{eh}=-\sum_{s_{1},s_{2}}\sum_{\mu^{\prime}_{1},\mu_{1}}\sum_{\ell_{1},\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}e&e\\ \mu^{\prime}_{1},g&\mu_{1},g\end{smallmatrix}\right)\hat{a}^{\dagger}_{s_{2},\ell_{2}}\hat{b}^{\dagger}_{s_{1},\mu_{1},\ell_{1}}\hat{b}_{s_{1},\mu^{\prime}_{1},\ell_{1}}\hat{a}_{s_{2},\ell_{2}} (183)

The V^eh(inter)\hat{V}^{(inter)}_{eh} part of Eq. (181) corresponds to Coulomb processes in which the electron and the hole of a lattice site recombines, while another electron-hole pair is created on either the same site or a different site. By writing the spin part of these terms explicitly, we find that this interaction appears just as in Eq. (4.1.5), with additional μ\mu indices to label the involved ground states, namely

V^eh(inter)\displaystyle\hat{V}^{(inter)}_{eh} =\displaystyle= μ1,μ21,2𝒱𝐑1𝐑2(μ2,geeμ1,g)\displaystyle\sum_{\mu_{1},\mu^{\prime}_{2}}\sum_{\ell_{1},\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}\mu^{\prime}_{2},g&e\\ e&\mu_{1},g\end{smallmatrix}\right)
(a^12,1b^12,μ1,1a^12,1b^12,μ1,1)(b^12,μ2,2a^12,2b^12,μ2,2a^12,2)\displaystyle\Big{(}\hat{a}^{\dagger}_{\frac{1}{2},\ell_{1}}\hat{b}^{\dagger}_{-\frac{1}{2},\mu_{1},\ell_{1}}-\hat{a}^{\dagger}_{-\frac{1}{2},\ell_{1}}\hat{b}^{\dagger}_{\frac{1}{2},\mu_{1},\ell_{1}}\Big{)}\Big{(}\hat{b}_{-\frac{1}{2},\mu^{\prime}_{2},\ell_{2}}\hat{a}_{\frac{1}{2},\ell_{2}}-\hat{b}_{\frac{1}{2},\mu^{\prime}_{2},\ell_{2}}\hat{a}_{-\frac{1}{2},\ell_{2}}\Big{)}

\bullet By collecting all these terms, we end with the relevant parts of the Hamiltonian H^eh\hat{H}_{eh} for the Frenkel exciton problem, as

H^eh=Eg+H^e+H^h+V^eh(intra)+V^eh(inter)\hat{H}_{eh}=E^{\prime}_{g}+\hat{H}_{e}+\hat{H}_{h}+\hat{V}^{(intra)}_{eh}+\hat{V}^{(inter)}_{eh} (185)

The constant term which precisely reads

Eg=Φg|H^|Φg=6Nsεg+2Nsμtμ,μ+Vg+Ns2𝐑𝟎36e2|𝐑|\displaystyle E^{\prime}_{g}=\langle\Phi_{g}|\hat{H}|\Phi_{g}\rangle=6{N_{s}}\varepsilon_{g}+2{N_{s}}\sum_{\mu}t^{\prime}_{\mu,\mu}+V_{g}+\frac{{N_{s}}}{2}\sum_{{\bf R}\not=\bf 0}\frac{36\,e^{2}}{|{\bf R}|} (186)

corresponds to the ground-state energy. It is possible to show that the overextensive terms that come from the long-range character of the Coulomb potential cancel out exactly, thanks to the ion-ion interaction that appears in the last term of the above equation.

The electron and hole kinetic parts read

H~e\displaystyle\widetilde{H}_{e} =\displaystyle= ε~es,a^s,a^s,\displaystyle\tilde{\varepsilon}_{e}\sum_{s,\ell}\hat{a}^{\dagger}_{s,\ell}\hat{a}_{s,\ell} (187)
H~h\displaystyle\widetilde{H}_{h} =\displaystyle= s,μ,με~h(μ,μ)b^s,μ,b^s,μ,\displaystyle\sum_{s,\ell}\sum_{\mu^{\prime},\mu}\tilde{\varepsilon}_{h}(\mu,\mu^{\prime})\,\hat{b}^{\dagger}_{s,\mu^{\prime},\ell}\hat{b}_{s,\mu,\ell} (188)

They contain contributions from the V^ee\hat{V}_{e-e} Coulomb interaction, as seen from the electron and hole energies given by

ε~e\displaystyle\tilde{\varepsilon}_{e} =\displaystyle= εe+te,e+𝒱e\displaystyle\varepsilon_{e}+t^{\prime}_{e,e}+\mathcal{V}_{e} (189)
ε~h(μ,μ)\displaystyle\tilde{\varepsilon}_{h}(\mu,\mu^{\prime}) =\displaystyle= εgδμ,μtμ,μ𝒱g(μ,μ)\displaystyle-\varepsilon_{g}\delta_{\mu,\mu^{\prime}}-t^{\prime}_{\mu,\mu^{\prime}}-\mathcal{V}_{g}(\mu,\mu^{\prime}) (190)

6.5 Spin-singlet and spin-triplet subspaces

\bullet The last step is to note that as in Sec. 4, the relevant operators for the interlevel Coulomb interaction are not electron operators and hole operators, but electron-hole pair operators in spin-singlet states (see Eq. (6.4)). This leads us to introduce the symmetric and antisymmetric combinations of electron-hole pair operators that correspond to spin-triplet (S=1,Sz=0)(S=1,S_{z}=0) and spin-singlet (S=0,Sz=0)(S=0,S_{z}=0) states, defined as

B^S,μ,=a^12,b^12,μ,(1)Sa^12,b^12,μ,2forS=(1,0)\displaystyle\hat{B}^{\dagger}_{S,\mu,\ell}=\frac{\hat{a}^{\dagger}_{\frac{1}{2},\ell}\hat{b}^{\dagger}_{-\frac{1}{2},\mu,\ell}-(-1)^{S}\hat{a}^{\dagger}_{-\frac{1}{2},\ell}\hat{b}^{\dagger}_{\frac{1}{2},\mu,\ell}}{\sqrt{2}}\quad\quad\textrm{for}\quad\quad S=(1,0) (191)

and to replace the two excited states |Φσ,μ;𝐑|\Phi_{\sigma,\mu;{\bf R}_{\ell}}\rangle, for σ=±1/2\sigma=\pm 1/2, by the two states

|ΦS,μ;𝐑=B^S,μ,|0|\Phi_{S,\mu;{\bf R}_{\ell}}\rangle=\hat{B}^{\dagger}_{S,\mu,\ell}|0\rangle (192)

for S=(1,0)S=(1,0), where |0|0\rangle has zero hole and zero electron in the sense of Eq. (177); so, this state corresponds to the full valence band |Φg|\Phi_{g}\rangle. We can check that the above states also form an orthogonal set

ΦS,μ;𝐑|ΦS,μ;𝐑=δS,Sδμ,μδ,\langle\Phi_{S^{\prime},\mu^{\prime};{\bf R}_{\ell^{\prime}}}|\Phi_{S,\mu;{\bf R}_{\ell}}\rangle=\delta_{S^{\prime},S}\,\,\delta_{\mu^{\prime},\mu}\,\,\delta_{\ell^{\prime},\ell} (193)

that readily follows from Eq. (191), so that

[B^S,μ,,B^S,μ,]=δS,Sδμ,μδ,\displaystyle\left[\hat{B}_{S^{\prime},\mu^{\prime},\ell^{\prime}},\hat{B}^{\dagger}_{S,\mu,\ell}\right]_{-}=\delta_{S^{\prime},S}\,\delta_{\mu^{\prime},\mu}\,\delta_{\ell^{\prime},\ell} (194)

\bullet The interlevel interaction given in Eq. (6.4) then takes a compact form in terms of spin-singlet operators

V^eh(inter)=2μ1,μ21,2𝒱𝐑1𝐑2(μ2,geeμ1,g)B^0,μ1,1B^0,μ2,2\displaystyle\hat{V}^{(inter)}_{eh}=2\sum_{\mu_{1},\mu^{\prime}_{2}}\sum_{\ell_{1},\ell_{2}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}\mu^{\prime}_{2},g&e\\ e&\mu_{1},g\end{smallmatrix}\right)\hat{B}^{\dagger}_{0,\mu_{1},\ell_{1}}\hat{B}_{0,\mu^{\prime}_{2},\ell_{2}} (195)

from which we get, with the help of Eq. (194),

V^eh(inter)|ΦS,μ;𝐑=2δS,0μ1,1𝒱𝐑1𝐑(μ,geeμ1,g)|Φ0,μ1;𝐑1\displaystyle\hat{V}^{(inter)}_{eh}|\Phi_{S,\mu;{\bf R}_{\ell}}\rangle=2\delta_{S,0}\sum_{\mu_{1},\ell_{1}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell}}\left(\begin{smallmatrix}\mu,g&e\\ e&\mu_{1},g\end{smallmatrix}\right)|\Phi_{0,\mu_{1};{\bf R}_{\ell_{1}}}\rangle (196)

Through this interlevel Coulomb interaction, the excitation of a spin-singlet pair on a 𝐑{\bf R}_{\ell} site can be transferred to a different 𝐑1{\bf R}_{\ell_{1}} site.

By contrast, we note from

b^s1,μ1,1a^s2,2B^S,μ,|0=δ1,δ2,δμ1,μδs1,s2δs2,12(1)Sδs2,122|0\hat{b}_{s_{1},\mu^{\prime}_{1},\ell_{1}}\hat{a}_{s_{2},\ell_{2}}\hat{B}^{\dagger}_{S,\mu,\ell}|0\rangle=\delta_{\ell_{1},\ell}\delta_{\ell_{2},\ell}\delta_{\mu^{\prime}_{1},\mu}\delta_{s_{1},-s_{2}}\frac{\delta_{s_{2},\frac{1}{2}}-(-1)^{S}\delta_{s_{2},-\frac{1}{2}}}{\sqrt{2}}|0\rangle (197)

that the intralevel interaction (183) acting on any state of the excited subspace

V^eh(intra)|ΦS,μ;𝐑=μ1𝒱𝐑=𝟎(eeμ,gμ1,g)|ΦS,μ1;𝐑\hat{V}^{(intra)}_{eh}|\Phi_{S,\mu;{\bf R}_{\ell}}\rangle=-\sum_{\mu_{1}}\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}e&e\\ \mu,g&\mu_{1},g\end{smallmatrix}\right)|\Phi_{S,\mu_{1};{\bf R}_{\ell}}\rangle (198)

does not produce an excitation transfer from site to site.

\bullet In the same way, the one-body parts of the H^eh\hat{H}_{eh} Hamiltonian given in Eq. (185) acting on the excited subspace

(H^e+H^h)|ΦS,μ;𝐑=μ(ε~eδμ,μ+ε~h(μ,μ))|ΦS,μ;𝐑\displaystyle\Big{(}\hat{H}_{e}+\hat{H}_{h}\Big{)}|\Phi_{S,\mu;{\bf R}_{\ell}}\rangle=\sum_{\mu^{\prime}}\Big{(}\tilde{\varepsilon}_{e}\delta_{\mu,\mu^{\prime}}+\tilde{\varepsilon}_{h}(\mu,\mu^{\prime})\Big{)}|\Phi_{S,\mu^{\prime};{\bf R}_{\ell}}\rangle (199)

does not delocalize the on-site excitation.

6.6 Electron-hole Hamiltonian in the |ΦS,μ;𝐑|\Phi_{S,\mu;{\bf R}_{\ell}}\rangle excited subspace

\bullet By collecting all the above results, we find that the (6Ns×6Ns6N_{s}\times 6N_{s}) matrix that represents the H^eh\hat{H}_{eh} Hamiltonian in the |ΦS,μ;𝐑|\Phi_{S,\mu;{\bf R}_{\ell}}\rangle excited subspace, appears block-diagonal in the spin-triplet subspace and in the spin-singlet subspace. The (3Ns×3Ns3N_{s}\times 3N_{s}) submatrix in the spin-triplet subspace (S=1,Sz=0)(S=1,S_{z}=0) is diagonal with respect to the NsN_{s} lattice sites because the coupling between lattice sites, induced by the electron-electron Coulomb interaction, differs from zero for spin-singlet pairs only. The main diagonal is made of (3×33\times 3) submatrices that have \ell-independent matrix elements given by

ES=1(pair)(μ,μ)=ε~eδμ,μ+ε~h(μ,μ)𝒱𝐑=𝟎(eeμ,gμ,g)E_{S=1}^{(pair)}(\mu,\mu^{\prime})=\tilde{\varepsilon}_{e}\delta_{\mu,\mu^{\prime}}+\tilde{\varepsilon}_{h}(\mu,\mu^{\prime})-\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}e&e\\ \mu,g&\mu^{\prime},g\end{smallmatrix}\right) (200)

By contrast, the H^eh\hat{H}_{eh} Hamiltonian in the spin-singlet subspace (S=0,Sz=0)(S=0,S_{z}=0), is not diagonal with respect to the lattice sites. The off-diagonal terms, that read

𝒱(e,g)(μ,μ)=2𝒱𝐑𝐑(μ,geeμ,g)\mathcal{V}^{(e,g)}_{\ell^{\prime}\not=\ell}(\mu,\mu^{\prime})=2\mathcal{V}_{{\bf R}_{\ell^{\prime}}-{\bf R}_{\ell}}\left(\begin{smallmatrix}\mu,g&e\\ e&\mu^{\prime},g\end{smallmatrix}\right) (201)

couple different lattice sites through interlevel Coulomb processes. The diagonal part of the (3Ns×3Ns3N_{s}\times 3N_{s}) submatrix in the spin-singlet subspace is also made of (3×33\times 3) submatrices, like for the spin-triplet subspace, with \ell-independent matrix elements that read

ES=0(pair)(μ,μ)=ES=1(pair)(μ,μ)+2𝒱𝐑=𝟎(μ,geeμ,g)E_{S=0}^{(pair)}(\mu,\mu^{\prime})=E_{S=1}^{(pair)}(\mu,\mu^{\prime})+2\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}\mu,g&e\\ e&\mu^{\prime},g\end{smallmatrix}\right) (202)

They differ from the ones in the spin-triplet subspace due to the existence of on-site interlevel Coulomb processes, that corresponds to the last term of the above equation.

\bullet So, just like when the spin enters into play (see Eq. (116)), the spin-triplet and spin-singlet states |ΦS,μ;𝐑|\Phi_{S,\mu;{\bf R}_{\ell}}\rangle for S=(1,0)S=(1,0) taken as a basis, render block-diagonal the (6Ns×6Ns6N_{s}\times 6N_{s}) matrix that represents the H^eh\hat{H}_{eh} Hamiltonian in the lowest-excited subspace: it appears as

(E1(pair)(μ,μ)0000E1(pair)(μ,μ)0000E0(pair)(μ,μ)𝒱,(e,g)(μ,μ)00(𝒱,(e,g)(μ,μ))E0(pair)(μ,μ))\left(\begin{matrix}E^{(pair)}_{1}(\mu,\mu^{\prime})&\cdots&0&0&\cdots&0\\ \vdots&\ddots&\vdots&\vdots&\ddots&\vdots\\ 0&\cdots&E^{(pair)}_{1}(\mu,\mu^{\prime})&0&\cdots&0\\ 0&\cdots&0&E^{(pair)}_{0}(\mu,\mu^{\prime})&\cdots&\mathcal{V}^{(e,g)}_{\ell^{\prime},\ell}(\mu,\mu^{\prime})\\ \vdots&\ddots&\vdots&\vdots&\ddots&\vdots\\ 0&\cdots&0&\left(\mathcal{V}^{(e,g)}_{\ell^{\prime},\ell}(\mu,\mu^{\prime})\right)^{\ast}&\cdots&E^{(pair)}_{0}(\mu,\mu^{\prime})\end{matrix}\right) (203)

The (3Ns×3Ns3N_{s}\times 3N_{s}) submatrix in the spin-triplet subspace is diagonal with respect to the lattice sites: the Coulomb interaction does not delocalize the on-site excitation because the interlevel interaction only acts on the spin-singlet subspace. We are thus left with diagonalizing the (3Ns×3Ns3N_{s}\times 3N_{s}) submatrix in the spin-singlet subspace. The Coulomb coupling between lattice sites that exists in this subspace is going to generate the Frenkel exciton waves.

6.7 Frenkel excitons

\bullet The Frenkel excitons, that correspond to wave excitations over the whole sample, are only formed in the spin-singlet subspace. To derive them, we first perform a diagonalization with respect to the lattice sites, through the same phase factor as the one given in Eq. (1), that is, we introduce the following linear combinations of on-site excitations

|Φ0,μ;𝐊n=1Ns=1Nsei𝐊n𝐑|Φ0,μ;𝐑|\Phi_{0,\mu;{\bf K}_{n}}\rangle=\frac{1}{\sqrt{{N_{s}}}}\sum_{\ell=1}^{N_{s}}e^{i{\bf K}_{n}\cdot{\bf R}_{\ell}}|\Phi_{0,\mu;{\bf R}_{\ell}}\rangle (204)

A calculation very similar to the one done in the previous sections, leads to

Φ0,μ;𝐊n|H^eh|Φ0,μ;𝐊n=δn,n(E0(pair)(μ,μ)+v𝐊n(μ,μ))\langle\Phi_{0,\mu^{\prime};{\bf K}_{n^{\prime}}}|\hat{H}_{eh}|\Phi_{0,\mu;{\bf K}_{n}}\rangle=\delta_{n^{\prime},n}\left(E_{0}^{(pair)}(\mu,\mu^{\prime})+v_{{}_{{\bf K}_{n}}}(\mu,\mu^{\prime})\right) (205)

where the interlevel Coulomb interaction appears, as in Eq. (142), through

v𝐊n(μ,μ)=2𝐑𝟎ei𝐊n𝐑𝒱𝐑(μ,geeμ,g)v_{{}_{{\bf K}_{n}}}(\mu,\mu^{\prime})=2\sum_{{\bf R}\not=\bf 0}e^{i{\bf K}_{n}\cdot{\bf R}}\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}\mu,g&e\\ e&\mu^{\prime},g\end{smallmatrix}\right) (206)

The last diagonalization with respect to the μ\mu spatial degeneracy, is similar to the one we have performed in Sec. 5. The interlevel Coulomb scattering is singular in the small 𝐊{\bf K} limit[36, 37]

lim𝐊𝟎v𝐊(μ,μ)=8π3ac3|dge|2(δμ,μ3KμKμK2)\lim_{{\bf K}\rightarrow\bf 0}v_{{}_{{\bf K}}}(\mu,\mu^{\prime})=-\frac{8\pi}{3a_{c}^{3}}|d_{ge}|^{2}\left(\delta_{\mu,\mu^{\prime}}-3\frac{K_{\mu}K_{\mu^{\prime}}}{K^{2}}\right) (207)

where dged_{ge} is the dipole moment between ground and excited distributions defined as in Eq. (145), namely

dge=eL3d3rμ,g|𝐫𝐫|erμd_{ge}=e\int_{L^{3}}d^{3}r\,\langle\mu,g|{\bf r}\rangle\langle{\bf r}|e\rangle\,r_{\mu} (208)

which does not depend on μ\mu due to cyclic invariance.

To end the diagonalization with respect to the spatial degeneracy, we choose the zz axis of the arbitrary set μ=(x,y,z)\mu=(x,y,z) along 𝐊n{\bf K}_{n}. Just as in Sec. 5, this leads to a “transverse-longitudinal splitting” of the Frenkel exciton level driven by the direction of its 𝐊n{\bf K}_{n} wave vector, with two transverse modes having a red shift and one longitudinal mode having a blue shift.

7 The Frenkel exciton problem formulated in first quantization

The very first book that dealt with excitons was written by Knox[5] in 1963. The exciton problem is presented within the first quantization framework, common at that time. This approach has been followed by numerous authors afterward, in spite of the fact that the first quantization formalism is surely not a convenient way to approach a many-body problem. In Knox’s book, that covers Frenkel and Wannier excitons, the many-electron states are written through wave function Slater determinants. These determinants render all calculations not only cumbersome but also tricky when the signs play a key role, as for problems in which spin-singlet and spin-triplet states have different properties. Most results in Knox’s book are presented without detailed derivations: this is why we find it useful to also present pedestrian derivations of some key results on Frenkel excitons when written in terms of Slater determinants, to be compared to the more transparent operator formulation given in the present manuscript.

The very first step to approach the exciton problem, that fundamentally deals with semiconductor excitations, is to properly approximate the relevant ground and lowest-excited semiconductor states. As for all problems on solid state physics, the major difficulty comes from the two-body electron-electron Coulomb interaction which is known to generate many-body effects that in most cases, cannot be handled exactly. This forces us to start with approximate semiconductor states written in terms of the most suitable one-body states and to derive the consequences of the electron-electron Coulomb interaction on these approximate states. As the spin plays a key role in the Frenkel exciton physics through differentiating spin-singlet from spin-triplet states, its introduction is far more tricky than just adding a few 2 prefactors to some quantities, we will, here also, start by considering the exciton problem in the absence of spin.

7.0.1 Slater determinants for semiconductor states without spin

\bullet Since semiconductors are nonmetallic crystals, the natural idea is to approximate their ground state by an antisymmetrized product of one-electron ground-state wave functions localized on all the NsN_{s} lattice sites of the crystal: indeed, in the absence of empty sites, the system cannot change under a small electric field, as required for a semiconductor. A simple idea for these one-electron wave functions is to take them as the eigenfunctions

(𝐩22m0+Ve(𝐫𝐑))𝐫|ν,𝐑=εν𝐫|ν,𝐑\left(\frac{{\bf p}^{2}}{2m_{0}}+V_{e}({\bf r}-{\bf R}_{\ell})\right)\langle{\bf r}|\nu,{\bf R}_{\ell}\rangle=\varepsilon_{\nu}\langle{\bf r}|\nu,{\bf R}_{\ell}\rangle (209)

where VeV_{e} stands for the Coulomb attraction to an effective charge |e||e| made of the atomic or molecular nucleus with charge Z|e|Z|e| and a cloud of (Z1)(Z-1) electrons. In the present manuscript, this potential is taken as the one of a point charge, for simplicity. The approximate NsN_{s}-electron ground state then reads as the Slater determinant

𝐫Ns,,𝐫1|Φg=1Ns!|𝐫1|g,𝐑1𝐫1|g,𝐑𝐫1|g,𝐑Ns𝐫Ns|g,𝐑1𝐫Ns|g,𝐑𝐫Ns|g,𝐑Ns|\langle{\bf r}_{\scriptscriptstyle{N_{s}}},\cdots,{\bf r}_{1}|\Phi_{g}\rangle=\frac{1}{\sqrt{{N_{s}}!}}\begin{vmatrix}\langle{\bf r}_{1}|g,{\bf R}_{1}\rangle&\cdots&\langle{\bf r}_{1}|g,{\bf R}_{\ell}\rangle&\cdots&\langle{\bf r}_{1}|g,{\bf R}_{N_{s}}\rangle\\ \vdots&\ddots&\vdots&\ddots&\vdots\\ \langle{\bf r}_{\scriptscriptstyle{N_{s}}}|g,{\bf R}_{1}\rangle&\cdots&\langle{\bf r}_{\scriptscriptstyle{N_{s}}}|g,{\bf R}_{\ell}\rangle&\cdots&\langle{\bf r}_{\scriptscriptstyle{N_{s}}}|g,{\bf R}_{N_{s}}\rangle\end{vmatrix} (210)

where ν=g\nu=g stands for the ground level.

\bullet In the case of semiconductors hosting Frenkel excitons, the 𝐫|g,𝐑\langle{\bf r}|g,{\bf R}_{\ell}\rangle function is highly localized on the 𝐑{\bf R}_{\ell} lattice site, at the cell size scale; so, the |Φg|\Phi_{g}\rangle state is made of states highly localized on the 𝐑{\bf R}_{\ell} sites. This state can look at first very different from what should be taken for the ground state of semiconductors hosting Wannier excitons because their physically relevant one-electron states, known as Bloch states, correspond to itinerant electrons characterized by a wave vector. As a result, the natural way to write the corresponding semiconductor ground state is as Eq. (210), with 𝐫|g,𝐑\langle{\bf r}|g,{\bf R}_{\ell}\rangle for =(1,,Ns)\ell=(1,\cdots,{N_{s}}) replaced by the Bloch wave functions 𝐫|g,𝐤n\langle{\bf r}|g,{\bf k}_{n}\rangle for n=(1,,Ns)n=(1,\cdots,{N_{s}}).

Wannier noted[2] that these two ground states actually are identical by introducing the linear combinations of |g,𝐤n|g,{\bf k}_{n}\rangle states, known as Wannier states,

|g,𝐑=1Nsn=1Nsei𝐤n𝐑|g,𝐤n|g,{\bf R}_{\ell}\rangle=\frac{1}{\sqrt{{N_{s}}}}\sum_{n=1}^{N_{s}}e^{-i{\bf k}_{n}\cdot{\bf R}_{\ell}}|g,{\bf k}_{n}\rangle (211)

When inserted into Eq. (210), the resulting Slater determinant can only contain columns made of different (g,𝐤n)(g,{\bf k}_{n})’s, in order to differ from zero. So, Eq. (210) must also read

𝐫Ns,,𝐫1|Φg=1Ns!|𝐫1|g,𝐤1𝐫1|g,𝐤n𝐫1|g,𝐤Ns𝐫Ns|g,𝐤1𝐫Ns|g,𝐤n𝐫Ns|g,𝐤Ns|\langle{\bf r}_{\scriptscriptstyle{N_{s}}},\cdots,{\bf r}_{1}|\Phi_{g}\rangle=\frac{1}{\sqrt{{N_{s}}!}}\begin{vmatrix}\langle{\bf r}_{1}|g,{\bf k}_{1}\rangle&\cdots&\langle{\bf r}_{1}|g,{\bf k}_{n}\rangle&\cdots&\langle{\bf r}_{1}|g,{\bf k}_{N_{s}}\rangle\\ \vdots&\ddots&\vdots&\ddots&\vdots\\ \langle{\bf r}_{\scriptscriptstyle{N_{s}}}|g,{\bf k}_{1}\rangle&\cdots&\langle{\bf r}_{\scriptscriptstyle{N_{s}}}|g,{\bf k}_{n}\rangle&\cdots&\langle{\bf r}_{\scriptscriptstyle{N_{s}}}|g,{\bf k}_{N_{s}}\rangle\end{vmatrix} (212)

within a global phase factor, for the two formulations are normalized.

The equivalence between these two expressions holds when all one-electron states labeled either by 𝐑{\bf R}_{\ell} or by 𝐤n{\bf k}_{n} are occupied. A pedestrian way to be convinced of this equivalence is to consider Ns=2{N_{s}}=2. Equation (210) which then reads

𝐫2,𝐫1|Φg=12!(𝐫1|g,𝐑1𝐫2|g,𝐑2𝐫1|g,𝐑2𝐫2|g,𝐑1)\langle{\bf r}_{2},{\bf r}_{1}|\Phi_{g}\rangle=\frac{1}{\sqrt{2!}}\Big{(}\langle{\bf r}_{1}|g,{\bf R}_{1}\rangle\langle{\bf r}_{2}|g,{\bf R}_{2}\rangle-\langle{\bf r}_{1}|g,{\bf R}_{2}\rangle\langle{\bf r}_{2}|g,{\bf R}_{1}\rangle\Big{)} (213)

reduces, with the help of Eq. (211), to

𝐫2,𝐫1|Φg\displaystyle\langle{\bf r}_{2},{\bf r}_{1}|\Phi_{g}\rangle =\displaystyle= 12!(𝐫1|g,𝐤1𝐫2|g,𝐤2𝐫1|g,𝐤2𝐫2|g,𝐤1)\displaystyle\frac{1}{\sqrt{2!}}\Big{(}\langle{\bf r}_{1}|g,{\bf k}_{1}\rangle\langle{\bf r}_{2}|g,{\bf k}_{2}\rangle-\langle{\bf r}_{1}|g,{\bf k}_{2}\rangle\langle{\bf r}_{2}|g,{\bf k}_{1}\rangle\Big{)} (214)
ei𝐤1𝐑1ei𝐤2𝐑2ei𝐤2𝐑1ei𝐤1𝐑22\displaystyle\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\frac{e^{-i{\bf k}_{1}\cdot{\bf R}_{1}}e^{-i{\bf k}_{2}\cdot{\bf R}_{2}}-e^{-i{\bf k}_{2}\cdot{\bf R}_{1}}e^{-i{\bf k}_{1}\cdot{\bf R}_{2}}}{2}

The fraction is nothing but an overall phase factor which depends on the chosen origins for 𝐑{\bf R}_{\ell} and 𝐤n{\bf k}_{n}: in the case of a one-dimensional sample with length LL, this fraction is equal to 1 for the (𝐑,𝐤)({\bf R},{\bf k})’s taken as (𝐑1=0,𝐑2=L/2)({\bf R}_{1}=0,{\bf R}_{2}=L/2) and (𝐤1=2π/L,𝐤2=4π/L)({\bf k}_{1}=2\pi/L,{\bf k}_{2}=4\pi/L).

A general proof of the equivalence between Eq. (210) and Eq. (212) can be obtained by noting that the matrix

(𝐫1|g,𝐑1𝐫1|g,𝐑𝐫1|g,𝐑Ns𝐫Ns|g,𝐑1𝐫Ns|g,𝐑𝐫Ns|g,𝐑Ns)\displaystyle\begin{pmatrix}\langle{\bf r}_{1}|g,{\bf R}_{1}\rangle&\cdots&\langle{\bf r}_{1}|g,{\bf R}_{\ell}\rangle&\cdots&\langle{\bf r}_{1}|g,{\bf R}_{N_{s}}\rangle\\ \vdots&\ddots&\vdots&\ddots&\vdots\\ \langle{\bf r}_{\scriptscriptstyle{N_{s}}}|g,{\bf R}_{1}\rangle&\cdots&\langle{\bf r}_{\scriptscriptstyle{N_{s}}}|g,{\bf R}_{\ell}\rangle&\cdots&\langle{\bf r}_{\scriptscriptstyle{N_{s}}}|g,{\bf R}_{N_{s}}\rangle\end{pmatrix} (215)

can be rewritten as the product

(𝐫1|g,𝐤1𝐫1|g,𝐤n𝐫1|g,𝐤Ns𝐫Ns|g,𝐤1𝐫Ns|g,𝐤n𝐫Ns|g,𝐤Ns)𝒰\displaystyle\begin{pmatrix}\langle{\bf r}_{1}|g,{\bf k}_{1}\rangle&\cdots&\langle{\bf r}_{1}|g,{\bf k}_{n}\rangle&\cdots&\langle{\bf r}_{1}|g,{\bf k}_{N_{s}}\rangle\\ \vdots&\ddots&\vdots&\ddots&\vdots\\ \langle{\bf r}_{\scriptscriptstyle{N_{s}}}|g,{\bf k}_{1}\rangle&\cdots&\langle{\bf r}_{\scriptscriptstyle{N_{s}}}|g,{\bf k}_{n}\rangle&\cdots&\langle{\bf r}_{\scriptscriptstyle{N_{s}}}|g,{\bf k}_{N_{s}}\rangle\end{pmatrix}\,\,\mathcal{U} (216)

with 𝒰\mathcal{U} given by

𝒰=(ei𝐤1𝐑1Nsei𝐤1𝐑Nsei𝐤1𝐑NsNsei𝐤Ns𝐑1Nsei𝐤Ns𝐑Nsei𝐤Ns𝐑NsNs)\mathcal{U}=\begin{pmatrix}\displaystyle\frac{e^{-i{\bf k}_{1}\cdot{\bf R}_{1}}}{\sqrt{{N_{s}}}}&\cdots&\displaystyle\frac{e^{-i{\bf k}_{1}\cdot{\bf R}_{\ell}}}{\sqrt{{N_{s}}}}&\cdots&\displaystyle\frac{e^{-i{\bf k}_{1}\cdot{\bf R}_{N_{s}}}}{\sqrt{{N_{s}}}}\\ \vdots&\ddots&\vdots&\ddots&\vdots\\ \displaystyle\frac{e^{-i{\bf k}_{\scriptscriptstyle{N_{s}}}\cdot{\bf R}_{1}}}{\sqrt{{N_{s}}}}&\cdots&\displaystyle\frac{e^{-i{\bf k}_{\scriptscriptstyle{N_{s}}}\cdot{\bf R}_{\ell}}}{\sqrt{{N_{s}}}}&\cdots&\displaystyle\frac{e^{-i{\bf k}_{\scriptscriptstyle{N_{s}}}\cdot{\bf R}_{N_{s}}}}{\sqrt{{N_{s}}}}\end{pmatrix} (217)

We then note that det(AB)=det(A)det(B)\det(AB)=\det(A)\det(B), while (𝒰𝒰)=I\mathcal{U}^{\dagger}\mathcal{U})=\textrm{I}, so that |det𝒰|=1|\det\mathcal{U}|=1. As a result, we readily conclude that Eq. (210) and Eq. (212) are equal, within a phase.

These Wannier states are commonly used to rewrite the ground state for materials hosting Wannier excitons, originally written in terms of delocalized Bloch functions labeled by a kn\textbf{k}_{n} wave vector, in terms of functions labeled by a lattice site 𝐑{\bf R}_{\ell}. These Wannier functions are definitively more localized on lattice sites than the fully delocalized Bloch functions; however, they are far from being localized enough for the tight-binding approximation to be valid — as evidenced from the fact that excitations on different lattice sites are accepted when forming the lowest Wannier exciton subspace (see below). Actually, besides nicely relating the two forms of the crystal ground state, these Wannier states are not really useful for semiconductors: indeed, calculations related to Wannier excitons are far easier to perform with Bloch states than with Wannier states, even for the so-called long-range and short-range parts of the “electron-hole exchange”[45].

\bullet Starting from the |Φg|\Phi_{g}\rangle ground state written as in Eq. (210), it appears natural to write the lowest-excited subspace in the same way, with the |g,𝐑|g,{\bf R}_{\ell}\rangle column replaced by the |e,𝐑|e,{\bf R}_{\ell}\rangle column, where ν=e\nu=e stands for the lowest-excited electronic level, namely

𝐫Ns,,𝐫1|Φ𝐑=1Ns!|𝐫1|g,𝐑1𝐫1|e,𝐑𝐫1|g,𝐑Ns𝐫Ns|g,𝐑1𝐫Ns|e,𝐑𝐫Ns|g,𝐑Ns|\langle{\bf r}_{\scriptscriptstyle{N_{s}}},\cdots,{\bf r}_{1}|\Phi_{{\bf R}_{\ell}}\rangle=\frac{1}{\sqrt{{N_{s}}!}}\begin{vmatrix}\langle{\bf r}_{1}|g,{\bf R}_{1}\rangle&\cdots&\langle{\bf r}_{1}|e,{\bf R}_{\ell}\rangle&\cdots&\langle{\bf r}_{1}|g,{\bf R}_{N_{s}}\rangle\\ \vdots&\ddots&\vdots&\ddots&\vdots\\ \langle{\bf r}_{\scriptscriptstyle{N_{s}}}|g,{\bf R}_{1}\rangle&\cdots&\langle{\bf r}_{\scriptscriptstyle{N_{s}}}|e,{\bf R}_{\ell}\rangle&\cdots&\langle{\bf r}_{\scriptscriptstyle{N_{s}}}|g,{\bf R}_{N_{s}}\rangle\end{vmatrix} (218)

In doing so, we accept that one of the Ns{N_{s}} electrons of the |Φg|\Phi_{g}\rangle state jumps to the lowest excited state of the same lattice site. This restriction physically imposes the |e,𝐑|e,{\bf R}_{\ell}\rangle states to be highly localized on their lattice site. Indeed, a jump to 𝐑𝐑{\bf R}_{\ell^{\prime}}\not={\bf R}_{\ell} would cost an electrostatic energy because the |e||e| charge that produces the Ve(𝐫𝐑)V_{e}({\bf r}-{\bf R}_{\ell}) potential, would then be well apart from the |e|-|e| charge of the excited electron.

It is of importance to stress the major difference between semiconductors hosting Wannier excitons and semiconductors hosting Frenkel excitons, on that respect. Indeed, even if we can formally write the crystal ground state in the same way, this is not so for the lowest excited states. In the case of Wannier excitons, the 𝐫|g,𝐑\langle{\bf r}|g,{\bf R}_{\ell}\rangle column is replaced by a 𝐫|e,𝐑\langle{\bf r}|e,{\bf R}_{\ell^{\prime}}\rangle column with 𝐑{\bf R}_{\ell^{\prime}} possibly different from 𝐑{\bf R}_{\ell}. So, the lowest set of excited states, then written as |Φ𝐑,𝐑¯|\Phi_{{\bf R}_{\ell^{\prime}},\overline{{\bf R}}_{\ell}}\rangle, depends on two lattice sites, 𝐑{\bf R}_{\ell^{\prime}} that denotes where the electron is and 𝐑{\bf R}_{\ell} that denotes from where it is removed. The physical reason for not enforcing 𝐑=𝐑{\bf R}_{\ell}={\bf R}_{\ell^{\prime}} is that the Wannier states formed on Bloch states through Eq. (211), are not that much localized on the 𝐑{\bf R}_{\ell} sites; so, the subspaces in which the electron jumps to the excited level of a different site, are not so far above in energy. This imposes to include all the |Φ𝐑,𝐑¯|\Phi_{{\bf R}_{\ell^{\prime}},\overline{{\bf R}}_{\ell}}\rangle states into the excited subspace. By contrast, for semiconductors hosting Frenkel excitons, the |Φ𝐑𝐑,𝐑¯|\Phi_{{\bf R}_{\ell^{\prime}}\not={\bf R}_{\ell},\overline{{\bf R}}_{\ell}}\rangle subspaces are different enough in energy to allow handling them separately.

The strong restriction 𝐑=𝐑{\bf R}_{\ell}={\bf R}_{\ell^{\prime}} that is accepted when separating the |Φ𝐑,𝐑¯|\Phi_{{\bf R}_{\ell},\overline{{\bf R}}_{\ell}}\rangle subspace from the |Φ𝐑𝐑,𝐑¯|\Phi_{{\bf R}_{\ell^{\prime}}\not={\bf R}_{\ell},\overline{{\bf R}}_{\ell}}\rangle subspaces, is physically associated with the fact that the electronic wave functions on the 𝐑{\bf R}_{\ell} and 𝐑{\bf R}_{\ell^{\prime}\not=\ell} sites have a very small wave function overlap. This supports using the tight-binding limit

ν,𝐑|𝐫𝐫|ν,𝐑=0for\langle\nu^{\prime},{\bf R}_{\ell^{\prime}}|{\bf r}\rangle\langle{\bf r}|\nu,{\bf R}_{\ell}\rangle=0\quad\quad\textrm{for}\quad\quad\ell^{\prime}\not=\ell (219)

for the lowest electronic states ν=(g,e)\nu=(g,e) that are the relevant states for Frenkel excitons. Note that this condition is stronger than

ν,𝐑|ν,𝐑=0\langle\nu^{\prime},{\bf R}_{\ell^{\prime}\not=\ell}|\nu,{\bf R}_{\ell}\rangle=0 (220)

We wish to mention that the absence of overlap between the wave functions of different lattice sites, eliminates intralevel Coulomb processes between different sites because these overlaps enter their associated scatterings. Through it, this absence also prohibits the formation of Frenkel exciton waves from electronic excitations in spin-triplet states. We will come back to this important point.

7.0.2 Slater determinants for semiconductor states with spin

(a) By following what can be done for valence Bloch states |v,𝐤n|v,{\bf k}_{n}\rangle, that can be occupied by an up-spin or down-spin electron, we are led to take the possible one-electron ground states as |±,g,𝐑|\pm,g,{\bf R}_{\ell}\rangle, where |g,𝐑|g,{\bf R}_{\ell}\rangle is the electronic ground state introduced in Eq. (209). So, the approximate ground-state wave function 𝐫2Ns,,𝐫1|Φg\langle{\bf r}_{\scriptscriptstyle{2N_{s}}},\cdots,{\bf r}_{1}|\Phi_{g}\rangle for a Ns{N_{s}}-site semiconductor crystal occupied by 2Ns2{N_{s}} up-spin and down-spin electrons reads, following Eq. (210), as

1(2Ns)!|𝐫1|+,g,𝐑1𝐫1|,g,𝐑1𝐫1|+,g,𝐑Ns𝐫1|,g,𝐑Ns𝐫2Ns|+,g,𝐑1𝐫2Ns|,g,𝐑1𝐫2Ns|+,g,𝐑Ns𝐫2Ns|,g,𝐑Ns|\frac{1}{\sqrt{(2{N_{s}})!}}\begin{vmatrix}\langle{\bf r}_{1}|+,g,{\bf R}_{1}\rangle&\langle{\bf r}_{1}|-,g,{\bf R}_{1}\rangle&\cdots&\langle{\bf r}_{1}|+,g,{\bf R}_{N_{s}}\rangle&\langle{\bf r}_{1}|-,g,{\bf R}_{N_{s}}\rangle\\ \vdots&\vdots&\ddots&\vdots&\vdots\\ \langle{\bf r}_{\scriptscriptstyle{2N_{s}}}|+,g,{\bf R}_{1}\rangle&\langle{\bf r}_{\scriptscriptstyle{2N_{s}}}|-,g,{\bf R}_{1}\rangle&\cdots&\langle{\bf r}_{2{N_{s}}}|+,g,{\bf R}_{N_{s}}\rangle&\langle{\bf r}_{\scriptscriptstyle{2N_{s}}}|-,g,{\bf R}_{N_{s}}\rangle\end{vmatrix} (221)

The above ground state is made of spin-singlet pairs on each lattice site. A simple way to catch it, is to note that for Ns=1{N_{s}}=1, this ground state reduces to

𝐫2,𝐫1|Φg=12!|𝐫1|+,g,𝐑1𝐫1|,g,𝐑1𝐫2|+,g,𝐑1𝐫2|,g,𝐑1|\displaystyle\langle{\bf r}_{2},{\bf r}_{1}|\Phi_{g}\rangle=\frac{1}{\sqrt{2!}}\begin{vmatrix}\langle{\bf r}_{1}|+,g,{\bf R}_{1}\rangle&\langle{\bf r}_{1}|-,g,{\bf R}_{1}\rangle\\ \langle{\bf r}_{2}|+,g,{\bf R}_{1}\rangle&\langle{\bf r}_{2}|-,g,{\bf R}_{1}\rangle\end{vmatrix}\hskip 113.81102pt (222)
=12(𝐫1|g,𝐑1|1+𝐫2|g,𝐑1|2𝐫1|g,𝐑1|1𝐫2|g,𝐑1|2+)\displaystyle=\frac{1}{\sqrt{2}}\Big{(}\langle{\bf r}_{1}|g,{\bf R}_{1}\rangle|1_{+}\rangle\otimes\langle{\bf r}_{2}|g,{\bf R}_{1}\rangle|2_{-}\rangle-\langle{\bf r}_{1}|g,{\bf R}_{1}\rangle|1_{-}\rangle\otimes\langle{\bf r}_{2}|g,{\bf R}_{1}\rangle|2_{+}\rangle\Big{)}

which corresponds to each electron in the ground level of the 𝐑1{\bf R}_{1} site, with the electron pair in the spin-singlet state, namely

𝐫2,𝐫1|Φg\displaystyle\langle{\bf r}_{2},{\bf r}_{1}|\Phi_{g}\rangle =\displaystyle= 𝐫1|g,𝐑1𝐫2|g,𝐑1|1+,2|1,2+2\displaystyle\langle{\bf r}_{1}|g,{\bf R}_{1}\rangle\langle{\bf r}_{2}|g,{\bf R}_{1}\rangle\frac{|1_{+},2_{-}\rangle-|1_{-},2_{+}\rangle}{\sqrt{2}} (223)
\displaystyle\equiv S𝐑1(𝐫1,𝐫2)=S𝐑1(𝐫2,𝐫1)\displaystyle S_{{\bf R}_{1}}({\bf r}_{1},{\bf r}_{2})=-S_{{\bf R}_{1}}({\bf r}_{2},{\bf r}_{1})

The above result is physically obvious because, as the |Φg|\Phi_{g}\rangle wave function must be antisymmetric with respect to the (𝐫1,𝐫2)({\bf r}_{1},{\bf r}_{2}) electron exchange, 𝐫2,𝐫1|Φg=𝐫1,𝐫2|Φg\langle{\bf r}_{2},{\bf r}_{1}|\Phi_{g}\rangle=-\langle{\bf r}_{1},{\bf r}_{2}|\Phi_{g}\rangle, two electrons with identical spatial wave functions must be in the antisymmetric spin-singlet state.

A pedestrian way to catch that this remains true for Ns>1{N_{s}}>1, is to consider Ns=2{N_{s}}=2. We then have

𝐫4,,𝐫1|Φg=14!|𝐫1|+,g,𝐑1𝐫1|,g,𝐑1𝐫2|+,g,𝐑1𝐫2|,g,𝐑1𝐫3|+,g,𝐑2𝐫3|,g,𝐑2𝐫4|+,g,𝐑2𝐫4|,g,𝐑2|\langle{\bf r}_{4},\cdots,{\bf r}_{1}|\Phi_{g}\rangle=\frac{1}{\sqrt{4!}}\begin{vmatrix}\langle{\bf r}_{1}|+,g,{\bf R}_{1}\rangle&\langle{\bf r}_{1}|-,g,{\bf R}_{1}\rangle&\cdot&\cdot\\ \langle{\bf r}_{2}|+,g,{\bf R}_{1}\rangle&\langle{\bf r}_{2}|-,g,{\bf R}_{1}\rangle&\cdot&\cdot\\ \cdot&\cdot&\langle{\bf r}_{3}|+,g,{\bf R}_{2}\rangle&\langle{\bf r}_{3}|-,g,{\bf R}_{2}\rangle\\ \cdot&\cdot&\langle{\bf r}_{4}|+,g,{\bf R}_{2}\rangle&\langle{\bf r}_{4}|-,g,{\bf R}_{2}\rangle\end{vmatrix} (224)

that we calculate by first isolating the two diagonal submatrices, namely

𝐫4,,𝐫1|Φg=(2)24!(S𝐑1(𝐫1,𝐫2)S𝐑2(𝐫3,𝐫4)+other terms)\langle{\bf r}_{4},\cdots,{\bf r}_{1}|\Phi_{g}\rangle=\frac{(\sqrt{2})^{2}}{\sqrt{4!}}\Big{(}S_{{\bf R}_{1}}({\bf r}_{1},{\bf r}_{2})S_{{\bf R}_{2}}({\bf r}_{3},{\bf r}_{4})+\textrm{other terms}\Big{)} (225)

These other terms are easy to obtain by noting that the Slater determinant (224) is antisymmetric under (𝐫i,𝐫j)({\bf r}_{i},{\bf r}_{j}) exchanges. So, the six terms that compose 𝐫4,,𝐫1|Φg\langle{\bf r}_{4},\cdots,{\bf r}_{1}|\Phi_{g}\rangle must read

𝐫4,,𝐫1|Φg\displaystyle\langle{\bf r}_{4},\cdots,{\bf r}_{1}|\Phi_{g}\rangle\!\!\! =\displaystyle= 13!(S𝐑1(𝐫1,𝐫2)S𝐑2(𝐫3,𝐫4)S𝐑1(𝐫1,𝐫3)S𝐑2(𝐫2,𝐫4)\displaystyle\!\!\!\frac{1}{\sqrt{3!}}\Big{(}S_{{\bf R}_{1}}({\bf r}_{1},{\bf r}_{2})S_{{\bf R}_{2}}({\bf r}_{3},{\bf r}_{4})-S_{{\bf R}_{1}}({\bf r}_{1},{\bf r}_{3})S_{{\bf R}_{2}}({\bf r}_{2},{\bf r}_{4}) (226)
S𝐑1(𝐫1,𝐫4)S𝐑2(𝐫3,𝐫2)S𝐑1(𝐫3,𝐫2)S𝐑2(𝐫1,𝐫4)\displaystyle\!\!\!-S_{{\bf R}_{1}}({\bf r}_{1},{\bf r}_{4})S_{{\bf R}_{2}}({\bf r}_{3},{\bf r}_{2})-S_{{\bf R}_{1}}({\bf r}_{3},{\bf r}_{2})S_{{\bf R}_{2}}({\bf r}_{1},{\bf r}_{4})
S𝐑1(𝐫4,𝐫2)S𝐑2(𝐫3,𝐫1)+S𝐑1(𝐫3,𝐫4)S𝐑2(𝐫1,𝐫2))\displaystyle\!\!\!-S_{{\bf R}_{1}}({\bf r}_{4},{\bf r}_{2})S_{{\bf R}_{2}}({\bf r}_{3},{\bf r}_{1})+S_{{\bf R}_{1}}({\bf r}_{3},{\bf r}_{4})S_{{\bf R}_{2}}({\bf r}_{1},{\bf r}_{2})\Big{)}

which proves that the |Φg|\Phi_{g}\rangle ground state is indeed made of spin-singlet pairs on each lattice site.

(b) In the same way that a valence Bloch state is replaced by a conduction Bloch state with same spin to get the excited states leading to Wannier excitons, the lowest set of excited states for semiconductors hosting Frenkel excitons in the presence of spin, follows by replacing a |σ,g,𝐑|\sigma,g,{\bf R}_{\ell}\rangle column with σ=±\sigma=\pm, by a |σ,e,𝐑|\sigma,e,{\bf R}_{\ell}\rangle column, the other columns staying unchanged. This leads us to write the wave functions for the lowest set of excited states as

𝐫2Ns,,𝐫1|Φσ;𝐑=\displaystyle\langle{\bf r}_{\scriptscriptstyle{2N_{s}}},\cdots,{\bf r}_{1}|\Phi_{\sigma;{\bf R}_{\ell}}\rangle= (227)
1(2Ns)!|𝐫1|+,g,𝐑1𝐫1|σ,e,𝐑𝐫1|,g,𝐑Ns𝐫2Ns|+,g,𝐑1𝐫2Ns|σ,e,𝐑𝐫2Ns|,g,𝐑Ns|\displaystyle\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\frac{1}{\sqrt{(2N_{s})!}}\begin{vmatrix}\langle{\bf r}_{1}|+,g,{\bf R}_{1}\rangle&\cdots&\langle{\bf r}_{1}|\sigma,e,{\bf R}_{\ell}\rangle&\cdots&\langle{\bf r}_{1}|-,g,{\bf R}_{N_{s}}\rangle\\ \vdots&\ddots&\vdots&\ddots&\vdots\\ \langle{\bf r}_{\scriptscriptstyle{2N_{s}}}|+,g,{\bf R}_{1}\rangle&\cdots&\langle{\bf r}_{\scriptscriptstyle{2N_{s}}}|\sigma,e,{\bf R}_{\ell}\rangle&\cdots&\langle{\bf r}_{\scriptscriptstyle{2N_{s}}}|-,g,{\bf R}_{N_{s}}\rangle\end{vmatrix}

It can be of interest to note that the 𝐑{\bf R}_{\ell} site of the |Φσ;𝐑|\Phi_{\sigma;{\bf R}_{\ell}}\rangle state now contains a linear combination of spin-singlet and spin-triplet pairs. Indeed, |Φ+;𝐑|\Phi_{+;{\bf R}_{\ell}}\rangle for Ns=1{N_{s}}=1 reads

𝐫2,𝐫1|Φ+;𝐑=12!|𝐫1|+,e,𝐑1𝐫1|,g,𝐑1𝐫2|+,e,𝐑1𝐫2|,g,𝐑1|\displaystyle\langle{\bf r}_{2},{\bf r}_{1}|\Phi_{+;{\bf R}_{\ell}}\rangle=\frac{1}{\sqrt{2!}}\begin{vmatrix}\langle{\bf r}_{1}|+,e,{\bf R}_{1}\rangle&\langle{\bf r}_{1}|-,g,{\bf R}_{1}\rangle\\ \langle{\bf r}_{2}|+,e,{\bf R}_{1}\rangle&\langle{\bf r}_{2}|-,g,{\bf R}_{1}\rangle\end{vmatrix}\hskip 113.81102pt (228)
=12(𝐫1|e,𝐑1𝐫2|g,𝐑1|1+|2𝐫1|g,𝐑1𝐫2|e,𝐑1|1|2+)\displaystyle=\frac{1}{\sqrt{2}}\Big{(}\langle{\bf r}_{1}|e,{\bf R}_{1}\rangle\langle{\bf r}_{2}|g,{\bf R}_{1}\rangle\otimes|1_{+}\rangle|2_{-}\rangle-\langle{\bf r}_{1}|g,{\bf R}_{1}\rangle\langle{\bf r}_{2}|e,{\bf R}_{1}\rangle\otimes|1_{-}\rangle|2_{+}\rangle\Big{)}

that we can rewrite in terms of spin-singlet and spin-triplet pairs as

𝐫2,𝐫1|Φ+;𝐑=\displaystyle\langle{\bf r}_{2},{\bf r}_{1}|\Phi_{+;{\bf R}_{\ell}}\rangle=\hskip 256.0748pt (229)
12(𝐫1|e,𝐑1𝐫2|g,𝐑1+𝐫1|g,𝐑1𝐫2|e,𝐑12|1+|2|1|2+2\displaystyle\frac{1}{\sqrt{2}}\bigg{(}\frac{\langle{\bf r}_{1}|e,{\bf R}_{1}\rangle\langle{\bf r}_{2}|g,{\bf R}_{1}\rangle+\langle{\bf r}_{1}|g,{\bf R}_{1}\rangle\langle{\bf r}_{2}|e,{\bf R}_{1}\rangle}{\sqrt{2}}\otimes\frac{|1_{+}\rangle|2_{-}\rangle-|1_{-}\rangle|2_{+}\rangle}{\sqrt{2}}
+𝐫1|e,𝐑1𝐫2|g,𝐑1𝐫1|g,𝐑1𝐫2|e,𝐑12|1+|2+|1|2+2)\displaystyle+\frac{\langle{\bf r}_{1}|e,{\bf R}_{1}\rangle\langle{\bf r}_{2}|g,{\bf R}_{1}\rangle-\langle{\bf r}_{1}|g,{\bf R}_{1}\rangle\langle{\bf r}_{2}|e,{\bf R}_{1}\rangle}{\sqrt{2}}\otimes\frac{|1_{+}\rangle|2_{-}\rangle+|1_{-}\rangle|2_{+}\rangle}{\sqrt{2}}\bigg{)}

each part being antisymmetric with respect to electron exchange. This triplet part can a priori exist because the two electronic wave functions on the 𝐑1{\bf R}_{1} site now are different.

(c) We wish to mention that the above |Φg|\Phi_{g}\rangle and |Φσ;𝐑|\Phi_{\sigma;{\bf R}_{\ell}}\rangle states do not exactly correspond to Knox’s formulation in two ways.

(i) One (minor) difference is that we only consider semiconductor excited states with a total spin SzS_{z} equal to zero, while Knox also considers the possibility for the excited electron to have a spin σ\sigma^{\prime} different from the spin σ\sigma of the removed ground-state electron. The resulting semiconductor excited states can then have a total spin SzS_{z} different from zero. We do not see the interest of considering such states because the two physically relevant interactions that can lead to semiconductor excited states, namely the electron-photon interaction and the Coulomb interaction, conserve the spin; so, |Φg|\Phi_{g}\rangle can hardly be coupled to σσ\sigma^{\prime}\neq\sigma excited states. Allowing them brings pairs in a spin-triplet state (S=1,Sz=±1)(S=1,S_{z}=\pm 1), while by only considering σ=σ\sigma^{\prime}=\sigma, the pairs have a total spin Sz=0S_{z}=0. Yet, from them, we can still have spin-singlet and spin-triplet states (S=(0,1),Sz=0)(S=(0,1),S_{z}=0) and disentangle their difference in order to properly answer the important question which is why interlevel Coulomb processes only exist between spin-singlet states.

(ii) The major difference with Knox’s book lies in the one-electron states that are used to describe the system. Knox considers that in the presence of spin, each atom or molecule “contributes to a core and two valence electrons”. The core includes the nucleus and a charge cloud representing the remaining electrons ([5], p.9). As a result, the core of a neutral atom or molecule has a charge 2|e|2|e|. Using such a 2|e|2|e| core to define one-electron states does not correspond to the physical situation, as explained in the main part of the present manuscript, because the excited electron also feels the repulsion of the opposite-spin electron that remains in the ground level. So, the effective opposite charge felt by the excited electron to stay on the 𝐑{\bf R}_{\ell} site, is more |e||e| than 2|e|2|e|. For this reason, we find it reasonable to keep the same one-electron states to describe the system with or without spin, the excited-state energy having the same εe\varepsilon_{e} value in both cases. As a strong support of this idea, we may remember that this exactly is what it is done in the case of Bloch states when the spin is introduced.

Actually, replacing the 2|e|2|e| core by an |e||e| effective charge in the definition of the one-electron basis, does not change the structure of the calculations, except for the term associated with electron repulsion on the same lattice site. The existence of this on-site repulsion, which physically comes from the two-body electron-electron Coulomb interaction, was briefly mentioned by Knox: he then acknowledged the intrinsic problem associated with writing a many-body state as a product of one-body states. Such a problem does not exist when using the second quantization formalism because this formulation is valid whatever the chosen basis to define the electron creation operators: a physically bad basis would eventually lead to correct results, provided that we do not neglect terms improperly.

7.0.3 Coulomb scatterings within the excited subspace

The formation of Frenkel exciton waves relies on the capability of the electronic excitation to transfer to a different lattice site. This cannot be done by the H0H_{0} one-body part of the Hamiltonian, due to the tight-binding limit (219) that is required to have the lowest excited subspace with excitations on the same lattice site only, as previously discussed. Indeed, Φσ,𝐑|H0|Φσ;𝐑\langle\Phi_{\sigma^{\prime},{\bf R}_{\ell^{\prime}}}|H_{0}|\Phi_{\sigma;{\bf R}_{\ell}}\rangle contains integrals like

d3re,𝐑|𝐫H0𝐫|e,𝐑\int d^{3}r\langle e,{\bf R}_{\ell^{\prime}}|{\bf r}\rangle H_{0}\langle{\bf r}|e,{\bf R}_{\ell}\rangle (230)

which reduce to zero in the absence of wave function overlap when \ell^{\prime}\not=\ell.

The interesting part of the Hamiltonian is concentrated into the two-body electron-electron interaction which reads in first quantization as

Vee=12ijie2|𝐫i𝐫j|V_{e-e}=\frac{1}{2}\sum_{i}\sum_{j\not=i}\frac{e^{2}}{|{\bf r}_{i}-{\bf r}_{j}|} (231)
Refer to caption
Figure 15: Coulomb processes between Φ𝐑2|\langle\Phi_{{\bf R}_{2}}| and |Φ𝐑1|\Phi_{{\bf R}_{1}}\rangle in the case of just two lattice sites, 𝐑1{\bf R}_{1} and 𝐑2{\bf R}_{2}. (a) Interlevel processes on the same lattice site. (b) Intralevel processes between the two lattice sites. These intralevel processes must be neglected in the tight-binding limit, in consistency with only considering excited states on the same lattice site.

\bullet To better catch that the tight-binding limit reduces the Coulomb interaction to interlevel processes, let us first forget the electron spin. The possible coupling between excitations on different lattice sites, relies on the nonzero value of Φ𝐑|Vee|Φ𝐑\langle\Phi_{{\bf R}_{\ell^{\prime}}}|V_{e-e}|\Phi_{{\bf R}_{\ell}}\rangle for \ell^{\prime}\not=\ell. The simplest way to catch the form of this scattering again is to consider Ns=2N_{s}=2 that is, 𝐑=𝐑1{\bf R}_{\ell}={\bf R}_{1} and 𝐑=𝐑2{\bf R}_{\ell^{\prime}}={\bf R}_{2}. According to Eq. (218), we find

Φ𝐑2|Vee|Φ𝐑1=\displaystyle\langle\Phi_{{\bf R}_{2}}|V_{e-e}|\Phi_{{\bf R}_{1}}\rangle=\hskip 241.84842pt (232)
12!d3r1d3r2|𝐫1|g,𝐑1𝐫1|e,𝐑2𝐫2|g,𝐑1𝐫2|e,𝐑2|e2|𝐫1𝐫2||𝐫1|e,𝐑1𝐫1|g,𝐑2𝐫2|e,𝐑1𝐫2|g,𝐑2|\displaystyle\frac{1}{2!}\iint d^{3}r_{1}d^{3}r_{2}\begin{vmatrix}\langle{\bf r}_{1}|g,{\bf R}_{1}\rangle&\langle{\bf r}_{1}|e,{\bf R}_{2}\rangle\\ \langle{\bf r}_{2}|g,{\bf R}_{1}\rangle&\langle{\bf r}_{2}|e,{\bf R}_{2}\rangle\end{vmatrix}^{\ast}\frac{e^{2}}{|{\bf r}_{1}-{\bf r}_{2}|}\begin{vmatrix}\langle{\bf r}_{1}|e,{\bf R}_{1}\rangle&\langle{\bf r}_{1}|g,{\bf R}_{2}\rangle\\ \langle{\bf r}_{2}|e,{\bf R}_{1}\rangle&\langle{\bf r}_{2}|g,{\bf R}_{2}\rangle\end{vmatrix}

which is equal to

12!d3r1d3r2e2|𝐫1𝐫2|(g,𝐑1|𝐫1e,𝐑2|𝐫2e,𝐑2|𝐫1g,𝐑1|𝐫2)\displaystyle\frac{1}{2!}\iint d^{3}r_{1}d^{3}r_{2}\frac{e^{2}}{|{\bf r}_{1}-{\bf r}_{2}|}\Big{(}\langle g,{\bf R}_{1}|{\bf r}_{1}\rangle\langle e,{\bf R}_{2}|{\bf r}_{2}\rangle-\langle e,{\bf R}_{2}|{\bf r}_{1}\rangle\langle g,{\bf R}_{1}|{\bf r}_{2}\rangle\Big{)}
(𝐫1|e,𝐑1𝐫2|g,𝐑2𝐫1|g,𝐑2𝐫2|e,𝐑1)\displaystyle\Big{(}\langle{\bf r}_{1}|e,{\bf R}_{1}\rangle\langle{\bf r}_{2}|g,{\bf R}_{2}\rangle-\langle{\bf r}_{1}|g,{\bf R}_{2}\rangle\langle{\bf r}_{2}|e,{\bf R}_{1}\rangle\Big{)} (233)

The above quantity contains two conceptually different terms. The term

d3r1d3r2e2|𝐫1𝐫2|(g,𝐑1|𝐫1𝐫1|e,𝐑1)(e,𝐑2|𝐫2𝐫2|g,𝐑2)\iint d^{3}r_{1}d^{3}r_{2}\frac{e^{2}}{|{\bf r}_{1}-{\bf r}_{2}|}\Big{(}\langle g,{\bf R}_{1}|{\bf r}_{1}\rangle\langle{\bf r}_{1}|e,{\bf R}_{1}\rangle\Big{)}\Big{(}\langle e,{\bf R}_{2}|{\bf r}_{2}\rangle\langle{\bf r}_{2}|g,{\bf R}_{2}\rangle\Big{)} (234)

corresponds to interlevel Coulomb processes on the same lattice site[34, 35]: the 𝐫1{\bf r}_{1} electron goes from the ee level to the gg level on the same 𝐑1{\bf R}_{1} site (see Fig. 15(a)). The other term

d3r1d3r2e2|𝐫1𝐫2|(e,𝐑2|𝐫1𝐫1|e,𝐑1)(g,𝐑1|𝐫2𝐫2|g,𝐑2)-\iint d^{3}r_{1}d^{3}r_{2}\frac{e^{2}}{|{\bf r}_{1}-{\bf r}_{2}|}\Big{(}\langle e,{\bf R}_{2}|{\bf r}_{1}\rangle\langle{\bf r}_{1}|e,{\bf R}_{1}\rangle\Big{)}\Big{(}\langle g,{\bf R}_{1}|{\bf r}_{2}\rangle\langle{\bf r}_{2}|g,{\bf R}_{2}\rangle\Big{)} (235)

corresponds to intralevel Coulomb processes between two lattice sites[46]: the 𝐫1{\bf r}_{1} electron stays in the same ee level, but moves from the 𝐑1{\bf R}_{1} site to the 𝐑2{\bf R}_{2} site (see Fig. 15(b)). This second term has to be neglected in the tight-binding limit because it contains wave function overlaps between different sites. Knox mentions that this intralevel term, which appears in his Eq. (3.7), decreases exponentially with the lattice site distance (see comment below his Eq. (3.9)), due to the poor wave function overlap. We think that this term should be simply dropped from the VeeV_{e-e} matrix elements, in order to be consistent with the very first line on Frenkel excitons, i.e., when it is accepted that the excited subspace only contains excitations on the same lattice site.

By noting that there are two terms similar to Eq. (234) that come from exchanging (𝐫1,𝐫2)({\bf r}_{1},{\bf r}_{2}) in Eq. (233), we ultimately find for Ns=2N_{s}=2,

Φ𝐑2|Vee|Φ𝐑1=\displaystyle\langle\Phi_{{\bf R}_{2}}|V_{e-e}|\Phi_{{\bf R}_{1}}\rangle=\hskip 227.62204pt
d3r1d3r2e2|𝐫1𝐫2|(g,𝐑1|𝐫1𝐫1|e,𝐑1)(e,𝐑2|𝐫2𝐫2|g,𝐑2)\displaystyle\iint d^{3}r_{1}d^{3}r_{2}\frac{e^{2}}{|{\bf r}_{1}-{\bf r}_{2}|}\Big{(}\langle g,{\bf R}_{1}|{\bf r}_{1}\rangle\langle{\bf r}_{1}|e,{\bf R}_{1}\rangle\Big{)}\Big{(}\langle e,{\bf R}_{2}|{\bf r}_{2}\rangle\langle{\bf r}_{2}|g,{\bf R}_{2}\rangle\Big{)} (236)

This evidences that the Coulomb scatterings do reduce to interlevel processes on the same lattice site.

The above calculation performed for Ns=2N_{s}=2 can be extended to larger NsN_{s}, by noting that the 𝐫1|e,𝐑1\langle{\bf r}_{1}|e,{\bf R}_{1}\rangle wave function will appear not only with g,𝐑1|𝐫1\langle g,{\bf R}_{1}|{\bf r}_{1}\rangle or g,𝐑2|𝐫1\langle g,{\bf R}_{2}|{\bf r}_{1}\rangle as for Ns=2N_{s}=2, but also with g,𝐑3|𝐫1\langle g,{\bf R}_{3}|{\bf r}_{1}\rangle and so on… Due to the absence of wave function overlap between different lattice sites, the unique term that contributes to the VeeV_{e-e} matrix element, still is g,𝐑1|𝐫1𝐫1|e,𝐑1\langle g,{\bf R}_{1}|{\bf r}_{1}\rangle\langle{\bf r}_{1}|e,{\bf R}_{1}\rangle which corresponds to the 𝐫1{\bf r}_{1} electron going from the gg to the ee level of the 𝐑1{\bf R}_{1} site.

Refer to caption
Figure 16: Interlevel Coulomb processes between Φ𝐑|\langle\Phi_{{\bf R}_{\ell^{\prime}}}| and |Φ𝐑|\Phi_{{\bf R}_{\ell}}\rangle.

A careful counting of the possible 𝐫i{\bf r}_{i} permutations ultimately leads, in the case of NsN_{s} lattice sites, to

Φ𝐑|Vee|Φ𝐑=\displaystyle\langle\Phi_{{\bf R}_{\ell^{\prime}}}|V_{e-e}|\Phi_{{\bf R}_{\ell}}\rangle=\hskip 227.62204pt
d3r1d3r2e2|𝐫1𝐫2|(g,𝐑|𝐫1𝐫1|e,𝐑)(e,𝐑|𝐫2𝐫2|g,𝐑)\displaystyle\iint d^{3}r_{1}d^{3}r_{2}\frac{e^{2}}{|{\bf r}_{1}-{\bf r}_{2}|}\Big{(}\langle g,{\bf R}_{\ell}|{\bf r}_{1}\rangle\langle{\bf r}_{1}|e,{\bf R}_{\ell}\rangle\Big{)}\Big{(}\langle e,{\bf R}_{\ell^{\prime}}|{\bf r}_{2}\rangle\langle{\bf r}_{2}|g,{\bf R}_{\ell^{\prime}}\rangle\Big{)} (237)

which corresponds to the interlevel processes shown in Fig. 16.

It should be noted that the above matrix element only depends on 𝐑𝐑{\bf R}_{\ell^{\prime}}-{\bf R}_{\ell}. This follows from the fact that the electronic wave functions are such that 𝐫1|ν,𝐑=𝐫1𝐑|ν\langle{\bf r}_{1}|\nu,{\bf R}_{\ell}\rangle=\langle{\bf r}_{1}-{\bf R}_{\ell}|\nu\rangle. So, for 𝐫1=𝐫1𝐑{\bf r}^{\prime}_{1}={\bf r}_{1}-{\bf R}_{\ell} and 𝐫2=𝐫2𝐑{\bf r}^{\prime}_{2}={\bf r}_{2}-{\bf R}_{\ell^{\prime}}, the above equation also reads as

d3r1d3r2e2|𝐫1𝐫2+𝐑𝐑|(g|𝐫1𝐫1|e)(e|𝐫2𝐫2|g)\iint d^{3}r^{\prime}_{1}d^{3}r^{\prime}_{2}\frac{e^{2}}{|{\bf r}^{\prime}_{1}-{\bf r}^{\prime}_{2}+{\bf R}_{\ell}-{\bf R}_{\ell^{\prime}}|}\Big{(}\langle g|{\bf r}^{\prime}_{1}\rangle\langle{\bf r}^{\prime}_{1}|e\rangle\Big{)}\Big{(}\langle e|{\bf r}^{\prime}_{2}\rangle\langle{\bf r}^{\prime}_{2}|g\rangle\Big{)} (238)

\bullet Although calculations including the spin degree of freedom are fundamentally performed in the same way as in the absence of spin, they definitely are more tricky because in the very end, we have to show that the interlevel Coulomb processes that control the VeeV_{e-e} matrix element, only exist for spin-singlet pairs. Since the difference between spin-singlet and spin-triplet is just a sign, we have to be extremely careful when calculating quantities involving Slater determinants. Although the two electrons on the excited site can be written through their spin-singlet and spin-triplet combinations, as previously shown, we find it more transparent to handle the two electrons independently, instead of as spin-singlet and spin-triplet electron pairs, as done in Knox’s book.

Here again, let us consider Ns=2N_{s}=2. The excited subspace is then made of four excited states, |Φ+;𝐑1|\Phi_{+;{\bf R}_{1}}\rangle, |Φ;𝐑1|\Phi_{-;{\bf R}_{1}}\rangle, |Φ+;𝐑2|\Phi_{+;{\bf R}_{2}}\rangle, and |Φ;𝐑2|\Phi_{-;{\bf R}_{2}}\rangle. According to Eq. (227), a possible Coulomb scattering between different lattice sites, reads

Φ+;𝐑2|Vee|Φ+;𝐑1=14!d3r1d3r2d3r3d3r4\displaystyle\langle\Phi_{+;{\bf R}_{2}}|V_{e-e}|\Phi_{+;{\bf R}_{1}}\rangle=\frac{1}{4!}\iiiint d^{3}r_{1}d^{3}r_{2}d^{3}r_{3}d^{3}r_{4}\hskip 85.35826pt
|𝐫1|+,g,𝐑1𝐫1|,g,𝐑1𝐫1|+,e,𝐑2𝐫1|,g,𝐑2|\displaystyle\begin{vmatrix}\langle{\bf r}_{1}|+,g,{\bf R}_{1}\rangle&\langle{\bf r}_{1}|-,g,{\bf R}_{1}\rangle&\langle{\bf r}_{1}|+,e,{\bf R}_{2}\rangle&\langle{\bf r}_{1}|-,g,{\bf R}_{2}\rangle\\ \vdots&\vdots&\vdots&\vdots\end{vmatrix}^{\ast}
Vee|𝐫1|+,e,𝐑1𝐫1|,g,𝐑1𝐫1|+,g,𝐑2𝐫1|,g,𝐑2|\displaystyle V_{e-e}\begin{vmatrix}\langle{\bf r}_{1}|+,e,{\bf R}_{1}\rangle&\langle{\bf r}_{1}|-,g,{\bf R}_{1}\rangle&\langle{\bf r}_{1}|+,g,{\bf R}_{2}\rangle&\langle{\bf r}_{1}|-,g,{\bf R}_{2}\rangle\\ \vdots&\vdots&\vdots&\vdots\end{vmatrix} (239)

Since the |,g,𝐑1|-,g,{\bf R}_{1}\rangle columns and the |,g,𝐑2|-,g,{\bf R}_{2}\rangle column of the above Slater determinants are the same, the nonzero contributions to the above integral reduce to terms made from the other columns, namely

d3r1d3r2|𝐫1|+,g,𝐑1𝐫1|+,e,𝐑2𝐫2|+,g,𝐑1𝐫2|+,e,𝐑2|e2|𝐫1𝐫2||𝐫1|+,e,𝐑1𝐫1|+,g,𝐑2𝐫2|+,e,𝐑1𝐫2|+,g,𝐑2|\iint d^{3}r_{1}d^{3}r_{2}\begin{vmatrix}\langle{\bf r}_{1}|+,g,{\bf R}_{1}\rangle&\langle{\bf r}_{1}|+,e,{\bf R}_{2}\rangle\\ \langle{\bf r}_{2}|+,g,{\bf R}_{1}\rangle&\langle{\bf r}_{2}|+,e,{\bf R}_{2}\rangle\end{vmatrix}^{\ast}\frac{e^{2}}{|{\bf r}_{1}-{\bf r}_{2}|}\begin{vmatrix}\langle{\bf r}_{1}|+,e,{\bf R}_{1}\rangle&\langle{\bf r}_{1}|+,g,{\bf R}_{2}\rangle\\ \langle{\bf r}_{2}|+,e,{\bf R}_{1}\rangle&\langle{\bf r}_{2}|+,g,{\bf R}_{2}\rangle\end{vmatrix} (240)

which also reads, due to the (𝐫1𝐫2)({\bf r}_{1}\longleftrightarrow{\bf r}_{2}) permutation,

2d3r1d3r2e2|𝐫1𝐫2|[(+,g,𝐑1|𝐫1𝐫1|+,e,𝐑1)(+,e,𝐑2|𝐫2𝐫2|+,g,𝐑2)\displaystyle 2\iint d^{3}r_{1}d^{3}r_{2}\frac{e^{2}}{|{\bf r}_{1}-{\bf r}_{2}|}\bigg{[}\Big{(}\langle+,g,{\bf R}_{1}|{\bf r}_{1}\rangle\langle{\bf r}_{1}|+,e,{\bf R}_{1}\rangle\Big{)}\Big{(}\langle+,e,{\bf R}_{2}|{\bf r}_{2}\rangle\langle{\bf r}_{2}|+,g,{\bf R}_{2}\rangle\Big{)}
(+,e,𝐑2|𝐫1𝐫1|+,e,𝐑1)(+,g,𝐑1|𝐫2𝐫2|+,g,𝐑2)]\displaystyle-\Big{(}\langle+,e,{\bf R}_{2}|{\bf r}_{1}\rangle\langle{\bf r}_{1}|+,e,{\bf R}_{1}\rangle\Big{)}\Big{(}\langle+,g,{\bf R}_{1}|{\bf r}_{2}\rangle\langle{\bf r}_{2}|+,g,{\bf R}_{2}\rangle\Big{)}\bigg{]} (241)

the second term being equal to zero in the tight-binding limit for it contains wave function overlap between different sites, 𝐑1{\bf R}_{1} and 𝐑2{\bf R}_{2}.

A careful counting of the 𝐫i{\bf r}_{i} permutations in Eq. (239) ultimately gives

Φ+;𝐑2|Vee|Φ+;𝐑1=\displaystyle\langle\Phi_{+;{\bf R}_{2}}|V_{e-e}|\Phi_{+;{\bf R}_{1}}\rangle=\hskip 213.39566pt (242)
d3r1d3r2e2|𝐫1𝐫2|(+,g,𝐑1|𝐫1𝐫1|+,e,𝐑1)(+,e,𝐑2|𝐫2𝐫2|+,g,𝐑2)\displaystyle\iint d^{3}r_{1}d^{3}r_{2}\frac{e^{2}}{|{\bf r}_{1}-{\bf r}_{2}|}\Big{(}\langle+,g,{\bf R}_{1}|{\bf r}_{1}\rangle\langle{\bf r}_{1}|+,e,{\bf R}_{1}\rangle\Big{)}\Big{(}\langle+,e,{\bf R}_{2}|{\bf r}_{2}\rangle\langle{\bf r}_{2}|+,g,{\bf R}_{2}\rangle\Big{)}

In the same way, the nonzero contributions to

Φ+;𝐑2|Vee|Φ;𝐑1=14!d3r1d3r2d3r3d3r4\displaystyle\langle\Phi_{+;{\bf R}_{2}}|V_{e-e}|\Phi_{-;{\bf R}_{1}}\rangle=\frac{1}{4!}\iiiint d^{3}r_{1}d^{3}r_{2}d^{3}r_{3}d^{3}r_{4}\hskip 85.35826pt
×|𝐫1|+,g,𝐑1𝐫1|,g,𝐑1𝐫1|+,e,𝐑2𝐫1|,g,𝐑2|\displaystyle\times\begin{vmatrix}\langle{\bf r}_{1}|+,g,{\bf R}_{1}\rangle&\langle{\bf r}_{1}|-,g,{\bf R}_{1}\rangle&\langle{\bf r}_{1}|+,e,{\bf R}_{2}\rangle&\langle{\bf r}_{1}|-,g,{\bf R}_{2}\rangle\\ \vdots&\vdots&\vdots&\vdots\end{vmatrix}^{\ast}
×Vee|𝐫1|+,g,𝐑1𝐫1|,e,𝐑1𝐫1|+,g,𝐑2𝐫1|,g,𝐑2|\displaystyle\times V_{e-e}\begin{vmatrix}\langle{\bf r}_{1}|+,g,{\bf R}_{1}\rangle&\langle{\bf r}_{1}|-,e,{\bf R}_{1}\rangle&\langle{\bf r}_{1}|+,g,{\bf R}_{2}\rangle&\langle{\bf r}_{1}|-,g,{\bf R}_{2}\rangle\\ \vdots&\vdots&\vdots&\vdots\end{vmatrix} (243)

in which the |+,g,𝐑1|+,g,{\bf R}_{1}\rangle column and the |,g,𝐑2|-,g,{\bf R}_{2}\rangle column now are the same, are made from the other two columns, namely

d3r1d3r2|𝐫1|,g,𝐑1𝐫1|+,e,𝐑2𝐫2|,g,𝐑1𝐫2|+,e,𝐑2|e2|𝐫1𝐫2||𝐫1|,e,𝐑1𝐫1|+,g,𝐑2𝐫2|,e,𝐑1𝐫2|+,g,𝐑2|\iint d^{3}r_{1}d^{3}r_{2}\begin{vmatrix}\langle{\bf r}_{1}|-,g,{\bf R}_{1}\rangle&\langle{\bf r}_{1}|+,e,{\bf R}_{2}\rangle\\ \langle{\bf r}_{2}|-,g,{\bf R}_{1}\rangle&\langle{\bf r}_{2}|+,e,{\bf R}_{2}\rangle\end{vmatrix}^{\ast}\frac{e^{2}}{|{\bf r}_{1}-{\bf r}_{2}|}\begin{vmatrix}\langle{\bf r}_{1}|-,e,{\bf R}_{1}\rangle&\langle{\bf r}_{1}|+,g,{\bf R}_{2}\rangle\\ \langle{\bf r}_{2}|-,e,{\bf R}_{1}\rangle&\langle{\bf r}_{2}|+,g,{\bf R}_{2}\rangle\end{vmatrix} (244)

The above quantity, equal to

2d3r1d3r2e2|𝐫1𝐫2|[(,g,𝐑1|𝐫1𝐫1|,e,𝐑1)(+,e,𝐑2|𝐫2𝐫2|+,g,𝐑2)\displaystyle 2\iint d^{3}r_{1}d^{3}r_{2}\frac{e^{2}}{|{\bf r}_{1}-{\bf r}_{2}|}\bigg{[}\Big{(}\langle-,g,{\bf R}_{1}|{\bf r}_{1}\rangle\langle{\bf r}_{1}|-,e,{\bf R}_{1}\rangle\Big{)}\Big{(}\langle+,e,{\bf R}_{2}|{\bf r}_{2}\rangle\langle{\bf r}_{2}|+,g,{\bf R}_{2}\rangle\Big{)}
(+,e,𝐑2|𝐫1𝐫1|,e,𝐑1)(,g,𝐑1|𝐫2𝐫2|+,g,𝐑2)]\displaystyle-\Big{(}\langle+,e,{\bf R}_{2}|{\bf r}_{1}\rangle\langle{\bf r}_{1}|-,e,{\bf R}_{1}\rangle\Big{)}\Big{(}\langle-,g,{\bf R}_{1}|{\bf r}_{2}\rangle\langle{\bf r}_{2}|+,g,{\bf R}_{2}\rangle\Big{)}\bigg{]} (245)

also has a second term equal to zero, but for a deeper reason: its spin part +|\langle+|-\rangle is equal to zero, independently from the tight-binding limit, as in the case of Eq. (241). So, we recover the previous result (242), which extends to any |Φσ;𝐑|\Phi_{\sigma;{\bf R}}\rangle state.

We thus end with

Φ+;𝐑2|Vee|Φ+;𝐑1=Φσ;𝐑2|Vee|Φσ;𝐑1\langle\Phi_{+;{\bf R}_{2}}|V_{e-e}|\Phi_{+;{\bf R}_{1}}\rangle=\langle\Phi_{\sigma^{\prime};{\bf R}_{2}}|V_{e-e}|\Phi_{\sigma;{\bf R}_{1}}\rangle (246)

whatever (σ,σ)(\sigma^{\prime},\sigma). The VeeV_{e-e} matrix elements that differ from zero for 𝐑1𝐑2{\bf R}_{1}\not={\bf R}_{2} in the tight-binding limit, physically come from interlevel Coulomb processes on the same lattice site, as given in Eq. (242). The physical reason for these matrix elements to be independent of (σ,σ)(\sigma^{\prime},\sigma) is that they correspond to on-site processes (see Fig. 17(a)). Note that the second term of Eq. (245) associated with the intralevel Coulomb interaction, that corresponds to process between sites, would impose a link between the spins of the (𝐑1,𝐑2)({\bf R}_{1},{\bf R}_{2}) excited electrons, in order to differ from zero (see Fig. 17(b)).

Refer to caption
Figure 17: The interlevel Coulomb processes between lattice sites (𝐑2,𝐑1)({\bf R}_{2},{\bf R}_{1}) shown in (a) exist whatever the electron spins (σ,σ)(\sigma^{\prime},\sigma), while the intralevel Coulomb processes shown in (b), would require the two spins to be equal, σ=σ\sigma^{\prime}=\sigma, since electrons keep their spin under a Coulomb scattering. These intralevel processes have to be neglected, as a consequence of the tight-binding limit.

The last step is to show that such interlevel Coulomb processes only exist for lattice sites hosting electron pairs in a spin-singlet state. This feature follows from the fact that the two electrons on the 𝐑1{\bf R}_{1} site of |Φ+;𝐑1|\Phi_{+;{\bf R}_{1}}\rangle are in the electronic states |+,e,𝐑1|+,e,{\bf R}_{1}\rangle and |,g,𝐑1|-,g,{\bf R}_{1}\rangle, these states being occupied by electrons with opposite spins. Their associated wave function must read, due to its antisymmetry,

|Φ+;𝐑1\displaystyle|\Phi_{+;{\bf R}_{1}}\rangle\!\!\! \displaystyle\Longrightarrow 𝐫1|+,e,𝐑1𝐫2|,g,𝐑1𝐫1|,g,𝐑1𝐫2|+,e,𝐑1\displaystyle\!\!\!\langle{\bf r}_{1}|+,e,{\bf R}_{1}\rangle\langle{\bf r}_{2}|-,g,{\bf R}_{1}\rangle-\langle{\bf r}_{1}|-,g,{\bf R}_{1}\rangle\langle{\bf r}_{2}|+,e,{\bf R}_{1}\rangle
=𝐫1|e,𝐑1𝐫2|g,𝐑1|1+,2𝐫1|g,𝐑1𝐫2|e,𝐑1|1,2+\displaystyle\!\!\!=\langle{\bf r}_{1}|e,{\bf R}_{1}\rangle\langle{\bf r}_{2}|g,{\bf R}_{1}\rangle\otimes|1_{+},2_{-}\rangle-\langle{\bf r}_{1}|g,{\bf R}_{1}\rangle\langle{\bf r}_{2}|e,{\bf R}_{1}\rangle\otimes|1_{-},2_{+}\rangle

In the same way, the wave function associated with the two electrons on the 𝐑1{\bf R}_{1} site of |Φ;𝐑1|\Phi_{-;{\bf R}_{1}}\rangle reads

|Φ;𝐑1\displaystyle|\Phi_{-;{\bf R}_{1}}\rangle\!\!\! \displaystyle\Longrightarrow 𝐫1|+,g,𝐑1𝐫2|,e,𝐑1𝐫1|,e,𝐑1𝐫2|+,g,𝐑1\displaystyle\!\!\!\langle{\bf r}_{1}|+,g,{\bf R}_{1}\rangle\langle{\bf r}_{2}|-,e,{\bf R}_{1}\rangle-\langle{\bf r}_{1}|-,e,{\bf R}_{1}\rangle\langle{\bf r}_{2}|+,g,{\bf R}_{1}\rangle
=𝐫1|g,𝐑1𝐫2|e,𝐑1|1+,2𝐫1|e,𝐑1𝐫2|g,𝐑1|1,2+\displaystyle\!\!\!=\langle{\bf r}_{1}|g,{\bf R}_{1}\rangle\langle{\bf r}_{2}|e,{\bf R}_{1}\rangle\otimes|1_{+},2_{-}\rangle-\langle{\bf r}_{1}|e,{\bf R}_{1}\rangle\langle{\bf r}_{2}|g,{\bf R}_{1}\rangle\otimes|1_{-},2_{+}\rangle

As a result, the wave functions obtained from their sum and difference correspond to a spin-singlet and a spin-triplet electron pair on the 𝐑1{\bf R}_{1} site: indeed, for the sum, we find

|Φ+;𝐑1+|Φ;𝐑1(𝐫1|e,𝐑1𝐫2|g,𝐑1+𝐫1|g,𝐑1𝐫2|e,𝐑1)\displaystyle|\Phi_{+;{\bf R}_{1}}\rangle+|\Phi_{-;{\bf R}_{1}}\rangle\Longrightarrow\Big{(}\langle{\bf r}_{1}|e,{\bf R}_{1}\rangle\langle{\bf r}_{2}|g,{\bf R}_{1}\rangle+\langle{\bf r}_{1}|g,{\bf R}_{1}\rangle\langle{\bf r}_{2}|e,{\bf R}_{1}\rangle\Big{)}
(|1+,2|1,2+)\displaystyle\Big{(}|1_{+},2_{-}\rangle-|1_{-},2_{+}\rangle\Big{)} (249)

while for the difference, we get

|Φ+;𝐑1|Φ;𝐑1(𝐫1|e,𝐑1𝐫2|g,𝐑1𝐫1|g,𝐑1𝐫2|e,𝐑1)\displaystyle|\Phi_{+;{\bf R}_{1}}\rangle-|\Phi_{-;{\bf R}_{1}}\rangle\Longrightarrow\Big{(}\langle{\bf r}_{1}|e,{\bf R}_{1}\rangle\langle{\bf r}_{2}|g,{\bf R}_{1}\rangle-\langle{\bf r}_{1}|g,{\bf R}_{1}\rangle\langle{\bf r}_{2}|e,{\bf R}_{1}\rangle\Big{)}
(|1+,2+|1,2+)\displaystyle\Big{(}|1_{+},2_{-}\rangle+|1_{-},2_{+}\rangle\Big{)} (250)

We can check that, as expected, these two combinations are antisymmetric with respect to the (𝐫1𝐫2)({\bf r}_{1}\longleftrightarrow{\bf r}_{2}) exchange.

This leads us to introduce the spin-singlet (S=0,Sz=0)(S=0,S_{z}=0) and spin-triplet (S=1,Sz=0)(S=1,S_{z}=0) combinations of excited states with electronic excitation on the 𝐑1{\bf R}_{1} site,

|ΦS;𝐑1=|Φ+;𝐑1+(1)S|Φ;𝐑12|\Phi_{S;{\bf R}_{1}}\rangle=\frac{|\Phi_{+;{\bf R}_{1}}\rangle+(-1)^{S}|\Phi_{-;{\bf R}_{1}}\rangle}{\sqrt{2}} (251)

for S=(0,1)S=(0,1), these combinations being such that

ΦS2;𝐑2|ΦS1;𝐑1=δS2,S1δ𝐑2,𝐑1\langle\Phi_{S_{2};{\bf R}_{2}}|\Phi_{S_{1};{\bf R}_{1}}\rangle=\delta_{S_{2},S_{1}}\delta_{{\bf R}_{2},{\bf R}_{1}} (252)

in the tight-binding limit. It is then possible to show, with the help of Eqs. (246,249,250), that

ΦS2;𝐑2|Vee|ΦS1;𝐑1\displaystyle\langle\Phi_{S_{2};{\bf R}_{2}}|V_{e-e}|\Phi_{S_{1};{\bf R}_{1}}\rangle =\displaystyle= (1+(1)S1)(1+(1)S2)4Φ+;𝐑2|Vee|Φ+;𝐑1\displaystyle\frac{\Big{(}1+(-1)^{S_{1}}\Big{)}\Big{(}1+(-1)^{S_{2}}\Big{)}}{4}\langle\Phi_{+;{\bf R}_{2}}|V_{e-e}|\Phi_{+;{\bf R}_{1}}\rangle (253)
=\displaystyle= δS2,S1δS1,0Φ+;𝐑2|Vee|Φ+;𝐑1\displaystyle\delta_{S_{2},S_{1}}\delta_{S_{1},0}\langle\Phi_{+;{\bf R}_{2}}|V_{e-e}|\Phi_{+;{\bf R}_{1}}\rangle

with Φ+;𝐑2|Vee|Φ+;𝐑1\langle\Phi_{+;{\bf R}_{2}}|V_{e-e}|\Phi_{+;{\bf R}_{1}}\rangle given in Eq. (242). The above result proves that the interlevel Coulomb processes that control the electron-electron scattering only exist between lattice sites hosting spin-singlet electron pairs, an important result that is not at all obvious, and a priori far more tricky to derive when using Slater determinants than operators as in the second quantization formalism.

\bullet Let us now relate the above calculations to the results found in Knox’s book[5]. The intrinsic difficulty when using the first quantization formalism, is to speak in terms of holes correctly, because the Slater determinants are written in terms of electron states. Of course, one can always say that the excitation of an electron leaves a hole in the ground state and that the spin and wave vector of this hole are opposite to the ones of the electron that has been removed. However, there are some tricky sign changes when turning from electron to hole. These sign changes can be handled in a secure way within the second quantization formalism, but their handling is far more problematic within the first quantization description of the problem. Let us discuss two particular aspects of this fundamental problem.

(1) A major confusing point when using Slater determinants concerns spin-singlet and spin-triplet pairs: is Knox speaking in terms of electron pairs, as we have done above to get Eq. (251) because the natural language when using Slater determinants is in terms of electrons in the ground or excited level? Or is Knox speaking in terms of electron-hole pairs? If the latter is true, the question then is: how the change from electron-electron pair to electron-hole pair has been performed? The spin-singlet versus spin-triplet question is of importance because spin-singlet states are the only ones that suffer the interlevel Coulomb processes responsible for the Frenkel exciton formation. The δM=(1,0)\delta_{M}=(1,0) prefactor that appears in various excited-state energies (Knox[5], Eqs. (2.18, 2.29, 2.35, 3.7)), correctly tells that interlevel Coulomb processes exist for spin-singlet states only. The trouble is that these spin-singlet and spin-triplet states, associated with the correct δM\delta_{M} prefactor, were not precisely defined in the book. This is a pity because there is a sign difference between the spin of the removed electron and the spin of the corresponding hole!

To clarify this important point, it is necessary to go back to the variables that are used when writing Slater determinants: these must be electron variables, just as the ones we have used to define the spin-singlet combination in Eq. (249) and the spin-triplet combination in Eq. (250). As better seen from Eq. (7.0.3), the notation |1+,2|1_{+},2_{-}\rangle refers to 𝐫1|+,e,𝐑1𝐫2|,g,𝐑1\langle{\bf r}_{1}|+,e,{\bf R}_{1}\rangle\langle{\bf r}_{2}|-,g,{\bf R}_{1}\rangle, that is, two electrons (𝐫1,𝐫2)({\bf r}_{1},{\bf r}_{2}), with spins ++ and -, that respectively occupy the excited and ground levels of the 𝐑1{\bf R}_{1} lattice site. These spins refer to electrons; so, Eqs. (249, 250) refer to spin-singlet and spin-triplet combinations of the two electrons that occupy the 𝐑1{\bf R}_{1} site. The tricky point is that, when turning from these two electrons to one electron and one hole, we de facto drop one of the two electrons, namely, the one that remains in the ground state. The proper way to reformulate one electron-electron pair into one electron-hole pair is to say that if the 𝐑1{\bf R}_{1} lattice site is occupied by a (+,e)(+,e) electron along with a (,g)(-,g) electron, this means that the (+,g)(+,g) electron has been removed from the |Φg|\Phi_{g}\rangle ground state. Removing the (+,g)(+,g) electron from the 𝐑1{\bf R}_{1} site corresponds to creating a down-spin hole (-) on this site. So, the 𝐑1{\bf R}_{1} site has a ()(-) hole and a (+)(+) electron. Consequently, the formulation in terms of one electron-hole pair on the 𝐑1{\bf R}_{1} lattice site, instead of the two electrons on this site, follows from

|1+,2=|e+,h|1_{+},2_{-}\rangle=|e_{+},h_{-}\rangle (254)

The above equation readily proves that the spin-singlet and spin-triplet states that we have defined in terms of electron-electron pair on the 𝐑1{\bf R}_{1} lattice site, are identical to the ones defined in terms of electron-hole pair on this site (see Fig. 18). While this identity might be obvious to Knox, we find that it warrants a precise derivation.

Refer to caption
Figure 18: Occupation of the 𝐑1{\bf R}_{1} lattice site: (a) in the case of the |Φg|\Phi_{g}\rangle ground state, two up-spin and down-spin electrons are in the ground level gg of all lattice sites; (b) in the case of the excited state |Φ+;𝐑1|\Phi_{+;{\bf R}_{1}}\rangle, one up-spin electron is in the excited level and one down-spin electron is in the ground level gg; (c) the same |Φ+;𝐑1|\Phi_{+;{\bf R}_{1}}\rangle excited state, when described in terms of electron and hole, has an up-spin electron and a down-spin hole.

(2) Another confusion associated with the difficulty to speak in terms of holes when using Slater determinants, shows up in the brute-force way that the hole appears in the definition of the excited subspace (see Knox[5], Eq. (2.17)). In this part of Knox’s book, the one-particle states are taken as Bloch states. The excited state, considered in this equation, is said to correspond to “a valence electron, with spin σ-\sigma and wave vector 𝐤h{\bf k}_{h}, that is excited to a conduction state with wave vector 𝐤e{\bf k}_{e} ”, the spin possibly changing to σ\sigma^{\prime}. The Slater determinant of the corresponding excited state is said to contain the following two-electron wave function

ψv,𝐤h,σψc,𝐤e,σ\psi_{v,{\bf k}_{h},\sigma}\psi_{c,{\bf k}_{e},\sigma^{\prime}} (255)

Beside the fact that the physically relevant excited subspace is made of states having a total spin equal to zero, so that σ\sigma^{\prime} should be equal to σ-\sigma, the trouble with the above equation is the physical meaning of its wave vectors; they do not correspond to the ones associated with the aforementioned excited state: indeed, starting from the valence state (v,𝐤v)(v,{\bf k}_{v}) occupied by up-spin and down-spin electrons in the |Φg|\Phi_{g}\rangle ground state, the corresponding Slater determinant should contain the following wave function product

ψv,𝐤v,σψv,𝐤v,σ\psi_{v,{\bf k}_{v},\sigma}\psi_{v,{\bf k}_{v},-\sigma} (256)

A possible excited state is obtained by replacing one of the two valence Bloch states (v,𝐤v)(v,{\bf k}_{v}) by a conduction state (c,𝐤c)(c,{\bf k}_{c}). The Slater determinant of the corresponding excited state then contains the wave function product

ψv,𝐤v,σψc,𝐤c,σ\psi_{v,{\bf k}_{v},\sigma}\psi_{c,{\bf k}_{c},-\sigma} (257)

Since the hole that appears in the valence band when the (v,𝐤v,σ)(v,{\bf k}_{v},-\sigma) electron is removed, has a wave vector 𝐤h=𝐤v{\bf k}_{h}=-{\bf k}_{v}, the above wave function product should read in terms of electron and hole wave vectors as

ψv,𝐤h,σψc,𝐤e,σ\psi_{v,-{\bf k}_{h},\sigma}\psi_{c,{\bf k}_{e},-\sigma} (258)

for 𝐤e=𝐤c{\bf k}_{e}={\bf k}_{c}: this differs from Eq. (255) through a sign difference in the hole wave vector.

The above points evidence the intrinsic difficulty to properly speak in terms of electron-hole pairs when using the first quantization formalism because, in this formalism, the many-body wave functions appear through Slater determinants that are written in terms of electron wave functions.

7.0.4 Frenkel excitons

The preceding pages were dealing with the derivation of the Coulomb interaction in the excited subspace by using the first quantization formalism, that is, states written through wave functions in terms of Slater determinants, instead of through operators as done in the main part of the present manuscript. It is clear that handling these Slater determinants is definitely quite heavy. Moreover, turning to the physically relevant formulation in terms of system excitations, that is, electron-hole pairs, is delicate when signs matter.

In the preceding pages, we have shown that the matrix elements

ΦS2;𝐑2|Vee|ΦS1;𝐑1\langle\Phi_{S_{2};{\bf R}_{2}}|V_{e-e}|\Phi_{S_{1};{\bf R}_{1}}\rangle (259)

of the electron-electron Coulomb interaction VeeV_{e-e} between the excited states, in which the two electrons of the lattice sites 𝐑2{\bf R}_{2} or 𝐑1{\bf R}_{1} are in the spin combination S2S_{2} or S1S_{1}, differ from zero for singlet states only, (S2=S1=0)(S_{2}=S_{1}=0); the nonzero contribution to this matrix element comes from interlevel Coulomb processes on the same lattice site. This is just what has been found in the main part of the manuscript. In this main part, we also have taken into account the possibility for the spatial wave functions to be degenerate. This is easy to include when using Slater determinants: in the case of degenerate excited level, one just has to replace the excited state |e,𝐑|e,{\bf R}_{\ell}\rangle by |μ,e,𝐑|\mu,e,{\bf R}_{\ell}\rangle. The VeeV_{e-e} matrix, calculated through Slater determinants, ends with the same expression as the one derived from using the operator formalism of the second quantization. Its diagonalization then follows in exactly the same way.

To conclude this section, we can say that the unique but major problem with Knox’s approach to Frenkel excitons, is to derive the VeeV_{e-e} matrix for many-body states written as Slater determinants in terms of electrons when electron-hole pairs in spin-singlet states matter.

8 Frenkel exciton many-body effects

8.1 Composite boson nature

Frenkel excitons are linear combinations of two fermions: one electron in an electronic excited state and one electron absence in the electronic ground state of the same lattice site, or one hole in the semiconductor ground state. We have shown (see Eqs. (40, 60)) that, in the absence of spin and spatial degeneracy, the linear combination of on-site excitations that corresponds to a Frenkel exciton with wave vector 𝐊n{\bf K}_{n}, reads as

|Φ𝐊n=1Ns=1Nsei𝐊n𝐑a^e,a^g,|Φg=1Ns=1Nsei𝐊n𝐑a^b^|0|\Phi_{{\bf K}_{n}}\rangle=\frac{1}{\sqrt{N_{s}}}\sum_{\ell=1}^{N_{s}}e^{i{\bf K}_{n}\cdot{\bf R}_{\ell}}\hat{a}^{\dagger}_{e,\ell}\hat{a}_{g,\ell}|\Phi_{g}\rangle=\frac{1}{\sqrt{N_{s}}}\sum_{\ell=1}^{N_{s}}e^{i{\bf K}_{n}\cdot{\bf R}_{\ell}}\hat{a}^{\dagger}_{\ell}\hat{b}^{\dagger}_{\ell}|0\rangle (260)

This leads us to assign to the Frenkel exciton, a creation operator that appears in terms of electron and hole as

B^𝐊n=1Ns=1Nsei𝐊n𝐑a^b^\hat{B}^{\dagger}_{{\bf K}_{n}}=\frac{1}{\sqrt{N_{s}}}\sum_{\ell=1}^{N_{s}}e^{i{\bf K}_{n}\cdot{\bf R}_{\ell}}\hat{a}^{\dagger}_{\ell}\hat{b}^{\dagger}_{\ell} (261)

Being made of fermion pairs, the Frenkel excitons have a bosonic nature. This is seen from the fact that their creation operators commute

[B^𝐊n,B^𝐊n]=0\left[\hat{B}^{\dagger}_{{\bf K}_{n^{\prime}}},\hat{B}^{\dagger}_{{\bf K}_{n}}\right]_{-}=0 (262)

which readily follows from the commutation relations between different electronic states [a^g,,a^e,]+=0\Big{[}\hat{a}_{g,\ell^{\prime}},\hat{a}^{\dagger}_{e,\ell}\Big{]}_{+}=0 for a^g,=b^\hat{a}_{g,\ell^{\prime}}=\hat{b}^{\dagger}_{\ell^{\prime}}.

As for any fermion pair, their composite nature shows up through the fact that the commutator of their destruction and creation operators differs from the one of elementary bosons, as seen by calculating

[B^𝐊n,B^𝐊n]=δn,nD^𝐊n,𝐊n\left[\hat{B}_{{\bf K}_{n^{\prime}}},\hat{B}^{\dagger}_{{\bf K}_{n}}\right]_{-}=\delta_{n^{\prime},n}-\hat{D}_{{\bf K}_{n^{\prime}},{\bf K}_{n}} (263)

The difference, that we called “deviation-from-boson operator”, is precisely given by[6, 10]

D^𝐊n,𝐊n=1Ns=1Nsei(𝐊n𝐊n)𝐑(a^a^+b^b^)\hat{D}_{{\bf K}_{n^{\prime}},{\bf K}_{n}}=\frac{1}{\sqrt{N_{s}}}\sum_{\ell=1}^{N_{s}}e^{i({\bf K}_{n}-{\bf K}_{n^{\prime}})\cdot{\bf R}_{\ell}}\big{(}\hat{a}^{\dagger}_{\ell}\hat{a}_{\ell}+\hat{b}^{\dagger}_{\ell}\hat{b}_{\ell}\big{)} (264)

While this operator gives zero when acting on the electron-hole-pair vacuum |0|0\rangle, as consistent with the orthogonality of Frenkel excitons that are semiconductor eigenstates,

0|B^𝐊nB^𝐊n|0=δn,n\langle 0|\hat{B}_{{\bf K}_{n^{\prime}}}\hat{B}^{\dagger}_{{\bf K}_{n}}|0\rangle=\delta_{n^{\prime},n} (265)

the D^𝐊n,𝐊n\hat{D}_{{\bf K}_{n^{\prime}},{\bf K}_{n}} operator renders the norm of the NN-Frenkel exciton state smaller and smaller when NN increases[47]. Indeed, the iteration

[D^𝐊n1,𝐊n1,B^𝐊n]=2NsB^𝐊n+𝐊n1𝐊n1\left[\hat{D}_{{\bf K}_{n^{\prime}_{1}},{\bf K}_{n_{1}}},\hat{B}^{\dagger}_{{\bf K}_{n}}\right]_{-}=\frac{2}{N_{s}}\hat{B}^{\dagger}_{{\bf K}_{n}+{\bf K}_{n_{1}}-{\bf K}_{n^{\prime}_{1}}} (266)

that follows from Eqs. (261,264), leads to

[D^𝐊n1,𝐊n1,(B^𝐊n)N]=2NNs(B^𝐊n)N1B^𝐊n+𝐊n1𝐊n1\left[\hat{D}_{{\bf K}_{n^{\prime}_{1}},{\bf K}_{n_{1}}},\big{(}\hat{B}^{\dagger}_{{\bf K}_{n}}\big{)}^{N}\right]_{-}=\frac{2N}{N_{s}}\big{(}\hat{B}^{\dagger}_{{\bf K}_{n}}\big{)}^{N-1}\hat{B}^{\dagger}_{{\bf K}_{n}+{\bf K}_{n_{1}}-{\bf K}_{n^{\prime}_{1}}} (267)

from which we get

0|(B^𝐊n)N(B^𝐊n)N|0=N!FN\langle 0|\big{(}\hat{B}_{{\bf K}_{n}}\big{)}^{N}\big{(}\hat{B}^{\dagger}_{{\bf K}_{n}}\big{)}^{N}|0\rangle=N!F_{N} (268)

where FNF_{N} decreases with NN as

FN+1FN=1NNs\frac{F_{N+1}}{F_{N}}=1-\frac{N}{N_{s}} (269)

This norm decrease physically comes from the Pauli exclusion principle: because of this state blocking, the ground state for semiconductors hosting Frenkel excitons hosts one electron only in each electronic ground level of the NsN_{s} lattice sites (in the absence of spin and spatial degeneracies). As a result, the maximum number of electrons that can be excited is NsN_{s}; so is the maximum number of Frenkel excitons a semiconductor sample of NsN_{s} sites can handle, in agreement with Eq. (268): indeed, due to Eq. (269), the norm of the NN-Frenkel exciton state is equal to zero for N>NsN>N_{s}, which means that such state does not exist.

8.2 Dimensionless many-body parameter

The effect of the Pauli exclusion principle that appears in Eq. (268) corresponds to what we have called a “moth-eaten effect”: as more and more “little” electron-hole pairs are used to construct a NN-Frenkel exciton state, less pair states will be available for the formation of an additional exciton. For N=NsN=N_{s}, no pair state is left to add a new Frenkel exciton to the (B^𝐊n)N|0\big{(}\hat{B}^{\dagger}_{{\bf K}_{n}}\big{)}^{N}|0\rangle state: the “moths” have eaten all available electron-hole pair states.

In view of Eq. (269), the dimensionless parameter that controls many-body effects driven by the Pauli exclusion principle, corresponds in the case of Frenkel excitons to

η=NNs\eta=\frac{N}{N_{s}} (270)

This parameter appears quite different from the one that was first written for Wannier excitons, namely

η=N(aXL)3\eta=N\left(\frac{a_{{}_{X}}}{L}\right)^{3} (271)

where aXa_{{}_{X}} is the Bohr radius associated with the relative-motion extension of the electron-hole pair in a Wannier exciton. This parameter was physically understood as coming from the Coulomb interaction that occurs when two Wannier excitons, with wave vectors (𝐊,𝐊)({\bf K},{\bf K}^{\prime}) and spatial extension aXa_{{}_{X}}, overlap (see Fig. 19). Indeed, the Wannier exciton wave extends over the sample volume L3L^{3}, while the volume over which the electron and the hole of an exciton can interact is the exciton volume a3Xa^{3}_{{}_{X}}. As Frenkel excitons have a relative-motion extension that reduces to zero, since they are made of on-site excitations, the above expression could lead us to naïvely conclude that the many-body parameter for Frenkel excitons should reduce to zero.

Refer to caption
Figure 19: The dimensionless parameter (Eq. (271)) that controls Wannier exciton many-body effects can be physically understood as coming from Coulomb interaction when the two exciton waves (𝐊,𝐊)({\bf K},{\bf K}^{\prime}) overlap, due to their finite spatial extension, or Bohr radius.

Since this is hard to accept, we are forced to reconsider the physical understanding of the many-body parameter for Wannier excitons (271) in view of its expression for Frenkel exciton given in Eq. (270). To relate these two expressions, we note that NsN_{s} in Eq. (270) is the maximum number of Frenkel excitons that a semiconductor sample having NsN_{s} lattice sites can accommodate, as previously explained in view of the state norm given in Eq. (268). With this idea in mind, we then note that (L/aX)3(L/a_{{}_{X}})^{3} also is the maximum number of Wannier excitons a semiconductor sample having a L3L^{3} volume can accommodate. Indeed, for an exciton number such that Na3XL3Na^{3}_{{}_{X}}\sim L^{3}, the electron-hole pairs of the NN Wannier excitons would fill the whole sample, and dissociate into an electron-hole plasma, known as Mott dissociation. So, (L/aX)3(L/a_{{}_{X}})^{3} also is of the order of the maximum number of Wannier excitons that a L3L^{3} semiconductor sample can contain.

This leads us to conclude that both the many-body effects for Frenkel excitons and Wannier excitons are controlled by the Pauli exclusion principle through a dimensionless parameter that reads[6]

η=NNmax\eta=\frac{N}{N_{\rm max}} (272)

where NmaxN_{\rm max} is the maximum exciton number the sample can contain, that is, the number of lattice sites, NsN_{s}, in the case of Frenkel excitons, and (L/aX)3\sim(L/a_{{}_{X}})^{3} for a sample volume L3L^{3} in the case of Wannier excitons.

Interestingly, this understanding also extends to Cooper pairs that are composite bosons made of up-spin and down-spin electrons inside an energy layer Ω\Omega extending over two-phonon energy around the Fermi level, over which the BCS attractive potential acts: indeed, we find that the energy of NN Cooper pairs decreases with the pair number as η\eta given in Eq. (272), with Nmax=ρΩN_{\rm max}=\rho\Omega, where ρ\rho is the electron density of states in the Ω\Omega layer; so, ρΩ\rho\Omega is nothing but the maximum number of electron pairs that can be involved in the BCS coupling[48].

8.3 Frenkel exciton scatterings

The interaction scatterings fundamentally dictate how two particles change under an interaction. As pointed out in the case of Wannier excitons, two composite bosons made of charged fermions feel each other not only through the Coulomb interaction, but also through the Pauli exclusion principle between their fermionic components, the associated scatterings being somewhat unusual because they are dimensionless.

The Frenkel exciton many-body physics[49] has to be simpler than the one for Wannier excitons. The fundamental reason is that Frenkel excitons are made of electronic excitations on lattice sites; so, they depend on a single index, their wave vector 𝐊{\bf K}. By contrast, Wannier excitons are made of a delocalized electron 𝐤e{\bf k}_{e} and a delocalized hole 𝐤h{\bf k}_{h}; so, they depend on two indices, a wave vector and a relative-motion index. Nevertheless, the structure of the composite boson many-body formalism for Frenkel excitons closely follows the one that we have constructed for Wannier excitons[6, 50, 51].

The problem is to determine the physics that transforms two Frenkel excitons (𝐊1,𝐊2)({\bf K}_{1},{\bf K}_{2}) into two Frenkel excitons (𝐊1,𝐊2)({\bf K}^{\prime}_{1},{\bf K}^{\prime}_{2}) (see Fig. 20). This change can be induced either by a fermion exchange between two Frenkel excitons[52] or by Coulomb interactions between their charged components[53, 54].

Refer to caption
Figure 20: The scattering between Frenkel excitons (𝐊1,𝐊2)({\bf K}_{1},{\bf K}_{2}) that end as (𝐊1,𝐊2)({\bf K}^{\prime}_{1},{\bf K}^{\prime}_{2}).

8.3.1 Pauli scattering between two Frenkel excitons

The Pauli exclusion principle leads to scatterings between Frenkel excitons that follow from the deviation-from-boson operator in the commutator of Eq. (263). The fermion exchange scatterings λ(𝐊2𝐊2𝐊1𝐊1)\lambda\left(\begin{smallmatrix}{\bf K}^{\prime}_{2}&{\bf K}_{2}\\ {\bf K}^{\prime}_{1}&{\bf K}_{1}\end{smallmatrix}\right) induced by this operator, appear through a second commutation relation

[D^𝐊1,𝐊1,B^𝐊2]=𝐊2(λ(𝐊2𝐊2𝐊1𝐊1)+λ(𝐊1𝐊2𝐊2𝐊1))B^𝐊2\left[\hat{D}_{{\bf K}^{\prime}_{1},{\bf K}_{1}},\hat{B}^{\dagger}_{{\bf K}_{2}}\right]_{-}=\sum_{{\bf K}^{\prime}_{2}}\Big{(}\lambda\left(\begin{smallmatrix}{\bf K}^{\prime}_{2}&{\bf K}_{2}\\ {\bf K}^{\prime}_{1}&{\bf K}_{1}\end{smallmatrix}\right)+\lambda\left(\begin{smallmatrix}{\bf K}^{\prime}_{1}&{\bf K}_{2}\\ {\bf K}^{\prime}_{2}&{\bf K}_{1}\end{smallmatrix}\right)\Big{)}\hat{B}^{\dagger}_{{\bf K}^{\prime}_{2}} (273)

In the case of single-index Frenkel excitons, this scattering reduces, with the help of Eqs. (261,264), to

λ(𝐊2𝐊2𝐊1𝐊1)=1Nsδ𝐊1+𝐊2,𝐊1+𝐊2\lambda\left(\begin{smallmatrix}{\bf K}^{\prime}_{2}&{\bf K}_{2}\\ {\bf K}^{\prime}_{1}&{\bf K}_{1}\end{smallmatrix}\right)=\frac{1}{N_{s}}\,\delta_{{\bf K}^{\prime}_{1}+{\bf K}^{\prime}_{2},{\bf K}_{1}+{\bf K}_{2}} (274)

It just corresponds to wave vector conservation which is ubiquitous in any fermion exchange between composite bosons.

8.3.2 Coulomb scattering between two Frenkel excitons

Since the electrons and holes of two Frenkel excitons are charged particles, these excitons also feel each other through the Coulomb interaction. To derive the associated scattering, we follow the same procedure as the one for Wannier excitons: the commutation of the Frenkel exciton creation operator with the semiconductor Hamiltonian written in terms of electrons and holes, contains a naïve term that corresponds to the exciton energy, plus an operator V^𝐊\hat{V}_{\bf K}^{\dagger} that we called “Coulomb creation operator”

[H^eh,B^𝐊]=E𝐊B^𝐊+V^𝐊\left[\hat{H}_{eh},\hat{B}^{\dagger}_{{\bf K}}\right]_{-}=E_{\bf K}\hat{B}^{\dagger}_{{\bf K}}+\hat{V}_{\bf K}^{\dagger} (275)

This operator gives zero when acting on the electron-hole pair vacuum |0|0\rangle, as seen from the above equation projected onto |0|0\rangle, namely

[H^eh,B^𝐊]|0=H^ehB^𝐊|0=E𝐊B^𝐊|0+V^𝐊|0\left[\hat{H}_{eh},\hat{B}^{\dagger}_{{\bf K}}\right]_{-}|0\rangle=\hat{H}_{eh}\hat{B}^{\dagger}_{{\bf K}}|0\rangle=E_{\bf K}\hat{B}^{\dagger}_{{\bf K}}|0\rangle+\hat{V}_{\bf K}^{\dagger}|0\rangle (276)

By contrast, it leads to energy-like scatterings when acting on Frenkel excitons. The corresponding scatterings formally follow from a second commutation relation

[V^𝐊2,B^𝐊1]=𝐊1,𝐊2ξ(𝐊2𝐊2𝐊1𝐊1)B^𝐊1B^𝐊2\left[\hat{V}_{{\bf K}_{2}}^{\dagger},\hat{B}^{\dagger}_{{\bf K}_{1}}\right]_{-}=\sum_{{\bf K}^{\prime}_{1},{\bf K}^{\prime}_{2}}\xi\left(\begin{smallmatrix}{\bf K}^{\prime}_{2}&{\bf K}_{2}\\ {\bf K}^{\prime}_{1}&{\bf K}_{1}\end{smallmatrix}\right)\hat{B}^{\dagger}_{{\bf K}^{\prime}_{1}}\hat{B}^{\dagger}_{{\bf K}^{\prime}_{2}} (277)

Since the Coulomb physics between Frenkel excitons results from electron-hole pair exchange between lattice sites, it may not be surprising to find that the precise calculation of the above commutators, leads to a Coulomb scattering that simply reads in terms of the Pauli scattering for fermion exchange defined in Eq. (274), namely

ξ(𝐊2𝐊2𝐊1𝐊1)=(𝒱𝐊1+𝒱𝐊2)λ(𝐊2𝐊2𝐊1𝐊1)\xi\left(\begin{smallmatrix}{\bf K}^{\prime}_{2}&{\bf K}_{2}\\ {\bf K}^{\prime}_{1}&{\bf K}_{1}\end{smallmatrix}\right)=-\left(\mathcal{V}_{{\bf K}^{\prime}_{1}}+\mathcal{V}_{{\bf K}^{\prime}_{2}}\right)\lambda\left(\begin{smallmatrix}{\bf K}^{\prime}_{2}&{\bf K}_{2}\\ {\bf K}^{\prime}_{1}&{\bf K}_{1}\end{smallmatrix}\right) (278)

where 𝒱𝐊\mathcal{V}_{{\bf K}} is the 𝐊{\bf K}-dependent part of the Frenkel exciton energy, as necessary for this scattering to be an energy-like quantity.

8.3.3 Hamiltonian mean value

By using the above commutators, we can calculate the Hamiltonian mean value in the NN-Frenkel exciton ground state

H^ehN=0|(B^𝐊)NH^eh(B^𝐊)N|00|(B^𝐊)N(B^𝐊)N|0\langle\hat{H}_{eh}\rangle_{N}=\frac{\langle 0|\big{(}\hat{B}_{\bf{K}}\big{)}^{N}\hat{H}_{eh}\big{(}\hat{B}^{\dagger}_{\bf{K}}\big{)}^{N}|0\rangle}{\langle 0|\big{(}\hat{B}_{\bf{K}}\big{)}^{N}\big{(}\hat{B}^{\dagger}_{\bf{K}}\big{)}^{N}|0\rangle} (279)

To do it, we first push the Hamiltonian and the V^𝐊\hat{V}_{\bf K}^{\dagger} operators to the right with the help of the two commutators (276,277), in order to end with H^eh|0=0\hat{H}_{eh}|0\rangle=0 and V^𝐊|0=0\hat{V}_{\bf K}^{\dagger}|0\rangle=0. We are left with scalar products of Frenkel exciton states. To calculate them, we push the operator B^𝐊\hat{B}_{\bf K} and D^𝐊,𝐊\hat{D}_{{\bf K}^{\prime},{\bf K}} to the right with the help of the two commutators (263,273), in order to end with B^𝐊|0=0\hat{B}_{\bf K}|0\rangle=0 and D^𝐊,𝐊|0=0\hat{D}_{{\bf K}^{\prime},{\bf K}}|0\rangle=0.

For state in which all the Frenkel excitons have a zero wave vector, we find

H^ehNNE𝟎+N(N1)2ξ(𝟎𝟎𝟎𝟎)\langle\hat{H}_{eh}\rangle_{N}\simeq NE_{\bf{0}}+\frac{N(N-1)}{2}\,\xi\left(\begin{smallmatrix}\bf{0}&\bf{0}\\ \bf{0}&\bf{0}\end{smallmatrix}\right) (280)

More details on these many-body calculations can be found in the book “Excitons and Cooper pairs”[6], and also in Refs. [49, 55].

9 Conclusion

Besides their applied interest in the physics of inorganic semiconductors and in the physics of “Open Systems”, Frenkel excitons have an utmost fundamental interest: like Wannier excitons, they correspond to electronic excitation extending over the whole sample. However, being made of on-site electronic excitations, Frenkel excitons have no relative-motion extension; so, they depend on one parameter only, their wave vector Kn\textbf{K}_{n}. This renders their many-body physics far simpler, but mostly this gives rise to a totally different Coulomb physics.

In the case of Wannier exciton made of one delocalized electron in the conduction band and one delocalized electron-absence in the valence band, the Coulomb interaction acts in two different ways: its intraband processes, responsible for binding a conduction electron to a valence hole, control the exciton relative-motion extension, while the interband Coulomb processes that only exist between optically bright electron-hole pairs, split their degenerate state into one longitudinal and two transverse modes.

The situation is totally different in the case of Frenkel excitons because these excitons are made of on-site electronic excitations. The tight-binding approximation, valid for semiconductors hosting Frenkel excitons, renders negligible the intralevel processes between sites, that are the analog of intraband processes. So, we are left with interlevel processes in which one electron-hole pair recombines on one site while another pair is created on another site: this fundamentally corresponds to an electron-hole pair exchange between two sites — not an “electron-hole exchange”, which would be meaningless for different quantum particles. These interlevel processes do both: they delocalize the electronic excitations on the 𝐑{\bf R}_{\ell} lattice sites as Kn\textbf{K}_{n} excitonic waves extending over the whole sample; they also split the degenerate exciton level into one longitudinal and two transverse modes, just as for Wannier excitons. Note that, since the interlevel Coulomb processes only act between spin-singlet electron-hole pairs, just like the electron-photon interaction, the Frenkel excitons are by construction optically bright — in the tight-binding limit. Yet, the exciton longitudinal mode is not coupled to light because the photon field is transverse.

The presentation of Frenkel excitons we give in the present manuscript, makes use of the tight-binding approximation that we take in the limit of zero wave function overlap between different lattice sites. This allows us to introduce clean fermionic operators for the one-electron basis in terms of which we write the Frenkel exciton problem in second quantization. The second quantization formalism not only avoids using the cumbersome Slater determinants for many electron states when written in the first quantization, but it also allows properly introducing the concept of hole that goes along with many tricky minus signs, which are difficult to follow when using Slater determinants: indeed, these determinants have been introduced for electron states, to take care of the sign change that appears when exchanging two electrons, not one electron and one hole.

In a last part, we give an overview of how to handle many-body effects between Frenkel excitons, through a formalism appropriate to composite bosons having no spatial extension, as required because these excitons are made of on-site excitations, in contrast to Wannier excitons made of delocalized conduction electron and valence-electron absence but ending as bound electron-hole pairs extending over a Bohr radius. We show that the dimensionless parameter for many-body effects between composite bosons like Frenkel excitons, Wannier excitons and also Cooper pairs, is entirely controlled by the Pauli exclusion principle between their fermionic components, whatever the mechanism that binds their fermion pairs.

We wish to say that the present work does not address two important parts of the exciton physics.

(i) We do not consider the interaction between Frenkel excitons and photons. Like Wannier excitons, Frenkel excitons can form polaritons, that is, linear combinations of one photon and one exciton. This formation occurs in the so-called “strong coupling” regime in which the exciton couples to a photon over a characteristic time much smaller than the time over which its wave vector would change by collisions: in this regime, the exciton keeps re-emitting a photon having the same wave vector; so, a coupled photon-exciton state can develop.

(ii) The second important part is the spin-orbit interaction[56]. This interaction splits the (2×3)(2\times 3) spin and spatial degeneracies of the energy levels. The relevant electron states one then has to consider are these spin-orbit eigenstates. The procedure to derive Frenkel excitons will, in the same way, follow with writing the matrix for the one electron-hole pair Hamiltonian in the degenerate subspace and diagonalizing this matrix. However, due to the intrinsic mixture of spin and spatial degeneracies induced by the spin-orbit interaction, the physics associated with the singlet-triplet splitting and the one associated with the longitudinal-transverse splitting will be less transparent to pick out.

Appendix A Derivation of the commutation relations (9,10)

We here derive the anticommutation relations between electron operators a^ν,\hat{a}^{\dagger}_{\nu,\ell} defined in Eq. (8), namely

|ν,𝐑=a^ν,|v|\nu,{\bf R}_{\ell}\rangle=\hat{a}^{\dagger}_{\nu,\ell}|v\rangle (281)

where |v|v\rangle is the vacuum state.

\bullet To obtain the anticommutation relation [a^ν,,a^ν,]+\left[\hat{a}_{\nu^{\prime},\ell^{\prime}},\hat{a}^{\dagger}_{\nu,\ell}\right]_{+}, we start with the a^ν,a^ν,|v\hat{a}_{\nu^{\prime},\ell^{\prime}}\hat{a}^{\dagger}_{\nu,\ell}|v\rangle state that we calculate in two ways.

(1) Since this state contains no electron, it reduces to vacuum. So,

a^ν,a^ν,|v\displaystyle\hat{a}_{\nu^{\prime},\ell^{\prime}}\hat{a}^{\dagger}_{\nu,\ell}|v\rangle =\displaystyle= |vv|a^ν,a^ν,|v\displaystyle|v\rangle\langle v|\hat{a}_{\nu^{\prime},\ell^{\prime}}\hat{a}^{\dagger}_{\nu,\ell}|v\rangle (282)
=\displaystyle= |vν,𝐑|ν,𝐑=δν,νδ,|v\displaystyle|v\rangle\langle\nu^{\prime},{\bf R}_{\ell^{\prime}}|\nu,{\bf R}_{\ell}\rangle=\delta_{\nu^{\prime},\nu}\delta_{\ell^{\prime},\ell}|v\rangle

(2) In the other way, we rewrite the above state as

a^ν,a^ν,|v=([a^ν,,a^ν,]+a^ν,a^ν,)|v\hat{a}_{\nu^{\prime},\ell^{\prime}}\hat{a}^{\dagger}_{\nu,\ell}|v\rangle=\left(\left[\hat{a}_{\nu^{\prime},\ell^{\prime}},\hat{a}^{\dagger}_{\nu,\ell}\right]_{+}-\hat{a}^{\dagger}_{\nu,\ell}\hat{a}_{\nu^{\prime},\ell^{\prime}}\right)|v\rangle (283)

the second term being equal to zero, since a^ν,|v=0\hat{a}_{\nu^{\prime},\ell^{\prime}}|v\rangle=0.

From these two calculations, we readily get

[a^ν,,a^ν,]+=δν,νδ,\left[\hat{a}_{\nu^{\prime},\ell^{\prime}},\hat{a}^{\dagger}_{\nu,\ell}\right]_{+}=\delta_{\nu^{\prime},\nu}\delta_{\ell^{\prime},\ell} (284)

\bullet To obtain [a^ν,,a^ν,]+\Big{[}\hat{a}^{\dagger}_{\nu^{\prime},\ell^{\prime}},\hat{a}^{\dagger}_{\nu,\ell}\Big{]}_{+}, we first note that |ν,𝐑|\nu,{\bf R}_{\ell}\rangle can be written on the electron basis |ν,𝐑0|\nu,{\bf R}_{\ell_{0}}\rangle of the 𝐑0{\bf R}_{\ell_{0}} sites as

a^ν,|v\displaystyle\hat{a}^{\dagger}_{\nu,\ell}|v\rangle =\displaystyle= |ν,𝐑=(ν1|ν1,𝐑0ν1,𝐑0|)|ν,𝐑\displaystyle|\nu,{\bf R}_{\ell}\rangle=\left(\sum_{\nu_{1}}|\nu_{1},{\bf R}_{\ell_{0}}\rangle\langle\nu_{1},{\bf R}_{\ell_{0}}|\right)|\nu,{\bf R}_{\ell}\rangle (285)
=\displaystyle= (ν1a^ν1,0ν1,𝐑0|ν,𝐑)|v\displaystyle\left(\sum_{\nu_{1}}\hat{a}^{\dagger}_{\nu_{1},\ell_{0}}\langle\nu_{1},{\bf R}_{\ell_{0}}|\nu,{\bf R}_{\ell}\rangle\right)|v\rangle

from which we get

a^ν,a^ν,\displaystyle\hat{a}^{\dagger}_{\nu^{\prime},\ell^{\prime}}\hat{a}^{\dagger}_{\nu,\ell} =\displaystyle= ν1ν2a^ν2,0a^ν1,0ν2,𝐑0|ν,𝐑ν1,𝐑0|ν,𝐑\displaystyle\sum_{\nu_{1}\nu_{2}}\hat{a}^{\dagger}_{\nu_{2},\ell_{0}}\hat{a}^{\dagger}_{\nu_{1},\ell_{0}}\langle\nu_{2},{\bf R}_{\ell_{0}}|\nu^{\prime},{\bf R}_{\ell^{\prime}}\rangle\langle\nu_{1},{\bf R}_{\ell_{0}}|\nu,{\bf R}_{\ell}\rangle (286)
=\displaystyle= a^ν,a^ν,\displaystyle-\hat{a}^{\dagger}_{\nu,\ell}\hat{a}^{\dagger}_{\nu^{\prime},\ell^{\prime}}

since the electron operators a^ν,0\hat{a}^{\dagger}_{\nu,\ell_{0}} anticommute; so, a^ν2,0a^ν1,0=a^ν1,0a^ν2,0\hat{a}^{\dagger}_{\nu_{2},\ell_{0}}\hat{a}^{\dagger}_{\nu_{1},\ell_{0}}=-\hat{a}^{\dagger}_{\nu_{1},\ell_{0}}\hat{a}^{\dagger}_{\nu_{2},\ell_{0}}. As a result, we end with

[a^ν,,a^ν,]+=0\left[\hat{a}^{\dagger}_{\nu^{\prime},\ell^{\prime}},\hat{a}^{\dagger}_{\nu,\ell}\right]_{+}=0 (287)

Appendix B H^\hat{H} Hamiltonian in the |Φg|\Phi_{g}\rangle subspace

The parts of H^\hat{H} that act on the system ground state |Φg|\Phi_{g}\rangle, reduce to H^0,g+V^gg+Vii\hat{H}_{0,g}+\hat{V}_{gg}+V_{i-i} because |Φg|\Phi_{g}\rangle is made of ground-level electrons only.

\bullet Using H^0,g\hat{H}_{0,g} in Eq. (32), we get, since a^g,a^g,|Φg=|Φg\hat{a}^{\dagger}_{g,\ell}\hat{a}_{g,\ell}|\Phi_{g}\rangle=|\Phi_{g}\rangle as all NsN_{s} lattice sites are occupied by a ground-level electron,

H^0,g|Φg\displaystyle\hat{H}_{0,g}|\Phi_{g}\rangle =\displaystyle= (εg+tg,g)=1Nsa^g,a^g,|Φg\displaystyle(\varepsilon_{g}+t_{g,g})\sum_{\ell=1}^{N_{s}}\hat{a}^{\dagger}_{g,\ell}\hat{a}_{g,\ell}|\Phi_{g}\rangle (288)
=\displaystyle= Ns(εg+tg,g)|Φg\displaystyle{N_{s}}(\varepsilon_{g}+t_{g,g})|\Phi_{g}\rangle

\bullet In the same way, since a^g,1a^g,2a^g,2a^g,1|Φg=(1δ1,2)|Φg\hat{a}^{\dagger}_{g,\ell_{1}}\hat{a}^{\dagger}_{g,\ell_{2}}\hat{a}_{g,\ell_{2}}\hat{a}_{g,\ell_{1}}|\Phi_{g}\rangle=(1-\delta_{\ell_{1},\ell_{2}})|\Phi_{g}\rangle, the V^gg\hat{V}_{gg} Coulomb interaction (35) acting on |Φg|\Phi_{g}\rangle gives, due to lattice periodicity,

V^gg|Φg=(121=1Ns21Ns𝒱𝐑1𝐑2(gggg))|Φg=Ns2𝐑𝟎𝒱𝐑(gggg)|Φg\displaystyle\hat{V}_{gg}|\Phi_{g}\rangle=\left(\frac{1}{2}\sum_{\ell_{1}=1}^{N_{s}}\sum_{\ell_{2}\not=\ell_{1}}^{N_{s}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)\right)|\Phi_{g}\rangle=\frac{{N_{s}}}{2}\sum_{{\bf R}\not=\bf 0}\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)|\Phi_{g}\rangle (289)

\bullet The above results readily give, for the one-body part of H^\hat{H},

Φg|H^0,g|Φg=Ns(εg+tg,g)\langle\Phi_{g}|\hat{H}_{0,g}|\Phi_{g}\rangle=N_{s}(\varepsilon_{g}+t_{g,g}) (290)

while for the two-body Coulomb part, we get

Φg|V^gg|Φg=Ns2𝐑𝟎𝒱𝐑(gggg)\langle\Phi_{g}|\hat{V}_{gg}|\Phi_{g}\rangle=\frac{{N_{s}}}{2}\sum_{{\bf R}\not=\bf 0}\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right) (291)

\bullet By adding the constant term that corresponds to the ion-ion interaction given in Eq. (22), we end with the Ns{N_{s}}-electron ground-state energy, defined in Eq. (26), as

Eg=Ns(εg+vgg)E^{\prime}_{g}={N_{s}}(\varepsilon_{g}+v_{gg}) (292)

where the energy contribution from the Coulomb interactions is given, according to Eqs. (290,291) and Eqs. (15,22), by

vgg\displaystyle v_{gg} =\displaystyle= tg,g+12𝐑𝟎𝒱𝐑(gggg)+12𝐑𝟎e2|𝐑|\displaystyle t_{g,g}+\frac{1}{2}\sum_{{\bf R}\not=\bf 0}\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)+\frac{1}{2}\sum_{{\bf R}\not=\bf 0}\frac{e^{2}}{|{\bf R}|}
=\displaystyle= 𝐑𝟎(12L3d3rd3r|𝐫|g|2|𝐫|g|2e2|𝐑+𝐫𝐫|\displaystyle\sum_{{\bf R}\not=\bf 0}\bigg{(}\frac{1}{2}\iint_{L^{3}}d^{3}rd^{3}r^{\prime}|\langle{\bf r}|g\rangle|^{2}|\langle{\bf r}^{\prime}|g\rangle|^{2}\frac{e^{2}}{|{\bf R}+{\bf r}-{\bf r}^{\prime}|}
L3d3r|𝐫|g|2e2|𝐑𝐫|+12e2|𝐑|)\displaystyle-\int_{L^{3}}d^{3}r|\langle{\bf r}|g\rangle|^{2}\frac{e^{2}}{|{\bf R}-{\bf r}^{\prime}|}+\frac{1}{2}\frac{e^{2}}{|{\bf R}|}\bigg{)}

By noting that

1=L3d3r|𝐫|g|21=\int_{L^{3}}d^{3}r|\langle{\bf r}|g\rangle|^{2} (294)

for normalized ground-level wave function, we can rewrite vggv_{gg} in a compact form as

vgg=12L3d3rd3r|𝐫|g|2|𝐫|g|2𝐑𝟎(e2|𝐑|+e2|𝐑+𝐫𝐫|2e2|𝐑𝐫|)v_{gg}=\frac{1}{2}\iint_{L^{3}}d^{3}rd^{3}r^{\prime}\,|\langle{\bf r}|g\rangle|^{2}|\langle{\bf r}^{\prime}|g\rangle|^{2}\sum_{{\bf R}\not=\bf 0}\left(\frac{e^{2}}{|{\bf R}|}+\frac{e^{2}}{|{\bf R}+{\bf r}-{\bf r}^{\prime}|}-\frac{2e^{2}}{|{\bf R}-{\bf r}^{\prime}|}\right) (295)

The above equation evidences that it is necessary to keep the electron interaction with all the other ions through tg,gt_{g,g}, as well as the interaction ViiV_{i-i} between all the ions, to possibly cancel the overextensive contribution coming from V^ee\hat{V}_{e-e} alone. Indeed, for |𝐫||{\bf r}| kept small by the 𝐫|g\langle{\bf r}|g\rangle wave function, the leading term of the V^ee\hat{V}_{e-e} scattering scales as

𝐑𝟎1|𝐑+𝐫𝐫|𝐑𝟎1|𝐑|L3d3R1RL2\sum_{{\bf R}\not=\bf 0}\frac{1}{|{\bf R}+{\bf r}-{\bf r}^{\prime}|}\simeq\sum_{{\bf R}\not=\bf 0}\frac{1}{|{\bf R}|}\simeq\int_{L^{3}}d^{3}R\frac{1}{R}\propto L^{2} (296)

The contribution of the electron-electron interaction to the ground-state energy would then be volume infinite compared to εg\varepsilon_{g}. This once more proves that considering a neutral system, with the same amount of negative electron charges and positive ion charges, is crucial to avoid spurious singularities.

Appendix C H^\hat{H} Hamiltonian in the |Φ𝐑|\Phi_{{\bf R}_{\ell}}\rangle subspace

We now consider H^\hat{H} acting on |Φ𝐑|\Phi_{{\bf R}_{\ell}}\rangle given in Eq. (27), with the electron on the 𝐑{\bf R}_{\ell} ion being in the excited level, while all the other lattice sites are occupied by a ground-level electron.

\bullet Using the one-body part H^0,e\hat{H}_{0,e} of the Hamiltonian given in Eq. (33), we readily find

H^0,e|Φ𝐑\displaystyle\hat{H}_{0,e}|\Phi_{{\bf R}_{\ell}}\rangle =\displaystyle= (εe+te,e)1=1Nsa^e,1a^e,1|Φ𝐑\displaystyle(\varepsilon_{e}+t_{e,e})\sum_{\ell_{1}=1}^{N_{s}}\hat{a}^{\dagger}_{e,\ell_{1}}\hat{a}_{e,\ell_{1}}|\Phi_{{\bf R}_{\ell}}\rangle (297)
=\displaystyle= (εe+te,e)|Φ𝐑\displaystyle(\varepsilon_{e}+t_{e,e})|\Phi_{{\bf R}_{\ell}}\rangle

while from the H^0,g\hat{H}_{0,g} part given in Eq. (32), we get

H^0,g|Φ𝐑\displaystyle\hat{H}_{0,g}|\Phi_{{\bf R}_{\ell}}\rangle =\displaystyle= (εg+tg,g)1=1Nsa^g,1a^g,1|Φ𝐑\displaystyle(\varepsilon_{g}+t_{g,g})\sum_{\ell_{1}=1}^{N_{s}}\hat{a}^{\dagger}_{g,\ell_{1}}\hat{a}_{g,\ell_{1}}|\Phi_{{\bf R}_{\ell}}\rangle (298)
=\displaystyle= (Ns1)(εg+tg,g)|Φ𝐑\displaystyle({N_{s}}-1)(\varepsilon_{g}+t_{g,g})|\Phi_{{\bf R}_{\ell}}\rangle

since the |Φ𝐑|\Phi_{{\bf R}_{\ell}}\rangle state contains (Ns1)({N_{s}}-1) ground-level electrons.

\bullet From the Coulomb interaction V^(intra)eg\hat{V}^{(intra)}_{eg} given in Eq. (36), in which each electron stays in its level, we get

V^(intra)eg|Φ𝐑\displaystyle\hat{V}^{(intra)}_{eg}|\Phi_{{\bf R}_{\ell}}\rangle =\displaystyle= 1=1Ns𝒱𝐑1𝐑(eegg)a^g,1a^g,1|Φ𝐑\displaystyle\sum_{\ell_{1}=1}^{N_{s}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell}}\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)\hat{a}^{\dagger}_{g,\ell_{1}}\hat{a}_{g,\ell_{1}}|\Phi_{{\bf R}_{\ell}}\rangle (299)
=\displaystyle= 1Ns𝒱𝐑1𝐑(eegg)|Φ𝐑=|Φ𝐑𝐑𝟎𝒱𝐑(eegg)\displaystyle\sum_{\ell_{1}\not=\ell}^{N_{s}}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell}}\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)|\Phi_{{\bf R}_{\ell}}\rangle=|\Phi_{{\bf R}_{\ell}}\rangle\sum_{{\bf R}\not=\bf 0}\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)

Indeed, the 𝐑{\bf R}_{\ell} ion is occupied by an excited-level electron, so that a^g,1|Φ𝐑=0\hat{a}_{g,\ell_{1}}|\Phi_{{\bf R}_{\ell}}\rangle=0 for 1=\ell_{1}=\ell.

\bullet Turning to the Coulomb interaction V^(inter)eg\hat{V}^{(inter)}_{eg} given in Eq. (37), in which each electron changes its level, we first note that

a^e,1a^g,2a^e,2a^g,1|Φ𝐑=δ2,(1δ1,)|Φ𝐑1\hat{a}^{\dagger}_{e,\ell_{1}}\hat{a}^{\dagger}_{g,\ell_{2}}\hat{a}_{e,\ell_{2}}\hat{a}_{g,\ell_{1}}|\Phi_{{\bf R}_{\ell}}\rangle=\delta_{\ell_{2},\ell}(1-\delta_{\ell_{1},\ell})|\Phi_{{\bf R}_{\ell_{1}}}\rangle (300)

since a^g,|Φ𝐑=0\hat{a}_{g,\ell}|\Phi_{{\bf R}_{\ell}}\rangle=0. This leads to

V^(inter)eg|Φ𝐑=1𝒱𝐑1𝐑(geeg)|Φ𝐑1\hat{V}^{(inter)}_{eg}|\Phi_{{\bf R}_{\ell}}\rangle=\sum_{\ell_{1}\not=\ell}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell}}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)|\Phi_{{\bf R}_{\ell_{1}}}\rangle (301)

\bullet The V^gg\hat{V}_{gg} interaction, given in Eq. (35), also acts on |Φ𝐑|\Phi_{{\bf R}_{\ell}}\rangle because this state contains (Ns1)({N_{s}}-1) ground-level electrons. Its contribution reads

V^gg|Φ𝐑\displaystyle\hat{V}_{gg}|\Phi_{{\bf R}_{\ell}}\rangle =\displaystyle= [1212(1,)𝒱𝐑1𝐑2(gggg)]|Φ𝐑\displaystyle\bigg{[}\frac{1}{2}\sum_{\ell_{1}\not=\ell}\,\sum_{\ell_{2}\not=(\ell_{1},\ell)}\!\!\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)\bigg{]}\!|\Phi_{{\bf R}_{\ell}}\rangle (302)
=\displaystyle= (Ns21)𝐑𝟎𝒱𝐑(gggg)|Φ𝐑\displaystyle\left(\frac{{N_{s}}}{2}-1\right)\sum_{{\bf R}\not=\bf 0}\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)|\Phi_{{\bf R}_{\ell}}\rangle

\bullet The above results give the Coulomb parts of the H^\hat{H} matrix elements in the |Φ𝐑|\Phi_{{\bf R}_{\ell}}\rangle subspace as

Φ𝐑|V^gg|Φ𝐑=δ,(Ns21)𝐑𝟎𝒱𝐑(gggg)\displaystyle\langle\Phi_{{\bf R}_{\ell^{\prime}}}|\hat{V}_{gg}|\Phi_{{\bf R}_{\ell}}\rangle=\delta_{\ell^{\prime},\ell}\left(\frac{{N_{s}}}{2}-1\right)\sum_{{\bf R}\not=\bf 0}\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right) (303)
Φ𝐑|V^(intra)eg|Φ𝐑=δ,𝐑𝟎𝒱𝐑(eegg)\displaystyle\langle\Phi_{{\bf R}_{\ell^{\prime}}}|\hat{V}^{(intra)}_{eg}|\Phi_{{\bf R}_{\ell}}\rangle=\delta_{\ell^{\prime},\ell}\sum_{{\bf R}\not=\bf 0}\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)\hskip 28.45274pt (304)
Φ𝐑|V^(inter)eg|Φ𝐑=(1δ,)𝒱𝐑𝐑(geeg)\displaystyle\langle\Phi_{{\bf R}_{\ell^{\prime}}}|\hat{V}^{(inter)}_{eg}|\Phi_{{\bf R}_{\ell}}\rangle=(1-\delta_{\ell^{\prime},\ell})\,\mathcal{V}_{{\bf R}_{\ell^{\prime}}-{\bf R}_{\ell}}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right) (305)

\bullet Using them, we obtain the diagonal part of the H^\hat{H} Hamiltonian in the |Φ𝐑|\Phi_{{\bf R}_{\ell}}\rangle subspace as

Φ𝐑|H^|Φ𝐑=Ee=Eg+εeεg+veg\langle\Phi_{{\bf R}_{\ell}}|\hat{H}|\Phi_{{\bf R}_{\ell}}\rangle=E^{\prime}_{e}=E^{\prime}_{g}+\varepsilon_{e}-\varepsilon_{g}+v_{eg} (306)

whatever \ell, the Coulomb part

veg=te,etg,g+𝐑𝟎(𝒱𝐑(eegg)𝒱𝐑(gggg))v_{eg}=t_{e,e}-t_{g,g}+\sum_{{\bf R}\not=\bf 0}\Big{(}\mathcal{V}_{{\bf R}}\!\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)-\mathcal{V}_{{\bf R}}\!\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)\Big{)} (307)

being precisely given by

veg=L3d3rd3r(|𝐫|e|2|𝐫|g|2)|𝐫|g|2𝐑𝟎(e2|𝐑+𝐫𝐫|e2|𝐑𝐫|)v_{eg}=\iint_{L^{3}}d^{3}rd^{3}r^{\prime}\Big{(}|\langle{\bf r}^{\prime}|e\rangle|^{2}-|\langle{\bf r}^{\prime}|g\rangle|^{2}\Big{)}|\langle{\bf r}|g\rangle|^{2}\sum_{{\bf R}\not=\bf 0}\left(\frac{e^{2}}{|{\bf R}+{\bf r}-{\bf r}^{\prime}|}-\frac{e^{2}}{|{\bf R}-{\bf r}^{\prime}|}\right) (308)

The above equation again shows that the electron interaction with all the other ions, that produces the e2/|𝐑𝐫|e^{2}/|{\bf R}-{\bf r}^{\prime}| term, goes to cancel the dominant 1/|𝐑|1/|{\bf R}| contribution of the first term that comes from electron-electron interaction, so that no overextensive contribution remains in vegv_{eg}.

\bullet The nondiagonal parts of the H^\hat{H} Hamiltonian in the |Φ𝐑|\Phi_{{\bf R}_{\ell}}\rangle subspace only come from Coulomb processes. They reduce to

Φ𝐑|H^|Φ𝐑=Φ𝐑|V^(inter)eg|Φ𝐑=𝒱𝐑𝐑(geeg)𝒱(e,g),\langle\Phi_{{\bf R}_{\ell^{\prime}}}|\hat{H}|\Phi_{{\bf R}_{\ell}}\rangle=\langle\Phi_{{\bf R}_{\ell^{\prime}}}|\hat{V}^{(inter)}_{eg}|\Phi_{{\bf R}_{\ell}}\rangle=\mathcal{V}_{{\bf R}_{\ell^{\prime}}-{\bf R}_{\ell}}\!\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)\equiv\mathcal{V}^{(e,g)}_{\ell^{\prime},\ell} (309)

\bullet As a result, the (Ns×NsN_{s}\times N_{s}) matrix for the H^\hat{H} Hamiltonian in this subspace appears as

(Ee𝒱(e,g),𝒱(e,g),Ee)\left(\begin{matrix}E^{\prime}_{e}&\cdots&\mathcal{V}^{(e,g)}_{\ell^{\prime},\ell}\\ \vdots&\ddots&\vdots\\ \mathcal{V}^{(e,g)}_{\ell^{\prime},\ell}&\cdots&E^{\prime}_{e}\end{matrix}\right) (310)

Appendix D H^\hat{H} Hamiltonian in the ground and excited subspaces with spin

\bullet The system ground-state energy in the presence of Coulomb interactions between electrons, between electrons and the other ions, and between ions, reads

Φg|H^|Φg=2Ns(εg+vgg)\langle\Phi_{g}|\hat{H}|\Phi_{g}\rangle=2N_{s}(\varepsilon_{g}+v_{gg}) (311)

with the Coulomb part given by

vgg=tg,g+12𝒱𝐑=𝟎(gggg)+𝐑𝟎𝒱𝐑(gggg)+Vii2Nsv_{gg}=t_{g,g}+\frac{1}{2}\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)+\sum_{{\bf R}\not=\bf 0}\mathcal{V}_{\bf R}\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)+\frac{V_{i-i}}{2N_{s}} (312)

We can check that the overextensive terms of this Coulomb part cancel out exactly.

Refer to caption
Figure 21: The V^(inter)eg\hat{V}^{(inter)}_{eg} interaction only acts between |Φσ1,σ1;𝐑1|\Phi_{\sigma^{\prime}_{1},\sigma_{1};{\bf R}_{\ell_{1}}}\rangle states with σ1=σ1\sigma^{\prime}_{1}=\sigma_{1}, because Coulomb interaction conserves the spin (see Eq. (313)).

\bullet For the sake of generality, we here consider that the excited electron may not have the same spin as the absent ground-level electron. So, the excited subspace is made of states |Φσ,σ;𝐑=a^σ,e,a^σ,g,|Φg|\Phi_{\sigma^{\prime},\sigma;{\bf R}_{\ell}}\rangle=\hat{a}^{\dagger}_{\sigma^{\prime},e,\ell}\hat{a}_{\sigma,g,\ell}|\Phi_{g}\rangle. The terms of H^\hat{H} in this subspace, Φσ2,σ2;𝐑2|H^|Φσ1,σ1;𝐑1\langle\Phi_{\sigma^{\prime}_{2},\sigma_{2};{\bf R}_{\ell_{2}}}|\hat{H}|\Phi_{\sigma^{\prime}_{1},\sigma_{1};{\bf R}_{\ell_{1}}}\rangle, that are responsible for Frenkel exciton formation, come from the interlevel Coulomb interaction. By using Eq. (89), we find that the state V^(inter)eg|Φσ1,σ1;𝐑1\hat{V}^{(inter)}_{eg}|\Phi_{\sigma^{\prime}_{1},\sigma_{1};{\bf R}_{\ell_{1}}}\rangle contains two types of terms: one term, shown in Fig. 21,

δσ1,σ1s111𝒱𝐑1𝐑1(geeg)|Φs1,s1;𝐑1\delta_{\sigma^{\prime}_{1},\sigma_{1}}\sum_{s_{1}}\sum_{\ell^{\prime}_{1}\not=\ell_{1}}\mathcal{V}_{{\bf R}_{\ell^{\prime}_{1}}-{\bf R}_{\ell_{1}}}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)|\Phi_{s_{1},s_{1};{\bf R}_{\ell^{\prime}_{1}}}\rangle (313)

comes from processes on different lattice sites 12\ell^{\prime}_{1}\not=\ell^{\prime}_{2}, but for σ1=σ1\sigma_{1}=\sigma^{\prime}_{1} only.

Refer to caption
Figure 22: The V^(inter)eg\hat{V}^{(inter)}_{eg} interaction inside a lattice site, as given in Eq. (314).

We can also have 1=2\ell^{\prime}_{1}=\ell^{\prime}_{2}, which corresponds to Coulomb processes on the same site. These on-site processes can involve electrons having the same spin (see Fig. 22(a)) or different spins (see Fig. 22(b)). The resulting states read

𝒱𝐑=𝟎(geeg)|Φσ1,σ1;𝐑1forσ1=±σ1\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)|\Phi_{-\sigma_{1},-\sigma^{\prime}_{1};{\bf R}_{\ell_{1}}}\rangle\,\,\,\,\,\,\,\,\,\textrm{for}\,\,\,\,\,\sigma^{\prime}_{1}=\pm\sigma_{1} (314)

The H^\hat{H} Hamiltonian in the ((2×2)×Ns(2\times 2)\times N_{s}) degenerate subspace made of |Φσ,σ;𝐑|\Phi_{\sigma^{\prime},\sigma;{\bf R}_{\ell}}\rangle states, has the same 4×44\times 4 submatrix h(1=2)h(\ell_{1}=\ell_{2}) on the main diagonal, and a set of (4×44\times 4) submatrices h(12)h(\ell_{1}\not=\ell_{2}) out of the diagonal. The h(1=2)h(\ell_{1}=\ell_{2}) submatrix, which does not depend on 1\ell_{1}, appears in the (++,+,+,)(++,+-,-+,--) basis as

h(1=2)=(Ee00𝒱(e,g)0Ee+𝒱(e,g)0000Ee+𝒱(e,g)0𝒱(e,g)00Ee)h(\ell_{1}=\ell_{2})=\left(\begin{matrix}E^{\prime}_{e}&0&0&\mathcal{V}^{(e,g)}\\ 0&E^{\prime}_{e}+\mathcal{V}^{(e,g)}&0&0\\ 0&0&E^{\prime}_{e}+\mathcal{V}^{(e,g)}&0\\ \mathcal{V}^{(e,g)}&0&0&E^{\prime}_{e}\end{matrix}\right) (315)

with 𝒱(e,g)=𝒱𝐑=𝟎(geeg)\mathcal{V}^{(e,g)}=\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right) and

Ee\displaystyle E^{\prime}_{e} =\displaystyle= Φg|H^|Φg+(εeεg)+(te,etg,g)+(𝒱𝐑=𝟎(eegg)𝒱𝐑=𝟎(gggg))\displaystyle\langle\Phi_{g}|\hat{H}|\Phi_{g}\rangle+(\varepsilon_{e}-\varepsilon_{g})+(t_{e,e}-t_{g,g})+\Big{(}\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)-\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)\Big{)} (316)
+2𝐑𝟎(𝒱𝐑(eegg)𝒱𝐑(gggg))\displaystyle+2\sum_{{\bf R}\not=\bf 0}\Big{(}\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}e&e\\ g&g\end{smallmatrix}\right)-\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}g&g\\ g&g\end{smallmatrix}\right)\Big{)}

The (4×44\times 4) off-diagonal submatrices appear in this basis as

h(12)=𝒱𝐑1𝐑2(geeg)(1001000000001001)h(\ell_{1}\not=\ell_{2})=\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}g&e\\ e&g\end{smallmatrix}\right)\left(\begin{matrix}1&0&0&1\\ 0&0&0&0\\ 0&0&0&0\\ 1&0&0&1\end{matrix}\right) (317)

Appendix E Derivation of Eq. (147) from Eq. (146)

We consider an arbitrary set of orthonormal vectors (𝐱,𝐲,𝐳)({\bf x},{\bf y},{\bf z}) and we take 𝝁\bm{\mu} and 𝝁\bm{\mu}^{\prime} as any of these three vectors. For Rμ=𝐑𝝁R_{\mu}={\bf R}\cdot\bm{\mu}, the sum

S𝐊(μ,μ)\displaystyle S_{\bf K}(\mu^{\prime},\mu) =\displaystyle= 𝐑𝟎ei𝐊𝐑R3(δμ,μ3RμRμR2)\displaystyle\sum_{{\bf R}\not=\bf 0}\frac{e^{i{\bf K}\cdot{\bf R}}}{R^{3}}\left(\delta_{\mu^{\prime},\mu}-3\frac{R_{\mu^{\prime}}R_{\mu}}{R^{2}}\right) (318)
=\displaystyle= 𝐑𝟎ei𝐊𝐑R3(𝝁𝝁3(𝐑𝝁)(𝐑𝝁)R2)\displaystyle\sum_{{\bf R}\not=\bf 0}\frac{e^{i{\bf K}\cdot{\bf R}}}{R^{3}}\left(\bm{\mu}^{\prime}\cdot\bm{\mu}-3\frac{({\bf R}\cdot\bm{\mu}^{\prime})\,({\bf R}\cdot\bm{\mu})}{R^{2}}\right)

is singular in the 𝐊𝟎{\bf K}\rightarrow\bf 0 limit: its 𝐊=𝟎{\bf K}=\bf 0 value differs from the 𝐊𝟎{\bf K}\rightarrow\bf 0 limit, and this limit depends on the 𝐊{\bf K} direction.

E.1 Calculation of S𝐊(μ,μ)S_{\bf K}(\mu^{\prime},\mu) for 𝐊=𝟎{\bf K}=\bf 0

We first show that S𝐊(μ,μ)S_{\bf K}(\mu^{\prime},\mu) cancels for 𝐊=𝟎{\bf K}=\bf 0. To do it, we introduce the cubic crystal axes (𝐗,𝐘,𝐙)({\bf X},{\bf Y},{\bf Z}) and we expand the three vectors (𝐑,𝝁,𝝁)({\bf R},\bm{\mu}^{\prime},\bm{\mu}) on these axes

𝐑\displaystyle{\bf R} =\displaystyle= RX𝐗+RY𝐘+RZ𝐙\displaystyle R_{X}{\bf X}+R_{Y}{\bf Y}+R_{Z}{\bf Z} (319)
𝝁\displaystyle\bm{\mu} =\displaystyle= μX𝐗+μY𝐘+μZ𝐙\displaystyle\mu_{X}{\bf X}+\mu_{Y}{\bf Y}+\mu_{Z}{\bf Z} (320)

with a similar expansion for 𝝁\bm{\mu}^{\prime}. As 𝐑𝝁=RXμX+RYμY+RZμZ{\bf R}\cdot\bm{\mu}=R_{X}\mu_{X}+R_{Y}\mu_{Y}+R_{Z}\mu_{Z}, the product (𝐑𝝁)(𝐑𝝁)({\bf R}\cdot\bm{\mu}^{\prime})({\bf R}\cdot\bm{\mu}) in Eq. (318) reads

(𝐑𝝁)(𝐑𝝁)\displaystyle({\bf R}\cdot\bm{\mu}^{\prime})({\bf R}\cdot\bm{\mu}) =\displaystyle= (RX2μXμX+RY2μYμY+RZ2μZμZ)\displaystyle\Big{(}R_{X}^{2}\mu^{\prime}_{X}\mu_{X}+R_{Y}^{2}\mu^{\prime}_{Y}\mu_{Y}+R_{Z}^{2}\mu^{\prime}_{Z}\mu_{Z}\Big{)} (321)
+(RXRY(μXμY+μYμX)+)\displaystyle+\Big{(}R_{X}R_{Y}(\mu_{X}\mu^{\prime}_{Y}+\mu_{Y}\mu^{\prime}_{X})+\cdots\Big{)}

As a result, S𝐊(μ,μ)S_{\bf K}(\mu^{\prime},\mu) taken for 𝐊=𝟎{\bf K}=\bf 0 contains the following terms, which, for 𝐑{\bf R} written in terms of (RX,RY,RZ)(R_{X},R_{Y},R_{Z}), reduce through symmetry, either to

𝐑𝟎RX2R5\displaystyle\sum_{{\bf R}\not=\bf 0}\frac{R_{X}^{2}}{R^{5}}\!\!\!\! =\displaystyle= 𝐑𝟎RY2R5=𝐑𝟎RZ2R5=13𝐑𝟎RX2+RY2+RZ2R5=13𝐑𝟎1R3\displaystyle\!\!\!\!\sum_{{\bf R}\not=\bf 0}\frac{R_{Y}^{2}}{R^{5}}=\sum_{{\bf R}\not=\bf 0}\frac{R_{Z}^{2}}{R^{5}}=\frac{1}{3}\sum_{{\bf R}\not=\bf 0}\frac{R_{X}^{2}{+}R_{Y}^{2}{+}R_{Z}^{2}}{R^{5}}=\frac{1}{3}\sum_{{\bf R}\not=\bf 0}\frac{1}{R^{3}} (322)

or to

𝐑𝟎RXRYR5\displaystyle\sum_{{\bf R}\not=\bf 0}\frac{R_{X}R_{Y}}{R^{5}}\!\!\!\! =\displaystyle= 0\displaystyle\!\!\!0 (323)

When used into Eq. (318), this leads to

S𝐊=𝟎(μ,μ)\displaystyle S_{{\bf K}=\bf 0}(\mu^{\prime},\mu) =\displaystyle= 𝐑𝟎1R3(𝝁𝝁μXμXμYμYμZμZ)\displaystyle\sum_{{\bf R}\not=\bf 0}\frac{1}{R_{3}}\left(\bm{\mu}^{\prime}\cdot\bm{\mu}{-}\mu^{\prime}_{X}\mu_{X}{-}\mu^{\prime}_{Y}\mu_{Y}{-}\mu^{\prime}_{Z}\mu_{Z}\right)
=\displaystyle= 0\displaystyle 0

E.2 Calculation of S𝐊(μ,μ)S_{\bf K}(\mu^{\prime},\mu) in the 𝐊𝟎{\bf K}\rightarrow\bf 0 limit

Since S𝐊=𝟎(μ,μ)=0S_{{\bf K}=\bf 0}(\mu^{\prime},\mu)=0, we can rewrite S𝐊(μ,μ)S_{\bf K}(\mu^{\prime},\mu) as

S𝐊(μ,μ)=𝐑𝟎ei𝐊𝐑1R3(δμ,μ3RμRμR2)S_{\bf K}(\mu^{\prime},\mu)=\sum_{{\bf R}\not=\bf 0}\frac{e^{i{\bf K}\cdot{\bf R}}-1}{R^{3}}\left(\delta_{\mu^{\prime},\mu}-3\frac{R_{\mu^{\prime}}R_{\mu}}{R^{2}}\right) (325)

which readily shows that the S𝐊(μ,μ)S_{\bf K}(\mu^{\prime},\mu) limit for 𝐊𝟎{\bf K}\rightarrow\bf 0 is controlled by its large-𝐑{\bf R} terms. As a result, we can replace the S𝐊(μ,μ)S_{\bf K}(\mu^{\prime},\mu) discrete sum over the vectors 𝐑{\bf R} of a cubic lattice with cell size aca_{c}, by an integral according to

𝐑f(𝐑)1ac3d3Rf(𝐑)\sum_{\bf R}f({\bf R})\simeq\frac{1}{a_{c}^{3}}\int d^{3}R\,\,f({\bf R}) (326)

A convenient way to calculate S𝐊(μ,μ)S_{{\bf K}}(\mu^{\prime},\mu) in the small 𝐊{\bf K} limit is to introduce the orthonormal vectors (𝐱𝐊,𝐲𝐊,𝐳𝐊)({\bf x}_{\bf K},{\bf y}_{\bf K},{\bf z}_{\bf K}) with 𝐳𝐊=𝐊/K{\bf z}_{\bf K}={\bf K}/K and to expand the three vectors (𝐑,𝝁,𝝁)({\bf R},\bm{\mu}^{\prime},\bm{\mu}) on these vectors as

𝐑\displaystyle{\bf R} =\displaystyle= R(sinθcosφ𝐱𝐊+sinθsinφ𝐲𝐊+cosθ𝐳𝐊)\displaystyle R(\sin\theta\cos\varphi\,{\bf x}_{\bf K}+\sin\theta\sin\varphi\,{\bf y}_{\bf K}+\cos\theta\,{\bf z}_{\bf K}) (327)
𝝁\displaystyle\bm{\mu} =\displaystyle= α𝐱𝐊+β𝐲𝐊+γ𝐳𝐊\displaystyle\alpha\,{\bf x}_{\bf K}+\beta\,{\bf y}_{\bf K}+\gamma\,{\bf z}_{\bf K} (328)

with a similar expansion for 𝝁\bm{\mu}^{\prime}. This gives

lim𝐊𝟎S𝐊(μ,μ)1ac30R2dR0πsinθdθ02πdφeiKRcosθR3(αα+ββ+γγ\displaystyle\lim_{{\bf K}\rightarrow\bf 0}S_{{\bf K}}(\mu^{\prime},\mu)\simeq\frac{1}{a_{c}^{3}}\int_{0}^{\infty}R^{2}dR\int_{0}^{\pi}\sin\theta d\theta\int_{0}^{2\pi}d\varphi\frac{e^{iKR\cos\theta}}{R^{3}}\Big{(}\alpha^{\prime}\alpha+\beta^{\prime}\beta+\gamma^{\prime}\gamma\hskip 14.22636pt
3(αsinθcosφ+βsinθsinφ+γcosθ)(αsinθcosφ+βsinθsinφ+γcosθ))\displaystyle-3(\alpha^{\prime}\sin\theta\cos\varphi+\beta^{\prime}\sin\theta\sin\varphi+\gamma^{\prime}\cos\theta)(\alpha\sin\theta\cos\varphi+\beta\sin\theta\sin\varphi+\gamma\cos\theta)\Big{)}

The integration over φ\varphi reduces the above equation to

lim𝐊𝟎S𝐊(μ,μ)\displaystyle\lim_{{\bf K}\rightarrow\bf 0}S_{{\bf K}}(\mu^{\prime},\mu) \displaystyle\simeq 2πac30dRR0πsinθdθeiKRcosθ\displaystyle\frac{2\pi}{a_{c}^{3}}\int_{0}^{\infty}\frac{dR}{R}\int_{0}^{\pi}\sin\theta d\theta\,e^{iKR\cos\theta}
×((αα+ββ+γγ)32(αα+ββ)sin2θ3γγcos2θ)\displaystyle\times\Big{(}(\alpha^{\prime}\alpha+\beta^{\prime}\beta+\gamma^{\prime}\gamma)-\frac{3}{2}(\alpha^{\prime}\alpha+\beta^{\prime}\beta)\sin^{2}\theta-3\gamma^{\prime}\gamma\cos^{2}\theta\Big{)}

So, by setting x=KRx=KR, this also reads

lim𝐊𝟎S𝐊(μ,μ)\displaystyle\lim_{{\bf K}\rightarrow\bf 0}S_{{\bf K}}(\mu^{\prime},\mu)\!\!\! \displaystyle\simeq πac3(ααββ+2γγ)0dxx0πsinθdθeixcosθ(13cos2θ)\displaystyle\!\!\!\frac{\pi}{a_{c}^{3}}(-\alpha^{\prime}\alpha-\beta^{\prime}\beta+2\gamma^{\prime}\gamma)\int_{0}^{\infty}\frac{dx}{x}\int_{0}^{\pi}\sin\theta d\theta\,e^{ix\cos\theta}(1-3\cos^{2}\theta) (330)
=\displaystyle= 4π3ac3(ααββ+2γγ)\displaystyle\!\!\!\frac{4\pi}{3a_{c}^{3}}(-\alpha^{\prime}\alpha-\beta^{\prime}\beta+2\gamma^{\prime}\gamma)

as obtained from

0dxx11dt(13t2)eixt=43\int_{0}^{\infty}\frac{dx}{x}\int_{-1}^{1}dt(1-3t^{2})\,e^{ixt}=\frac{4}{3} (331)

Since γ=𝐊𝝁/K=Kμ/K\gamma={\bf K}\cdot\bm{\mu}/K=K_{\mu}/K, while (ααββ+2γγ)=3γγ(αα+ββ+γγ)(-\alpha^{\prime}\alpha-\beta^{\prime}\beta+2\gamma^{\prime}\gamma)=3\gamma^{\prime}\gamma-(\alpha^{\prime}\alpha+\beta^{\prime}\beta+\gamma^{\prime}\gamma), we ultimately find

lim𝐊𝟎S𝐊(μ,μ)=4π3ac3(δμ,μ3KμKμK2)\lim_{{\bf K}\rightarrow\bf 0}S_{\bf K}(\mu^{\prime},\mu)=-\frac{4\pi}{3a_{c}^{3}}\left(\delta_{\mu^{\prime},\mu}-3\frac{K_{\mu^{\prime}}K_{\mu}}{K^{2}}\right) (332)

Note the sign change between the two terms of S𝐊(μ,μ)S_{\bf K}(\mu^{\prime},\mu) in Eq. (318) and the two terms of its 𝐊𝟎{\bf K}\rightarrow\bf 0 limit.

Appendix F Large 𝐑{\bf R} expansion of vμ,μv_{\mu^{\prime},\mu}

The large-𝐑{\bf R} expansion to the Coulomb part of vμ,μv_{\mu^{\prime},\mu} defined in Eq. (138) as

vμ,μ\displaystyle v_{\mu^{\prime},\mu} =\displaystyle= L3d3rd3r|𝐫|g|2(μ,e|𝐫𝐫|μ,eδμ,μ|𝐫|g|2)\displaystyle\iint_{L^{3}}d^{3}rd^{3}r^{\prime}\,|\langle{\bf r}|g\rangle|^{2}\Big{(}\langle\mu^{\prime},e|{\bf r}^{\prime}\rangle\langle{\bf r}^{\prime}|\mu,e\rangle-\delta_{\mu^{\prime},\mu}|\langle{\bf r}^{\prime}|g\rangle|^{2}\Big{)} (333)
×𝐑𝟎[e2|𝐑+𝐫𝐫|e2|𝐑𝐫|]\displaystyle\times\sum_{{\bf R}\not=\bf 0}\left[\frac{e^{2}}{|{\bf R}+{\bf r}-{\bf r}^{\prime}|}-\frac{e^{2}}{|{\bf R}-{\bf r}^{\prime}|}\right]

reads as

1|𝐑+𝐫𝐫|1|𝐑𝐫|12R(2𝐑R𝐫R+2𝐫𝐫+𝐫2R2)\displaystyle\frac{1}{|{\bf R}+{\bf r}-{\bf r}^{\prime}|}-\frac{1}{|{\bf R}-{\bf r}^{\prime}|}\simeq-\frac{1}{2R}\left(\frac{2{\bf R}}{R}\cdot\frac{{\bf r}}{R}+\frac{-2{\bf r}\cdot{\bf r}^{\prime}+{\bf r}^{2}}{R^{2}}\right)\hskip 56.9055pt
+38R[2(2𝐑R𝐫R)(2𝐑R𝐫R)+(2𝐑R𝐫R)2]\displaystyle+\frac{3}{8R}\left[-2\left(\frac{2{\bf R}}{R}\cdot\frac{{\bf r}^{\prime}}{R}\right)\left(\frac{2{\bf R}}{R}\cdot\frac{{\bf r}}{R}\right)+\left(\frac{2{\bf R}}{R}\cdot\frac{{\bf r}}{R}\right)^{2}\right] (334)

When used into Eq. (333), we find that, as d3r|𝐫|g|2𝐫=0\int d^{3}r|\langle{\bf r}|g\rangle|^{2}{\bf r}=0, the nonzero terms of this large-R limit reduce to

vμ,μ\displaystyle v_{\mu^{\prime},\mu} \displaystyle\simeq L3d3rd3r|𝐫|g|2(μ,e|𝐫𝐫|μ,eδμ,μ|𝐫|g|2)\displaystyle\iint_{L^{3}}d^{3}rd^{3}r^{\prime}|\langle{\bf r}|g\rangle|^{2}\Big{(}\langle\mu^{\prime},e|{\bf r}^{\prime}\rangle\langle{\bf r}^{\prime}|\mu,e\rangle-\delta_{\mu^{\prime},\mu}|\langle{\bf r}^{\prime}|g\rangle|^{2}\Big{)} (335)
𝐑𝟎12R3[𝐫2+3R2(RX2rX2+RY2rY2+RZ2rZ2)]\displaystyle\sum_{{\bf R}\not=\bf 0}\frac{1}{2R^{3}}\left[-{\bf r}^{2}{+}\frac{3}{R^{2}}\left(R_{X}^{2}{r}_{X}^{2}{+}R_{Y}^{2}{r}_{Y}^{2}{+}R_{Z}^{2}{r}_{Z}^{2}\right)\right]

for 𝐑{\bf R} written as RX𝐗+RY𝐘+RZ𝐙R_{X}{\bf X}+R_{Y}{\bf Y}+R_{Z}{\bf Z} in terms of the cubic crystal axes (𝐗,𝐘,𝐙)({\bf X},{\bf Y},{\bf Z}). Since for cubic symmetry,

𝐑𝟎RX2R5=𝐑𝟎RY2R5=𝐑𝟎RZ2R5=13𝐑𝟎1R3\sum_{{\bf R}\not=\bf 0}\frac{R_{X}^{2}}{R^{5}}=\sum_{{\bf R}\not=\bf 0}\frac{R_{Y}^{2}}{R^{5}}=\sum_{{\bf R}\not=\bf 0}\frac{R_{Z}^{2}}{R^{5}}=\frac{1}{3}\sum_{{\bf R}\not=\bf 0}\frac{1}{R^{3}} (336)

we readily find that Eq. (335) reduces to zero. So, vμ,μv_{\mu^{\prime},\mu} scales as 𝒪(R4)\mathcal{O}(R^{-4}) for 𝐑{\bf R} large.

Appendix G Coulomb interaction V^gg\hat{V}_{gg} in terms of holes

\bullet Equation (172) gives the Coulomb interaction V^gg\hat{V}_{gg} between ground-level electrons, in terms of electron operators. To write it in terms of holes, we first note that

a^s1,μ1,g,1a^s2,μ2,g,2a^s2,μ2,g,2a^s1,μ1,g,1\displaystyle\hat{a}^{\dagger}_{s_{1},\mu^{\prime}_{1},g,\ell_{1}}\hat{a}^{\dagger}_{s_{2},\mu^{\prime}_{2},g,\ell_{2}}\hat{a}_{s_{2},\mu_{2},g,\ell_{2}}\hat{a}_{s_{1},\mu_{1},g,\ell_{1}}\hskip 85.35826pt
=a^s1,μ1,g,1(δμ2,μ2a^s2,μ2,2a^s2,μ2,2)a^s1,μ1,g,1\displaystyle=\hat{a}^{\dagger}_{s_{1},\mu^{\prime}_{1},g,\ell_{1}}\Big{(}\delta_{\mu^{\prime}_{2},\mu_{2}}-\hat{a}_{s_{2},\mu_{2},\ell_{2}}\hat{a}^{\dagger}_{s_{2},\mu^{\prime}_{2},\ell_{2}}\Big{)}\hat{a}_{s_{1},\mu_{1},g,\ell_{1}} (337)

which becomes in terms of hole operators

(δμ1,μ1δμ2,μ2δs1,s2δμ1,μ2δμ2,μ1δ1,2)\displaystyle\Big{(}\delta_{\mu^{\prime}_{1},\mu_{1}}\delta_{\mu^{\prime}_{2},\mu_{2}}-\delta_{s_{1},s_{2}}\delta_{\mu^{\prime}_{1},\mu_{2}}\delta_{\mu^{\prime}_{2},\mu_{1}}\delta_{\ell_{1},\ell_{2}}\Big{)}\hskip 99.58464pt
+[δs1,s2δ1,2(δμ1,μ2b^s1,μ1,1b^s2,μ2,2+(12))\displaystyle+\bigg{[}\delta_{s_{1},s_{2}}\delta_{\ell_{1},\ell_{2}}\Big{(}\delta_{\mu^{\prime}_{1},\mu_{2}}\hat{b}^{\dagger}_{-s_{1},\mu_{1},\ell_{1}}\hat{b}_{-s_{2},\mu^{\prime}_{2},\ell_{2}}+(1\longleftrightarrow 2)\Big{)}
(δμ1,μ1b^s2,μ2,2b^s2,μ2,2+(12))]\displaystyle-\Big{(}\delta_{\mu^{\prime}_{1},\mu_{1}}\hat{b}^{\dagger}_{-s_{2},\mu_{2},\ell_{2}}\hat{b}_{-s_{2},\mu^{\prime}_{2},\ell_{2}}+(1\longleftrightarrow 2)\Big{)}\bigg{]}
+b^s1,μ1,1b^s2,μ2,2b^s2,μ2,2b^s1,μ1,1\displaystyle+\hat{b}^{\dagger}_{-s_{1},\mu_{1},\ell_{1}}\hat{b}^{\dagger}_{-s_{2},\mu_{2},\ell_{2}}\hat{b}_{-s_{2},\mu^{\prime}_{2},\ell_{2}}\hat{b}_{-s_{1},\mu^{\prime}_{1},\ell_{1}} (338)

When used into V^gg\hat{V}_{gg} given in Eq. (172), we get this operator as

V^gg=Vg𝒱g(μ,μ)b^s,μ,b^s,μ,+\hat{V}_{gg}=V_{g}-\sum\mathcal{V}_{g}(\mu^{\prime},\mu)\,\hat{b}^{\dagger}_{s,\mu,\ell}\hat{b}_{s,\mu^{\prime},\ell}+\cdots (339)

The constant part VgV_{g} is given by

Vg\displaystyle V_{g} =\displaystyle= (δμ1,μ1δμ2,μ2δs1,s2δμ1,μ2δμ2,μ1δ1,2)𝒱𝐑1𝐑2(μ2,gμ2,gμ1,gμ1,g)\displaystyle\sum\big{(}\delta_{\mu^{\prime}_{1},\mu_{1}}\delta_{\mu^{\prime}_{2},\mu_{2}}-\delta_{s_{1},s_{2}}\delta_{\mu^{\prime}_{1},\mu_{2}}\delta_{\mu^{\prime}_{2},\mu_{1}}\delta_{\ell_{1},\ell_{2}}\big{)}\mathcal{V}_{{\bf R}_{\ell_{1}}-{\bf R}_{\ell_{2}}}\left(\begin{smallmatrix}\mu^{\prime}_{2},g&\mu_{2},g\\ \mu^{\prime}_{1},g&\mu_{1},g\end{smallmatrix}\right) (340)
=\displaystyle= Nsμ1,μ2(𝒱𝐑=𝟎(μ2,gμ2,gμ1,gμ1,g)+2𝐑𝟎𝒱𝐑(μ2,gμ2,gμ1,gμ1,g))\displaystyle N_{s}\sum_{\mu_{1},\mu_{2}}\Big{(}\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}\mu_{2},g&\mu_{2},g\\ \mu_{1},g&\mu_{1},g\end{smallmatrix}\right)+2\sum_{{\bf R}\not=\bf 0}\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}\mu_{2},g&\mu_{2},g\\ \mu_{1},g&\mu_{1},g\end{smallmatrix}\right)\Big{)}
+Nsμ1μ2(𝒱𝐑=𝟎(μ2,gμ2,gμ1,gμ1,g)𝒱𝐑=𝟎(μ1,gμ2,gμ2,gμ1,g))\displaystyle+N_{s}\sum_{\mu_{1}\not=\mu_{2}}\Big{(}\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}\mu_{2},g&\mu_{2},g\\ \mu_{1},g&\mu_{1},g\end{smallmatrix}\right)-\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}\mu_{1},g&\mu_{2},g\\ \mu_{2},g&\mu_{1},g\end{smallmatrix}\right)\Big{)}

while the prefactors 𝒱g(μ,μ)\mathcal{V}_{g}(\mu^{\prime},\mu) in the one-body part of V^gg\hat{V}_{gg} are given by

𝒱g(μ,μ)\displaystyle\mathcal{V}_{g}(\mu^{\prime},\mu) =\displaystyle= μ1(𝒱𝐑=𝟎(μ,gμ,gμ1,gμ1,g)+2𝐑𝟎𝒱𝐑(μ,gμ,gμ1,gμ1,g))\displaystyle\sum_{\mu_{1}}\Big{(}\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}\mu^{\prime},g&\mu,g\\ \mu_{1},g&\mu_{1},g\end{smallmatrix}\right)+2\sum_{{\bf R}\not=\bf 0}\mathcal{V}_{{\bf R}}\left(\begin{smallmatrix}\mu^{\prime},g&\mu,g\\ \mu_{1},g&\mu_{1},g\end{smallmatrix}\right)\Big{)} (341)
+μ1(μ,μ)(𝒱𝐑=𝟎(μ,gμ,gμ1,gμ1,g)𝒱𝐑=𝟎(μ,gμ1,gμ1,gμ,g))\displaystyle+\sum_{\mu_{1}\not=(\mu^{\prime},\mu)}\Big{(}\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}\mu^{\prime},g&\mu,g\\ \mu_{1},g&\mu_{1},g\end{smallmatrix}\right)-\mathcal{V}_{{\bf R}=\bf 0}\left(\begin{smallmatrix}\mu^{\prime},g&\mu_{1},g\\ \mu_{1},g&\mu,g\end{smallmatrix}\right)\Big{)}

References

  • [1] D. I. Dexter, and R. S. Knox, Excitons, Wiley (New York, 1981).
  • [2] G. H. Wannier, Phys. Rev. 52 (1937) 191.
  • [3] J. Frenkel, Phys. Rev. 37 (1931) 17.
  • [4] J. Frenkel, Phys. Rev. 37 (1931) 1276.
  • [5] R. S. Knox, Theory of Excitons, Academic Press (New York, 1963).
  • [6] M. Combescot, and S.-Y. Shiau, Excitons and Cooper Pairs, Oxford University Press (Oxford, 2015).
  • [7] G. Bastard, Wave Mechanics Applied to Semiconductor Heterostructures, Wiley (New York, 1988).
  • [8] P. Y. Yu, and M. Cardona, Fundamentals of Semiconductors, 3rd ed., Springer (Berlin, 2005).
  • [9] V. M. Agranovich, Sov. Phys.-Solid State 3 (1961) 592.
  • [10] V. M. Agranovich, Excitations in Organic Solids, Clarendon Press (Oxford, 2008).
  • [11] A. S. Davydov, Theory of Molecular Excitons, Plenum Press (New York, 1971).
  • [12] M. Combescot, F. Dubin, and S.-Y. Shiau, Europhys. Lett. 138 (2022) 36002.
  • [13] M. Alloing, M. Beian, M. Lewenstein, D. Fuster, Y. Gonzalez, L. Gonzalez, R. Combescot, M. Combescot, and F. Dubin, Europhys. Lett. 107 (2014) 10012.
  • [14] Y. Onodera, J. Phys. Soc. Jpn. 49 (1980) 1845.
  • [15] L. C. Andreani, F. Bassani, and A. Quattropani, Il Nuovo Cimento D 10 (1988) 1473.
  • [16] L. C. Andreani, Confined Electrons and Photons, Edited by E. Burstein and C. Weisbuch, Plenum Press (New York, 1995).
  • [17] M. Combescot, and W. Pogosov, Phys. Rev. B 77 (2008) 085206.
  • [18] M. A. Castellanos, A. Dodin, and A. P. Willard, Phys. Chem. Chem. Phys. 22 (2020) 3048.
  • [19] B. Yurke, R. Elliott, and A. Sup, Phys. Rev. A 107 (2023) 012603.
  • [20] F. Fassioli, R. Dinshaw, P. C. Arpin, and G. D. Scholes, J. R. Soc. Interface 11 (2014) 20130901.
  • [21] R. E. Blankenship, Molecular Mechanisms of Photosynthesis, 3rd ed., John Wiley and Sons Ltd, 2021.
  • [22] D.-L. Jiang, and T. Aida, Nature 388 (1997) 454.
  • [23] T. Minami, S. Tretiak, V. Chernyak, and S. Mukamel, J. Lumin. 87-89 (2000) 115.
  • [24] F. C. Spano, Acc. Chem. Res. 43 (2010) 429.
  • [25] H. Zhao, Y. Zhao, Y. Song et al, Nat. Commun. 10 (2019) 5589.
  • [26] J. Singh (editor), Organic Light Emitting Devices, Intech (Croatia, 2012).
  • [27] S. Günes, H. Neugebauer, and N. S. Sariciftci, Chem. Rev. 107 (2007) 1324.
  • [28] G. Li, R. Zhu, and Y. Yang, Nature Photonics 6 (2012) 153.
  • [29] J. Singh (editor), An introduction to Charge Carriers, Nova Sci. Publishers (New York, 2022)
  • [30] S.-Y. Shiau, and M. Combescot, Semiconductors 55 (2021) 790.
  • [31] M. Combescot, and S.-Y. Shiau, J. Phys.: Condens. Matter 34 (2022) 205502.
  • [32] V. Chernyak, and S. Mukamel, J. Opt. Am. B 13 (1996) 1302.
  • [33] S. Mukamel, Principles of Nonlinear Optics and Spectroscopy, Oxford University Press (New York, 1995).
  • [34] Th. Förster, Discuss. Faraday Soc. 27 (1959) 7.
  • [35] G. A. Jones, and D. S. Bradshaw, Front. Phys. 7 (2019) 1.
  • [36] M. Born, and M. Bradburn, Proc. Cambridge Phil. 39 (1943) 104.
  • [37] M. H. Cohen, and F. Keffer, Phys. Rev. 99 (1955) 1128.
  • [38] R. S. Knox, J. Phys. Chem. Solids 9 (1959) 238.
  • [39] V. Salie, and E. E. Koch, Phys. Rev. B 20 (1979) 784.
  • [40] W. Andreoni, F. Perrot, and F. Bassani, Phys. Rev. B 14 (1976) 3589.
  • [41] R. S. Knox, J. Phys. Chem. Solids 9 (1959) 265.
  • [42] W. Andreoni, M. Altarelli, and F. Bassani, Phys. Rev. B 11 (1975) 2352.
  • [43] W. R. Heller, and A. Marcus, Phys. Rev. 84 (1951) 809.
  • [44] U. Fano, Phys. Rev. 118 (1960) 451.
  • [45] M. Combescot, T. Amand, and S.-Y. Shiau, unpublished.
  • [46] D. L. Dexter, J. Chem. Phys. 21 (1953) 836.
  • [47] M. Combescot, X. Leyronas, and C. Tanguy, Euro. Phys. J. B 31 (2003) 17.
  • [48] W. Pogosov, and M. Combescot, JETP Letters 92 (2010) 484.
  • [49] M. Combescot, and W. Pogosov, Eur. Phys. J. B 68 (2009) 161.
  • [50] M. Combescot, O. Betbeder-Matibet, and F. Dubin, Phys. Rep. 463 (2008) 215.
  • [51] M. Combescot, and O. Betbeder-Matibet, Phys. Rev. Lett. 104 (2010) 206404; arXiv:0902.4588v1.
  • [52] S.-Y. Shiau, and M. Combescot, Phys. Rev. A 95 (2017) 013838.
  • [53] V. M. Agranovich, and B. S. Toshich, Sov. Phys. JETP 26 (1968) 104.
  • [54] V. Chernyak, S. Yokojima, T. Meier, and S. Mukamel, Phys. Rev. B 58 (1998) 4496.
  • [55] W. Pogosov, and M. Combescot, Eur. Phys. J. B 68 (2009) 183.
  • [56] D. E. Cooper, and M. D. Fayer, J. Chem. Phys. 68 (1978) 229.