This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Dispersion interactions between semiconducting wires

Alston J. Misquitta Cavendish Laboratory, 19, J J Thomson Avenue Cambridge CB3 0HE, U.K.    James Spencer University Chemical Laboratory, Lensfield Road, Cambridge CB2 1EW, U.K. Department of Physics and Thomas Young Centre, Imperial College London, Exhibition Road, London SW7 2AZ, U.K    Anthony J. Stone University Chemical Laboratory, Lensfield Road, Cambridge CB2 1EW, U.K.    Ali Alavi University Chemical Laboratory, Lensfield Road, Cambridge CB2 1EW, U.K.
(August 11, 2025)
Abstract

The dispersion energy between extended molecular chains (or equivalently infinite wires) with non-zero band gaps is generally assumed to be expressible as a pair-wise sum of atom-atom terms which decay as R6R^{-6}. Using a model system of two parallel wires with a variable band gap, we show that this is not the case. The dispersion interaction scales as z5z^{-5} for large interwire separations zz, as expected for an insulator, but as the band gap decreases the interaction is greatly enhanced; while at shorter (but non-overlapping) separations it approaches a power-law scaling given by z2z^{-2}, i.e. the dispersion interaction expected between metallic wires. We demonstrate that these effects can be understood from the increasing length scale of the plasmon modes (charge fluctuations), and their increasing contribution to the molecular dipole polarizability and the dispersion interaction, as the band gaps are reduced. This result calls into question methods which invoke locality assumptions in deriving dispersion interactions between extended small-gap systems.

pacs:
34.20.Gj 73.22.-f 31.15.ap

I Introduction

Conventionally, the dispersion interaction between two systems is formulated in terms of correlated fluctuations described by local (frequency-dependent) polarizabilities which gives rise to the familiar atom–atom C6abRab6-C_{6}^{ab}R_{ab}^{-6} interaction at leading order Stone (1996). This form of the interaction has been used with a good measure of success in studies of many systems, from gases to solids, including complexes of biological molecules and organic molecules. These are typically insulators; that is, they have large HOMO–LUMO gaps.

It has been known for a very long time that the additive atom–atom form of the dispersion does not hold very well for metallic systems. The classic case of an atom interacting with a thin metallic surface shows deviations from the additive law: the correct treatment of the metal results in a R3R^{-3} power law Casimir and Polder (1948), but summing over atom–atom terms leads to a R4R^{-4} interaction. More recently, strong deviations from additivity have been demonstrated in the interactions of extended systems of metals or zero band-gap materials with at least one nano-scale dimension Dobson (2007); Dobson et al. (2006, 2001, 2004); Drummond and Needs (2007).

The semi-classical picture is useful for understanding the source of the differences in the metallic and insulating cases. In an insulator, electronic perturbations decay exponentially with distance. We can therefore treat electron correlations as being local. At lowest order, these local fluctuations give rise to instantaneous local dipoles, and the correlation between these local dipoles gives rise to the atom–atom R6R^{-6} interaction. However, in a zero band-gap material, electronic fluctuations are long-ranged, particularly if one or two of the dimensions are nano-scale Dobson (2007). It is these long-ranged fluctuations that give rise to the non-additivity of the dispersion and deviations from the atom–atom R6R^{-6} interaction.

While the insulating and metallic cases are now well understood, little is known of the intermediate, semi-conducting case. The nature of the dispersion interaction between extended molecules with finite but small HOMO–LUMO gaps less than about 0.2 a.u. (5 eV) remains an open question. A large number of important nano-molecules fall into this category, such as carbon nano-tubes and the ‘lander’-type molecules that are used as organic conductors. The electronic structure of materials made of these molecules depends strongly on structures they assume in the bulk, often via self-assembly. Consequently it is very important to understand exactly how these molecules interact. The dispersion interaction is the dominant source of attraction between these π\pi-conjugated systems. For want of a clearer understanding, many studies assume the usual insulating case for the atom–atom R6R^{-6} form of this interaction. As we shall show in this article, this is both qualitatively and quantitatively incorrect for semiconducting molecules.

A word of clarification: we use the term ‘non-additive’ here to describe deviations from the additive atom-atom picture of the dispersion interaction between two molecules. It is also commonly used to refer to the deviation from pair additivity seen in the interactions of three or more distinct molecules, but we are not concerned with such effects here, except for a brief comment in the Discussion.

II Interacting wires using Hückel (Tight-binding) theory

The dispersion energy appears at second-order in intermolecular perturbation theory and is formally expressed in terms of the exact eigenstates and eigenenergies of the non-interacting systems Stone (1996). In a mean-field theory the dispersion energy between two subsystems (AA and BB), can be expressed as a sum over the single-electron wavefunctions localised to each subsystem:

Edisp(2)=iA,jBaA,bB|ij|r121|ab|2εi+εjεaεb,E^{(2)}_{\rm disp}=\sum_{i\in A,j\in B}\sum_{a\in A,b\in B}\frac{|\langle ij|r_{12}^{-1}|ab\rangle|^{2}}{\varepsilon_{i}+\varepsilon_{j}-\varepsilon_{a}-\varepsilon_{b}}, (1)

where i,ji,j (a,ba,b) are occupied (virtual) single-particle wavefunctions in either subsystem AA or BB, and εi\varepsilon_{i} is the eigenvalue of the ii-th wavefunction. Assuming two parallel wires, aligned parallel to the xx axis and separated by a distance zz, the integral which appears above has the form:

ij|r121|ab=ψi(x1)ψj(x2)ψa(x1)ψb(x2)|x12𝐱^+z𝐳^|𝑑x1𝑑x2\langle ij|r_{12}^{-1}|ab\rangle=\int\frac{\psi^{*}_{i}(x_{1})\psi^{*}_{j}(x_{2})\psi_{a}(x_{1})\psi_{b}(x_{2})}{|x_{12}\hat{\mathbf{x}}+z\hat{\mathbf{z}}|}dx_{1}dx_{2} (2)

and is the Coulomb interaction between the charge density ψiψa\psi_{i}^{*}\psi_{a} due to excitation iai\rightarrow a in subsystem AA with ψjψb\psi_{j}^{*}\psi_{b} in BB.

In periodic systems, the wavefunction can be expressed in Bloch form, i.e. ψj(x)=eikjxuj(x)\psi_{j}(x)=e^{ik_{j}x}u_{j}(x), and so the co-density ψi(x)ψa(x)=ei(kaki)xui(x)ua(x)\psi_{i}^{*}(x)\psi_{a}(x)=e^{i(k_{a}-k_{i})x}u_{i}^{*}(x)u_{a}(x) can have a long-wavelength modulation, whose electrostatic field will not be well described by a multipole expansion at separations comparable to this length scale. Furthermore, in small-gap systems, these excitations have plasmon character and contribute significantly to Edisp(2)E^{(2)}_{\rm disp}, as we discuss below.

Hückel (tight-binding) theory gives us a convenient formalism for evaluating the above expression for interactions between two one-dimensional wires. We considered a two-band model Hamiltonian of the form

H=in(βa2ia2i1+βa2i+1a2i+h.c.),H=\sum_{i}^{n}(\beta{a^{\dagger}}_{2i}a_{2i-1}+\beta^{\prime}{a^{\dagger}}_{2i+1}a_{2i}+h.c.), (3)

where β,β\beta,\beta^{\prime} are alternating bond-strengths between adjacent sites, as would be encountered in a chain of (H2)n or a π\pi-conjugated polyene. We computed the interactions between two such parallel wires, each consisting of 2n2n identical atoms, equally spaced at intervals of dd, such that the unit cell is of length 2d2d and contains two atoms. Assuming periodic boundary conditions over a crystal cell of length 2dn2dn, then there are two bands per wavevector. The single-particle wavefunctions and energies of sych a system can be found analytically. The band structure is given by

ϵ(k)=±|βeikd+βeikd|\epsilon(k)=\pm\left|\,\beta e^{ikd}+\beta^{\prime}e^{-ikd}\right| (4)

and the set of wavevectors by

k=πjnd;j=n2+1,n2+2,,n2.k=\frac{\pi j}{nd};\quad j=-\frac{n}{2}+1,-\frac{n}{2}+2,\cdots,\frac{n}{2}. (5)

β=β\beta=\beta^{\prime} corresponds to a uniform wire with energy eigenvalues given by ϵ(k)=±2β|cos(kd)|\epsilon(k)=\pm 2\beta|\cos(kd)|, which in the limit of large nn has a vanishing band-gap at half-filling (i.e. is a metal). The opposite limit (β=0\beta^{\prime}=0) corresponds to a chain of isolated dimers with energy eigenvalues ϵ(k)=±β\epsilon(k)=\pm\beta, independent of wavevector, and corresponds to the perfect insulator. By varying the ratio β/β\beta^{\prime}/\beta between 0 and 1, we can very conveniently probe the dispersion interaction in the intermediate (semiconducting) regime, with the band-gap given by ΔEg=2(ββ)\Delta E_{g}=2(\beta-\beta^{\prime}). The required integrals can be evaluated Spencer (2009) as:

ij|r121|ab=1ndGK0(Gz)Yia(G)Yjb(ki+kjkakbG)\langle ij|r_{12}^{-1}|ab\rangle=\frac{1}{nd}\sum_{G}K_{0}(Gz)Y_{ia}(G)Y_{jb}(k_{i}+k_{j}-k_{a}-k_{b}-G) (6)

where {G}\{G\} are reciprocal lattice vectors of the primitive unit cell and Yia(G)Y_{ia}(G) is the (analytic) Fourier transform of uiuau_{i}^{*}u_{a} obtained by assuming highly-localised basis functions on each atom. K0K_{0} is the zero-th order modified Bessel function of the second kind and is the Fourier transform of the potential generated by a one-dimensional lattice at a field point at zz from the latticeEpstein (1903, 1906); Tosi (1964). K0K_{0} decays exponentially with increasingly large arguments and so typically only the 10 smallest reciprocal lattice vectors need to be included in the summation in order to obtain converged results.

The calculations presented here use 8401 Monkhorst-Pack kk-points to sample the Brillouin zone, which corresponds in real-space to wires consisting of 16802 sites per crystal cell. Such fine k-point sampling was required to obtained converged results with respect to system size.

Refer to caption
Figure 1: The second-order dispersion energy of parallel wires modelled using Hückel theory obtained from eq. (1). The bonding interaction is of alternating strength and is controlled by the ratio β/β\beta^{\prime}/\beta. The lines are numerical power-law fits to the small-zz and large-zz data points. In the metallic (β=β\beta=\beta^{\prime}) and perfect insulating (β=0\beta^{\prime}=0) cases, a single power-law fits the whole range of zz, with the expected exponents 2-2 and 5-5 respectively. The intermediate cases show a cross-over between these two limiting regimes as zz is varied.

As shown in fig. 1, the dispersion interaction between two uniform wires (β/β=1\beta^{\prime}/\beta=1) varies as z2\sim z^{-2} across the whole range of zz, in good agreement with previously obtained results via RPADobson (2007) and QMCDrummond and Needs (2007) for metallic wires. The interaction between chains of isolated dimers (β=0\beta^{\prime}=0) varies as z5z^{-5}, as would be expected from the conventional picture of dispersion interactions. In the semiconducting regime, there is no unique exponent which characterises the dispersion interaction over all zz. However, at small separations zz, it tends to the metallic behaviour (z2z^{-2}), whereas large zz it tends to insulating behaviour z5z^{-5}.

III Interacting H2 chains using SAPT(DFT)

The principal drawback of the Hückel model is the lack of electron correlation, and hence screening, within each subsystem. For more realistic calculations we can use the symmetry-adapted perturbation theory based on density-functional theory Misquitta et al. (2003); Hesselmann et al. (2005); Misquitta et al. (2005) (SAPT(DFT)), where the dispersion energy is evaluated not via a sum-over-states but through a coupled Kohn–Sham formulation based on the density response functions of the interacting moleculesLonguet-Higgins (1965):

Edisp(2)\displaystyle E^{(2)}_{\rm disp} =12π0𝑑w𝑑𝐫1𝑑𝐫1𝑑𝐫2𝑑𝐫2\displaystyle=-\frac{1}{2\pi}\int_{0}^{\infty}dw\int d{\mathbf{r}}_{1}d{\mathbf{r}}_{1}^{\prime}d{\mathbf{r}}_{2}d{\mathbf{r}}_{2}^{\prime}
αA(𝐫1,𝐫1;iw)αB(𝐫2,𝐫2;iw)|𝐫1𝐫2||𝐫1𝐫2|\displaystyle\qquad\frac{\alpha_{A}({\mathbf{r}}_{1},{\mathbf{r}}^{\prime}_{1};{\rm i}w)\alpha_{B}({\mathbf{r}}_{2},{\mathbf{r}}^{\prime}_{2};{\rm i}w)}{|{\mathbf{r}}_{1}-{\mathbf{r}}_{2}||{\mathbf{r}}_{1}^{\prime}-{\mathbf{r}}_{2}^{\prime}|} (7)

where αA/B(𝐫,𝐫;w)\alpha_{A/B}({\mathbf{r}},{\mathbf{r}}^{\prime};w) are the frequency-dependent density response functions of the molecules which describe the propagation of a frequency-dependent perturbation in a molecule, from one point to another, within linear-response theory. The second-order dispersion energy is exact in this formulation if exchange effects are neglected.

Refer to caption
Figure 2: Distorted (H2)n-chains. The distortion parameter is defined as η=y/x\eta=y/x. In all our chains we have chosen x=1.4487x=1.4487 a.u.

We have studied the interactions between two parallel finite (H2)n (fig. 2) chains with n=16n=16 and 3232 and distortion parameters η=2.0,1.5,1.25\eta=2.0,1.5,1.25 and 1.01.0, where η\eta is the ratio of the alternate bond lengths. The HOMO-LUMO gap of a hydrogen chain can be modified by simply introducing a difference in alternate bond lengths. The distortion parameter η\eta gives us a convenient way to control the electronic structure of the chain and allows us to study the interactions between insulating, semiconducting and (near) metallic chains in one framework.

The frequency-dependent density response functions that appear in eq. (7) were evaluated with coupled Kohn–Sham perturbation theory, using the PBE0 functional with a hybrid kernel consisting of 75% adiabatic LDA and 25% coupled Hartree-Fock. The accuracy of this approach for the dispersion energy of small molecules surpasses that of Møller-Plesset perturbation theory and rivals that of coupled-cluster methods Misquitta et al. (2005); Hesselmann and Jansen (2003); Hesselmann et al. (2005); Podeszwa and Szalewicz (2005). Furthermore, there is no qualitative change in our results when the fraction of HF exchange is increased, so the results presented here are unlikely to be artifacts of the shortcomings of coupled Kohn–Sham perturbation theory Champagne et al. (1998); Sekino et al. (2007). All calculations used the cc-pVDZ basis which will result in a significant underestimation of the strength of the contribution to the dispersion energy that arises from transverse polarization, but should better describe the contribution arising from the longitudinal polarization. Since it is the longitudinal polarization that is most important in these systems, we expect our results to be qualitatively correct.

Dispersion energies are displayed in fig. 3 and the effective power laws for the physically important separations between 6 and 20 a.u. are shown in table 1 together with HOMO–LUMO gaps. For insulating chains with an additive dispersion interaction we would expect two regimes determined by chain length LL: for zLz\ll L we should expect the infinite chain result, i.e., the effective dispersion interaction should decay as z5z^{-5}, and for zLz\gg L we should recover the usual z6z^{-6} power law. This is what we see with the chains with the largest distortion parameter, η=2.0\eta=2.0, which exhibit large HOMO–LUMO gaps. As the distortion parameter η\eta approaches unity and the HOMO–LUMO gap consequently decreases, the deviation from the additive insulating case becomes increasingly apparent, and for the longest of the chains considered here the dispersion interaction is significantly enhanced. For example at 4040 a.u. (roughly half the chain length) an η=1\eta=1 chain has a dispersion interaction two orders of magnitude larger than an η=2\eta=2 chain. Additionally, we see increasing finite-size effects: the power laws for the n=16n=16 and 3232 chains, which were very similar for η=2.0\eta=2.0, are considerably different for η=1.0\eta=1.0.

Refer to caption
Figure 3: The second-order dispersion energy of pairs of parallel (H2)n chains. Energies have been normalized by the number of H2 units nn. The solid lines are dispersion energies for chains with n=32n=32 and the dashed lines for chains with n=16n=16 (only η\eta =1.0 and 2.0).
Table 1: Power-law behaviour of Edisp(2)E^{(2)}_{\rm disp} fitted to the form zxz^{-x} in the region from 6 to 20 a.u. Beyond 600 a.u. all chains exhibit dispersion energies with the z6z^{-6} power law because of their finite length. The HOMO–LUMO gaps, ΔEg\Delta E_{g}, have been calculated from the Kohn–Sham eigenvalues. Energies are in atomic units.
η\eta (H2)16 (H2)32
ΔEg\Delta E_{g} xx ΔEg\Delta E_{g} xx
2.0 0.370 4.89 0.366 4.84
1.5 0.280 4.58 0.270 4.50
1.25 0.202 4.20 0.183 4.09
1.0 0.099 3.52 0.057 3.17

IV Multipole expansion

We can understand these effects in terms of the multipole expansion. The multipole form of the dispersion energy is usually formulated in terms of (local) atomic polarizabilities. This leads to the usual R6R^{-6} atom–atom interaction (at leading order) and would not account for the anomalous dispersion power laws that we have observed in the chains. However the complete distributed-polarizability description is non-local: that is, it describes the change in multipole moments at one atom in response to a change in electrostatic fields at another.

We obtain the multipole form of eq. (7) by expanding the Coulomb terms using the distributed form of the multipole expansion |𝐫𝐫|1=Q^taTtuabQ^ub|{\mathbf{r}}-{\mathbf{r}^{\prime}}|^{-1}=\hat{Q}^{a}_{t}T^{ab}_{tu}\hat{Q}^{b}_{u} where aa and bb denote (atomic) sites and tt and uu are multipole indices — 00 for the charge, 10, 11c11c and 11s11s for the components of the dipole, and so on. Q^ta\hat{Q}^{a}_{t} is the multipole moment operator for moment tt of site aa and TtuabT^{ab}_{tu} are the interaction tensors that contain the distance and angular dependence (see Ref.Stone (1996) for details). Inserting this expansion in eq. (7) we obtain the multipole form for the dispersion energy:

Edisp(2)=12πTtuabTtuab0αttaa(iw)αuubb(iw)𝑑w.E^{(2)}_{\rm disp}=-\frac{1}{2\pi}T^{ab}_{tu}T^{a^{\prime}b^{\prime}}_{t^{\prime}u^{\prime}}\int_{0}^{\infty}\alpha^{aa^{\prime}}_{tt^{\prime}}({\rm i}w)\alpha^{bb^{\prime}}_{uu^{\prime}}({\rm i}w)dw. (8)

Here TtuabT^{ab}_{tu} is the interaction function between multipole tt on site aa in subsystem AA and multipole uu on site bb in BB, while αttaa\alpha^{aa^{\prime}}_{tt^{\prime}} are the frequency-dependent non-local polarizabilities for sites aa and aa^{\prime}, and may be expressed in terms of the frequency-dependent density susceptibility asMisquitta and Stone (2006)

αttaa(ω)=aaQ^ta(𝐫)α(𝐫,𝐫|ω)Q^ta(𝐫)d3𝐫d3𝐫.\alpha^{aa^{\prime}}_{tt^{\prime}}(\omega)=\int_{a}\int_{a^{\prime}}\hat{Q}^{a}_{t}({\mathbf{r}})\alpha({\mathbf{r}},{\mathbf{r}^{\prime}}|\omega)\hat{Q}^{a^{\prime}}_{t^{\prime}}({\mathbf{r}^{\prime}}){\mathrm{d}}^{3}{\mathbf{r}}{\mathrm{d}}^{3}{\mathbf{r}^{\prime}}. (9)

There are no assumptions made in deriving eq. (8), other than that spheres enclosing the atomic charge densities on different molecules do not overlap. We have calculated distributed polarizabilities using the constrained density-fitting algorithm Misquitta and Stone (2006).

Eq. (8) involves a quadruple sum over sites and is therefore computationally demanding. To reduce this cost, we normally make a simplification by localizing the non-local polarizabilities. That is, the non-local polarizabilities — the αttaa(ω)\alpha^{aa^{\prime}}_{tt^{\prime}}(\omega) with aaa\neq a^{\prime} — are transformed onto one or other site using the multipole expansion Le Sueur and Stone (1994); Lillestolen and Wheatley (2007). This transformation is possible only if the non-local terms decay fast enough with inter-site distance. If not, the multipole expansion used in the transformation diverges and the localization is no longer possible. As we shall see, this is precisely what happens in the (H2)n chains as the distortion parameter η\eta decreases.

Refer to caption
Figure 4: Matrix representation of the charge-flow polarizability matrix α00,00aa\alpha^{aa^{\prime}}_{00,00} for (H2)32 chains with η=2,1.5,1.25\eta=2,1.5,1.25 and 11 (clockwise, from top left). Sites aa and aa^{\prime} are represented along the xx and yy axes. Because these terms span a number of orders of magnitude of both signs we have plotted ln(|α00,00aa|)\ln(|\alpha^{aa^{\prime}}_{00,00}|). Colour scheme: Black rectangles correspond to terms of order 1 a.u. and white rectangles to terms of order 10310^{-3} a.u.

An important feature of the non-local polarizability description is the presence of charge-flow polarizabilities—terms with tt or u=00u=00 that describe the flow of charge in a molecule—which are usually small and die off quickly with distance. The lowest rank charge-flow polarizability is α00,00aa\alpha^{aa^{\prime}}_{00,00}: if VaV^{a} is the potential at site aa, the change in charge at site aa is given by ΔQ^00a=aα00,00aa(VaVa)\Delta\hat{Q}^{a}_{00}=-\sum_{a^{\prime}}\alpha^{aa^{\prime}}_{00,00}(V^{a^{\prime}}-V^{a}). For a system with a large HOMO–LUMO gap the charge-flow terms are expected to be short-range. The charge-flow polarizabilities of the (H2)32 chain with η=2.0\eta=2.0 can be empirically modelled reasonably well with an exponential, that is,

α00,00aaeγ|raa|,\alpha^{aa^{\prime}}_{00,00}\sim e^{-\gamma|r_{aa^{\prime}}|}, (10)

where raar_{aa^{\prime}} is the intersite distance and γ0.5\gamma\approx 0.5 a.u. In this case, the charge-flow polarizabilities drop by more than an order of magnitude within a few bonds. This is illustrated in fig. 4. However, as we reduce the distortion parameter η\eta the charge-flow polarizabilities decay more and more slowly with site-site distance, until, at η=1.0\eta=1.0, they span the entire length of the chain. Even for the η=1.5\eta=1.5 chain, these non-local charge-flow polarizabilities can no longer be localized without incurring a significant error, but for the chains with η=1.25\eta=1.25 and 1.01.0 localization results in a qualitatively incorrect physical picture.

Moreover the charge-flow polarizabilities contribute to the dipole–dipole polarizability in the direction of the chain, and this contribution increases dramatically as the band-gap decreases. For a 64-atom H22 chain with η=2\eta=2, the static dipole polarizability αxx=α11c,11c\alpha_{xx}=\alpha_{11c,11c} parallel to the chain, calculated as described above, is about 410 a.u., of which 180 a.u. is contributed by charge-flows. When η=1\eta=1, so that the H atoms are equally spaced, αxx\alpha_{xx} is about 11350 a.u., larger by about two orders of magnitude, and all of this increase is attributable to the charge-flow effects.

Refer to caption
Figure 5: The second-order dispersion energy calculated using the polarizabilities of (H2)32 chains with distortion parameters η=1\eta=1 and 22. The contribution from the charge-flow polarizabilities, α00,00aa\alpha^{aa^{\prime}}_{00,00}, is displayed separately from the sum of contributions up to the dipole-dipole polarizabilities.

We have calculated the dispersion energy using eq. (8) with distributed polarizabilities, including terms to rank 1. Higher ranking terms can be included, but these are not needed for the hydrogen chains. These results are presented in fig. 5 for the (H2)32 chains with η=2.0\eta=2.0 and 1.01.0. First of all, consider the insulating chain (η=2.0\eta=2.0): the charge-flow polarizabilities alone severely underestimate the dispersion energy, but when terms up to rank 1 are included the agreement of eq. (8) with the non-expanded SAPT(DFT) value for Edisp(2)E^{(2)}_{\rm disp} is excellent for all inter-chain separations shown in the figure. However, at η=1\eta=1, the dispersion interaction is entirely dominated by the pure charge-flow (i.e. α00,00aa\alpha^{aa^{\prime}}_{00,00}) terms, the higher order terms involving the dipole polarisabilities making a negligible contribution. Since the overall dispersion interaction is greatly enhanced as η\eta is reduced, this implies that these pure charge-flow terms enhance the dispersion interaction in this limit.

Why do the charge-flow terms result in the anomalous power laws? The lowest rank charge-flow terms, α00,00aa\alpha^{aa^{\prime}}_{00,00}, that appear in eq. (8) are associated with TT-functions for the charge-charge interaction: T00,00ab=Rab1T^{ab}_{00,00}=R_{ab}^{-1}, where RabR_{ab} is the distance between site aa on one wire and site bb on the other. If the wires are separated by distance vector 𝐑=(0,0,z){\mathbf{R}}=(0,0,z) and 𝐱a{\mathbf{x}}_{a} and 𝐱b{\mathbf{x}}_{b} are distance vectors for sites aa and bb along the chains then 𝐑ab=𝐑(𝐱a𝐱b){\mathbf{R}}_{ab}={\mathbf{R}}-({\mathbf{x}}_{a}-{\mathbf{x}}_{b}). The dispersion energy arising from just the charge-flow terms is

Edisp(2)(00,00)\displaystyle E^{(2)}_{\rm disp}(00,00) =12πaabb1Rab1Rab\displaystyle=-\frac{1}{2\pi}\sum_{aa^{\prime}}\sum_{bb^{\prime}}\frac{1}{R_{ab}}\frac{1}{R_{a^{\prime}b^{\prime}}}
×0α00,00aa(iw)α00,00bb(iw)dw.\displaystyle\times\int_{0}^{\infty}\alpha^{aa^{\prime}}_{00,00}({\rm i}w)\alpha^{bb^{\prime}}_{00,00}({\rm i}w)dw. (11)

In contrast to the dipole-dipole polarizabilities, the charge-flow polarizabilities satisfy the sum-rule aαt,00aa(w)=0\sum_{a^{\prime}}\alpha^{aa^{\prime}}_{t,00}(w)=0, which is a direct consequence of the charge conservation requirement: αA(𝐫,𝐫;w)d3𝐫=0\int\alpha_{A}({\mathbf{r}},{\mathbf{r}}^{\prime};w){\mathrm{d}}^{3}{\mathbf{r}^{\prime}}=0. This leads to cancellation between the charge-flow contributions to the total dispersion energy, and to an R6R^{-6} distance dependence at long range, but the cancellation is incomplete at short range, and terms in Rn,2n5R^{-n},2\leq n\leq 5 also occur. To see how this arises, consider the following length scales: (1) Lc=1/γL_{c}=1/\gamma, which is a measure of the extent of the charge fluctuations determined by the exponential decay of the charge-flow polarizabilities assumed in eq. (10), and (2) LL, the chain length.

  • zLcz\leq L_{c}: Here the charge-flow terms dominate and contribute R2R^{-2} terms to the dispersion energy. From fig. 4 we see that LcL_{c} is largest for the near-metallic wire for which we see the effects of these terms (fig. 5) to z60z\sim 60 a.u.

  • Lcz<LL_{c}\ll z\mathchar 12604\relax L: In this region the extent of the charge fluctuations is small compared with RabR_{ab} and only those RabR_{a^{\prime}b^{\prime}} close to RabR_{ab} are important. We can therefore expand RabR_{a^{\prime}b^{\prime}} about RabR_{ab} using the multipole expansion. The leading term in this expansion is

    Edisp(2)(00,00)12πab1Rab1Rab\displaystyle E^{(2)}_{\rm disp}(00,00)\approx-\frac{1}{2\pi}\sum_{ab}\frac{1}{R_{ab}}\frac{1}{R_{ab}}
    ×0(aα00,00aa(iw))(bα00,00bb(iw))dw\displaystyle\qquad\times\int_{0}^{\infty}\left(\sum_{a^{\prime}}\alpha^{aa^{\prime}}_{00,00}({\rm i}w)\right)\left(\sum_{b^{\prime}}\alpha^{bb^{\prime}}_{00,00}({\rm i}w)\right)dw
    =0\displaystyle=0 (12)

    where we have used the charge-flow sum rule. So we see that the R2R^{-2} contributions sum to zero in this region. However, the higher-order contributions are non-zero.

  • LzL\ll z: In this limit both RabR_{ab} and RabR_{a^{\prime}b^{\prime}} can be expanded in a multipole expansion about 𝐑=(0,0,z){\mathbf{R}}=(0,0,z): Rab1=|𝐑(𝐱a𝐱b)|1=|𝐑𝐱ab|1z112xab2z3R_{ab}^{-1}=|{\mathbf{R}}-({\mathbf{x}}_{a}-{\mathbf{x}}_{b})|^{-1}=|{\mathbf{R}}-{\mathbf{x}}_{ab}|^{-1}\approx z^{-1}-\frac{1}{2}x_{ab}^{2}z^{-3}, and likewise for Rab1R_{a^{\prime}b^{\prime}}^{-1}. Once again, the leading terms vanish because of the sum rule, leaving only the effective dipole-dipole contribution:

    Edisp(2)(00,00)\displaystyle E^{(2)}_{\rm disp}(00,00) 12π14z6aabbxab2xab2\displaystyle\approx-\frac{1}{2\pi}\frac{1}{4z^{6}}\sum_{aa^{\prime}}\sum_{bb^{\prime}}x_{ab}^{2}x_{a^{\prime}b^{\prime}}^{2}
    ×0α00,00aa(iw)α00,00bb(iw)dw\displaystyle\times\int_{0}^{\infty}\alpha^{aa^{\prime}}_{00,00}({\rm i}w)\alpha^{bb^{\prime}}_{00,00}({\rm i}w)dw
    C6(00,00)z6.\displaystyle\equiv-\frac{C_{6}(00,00)}{z^{6}}. (13)

    This explains the large-zz charge-flow contribution to the dispersion energy shown in fig. 5. Notice that for the near-metallic wires, this contribution to the total molecular C6C_{6} coefficient dominates that from the dipole-dipole polarizabilities, but for the large-gap wires the opposite is true.

In short, the charge-flow polarizabilities give rise to the changes in power-law of the dispersion energy and also contribute to an enhancement of the effective C6C_{6} coefficient, applicable at long range.

V Discussion

The physical picture which emerges from both the Hückel approach and the ab initio calculations is the following.

  • In systems with a finite but small gap, spontaneous charge-fluctuations (plasmon modes in the infinite case) introduce a secondary length-scale intermediate in size to the interatomic distance and the system size. This length scale grows as the gap gets smaller.

  • For separations zz small compared with this length scale, the dispersion energy arising from the correlated fluctuations has metallic character.

  • At zz that are large compared with the length scale of the fluctuations, the dispersion can be described using London’s dipole approximation, giving z5z^{-5} behaviour, but the magnitude of the fluctuations now depends strongly on the band gap, leading to orders of magnitude enhancement over the insulator case.

  • In small-gap systems these fluctuations give rise to a strong non-additivity in the polarizability. For example, the ratio of the longitudinal static polarizabilities for the near-metallic and insulating (H2)32 chains is 28. The dispersion energy is proportional to the square of the polarizability, that is, 784. This is roughly the ratio of the dispersion energies for these two cases but only at separations zz much greater than the chain length. Any attempt to extrapolate this result to shorter distances results in a severe overestimation of the dispersion energy.

  • This effect is not a consequence of retardation (we use the non-relativistic Hamiltonian) or damping (charge-density overlap is negligible). It originates from the complex behaviour of the non-local charge-flow polarizabilities. These are terms of rank zero that describe charge fluctuations in the system and are associated with a delocalized exchange-correlation hole Angyan (2007, 2009). This delocalization can be quantified using the localization tensor Resta and Sorella (1999); Angyan (2009) and may give us a quantitative method for defining the charge fluctuation length-scale, LcL_{c}. We are currently investigating this possibility.

  • We have demonstrated that both the change in power-law with distance and the enhancement of the dispersion energy can be understood using non-local polarizability models containing charge-flow polarizabilities.

  • It should come as no surprise that these effects are also strongly dependent on system size (see fig.3).

These results call into question theoretical methods that impose locality so as to scale linearly with system size (LCCSD(T), LMP2), or that approximate the dispersion energy using a pair-wise C6R6-C_{6}R^{-6} interaction, as is done with dispersion-corrected DFT methods and empirical potentials. In the former, these effects can be included by extending the region of locality, though at the cost of losing linearity in scaling, but the latter methods should not be applied to systems such as these. Even DFT functionals with a non-local dispersion correction, such as the van der Waals functional of Dion et al.Dion et al. (2004) and the more recent functional of Vydrov and Van Voorhis Vydrov and Voorhis (2009) are unlikely to contain the correct physics because of an implicit assumption of locality in the polarizability. These functionals will include many-body non-additive effects between non-overlapping systems, but not the non-additive effects within each system such as those described here. On the other hand, methods based on the random phase approximation and quantum Monte Carlo should be able to describe the non-additive effects described in this paper if finite-size effects are kept under control.

In systems containing carbon atoms, such as π\pi-conjugated chains, the contributions from the core electrons will at least partially mask those from the more mobile π\pi electrons, so it is possible that the changes in power-law of the dispersion interaction will not be as dramatic as those we see in the (H2)n chains. But we should nevertheless expect a high degree of non-additivity arising from the charge-flow terms. One indicator of this non-additivity is the non-linear dependence of the molecular polarizability on system size. This has been already demonstrated above and is further supported by the experimental work of Compagnon et al. Compagnon et al. (2001) on the polarizabilities of the fullerenes C70 and C60 which are found to be in the ratio 1.33 rather than the ratio 70/60=1.1770/60=1.17 that would be expected if the systems were additive. Therefore, if reliable C6C_{6}-models are to be constructed for extended π\pi-conjugated systems, these models will need to absorb the effects of the non-additivity of the charge-flow polarizabilities. That is, they should be tailored to suit the electronic structure of the system rather than be transferred from calculations on smaller systems. The Williams–Stone–Misquitta (WSM) procedure Misquitta and Stone (2008a); Misquitta et al. (2008); Misquitta and Stone (2008b) is one such method that is capable of constructing effective local polarizability and dispersion models that account for the electronic structure of the system. In fact, the non-local models presented in this paper are derived in the first step of the WSM procedure. We are currently investigating the behaviour of WSM dispersion models for a variety of carbon systems.

Finally, strongly delocalized systems will also have important contributions to the dispersion energy from terms of third order in the interaction operator, that is, from the hyperpolarizabilities. We have not considered such terms in this paper. Furthermore, in ensembles of such systems there will be strong non-additive effects between molecules. We are currently investigating the nature and importance of this type of non-additivity.

VI Acknowledgements

We thank University College London for computational resources. AJM and JS thank EPSRC for funding.

References

  • Stone (1996) A. J. Stone, The Theory of Intermolecular Forces (Clarendon Press, Oxford, 1996).
  • Casimir and Polder (1948) H. B. G. Casimir and D. Polder, Phys. Rev.  73, 360 (1948).
  • Dobson (2007) J. F. Dobson, Surface Science 601, 5667 (2007).
  • Dobson et al. (2006) J. F. Dobson, A. White, and A. Rubio, Phys. Rev. Lett.  96, 073201(4) (2006).
  • Dobson et al. (2001) J. F. Dobson, K. McLennan, A. Rubio, J. Wang, T. Gould, H. M. Le, and B. P. Dinte, Aust. J. Chem. 54, 513 (2001).
  • Dobson et al. (2004) J. F. Dobson, J. Wang, B. P. Dinte, K. McLennan, and H. M. Le, Int. J. Quantum Chem.  101, 579 (2004).
  • Drummond and Needs (2007) N. D. Drummond and R. J. Needs, Phys. Rev. Lett.  99, 166401(4) (2007).
  • Spencer (2009) J. Spencer, Ph.D. thesis, University of Cambridge (2009).
  • Epstein (1903) P. Epstein, Mathematische Annalen 56, 615 (1903).
  • Epstein (1906) P. Epstein, Mathematische Annalen 63, 205 (1906).
  • Tosi (1964) M. P. Tosi, Cohesion of ionic solids in the Born model (Academic Press, 1964), vol. XVI of Solid State Physics.
  • Misquitta et al. (2003) A. J. Misquitta, B. Jeziorski, and K. Szalewicz, Phys. Rev. Lett.  91, 33201 (2003).
  • Hesselmann et al. (2005) A. Hesselmann, G. Jansen, and M. Schutz, J. Chem. Phys.  122, 014103 (2005).
  • Misquitta et al. (2005) A. J. Misquitta, R. Podeszwa, B. Jeziorski, and K. Szalewicz, J. Chem. Phys.  123, 214103 (2005).
  • Longuet-Higgins (1965) H. C. Longuet-Higgins, Disc. Farad. Soc. 40, 7 (1965).
  • Hesselmann and Jansen (2003) A. Hesselmann and G. Jansen, Phys. Chem. Chem. Phys.  5, 5010 (2003).
  • Podeszwa and Szalewicz (2005) R. Podeszwa and K. Szalewicz, Chem. Phys. Lett.  412, 488 (2005).
  • Champagne et al. (1998) B. Champagne, E. A. Perpete, S. J. A. van Gisbergen, E.-J. Baerends, J. G. Snijders, C. Soubra-Ghaoui, K. A. Robins, and B. Kirtman, J. Chem. Phys.  109, 10489 (1998).
  • Sekino et al. (2007) H. Sekino, Y. Maeda, M. Kamiya, and K. Hirao, J. Chem. Phys.  126, 014107 (2007).
  • Misquitta and Stone (2006) A. J. Misquitta and A. J. Stone, J. Chem. Phys.  124, 024111 (2006).
  • Le Sueur and Stone (1994) C. R. Le Sueur and A. J. Stone, Mol. Phys.  83, 293 (1994).
  • Lillestolen and Wheatley (2007) T. C. Lillestolen and R. J. Wheatley, J. Phys. Chem. A  111, 11141 (2007).
  • Angyan (2007) J. G. Angyan, J. Chem. Phys.  127, 024108 (2007).
  • Angyan (2009) J. G. Angyan, Int. J. Quantum Chem.  109, 2340 (2009).
  • Resta and Sorella (1999) R. Resta and S. Sorella, Phys. Rev. Lett.  82, 370 (1999).
  • Dion et al. (2004) M. Dion, H. Rydberg, E. Schroder, D. C. Langreth, , and B. I. Lundqvist, Phys. Rev. Lett.  92, 246401 (2004).
  • Vydrov and Voorhis (2009) O. A. Vydrov and T. V. Voorhis, Phys. Rev. Lett.  103, 063004 (2009).
  • Compagnon et al. (2001) I. Compagnon, R. Antoine, M. Broyer, P. Dugourd, J. Lerme, and D. Rayane, Phys. Rev. A  64, 025201 (2001).
  • Misquitta and Stone (2008a) A. J. Misquitta and A. J. Stone, J. Chem. Theory Comput.  4, 7 (2008a).
  • Misquitta et al. (2008) A. J. Misquitta, A. J. Stone, and S. L. Price, J. Chem. Theory Comput.  4, 19 (2008).
  • Misquitta and Stone (2008b) A. J. Misquitta and A. J. Stone, Mol. Phys.  106, 1631 (2008b).