This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.



Efficient calculation of the Green’s function in scattering region for electron-transport simulations

Yoshiyuki Egami y_egami@eng.hokudai.ac.jp Division of Applied Physics, Faculty of Engineering, Hokkaido University, Sapporo, Hokkaido 060-8628, Japan    Shigeru Tsukamoto Peter Grünberg Institut & Institute for Advanced Simulation, Forschungszentrum Jülich and JARA, D-52425 Jülich, Germany    Tomoya Ono Department of Electrical and Electronic Engineering, Kobe University, Kobe, Hyogo 657-8501, Japan
Abstract

We propose a first-principles method of efficiently evaluating electron-transport properties of very long systems. Implementing the recursive Green’s function method and the shifted conjugate gradient method in the transport simulator based on real-space finite-difference formalism, we can suppress the increase in the computational cost, which is generally proportional to the cube of the system length to a linear order. This enables us to perform the transport calculations of double-walled carbon nanotubes (DWCNTs) with 196,608 atoms. We find that the conductance spectra exhibit different properties depending on the periodicity of doped impurities in DWCNTs and they differ from the properties for systems with less than 1,000 atoms.

One-dimensional materials such as nanowires and nanotubes, which have unique electronic properties due to the quantum confinement effect, are expected to be applied to electronic and spintronic devices, optoelectronic circuits, and biosensors.Kong et al. (2000); Duan et al. (2001); Kind et al. (2002); Li et al. (2006); Patolsky et al. (2006); Liao et al. (2006); Ivanovskaya et al. (2007) In recent years, large-scale electron-transport calculations have been indispensable for designing the functionality of electronic devices. Although first-principles calculations based on the density functional theory (DFT)Kohn (1999); *PhysRev136-00B864; *PhysRev140-0A1133 allow us to accurately evaluate electron-transport properties of atomic systems, their target is typically limited to small systems because of the heavy computational cost arising from calculating the Green’s functions, which is intrinsic cubic scaling with system size. Limited-scale system is sufficient to simulate simple systems, such as locally perturbed bulk regions or interfaces. However, it is not suitable for simulating complex systems, such as bulk with a realistic defect density, amorphous structure, and interfaces with lattice mismatches between different crystalline materials. Thus far, to circumvent this restriction, transport calculations for large systems containing several thousands of atoms have been performed within atomic-basis formalism and tight-binding (TB) formalism based on DFT.Gunst et al. (2017); Ducry et al. (2019); Calogero et al. (2019); Jippo et al. (2016) Since the Hamiltonian matrix is dense in these formalisms, the computational cost of the inverse matrix for obtaining the Green’s function matrix by the direct method is proportional to the cube of the matrix dimension. Moreover, there are unavoidable problems such as the incompleteness of basis sets and inefficiency in parallel computing. However, real-space finite-difference (RSFD) formalism is also recognized as suitable for large-scale calculations requiring high computational accuracy because it has a high affinity to massively parallel architectures and can avoid problems arising from the basis sets.Fujimoto and Hirose (2003); Hirose et al. (2005); Ono et al. (2012); Iwase et al. (2015); Egami et al. (2015); Ono and Tsukamoto (2016); Tsukamoto et al. (2017); *PRB97-115450; *PRB98-195422; Egami et al. (2019)

Fujimoto and Hirose developed the overbridging boundary matching (OBM) method based on the RSFD formalismFujimoto and Hirose (2003) by exploiting the advantages for electron-transport calculations, where a whole system is divided along the zz direction (see Fig. 1) into three regions: the left electrode, the transition region and the right electrode. The electron-transport calculation refers only to subsets of the Green’s function matrix of the transition region. In the OBM method, the subsets are calculated efficiently using a shifted conjugate-gradient (SCG) solverIwase et al. (2015) since the Hamiltonian matrix in the RSFD formalism is sparse. In the SCG solver, the computational cost for calculating the subsets is 𝒪(NENxy3Nz){\cal{O}}(N_{E}N_{xy}^{3}N_{z}) when the algorithm proposed by Takayama et al.Takayama et al. (2006) is adopted, where NxyN_{xy} and NzN_{z} are the numbers of grid points in the xyxy plane and the zz direction, respectively, and NEN_{E} denotes the number of energy values EE to be treated at once by the SCG method. For systems with a large NzN_{z}, the computational cost for matrix-vector operations, which is 𝒪(Nxy3Nz2){\cal O}(N_{xy}^{3}N_{z}^{2}), becomes dominant.111Although the computational cost of the subsets of the Green’s function matrix is generally dominant, the cost of coefficient matrix-vector operations dominates in the case that EE becomes close to the eigenvalue of the truncated Hamiltonian matrix. The matrix-vector operation cost also becomes dominant when NzN_{z} is large because the density of the eigenvalues in energy increases. Although the maximum order of the computational cost is reduced from 𝒪(Nxy3Nz3){\cal O}(N_{xy}^{3}N_{z}^{3}) to 𝒪(Nxy3Nz2){\cal O}(N_{xy}^{3}N_{z}^{2}) by the OBM method, the transport calculation has room for further reducing the computational cost to handle more realistic and longer one-dimensional systems.

Refer to caption
Figure 1: Schematic representation of a junction system. The system is partitioned into one transition region and left and right semi-infinite electrodes, whose Hamiltonians are denoted by 𝐇T\mathbf{H}_{\mathrm{T}}, 𝐇L\mathbf{H}_{\mathrm{L}}, and 𝐇R\mathbf{H}_{\mathrm{R}}, respectively. 𝐁L(R)\mathbf{B}_{\mathrm{L(R)}} denotes the interaction between adjoining small parts. The transition region consists of PP parts. The Hamiltonians are denoted by 𝐇i\mathbf{H}_{i} for i=1,,Pi=1,\dots,P, and the interactions between the adjoining parts by 𝐁i\mathbf{B}_{i} for i=1,,P1i=1,\dots,P-1.

In this letter, we propose an efficient computational procedure based on RSFD formalism to evaluate the electron-transport properties of long systems containing more than 100,000 atoms without accuracy deterioration. We verify the computational accuracy and performance of the proposed method. Additionally, to exemplify the efficiency of the proposed procedure we demonstrate large-scale electron-transport calculations for double-walled carbon nanotubes (DWCNTs) composed of 196,608 atoms, which make, to the best of our knowledge, the largest system in the first-principles electron-transport calculation.

Thouless and Kirkpatrick proposed an efficient calculation of subsets of a Green’s function matrix for a transition region within TB formalism as the recursive Green’s function (RGF) method.Thouless and Kirkpatrick (1981) The RGF method constructs a transition region by arranging multiple parts along the zz direction as illustrated in Fig. 1, and calculates only necessary subsets by recursively combining the Green’s function matrices of the adjoining small parts one by one. Since the calculation of the Green’s function matrices for the small parts is less costly than that for the extended transition region, the RGF method is capable of handling large systems and is widely used for calculating the transport properties of atomic-basis models and TB models.Lewenkopf and Mucciolo (2013); *JComputPhys215-000741; Ferry and Goodnick (1997); Sols (1995); Metalidis (2007); Souza et al. (2012); Shin et al. (2016); Rojas et al. (2019)

Now, we discuss the advantage of the RGF method in transport calculations within RSFD formalism. The transport properties are evaluated using the Green’s function of the transition region suspended between semi-infinite electrodes, 𝐆T\mathbf{G}_{\mathrm{T}}, which is constructed by the Green’s function associated with the truncated transition region, 𝒢T\mathbf{\cal G}_{\mathrm{T}}, and the self-energy terms of the electrodes.Ono et al. (2012) The Green’s function matrix 𝒢T=(E𝐈T𝐇T)1\mathbf{\cal G}_{\mathrm{T}}=(E\mathbf{I}_{\mathrm{T}}-\mathbf{H}_{\mathrm{T}})^{-1}, in which 𝐇T\mathbf{H}_{\mathrm{T}} is the truncated Hamiltonian matrix of the extended transition region consisting of PP parts (see Fig. 1), is expressed within the framework of RSFD formalism using the norm-conserving pseudopotentials (NCPPs) as

𝒢T=(E𝐈T[𝐇1𝐁1𝟎𝐁1𝐇2𝐁2

𝐁2𝟎𝐇P
]
)
1
,
\mathbf{\cal G}_{\mathrm{T}}=\left(E\mathbf{I}_{\mathrm{T}}-\left[\begin{array}[]{SSSSSSS}\cline{1-2}\cr\vrule\lx@intercol\vtop{\hbox to0.0pt{\centering\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering}}&\lx@intercol\vtop{\hbox to0.0pt{\centering\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering}}\vrule&\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering&\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering&\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering&\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering&\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering\\ \cline{3-3}\cr\vrule\lx@intercol\hfil\raisebox{0.0pt}{\smash{$\mathbf{H}_{1}$}}\hfil\lx@intercol\vrule\lx@intercol&\lx@intercol\vtop{\hbox to0.0pt{\centering$\mathbf{B}_{1}$\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering}}\vrule&\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering&\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering&\lx@intercol\hfil\raisebox{0.0pt}{\smash{$\mathbf{0}$}}\hfil\lx@intercol\\ \cline{1-4}\cr\lx@intercol\vtop{\hbox to0.0pt{\centering\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering}}\vrule&\lx@intercol\hfil\mathbf{B}_{1}^{{\dagger}}\hfil\lx@intercol\vrule\lx@intercol&\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering&\lx@intercol\vtop{\hbox to0.0pt{\centering\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering}}\vrule&\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering\\ \cline{2-2}\cr\cline{5-5}\cr\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering&\lx@intercol\hfil\hfil\lx@intercol\vrule\lx@intercol&\lx@intercol\hfil\raisebox{0.0pt}{\smash{$\mathbf{H}_{2}$}}\hfil\lx@intercol\vrule\lx@intercol&\lx@intercol\vtop{\hbox to0.0pt{\centering$\mathbf{B}_{2}$\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering}}\vrule&\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering\\ \cline{3-5}\cr\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering&\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering&\lx@intercol\vtop{\hbox to0.0pt{\centering\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering}}\vrule&\lx@intercol\vtop{\hbox to0.0pt{\centering$\mathbf{B}_{2}^{{\dagger}}$\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering}}\vrule&$\ddots$\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering&$\ddots$\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering&\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering\\ \cline{4-4}\cr\cline{6-7}\cr\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering&\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering&\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering&\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering&\lx@intercol\vtop{\hbox to0.0pt{\centering\smash{$\ddots$}\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering}}\vrule&\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering&\lx@intercol\vtop{\hbox to0.0pt{\centering\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering}}\vrule\\ \lx@intercol\hfil\raisebox{0.0pt}{\smash{$\mathbf{0}$}}\hfil\lx@intercol&\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering&\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering&\lx@intercol\vtop{\hbox to0.0pt{\centering\rule[0.0pt]{0.0pt}{0.0pt}\@add@centering}}\vrule&\lx@intercol\hfil\raisebox{0.0pt}{\smash{$\mathbf{H}_{P}$}}\hfil\lx@intercol\vrule\lx@intercol\\ \cline{6-7}\cr\end{array}\right]\right)^{-1},
(1)

where 𝐇i\mathbf{H}_{i} is the truncated Hamiltonian matrix for the transition region of the iith part. For simplicity, we assume that all 𝐇i\mathbf{H}_{i} (i=1,,P)i=1,\ldots,P) have the same dimension, the interactions between the second- and further-neighboring parts are zero, and the non-zero matrix, 𝐁i\mathbf{B}_{i}, represents the interaction between the adjoining parts. We define the dimensions of 𝐁i\mathbf{B}_{i} and 𝐇i\mathbf{H}_{i} as Mi×NiM_{i}\times N_{i} and NxyNz(part)N_{xy}N_{z}^{(\text{part})}, respectively, with Nz(part)=Nz/PN_{z}^{(\text{part})}=N_{z}/P; 𝐈T\mathbf{I}_{\mathrm{T}} denotes an NxyNzN_{xy}N_{z}-dimensional identity matrix.

Since the Hamiltonian matrix of RSFD formalism is highly sparse, it is more advantageous to calculate the subsets of Green’s function matrix of the transition region by iterative solvers than it is to compute the entire Green’s function by direct inversions of the Hamiltonian matrix. According to Ref. Ono et al., 2012, the electron-transport properties can be estimated using submatrices located at the four corners of the Green’s function matrix, 𝒢T\mathbf{\cal G}_{\mathrm{T}}. Now we will combine the two adjoining parts to derive the Green’s function matrix associated with a truncated Hamiltonian composed of the iith and (i+1i+1)th parts, which is denoted by the 2NxyNz(part)2N_{xy}N_{z}^{(\text{part})}-dimensional matrix 𝒢i:i+1\mathbf{\cal G}_{i:i+1}. As described in Supplemental Material, the submatrices located at the upper-left, upper-right, lower-left, and lower-right corners of 𝒢i:i+1\mathbf{\cal G}_{i:i+1} are given as

𝒢i:i+1UL=𝒢i:iUL+𝒢i:iUR𝐁i𝒢i+1:i+1UL𝐁i×[𝐈Mi𝒢i:iLR𝐁i𝒢i+1:i+1UL𝐁i]1𝒢i:iLL,\displaystyle\begin{split}\mathbf{\cal G}_{i:i+1}^{\mathrm{UL}}&\!=\!\mathbf{\cal G}_{i:i}^{\mathrm{UL}}+\mathbf{\cal G}_{i:i}^{\mathrm{UR}}\mathbf{B}_{i}\mathbf{\cal G}_{i+1:i+1}^{\mathrm{UL}}\mathbf{B}_{i}^{{\dagger}}\\ &\quad\times[\mathbf{I}_{M_{i}}-\mathbf{\cal G}_{i:i}^{\mathrm{LR}}\mathbf{B}_{i}\mathbf{\cal G}_{i+1:i+1}^{\mathrm{UL}}\mathbf{B}_{i}^{{\dagger}}]^{-1}\mathbf{\cal G}_{i:i}^{\mathrm{LL}},\end{split} (2)
𝒢i:i+1UR=𝒢i:iUR[𝐈Mi𝐁i𝒢i+1:i+1UL𝐁i𝒢i:iLR]1𝐁i𝒢i+1:i+1UR,\displaystyle\begin{split}\mathbf{\cal G}_{i:i+1}^{\mathrm{UR}}&\!=\!-\mathbf{\cal G}_{i:i}^{\text{UR}}[\mathbf{I}_{M_{i}}\!-\!\mathbf{B}_{i}\mathbf{\cal G}_{i+1:i+1}^{\mathrm{UL}}\mathbf{B}_{i}^{{\dagger}}\mathbf{\cal G}_{i:i}^{\mathrm{LR}}]^{-1}\mathbf{B}_{i}\mathbf{\cal G}_{i+1:i+1}^{\mathrm{UR}},\end{split} (3)
𝒢i:i+1LL=𝒢i+1:i+1LL[𝐈Ni𝐁i𝒢i:iLR𝐁i𝒢i+1:i+1UL]1𝐁i𝒢i:iLL,\displaystyle\begin{split}\mathbf{\cal G}_{i:i+1}^{\mathrm{LL}}&\!=\!-\mathbf{\cal G}_{i+1:i+1}^{\text{LL}}[\mathbf{I}_{N_{i}}\!-\!\mathbf{B}_{i}^{{\dagger}}\mathbf{\cal G}_{i:i}^{\mathrm{LR}}\mathbf{B}_{i}\mathbf{\cal G}_{i+1:i+1}^{\mathrm{UL}}]^{-1}\mathbf{B}_{i}^{{\dagger}}\mathbf{\cal G}_{i:i}^{\mathrm{LL}},\end{split} (4)
𝒢i:i+1LR=𝒢i+1:i+1LR+𝒢i+1:i+1LL𝐁i𝒢i:iLR𝐁i×[𝐈Ni𝒢i+1:i+1UL𝐁i𝒢i:iLR𝐁i]1𝒢i+1:i+1UR,\displaystyle\begin{split}\mathbf{\cal G}_{i:i+1}^{\mathrm{LR}}&\!=\!\mathbf{\cal G}_{i+1:i+1}^{\mathrm{LR}}+\mathbf{\cal G}_{i+1:i+1}^{\mathrm{LL}}\mathbf{B}_{i}^{{\dagger}}\mathbf{\cal G}_{i:i}^{\mathrm{LR}}\mathbf{B}_{i}\\ &\quad\times[\mathbf{I}_{N_{i}}-\mathbf{\cal G}_{i+1:i+1}^{\mathrm{UL}}\mathbf{B}_{i}^{{\dagger}}\mathbf{\cal G}_{i:i}^{\mathrm{LR}}\mathbf{B}_{i}]^{-1}\mathbf{\cal G}_{i+1:i+1}^{\mathrm{UR}},\end{split} (5)

respectively. 𝐈Mi(Ni)\mathbf{I}_{M_{i}(N_{i})} denotes an Mi(Ni)M_{i}(N_{i})-dimensional identity matrix, and 𝒢i:i\mathbf{\cal G}_{i:i} represents the NxyNz(part)N_{xy}N_{z}^{(\text{part})}-dimensional Green’s function matrix associated with 𝐇i\mathbf{H}_{i}, that is, 𝒢i:i=(E𝐈i𝐇i)1\mathbf{\cal G}_{i:i}=(E\mathbf{I}_{i}-\mathbf{H}_{i})^{-1}, with 𝐈i\mathbf{I}_{i} being an NxyNz(part)N_{xy}N_{z}^{(\text{part})}-dimensional identity matrix. As shown in Eqs. (2)–(5), we only need the four corners of 𝒢i:i\mathbf{\cal G}_{i:i}, which are efficiently obtained by using the SCG method. In most cases, Mi=Ni=NfNxyM_{i}=N_{i}=N_{\text{f}}N_{xy}, with NfN_{\text{f}} being the order of the finite-difference approximation.Chelikowsky et al. (1994a); *PRL72-001240 Repeating this procedure to connect the parts numbered from ii though to jj, we can construct the four submatrices of a (ji+1)NxyNz(part)(j-i+1)N_{xy}N_{z}^{(\text{part})}-dimensional matrix 𝒢i:j\mathbf{\cal G}_{i:j}.

Now, we estimate the computational cost of the proposed method. We assume that 𝐁L(R)\mathbf{B}_{\mathrm{L(R)}} (see Fig. 1) and 𝐁i\mathbf{B}_{i} for i=1,,P1i=1,\dots,P-1 are MM-dimensional square matrices with M=NfNxyM=N_{\text{f}}N_{xy}. In the case of the RGF method based on the atomic-basis formalism, 𝐇T\mathbf{H}_{\mathrm{T}} is dense; that is, MM is not sufficiently smaller than NxyNz(part)N_{xy}N_{z}^{(\text{part})}. Therefore, the iterative solvers are not more efficient than direct inversion, and there is not much benefit in using Eqs. (2)–(5) to reduce the computational cost. Furthermore, the overlap matrix is generally not any scalar matrix; hence, the SCG method cannot be used to solve the linear systems. However, the RGF algorithm must be compatible with RSFD formalism since 𝐇i\mathbf{H}_{i} is a block-diagonal matrix and a sparse matrix.Hirose et al. (2005); Chelikowsky et al. (1994a, b) The proposed method can be implemented without negatively affecting the benefit of the SCG method, in which the matrix-vector operations for shifted energy points are omitted, and only the matrix elements of subsets are updated for shifted energy points during SCG iteration. The computational cost of the matrix-vector products with a sparse matrix increases linearly with NzN_{z}, and the number of SCG iterations is proportional to NzN_{z}. Therefore, the computational cost for the extended transition region is reduced from 𝒪(Nxy3Nz2)=𝒪(Nxy3(Nz(part)P)2){\cal O}(N_{xy}^{3}N_{z}^{2}\bigr{)}={\cal O}(N_{xy}^{3}(N_{z}^{\text{(part)}}P)^{2}\bigr{)} to 𝒪(Nxy3(Nz(part))2P){\cal O}(N_{xy}^{3}(N_{z}^{\text{(part)}})^{2}P\bigr{)} using the RGF and the SCG methods since the cost is 𝒪(Nxy3(Nz(part))2){\cal O}(N_{xy}^{3}(N_{z}^{\text{(part)}})^{2}\bigr{)} for each of the PP parts; that is, the increase in computational cost can be suppressed linearly for PP. Although the computational cost for calculating Eqs. (2)–(5) is 𝒪(PM3){\cal O}(PM^{3}), where M=NfNxyM=N_{\text{f}}N_{xy}, with NfN_{\text{f}} being sufficiently smaller than Nz(part)N_{z}^{(\text{part})}. Consequently, further computational cost reduction for NzN_{z} is achieved by making use of the advantages of the RGF method and SCG method within the RSFD formalism in comparison with the conventional procedure.Ono et al. (2012)

We evaluated the computational error between the conductance spectra using the Green’s function matrices obtained by the proposed method and the conventional methodOno et al. (2012) for the BN-doped (4,4)@(8,8) DWCNT to verify that the proposed method does not deteriorate the computational accuracy. The unit cell contains 192 atoms, and boron and nitrogen atoms are substitutionally co-doped into the outer tube along the circumferential direction. The dimensions are Lx(y)=19.5ÅL_{x(y)}=19.5~{}{\text{\AA}} and Lz(part)=9.84ÅL_{z}^{\text{(part)}}=9.84~{}{\text{\AA}}. For this verification, a supercell has twice the dimension in the zz direction that of the unit cell, that is, Lz(ext)=2Lz(part)L_{z}^{\text{(ext)}}=2L_{z}^{\text{(part)}}. Using the electronic structure calculation code rspaceHirose et al. (2005); Ono and Hirose (2005) based on RSFD formalism within the DFT framework, the effective potential of the unit cell is calculated with 1×1×101\times 1\times 10 kk-points, Nxy=80×80N_{xy}=80\times 80, Nz(part)=40N_{z}^{(\text{part})}=40, Nf=4N_{\text{f}}=4, the NCPPs,Kobayashi (1999); Troullier and Martins (1991); *PRL48-001425 and the local-density approximation (LDA).Vosko et al. (1980) The Green’s function calculation is collectively performed using the SCG method with NE=41N_{E}=41. Figure 2 shows the conductance spectra of the supercell evaluated using the proposed method with P=2P=2 and the conventional method. There are no notable errors more than 2.5×107G02.5\times 10^{-7}~{}{\text{G}_{0}}.

Refer to caption
Figure 2: (Color online) conductance spectra of DWCNT (solid lines) and computational error of the proposed method (dashed line).

Finally, we compared the computational times in calculating the Green’s function submatrices for several systems using the proposed method with those using the conventional method. Here the computational times are measured for systems in which (001) Si bulk is doped with B atoms. We consider a primitive unit cell of (001) Si bulk, which contains four Si atoms and has the dimensions of Lx(y)(part)=3.84ÅL_{x(y)}^{\text{(part)}}=3.84~{}{\text{\AA}} and Lz(part)=5.43ÅL_{z}^{\text{(part)}}=5.43~{}{\text{\AA}}. We measured the CPU time for calculating the submatrices of the Green’s function matrix in a supercell with Lz(ext)=PLz(part)L_{z}^{\text{(ext)}}=PL_{z}^{\text{(part)}}, in which the PP unit cells are arranged along the zz direction, (P=1,2,4P=1,2,4 and 8), and any of the four Si atoms in each unit cell is randomly replaced with a B atom. The effective potential of the unit cell is calculated using the rspace code with 8×8×128\times 8\times 12 kk-points, Nxy=12×12N_{xy}=12\times 12, Nz(part)=16N_{z}^{(\text{part})}=16, Nf=4N_{\text{f}}=4, the NCPPs, and the LDA. We prepared small parts with Lx(y)=nx(y)×Lx(y)(part)L_{x(y)}=n_{x(y)}\times L_{x(y)}^{\text{(part)}} by duplicating the unit cell nx(y)n_{x(y)} times in the x(y)x(y) direction with nx=ny=1,2n_{x}=n_{y}=1,2, and 4 to verify the influence of the cross-sectional size on the CPU time. As summarized in Table 1, the CPU time for calculating the four submatrices of 𝒢1:P\mathcal{G}_{1:P} with NE=16N_{E}=16 using the proposed method increases in proportion to PP. We confirmed that the proposed method enabled us to notably reduce the computational cost and avoid computational difficulties in handling long systems. Furthermore, one can see that the time ratio is not attenuated even in large cross-sectional systems. Since the proposed method has high weak-scaling efficiency for increasing PP, the benefit of this method becomes more remarkable as the system size increases.

Table 1: CPU time to obtain four submatrices of the Green’s function matrix for B-doped (001) Si bulk models. The time ratio is evaluated as [calculating time + combining time]/[calculating time for P=1P=1]. The calculations were performed on Intel® Xeon® CPU E5-2680.
  calculating   combining   time
(nx,ny)(n_{x},n_{y})  PP   time (sec.)   time (sec.)   ratio
(1,1) 1 175 1.00
2 352 19 2.12
4 669 56 4.15
8 1,347 130 8.45
(2,2) 1 13,381 1.00
2 26,773 1,457 2.11
4 49,171 4,372 4.00
8 98,240 10,206 8.10
(4,4) 1 592,191 1.00
2 1,183,461 65,451 2.11
4 2,159,101 196,408 3.98
8 4,317,207 458,096 8.06
Refer to caption
Figure 3: (Color online) Schematic view of the DWCNT unit-cell models. The grey and green spheres represent C atoms of the outer and inner carbon nanotubes, respectively, and the red and orange spheres represent N and B atoms, respectively. Dashed lines represent the unit cell boundaries.

With the above verifications, it is confirmed that electron-transport properties can be estimated efficiently without the deterioration of computational accuracy using the proposed method. As a practical demonstration of the method, we investigated the electron-transport properties of DWCNTs depending on the tube length. As illustrated in Fig. 3, eight DWCNT unit-cell models with different relative positions of the two BN dimers are prepared. Here, the transport properties are evaluated for a model in which the same unit cells (Model 4) are periodically combined (periodic model) and in which different unit cells (Model 1–8) are randomly arranged (random model). For the periodic model, the transport calculations are performed for the DWCNTs with P=1P=11,0241,024 to investigate the changes in the conductance spectrum according to the number of unit cells PP in the transition region. The numerical conditions and procedures to obtain the effective potential of each unit cell are the same as those mentioned above. Here, we ignore the electron-phonon couplings and evaluate the transport properties described in the ballistic regime.Ishii et al. (2010)

As shown in Fig. 4, for systems with small PP, the conductance quantization is observed at the low incident energy, where there are four conduction channels with a transmission probability of almost unity. In the low energy region, the incident electrons flow through the intra-states of the inner and outer tubes, as depicted in Fig. 5(a). Therefore, there are high-transmission channels due to the inner-tube states that are insensitive to the scattering potential created by the BN dimers in the outer tube. However, the channel transmissions are notably reduced at the high incident energy because of the inter-states between the inner and outer tubes, as illustrated in Fig. 5(b). As PP increases, dips appear in the conductance spectra even at the low incident energy, and they gradually become prominent. This reflects the energy gap in the energy band structure that appears in an infinite DWCNT with a periodic scattering potential, as in the Kronig-Penney model. On the contrary, for the random model with P=1,024P=1,024 (196,608 atoms, 1.0μm\sim{1.0}~{}{\text{$\mu${m}}}), overall conductance is suppressed as plotted in Fig. 4. Although the transmission in most conduction channels is notably reduced due to the random scattering potential component widely distributed in DWCNT, there are still channels with high transmission to which the intra-states of the inner tube contribute. Here, the random model with P=1,024P=1,024 is prepared by randomly combining four different random models with P=64P=64 and 96. We also performed calculations using random models with different configurations, but the results are essentially the same. Performing the large-scale electron-transport calculation, we can discover this difference in the conductance spectra between the randomly and periodically doped models.

Refer to caption
Figure 4: (Color online) Conductance spectra for BN-doped DWCNTs. The filled-circle and open-circle plots represent the spectra of the periodic and random models, respectively. The zero point of conductance is shifted by 1G01~{}{\text{G}_{0}} between the adjacent spectra, where the vertical scale represents the value for the 196,608-atom models (P=1,024P=1,024). The incident energy is measured from the Fermi level.
Refer to caption
Figure 5: (Color online) Distributions of the scattering wave functions in the conduction channels of the 6,144-atom model (P=32P=32). Panels (a) and (b) illustrate the charge density of scattering wave functions with the highest transmission at the Fermi energy and 0.70.7 eV above the Fermi energy, respectively. Insets surrounded by dashed lines depict cross-sectional and enlarged views. The key to the symbols is the same as that in Fig. 3. The value of the yellow isosurface is 6.7×1046.7\times 10^{-4} electron/Å3{\text{\AA}}^{3}.

In summary, we developed an efficient first-principles algorithm for evaluating the electron-transport properties of long systems consisting of a vast number of atoms. The Green’s function of the whole transition region extended towards the transport direction was obtained by recursively combining the Green’s functions of the adjoining parts one by one. The computational cost for calculating the submatrices of the Green’s functions required to estimate the transport properties is proportional to the number of the combined parts. As a practical application, we demonstrated the electron-transport calculations of BN-doped DWCNTs containing up to 196,608 atoms. The proposed method develops our understanding of experimental studies on the research and development of devices using low-dimensional materials, as reported by Refs. Franklin and Chen, 2010; Franklin et al., 2010; Qiu et al., 2017; Sun et al., 2019; Schmidt et al., 2019. Moreover, even in a submicron scale structure containing millions of atoms, it is possible to efficiently evaluate the electron-transport properties without any deterioration of computational accuracy.

Acknowledgements.
This work was partially supported by the financial support from MEXT as a social and scientific priority issue (creation of new functional devices and highperformance materials to support next-generation industries) to be tackled by using post-K computer and JSPS KAKENHI Grant Numbers JP16H03865, JP18K04873. S.T. acknowledges the financial support from Deutsche Forschungsgemeinschaft through the project numbers 389895192 and 277146847. The numerical calculations were carried out using the system B and the system C of the Institute for Solid State Physics at the University of Tokyo, the Oakforest-PACS of the Center for Computational Sciences at University of Tsukuba, and the K computer provided by the RIKEN Advanced Institute for Computational Science through the HPCI System Research project (Project ID: hp190172).

References