This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Hybrid Spherical- and Planar-Wave Channel Modeling and Estimation for Terahertz Integrated UM-MIMO and IRS Systems

Yuhang Chen, Renwang Li,  Chong Han,  Shu Sun,  and Meixia Tao This paper was presented in part at IEEE ICC, May 2022 [1]. Y. Chen and C. Han are with Terahertz Wireless Communications (TWC) Laboratory, Shanghai Jiao Tong University, Shanghai 200240, China (e-mail: {yuhang.chen, chong.han}@sjtu.edu.cn). R, Li, S. Sun and M. Tao are with Department of Electronic Engineering, Shanghai Jiao Tong University, Shanghai 200240, China (e-mail: {renwanglee, shusun, mxtao}@sjtu.edu.cn).
Abstract

Integrated ultra-massive multiple-input multiple-output (UM-MIMO) and intelligent reflecting surface (IRS) systems are promising for 6G and beyond Terahertz (0.1-10 THz) communications, to effectively bypass the barriers of limited coverage and line-of-sight blockage. However, excessive dimensions of UM-MIMO and IRS enlarge the near-field region, while strong THz channel sparsity in far-field is detrimental to spatial multiplexing. Moreover, channel estimation (CE) requires recovering the large-scale channel from severely compressed observations due to limited RF-chains. To tackle these challenges, a hybrid spherical- and planar-wave channel model (HSPM) is developed for the cascaded channel of the integrated system. The spatial multiplexing gains under near-field and far-field regions are analyzed, which are found to be limited by the segmented channel with a lower rank. Furthermore, a compressive sensing-based CE framework is developed, including a sparse channel representation method, a separate-side estimation (SSE) and a dictionary-shrinkage estimation (DSE) algorithms. Numerical results verify the effectiveness of the HSPM, the capacity of which is only 5×1045\times 10^{-4} bits/s/Hz deviated from that obtained by the ground-truth spherical-wave-model, with 256 elements. While the SSE achieves improved accuracy for CE than benchmark algorithms, the DSE is more attractive in noisy environments, with 0.8 dB lower normalized-mean-square-error than SSE.

Index Terms:
Terahertz integrated ultra-massive multiple-input-multiple-output (UM-MIMO) and intelligent reflecting surface (IRS) systems, Channel modeling, Spatial multiplexing gain, Channel estimation.

I Introduction

Owning abundant bandwidth of multi-GHz up to even Terahertz (THz), the THz spectrum ranging from 0.1 to 10 THz has attracted upsurging attention from academia and industry in recent years. The THz wireless communications have the capability to support Terabit-per-second high data rates, which are envisioned as a pillar candidate for 6G wireless networks [2, 3, 4]. However, the THz wave suffers from large free-space attenuation, strong molecular absorption, and high non-line-of-sight (NLoS) propagation losses incurred from reflection, scattering, and diffraction. Therefore, it is challenging to achieve robust wireless transmission in complex occlusion environments, especially when line-of-sight (LoS) is blocked [5]. Moreover, power amplifiers with low efficiency at THz frequencies have constrained output power, which results in the low reception signal-to-noise ratio (SNR) thus constraining the communication distance [6].

To overcome the distance limitation, the ultra-massive multiple-input multiple-output (UM-MIMO) systems are exploited in the THz band [7]. Thanks to the sub-millimeter wavelength, hundreds and even thousands of antennas can be deployed in the UM-MIMO, which provides high array gain to compensate for the propagation losses. Furthermore, as a key technology to enable intelligent propagation environments in 6G systems, the intelligent reflecting surface (IRS) has been advocated in the literature [8, 9, 10]. The IRS is equipped with a metamaterial surface of the integrated circuit, which can be programmed to enable passive beamforming with high energy efficiency [8]. At lower frequencies, the IRS is majorly used to increase the achievable data rates. By contrast, in the THz band, the IRS can effectively bypass the barrier of the LoS blockage problem, by precisely controlling the reflection of incident THz signals [2, 11]. To combine, an integrated UM-MIMO and IRS systems can simultaneously solve the distance limitation and LoS blockage problems for THz wireless communications.

Channel modeling, analysis, and channel estimation (CE) arise as three inter-related open challenges of the THz integrated UM-MIMO and IRS systems. First, while most existing work on channel modeling in IRS assisted systems only considers the far-field propagation [12], the near-field region is expanded with an enlarged dimension of antenna arrays in UM-MIMO and IRS, relative to the sub-millimeter wavelength of the THz wave. The consideration of near-field spherical-wave propagation is imperatively needed [13, 14]. Second, each segmented channel in the integrated IRS and UM-MIMO systems can be in near-field and far-field, whose multiplexing capability concerning the cascaded channel remains unclear. Moreover, due to the large reflection, scattering, and diffraction losses, the THz channel is generally sparse and dominated by a LoS and only a few NLoS paths [15]. As a result, the THz multi-antenna channels suffer from limited multiplexing capability imposed by the number of multi-paths instead of the number of antennas as in the microwave band. Therefore, the spatial multiplexing capability needs to be assessed and possibly enhanced in the THz integrated UM-MIMO and IRS systems.

Third, the hybrid UM-MIMO structures with low hardware cost are commonly deployed in the THz systems, which exploit a much smaller number of RF-chains than antennas [16]. This hybrid architecture is helpful to reduce power and hardware costs, which however causes a research problem for CE. That is, with the enormous amount of antennas in the UM-MIMO and passive reflecting elements lacking the signal processing ability of the IRS, CE has to recover a high-dimensional channel relating to the antennas and passive elements, from severely compressed low dimensional signal on the RF-chains. Moreover, the consideration of spherical-wave propagation alters the structure of channel models, leading that traditional solutions based on planar-wave propagation become ineffective. New CE methods to address these problems are thus needed.

I-A Related Works

I-A1 Channel Modeling and Analysis

In the literature, mainly two categories of MIMO channel models are considered, namely, the spherical-wave model (SWM) and the planar-wave model (PWM), which are effective in addressing the near-field and far-field effects, respectively [17, 18]. As an improvement to PWM and SWM, we proposed a hybrid spherical- and planar-wave channel model (HSPM) for THz UM-MIMO systems in [13] , which accounted for PWM within the subarray and SWM among subarrays. Compared to the PWM and SWM, the HSPM is more effective by deploying a few channel parameters to achieve high accuracy in the near-field. In the IRS assisted communication systems, an alternative physically feasible Rayleigh fading model was proposed in [12] under the far-field assumption. By taking both near-field effect and IRS into consideration, the authors in [19] considered the SWM for THz integrated IRS and UM-MIMO systems. However, the SWM suffers from high complexity with the massive number of elements in the UM-MIMO and IRS [13]. To date, an effective model addressing the near-field effect in UM-MIMO and IRS systems is still required.

In the IRS systems, the channel analysis mainly focuses on sum rate, power gain, spectral efficiency (SE), and energy efficiency (EE). In microwave systems, the authors in [20] characterized the capacity limit by jointly optimizing the IRS reflection coefficients and the MIMO transmit covariance matrix. The distribution and the outage probability of the sum rate were derived in [21], by considering the SWM of the LoS and PWM of the NLoS. A closed-form expression of the power gain was derived in [22], and the near-field and far-field behaviors were analyzed. At higher frequencies, the ergodic capacity under the Saleh-Valenzuela model was derived and optimized in [23], while the SE and EE are analyzed in [19]. As a critical metric to assess the spatial-multiplexing capability of the channel, the channel rank analysis has been conducted in the THz UM-MIMO systems. To enhance the limited spatial multiplexing in the THz UM-MIMO systems, a widely-spaced multi-subarray (WSMS) structure with enlarged subarray spacing was proposed in [24], where the channel rank can be improved by a factor equal to the number of subarrays. However, the rank analysis in the THz integrated UM-MIMO and IRS systems are still lacking in the literature.

I-A2 Channel Estimation

CE for IRS assisted MIMO systems has been explored in the literature [25, 26, 27, 28, 29, 30, 31, 32, 33, 34], which can be categorized into two main categories, namely, estimation of the segmented channels from user equipment (UE) to IRS and IRS to base station (BS), and estimation of the UE-IRS-BS cascaded channel. On one hand, since the passive IRS lacks signal processing capability, it is hard to directly separate each channel segment. Thus, the segmented CE schemes often require special hardware design, e.g., inserting active IRS elements or using full-duplex equipment, both of which however increase the hardware cost [25, 26, 27, 28]. In [25] and [26], a few IRS elements were activated during the pilot reception. The deep-learning tool was then assisted for CE with considerable estimation accuracy. By deploying a full-duplex operated BS, a two timescale CE method was proposed in [27]. The segmented CE problem was formulated as a matrix factorization problem and solved in [28], which can be operated with purely passive IRS. However, this scheme does not address the near-field effect.

On the other hand, since most precoding designs are based on the knowledge of the cascaded channel, the estimation of which has been explored in most existing schemes [29, 30, 31, 32, 33, 34]. In [29], a two-stage atomic norm minimization problem was formulated, by which the super-resolution channel parameter estimation was conducted to efficiently obtain the channel-state-information. Theoretical analysis of the required pilot overhead and a universal CE framework were proposed in [30], which are effective in guiding the design of training and CE. However, all of them are limited to be applicable with fully digital MIMO structures. By exploiting the channel sparsity in the mmWave and THz bands, compressive sensing (CS) based CE methods in hybrid MIMO systems were explored in [31, 32, 33, 34]. These schemes deploy the spatial discrete Fourier transform (DFT) based on-grid codebook to sparsely represent the channel, which is beneficial in achieving reduced training overhead. On the downside, the near-field effect was not incorporated in the DFT codebook, which results in limited estimation accuracy of these schemes in the near-field region of the THz multi-antenna systems [13].

I-B Contributions

To fill the aforementioned research gap, in this work, we first model the cascaded channel and study the spatial multiplexing in THz integrated UM-MIMO and IRS systems, by considering both near-field and far-field effects. Based on that, we propose a CS-based CE framework. In particular, we develop a subarray-based on-grid codebook to sparsely represent the channel. Then, a separate side estimation (SSE) and a spatial correlation inspired dictionary shrinkage estimation (DSE) algorithms are proposed to realize low-complexity CE. In our prior and shorter version [1], we proposed the cascaded channel model and analyzed the spatial multiplexing of the integrated systems. In this work, we further derive the on-grid codebook and propose two low-complexity CE algorithms. Furthermore, we perform substantially more extensive performance evaluation. The major contributions of this work are summarized as follows.

  • We propose an HSPM for the cascaded channel in the THz integrated UM-MIMO and IRS systems, and analyze the spatial multiplexing gain of the cascaded channel. By addressing both near-field and far-field effects. The proposed channel model accounts for the PWM within the subarray and SWM among subarrays, which achieves better accuracy than the PWM and lower complexity than the SWM. Moreover, the spatial multiplexing gain of the cascaded channel is analyzed when the segmented channels satisfy the near-field and far-field conditions, respectively. We prove that the rank of the cascaded channel is constrained by the individual channel with a lower rank. Furthermore, we present that spatial multiplexing can be improved based on the widely-spaced architecture design.

  • We develop a CS-based CE framework including the sparse channel representation and sparse recovery algorithms. First, we propose a subarray-based codebook to sparsely represent the HSPM. Since the HSPM takes the subarray as a unit, by which each block is the sub-channel for a specific subarray pair, the proposed codebook possesses much higher accuracy than the traditional DFT codebook. Based on this, we propose low complexity DSE and SSE sparse recovery algorithms for the CE of the integrated system. The SSE algorithm separately estimates the positions of non-zero grids on each side of the channel. By contrast, the DSE algorithm further reduces the complexity of SSE by exploring the fact that the angles for different subarray pairs are close in the spatial domain.

  • We carry out extensive simulations to demonstrate the effectiveness of the proposed HSPM and CE methods. The HSPM can accurately capture the propagation features of the THz integrated UM-MIMO and IRS system. Numerically, we demonstrate that the capacity based on the HSPM is fairly close to that obtained by the ground-truth SWM. Moreover, both SSE and DSE achieve higher accuracy compared to existing algorithms. While SSE in general owns the highest accuracy, the DSE is more attractive at lower SNR, e.g., below 0 dB.

The remainder of this paper is organized as follows. The system and channel models are introduced in Sec. II. Spatial multiplexing analysis is presented in Sec. III. The subarray-based codebook and the SSE, DSE CE algorithms are proposed in IV. Extensive performance evaluation and numerical analysis are conducted in Sec. V. Finally, the paper is concluded in Sec. VI.

Notation: aa is a scalar. 𝐚\mathbf{a} denotes a vector. 𝐀\mathbf{A} represents a matrix. 𝐀(m,n)\mathbf{A}(m,n) stands for the element at the mthm^{\rm th} row and nthn^{\rm th} column in 𝐀\mathbf{A}. 𝐀(i,:)\mathbf{A}(i,:) depicts the ithi^{\rm th} row of 𝐀\mathbf{A}. 𝐀(:,j)\mathbf{A}(:,j) refers to the jthj^{\rm th} column of 𝐀\mathbf{A}. 𝐩(m:n)\mathbf{p}(m:n) denotes the mthm^{\rm th} to nthn^{\rm th} elements of 𝐩\mathbf{p}. 𝐩(m)\mathbf{p}(m) refers to the mthm^{\rm th} element of 𝐩\mathbf{p}. ()T(\cdot)^{\mathrm{T}} defines the transpose. ()H(\cdot)^{\mathrm{H}} refers to the conjugate transpose. ()(\cdot)^{{\dagger}} denotes the pseudo inverse. |||\cdot| depicts the absolute value. 0\|\cdot\|_{0} defines the l0l_{0}-norm. 2\|\cdot\|_{2} stands for the l2l_{2}-norm. M×N\mathbb{C}^{M\times N} depicts the set of M×NM\times N-dimensional complex-valued matrices. \otimes refers to Kronecker product. \circ denotes Khatri-Rao product. \propto depicts proportional sign.

II System Overview

II-A System Model

Refer to caption
Figure 1: Integrated THz UM-MIMO and IRS system.

As illustrated in Fig. 1, we consider a THz integrated UM-MIMO and IRS communication system. The WSMS THz UM-MIMO with planar-shaped antenna arrays is equipped at both BS and UE. The direct channel between the BS and UE is considered to be blocked and inaccessible due to the occlusion propagation environment [32, 29]. The communication link is assisted by a planar-shaped IRS with MM passive reflecting elements, which is connected to the BS via an IRS controller. Moreover, we consider that the IRS can be divided into KmK_{m} planar-shaped subarrays, M=KmNamM=K_{m}N_{am}, where NamN_{am} denotes the number of passive reflecting elements on each subarray.

In the WSMS design at the BS, KbK_{b} subarrays are deployed, each of which contains NabN_{ab} antennas. The total number of antennas is obtained as Nb=KbNabN_{b}=K_{b}N_{ab}. On one hand, within the subarray, the antenna spacing d=λ/2d=\lambda/2, where λ\lambda denotes the carrier wavelength. On the other hand, the subarray spacing is multiple times half-wavelengths [24]. Moreover, each subarray is connected to one RF-chain. In THz UM-MIMO systems, a much smaller number of RF-chains than the number of antennas is often adopted, for lower hardware cost and higher EE [16]. Therefore, we have KbNbK_{b}\ll N_{b}. Similarly, the UE is composed of NuN_{u} antennas, which can be divided into KuK_{u} subarrays, each of which is connected to one RF-chain. Each subarray contains NauN_{au} antennas, satisfying Nu=KuNauN_{u}=K_{u}N_{au} and KuNuK_{u}\ll N_{u}.

By considering an uplink transmission, the received signal 𝐲Nsb\mathbf{y}\in\mathbb{C}^{N_{sb}} at the BS is denoted as

𝐲=𝐖¯H𝐇cas𝐅¯𝐬+𝐖¯H𝐧,\begin{split}\mathbf{y}&=\overline{\mathbf{W}}^{\mathrm{H}}\mathbf{H}^{\rm cas}\overline{\mathbf{F}}\mathbf{s}+\overline{\mathbf{W}}^{\mathrm{H}}\mathbf{n},\end{split} (1)

where NsbN_{sb} denotes the number of signalstreams st BS, 𝐖¯=𝐖RF𝐖BBNb×Nsb\overline{\mathbf{W}}=\mathbf{W}_{\rm RF}\mathbf{W}_{\rm BB}\in\mathbb{C}^{N_{b}\times N_{sb}} represents the combining matrix, with 𝐖RFNb×Kb\mathbf{W}_{\rm RF}\in\mathbb{C}^{N_{b}\times K_{b}} and 𝐖BBKb×Nsb\mathbf{W}_{\rm BB}\in\mathbb{C}^{K_{b}\times N_{sb}} denoting the analog and digital combining matrices, respectively. The cascaded channel matrix from the UE to BS is depicted as 𝐇casNb×Nu\mathbf{H}^{\rm cas}\in\mathbb{C}^{N_{b}\times N_{u}}. The beamforming matrix at the UE is represented as 𝐅¯=𝐅RF𝐅BBNu×Nsu\overline{\mathbf{F}}=\mathbf{F}_{\rm RF}\mathbf{F}_{\rm BB}\in\mathbb{C}^{N_{u}\times N_{su}}, where NsuN_{su} depicts the transmitted number of signal streams at UE, 𝐅RFNu×Ku\mathbf{F}_{\rm RF}\in\mathbb{C}^{N_{u}\times K_{u}} and 𝐅BBKu×Nsu\mathbf{F}_{\rm BB}\in\mathbb{C}^{K_{u}\times N_{su}} refer to the analog and digital beamforming matrices, respectively. Moreover, 𝐬Nsu\mathbf{s}\in\mathbb{C}^{N_{su}} describes the transmitted signal, while 𝐧Nu\mathbf{n}\in\mathbb{C}^{N_{u}} represents additive white Gaussian noise (AWGN). The analog beamforming and combining are completed by phase shifters. Therefore, each element of 𝐖RF\mathbf{W}_{\rm RF} and 𝐅RF\mathbf{F}_{\rm RF} satisfies constant module constraint, which can be expressed as 𝐖RF(i,j)=1Nbejwi,j\mathbf{W}_{\rm RF}(i,j)=\frac{1}{\sqrt{N_{b}}}e^{jw_{i,j}}, 𝐅RF(i,j)=1Nuejfi,j\mathbf{F}_{\rm RF}(i,j)=\frac{1}{\sqrt{N_{u}}}e^{jf_{i,j}}, where wi,j,fi,j[0,2π]w_{i,j},f_{i,j}\in[0,2\pi] denote the phase shift value. In addition, 𝐖BB\mathbf{W}_{\rm BB} and 𝐅BB\mathbf{F}_{\rm BB} are usually set as identity matrices during the training process for CE. In this case, there is Nsu=KuN_{su}=K_{u} and Nsb=KbN_{sb}=K_{b}.

II-B Channel Model

The cascaded channel matrix 𝐇cas\mathbf{H}^{\rm cas} in (1) can be represented as

𝐇cas=𝐇IRSBS𝐏¯𝐇UEIRS,\mathbf{H}^{\rm cas}=\mathbf{H}_{\rm IRS-BS}\overline{\mathbf{P}}\mathbf{H}_{\rm UE-IRS}, (2)

where 𝐇IRSBSNb×M\mathbf{H}_{\rm IRS-BS}\in\mathbb{C}^{N_{b}\times M} stands for the segmented channel from the IRS to BS, 𝐏¯=diag{𝐩}M×M\overline{\mathbf{P}}={\rm{diag}}\{\mathbf{p}\}\in\mathbb{C}^{M\times M} denotes the passive beamforming matrix at the IRS, where 𝐩=[ejp~1,,ejp~M]T\mathbf{p}=[e^{j\tilde{p}_{1}},\dots,e^{j\tilde{p}_{M}}]^{\rm{T}}, p~m[0,2π]\tilde{p}_{m}\in[0,2\pi] refers to the phase shift of the mthm^{\rm th} element of the IRS, m=1,,Mm=1,\dots,M. In addition, 𝐇UEIRSM×Nu\mathbf{H}_{\rm UE-IRS}\in\mathbb{C}^{M\times N_{u}} depicts the segmented channel from the UE to IRS. The segmented channels can be characterized based on different modeling assumptions. The PWM and SWM are explored by addressing the far-field and near-field effects, respectively [13]. Particularly, the receiver (Rx) is in the far-field of the antenna array at the transmitter (Tx) when the communication distance DD is larger than the Rayleigh distance 2S2λ\frac{2S^{2}}{\lambda}, where SS denotes the array aperture. In this case, the wave is approximated to propagate in a plane and the PWM can be adopted. By contrast, the SWM has to be considered when the communication distance is smaller than the Rayleigh distance, where the Rx is located in the near-field and the propagation travels in a sphere.

As an improvement to the PWM and SWM, we proposed the idea of HSPM in [13] in the THz UM-MIMO systems, which possesses less complexity than the SWM and achieves better modeling accuracy than the PWM in the near-field condition. In the following, we first introduce the PWM, SWM and HSPM for the segmented channels 𝐇IRSBS\mathbf{H}_{\rm IRS-BS} and 𝐇UEIRS\mathbf{H}_{\rm UE-IRS} in (2), respectively. Then, we propose the HSPM for the cascaded channel 𝐇cas\mathbf{H}^{\rm cas}. To facilitate the description, during the introduction of different channel models, we use Tx to represent the IRS in 𝐇IRSBS\mathbf{H}_{\rm IRS-BS} and the UE in 𝐇UEIRS\mathbf{H}_{\rm UE-IRS}, and use Rx to denote the BS in 𝐇IRSBS\mathbf{H}_{\rm IRS-BS} and the IRS in 𝐇UEIRS\mathbf{H}_{\rm UE-IRS}, respectively. Moreover, we consider that Tx is composed of NtN_{t} elements and KtK_{t} subarrays, while Rx employs NrN_{r} antennas and KrK_{r} subarrays, respectively.

II-B1 PWM

The PWM suitable for the far-field propagation region can be denoted as [18]

𝐇P=Σp=1Npαp𝐚rp𝐚tpH,\mathbf{H}_{\rm P}=\Sigma_{p=1}^{N_{p}}\alpha_{p}\mathbf{a}_{rp}\mathbf{a}_{tp}^{{\rm H}}, (3)

where αp\alpha_{p} represents the complex gain of the pthp^{\rm th} propagation path, p=1,,Npp=1,...,N_{p}, with NpN_{p} denoting the total number of paths. The array steering vectors at Rx and Tx are denoted as 𝐚rp=𝐚Nr(ψrpx,ψrpz)Nr\mathbf{a}_{rp}=\mathbf{a}_{N_{r}}(\psi_{rpx},\psi_{rpz})\in\mathbb{C}^{N_{r}} and 𝐚tp=𝐚Nt(ψtpx,ψtpz)Nt\mathbf{a}_{tp}=\mathbf{a}_{N_{t}}(\psi_{tpx},\psi_{tpz})\in\mathbb{C}^{N_{t}}, respectively. Without loss of generality, by considering an NN element planar-shaped array on the x-z plane with physical angle pair (θ,ϕ)(\theta,\phi), the array steering vector 𝐚N(ψx,ψz)N\mathbf{a}_{N}(\psi_{x},\psi_{z})\in\mathbb{C}^{N} can be expressed as

𝐚N(ψx,ψz)=[1ej2πλψnej2πλψN]T,\mathbf{a}_{N}(\psi_{x},\psi_{z})=\left[1\dots\mathrm{e}^{j\frac{2\pi}{\lambda}\psi_{n}}\dots\mathrm{e}^{j\frac{2\pi}{\lambda}\psi_{N}}\right]^{\mathrm{T}}, (4)

where ψn=dnxλψx+dnzλψz\psi_{n}=\frac{d_{n_{x}}}{\lambda}\psi_{x}+\frac{d_{n_{z}}}{\lambda}\psi_{z}, ψx=sinθcosϕ\psi_{x}={\rm sin}\theta{\rm cos}\phi, ψz=sinϕ\psi_{z}={\rm sin}\phi denotes the virtual angles, dnxd_{n_{x}} and dnzd_{n_{z}} stand for the distances between the nthn^{\rm th} antenna to the first antenna on x- and z-axis, respectively.

II-B2 SWM

The SWM is universally applicable to different communication distances, which individually calculates the channel responses of all antenna pairs between Tx and Rx to obtain the ground-truth channel. Due to the high complexity, the SWM is usually deployed in the near-field region, where the PWM becomes inaccurate. By denoting the communication distance of the pthp^{\rm th} propagation path from the ntthn_{t}^{\rm th} transmitted antenna to the nrthn_{r}^{\rm th} received antenna as DpntnrD^{n_{t}n_{r}}_{p}, the channel response of each antenna pair can be depicted as [18]

𝐇S(nr,nt)=Σp=1Np|αpnrnt|ej2πλDpnrnt,\mathbf{H}_{\rm S}(n_{r},n_{t})=\Sigma_{p=1}^{N_{p}}\lvert\alpha^{n_{r}n_{t}}_{p}\rvert e^{-j\frac{2\pi}{\lambda}D^{n_{r}n_{t}}_{p}}, (5)

where 𝐇SNr×Nt\mathbf{H}_{\rm S}\in\mathbb{C}^{N_{r}\times N_{t}} denotes the SWM channel matrix, αpnrnt\alpha^{n_{r}n_{t}}_{p} represents the complex path gain.

II-B3 HSPM

The HSPM accounts for the PWM within one subarray and the SWM among subarrays, which can be denoted as [13]

𝐇HSPM=p=1Np|αp|[ej2πλDp11𝐚rp11(𝐚tp11)Hej2πλDp1Kt𝐚rp1Kt(𝐚tp1Kt)Hej2πλDpKr1𝐚rpKr1(𝐚tpKr1)Hej2πλDpKrKt𝐚rpKrKt(𝐚tpKrKt)H],\begin{split}\mathbf{H}_{\rm HSPM}=\sum_{p=1}^{N_{p}}|\alpha_{p}|\left[\begin{array}[]{ccc}e^{-j\frac{2\pi}{\lambda}D^{11}_{p}}\mathbf{a}_{rp}^{11}(\mathbf{a}_{tp}^{11})^{\rm H}&\dots&e^{-j\frac{2\pi}{\lambda}D^{1K_{t}}_{p}}\mathbf{a}_{rp}^{1K_{t}}(\mathbf{a}_{tp}^{1K_{t}})^{\rm H}\\ \vdots&\ddots&\vdots\\ e^{-j\frac{2\pi}{\lambda}D^{K_{r}1}_{p}}\mathbf{a}_{rp}^{K_{r}1}(\mathbf{a}_{tp}^{K_{r}1})^{\rm H}&\cdots&e^{-j\frac{2\pi}{\lambda}D^{K_{r}K_{t}}_{p}}\mathbf{a}_{rp}^{K_{r}K_{t}}(\mathbf{a}_{tp}^{K_{r}K_{t}})^{\rm H}\\ \end{array}\right],\end{split} (6)

where DpkrktD^{k_{r}k_{t}}_{p} stands for the distance between the krthk_{r}^{\rm th} received and ktthk_{t}^{\rm th} transmitted subarray. The array steering vectors of the pthp^{\rm th} path for the corresponding subarray pairs are denoted as 𝐚rpkrkt=𝐚Nar(ψrpxkrkt,ψrpzkrkt)\mathbf{a}^{k_{r}k_{t}}_{rp}=\mathbf{a}_{N_{ar}}(\psi^{k_{r}k_{t}}_{rpx},\psi^{k_{r}k_{t}}_{rpz}), and 𝐚tpkrkt=𝐚Nat(ψtpxkrkt,ψtpzkrkt)\mathbf{a}_{tp}^{k_{r}k_{t}}=\mathbf{a}_{N_{at}}(\psi^{k_{r}k_{t}}_{tpx},\psi^{k_{r}k_{t}}_{tpz}), respectively, which have similar forms as (4). The virtual angles ψrpxkrkt=sinθrpkrktcosϕrpkrkt\psi^{k_{r}k_{t}}_{rpx}={\rm sin}\theta^{k_{r}k_{t}}_{rp}{\rm cos}\phi^{k_{r}k_{t}}_{rp}, ψrpzkrkt=sinϕrpkrkt\psi^{k_{r}k_{t}}_{rpz}={\rm sin}\phi^{k_{r}k_{t}}_{rp}, ψtpxkrkt=sinθtpkrktcosϕtpkrkt\psi^{k_{r}k_{t}}_{tpx}={\rm sin}\theta^{k_{r}k_{t}}_{tp}{\rm cos}\phi^{k_{r}k_{t}}_{tp}, ψtpzkrkt=sinϕtpkrkt\psi^{k_{r}k_{t}}_{tpz}={\rm sin}\phi^{k_{r}k_{t}}_{tp}, where (θrpkrkt,ϕrpkrkt)(\theta^{k_{r}k_{t}}_{rp},\phi^{k_{r}k_{t}}_{rp}) and (θtpkrkt,ϕtpkrkt)(\theta^{k_{r}k_{t}}_{tp},\phi^{k_{r}k_{t}}_{tp}) stand for the azimuth and elevation angle pairs at Rx and Tx, respectively. Moreover, NarN_{ar} and NatN_{at} depict the number of antennas on the subarrays at Rx and Tx, respectively. We point out that the PWM and SWM are two special cases of HSPM when Kt=Kr=1K_{t}=K_{r}=1 and Kt=NtK_{t}=N_{t}, Kr=NrK_{r}=N_{r}. In addition, the HSPM is accurate and can be adopted when the communication distance is smaller than the Rayleigh distance.

II-C HSPM for THz Integrated UM-MIMO and IRS Systems

By replacing the segmented channels 𝐇IRSBS\mathbf{H}_{\rm IRS-BS} and 𝐇UEIRS\mathbf{H}_{\rm UE-IRS} of 𝐇cas\mathbf{H}^{\rm cas} in (2) by the expression in (6), the HSPM for the cascaded channel 𝐇cas\mathbf{H}^{\rm cas} can be represented as

𝐇HSPMcas=pi,b=1NpIB|αpi,b|[km=1Km𝐆pi,b1km𝐄km1km=1Km𝐆pi,b1km𝐄kmKukm=1Km𝐆pi,bKbkm𝐄km1km=1Km𝐆pi,bKbkm𝐄kmKu],\begin{split}\mathbf{H}_{\rm HSPM}^{\rm cas}=\sum_{p_{i,b}=1}^{N_{p}^{\rm IB}}|\alpha_{p_{i,b}}|&\left[\begin{array}[]{ccc}\sum_{k_{m}=1}^{K_{m}}\mathbf{G}^{1k_{m}}_{p_{i,b}}\mathbf{E}^{k_{m}1}&\dots&\sum_{k_{m}=1}^{K_{m}}\mathbf{G}^{1k_{m}}_{p_{i,b}}\mathbf{E}^{k_{m}K_{u}}\\ \vdots&\dots&\vdots\\ \sum_{k_{m}=1}^{K_{m}}\mathbf{G}^{K_{b}k_{m}}_{p_{i,b}}\mathbf{E}^{k_{m}1}&\dots&\sum_{k_{m}=1}^{K_{m}}\mathbf{G}^{K_{b}k_{m}}_{p_{i,b}}\mathbf{E}^{k_{m}K_{u}}\\ \end{array}\right],\end{split} (7)

where αpi,b\alpha_{p_{i,b}} denotes the path gain for the pi,bthp_{i,b}^{\rm th} path of 𝐇IRSBS\mathbf{H}^{\rm IRS-BS}, pi,b=1,,NpIBp_{i,b}=1,\dots,N_{p}^{\rm IB}, NpIBN_{p}^{\rm IB} refers to the number of propagation paths in 𝐇IRSBS\mathbf{H}^{\rm IRS-BS}. The matrix 𝐆pi,bkbkmNab×Nam\mathbf{G}_{p_{i,b}}^{k_{b}k_{m}}\in\mathbb{C}^{N_{ab}\times N_{am}} is represented as

𝐆pi,bkbkm=ej2πλDpi,bkbkm𝐚rpi,bkbkm(𝐚tpi,bkbkm)H𝐏~km,\mathbf{G}_{p_{i,b}}^{k_{b}k_{m}}=e^{-j\frac{2\pi}{\lambda}D^{k_{b}k_{m}}_{p_{i,b}}}\mathbf{a}_{rp_{i,b}}^{k_{b}k_{m}}(\mathbf{a}_{tp_{i,b}}^{k_{b}k_{m}})^{\rm H}\tilde{\mathbf{P}}^{k_{m}}, (8)

where Dpi,bkbkmD^{k_{b}k_{m}}_{p_{i,b}} stands for the communication distance between the kbthk_{b}^{\rm th} subarray at the BS and kmthk_{m}^{\rm th} subarray at the IRS for the pi,bthp_{i,b}^{\rm th} path. Moreover, the received and transmitted array steering vectors are denoted as 𝐚rpi,bkbkm=𝐚Nab(ψrpi,bxkbkm,ψrpi,bzkbkm)\mathbf{a}_{rp_{i,b}}^{k_{b}k_{m}}=\mathbf{a}_{N_{ab}}(\psi_{rp_{i,b}x}^{k_{b}k_{m}},\psi_{rp_{i,b}z}^{k_{b}k_{m}}) and 𝐚tpi,bkbkm=𝐚Nam(ψtpi,bxkbkm,ψtpi,bzkbkm)\mathbf{a}_{tp_{i,b}}^{k_{b}k_{m}}=\mathbf{a}_{N_{am}}(\psi_{tp_{i,b}x}^{k_{b}k_{m}},\psi_{tp_{i,b}z}^{k_{b}k_{m}}) as (4). The virtual angles ψrpi,bxkbkm=sinθrpi,bkbkmcosϕrpi,bkbkm,\psi_{rp_{i,b}x}^{k_{b}k_{m}}\!=\!{\rm sin}\theta_{rp_{i,b}}^{k_{b}k_{m}}{\rm cos}\phi_{rp_{i,b}}^{k_{b}k_{m}}, ψrpi,bzkbkm=sinϕrpi,bkbkm,\psi_{rp_{i,b}z}^{k_{b}k_{m}}\!=\!{\rm sin}\phi_{rp_{i,b}}^{k_{b}k_{m}}, ψtpi,bxkbkm=sinθtpi,bkbkmcosϕtpi,bkbkm,\psi_{tp_{i,b}x}^{k_{b}k_{m}}\!=\!{\rm sin}\theta_{tp_{i,b}}^{k_{b}k_{m}}{\rm cos}\phi_{tp_{i,b}}^{k_{b}k_{m}}, ψtpi,bzkbkm=sinϕtpi,bkbkm,\psi_{tp_{i,b}z}^{k_{b}k_{m}}\!=\!{\rm sin}\phi_{tp_{i,b}}^{k_{b}k_{m}}, (θrpi,bkbkm,ϕrpi,bkbkm)(\theta_{rp_{i,b}}^{k_{b}k_{m}},\phi_{rp_{i,b}}^{k_{b}k_{m}}) and (θtpi,bkbkm,ϕtpi,bkbkm)(\theta_{tp_{i,b}}^{k_{b}k_{m}},\phi_{tp_{i,b}}^{k_{b}k_{m}}) represent the physical angles pairs. The passive beamforming matrix of the kmthk_{m}^{\rm th} subarray at IRS is denoted as 𝐏~km=diag{𝐩(kmNam+1:(km+1)Nam)}\tilde{\mathbf{P}}^{k_{m}}={\rm diag}\{\mathbf{p}(k_{m}N_{am}+1:(k_{m}+1)N_{am})\}.

In (7), the matrix 𝐄kmkuNam×Nau\mathbf{E}^{k_{m}k_{u}}\in\mathbb{C}^{N_{am}\times N_{au}} can be expressed as

𝐄kmku=pu,iNpUI|αpu,i|ej2πλDpu,ikmku𝐚rpu,ikmku(𝐚tpu,ikmku)H,\mathbf{E}^{k_{m}k_{u}}=\sum_{p_{u,i}}^{N_{p}^{\rm UI}}|\alpha_{p_{u,i}}|e^{-j\frac{2\pi}{\lambda}D^{k_{m}k_{u}}_{p_{u,i}}}\mathbf{a}_{rp_{u,i}}^{k_{m}k_{u}}(\mathbf{a}_{tp_{u,i}}^{k_{m}k_{u}})^{\rm H}, (9)

where NpUIN_{p}^{\rm UI} denotes the number of propagation paths in 𝐇UEIRS\mathbf{H}_{\rm UE-IRS}, pu,i=1,,NpUIp_{u,i}=1,\dots,N_{p}^{\rm UI}, αpu,i\alpha_{p_{u,i}} represents the path gain for the pu,ithp_{u,i}^{\rm th} path. Moreover, Dpu,ikmkuD^{k_{m}k_{u}}_{p_{u,i}} stands for the communication distance between the kmthk_{m}^{\rm th} subarray at IRS and kuthk_{u}^{\rm th} subarray at UE. The array steering vectors owning similar forms as (4) are denoted as 𝐚rpu,ikmku=𝐚Nam(ψrpu,ixkmku,ψrpu,izkmku)\mathbf{a}_{rp_{u,i}}^{k_{m}k_{u}}=\mathbf{a}_{N_{am}}(\psi_{rp_{u,i}x}^{k_{m}k_{u}},\psi_{rp_{u,i}z}^{k_{m}k_{u}}) and 𝐚tpu,ikmku=𝐚Nau(ψtpu,ixkmku,ψtpu,izkmku)\mathbf{a}_{tp_{u,i}}^{k_{m}k_{u}}=\mathbf{a}_{N_{au}}(\psi_{tp_{u,i}x}^{k_{m}k_{u}},\psi_{tp_{u,i}z}^{k_{m}k_{u}}), where ψrpu,ix=sinθrpu,ikmkucosϕrpu,ikmku\psi_{rp_{u,i}x}={\rm sin}\theta_{rp_{u,i}}^{k_{m}k_{u}}{\rm cos}\phi_{rp_{u,i}}^{k_{m}k_{u}}, ψrpu,iz=sinϕrpu,ikmku\psi_{rp_{u,i}z}={\rm sin}\phi_{rp_{u,i}}^{k_{m}k_{u}}, ψtpu,ix=sinθtpu,ikmkucosϕtpu,ikmku\psi_{tp_{u,i}x}={\rm sin}\theta_{tp_{u,i}}^{k_{m}k_{u}}{\rm cos}\phi_{tp_{u,i}}^{k_{m}k_{u}}, ψtpu,iz=sinϕtpu,ikmku\psi_{tp_{u,i}z}={\rm sin}\phi_{tp_{u,i}}^{k_{m}k_{u}}, (θrpu,ikmku,ϕrpu,ikmku)(\theta_{rp_{u,i}}^{k_{m}k_{u}},\phi_{rp_{u,i}}^{k_{m}k_{u}}) and (θtpu,ikmku,ϕtpu,ikmku)(\theta_{tp_{u,i}}^{k_{m}k_{u}},\phi_{tp_{u,i}}^{k_{m}k_{u}}) stand for the angle pairs at IRS and BS, respectively.

Based on (8) and (9), the (nab,nau)th(n_{ab},n_{au})^{\rm th} element for the production of 𝐆pi,bkbkm𝐄kmkuNab×Nau\mathbf{G}^{k_{b}k_{m}}_{p_{i,b}}\mathbf{E}^{k_{m}k_{u}}\in\mathbb{C}^{N_{ab}\times N_{au}} in (7) can be represented as

(𝐆pi,bkbkm𝐄kmku)(nab,nau)=pu,iNpUI|αpu,i|ej2πλ(Dpi,bkbkm+Dpu,ikmku)×nam=kmNam+1(km+1)Namexp[jπ(ζrpi,bnabkbkmζtpi,bnamkbkm+ζtpu,inaukmkuζrpu,inamkmku+ejp~nam)],\begin{split}&(\mathbf{G}^{k_{b}k_{m}}_{p_{i,b}}\mathbf{E}^{k_{m}k_{u}})(n_{ab},n_{au})=\sum_{p_{u,i}}^{N_{p}^{\rm UI}}|\alpha_{p_{u,i}}|e^{-j\frac{2\pi}{\lambda}(D^{k_{b}k_{m}}_{p_{i,b}}+D^{k_{m}k_{u}}_{p_{u,i}})}\times\\ &\sum_{n_{am}=k_{m}N_{am}+1}^{(k_{m}+1)N_{am}}\exp\left[-j\pi\left(\zeta_{rp_{i,b}n_{ab}}^{k_{b}k_{m}}-\zeta_{tp_{i,b}n_{am}}^{k_{b}k_{m}}+\zeta_{tp_{u,i}n_{au}}^{k_{m}k_{u}}-\zeta_{rp_{u,i}n_{am}}^{k_{m}k_{u}}+e^{j\tilde{p}_{n_{am}}}\right)\right],\end{split} (10)

where the aggregated phase ζrpi,bnabkbkm\zeta_{rp_{i,b}n_{ab}}^{k_{b}k_{m}} can be denoted as

ζrpi,bnabkbkm=(nabx1)ψrpi,bxkbkm+(nabz1)ψrpi,bzkbkm,\zeta_{rp_{i,b}n_{ab}}^{k_{b}k_{m}}=(n_{abx}-1)\psi_{rp_{i,b}x}^{k_{b}k_{m}}+(n_{abz}-1)\psi_{rp_{i,b}z}^{k_{b}k_{m}}, (11)

nab=nabxnabz=1,,Nabn_{ab}=n_{abx}n_{abz}=1,\dots,N_{ab}, with nabxn_{abx} and nabzn_{abz} index the positions of the element at the subarray of UE on x- and z-axis, respectively. Similarly, the aggregated phases ζtpi,bnamkbkm,\zeta_{tp_{i,b}n_{am}}^{k_{b}k_{m}}, ζtpu,inabkmku\zeta_{tp_{u,i}n_{ab}}^{k_{m}k_{u}} and ζrpu,inamkmku\zeta_{rp_{u,i}n_{am}}^{k_{m}k_{u}} in (7) can be expressed as

ζtpi,bnamkbkm\displaystyle\zeta_{tp_{i,b}n_{am}}^{k_{b}k_{m}} =(namx1)ψtpi,bzkbkm+(namz1)sinψrpi,bzkbkm,\displaystyle=(n_{amx}-1)\psi_{tp_{i,b}z}^{k_{b}k_{m}}+(n_{amz}-1){\rm sin}\psi_{rp_{i,b}z}^{k_{b}k_{m}}, (12a)
ζtpu,inabkmku\displaystyle\zeta_{tp_{u,i}n_{ab}}^{k_{m}k_{u}} =(naux1)ψtpu,ixkmku+(nauz1)ψtpu,izkmku,\displaystyle=(n_{aux}-1)\psi_{tp_{u,i}x}^{k_{m}k_{u}}+(n_{auz}-1)\psi_{tp_{u,i}z}^{k_{m}k_{u}}, (12b)
ζrpu,inamkmku\displaystyle\zeta_{rp_{u,i}n_{am}}^{k_{m}k_{u}} =(namx1)ψrpu,ixkmku+(namz1)ψrpu,izkmku,\displaystyle=(n_{amx}-1)\psi_{rp_{u,i}x}^{k_{m}k_{u}}+(n_{amz}-1)\psi_{rp_{u,i}z}^{k_{m}k_{u}}, (12c)

where nam=namxnamz=1,,Namn_{am}=n_{amx}n_{amz}=1,\dots,N_{am}, nau=nauxnauz=1,,Nabn_{au}=n_{aux}n_{auz}=1,\dots,N_{ab}.

III Spatial Multiplexing Gains Analysis

The cascaded channel 𝐇cas\mathbf{H}^{\rm cas} in (2) is composed of two channel segments, 𝐇IRSBS\mathbf{H}_{\rm IRS-BS} and 𝐇UEIRS\mathbf{H}_{\rm UE-IRS}. Under near-field and far-field conditions, the segmented channels can adopt different channel models. In this section, we analyze the spatial multiplexing capability of 𝐇cas\mathbf{H}^{\rm cas} in terms of channel rank, under different cases of the segmented channels.

III-A Ranks of PWM, SWM, and HSPM

To facilitate the analysis, we consider an end-to-end channel during the process of analyzing the ranks of PWM, SWM and HSPM and use Tx and Rx to represent each end. The number of propagation paths between Tx and Rx is denoted as NpN_{p}. As studied in [18], ranks of the PWM in (3) and SWM in (5) equal to NpN_{p} and N=min{Nt,Nr}N=\min\{N_{t},N_{r}\}, respectively. To illustrate the rank of the HSPM in (6), we present Lemma 1.

Lemma 1

The rank of 𝐇HSPM\mathbf{H}_{\rm HSPM} in (6) satisfies min{KrNp,KtNp,Nr,Nt}Rank(𝐇HSPM)min{KrKtNp,Nr,Nt}\min\{K_{r}N_{p},K_{t}N_{p},N_{r},N_{t}\}\leq{\rm Rank}(\mathbf{H}_{\rm HSPM})\leq\min\{K_{r}K_{t}N_{p},N_{r},N_{t}\}.

Proof: The dimension of the channel matrix 𝐇HSPM\mathbf{H}_{\rm HSPM} in (6) is Nr×NtN_{r}\times N_{t}, the maximum rank of the channel is min{Nr,Nt}\min\{N_{r},N_{t}\}. To prove the right-hand side inequality, we first consider for a fixed transmit subarray ktk_{t} and propagation path pp, elements in the set of array steering vectors {𝐚tp1kt,,𝐚tpKrkt}\left\{\mathbf{a}_{tp}^{1k_{t}},\dots,\mathbf{a}_{tp}^{K_{r}k_{t}}\right\} are linearly independent. Due to the distinguishability of propagation paths, the angles of different paths are different. Therefore, by fixing transmit and receive subarrays as krk_{r} and ktk_{t}, respectively, 𝐚tpkrkt\mathbf{a}_{tp}^{k_{r}k_{t}} for different propagation paths p=1,,Npp=1,\dots,N_{p} are linearly independent.

We consider that the nrthn_{r}^{\rm th} received antenna is the narthn_{ar}^{\rm th} element on the krthk_{r}^{\rm th} received subarray. Therefore, the nrthn_{r}^{\rm th} row of the HSPM channel in (6) 𝐇HSPM(nr,:)\mathbf{H}_{\rm HSPM}(n_{r},:) can be expressed as

𝐇HSPM(nr,:)=[p=1Npβpkr1[𝐚rpkr1(nar)(𝐚tpkr1)H],,p=1NpβpkrKt[𝐚rpkrKt(nar)(𝐚tpkrKt)H]].\mathbf{H}_{\rm HSPM}(n_{r},:)=\bigg{[}\sum_{p=1}^{N_{p}}\beta_{p}^{k_{r}1}\left[\mathbf{a}_{rp}^{k_{r}1}(n_{ar})(\mathbf{a}_{tp}^{k_{r}1})^{\rm H}\right],\dots,\sum_{p=1}^{N_{p}}\beta_{p}^{k_{r}K_{t}}\left[\mathbf{a}_{rp}^{k_{r}K_{t}}(n_{ar})(\mathbf{a}_{tp}^{k_{r}K_{t}})^{\rm H}\right]\bigg{]}. (13)

Each row of 𝐇HSPM\mathbf{H}_{\rm HSPM} is a linear combination of KrKtNpK_{r}K_{t}N_{p} linearly independent vectors as

[(𝐚tpkr1)H,𝟎,,𝟎],\displaystyle\Big{[}(\mathbf{a}_{tp}^{k_{r}1})^{\rm H},\mathbf{0},\dots,\mathbf{0}\Big{]}, (14a)
[𝟎,(𝐚tpkr2)H,𝟎,,𝟎],\displaystyle\Big{[}\mathbf{0},(\mathbf{a}_{tp}^{k_{r}2})^{\rm H},\mathbf{0},\dots,\mathbf{0}\Big{]}, (14b)
\displaystyle~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}\dots (14c)
[𝟎,,𝟎,(𝐚tpkrKt)H],\displaystyle\Big{[}\mathbf{0},\dots,\mathbf{0},(\mathbf{a}_{tp}^{k_{r}K_{t}})^{\rm H}\Big{]}, (14d)

where kr=1,,Krk_{r}=1,\dots,K_{r} and 𝟎\mathbf{0} is an all-zero vector of dimension 1×Nat1\times N_{at}.

However, the angles of different paths to different received subarrays might be the same, leading that vectors in (14) can be linearly dependent, which reduces the rank of the HSPM channel. Thus, there is Rank(𝐇HSPM)min{KrKtNp,Nr,Nt}{\rm Rank}(\mathbf{H}_{\rm HSPM})\leq\min\{K_{r}K_{t}N_{p},N_{r},N_{t}\}. To prove the left-hand side inequality, we consider an extreme case. For a fixed propagation path, the angles among different subarray pairs between Tx and Rx are same. In this case, the HSPM equals to the channel model in [24], whose rank has been proved to be equal to min{KrNp,KtNp,Nr,Nt}\min\{K_{r}N_{p},K_{t}N_{p},N_{r},N_{t}\}, which lower bounds the rank of the HSPM. Till here, we have completed the proof for Lemma 1. \hfill\blacksquare

III-B Cascaded Channel Rank Analysis

To analyze the rank of the cascaded channel, we first introduce the following lemma.

Lemma 2

For matrices 𝐀M×N\mathbf{A}\in\mathbb{C}^{M\times N}, 𝐁N×N\mathbf{B}\in\mathbb{C}^{N\times N} and 𝐂N×K\mathbf{C}\in\mathbb{C}^{N\times K}, where 𝐁\mathbf{B} is a diagonal matrix, and rank(𝐀)=Ra{\rm rank}(\mathbf{A})=R_{a}, rank(𝐁)=N{\rm rank}(\mathbf{B})=N, rank(𝐂)=Rc{\rm rank}(\mathbf{C})=R_{c}, we have

rank(𝐀𝐁𝐂)min{Ra,Rc},{\rm rank}(\mathbf{ABC})\leq\min\{R_{a},R_{c}\}, (15)

where the equality holds when 𝐀\mathbf{A} and 𝐂\mathbf{C} are full-rank matrices.

Proof: Since 𝐁\mathbf{B} is full row rank, we have rank(𝐀𝐁)=rank(𝐀)=Ra{\rm rank}(\mathbf{AB})={\rm rank}(\mathbf{A})=R_{a}. Then, rank(𝐀𝐁𝐂)min{rank(𝐀𝐁),rank(𝐂)}=min{Ra,Rc}{\rm rank}(\mathbf{ABC})\leq\min\{{\rm rank}(\mathbf{AB}),{\rm rank}(\mathbf{C})\}=\min\{R_{a},R_{c}\}. When 𝐀\mathbf{A} is a full-rank matrix, we have rank(𝐀𝐁)=rank(𝐀)=N{\rm rank}(\mathbf{AB})={\rm rank}(\mathbf{A})=N. When 𝐂\mathbf{C} is a full-rank matrix, we have rank(𝐀𝐁𝐂)=rank(𝐀𝐁)=N{\rm rank}(\mathbf{ABC})={\rm rank}(\mathbf{AB})=N. \hfill\blacksquare

Next, we analyze the rank of the cascaded channel. We adopt the PWM in the far-field region, while the SWM and HSPM are deployed for the near-field region, respectively.

III-B1 Both segmented channels satisfy the far-field condition

In this case, both 𝐇IRSBS\mathbf{H}_{\rm IRS-BS} and 𝐇UEIRS\mathbf{H}_{\rm UE-IRS} in (2) adopt PWM, whose ranks equal to NpIBN_{p}^{\rm IB} and NpUIN_{p}^{\rm UI}, respectively. By denoting 𝐇cas\mathbf{H}^{\rm cas} in (2) as 𝐇PWMcas\mathbf{H}^{\rm cas}_{\rm PWM}, from Lemma 2, we can state that rank(𝐇PWMcas)min{NpIU,NpBI}{\rm rank}(\mathbf{H}^{\rm cas}_{\rm PWM})\leq\min\{N_{p}^{\rm IU},N_{p}^{\rm BI}\}. Moreover, when NpIU=NpBI=NpN_{p}^{\rm IU}=N_{p}^{\rm BI}=N_{p}, rank(𝐇PWMcas)Np{\rm rank}(\mathbf{H}^{\rm cas}_{\rm PWM})\leq N_{p}.

III-B2 One of the segmented channels satisfies the far-field condition, while the other satisfies the near-field condition

We first consider 𝐇IRSBS\mathbf{H}_{\rm IRS-BS} satisfies the far-field condition and adopts the PWM, while 𝐇UEIRS\mathbf{H}_{\rm UE-IRS} meets the near-field condition, which deploys SWM or HSPM. Since the number of propagation paths in the THz channel is much smaller than the number of elements in the UM-MIMO and IRS, rank(𝐇IRSBS)=NpIB<rank(𝐇UEIRS){\rm rank}(\mathbf{H}_{\rm IRS-BS})=N^{\rm IB}_{p}<{\rm rank}(\mathbf{H}_{\rm UE-IRS}). From Lemma 2, we can obtain that rank(𝐇cas)NpIB{\rm rank}(\mathbf{H}^{\rm cas})\leq N_{p}^{\rm IB}. A similar deduction can be drawn when 𝐇IRSBS\mathbf{H}_{\rm IRS-BS} meets the near-field condition while 𝐇UEIRS\mathbf{H}_{\rm UE-IRS} satisfies the far-field condition. Thus, when NpIU=NpBI=NpN_{p}^{\rm IU}=N_{p}^{\rm BI}=N_{p}, there is rank(𝐇cas)Np{\rm rank}(\mathbf{H}^{\rm cas})\leq N_{p}.

III-B3 Both segmented channels satisfy the near-field condition

We denote 𝐇cas\mathbf{H}^{\rm cas} as 𝐇SWMcas\mathbf{H}^{\rm cas}_{\rm SWM} when both 𝐇IRSBS\mathbf{H}_{\rm IRS-BS} and 𝐇UEIRS\mathbf{H}_{\rm UE-IRS} meet the near-field condition and adopt SWM. In this case, 𝐇IRSBS\mathbf{H}_{\rm IRS-BS}, 𝐏¯\overline{\mathbf{P}} and 𝐇UEIRS\mathbf{H}_{\rm UE-IRS} are full-rank matrices. Based on Lemma 2, we know that rank(𝐇SWMcas)=min{M,Nu,Nb}{\rm rank}(\mathbf{H}^{\rm cas}_{\rm SWM})=\min\{M,N_{u},N_{b}\}. When Nu=Nb=M=NN_{u}=N_{b}=M=N, rank(𝐇SWMcas)=N{\rm rank}(\mathbf{H}^{\rm cas}_{\rm SWM})=N. Similarly, we denote 𝐇cas\mathbf{H}^{\rm cas} as 𝐇HSPMcas\mathbf{H}^{\rm cas}_{\rm HSPM} when both 𝐇IRSBS\mathbf{H}_{\rm IRS-BS} and 𝐇UEIRS\mathbf{H}_{\rm UE-IRS} adopt HSPM. By considering that Km=Ku=Kb=KK_{m}=K_{u}=K_{b}=K and NpIU=NpBI=NpN_{p}^{\rm IU}=N_{p}^{\rm BI}=N_{p}, we have rank(𝐇HSPMcas)K2Np{\rm rank}(\mathbf{H}^{\rm cas}_{\rm HSPM})\leq K^{2}N_{p}.

From the above analysis, we can state that in the THz integrated UM-MIMO and IRS systems, the total rank of the cascaded channel is limited by the segmented channel with a smaller rank. This suggests that the rank of the cascaded channel is increased only when both segmented channels meet the near-field condition, which inspires us to enlarge the array size and obtain a larger near-field region. It is worth noticing that the above discussions are not dependent on the IRS beamforming matrix 𝐏¯\overline{\mathbf{P}}. Therefore, we further claim that given fixed segmented channels, the channel rank can not be improved by the IRS.

We will show in Sec. V that the capacity of the THz integrated UM-MIMO and IRS system based on HSPM is close to that based on the ground truth SWM, which reveals the accuracy of the HSPM. In addition, the HSPM possesses lower complexity compared to the SWM [13]. Therefore, we directly adopt the HSPM for both segmented channels during the CE process.

IV Channel Estimation

In this section, we present the CS-based CE framework for the THz integrated UM-MIMO and IRS communication systems, which is composed of three steps, namely, on-grid sparse channel representation, signal observation and sparse recovery algorithm. Specifically, the sparse channel representation is based on an on-grid codebook, by which the channel matrix is expressed as the production of the codebook and a sparse matrix. We first introduce the traditional DFT codebook, which is shown to be ineffective in the considered integrated systems. Inspired by this, we propose a subarray-based codebook by considering the characteristic of the HSPM channel, which possesses higher sparsity and accuracy than the DFT codebook. Second, we introduce the training procedure to obtain the channel observation and formulate the CE problem as a sparse recovery problem. Third, to obtain the CE result, we develop the low-complexity SSE algorithm with high accuracy. The spatial correlation inspired DSE algorithm is further developed, which possesses lower complexity compared to the SSE, at the cost of slightly degraded accuracy.

IV-A On-grid Sparse Channel Representation

IV-A1 Traditional DFT-based Sparse Channel Representation

In the literature, the spatial DFT-based on-grid codebook is widely deployed [31, 32, 33, 34]. This codebook treats the entire antenna array as a unit, and considers that the virtual spatial angles ψx=sinθcosϕ\psi_{x}={\rm sin}\theta{\rm cos}\phi and ψz=sinϕ\psi_{z}={\rm sin}\phi are taken form a uniform grid composed of NxN_{x} and NzN_{z} points, respectively. θ\theta and ϕ\phi denote the azimuth and elevation angles, while NxN_{x} and NzN_{z} refer to the number of antennas on x- and z-axis, respectively. In this way, the channel is sparsely represented as

𝐇=𝐀Dr𝚲D𝐀DtH\mathbf{H}=\mathbf{A}_{\rm Dr}\boldsymbol{\Lambda}_{\rm D}\mathbf{A}_{\rm Dt}^{\rm H} (16)

where 𝐀DrNr×Nr\mathbf{A}_{\rm Dr}\in\mathbb{C}^{N_{r}\times N_{r}} and 𝐀DzNt×Nt\mathbf{A}_{\rm Dz}\in\mathbb{C}^{N_{t}\times N_{t}} refer to the two-dimensional DFT on-grid codebooks at Rx and Tx, respectively, which hold a similar form, and can be represented as 𝐀D=[𝐚N(1,1)𝐚N(2(nx1)Nx1,2(nz1)Nz1)\mathbf{A}_{\rm D}=\Big{[}\mathbf{a}_{N}(-1,-1)\dots\mathbf{a}_{N}\left(\frac{2(n_{x}-1)}{N_{x}}-1,\frac{2(n_{z}-1)}{N_{z}}-1\right) 𝐚Nx(2(Nx1)Nx1,2(Nz1)Nz1)]\dots\mathbf{a}_{N_{x}}\left(\frac{2(N_{x}-1)}{N_{x}}-1,\frac{2(N_{z}-1)}{N_{z}}-1\right)\Big{]}. The sparse on-grid channel with complex gains on the quantized spatial angles is depicted by 𝚲DNr×Nt\boldsymbol{\Lambda}_{\rm D}\in\mathbb{C}^{N_{r}\times N_{t}}.

To assess the performance of the DFT codebook, we first evaluate the sparsity of on-grid channel 𝚲D\boldsymbol{\Lambda}_{\rm D} in (16) in different cases, by considering the HSPM. Moreover, since in practice, there does not exist a grid whose amplitude is strictly equal to 0, we consider that the sparsity of the on-grid channel equals the number of grids whose amplitude is greater than a small value, e.g., 0.01. First, as illustrated in Fig. 2(a), the amplitude of 𝚲D\boldsymbol{\Lambda}_{\rm D} is shown by considering a compact array without enlarging the subarray spacing. The on-grid channel is sparse, the number of grids with an amplitude larger than 0.01 is only 397, which is much smaller than the preset total number of grids, i.e., 262144. By contrast, in Fig. 2(b), the amplitude of the on-grid channel in the WSMS is plotted. The on-grid channel contains 2755 grids with amplitude larger than 0.01. Therefore, the DFT codebook lacks sparsity in representing the HSPM.

Refer to caption
(a) Amplitude of the on-grid channel in compact array using the DFT codebook.
Refer to caption
(b) Amplitude of the on-grid channel in the WSMS using the DFT codebook, the subarray spacing is 64λ64\lambda.
Refer to caption
(c) Amplitude of the on-grid channel in the WSMS using the subarray-based codebook, the subarray spacing is 64λ64\lambda.
Figure 2: Amplitude of on-grid channels in the compact array and WSMS with different codebooks by considering HSPM, Nt=256N_{t}=256, Nr=1024N_{r}=1024, Kt=Kr=4K_{t}=K_{r}=4. The number of propagation path is 1.

IV-A2 Proposed Subarray-based Sparse Channel Representation

We observe that the HSPM in (6) views each subarray as a unit, each block of which is the production of the array steering vectors for the subarrays at Rx and Tx, respectively. Inspired by this, we consider a subarray-based on-grid codebook. At Rx, the virtual spatial angles for each subarray are considered to be taken from fixed Nar=NarxNarzN_{ar}=N_{arx}N_{arz} grids, where NarxN_{arx} and NarzN_{arz} refer to the number of elements on x- and z-axis of the subarray at Rx, respectively. The corresponding DFT codebook is expressed as 𝐔Dr=[𝐚Nar(1,1),\mathbf{U}_{\rm Dr}=\Big{[}\mathbf{a}_{N_{ar}}(-1,-1), ,𝐚Nar(2(narx1)Narx1,2(narz1)Narz1),\dots\!,\mathbf{a}_{N_{ar}}\left(\frac{2(n_{arx}-1)}{N_{arx}}-1,\frac{2(n_{arz}\!-\!1)}{N_{arz}}-1\!\right)\!, ,𝐚Nar(2(Narx1)Narx1,2(Narz1)Narz1)],\dots,\mathbf{a}_{N_{ar}}\left(\frac{2(N_{arx}-1)}{N_{arx}}-1,\frac{2(N_{arz}-1)}{N_{arz}}-1\right)\Big{]}, narx=1,,Narxn_{arx}=1,\dots,N_{arx}, narz=1,,Narzn_{arz}=1,\dots,N_{arz}. We define 𝐀¯rNr×Nr\overline{\mathbf{A}}_{\rm r}\in\mathbb{C}^{N_{r}\times N_{r}} as the subarray-based codebook at Rx, which deploys KrK_{r} 𝐔Dr\mathbf{U}_{\rm Dr} on its diagonal as

𝐀¯r=blkdiag[𝐔Dr,,𝐔Dr].\overline{\mathbf{A}}_{\rm r}={\rm blkdiag}\left[\mathbf{U}_{\rm Dr},\dots,\mathbf{U}_{\rm Dr}\right]. (17)

The on-grid codebook matrix at Tx 𝐀¯tNt×Nt\overline{\mathbf{A}}_{\rm t}\in\mathbb{C}^{N_{t}\times N_{t}} is constructed similarly. Therefore, the on-grid representation of the HSPM in (6) based on the subarray-based codebook can be denoted as

𝐇HSPM𝐀¯r𝚲¯𝐀¯tH,\mathbf{H}_{\rm HSPM}\approx\overline{\mathbf{A}}_{\rm r}\overline{\boldsymbol{\Lambda}}\overline{\mathbf{A}}_{\rm t}^{\rm H}, (18)

where 𝚲¯Nr×Nt\overline{\boldsymbol{\Lambda}}\in\mathbb{C}^{N_{r}\times N_{t}} is a sparse matrix. If all spatial angles were taken from the grids and not equal to each other, 𝚲¯\overline{\boldsymbol{\Lambda}} would contain KrKtNpK_{r}K_{t}N_{p} non-zero elements.

The amplitude of the on-grid channel 𝚲¯\overline{\boldsymbol{\Lambda}} in (18) using the proposed codebook is plotted in Fig. 2(c), by considering the same channel as in Fig. 2(b). The number of grids with an amplitude larger than 0.01 is 1609, which is 1164 smaller than that by using the DFT codebook in Fig. 2(b). In addition, to reveal the accuracy of the on-grid channel, we calculate the difference between the real channel 𝐇HSPM{\mathbf{H}}_{\rm HSPM} and the reconstructed channels approximated by the on-grid channel and the codebooks in (16) and (18) as 𝐀Dr𝚲D𝐀DtH𝐇HSPM22𝐇HSPM22{\frac{\left\|\mathbf{A}_{\rm Dr}\boldsymbol{\Lambda}_{\rm D}\mathbf{A}_{\rm Dt}^{\rm H}-{\mathbf{H}}_{\rm HSPM}\right\|_{2}^{2}}{{\left\|{\mathbf{H}}_{\rm HSPM}\right\|_{2}^{2}}}} and 𝐀¯r𝚲¯𝐀¯tH𝐇HSPM22𝐇HSPM22{\frac{\left\|\overline{\mathbf{A}}_{\rm r}\overline{\boldsymbol{\Lambda}}\overline{\mathbf{A}}_{\rm t}^{\rm H}-{\mathbf{H}}_{\rm HSPM}\right\|_{2}^{2}}{{\left\|{\mathbf{H}}_{\rm HSPM}\right\|_{2}^{2}}}}, respectively. The approximation error based on the proposed codebook is around 4 dB lower than that based on the DFT codebook. To this end, we state that the proposed codebook is more efficient than the traditional DFT codebook, which possesses higher sparsity and lower approximation error.

IV-B Training Process and Problem Formulation

Refer to caption
Figure 3: Illustration of the training process.

IV-B1 Training Process for Channel Observation

We consider an uplink pilot training procedure, as illustrated in Fig. 3, which is conducted in three levels, namely, the BS training, UE training and IRS training, respectively. During the training process, the UE transmits the known pilot signals to the BS via the IRS in T=TbTuTiT=T_{b}T_{u}T_{i} training slots for uplink CE, where TbT_{b}, TiT_{i} and TuT_{u} denote the number of training slots for the BS, IRS and UE, respectively. At the (b,u,i)th(b,u,i)^{\rm th} slot, the UE deploys the training beamformer 𝐅~uNu×Nsu\tilde{\mathbf{F}}_{u}\in\mathbb{C}^{N_{u}\times N_{su}} and transmits the pilot signal 𝐬b,u,iNsu\mathbf{s}_{b,u,i}\in\mathbb{C}^{N_{su}}, b=1,,Tbb=1,\dots,T_{b}, u=1,,Tuu=1,\dots,T_{u}, i=1,,Tii=1,\dots,T_{i}. In the meantime, the training phase shift vector 𝐩iM\mathbf{p}_{i}\in\mathbb{C}^{M} and combiner 𝐖bNb×Nsb\mathbf{W}_{b}\in\mathbb{C}^{N_{b}\times N_{sb}} are deployed at the IRS and BS, respectively, to obtain the received signal 𝐲b,u,iNsb{\mathbf{y}}_{b,u,i}\in\mathbb{C}^{N_{sb}} at the BS, which can be represented as

𝐲b,u,i=𝐖bH𝐇IRSBSdiag{𝐩i}𝐇UEIRS𝐅~u𝐬b,u,i+𝐧b,u,i,{\mathbf{y}}_{b,u,i}=\mathbf{W}_{b}^{\rm H}\mathbf{H}_{\rm IRS-BS}{\rm diag}\{\mathbf{p}_{i}\}\mathbf{H}_{\rm UE-IRS}\tilde{\mathbf{F}}_{u}\mathbf{s}_{b,u,i}+{\mathbf{n}}_{b,u,i}, (19)

where 𝐧b,u,i=𝐖bH𝐧~b,u,iNsb{\mathbf{n}}_{b,u,i}=\mathbf{W}_{b}^{\rm H}\tilde{\mathbf{n}}_{b,u,i}\in\mathbb{C}^{N_{sb}}, and 𝐧~b,u,iNb\tilde{\mathbf{n}}_{b,u,i}\in\mathbb{C}^{N_{b}} refers to the received AWGN.

The BS training is first conducted, in which totally TbT_{b} different training combiners are used to obtain the received signal as (19). By collecting 𝐲b,u,i,b=1,,Tb\mathbf{y}_{b,u,i},b=1,\dots,T_{b} as 𝐲u,i=[𝐲1,u,iT,,𝐲Tb,u,iT]TNsbTb\mathbf{y}_{u,i}=[\mathbf{y}_{1,u,i}^{\rm T},\dots,\mathbf{y}_{T_{b},u,i}^{\rm T}]^{\rm T}\in\mathbb{C}^{N_{sb}T_{b}}, the received signal after BS training can be expressed as

𝐲u,i=(𝐟uT𝐖H)𝐇mul𝐩i+𝐧u,i,\mathbf{y}_{u,i}=\left(\mathbf{f}_{u}^{\rm T}\otimes\mathbf{W}^{\rm H}\right)\mathbf{H}^{\rm mul}\mathbf{p}_{i}+\mathbf{n}_{u,i}, (20)

where 𝐟u=𝐅~u𝐬b,u,iNu\mathbf{f}_{u}=\tilde{\mathbf{F}}_{u}\mathbf{s}_{b,u,i}\in\mathbb{C}^{N_{u}} stands for the equivalent training beamformer, 𝐖=[𝐖1,,𝐖Tb]Nb×NsbTb\mathbf{W}=[\mathbf{W}_{1},\dots,\mathbf{W}_{T_{b}}]\in\mathbb{C}^{N_{b}\times N_{sb}T_{b}} denotes the training combiner. The multiplied channel matrix 𝐇mul=𝐇UEIRST𝐇IRSBSNuNb×M\mathbf{H}^{\rm mul}=\mathbf{H}_{\rm UE-IRS}^{\rm T}\circ\mathbf{H}_{\rm IRS-BS}\in\mathbb{C}^{N_{u}N_{b}\times M}, and 𝐧u,i=[𝐧1,u,iT,,𝐧Tb,u,iT]TNsbTb\mathbf{n}_{u,i}=[\mathbf{n}_{1,u,i}^{\rm T},\dots,\mathbf{n}_{T_{b},u,i}^{\rm T}]^{\rm T}\in\mathbb{C}^{N_{sb}T_{b}} represents the collected noise.

After one round of BS training, the UE changes its beamformer 𝐅~u\tilde{\mathbf{F}}_{u} to complete the UE training. Particularly, totally TuT_{u} beamformers are used to obtain the received signal as (20). By collecting 𝐲u,i\mathbf{y}_{u,i} for u=1,,Tuu=1,\dots,T_{u} as 𝐲i=[𝐲1,iT,,𝐲Tu,iT]TNsbTbTu\mathbf{y}_{i}=[\mathbf{y}_{1,i}^{\rm T},\dots,\mathbf{y}_{T_{u},i}^{\rm T}]^{\rm T}\in\mathbb{C}^{N_{sb}T_{b}T_{u}}, we can obtain

𝐲i=(𝐅T𝐖H)𝐇mul𝐩i+𝐧i,\mathbf{y}_{i}=\left(\mathbf{F}^{\rm T}\otimes\mathbf{W}^{\rm H}\right)\mathbf{H}^{\rm mul}\mathbf{p}_{i}+\mathbf{n}_{i}, (21)

where 𝐅=[𝐟1,,𝐟Tu]Nu×Tu\mathbf{F}=[\mathbf{f}_{1},\dots,\mathbf{f}_{T_{u}}]\in\mathbb{C}^{N_{u}\times T_{u}} denotes the UE training beamforming matrix. Moreover, 𝐧i=[𝐧1,iT,,𝐧Tu,iT]TNsbTbTu\mathbf{n}_{i}=[\mathbf{n}_{1,i}^{\rm T},\dots,\mathbf{n}_{T_{u},i}^{\rm T}]^{\rm T}\in\mathbb{C}^{N_{sb}T_{b}T_{u}} represents the noise. Finally, the phase shift vector of the IRS 𝐩i\mathbf{p}_{i} is changed to conduct the IRS training. After obtaining each 𝐲i\mathbf{y}_{i} as (21), i=1,,Tii=1,\dots,T_{i}, we stack 𝐲i\mathbf{y}_{i} as 𝐘=[𝐲1,,𝐲Ti]NsbTbTu×Ti\mathbf{Y}=[\mathbf{y}_{1},\dots,\mathbf{y}_{T_{i}}]\in\mathbb{C}^{N_{sb}T_{b}T_{u}\times T_{i}}, which can be represented as

𝐘=(𝐅T𝐖H)𝐇mul𝐏+𝐍,\mathbf{Y}=\left(\mathbf{F}^{\rm T}\otimes\mathbf{W}^{\rm H}\right)\mathbf{H}^{\rm mul}\mathbf{P}+\mathbf{N}, (22)

where 𝐏=[𝐩1,,𝐩Ti]M×Ti\mathbf{P}=[\mathbf{p}_{1},\dots,\mathbf{p}_{T_{i}}]\in\mathbb{C}^{M\times T_{i}} refers to the training phase shift matrix. In addition, 𝐍=[𝐧1,,𝐧Ti]NsbTbTu×Ti\mathbf{N}=[\mathbf{n}_{1},\dots,\mathbf{n}_{T_{i}}]\in\mathbb{C}^{N_{sb}T_{b}T_{u}\times T_{i}} represents the stacked noise.

In this work, CE refers to estimating the multiplied channel matrix 𝐇mul\mathbf{H}^{\rm mul} in (22). Based on the proposed codebook in (18), 𝐇mul\mathbf{H}^{\rm mul} in (22) can be represented as

𝐇mul\displaystyle\mathbf{H}^{\rm mul} (𝐀¯tUEIRS𝚲¯UEIRST𝐀¯rUEIRST)(𝐀¯rIRSBS𝚲¯IRSBS𝐀¯tIRSBSH),\displaystyle\approx\left(\overline{\mathbf{A}}_{\rm tUE-IRS}^{*}\overline{\boldsymbol{\Lambda}}_{\rm UE-IRS}^{\rm T}\overline{\mathbf{A}}_{\rm rUE-IRS}^{\rm T}\right)\circ\left(\overline{\mathbf{A}}_{\rm rIRS-BS}\overline{\boldsymbol{\Lambda}}_{\rm IRS-BS}\overline{\mathbf{A}}_{\rm tIRS-BS}^{\rm H}\right), (23a)
=𝐀r𝚲~𝐀~t,\displaystyle=\mathbf{A}_{\rm r}\tilde{\boldsymbol{\Lambda}}\tilde{\mathbf{A}}_{\rm t}, (23b)

where 𝐀¯tUEIRSNu×Nu\overline{\mathbf{A}}_{\rm tUE-IRS}\in\mathbb{C}^{N_{u}\times N_{u}} and 𝐀¯rUEIRSM×M\overline{\mathbf{A}}_{\rm rUE-IRS}\in\mathbb{C}^{M\times M} denote the codebook matrices for the UE-IRS channel at UE and IRS, respectively, 𝚲¯UEIRSM×Nu\overline{\boldsymbol{\Lambda}}_{\rm UE-IRS}\in\mathbb{C}^{M\times N_{u}} denotes the sparse on-grid channel. The codebook matrices for the IRS-BS channel at the BS and IRS are denoted as 𝐀¯rIRSBSNb×Nb\overline{\mathbf{A}}_{\rm rIRS-BS}\in\mathbb{C}^{N_{b}\times N_{b}} and 𝐀¯tIRSBSM×M\overline{\mathbf{A}}_{\rm tIRS-BS}\in\mathbb{C}^{M\times M}, respectively. 𝚲¯IRSBSNb×M\overline{\boldsymbol{\Lambda}}_{\rm IRS-BS}\in\mathbb{C}^{N_{b}\times M} stands for the corresponding sparse matrix. Moreover, 𝐀r=(𝐀¯tUEIRS𝐀¯rIRSBS)NuNb×NuNb\mathbf{A}_{\rm r}=\left(\overline{\mathbf{A}}_{\rm tUE-IRS}^{*}\otimes\overline{\mathbf{A}}_{\rm rIRS-BS}\right)\in\mathbb{C}^{N_{u}N_{b}\times N_{u}N_{b}} stands for the combined codebook matrix at the left-hand side, 𝚲~=(𝚲¯UEIRST𝚲¯IRSBS)NuNb×M2\tilde{\boldsymbol{\Lambda}}=\left(\overline{\boldsymbol{\Lambda}}_{\rm UE-IRS}^{\rm T}\otimes\overline{\boldsymbol{\Lambda}}_{\rm IRS-BS}\right)\in\mathbb{C}^{N_{u}N_{b}\times M^{2}} depicts the multiplied sparse matrix, 𝐀~t=(𝐀¯rUEIRST𝐀¯tIRSBSH)M2×M\tilde{\mathbf{A}}_{\rm t}=\left(\overline{\mathbf{A}}_{\rm rUE-IRS}^{\rm T}\circ\overline{\mathbf{A}}_{\rm tIRS-BS}^{\rm H}\right)\in\mathbb{C}^{M^{2}\times M} represents the combined transmit codebook matrix.

It is worth noticing that 𝐀¯rUEIRS=𝐀¯tIRSBSH=𝐀¯IRS\overline{\mathbf{A}}_{\rm rUE-IRS}=\overline{\mathbf{A}}_{\rm tIRS-BS}^{\rm H}=\overline{\mathbf{A}}_{\rm IRS}, where 𝐀¯IRS\overline{\mathbf{A}}_{\rm IRS} denotes the on-grid codebook matrix at the IRS. Therefore, the multiplied channel matrix can be transformed as

𝐇mul𝐀r𝚲𝐀t,\mathbf{H}^{\rm mul}\approx\mathbf{A}_{\rm r}{\boldsymbol{\Lambda}}{\mathbf{A}}_{\rm t}, (24)

where 𝚲NuNb×M{\boldsymbol{\Lambda}}\in\mathbb{C}^{N_{u}N_{b}\times M} denotes the sparse on-grid channel mstrix, which is a function of 𝐀r\mathbf{A}_{\rm r}, 𝐀~t\tilde{\mathbf{A}}_{\rm t} and 𝚲~\tilde{\boldsymbol{\Lambda}}, 𝐀t=𝐀IRSM×M{\mathbf{A}}_{\rm t}={\mathbf{A}}_{\rm IRS}\in\mathbb{C}^{M\times M} denotes the codebook matrix on the right-hand side. In addition, we point out that the rows of the non-zero elements in 𝚲{\boldsymbol{\Lambda}} corresponds to the grid points in 𝐀r{\mathbf{A}}_{\rm r}, while the columns of non-zero elements in 𝚲{\boldsymbol{\Lambda}} indicate the grid points in 𝐀t{\mathbf{A}}_{\rm t}.

IV-B2 Problem Formulation

By combining the on-grid channel representation in (24) with the channel observation in (22), we can obtain

𝐘=(𝐅T𝐖H)𝐀r𝚲𝐀t𝐏+𝐍.\mathbf{Y}=\left(\mathbf{F}^{\rm T}\otimes\mathbf{W}^{\rm H}\right)\mathbf{A}_{\rm r}{\boldsymbol{\Lambda}}{\mathbf{A}}_{\rm t}\mathbf{P}+\mathbf{N}. (25)

The CE problem can be formulated as a sparse signal recovery problem as

min𝚲0,\displaystyle{\min}~{}\left\|\boldsymbol{\Lambda}\right\|_{0}, (26a)
s.t.𝐘(𝐅T𝐖H)𝐀r𝚲𝐀t𝐏0ϵ,\displaystyle{\rm s.t.}~{}\left\|\mathbf{Y}-\left(\mathbf{F}^{\rm T}\otimes\mathbf{W}^{\rm H}\right)\mathbf{A}_{\rm r}{\boldsymbol{\Lambda}}{\mathbf{A}}_{\rm t}\mathbf{P}\right\|_{0}\leq\epsilon, (26b)

where ϵ\epsilon is a constant to measure the estimation error. In addition, the l0l_{0} norm in problem (26) is usually transformed into the l1l_{1} norm, due to its non-convexity [35].

To solve the problem in (26), the received signal 𝐘\mathbf{Y} can be vectorized as 𝐲vec=vec{𝐘}NsbT\mathbf{y}_{\rm vec}={\rm vec}\{\mathbf{Y}\}\in\mathbb{C}^{N_{sb}T} to obtain 𝐲vec=𝚽~𝚿~𝐡+𝐧vec\mathbf{y}_{\rm vec}=\tilde{\boldsymbol{\Phi}}\tilde{\boldsymbol{\Psi}}\mathbf{h}+\mathbf{n}_{\rm vec}, where 𝚽~=(𝐏T𝐅T𝐖H)NsbT×NuNbM\tilde{\boldsymbol{\Phi}}=\left(\mathbf{P}^{\rm T}\otimes\mathbf{F}^{\rm T}\otimes\mathbf{W}^{\rm H}\right)\in\mathbb{C}^{N_{sb}T\times N_{u}N_{b}M} defines the measurement matrix, the overall codebook matrix is 𝚿~=(𝐀tT𝐀r)NuNbM×NuNbM\tilde{\boldsymbol{\Psi}}=\left(\mathbf{A}_{\rm t}^{\rm T}\otimes\mathbf{A}_{\rm r}\right)\in\mathbb{C}^{N_{u}N_{b}M\times N_{u}N_{b}M}. Moreover, 𝐡=vec{𝚲}NuNbM\mathbf{h}={\rm vec}\{\boldsymbol{\Lambda}\}\in\mathbb{C}^{N_{u}N_{b}M} is a sparse vector containing the complex gains on the grids of the codebook, 𝐧vec=vec{𝐍}NsbT\mathbf{n}_{\rm vec}={\rm vec}\{\mathbf{N}\}\in\mathbb{C}^{N_{sb}T} represents the vectorized noise. Various of greedy algorithms such as orthogonal matching pursuit (OMP) [33] and compressive sampling matching pursuit (CoSaMP) [36] can be used to recover 𝐡\mathbf{h} from 𝐲vec\mathbf{y}_{\rm vec}. However, the dimension of 𝚿~\tilde{\boldsymbol{\Psi}} is proportional to the number of antennas at BS NbN_{b}, UE NuN_{u} and the number of passive reflecting elements at IRS MM. In our considered UM-MIMO and IRS systems, the dimension becomes unacceptably large and the computational complexity of the existing greedy algorithms upsurges.

IV-C Sparse Recovery Algorithms

Algorithm 1: SSE Algorithm
Input: Received signal 𝐘\mathbf{Y} in (25), combined training matrices at UE, IRS and BS, 𝐅\mathbf{F}, 𝐏\mathbf{P} and BS 𝐖\mathbf{W},
the codebook matrices 𝐀r\mathbf{A}_{\rm r} and 𝐀t\mathbf{A}_{\rm t}
Initialization: Πr={\Pi}_{r}=\varnothing, Πt={\Pi}_{t}=\varnothing, 𝐁r=(𝐅T𝐖H)𝐀r\mathbf{B}_{\rm r}=\left(\mathbf{F}^{\rm T}\otimes\mathbf{W}^{\rm H}\right)\mathbf{A}_{\rm r}, 𝐁t=𝐀t𝐏\mathbf{B}_{\rm t}=\mathbf{A}_{\rm t}\mathbf{P}
1. Stage 1: Estimate non-zero grid points in 𝐀r\mathbf{A}_{\rm r}
2.     𝐲sumr=i=1M(𝐘𝐁tH)(:,i)\mathbf{y}_{\rm sumr}=\sum_{i=1}^{M}\left(\mathbf{Y}\mathbf{B}_{\rm t}^{\rm H}\right)(:,i)
3.     𝐲=𝐲sumr\mathbf{y}=\mathbf{y}_{\rm sumr}, 𝐁=𝐁r\mathbf{B}=\mathbf{B}_{\rm r}, IKuKbNpUINpIBI\propto K_{u}K_{b}N_{p}^{\rm UI}N_{p}^{\rm IB}
4.     Use Algorithm 2 to obtain the estimated grid point Πr{\Pi}_{r}
5. Stage 2: Estimate non-zero grid points in 𝐀t\mathbf{A}_{\rm t}
6.     𝐲sumt=(i=1KNpUINpIB(𝐁rH𝐘)(Πr(i),:))T\mathbf{y}_{\rm sumt}=\left(\sum_{i=1}^{KN_{p}^{\rm UI}N_{p}^{\rm IB}}\left(\mathbf{B}_{\rm r}^{\rm H}\mathbf{Y}\right)({\Pi}_{r}(i),:)\right)^{\rm T}
7.     𝐲=𝐲sumt\mathbf{y}=\mathbf{y}_{\rm sumt}, 𝐁=𝐁t\mathbf{B}=\mathbf{B}_{\rm t}, IKmNpUINpIBI\propto K_{m}N_{p}^{\rm UI}N_{p}^{\rm IB}
8.     Use Algorithm 2 to obtain the estimated grid point Πt{\Pi}_{t}
9. Stage 3: Recover the channel matrix
10.     𝐀^r=𝐁r(:,Πr)\hat{\mathbf{A}}_{\rm r}=\mathbf{B}_{\rm r}(:,{\Pi}_{r}), 𝐀^t=𝐁t(:,Πt)\hat{\mathbf{A}}_{\rm t}=\mathbf{B}_{\rm t}(:,{\Pi}_{t})
11.     𝚲^(Πr,Πt)=𝐀^r𝐘(𝐀^t)H\hat{\boldsymbol{\Lambda}}({\Pi}_{r},{\Pi}_{t})=\hat{\mathbf{A}}_{\rm r}^{\dagger}\mathbf{Y}\left(\hat{\mathbf{A}}_{\rm t}^{\dagger}\right)^{\rm H}
Output: Estimated channel 𝐇^=𝐀r𝚲^(𝐀t)H\hat{\mathbf{H}}={\mathbf{A}}_{\rm r}\hat{\boldsymbol{\Lambda}}({\mathbf{A}}_{\rm t})^{\rm H}

IV-C1 SSE Algorithm

The SSE algorithm separately estimates the positions of the non-zero grids on each side of the multiplied channel 𝐇mul\mathbf{H}^{\rm mul} in (24). Specifically, since the non-zero grids on the left- and right-hand side codebook matrices 𝐀r\mathbf{A}_{\rm r} and 𝐀t\mathbf{A}_{\rm t} relate to the non-zero rows and columns of 𝚲\boldsymbol{\Lambda}, respectively, we consider to separately estimate them. The procedures of the SSE algorithm are summarized in Algorithm 1 and explained as follows.

At Stage 1, the non-zero grid points Πr\Pi_{\rm r} in 𝐀r\mathbf{A}_{\rm r} is estimated. Specifically, by adding the columns of 𝐘\mathbf{Y} in Step 2, 𝐲sumrNsbTuTb\mathbf{y}_{\rm sumr}\in\mathbb{C}^{N_{sb}T_{u}T_{b}} can be expressed as 𝐲sumr=|(𝐅T𝐖H)𝐀r𝐬sumr+𝐧sumr|NsbTuTb\mathbf{y}_{\rm sumr}=\left|\left(\mathbf{F}^{\rm T}\otimes\mathbf{W}^{\rm H}\right)\mathbf{A}_{\rm r}\mathbf{s}_{\rm sumr}+\mathbf{n}_{\rm sumr}\right|\in\mathbb{C}^{N_{sb}T_{u}T_{b}}, where 𝐬sumr=(i=1Ti(𝚲𝐀t𝐏𝐏H𝐀tH)(:,i))NuNb\mathbf{s}_{\rm sumr}=\left(\sum_{i=1}^{T_{i}}({\boldsymbol{\Lambda}}{\mathbf{A}}_{\rm t}\mathbf{P}\mathbf{P}^{\rm H}\mathbf{A}_{\rm t}^{\rm H})(:,i)\right)\in\mathbb{C}^{N_{u}N_{b}} denotes the equivalent transmit signal, and 𝐧sumr=(i=1Ti(𝐍𝐏H𝐀tH)(:,i))NsbTuTb\mathbf{n}_{\rm sumr}=\left(\sum_{i=1}^{T_{i}}\left(\mathbf{N}\mathbf{P}^{\rm H}\mathbf{A}_{\rm t}^{\rm H}\right)(:,i)\right)\in\mathbb{C}^{N_{sb}T_{u}T_{b}} refers to the equivalent noise. Due to the sparsity of 𝚲\boldsymbol{\Lambda}, 𝐬sumr\mathbf{s}_{\rm sumr} is a sparse vector, the non-zero positions in 𝐬sumr\mathbf{s}_{\rm sumr} relates to the non-zero rows of 𝚲\boldsymbol{\Lambda}. Therefore, the positions of non-zero rows of 𝚲\boldsymbol{\Lambda} can be determined by estimating the non-zero positions of 𝐬sumr\mathbf{s}_{\rm sumr}, which is completed in Step 4.

Similarly, at Stage 2, the non-zero grid points Πt\Pi_{\rm t} in 𝐀t\mathbf{A}_{\rm t} is estimated. Since the positions of the non-zero rows of 𝚲\boldsymbol{\Lambda} have been determined in the previous stage, using these rows of 𝐘\mathbf{Y} to compose 𝐲sumt\mathbf{y}_{\rm sumt} is enough in determining the non-zero columns of 𝚲\boldsymbol{\Lambda}, which is shown in Step 6. Moreover, Πt\Pi_{\rm t} is also determined by Algorithm 2 in Step 8. Followed by that, at Stage 3 of Algorithm 1, the estimated 𝐀r{\mathbf{A}}_{\rm r} and 𝐀t{\mathbf{A}}_{\rm t} is first obtained in Step 10. The sparse on-grid channel matrix is then estimated in Step 11. Based on these estimated matrices, the channel matrix is finally recovered as illustrated in Step 11, which completes Algorithm 1.

To estimate the positions of non-zero grids with received signal 𝐲\mathbf{y} and measurement matrix 𝐁\mathbf{B}, Algorithm 2 first calculates the correlation between 𝐁\mathbf{B} and the residual vector 𝐫\mathbf{r} in Step 2. The most correlative column index is expressed as nn, which is regarded as the newly founded grid index and added to the grid set Π\Pi. The estimated signal 𝐬^\hat{\mathbf{s}} on the grids specified by Π\Pi is calculated in Step 4 by using the LS algorithm. Then, the residual vector is updated in Step 5, by removing the effect of the non-zero grid points that have been estimated in the previous step. By repeating these procedures, TT indexes are selected as the estimated non-zero grid points.

Algorithm 2: Grid Position Estimation
Input: Received signal 𝐲\mathbf{y}, measurement matrix 𝐁\mathbf{B}, number of iterations II
Initialization: Π=\Pi=\varnothing, 𝐫=𝐲\mathbf{r}=\mathbf{y}, 𝐬^=𝟎size(𝐁,2)\hat{\mathbf{s}}=\mathbf{0}_{{\text{size}}(\mathbf{B},2)}
1. for i=1,,Ii=1,\dots,I
2.     n=argmax𝐁H𝐫22n=\mathrm{argmax}~{}\|\mathbf{B}^{\rm H}\mathbf{r}\|_{2}^{2}
3.     Π=Πnr{\Pi}={\Pi}\cup n_{r}
4.     𝐬^(Π)=𝐁(:,Π)𝐲\hat{\mathbf{s}}({\Pi})=\mathbf{B}^{\dagger}(:,{\Pi})\mathbf{y}
5.     𝐫=𝐲𝐁𝐬^\mathbf{r}=\mathbf{y}-\mathbf{B}\hat{\mathbf{s}}
6. end for
Output: Estimated grid position Π{\Pi}

IV-C2 DSE Algorithm

The computational complexity of the DSE algorithm majorly comes from the production in Step 2 of Algorithm 2 in Step 4 and Step 8 of Algorithm 1, which are around 𝒪(NsbTuTbNuNb))\mathcal{O}\left(N_{sb}T_{u}T_{b}N_{u}N_{b})\right) and 𝒪(TiM)\mathcal{O}\left(T_{i}M\right) in each iteration, respectively. These values become large with the increased number of antennas in the UM-MIMO and elements in the IRS. The DSE algorithm addresses this problem by exploiting the spatial correlation among subarrays. Specifically, in the HSPM channel (6), for the entire array on the left-hand side, the spatial angles from subarrays on the right-hand side are close. Therefore, if we separately consider the codebooks between each subarray on the right-hand side and the entire array on the left-hand side, the positions of the non-zero grids would be close. Inspired by this, the DSE algorithm first calculates the positions of the non-zero grids for the codebook between the first subarray on the right-hand side and the entire array on the left-hand side, which are saved as the benchmark grids. For the remaining subarrays at the right-hand side, the grid searching space is shrunk by limiting the potential grids in the neighbor of the benchmark grids for reduced complexity.

The grid shrinkage of the DSE algorithm operates at Stage 1 and Stage 2 of Algorithm 1, which are detailed in Algorithm 3. The input to the DSE algorithm is the summarized channel observation 𝐲sum\mathbf{y}_{\rm sum}, the sensing matrix 𝚽\boldsymbol{\Phi}, the codebook relating to the subarray at Tx, and the entire subarray at Rx 𝐀sub\mathbf{A}_{\rm sub}, number of iterations II and number of subarrays at right-hand side KK. At Stage 1 of Algorithm 1, these parameters are obtained as 𝐲sum=𝐲sumr\mathbf{y}_{\rm sum}=\mathbf{y}_{\rm sumr}, 𝚽=𝐅T𝐖H\boldsymbol{\Phi}=\mathbf{F}^{\rm T}\otimes\mathbf{W}^{\rm H}, 𝐀sub=𝐔u𝐀¯rIRSBS\mathbf{A}_{\rm sub}={\mathbf{U}}_{\rm u}^{*}\otimes\overline{\mathbf{A}}_{\rm rIRS-BS}, IKbNpUINpIBI\propto K_{b}N_{p}^{\rm UI}N_{p}^{\rm IB} and K=KuK=K_{u}, where 𝐔u\mathbf{U}_{\rm u} denotes the spatial DFT matrix for the subarray at UE. At Stage 2 of Algorithm 1, these parameters are calculated as 𝐲sum=𝐲sumt\mathbf{y}_{\rm sum}=\mathbf{y}_{\rm sumt}, 𝚽=𝐏T\boldsymbol{\Phi}=\mathbf{P}^{\rm T}, 𝐀sub=𝐔m𝐀¯rUEIRS\mathbf{A}_{\rm sub}={\mathbf{U}}_{\rm m}^{*}\otimes\overline{\mathbf{A}}_{\rm rUE-IRS}, IKmNpUINpIBI\propto K_{m}N_{p}^{\rm UI}N_{p}^{\rm IB} and K=KmK=K_{m}, where 𝐔m{\mathbf{U}}_{\rm m} refers to the spatial DFT matrix for the subarray at IRS.

Algorithm 3: DSE Algorithm for Grid Position Estimation
Input: Received signal 𝐲sum\mathbf{y}_{\rm sum}, sensing matrix 𝚽\boldsymbol{\Phi}, codebook matrix for the subarray 𝐀sub\mathbf{A}_{\rm sub}
number of iterations II, number of subarrays KK
Initialization: Π={\Pi}=\varnothing, Πr,k={\Pi}_{r,k}=\varnothing, Na=size(𝐀sub,1)N_{a}=\textrm{size}(\mathbf{A}_{\rm sub},1)
1. for k=1:Kk=1:K
2.     𝐐=𝚽(:,(k1)Na+1:kNa)\mathbf{Q}=\boldsymbol{\Phi}(:,(k-1)*N_{a}+1:kN_{a})
3.     𝐁=𝐐𝐀sub\mathbf{B}=\mathbf{Q}\mathbf{A}_{\rm sub}
4.     if k>1k>1
5.         𝐁=𝐐𝐀sub(:,Π~)\mathbf{B}=\mathbf{Q}\mathbf{A}_{\rm sub}(:,\tilde{\Pi})
6.     end if
7.     Use Algorithm 2 to obtain the estimated grid point Πk{\Pi}_{k}
8.     if k=1k=1
9.        Construct Π~\tilde{\Pi} by selecting the neighboring qq grids for each point in Π1\Pi_{1}
10.    end if
11.    Transform positions in Πk{\Pi}_{k} to positions in Π{\Pi}, Π=ΠΠk{\Pi}={\Pi}\cup{\Pi}_{k}
12. end for
Output: Estimated grid position Π{\Pi}

For the kthk^{\rm th} subarray on right-hand side, the DSE algorithm first obtains the sensing matrix 𝐐\mathbf{Q} and the corresponding measurement matrix 𝐁\mathbf{B}, which are illustrated in Step 2 and Step 3 of Algorithm 3, respectively. For the first subarray, the non-zero grids relating to 𝐀sub\mathbf{A}_{\rm sub} are directly estimated and recorded in Π1\Pi_{1} as the benchmark grids. The neighboring qq elements for each grid in Π1\Pi_{1} are then selected as the potential searching grids for the remaining subarrays, which are saved as Π~\tilde{\Pi}, as shown in Step 7 to 10 in Algorithm 3. For the remaining subarray pairs, only the grids in Π~\tilde{\Pi} will be searched, as illustrated in Step 4 to 6. Finally, in Step 11, the determined grid positions for subarrays are transformed to positions for the entire array and saved in Π\Pi.

IV-C3 Computational Complexity

For the SSE algorithm, the total computational complexity of the SSE algorithm can be approximated as 𝒪(I(NsbTuTbNuNb+TiM))\mathcal{O}\left(I(N_{sb}T_{u}T_{b}N_{u}N_{b}+T_{i}M)\right). The computational complexity of the DSE algorithm also mainly comes from Step 4 and Step 8 of Algorithm 1, the total computational complexity of the SSE algorithm can be approximated as 𝒪(I(NsbTuTbNuNbKu+TiMKm))\mathcal{O}\left(I\left(\frac{N_{sb}T_{u}T_{b}N_{u}N_{b}}{K_{u}}+\frac{T_{i}M}{K_{m}}\right)\right).

V Performance Evaluation

In this section, we first numerically assess the system capacities by deploying different channel models for the THz integrated UM-MIMO and IRS systems. Then, the performance of the proposed SSE and DSE CE algorithms is extensively evaluated.

V-A Simulation Setup

The simulation parameters and important notations used in this paper are summarized in TABLE I. We employ the system in Fig. 1, where the complex gain of the THz channel is generated based on the channel model in [15]. To evaluate the capacity, the IRS beamforming matrix 𝐏¯\overline{\mathbf{P}} in (1) is randomly generated, while the phase of each element of 𝐏¯\overline{\mathbf{P}} follows a uniform distribution over [0,2π][0,2\pi]. In the CE process, we adopt the HSPM channel model in (6) for both segmented channels 𝐇UEIRS\mathbf{H}_{\rm UE-IRS} and 𝐇IRSBS\mathbf{H}_{\rm IRS-BS}. The spatial angles of both azimuth and elevation in the HSPM are randomly generated, following uniform distributions over [0,π][0,\pi]. The training process from (19) to (22) are deployed. Specifically, the phase of each element of 𝐖b\mathbf{W}_{b}, 𝐩i\mathbf{p}_{i} and 𝐅~u\tilde{\mathbf{F}}_{u} are randomly generated, following uniform distribution over [0,2π][0,2\pi]. The training time slot for BS, UE and IRS TbT_{b}, TuT_{u} and TiT_{i} satisfy TbNbKbT_{b}\leq\frac{N_{b}}{K_{b}}, TuNuKuT_{u}\leq\frac{N_{u}}{K_{u}} and TiMT_{i}\leq M during our evaluation, to guarantee reduced training overhead. The estimation accuracy is evaluated by the normalized-mean-square-error (NMSE), which is defined as

NMSE=𝔼{𝐇^𝐇mul22}𝔼{𝐇mul22},{\rm NMSE}={\frac{\mathbb{E}\left\{\left\|\hat{\mathbf{H}}-\mathbf{H}^{\rm mul}\right\|_{2}^{2}\right\}}{\mathbb{E}\left\{{\left\|\mathbf{H}^{\rm mul}\right\|_{2}^{2}}\right\}}}, (27)

where 𝐇^\hat{\mathbf{H}} denotes the estimated channel. All the results are obtained by averaging 5000 trials of Monte Carlo simulations.

TABLE I: Simulation parameters and notations.
Notation Meaning Value in simulation
ff Carrier frequency 0.3 THz
BB Bandwidth 5 GHz
λ\lambda Carrier wavelength
Kb,Km,KuK_{b},K_{m},K_{u} Number of subarrays at the BS, IRS, and UE Selected in 1,4
Nab,NamN_{ab},N_{am} Number of antennas on a subarray at the BS and IRS Selected in 64, 256
NauN_{au} Number of antennas on a subarray at the UE 16
Nb,M,NuN_{b},M,N_{u} Number of antennas at the BS, IRS and UE
NpUI,NpIBN_{p}^{\rm UI},N_{p}^{\rm IB} Number of paths in 𝐇UEIRS\mathbf{H}_{\rm UE-IRS} and 𝐇IRSBS\mathbf{H}_{\rm IRS-BS} NpN_{p}
qq Number of neighboring grids in the DSE algorithm 5
Tu,Tb,TiT_{u},T_{b},T_{i} Training time slots of UE, BS and IRS
Nr,NtN_{r},N_{t} Number of antennas at Rx (IRS or BS) and Tx (UE or IRS)
Kr,KtK_{r},K_{t} Number of subarrays at Rx (IRS or BS) and Tx (UE or IRS)
𝐀r,𝐀t\mathbf{A}_{\rm r},{\mathbf{A}}_{\rm t} Codebook matrices at left and right side of 𝐇mul\mathbf{H}^{\rm mul}, respectively
𝐅~u,𝐖b,𝐩i\tilde{\mathbf{F}}_{u},\mathbf{W}_{b},\mathbf{p}_{i} Training beamforming, combining and IRS reflection matrices
𝐅,𝐖,𝐏{\mathbf{F}},\mathbf{W},\mathbf{P} Combined training beamforming, combining, IRS reflection matrices
𝐅¯,𝐖¯,𝐏¯\overline{\mathbf{F}},\overline{\mathbf{W}},\overline{\mathbf{P}} UE beamforming, IRS beamforming and BS combining matrices
𝐇UEIRS,𝐇IRSBS\mathbf{H}_{\rm UE-IRS},\mathbf{H}_{\rm IRS-BS} Segmented channels form UE to IRS and IRS to BS, respectively
𝐇P,𝐇S,𝐇HSPM\mathbf{H}_{\rm P},\mathbf{H}_{\rm S},\mathbf{H}_{\rm HSPM} PWM, SWM and HSPM channel matrices
𝐇PWMcas,𝐇SWMcas,𝐇HSPMcas\mathbf{H}^{\rm cas}_{\rm PWM},\mathbf{H}^{\rm cas}_{\rm SWM},\mathbf{H}^{\rm cas}_{\rm HSPM} Cascaded channels based on the PWM, SWM and HSPM
𝐇mul\mathbf{H}^{\rm mul} The multiplied channel matrix to be estimated in (22)
𝐘\mathbf{Y} Observation matrix used for CE after training in (22)
𝚲{\boldsymbol{\Lambda}} Sparse on-grid channel of 𝐇mul\mathbf{H}^{\rm mul} based on 𝐀r\mathbf{A}_{\rm r} and 𝐀t{\mathbf{A}}_{\rm t}

V-B System Capacity based on Different Channel Models

We begin by evaluating the system capacities by using PWM, SWM, and HSPM for the segmented channels in different communication distances and subarray spacing. To facilitate evaluation, we consider that both cascaded channels 𝐇UEIRS\mathbf{H}_{\rm UE-IRS} and 𝐇IRSBS\mathbf{H}_{\rm IRS-BS} have only one LoS path, i.e. Np=1N_{p}=1, which is simplified yet practical for the THz communication systems due to the LoS domination property [37]. In this case, the 𝐇PWMcas\mathbf{H}^{\rm{cas}}_{\rm PWM} has rank 1 with no spatial multiplexing capability. Moreover, the transmit power at the BS is fixed as 20 dBm.

The capacity results over different communication distances from BS to IRS and IRS to UE are illustrated in Fig. 4. It is observed that the capacity of 𝐇HSPMcas\mathbf{H}^{\rm cas}_{\rm HSPM} is very close to 𝐇SWMcas\mathbf{H}^{\rm cas}_{\rm SWM}, which is much higher than that based on 𝐇PWMcas\mathbf{H}^{\rm cas}_{\rm PWM}. In particular, as shown in Fig. 4(a), when the communication distance is 40 m, the capacity of 𝐇HSPMcas\mathbf{H}^{\rm cas}_{\rm HSPM} is only 5×1045\times 10^{-4} bits/s/Hz lower than that of 𝐇SWMcas\mathbf{H}^{\rm cas}_{\rm SWM}. The capacities of 𝐇HSPMcas\mathbf{H}^{\rm cas}_{\rm HSPM} and 𝐇SWMcas\mathbf{H}^{\rm cas}_{\rm SWM} are 37.037.0 bits/s/Hz higher than the capacity of 𝐇PWMcas\mathbf{H}^{\rm cas}_{\rm PWM}. This is explained that in near-field transmission, the PWM loses its effectiveness to characterize the channel. By contrast, the capacities based on PWM, HSPM and SWM converge when the communication distance far exceeds the Rayleigh distance, which equals to 46.3 m in this case. Particularly, when the communication distance is 160 m that is over three times of the Rayleigh distance, the capacities based on different channel models finally approach to be close, where the difference between the PWM and the other channels reduces to 2.2 bits/s/Hz. Therefore, the Rayleigh distance overestimates the accuracy of the PWM approximation from the SWM. Equivalently, the misuse of PWM could cause severe deterioration of capacity even when the communication distance is equal to or larger than the Rayleigh distance, i.e., the so called far-field region. As a take-away lesson from our analysis, the HSPM is effective and generally applicable when the communication distance is smaller, comparable or even larger than the Rayleigh distance, i.e., ranging from near-field to far-field.

Refer to caption
(a) Nu=64,M=Nb=256,Ku=Km=Kb=4N_{u}=64,M=N_{b}=256,K_{u}=K_{m}=K_{b}=4.
Refer to caption
(b) Nu=64,M=Nb=1024,Ku=Km=Kb=4N_{u}=64,M=N_{b}=1024,K_{u}=K_{m}=K_{b}=4.
Figure 4: Channel capacity with various communication distance, the subarray spacing is fixed as 64λ64\lambda.
Refer to caption
(a) Nu=64,M=Nb=256,Ku=Km=Kb=4N_{u}=64,M=N_{b}=256,K_{u}=K_{m}=K_{b}=4
Refer to caption
(b) Nu=64,M=Nb=1024,Ku=Km=Kb=4N_{u}=64,M=N_{b}=1024,K_{u}=K_{m}=K_{b}=4.
Figure 5: Channel capacity with various subarray spacing, the communication distance is fixed as 4040 m.

As illustrated in Fig. 5, the effect of subarray spacing on channel capacity is evaluated with varying numbers of elements in the UM-MIMO and IRS. The trends of the curves in Fig. 5(a) and Fig. 5(b) are identical, due to the similar array size, which is mainly dependent on the subarray spacing. Specifically, in the considered system, the channel capacity is majorly influenced by the condition number, i.e., the difference between the minimax eigenvalues for the channel. As studied in [38], with fixed communication distance, the eigenvalue is a function of the array size. Moreover, when the subarray spacing is smaller than a threshold, e.g., 144λ144\lambda in both figures, the channel capacity rises monotonically with larger subarray spacing. In particular, as illustrated in Fig. 5(a), the capacity increases from 42.0 bits/s/Hz to 103.1 bits/s/Hz for the HSPM and SWM, as the subarray spacing increases from 16λ16\lambda to 144λ144\lambda. This is explained that the enlarged subarray spacing expands the near-field region and provides a better condition number to the channel, which contributes to the spatial multiplexing gain [13]. By contrast, the capacity based on the PWM remains around 39.9 bits/s/Hz. In addition, as the subarray distance further increases beyond 144λ144\lambda, the capacity begins fluctuating, due to the variation of the eigenvalues of the channel matrix [38]. In this study, we consider the reasonable widely-spaced subarrays, e.g., the subarray spacing is smaller than 144λ=0.144144\lambda=0.144 m. Therefore, the spatial multiplexing of the THz integrated UM-MIMO and IRS systems can be improved based on the widely-spaced architecture design.

V-C Performance of SSE and DSE Channel Estimation

To demonstrate the effectiveness of the proposed subarray-based codebook, we first compare the NMSE performance of the proposed SSE and DSE algorithms with two classical on-grid CS-based algorithms in different systems by deploying different channel models, including the OMP method as in [33] and the CoSaMP [36], both of which deploy the traditional DFT codebook. In addition, we fix the number of paths as Np=2N_{p}=2 for each channel segment. As illustrated in Fig. 6(a), the estimation NMSE against the SNR under the HSPM channel is evaluated. The proposed SSE and DSE methods based on the proposed codebook perform better than the traditional methods based on the DFT codebook. This observation validates the accuracy and effectiveness of the proposed subarray-based codebook in the considered system. Moreover, at higher SNR values, i.e., SNR>>0 dB, the SSE algorithm performs the best and obtains the highest estimation accuracy. Specifically, at SNR = 6 dB, the estimation NMSE of the SSE is 1 dB, 0.6 dB and 0.4 dB lower than the OMP, CoSaMP and DSE counterparts, respectively.

By contrast, at low SNR values, we can observe that the performance of the low-complexity DSE algorithm exceeds that of the SSE algorithm. For instance, the estimation NMSE of the DSE is around 0.8 dB lower than that of the SSE at -10 dB SNR. This gap decreases with the increment of SNR. The NMSE of SSE becomes lower than that of DSE as the SNR exceeds 0 dB. This is explained that, the potential grids error in the DSE algorithm can be avoided by the determination of potential searching grids based on the benchmark grids, especially in noisy conditions. However, since the best grids for the entire array cannot be completely mapped to the first subarray, the performance of the DSE becomes worse than the SSE as the SNR increases. To this end, we can state that the DSE algorithm is more attractive in the low SNR region, i.e., SNR<<0 dB. Furthermore, by considering the same system configuration as in Fig. 6(a), the estimation NMSE of different algorithms by deploying the ground-truth SWM is evaluated in Fig. 6(b). The result is consistent with that in Fig. 6(a), which further reinforces the effectiveness of the proposed HSPM. Specifically, the estimation accuracy of the SSE outperforms the other algorithms when at higher SNR larger than -5 dB, while the DSE algorithm achieves the lowest NMSE among the evaluated algorithms when SNR<<-5 dB.

Refer to caption
(a) WSMS system using HSPM. Nu=16,M=Nb=1024,Ku=1,Km=Kb=4,Tu=12,Ti=768,Tb=192N_{u}=16,M=N_{b}=1024,K_{u}=1,K_{m}=K_{b}=4,T_{u}=12,T_{i}=768,T_{b}=192
Refer to caption
(b) WSMS system using SWM, Nu=16,M=Nb=1024,Ku=1,Km=Kb=4,Tu=12,Ti=192,Tb=192N_{u}=16,M=N_{b}=1024,K_{u}=1,K_{m}=K_{b}=4,T_{u}=12,T_{i}=192,T_{b}=192
Refer to caption
(c) Compact array systems using PWM, Nu=16,M=Nb=256,Ku=Km=Kb=1,Tu=12,Ti=192,Tb=192N_{u}=16,M=N_{b}=256,K_{u}=K_{m}=K_{b}=1,T_{u}=12,T_{i}=192,T_{b}=192
Figure 6: NMSE comparison of different CE algorithms in different systems using different channel models.

To study the performance of the proposed SSE and DSE algorithms even in the traditional compact array systems without enlarging the subarray spacing, we evaluate their performances in Fig. 6(c) in contrast to the OMP and CoSaMP algorithms. We observe that the estimation NMSE of the OMP, SSE, and DSE algorithms are close. This is explained that in the traditional compact array systems, the number of subarrays at the BS, UE, and IRS is equal to 1. Therefore, the subarray-based codebook degenerates into the DFT codebook, and the operations in the DSE and SSE algorithms become the same. This result further demonstrates the effectiveness of the subarray-based codebook in the WSMS systems. Instead of being restricted by the performance of the sparse recovery algorithms, the performances of the OMP and CoSaMP methods are limited by the accuracy of the DFT codebook. Although very close, the NMSE of the CoSaMP slightly outperforms the remaining algorithms as the SNR exceeds 0 dB, with 0.02 dB higher NMSE at SNR=10 dB. This is owing to the benefits of the grid selection mechanism in the CoSaMP [36].

Refer to caption
(a) NMSE against TiT_{i}, Nu=16,M=Nb=1024,Ku=1,Km=Kb=4,Tu=16,Tb=192N_{u}=16,M=N_{b}=1024,K_{u}=1,K_{m}=K_{b}=4,T_{u}=16,T_{b}=192
Refer to caption
(b) NMSE against TbT_{b}, Nu=16,M=Nb=1024,Ku=1,Kb=Km=4,Tu=16,Tb=192N_{u}=16,M=N_{b}=1024,K_{u}=1,K_{b}=K_{m}=4,T_{u}=16,T_{b}=192.
Figure 7: Estimation NMSE against the number of training slots.

In Fig. 7, the NMSE performances of the DSE and SSE schemes versus the length of training slot in different SNRs are analyzed. The NMSE decreases with the increased length of training slots allocated to both the IRS and BS. In particular, as illustrated in Fig. 7(a), pertaining the SSE algorithm, when SNR is 10 dB, the NMSE drops sharply first, in which the decrement is around 4.5 dB as TiT_{i} increases from 256 to 640. However, as TiT_{i} further increases from 640 to 1024, the decline tends smooth, i.e., NMSE decreases by 0.65 dB. This is due to the fact that the NMSE results are determined by the dimensions of the channel and channel observation. The latter one is enlarged with the increment of the pilot length. Thus, the channel is more accurately estimated with a longer training slots, especially when the dimensions of the observation and the channel are comparable. In addition, as shown in Fig. 7(a) and Fig. 7(b), to obtain a good CE performance, time slots of length 600 and 100 are enough for the IRS and BS training in the considered configuration, respectively, after which the NMSE degradation is very limited by adding the length of the training slots. These values take around only half of the required training time slots for the traditional least-square (LS) and minimum-mean-square (MMSE) CE methods [32], in which 10241024 and 256256 slots are required for the IRS and BS training, respectively. These numbers are obtained by MM and NbKb\frac{N_{b}}{K_{b}}, respectively. Therefore, the proposed CE framework can estimate the channel with reduced training overhead.

VI Conclusion

As a promising technology for THz communications, the integrated UM-MIMO and IRS systems can effectively solve the LoS blockage problem in complex occlusion environments. Three challenges arise. First, the huge dimensional antenna array in UM-MIMO and IRS in contrast with the sub-millimeter wavelength enlarges the near-field region of propagation. Second, the spatial multiplexing and capacity are strongly limited by the sparsity of the THz channel. Third, the adoption of hybrid beamforming systems in UM-MIMO results in limited RF-chains, with the lack of signal processing capability of the IRS, CE has to recover the high-dimensional channel from severely compressed observations.

In this work, we have proposed the HSPM to accurately model the cascaded channel of the THz integrated UM-MIMO and IRS system. We have analyzed the spatial multiplexing under near-field and far-field cases and proved that the spatial multiplexing of the cascaded channel is limited by the segmented channel with a lower rank. Additionally, we have developed a subarray-based on-grid codebook and the SSE and DSE algorithms with low complexity to address the CE problem. Extensive simulations are conducted, and results demonstrate the accuracy of the HSPM channel. The capacity based on HSPM is only 5×1045\times 10^{-4} bits/s/Hz lower than that based on the ground-truth SWM with an array size of 256256. Moreover, the spatial multiplexing gain is improved based on the widely-spaced architecture design. Based on the proposed codebook, the SSE and DSE algorithms achieve better estimation accuracy than traditional algorithms. While the SSE possesses the highest accuracy at higher SNR over 0 dB, the DSE is more attractive at low SNR, whose estimation NMSE is 0.8 dB lower than the SSE when SNR=-10 dB.

References

  • [1] Y. Chen, R. Li, C. Han, and M. Tao, “Hybrid Spherical- and Planar-Wave Channel Modeling and Spatial Multiplexing Analysis for Terahertz Integrated UM-MIMO and IRS Systems,” in Proc. of IEEE Intern. Conf. Commun., Seoul, South Korea, May 2022, pp. 1–6.
  • [2] I. F. Akyildiz, C. Han, Z. Hu, S. Nie, and J. M. Jornet, “Terahertz Band Communication: An Old Problem Revisited and Research Directions for the Next Decade (Invited Paper),” IEEE Trans. Commun., to appear 2022.
  • [3] T. S. Rappaport, Y. Xing, O. Kanhere, S. Ju, A. Madanayake, S. Mandal, A. Alkhateeb, and G. C. Trichopoulos, “Wireless Communications and Applications Above 100 GHz: Opportunities and Challenges for 6G and Beyond,” IEEE Access, vol. 7, pp. 78 729–78 757, June 2019.
  • [4] Z. Chen, C. Han, Y. Wu, L. Li, C. Huang, Z. Zhang, G. Wang, and W. Tong, “Terahertz Wireless Communications for 2030 and Beyond: A Cutting-Edge Frontier,” IEEE Commun. Mag,, vol. 59, no. 11, pp. 66–72, Nov. 2021.
  • [5] Z. Chen, B. Ning, C. Han, and S. Li, “Intelligent Reflecting Surfaces Assisted Terahertz Communications toward 6G,” IEEE Wireless Commun., vol. 28, no. 6, pp. 110–117, Dec. 2021.
  • [6] I. F. Akyildiz, C. Han, and S. Nie, “Combating the Distance Problem in the Millimeter Wave and Terahertz Frequency Bands,” IEEE Commun. Mag., vol. 56, no. 6, pp. 102–108, Jun. 2018.
  • [7] I. F. Akyildiz and J. M. Jornet, “Realizing Ultra-Massive MIMO (1024×\times1024) Communication in the (0.06-10) Terahertz Band,” Nano. Commun. Netw., vol. 8, pp. 46–54, Jun. 2016.
  • [8] Q. Wu, S. Zhang, B. Zheng, C. You, and R. Zhang, “Intelligent Reflecting Surface-Aided Wireless Communications: A Tutorial,” IEEE Trans. Commun., vol. 69, no. 5, pp. 3313–3351, May 2021.
  • [9] M. Di Renzo, A. Zappone, M. Debbah, M.-S. Alouini, C. Yuen, J. de Rosny, and S. Tretyakov, “Smart Radio Environments Empowered by Reconfigurable Intelligent Surfaces: How It Works, State of Research, and The Road Ahead,” IEEE J. Sel. Areas Commun., vol. 38, no. 11, pp. 2450–2525, Nov. 2020.
  • [10] R. Li, B. Guo, M. Tao, Y.-F. Liu, and W. Yu, “Joint Design of Hybrid Beamforming and Reflection Coefficients in RIS-Aided mmWave MIMO Systems,” IEEE Trans. Commun., vol. 70, no. 4, pp. 2404–2416, April 2022.
  • [11] E. Björnson, Ö. Özdogan, and E. G. Larsson, “Reconfigurable Intelligent Surfaces: Three Myths and Two Critical Questions,” IEEE Commun. Mag., vol. 58, no. 12, pp. 90–96, Dec. 2020.
  • [12] E. Bjo¨\ddot{\text{o}}rnson and L. Sanguinetti, “Rayleigh Fading Modeling and Channel Hardening for Reconfigurable Intelligent Surfaces,” IEEE Wireless Commun. Lett., vol. 10, no. 4, pp. 830–834, April 2021.
  • [13] Y. Chen, L. Yan, and C. Han, “Hybrid Spherical-and Planar-Wave Modeling and DCNN-powered Estimation of Terahertz Ultra-massive MIMO Channels,” IEEE Trans. Commun., vol. 69, no. 10, pp. 7063–7076, Oct. 2021.
  • [14] M. Cui, Z. Wu, Y. Lu, X. Wei, and L. Dai, “Near-Field Communications for 6G: Fundamentals, Challenges, Potentials, and Future Directions,” arXiv preprint: 2203.16318, 2022.
  • [15] C. Han, A. O. Bicen, and I. F. Akyildiz, “Multi-Ray Channel Modeling and Wideband Characterization for Wireless Communications in the Terahertz Band,” IEEE Trans. Wireless Commun., vol. 14, no. 5, pp. 2402–2412, May 2015.
  • [16] C. Han, L. Yan, and J. Yuan, “Hybrid Beamforming for Terahertz Wireless Communications: Challenges, Architectures, and Open Problems,” IEEE Wireless Commun., vol. 28, no. 4, pp. 198–204, Aug. 2021.
  • [17] P. Zhang, J. Chen, X. Yang, N. Ma, and Z. Zhang, “Recent Research on Massive MIMO Propagation Channels: A Survey,” IEEE Commun. Mag., vol. 56, no. 12, pp. 22–29, Dec. 2018.
  • [18] F. Bohagen, P. Orten, and G. E. Oien, “On Spherical vs. Plane Wave Modeling of Line-of-sight MIMO Channels,” IEEE Trans. Commun., vol. 57, no. 3, pp. 841–849, Mar. 2009.
  • [19] K. Dovelos, S. D. Assimonis, H. Quoc Ngo, B. Bellalta, and M. Matthaiou, “Intelligent Reflecting Surfaces at Terahertz Bands: Channel Modeling and Analysis,” in Proc. of Intern. Conf. Commun. Workshops (ICC Workshops), Montreal, Canada, 2021, pp. 1–6.
  • [20] S. Zhang and R. Zhang, “Capacity Characterization for Intelligent Reflecting Surface Aided MIMO Communication,” IEEE J. Sel. Areas Commun., vol. 38, no. 8, pp. 1823–1838, Aug. 2020.
  • [21] M. Jung, W. Saad, Y. Jang, G. Kong, and S. Choi, “Reliability Analysis of Large Intelligent Surfaces (LISs): Rate Distribution and Outage Probability,” IEEE Wireless Commun. Lett., vol. 8, no. 6, pp. 1662–1666, Dec. 2019.
  • [22] E. Bjo¨\ddot{\text{o}}rnson and L. Sanguinetti, “Power Scaling Laws and Near-Field Behaviors of Massive MIMO and Intelligent Reflecting Surfaces,” IEEE Open J. Commun. Society, vol. 1, pp. 1306–1324, Sept. 2020.
  • [23] R. Li, S. Sun, Y. Chen, C. Han, and M. Tao, “RIS-assisted Millimeter-Wave MIMO Communication Systems: Ergodic Capacity Analysis and Optimization,” arXiv preprint:2202.06564, 2022.
  • [24] L. Yan, Y. Chen, C. Han, and J. Yuan, “Joint Inter-path and Intra-path Multiplexing for Terahertz Widely-spaced Multi-subarray Hybrid Beamforming Systems,” IEEE Trans. Commun., vol. 70, no. 2, pp. 1407–1422, Feb. 2022.
  • [25] S. Liu, Z. Gao, J. Zhang, M. D. Renzo, and M.-S. Alouini, “Deep Denoising Neural Network Assisted Compressive Channel Estimation for mmWave Intelligent Reflecting Surfaces,” IEEE Trans. Veh. Tech., vol. 69, no. 8, pp. 9223–9228, Aug. 2020.
  • [26] A. Taha, M. Alrabeiah, and A. Alkhateeb, “Enabling Large Intelligent Surfaces With Compressive Sensing and Deep Learning,” IEEE Access, vol. 9, pp. 44 304–44 321, March 2021.
  • [27] C. Hu, L. Dai, S. Han, and X. Wang, “Two-Timescale Channel Estimation for Reconfigurable Intelligent Surface Aided Wireless Communications,” IEEE Trans. Commun., vol. 69, no. 11, pp. 7736–7747, Nov. 2021.
  • [28] H. Liu, X. Yuan, and Y.-J. A. Zhang, “Matrix-Calibration-Based Cascaded Channel Estimation for Reconfigurable Intelligent Surface Assisted Multiuser MIMO,” IEEE J. Sel. Areas Commun,, vol. 38, no. 11, pp. 2621–2636, Nov. 2020.
  • [29] J. He, H. Wymeersch, and M. Juntti, “Channel Estimation for RIS-Aided mmWave MIMO Systems via Atomic Norm Minimization,” IEEE Trans. Wireless Commun., vol. 20, no. 9, pp. 5786–5797, Sept. 2021.
  • [30] Z. Wang, L. Liu, and S. Cui, “Channel Estimation for Intelligent Reflecting Surface Assisted Multiuser Communications: Framework, Algorithms, and Analysis,” IEEE Trans. Wireless Commun., vol. 19, no. 10, pp. 6607–6620, Oct. 2020.
  • [31] Z. Wan, Z. Gao, F. Gao, M. D. Renzo, and M.-S. Alouini, “Terahertz Massive MIMO With Holographic Reconfigurable Intelligent Surfaces,” IEEE Trans. Commun., vol. 69, no. 7, pp. 4732–4750, Jul. 2021.
  • [32] K. Ardah, S. Gherekhloo, A. L. F. de Almeida, and M. Haardt, “TRICE: A Channel Estimation Framework for RIS-Aided Millimeter-Wave MIMO Systems,” IEEE Signal Process. Lett., vol. 28, pp. 513–517, Feb. 2021.
  • [33] P. Wang, J. Fang, H. Duan, and H. Li, “Compressed Channel Estimation for Intelligent Reflecting Surface-Assisted Millimeter Wave Systems,” IEEE Signal Process. Lett., vol. 27, pp. 905–909, May 2020.
  • [34] X. Ma, Z. Chen, W. Chen, Z. Li, Y. Chi, C. Han, and S. Li, “Joint Channel Estimation and Data Rate Maximization for Intelligent Reflecting Surface Assisted Terahertz MIMO Communication Systems,” IEEE Access, vol. 8, pp. 99 565–99 581, May 2020.
  • [35] A. Alkhateeb, O. El Ayach, G. Leus, and R. W. Heath, “Channel Estimation and Hybrid Precoding for Millimeter Wave Cellular Systems,” IEEE J. Sel. Topics Signal Process., vol. 8, no. 5, pp. 831–846, Oct. 2014.
  • [36] M. F. Duarte and Y. C. Eldar, “Structured Compressed Sensing: From Theory to Applications,” IEEE Trans. Signal Process., vol. 59, no. 9, pp. 4053–4085, Sept. 2011.
  • [37] H. Do, S. Cho, J. Park, H.-J. Song, N. Lee, and A. Lozano, “Terahertz Line-of-sight MIMO Communication: Theory and Practical Challenges,” IEEE Commun. Mag., vol. 59, no. 3, pp. 104–109, Mar. 2021.
  • [38] F. Bohagen, P. Orten, and G. E. Oien, “Design of Optimal High-Rank Line-of-Sight MIMO Channels,” IEEE Trans. Wireless Commun., vol. 6, no. 4, pp. 1420–1425, April 2007.