This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Joint Beam Training and Positioning for Intelligent Reflecting Surfaces Assisted Millimeter Wave Communications

Wei Wang, , and Wei Zhang W. Wang and W. Zhang are with the School of Electrical Engineering and Telecommunications, The University of New South Wales, Sydney, Australia (e-mail: wei.wang@unsw.edu.au; wzhang@ee.unsw.edu.au).
Abstract

Intelligent reflecting surface (IRS) offers a cost-effective solution to link blockage problem in mmWave communications, and the prerequisite of which is the accurate estimation of (1) the optimal beams for base station/access point (BS/AP) and mobile terminal (MT), (2) the optimal reflection patterns for IRSs, and (3) link blockage. In this paper, we carry out beam training designs for IRSs assisted mmWave communications to estimate the aforementioned parameters. To acquire the optimal beams and reflection patterns, we firstly perform random beamforming and maximum likelihood estimation to estimate angle of arrival (AoA) and angle of departure (AoD) of the line of sight (LoS) path between BS/AP (or IRSs) and MT. Then, with the estimated AoDs, we propose an iterative positioning algorithm that achieves centimeter-level positioning accuracy. The obtained location information is not only a fringe benefit but also enables us to cross verify and enhance the estimation of AoA and AoD, and it also facilitates the estimation of blockage indicator. Numerical results show the superiority of our proposed beam training scheme and verify the performance gain brought by location information.

I Introduction

Millimeter-wave (mmWave) band, ranging from 30GHz to 300GHz, has attracted great interests from both academia and industry for its abundant spectrum resources [1, 2]. The Wi-Fi standard IEEE 802.11ad runs on the 60GHz (V band) spectrum with data transfer rates of up to 7 Gbit/s [3, 4]. In 3GPP Release 15, 24.25-29.5GHz and 37-43.5GHz, as the most promising frequencies for the early deployment of 5G millimeter wave systems, are specified based on a time-division duplexing (TDD) access scheme [5]. The millimeter scale wavelength, on one hand, renders massive antennas integratable on an antenna array with portable size [6], and, on the other hand, results in severe free space path loss especially for non-line-of-sight (NLoS) paths. Directional transmission enabled by beamforming techniques is an energy efficient transmission solution to compensate for the path loss in mmWave communications [7]. By properly adjusting the phase shifts of each antenna elements, it concentrates the emitted energy in a narrow beam between transmitter and receiver. However, the directional link is easily blocked by obstacles like human bodies, walls, and furniture, attributed to the millimeter scale wavelength [8]. Once LoS path is blocked, it is highly possible that the blocked link cannot be restored no matter how the beam direction is adjusted, as the NLoS paths are not strong enough to serve as a qualified alternative link. Channel measurement campaigns reveal that power of the LoS component is about 13dB higher than the sum of power of NLoS components [9]. Therefore, blockage is the biggest hindrance to the large scale applications of mmWave band in mobile communication systems.

Recently, intelligent reflecting surface (IRS) [10, 11, 12, 13], a.k.a. reconfigurable intelligent surface (RIS) [14, 15], large intelligent surface (LIS) [16], passive (intelligent) reflectors/mirrors [17, 18, 19], or programmable metasurface [20, 21, 22], is proposed as an energy-effective and cost-effective hardware structure for future wireless communications. IRS is essentially a new type of electromagnetic surface structure which is typically designed by deliberately arranging a set of sophisticated passive scatterers or apertures in a regular array to achieve the desired ability for guiding and controlling the flow of electromagnetic waves [23]. Current applications of IRS to wireless communications can be categorized into two types, namely IRS modulator and IRS “relay”. In [20, 21, 22], amplitude/phase modulations over IRS are investigated. Through controlling the reflection coefficient of IRS, the incident carrier wave from a feed antenna can be digitally modulated without requiring high-performance radio frequency (RF) chains. A more extensive application of IRS is IRS “relay”, in which the radiated power from BS/AP towards IRS is reflected to MT via intelligently managing the phase shifters on IRS [14, 13, 15, 10, 11, 12, 16, 19, 17, 18]. It is noteworthy that the rationale behind IRS “relay” and conventional amplify-and-forward (AF) relay is significantly different. AF relay firstly receives signal and then re-generates and re-transmits signal. In contrast, IRS only reflects the ambient RF signals as a passive array and bypasses conventional RF modules such as power amplifier, filters, and ADC/DAC [11]. Hence, IRS “relay” incurs no additional power consumption and is free from thermal noise introduced by RF modules. In this sense, IRS can be regarded as a smart “mirror” that enables us to change the paradigm of wireless communications from adjusting to wireless channel to changing wireless channel [14, 24]. As an active way to make wireless channel better, IRS “relay” assisted wireless communications have attracted great interests from researchers. In [10], IRS is applied to mmWave communications to provide effective reflected paths and thus enhance signal coverage. In [12, 15, 17], joint optimization of the transmit beamforming by active antenna array at the BS/AP and reflect beamforming by passive phase shifters at the IRS is carried out. In [18], empirical studies are performed to analyze the capability of signal coverage enhancement for IRSs assisted mmWave MIMO at 28GHz. In [19], the reconfigurable 60GHz IRS is designed, implemented and deployed to strengthen mmWave connections for indoor networks threatened by blockage. The objective of the work is to validate IRS’s capability to address link blockage problem in mmWave communications, and beam training design is not investigated. Although extensive analytical and empirical studies have been done on IRSs assisted wireless communications in the aforementioned literature, these work either assume the availability of channel state information (CSI) or accurate measurement of BS/AP, MT and IRS’s position and direction.

In [11], a practical transmission protocol and channel estimation are firstly proposed for an IRS-assisted orthogonal frequency division multiplexing (OFDM) system under frequency-selective channels. In [13], by exploiting the channel correlation among different users, a channel estimation scheme with reduced training overhead is proposed. Specifically, with a typical user’s reflection channel vector, estimation of the other users’ reflection channel vector can be simplified as the estimation of a multiplicative coefficient. However, the aforementioned designs were performed in non-mmWave frequency band, and the direct application of them to mmWave communications will fail to utilize the sparse nature of mmWave channel. In [16], to facilitate channel estimation of IRSs assisted link over mmWave band or LoS dominated sub-6GHz band, an upgrade of IRS’s structure is proposed to add a small number of channel sensors to sense and process incident signal. Although [16] is intended to mmWave band, the proposed compressive sensing and deep learning algorithms are incompatible to current structure of IRS which is without channel sensors. In [25], cascade channel estimation of the BS/AP-IRS-MT link in mmWave band is firstly converted into a sparse signal recovery problem and then solved via conventional compressed sensing methods. However, [25] is based on a strong assumption that AoA and AoD parameters lie on the discretized grid. In [26], a two-step channel estimation protocol is proposed for the cascaded BS/AP-IRS-MT link in mmWave band, which includes hierarchical beamforming and high resolution sparse channel estimation. As the selection of fine beam set in hierarchical beamforming is fully dependent on the training results of the wide beams in the previous layer, hierarchical beamforming requires interactions between BS/AP and MT. Thus, the extension of the proposed scheme from single user scenario to multi-user scenario might be costly in training overhead. Besides, as IRS is primarily used in mmWave communications to combat blockage, estimation of blockage in both BS/AP-MT link and BS/AP-IRS-MT link is essential for IRSs assisted mmWave communications, while [16, 25, 26] all neglect blockage effects in their designs.

Due to the deployment of multiple IRSs, beam training of IRSs assisted mmWave communications requires much heavier training overhead than traditional mmWave communications. Also, as the purpose of IRSs is to combat blockage and expand coverage, an accurate estimation of blockage is essential to beam selection by BS/AP. In addition, the lack of RF chains results in the inability of IRSs to sense signal, which further complicates beam training for the paths assisted by IRSs. These three features jointly render traditional beam training methods [27, 28] incompetent in IRSs assisted mmWave communications. Despite the aforementioned new challenges of integrating IRSs to mmWave communications, a notable advantage is that the estimation of path parameters, e.g., AoA/AoD and blockage indicator, can be cross verified, thanks to the relatively large number of deployed IRSs. Specifically, three accurate estimates of AoA/AoD, associated with other essential information, e.g., direction of arrays, can yield the location of MT, and the location of MT will in turn reproduce the path parameters. In this way, the path parameters of IRSs assisted mmWave MIMO can be enhanced according to their geometric relationship. To estimate the channel parameters of IRSs assisted mmWave communications, we have made the following contributions in this paper:

  • We propose a flexible beam training method for IRSs assisted mmWave MIMO by breaking it down into several mathematically equivalent sub-problems, and we further perform random beamforming and maximum likelihood (ML) estimation to jointly estimate AoA and AoD of the dominant path in each sub-problem. The proposed scheme does not require feedback from MT at training stage, and thus can be performed in a broadcasting manner. Hence, the required training overhead will not increase over MT number.

  • We prove the uniqueness of the AoA and AoD estimated by beam training with random beamforming. We further study the impact of training length, and we prove that larger training length almost surely results in smaller pairwise error probability of AoA, AoD pair.

  • By sorting the reliability of the estimated AoA, AoD pairs, we propose an iterative positioning algorithm to estimate the location of MT, and, through numerical analysis, we show that the algorithm achieves centimeter-level positioning accuracy.

  • With the estimated position of MT, we propose to cross verify and enhance the estimation of path parameters, i.e., AoA and AoD, according to their geometric relationship. We further propose an accurate method of blockage estimation by comparing the ML estimate of pathloss and MT position based estimate of pathloss.

Numerical results show the superiority of our proposed beam training scheme and verify the performance gain brought by location information.

The rest of the paper is organized as follows. Section II introduces the system model. In Section III, we break down the beam training design of IRSs assisted mmWave communications. In Section IV, we propose beam training with random beamforming, and specifically we estimate path parameters and study the feasibility of the scheme. In Section V, we study the interplay between positioning and beam training. In Section VI, numerical results are presented. Finally, in Section VII, we draw the conclusion.

Notations:  Column vectors (matrices) are denoted by bold-face lower (upper) case letters, 𝐱(n)\mathbf{x}(n) denotes the nn-th element in the vector 𝐱\mathbf{x}, ()(\cdot)^{*}, ()T(\cdot)^{T} and ()H(\cdot)^{H} represent conjugate, transpose and conjugate transpose operation, respectively, ||||||\cdot|| denotes the Frobenius norm of a vector or a matrix, \odot is Hadamard product. Subtraction and addition of the cosine AoAs/AoDs are defined as θϕ(θϕ+1)mod21\theta\ominus\phi\triangleq(\theta-\phi+1)\mod 2-1 and θϕ(θ+ϕ+1)mod21\theta\oplus\phi\triangleq(\theta+\phi+1)\mod 2-1 to guarantee the result is within the range [1,1)[-1,1).

II System Model

Consider a communication link between the BS/AP and an MT operating in mmWave band, where both ends adopt uniform linear array (ULA) antenna structure. To reduce wireless link blockage rate and thus guarantee the reliable linkage between BS and MT, a number of IRSs are deployed in the cell as shown in Figure 1, and BS/AP is able to control IRSs via cable or lower frequency radio link.

Refer to caption
Figure 1: Illustration of IRSs assisted mmWave communications

The channel response between BS/AP and MT without the assistance of IRSs is represented as [28]

𝐇BM=\displaystyle\mathbf{H}_{BM}= ζLoSδ1𝐚M(θBM,1)𝐚BH(ϕBM,1)+\displaystyle\zeta_{LoS}\delta_{1}\mathbf{a}_{M}(\theta_{BM,1})\mathbf{a}^{H}_{B}(\phi_{BM,1})+
l=2Lδl𝐚M(θBM,l)𝐚BH(ϕBM,l)\displaystyle\sum_{l=2}^{L}\delta_{l}\mathbf{a}_{M}(\theta_{BM,l})\mathbf{a}^{H}_{B}(\phi_{BM,l}) (1)

where ζLoS{0,1}\zeta_{LoS}\in\{0,1\} is the indicator of blockage of the LoS path, and δl\delta_{l}, θBM,l\theta_{BM,l} and ϕBM,l\phi_{BM,l} are channel gain, cosine of AoA, and cosine of AoD of the ll-th path, respectively. The parameters (ζLoS,δ1,θBM,1,ϕBM,1)(\zeta_{LoS},\delta_{1},\theta_{BM,1},\phi_{BM,1}) characterize LoS path, which are of particular interest to us in mmWave communications. According to [14], the path gain of LoS is δ1=λej2πdBM4πdBM\delta_{1}=\frac{\lambda e^{-j2\pi d_{BM}}}{4\pi d_{BM}}, where λ\lambda is the wavelength, and dBMd_{BM} is the distance between BS and MT. Further, the steering vectors are given by

𝐚M(θBM,l)=[1,ejπ1θBM,l,,ejπ(NM1)θBM,l]T\displaystyle\mathbf{a}_{M}(\theta_{BM,l})=[1,\;e^{j\pi 1\theta_{BM,l}},\;\cdots,e^{j\pi(N_{M}-1)\theta_{BM,l}}]^{T}
𝐚B(ϕBM,l)=[1,ejπ1ϕBM,l,,ejπ(NB1)ϕBM,l]T\displaystyle\mathbf{a}_{B}(\phi_{BM,l})=[1,\;e^{j\pi 1\phi_{BM,l}},\;\cdots,e^{j\pi(N_{B}-1)\phi_{BM,l}}]^{T}

where NBN_{B} is the number of antennas of BS/AP, NMN_{M} is the number of antennas of MT.

We also assume that IRSs adopt ULA antenna structure. Thus, the channel response of the reflected path from BS to MT assisted (reflected) by the ii-th IRS is

𝐇BRiM=\displaystyle\mathbf{H}_{BR_{i}M}= ζVLoS,iδ¯BRiM𝐚M(θRiM)𝐚RiH(ϕRiM)\displaystyle\zeta_{VLoS,i}\bar{\delta}_{BR_{i}M}\mathbf{a}_{M}(\theta_{R_{i}M})\mathbf{a}_{R_{i}}^{H}(\phi_{R_{i}M})
diag{𝐠¯i}𝐚Ri(θBRi)𝐚BH(ϕBRi)\displaystyle\operatorname{diag}\{\bar{\mathbf{g}}_{i}\}\mathbf{a}_{R_{i}}(\theta_{BR_{i}})\mathbf{a}_{B}^{H}(\phi_{BR_{i}})
=\displaystyle= ζVLoS,iδBRiM(𝐠¯i)𝐚M(θRiM)𝐚BH(ϕBRi)\displaystyle\zeta_{VLoS,i}\delta_{BR_{i}M}(\bar{\mathbf{g}}_{i})\mathbf{a}_{M}(\theta_{R_{i}M})\mathbf{a}_{B}^{H}(\phi_{BR_{i}}) (2)

where 𝐠¯i\bar{\mathbf{g}}_{i} is the reflection vector that determines the reflection pattern of the ii-th IRS, ζVLoS,i{0,1}\zeta_{VLoS,i}\in\{0,1\} is the indicator of blockage of the path reflected by the ii-th IRS and δ¯BRiM=ξλej2π(dBRi+dRiM)4π(dBRi+dRiM)\bar{\delta}_{BR_{i}M}=\frac{\sqrt{\xi}\lambda e^{-j2\pi(d_{BR_{i}}+d_{R_{i}M})}}{4\pi(d_{BR_{i}}+d_{R_{i}M})} [14], in which ξ\xi is reflection loss, dBRid_{BR_{i}} is the distance between BS and the ii-th IRS, dRiMd_{R_{i}M} is the distance between the ii-th IRS and MT. The equivalent path gain of the IRS reflected path can be written as

δBRiM(𝐠¯i)\displaystyle\delta_{BR_{i}M}(\bar{\mathbf{g}}_{i}) δ¯BRiM𝐚RiH(ϕRiM)diag{𝐠¯i}𝐚Ri(θBRi)\displaystyle\triangleq\bar{\delta}_{BR_{i}M}\mathbf{a}_{R_{i}}^{H}(\phi_{R_{i}M})\operatorname{diag}\{\bar{\mathbf{g}}_{i}\}\mathbf{a}_{R_{i}}(\theta_{BR_{i}})
=δ¯BRiM𝐚RiH(ϕRiMθBRi)𝐠¯i\displaystyle=\bar{\delta}_{BR_{i}M}\mathbf{a}_{R_{i}}^{H}(\phi_{R_{i}M}\ominus\theta_{BR_{i}})\bar{\mathbf{g}}_{i} (3)

The steering vector 𝐚Ri(ϕRiM)\mathbf{a}_{R_{i}}(\phi_{R_{i}M}) is given by

𝐚Ri(ϕRiM)=[1,ejπ1ϕRiM,,ejπ(NRi1)ϕRiM]T\displaystyle\mathbf{a}_{R_{i}}(\phi_{R_{i}M})=[1,\;e^{j\pi 1\phi_{R_{i}M}},\;\cdots,e^{j\pi(N_{R_{i}}-1)\phi_{R_{i}M}}]^{T} (4)

where NRiN_{R_{i}} is the number of passive reflectors of the ii-th IRS. Based on (II), the optimal reflection coefficient vector that maximizes effective received power is 𝐠¯i=𝐚Ri(ϕRiMθBRi)\bar{\mathbf{g}}_{i}^{\star}=\mathbf{a}_{R_{i}}(\phi_{R_{i}M}\ominus\theta_{BR_{i}}).

Hence, the channel response between BS and MT with the assistance of NIRSN_{IRS} IRSs is represented as

𝐇=𝐇BM+i=1NIRSγi𝐇BRiM=\displaystyle\mathbf{H}=\mathbf{H}_{BM}+\sum_{i=1}^{N_{IRS}}\gamma_{i}\mathbf{H}_{BR_{i}M}=
ζLoSδ1𝐚M(θMB,1)𝐚BH(ϕMB,1)LoScomponent+l=2Lδl𝐚M(θMB,l)𝐚BH(ϕMB,l)NLoScomponent\displaystyle\underbrace{\zeta_{LoS}\delta_{1}\mathbf{a}_{M}(\theta_{MB,1})\mathbf{a}^{H}_{B}(\phi_{MB,1})}_{LoS\;component}+\underbrace{\sum_{l=2}^{L}\delta_{l}\mathbf{a}_{M}(\theta_{MB,l})\mathbf{a}^{H}_{B}(\phi_{MB,l})}_{NLoS\;component}
+i=1NIRSγiζVLoS,iδBRiM(𝐠¯i)𝐚M(θRiM)𝐚BH(ϕBRi)VLoScomponent\displaystyle\;\;+\underbrace{\sum_{i=1}^{N_{IRS}}\gamma_{i}\zeta_{VLoS,i}\delta_{BR_{i}M}(\bar{\mathbf{g}}_{i})\mathbf{a}_{M}(\theta_{R_{i}M})\mathbf{a}_{B}^{H}(\phi_{BR_{i}})}_{VLoS\;component} (5)

where

γi={1,whentheithIRSisactivated0,whentheithIRSisdeactivated\displaystyle\gamma_{i}=\left\{\begin{array}[]{cc}1,&{\rm when\;the\;}i{\rm th\;IRS\;is\;activated}\\ 0,&{\;\;\;\;\rm when\;the\;}i{\rm th\;IRS\;is\;deactivated}\end{array}\right. (8)

indicates the activation status of the ii-th IRS and γi\gamma_{i} can be configured by BS/AP.

When the reflection pattern of the vector 𝐠¯i\bar{\mathbf{g}}_{i} is omnidirectional, IRS works as a scatterer that diffuses the energy radiated from BS. When 𝐠¯i=𝐚R(ϕRiMθBRi)\bar{\mathbf{g}}_{i}^{\star}=\mathbf{a}_{R}(\phi_{R_{i}M}\ominus\theta_{BR_{i}}), IRS works as a “mirror” that builds a virtual LoS (VLoS) path between BS and MT, and thus the energy from BS will be concentrated on MT, and ϕRiMθBRi\phi_{R_{i}M}\ominus\theta_{BR_{i}} is termed as the optimal reflection angle of the ii-th VLoS path. We can categorize channel components of 𝐇{\mathbf{H}} into three types as in Eq. (5), namely LoS path component, VLoS path component, and NLoS path component. LoS path component is the direct path between BS and MT, VLoS path component consists of the paths between BS and MT reflected by IRSs, and NLoS path component consists of the paths between BS and MT reflected by scatters, e.g., walls, human bodies, and etc.

As NLoS path component usually varies fast and its weight to the channel is marginal especially in mmWave band, we are more interested in LoS path and VLoS paths. Hence, we intend to estimate (1) the optimal reflection angle ϕRiMθBRi\phi_{R_{i}M}\ominus\theta_{BR_{i}} of IRSs and (2) the path parameters (ζBM,1,δBM,1,θBM,1,ϕBM,1)(\zeta_{BM,1},\delta_{BM,1},\theta_{BM,1},\phi_{BM,1}) of the LoS path and (ζBRiM,δBRiM(𝐠¯i),θRiM,ϕBRi)(\zeta_{BR_{i}M},\delta_{BR_{i}M}({\bar{\mathbf{g}}}^{\star}_{i}),\theta_{R_{i}M},\phi_{BR_{i}}) of the VLoS paths through beam training and location information aided parameter enhancement.

III Framework of Joint Beam Training and Positioning

For conventional mmWave communications, training overhead can be significantly reduced by exploiting the sparse nature of mmWave channel [28, 29]. However, with the assistance of IRSs, the sparse channel of mmWave band is artificially converted into rich scattering channel. The increased scattering effect, together with the unknown optimal reflection angle, jointly complicates the process of beam training. To make the over-complicated problem tractable, we propose to break down beam training of IRSs assisted mmWave MIMO into two sub-problems, and we further show that the two sub-problems are mathematically equivalent. Then, we propose a protocol for joint beam training and positioning which well accommodates multi-user scenario.

III-A Breakdown of Beam Training for IRSs Assisted MmWave MIMO

Refer to caption
Figure 2: Two steps of beam training with random beamforming in IRSs assisted mmWave communications

At first, it is noteworthy that AoA/AoD of the LoS path between IRSs and BS/AP can be accurately pre-measured, since both IRSs and BS/AP are pre-configured. Thus, θBRi\theta_{BR_{i}} and ϕBRi\phi_{BR_{i}} are used as prior knowledge hereafter. Then, beam training of IRSs assisted mmWave MIMO is carried out in the following two steps as illustrated in Figure 2.

Step 1. De-activate all the IRSs, and estimate the parameters (δBM,1,θBM,1,ϕBM,1)(\delta_{BM,1},\theta_{BM,1},\phi_{BM,1}) of LoS path

To estimate the parameters, measures of channel are collected via Tx/Rx random beamforming in BS/AP side and MT side, i.e.,

y=\displaystyle y= PTx𝐦H𝐇BM𝐟s+𝐦H𝐰¯\displaystyle\sqrt{P_{Tx}}\mathbf{m}^{H}\mathbf{H}_{BM}\mathbf{f}s+\mathbf{m}^{H}\bar{\mathbf{w}}
=\displaystyle= PTxζLoSδBM,1𝐦H𝐚M(θBM,1)𝐚BH(ϕBM,1)𝐟+\displaystyle\sqrt{P_{Tx}}\zeta_{LoS}\delta_{BM,1}\mathbf{m}^{H}\mathbf{a}_{M}(\theta_{BM,1})\mathbf{a}_{B}^{H}(\phi_{BM,1})\mathbf{f}+
l=2LPTxδBM,l𝐦H𝐚M(θBM,l)𝐚BH(ϕBM,l)𝐟ν+𝐦H𝐰¯\displaystyle\underbrace{\sum_{l=2}^{L}\sqrt{P_{Tx}}\delta_{BM,l}\mathbf{m}^{H}\mathbf{a}_{M}(\theta_{BM,l})\mathbf{a}_{B}^{H}(\phi_{BM,l})\mathbf{f}}_{\nu}+\mathbf{m}^{H}\bar{\mathbf{w}} (9)

where PTxP_{Tx} is transmit power, 𝐰¯𝒞𝒩(𝟎,σ𝐰¯2𝐈NM)\bar{\mathbf{w}}\sim{\cal{CN}}(\mathbf{0},\sigma^{2}_{\bar{\mathbf{w}}}\mathbf{I}_{N_{M}}) is the zero-mean complex Gaussian additive noise, s=1s=1 is the pilot signal sent by the user, 𝐟\mathbf{f} and 𝐦\mathbf{m} are transmit random beamforming vector at BS/AP side and receive random beamforming vector at MT side111A good random beamforming codebook can be derived offline by high performance computers, and they will be pre-configured in BS/AP, IRS and MT side., respectively, and the entries of 𝐟\mathbf{f} and 𝐦\mathbf{m} are phase-only complex variables with invariable amplitude [30], i.e.,

𝐟=1NB(ejπϱ1,ejπϱ2,,ejπϱNB)T\displaystyle\mathbf{f}=\frac{1}{\sqrt{N_{B}}}\left(e^{j\pi\varrho_{1}},e^{j\pi\varrho_{2}},\cdots,e^{j\pi\varrho_{N_{B}}}\right)^{T}
𝐦=1NM(ejπσ1,ejπσ2,,ejπσNM)T\displaystyle\mathbf{m}=\frac{1}{\sqrt{N_{M}}}\left(e^{j\pi\sigma_{1}},e^{j\pi\sigma_{2}},\cdots,e^{j\pi\sigma_{N_{M}}\emph{}}\right)^{T}

ϱnB\varrho_{n_{B}} is the phase shift value of the nBn_{B}-th analog phase shifter in BS/AP side, σnM\sigma_{n_{M}} is the phase shift value of the nMn_{M}-th analog phase shifter in MT side.

As NLoS paths are much weaker than LoS path in mmWave band, i.e., δBM,l(l=2,,L)\delta_{BM,l}(l=2,\cdots,L) are small compared to δBM,1\delta_{BM,1}, we are very less likely to build an effective communication link via NLoS paths. Hence, the AoA, AoD pair that we are interested in is merely (ζLoS,δBM,1,θBM,1,ϕBM,1)(\zeta_{LoS},\delta_{BM,1},\theta_{BM,1},\phi_{BM,1}), and the term ν\nu will be treated as interference. Considering the small scale and randomness of δBM,l(l=2,,L)\delta_{BM,l}(l=2,\cdots,L), we assume that ν\nu follows complex Gaussian distribution for the simplicity of analysis222Although we assume that ν\nu follows Gaussian distribution in theoretical analysis, the channel model to be applied in numerical simulations still considers NLoS components as in Eq. (1). . Then, the beam training problem for IRSs assisted mmWave MIMO communications is formulated as the estimation of (ζLoS,δBM,1,θBM,1,ϕBM,1)(\zeta_{LoS},\delta_{BM,1},\theta_{BM,1},\phi_{BM,1}) from the following received signal

y=PTxζLoSδBM,1𝐦H𝐚M(θBM,1)𝐚BH(ϕBM,1)𝐟+ν+𝐦H𝐰¯\displaystyle y=\sqrt{P_{Tx}}{\zeta_{LoS}\delta_{BM,1}\mathbf{m}^{H}\mathbf{a}_{M}(\theta_{BM,1})\mathbf{a}^{H}_{B}(\phi_{BM,1})\mathbf{f}}+\nu+\mathbf{m}^{H}\bar{\mathbf{w}} (10)

Adding the subscript nn to yy to denote the received signal in the nn-th time slot, we have

yn\displaystyle y_{n} =PTxζLoSδBM,1𝐦nH𝐚M(θBM,1)𝐚BH(ϕBM,1)𝐟n\displaystyle=\sqrt{P_{Tx}}\zeta_{LoS}\delta_{BM,1}\mathbf{m}_{n}^{H}\mathbf{a}_{M}(\theta_{BM,1})\mathbf{a}^{H}_{B}(\phi_{BM,1})\mathbf{f}_{n}
+νn+𝐦nH𝐰¯n\displaystyle\qquad+\nu_{n}+\mathbf{m}_{n}^{H}\bar{\mathbf{w}}_{n}
=PTxζLoSδBM,1(𝐟nT𝐦nH)𝐛(θBM,1,ϕBM,1)\displaystyle=\sqrt{P_{Tx}}\zeta_{LoS}\delta_{BM,1}(\mathbf{f}_{n}^{T}\otimes\mathbf{m}_{n}^{H})\mathbf{b}(\theta_{BM,1},\phi_{BM,1})
+νn+𝐦nH𝐰¯n\displaystyle\qquad+\nu_{n}+\mathbf{m}_{n}^{H}\bar{\mathbf{w}}_{n}

where 𝐛(θBM,1,ϕBM,1)vec(𝐚M(θBM,1)𝐚BH(ϕBM,1))\mathbf{b}(\theta_{BM,1},\phi_{BM,1})\triangleq vec(\mathbf{a}_{M}(\theta_{BM,1})\mathbf{a}^{H}_{B}(\phi_{BM,1})).

To estimate AoA and AoD, NN channel measurements are to be collected and concatenated, and its vector form is derived as

𝐲=PTxζLoSδBM,1𝐃𝐛(θBM,1,ϕBM,1)+𝝂+𝐰𝐧\displaystyle\mathbf{y}=\sqrt{P_{Tx}}\zeta_{LoS}\delta_{BM,1}\mathbf{D}\mathbf{b}(\theta_{BM,1},\phi_{BM,1})+\underbrace{\bm{\nu}+\mathbf{w}}_{\mathbf{n}} (11)

where

𝐲\displaystyle\mathbf{y} =[y1,y2,,yN]T\displaystyle=\left[y_{1},\;y_{2},\;\cdots,y_{N}\right]^{T}
𝐃\displaystyle\mathbf{D} =[𝐟1𝐦1,𝐟2𝐦2,,𝐟N𝐦N]T\displaystyle=\left[\mathbf{f}_{1}\otimes\mathbf{m}_{1}^{*},\;\mathbf{f}_{2}\otimes\mathbf{m}_{2}^{*},\;\cdots,\;\mathbf{f}_{N}\otimes\mathbf{m}_{N}^{*}\right]^{T}
𝝂\displaystyle\bm{\nu} =[ν1,ν2,,νN]T𝒞𝒩(𝟎,σ𝝂2𝐈N)\displaystyle=\left[{\nu}_{1},\;{\nu}_{2},\;\cdots,\;{\nu}_{N}\right]^{T}\sim{\cal{CN}}(\mathbf{0},\sigma^{2}_{\bm{\nu}}\mathbf{I}_{N})
𝐰\displaystyle\mathbf{w} =[𝐦1H𝐰¯1,𝐦2H𝐰¯2,,𝐦NH𝐰¯N]T\displaystyle=\left[\mathbf{m}_{1}^{H}\bar{\mathbf{w}}_{1},\;\mathbf{m}_{2}^{H}\bar{\mathbf{w}}_{2},\;\cdots,\;\mathbf{m}_{N}^{H}\bar{\mathbf{w}}_{N}\right]^{T}

Since

𝔼(𝐰(ι)𝐰(ι))=𝔼(𝐦ιH𝐰¯ι𝐰¯ιH𝐦ι)=σ𝐰¯2,\displaystyle\mathbb{E}\left({\mathbf{w}}(\iota){\mathbf{w}}^{*}(\iota)\right)=\mathbb{E}\left(\mathbf{m}_{\iota}^{H}\bar{\mathbf{w}}_{\iota}\bar{\mathbf{w}}_{\iota}^{H}\mathbf{m}_{\iota}\right)=\sigma^{2}_{\bar{\mathbf{w}}},
𝔼(𝐰(ι)𝐰(κ))=𝔼(𝐦ιH𝐰¯ι𝐰¯κH𝐦κ)=0,ικ\displaystyle\mathbb{E}\left({\mathbf{w}}(\iota){\mathbf{w}}^{*}(\kappa)\right)=\mathbb{E}\left(\mathbf{m}_{\iota}^{H}\bar{\mathbf{w}}_{\iota}\bar{\mathbf{w}}_{\kappa}^{H}\mathbf{m}_{\kappa}\right)=0,\;\forall\iota\neq\kappa

the covariance of the equivalent noise 𝐰\mathbf{w} is thus 𝔼(𝐰𝐰H)=σ𝐰¯2𝐈N\mathbb{E}(\mathbf{w}\mathbf{w}^{H})=\sigma^{2}_{\bar{\mathbf{w}}}\mathbf{I}_{N}. Let 𝐧𝝂+𝐰\mathbf{n}\triangleq\bm{\nu}+\mathbf{w}, as 𝝂\bm{\nu} and 𝐰\mathbf{w} are independent of each other, we have 𝐧𝒞𝒩(𝟎,(σ𝐰¯2+σ𝝂2)𝐈N)\mathbf{n}\sim{\cal{CN}}\left(\mathbf{0},\left(\sigma^{2}_{\bar{\mathbf{w}}}+\sigma^{2}_{\bm{\nu}}\right)\mathbf{I}_{N}\right).

Based on the above analysis, beam training for the link between BS/AP and MT is summarized as follows.
Sub-problem 1: How to accurately estimate the parameter set (ζLoS,δBM,1,θBM,1,ϕBM,1)(\zeta_{LoS},\delta_{BM,1},\theta_{BM,1},\phi_{BM,1}) from 𝐲\mathbf{y}.

Step 2. Activate the ii-th IRS, de-activate the rest IRSs, and estimate the parameters (δBRiM,θRiM,ϕRiMθBRi)(\delta_{BR_{i}M},\theta_{R_{i}M},\phi_{R_{i}M}\ominus\theta_{BR_{i}}) of the ii-th VLoS path. Repeat the above process for the rest IRSs.

As ϕBRi\phi_{BR_{i}} is known, with the transmit beamforming vector 𝐟=𝐚B(ϕBRi)NB\mathbf{f}=\frac{\mathbf{a}_{B}(\phi_{BR_{i}})}{\sqrt{N_{B}}}, BS/AP is able to concentrate its power towards IRSs via transmit beamforming. Simultaneously, IRS performs passive random reflection and MT performs receive random beamforming, the received signal at MT side is written as

y\displaystyle\qquad y
=PTx𝐦H(𝐇BM+𝐇BRiM)𝐚B(ϕBRi)NB+𝐦H𝐰¯\displaystyle=\sqrt{P_{Tx}}\mathbf{m}^{H}\left(\mathbf{H}_{BM}+\mathbf{H}_{BR_{i}M}\right)\frac{\mathbf{a}_{B}(\phi_{BR_{i}})}{\sqrt{N_{B}}}+\mathbf{m}^{H}\bar{\mathbf{w}}
=NBPTxζVLoS,iδBRiM𝐦H𝐚M(θRiM)𝐚RiH(ϕRiMθBRi)𝐠¯i\displaystyle=\sqrt{N_{B}P_{Tx}}\zeta_{VLoS,i}\delta_{BR_{i}M}\mathbf{m}^{H}\mathbf{a}_{M}(\theta_{R_{i}M})\mathbf{a}_{R_{i}}^{H}(\phi_{R_{i}M}\ominus\theta_{BR_{i}})\bar{\mathbf{g}}_{i}
+PTxζLoSδ1𝐦H𝐚M(θMB,1)𝐚BH(ϕMB,1)𝐚B(ϕBRi)NBν1\displaystyle+\underbrace{\sqrt{P_{Tx}}\zeta_{LoS}\delta_{1}\mathbf{m}^{H}\mathbf{a}_{M}(\theta_{MB,1})\mathbf{a}^{H}_{B}(\phi_{MB,1})\frac{\mathbf{a}_{B}(\phi_{BR_{i}})}{\sqrt{N_{B}}}}_{\nu_{1}}
+l=2LPTxδl𝐦H𝐚M(θMB,l)𝐚BH(ϕMB,l)𝐚B(ϕBRi)NBν2+𝐦H𝐰¯w\displaystyle+\underbrace{\sum_{l=2}^{L}\sqrt{P_{Tx}}\delta_{l}\mathbf{m}^{H}\mathbf{a}_{M}(\theta_{MB,l})\mathbf{a}^{H}_{B}(\phi_{MB,l})\frac{\mathbf{a}_{B}(\phi_{BR_{i}})}{\sqrt{N_{B}}}}_{\nu_{2}}+\underbrace{\mathbf{m}^{H}\bar{\mathbf{w}}}_{w} (12)

The interference term ν1\nu_{1} and ν2\nu_{2} are insignificant due to 1) the small NLoS path coefficients δl(l=2,,L)\delta_{l}(l=2,\cdots,L) in mmWave band, 2) the spatial filtering impact, i.e., 𝐚BH(θMB,l)𝐚B(ϕBRi)0,(l=1,2,,L)\mathbf{a}^{H}_{B}(\theta_{MB,l})\mathbf{a}_{B}(\phi_{BR_{i}})\approx 0,(l=1,2,\cdots,L) for |ϕBRiθMB,l|>1NB|\phi_{BR_{i}}-\theta_{MB,l}|>\frac{1}{N_{B}}.

Similar to (11), by concatenating NN channel measurements, we have

𝐲=NBPTxζVLoS,iδBRiM𝐃𝐛(θRiM,ϕRiMθBRi)\displaystyle\mathbf{y}=\sqrt{N_{B}P_{Tx}}\zeta_{VLoS,i}\delta_{BR_{i}M}\mathbf{D}\mathbf{b}(\theta_{R_{i}M},\phi_{R_{i}M}\ominus\theta_{BR_{i}})
+𝝂1+𝝂2+𝐰𝐧\displaystyle\qquad+\underbrace{\bm{\nu}_{1}+\bm{\nu}_{2}+\mathbf{w}}_{\mathbf{n}} (13)

where

𝐃=[𝐠1𝐦1,𝐠2𝐦2,,𝐠N𝐦N]T\displaystyle{\mathbf{D}}=\left[{\mathbf{g}}_{1}\otimes\mathbf{m}_{1}^{*},\;{\mathbf{g}}_{2}\otimes\mathbf{m}_{2}^{*},\;\cdots,\;{\mathbf{g}}_{N}\otimes\mathbf{m}_{N}^{*}\right]^{T}

Based on the above analysis, beam training for the reflected path between BS/AP and MT assisted by the ii-th IRS is summarized as follows.
Sub-problem 2: How to accurately estimate the parameter set (ζVLoS,i,δBRiM,θRiM,ϕRiMθBRi)(\zeta_{VLoS,i},\delta_{BR_{i}M},\theta_{R_{i}M},\phi_{R_{i}M}\ominus\theta_{BR_{i}}) from 𝐲\mathbf{y}.

Remark 1.

From (11) and (13), we can easily find that Sub-problem 1 and Sub-problem 2 are mathematically equivalent. Therefore, through flexible control over IRS, we are capable to decompose the complicated non-sparse channel estimation problem of IRSs assisted mmWave MIMO into a set of simplified sub-problems.

III-B Protocol of Joint Beam Training and Positioning

Refer to caption
Figure 3: Sequence diagram of joint beam training and positioning

On the basis of beam training breakdown, we introduce the protocol for joint beam training and position in the IRSs assisted mmWave communication system.

The procedures of the proposed scheme are given in Figure 3. Specifically, the scheme is divided into three stages, i.e., Stage I. channel measurement, Stage II. parameter estimation and Stage III. positioning and location information aided parameter enhancement. In Stage I, the channel measurement vector 𝐲\mathbf{y} in Eq. (11) and Eq. (13) are collected; In Stage II, ML estimation is performed to estimate the parameters of LoS path and VLoS paths in MT side, which will be introduced in Section IV; In Stage III, positioning and position aided path parameter refinement are performed in MT side, which will be introduced in Section V.

In practice, when an MT enters a cell, the prior information, e.g., random beamforming vector sequence of BS/AP, position of BS/AP and IRSs, will be sent to the MT via the lower frequency link, such as sub-6GHz link of 5G New Radio. Upon the request of high-speed mmWave data transmission, the random beamforming process in Stage I will be triggered periodically prior to data transmission to help setup initial beam alignment for new users and maintain beam alignment for the existing users. Then, in Stage II and Stage III, each MT performs parameters estimation, positioning and position aided parameter refinement based on its own channel measurement vector 𝐲\mathbf{y}. Finally, the estimated path parameters are fed back to BS/AP by each MT individually to facilitate beamforming designs for the subsequent mmWave data transmission.

It is noteworthy that, as random beamforming is quasi-omnidirectional [31], the pilot sent by BS/AP can be received by MTs from all directions simultaneously. The broadcasting mechanism of random beamforming, which is similar to Global Positioning System (GPS), enables channel measurements to be collected and processed by each MT individually without causing interference. Therefore, training overhead of the proposed scheme will not increase with MT number, which renders the scheme particularly suitable for multi-user scenario.

IV Beam Training With Random Beamforming – Parameter Estimation and Feasibility Study

In this section, ML estimation method is applied to estimate the path parameters (δ,θ,ϕ)(\delta,\theta,\phi) of LoS/VLoS paths from channel measurements sampled by random Rx/Tx beamforming. Furthermore, the feasibility of random beamforming based beam training is verified.

IV-A Maximum Log-likelihood Estimation of (δ,θ,ϕ)(\delta,\theta,\phi)

For conciseness of expression, we write the unified model of sub-problem 1 and sub-problem 2 as

𝐲=ζδ𝐃𝐛(θ,ϕ)+𝐧\displaystyle\mathbf{y}=\zeta\delta\mathbf{D}\mathbf{b}(\theta,\phi)+{\mathbf{n}} (14)

where ζ\zeta is the indicator of blockage, δ\delta is equivalent path gain ( δ=PTxδBM,1\delta=\sqrt{P_{Tx}}\delta_{BM,1} or δ=PTxNBδBRiM\delta=\sqrt{P_{Tx}N_{B}}\delta_{BR_{i}M} ), θ\theta is cosine AoA, ϕ\phi is equivalent cosine AoD (ϕ=ϕBM,1\phi=\phi_{BM,1} or ϕ=ϕRiMθBRi\phi=\phi_{R_{i}M}\ominus\theta_{BR_{i}}), and 𝐛(θ,ϕ)vec(𝐚Rx(θ)𝐚TxH(ϕ))\mathbf{b}(\theta,\phi)\triangleq vec(\mathbf{a}_{Rx}(\theta)\mathbf{a}^{H}_{Tx}(\phi)).

It is noteworthy that estimation of (δ,θ,ϕ)(\delta,\theta,\phi) should be performed merely when ζ=1\zeta=1, as the measurement vector 𝐲\mathbf{y} given that ζ=0\zeta=0 contains no information about (δ,θ,ϕ)(\delta,\theta,\phi). Therefore, we estimate the parameters (δ,θ,ϕ)(\delta,\theta,\phi) through maximizing log-likelihood function under the assumption that ζ=1\zeta=1, i.e.,

(δ^,θ^,ϕ^)=argmaxδ,θ,ϕ(δ,θ,ϕ)\displaystyle(\hat{\delta},\hat{\theta},\hat{\phi})=\operatorname*{arg\,max}_{\delta,\theta,\phi}\mathcal{L}(\delta,\theta,\phi) (15)

where

(δ,θ,ϕ)=\displaystyle\mathcal{L}(\delta,\theta,\phi)= logP(𝐲|ζ=1,δ,θ,ϕ)\displaystyle\log P(\mathbf{y}|\zeta=1,\delta,\theta,\phi)
=\displaystyle= NlogπNlogσ2𝐲δ𝐃𝐛(θ,ϕ)22σ2\displaystyle-N\log{\pi}-N\log\sigma^{2}-\frac{\|\mathbf{y}-\delta\mathbf{D}\mathbf{b}(\theta,\phi)\|_{2}^{2}}{\sigma^{2}} (16)

and the conditional probability is

P(𝐲|ζ,δ,θ,ϕ)=1πNdet(σ2𝐈N)e(𝐲ζδ𝐃𝐛(θ,ϕ))H(𝐲ζδ𝐃𝐛(θ,ϕ))σ2\displaystyle P(\mathbf{y}|\zeta,\delta,\theta,\phi)=\frac{1}{\pi^{N}\det(\sigma^{2}\mathbf{I}_{N})}e^{-\frac{(\mathbf{y}-\zeta\delta\mathbf{D}\mathbf{b}(\theta,\phi))^{H}(\mathbf{y}-\zeta\delta\mathbf{D}\mathbf{b}(\theta,\phi))}{\sigma^{2}}} (17)

IV-A1 Estimation of δ\delta

Before the derivation of θ^,ϕ^\hat{\theta},\hat{\phi}, we should find the expression of δ^\hat{\delta}. To this end, we ignore terms independent thereof and set

(δ,θ,ϕ)δ=0\displaystyle\frac{\partial\mathcal{L}(\delta,\theta,\phi)}{\partial\delta}=0 (18)

Expanding Eq. (18), we have

2Re{(𝐃𝐛(θ,ϕ))H(𝐲δ𝐃𝐛(θ,ϕ))}=0\displaystyle 2Re\left\{\left(\mathbf{D}\mathbf{b}(\theta,\phi)\right)^{H}(\mathbf{y}-\delta\mathbf{D}\mathbf{b}(\theta,\phi))\right\}=0 (19)

From Eq. (19), the optimal δ^\hat{\delta} is derived as

δ^=𝐛H(θ,ϕ)𝐃H𝐲𝐃𝐛(θ,ϕ)22\displaystyle\hat{\delta}=\frac{\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}\mathbf{y}}{\|\mathbf{D}\mathbf{b}(\theta,\phi)\|_{2}^{2}} (20)

IV-A2 Estimation of θ\theta and ϕ\phi

Next, we will jointly estimate θ\theta and ϕ\phi. Substituting Eq. (20) into Eq. (IV-A), we have

(δ,θ,ϕ)\displaystyle\mathcal{L}(\delta,\theta,\phi)
=\displaystyle= NlogπNlogσ2𝐲𝐃𝐛(θ,ϕ)𝐛H(θ,ϕ)𝐃H𝐃𝐛(θ,ϕ)22𝐲22σ2\displaystyle-N\log{\pi}-N\log\sigma^{2}-\frac{\left\|\mathbf{y}-\frac{\mathbf{D}\mathbf{b}(\theta,\phi)\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}}{\|\mathbf{D}\mathbf{b}(\theta,\phi)\|_{2}^{2}}\mathbf{y}\right\|_{2}^{2}}{\sigma^{2}} (21)

Since

𝐲𝐃𝐛(θ,ϕ)𝐛H(θ,ϕ)𝐃H𝐃𝐛(θ,ϕ)22𝐲22\displaystyle\left\|\mathbf{y}-\frac{\mathbf{D}\mathbf{b}(\theta,\phi)\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}}{\|\mathbf{D}\mathbf{b}(\theta,\phi)\|_{2}^{2}}\mathbf{y}\right\|_{2}^{2}
=\displaystyle= 𝐲H(𝐈𝐃𝐛(θ,ϕ)𝐛H(θ,ϕ)𝐃H𝐃𝐛(θ,ϕ)22)𝐲,\displaystyle\mathbf{y}^{H}(\mathbf{I}-\frac{\mathbf{D}\mathbf{b}(\theta,\phi)\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}}{\|\mathbf{D}\mathbf{b}(\theta,\phi)\|_{2}^{2}})\mathbf{y}, (22)

the beam training problem is formulated as

P1:\displaystyle P1:\quad maxθ,ϕ𝐛H(θ,ϕ)𝐃H𝐃𝐛(θ,ϕ)2𝐲22\displaystyle\max_{\theta,\phi}\left\|\frac{\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}}{\|\mathbf{D}\mathbf{b}(\theta,\phi)\|_{2}}\mathbf{y}\right\|_{2}^{2}
s.t.1θ<1\displaystyle s.t.\;-1\leq\theta<1
1ϕ<1\displaystyle\;\;\;\;\;\;-1\leq\phi<1

P1 is a non-convex problem. However, as there are only two real-valued variables to be estimated, a simple yet efficient two-step algorithm can be readily applied to solve P1. For conciseness, let g(θ,ϕ)𝐛H(θ,ϕ)𝐃H𝐃𝐛(θ,ϕ)2𝐲22g({\theta,\phi})\triangleq\left\|\frac{\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}}{\|\mathbf{D}\mathbf{b}(\theta,\phi)\|_{2}}\mathbf{y}\right\|_{2}^{2}. The two-step algorithm is explained as follows.

Step 1. Joint AoA and AoD Coarse Search

Set quantization level ZθZ_{\theta} and ZϕZ_{\phi}, and then exhaustively search for the NpkN_{pk} largest maxima that satisfy

g(θι^,ϕκ^)>g(θι^1,ϕκ^)\displaystyle g(\theta_{\hat{\iota}},\phi_{\hat{\kappa}})>g(\theta_{\hat{\iota}-1},\phi_{\hat{\kappa}})
g(θι^,ϕκ^)>g(θι^+1,ϕκ^)\displaystyle g(\theta_{\hat{\iota}},\phi_{\hat{\kappa}})>g(\theta_{\hat{\iota}+1},\phi_{\hat{\kappa}})
g(θι^,ϕκ^)>g(θι^,ϕκ^1)\displaystyle g(\theta_{\hat{\iota}},\phi_{\hat{\kappa}})>g(\theta_{\hat{\iota}},\phi_{\hat{\kappa}-1})
g(θι^,ϕκ^)>g(θι^,ϕκ^+1)\displaystyle g(\theta_{\hat{\iota}},\phi_{\hat{\kappa}})>g(\theta_{\hat{\iota}},\phi_{\hat{\kappa}+1})

over the discrete grid

𝒟{(θι,ϕκ)|θι=1+2ι1Zθ,ι=1,2,,Zθ,\displaystyle\mathcal{D}\triangleq\left\{(\theta_{\iota},\phi_{\kappa})\Big{|}\;\theta_{\iota}=-1+\frac{2\iota-1}{Z_{\theta}},\iota=1,2,\cdots,Z_{\theta},\right.
ϕκ=1+2κ1Zϕ,κ=1,2,,Zϕ}\displaystyle\left.\qquad\qquad\quad\;\;\phi_{\kappa}=-1+\frac{2\kappa-1}{Z_{\phi}},\kappa=1,2,\cdots,Z_{\phi}\right\} (23)

Step 2. Joint AoA and AoD Fine Search

For a given discrete maximum (θι^,ϕκ^)T(\theta_{\hat{\iota}},\phi_{\hat{\kappa}})^{T}, run gradient descent search starting from (θ(1),ϕ(1))T=(θι^,ϕκ^)T(\theta^{(1)},\phi^{(1)})^{T}=(\theta_{\hat{\iota}},\phi_{\hat{\kappa}})^{T} as follows

(θ(i+1)ϕ(i+1))=(θ(i)ϕ(i))λ(g(θ,ϕ)θ|θ=θ(i)g(θ,ϕ)ϕ|ϕ=ϕ(i))\displaystyle\left(\begin{array}[]{c}\theta^{(i+1)}\\ \phi^{(i+1)}\\ \end{array}\right)=\left(\begin{array}[]{c}\theta^{(i)}\\ \phi^{(i)}\\ \end{array}\right)\oplus\lambda\left(\begin{array}[]{c}\frac{\partial g({\theta,\phi})}{\partial\theta}\big{|}_{\theta=\theta^{(i)}}\\ \frac{\partial g({\theta,\phi})}{\partial\phi}\big{|}_{\phi=\phi^{(i)}}\\ \end{array}\right) (30)

where λ\lambda is the preset step size and the expressions of g(θ,ϕ)θ\frac{\partial g({\theta,\phi})}{\partial\theta} and g(θ,ϕ)ϕ\frac{\partial g({\theta,\phi})}{\partial\phi} are given in Appendix A. The iteration stops when (θ(i+1)θ(i))2+(ϕ(i+1)ϕ(i))2ϵ(\theta^{(i+1)}\ominus\theta^{(i)})^{2}+(\phi^{(i+1)}\ominus\phi^{(i)})^{2}\leq\epsilon, where ϵ\epsilon is a preset parameter.

Repeat the above operations over the rest Npk1N_{pk}-1 maxima derived in Step 1, and select the best one as (θ^,ϕ^)(\hat{\theta},\hat{\phi}). Then, the exact value of the estimated path gain δ^\hat{\delta} can be subsequently obtained by substituting (θ^,ϕ^)(\hat{\theta},\hat{\phi}) into Eq. (20).

Remark 2.

The complexity of Step 1 is 𝒪(22ZϕZθ)\mathcal{O}(2^{2}Z_{\phi}Z_{\theta}). The complexity of Step 2 mainly arises from the computation of the gradients g(θ,ϕ)θ\frac{\partial g({\theta,\phi})}{\partial\theta} and g(θ,ϕ)ϕ\frac{\partial g({\theta,\phi})}{\partial\phi}, which, according to Eq. (A) and Eq. (A), is 𝒪(NBNMN)\mathcal{O}(N_{B}N_{M}N) (or 𝒪(NRiNMN)\mathcal{O}(N_{R_{i}}N_{M}N)). Hence, the complexity of Step 2 is 𝒪(niterNpkNBNMN)\mathcal{O}(n_{iter}N_{pk}N_{B}N_{M}N) or (𝒪(niterNpkNRiNMN)\mathcal{O}(n_{iter}N_{pk}N_{R_{i}}N_{M}N)), where the iteration number nitern_{iter} depends on step size and stopping criterion of the gradient method and is generally less than 2020. Thus, the overall complexity is 𝒪(22ZϕZθ+niterNpkNBNMN)\mathcal{O}(2^{2}Z_{\phi}Z_{\theta}+n_{iter}N_{pk}N_{B}N_{M}N) (or 𝒪(22ZϕZθ+niterNpkNRiNMN)\mathcal{O}(2^{2}Z_{\phi}Z_{\theta}+n_{iter}N_{pk}N_{R_{i}}N_{M}N)).

IV-B Uniqueness of The Estimated AoA and AoD Pair

To delve into the effectiveness of beam training with random beamforming, conditions under which (θ,ϕ)(\theta,\phi) can be accurately estimated from the measurement signal 𝐲\mathbf{y} are studied in the ideal scenario without noise or interference.

Firstly, two definitions of uniqueness are introduced as follows.
(1) Uniqueness of measurement signal representation, namely

𝐲\displaystyle\mathbf{y} =δ𝐃𝐛(θ,ϕ)\displaystyle=\delta\mathbf{D}\mathbf{b}(\theta,\phi)
δ~𝐃𝐛(θ~,ϕ~),δ~,(θ~,ϕ~)(θ,ϕ)\displaystyle\neq\widetilde{\delta}\mathbf{D}\mathbf{b}(\widetilde{\theta},\widetilde{\phi}),\;\;\;\;\forall\widetilde{\delta}\in\mathbb{C},\forall(\widetilde{\theta},\widetilde{\phi})\neq(\theta,\phi) (31)

(2) Uniqueness of estimated AoA and AoD pair, namely

𝐛H(θ,ϕ)𝐃H𝐃𝐛(θ,ϕ)2𝐲2>𝐛H(θ~,ϕ~)𝐃H𝐃𝐛(θ~,ϕ~)2𝐲2,(θ~,ϕ~)(θ,ϕ)\displaystyle\left\|\frac{\mathbf{b}^{H}({\theta},{\phi})\mathbf{D}^{H}}{\|\mathbf{D}\mathbf{b}({\theta},{\phi})\|_{2}}\mathbf{y}\right\|_{2}>\left\|\frac{\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}^{H}}{\|\mathbf{D}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})\|_{2}}\mathbf{y}\right\|_{2},\;\forall(\widetilde{\theta},\widetilde{\phi})\neq(\theta,\phi) (32)

Uniqueness of measurement signal representation means that any AoA, AoD pair (θ~,ϕ~)(\widetilde{\theta},\widetilde{\phi}) that differs from (θ,ϕ)(\theta,\phi) cannot construct the measurement signal 𝐲\mathbf{y}. It is an inherent property of the sampling method, which is primarily determined by 𝐃\mathbf{D}. By contrast, uniqueness of the estimated AoA and AoD depends on both sampling method and estimation method. It indicates that AoA, AoD pair can be accurately estimated from the measurement signal 𝐲\mathbf{y} using a specific estimation method.

In the following Theorem, we will study the relationship between the above two types of uniqueness.

Theorem 1.

As long as uniqueness of measurement signal representation is satisfied, ML method is capable to accurately estimate the AoA, AoD pair.

Proof.

See Appendix B. ∎

According to Theorem 1, the uniqueness of AoA and AoD estimation is equivalent to the uniqueness of measurement signal representation, which means we just need to investigate the conditions on which uniqueness of measurement signal representation can be achieved.

Before studying the sensing matrix 𝐃\mathbf{D}, we will observe the signal space of channel response. The vectorized response of LoS path, namely 𝐡=δ𝐛(θ,ϕ)\mathbf{h}=\delta\mathbf{b}(\theta,\phi), is a high dimensional (NrNtN_{r}N_{t}-dimensional) variable that is characterized by (δ,θ,ϕ)(\delta,\theta,\phi), and we define the signal space of 𝐡\mathbf{h} as

𝒮{δ𝐛(θ,ϕ)|δ,1θ,ϕ<1}\displaystyle\mathcal{S}\triangleq\{\delta\mathbf{b}(\theta,\phi)|\delta\in\mathbb{C},-1\leq\theta,\phi<1\} (33)

𝒮\mathcal{S} is a nonlinear kk-dimensional (k=3k=3) submanifold of NrNt\mathbb{C}^{N_{r}N_{t}} with the parameters (δ,θ,ϕ)(\delta,\theta,\phi) [32, 33]. As 𝐛(θ,ϕ)\mathbf{b}(\theta,\phi) is the Kronecker product of two array steering vectors, 𝒮\mathcal{S} is indeed the so-called array manifold [34]. Thus, one channel realization 𝐡ˇ\check{\mathbf{h}} with the parameters (δˇ,θˇ,ϕˇ)(\check{\delta},\check{\theta},\check{\phi}) can be seen as a point in the array manifold. The dimensionality kk can be interpreted as an “information level” of the signal, analogous to the sparsity level in compressive sensing problems [32, 35, 36]. In [32], it is proved that signals obeying manifold models can also be recovered from only a few measurements, simply by replacing the traditional compressive sensing model of sparsity with a manifold model for 𝐡\mathbf{h}. The above statement is supported by Lemma 1.

Lemma 1.

For a random orthoprojector 𝚽M×N\bm{\Phi}\in\mathbb{C}^{M\times N}, the following statement

(1ϵ)MN𝚽𝐡1𝚽𝐡222𝐡1𝐡222(1+ϵ)MN,\displaystyle(1-\epsilon)\sqrt{\frac{M}{N}}\leq\frac{\left\|\bm{\Phi}\mathbf{h}_{1}-\bm{\Phi}\mathbf{h}_{2}\right\|_{2}^{2}}{\|\mathbf{h}_{1}-\mathbf{h}_{2}\|_{2}^{2}}\leq(1+\epsilon)\sqrt{\frac{M}{N}},
𝐡1,𝐡2𝒮,𝐡1𝐡2\displaystyle\forall\mathbf{h}_{1},\mathbf{h}_{2}\in\mathcal{S},\mathbf{h}_{1}\neq\mathbf{h}_{2} (34)

holds with high probability, when dimensionality MM of the projected low-dimensional space is sufficient 333The sufficient number of MM is related to ϵ\epsilon and several manifold-related factors, e.g., condition number, volume, and geodesic covering regularity. Detailed analysis can be referred to [32, 33]. In practice, the exact relationship between the sufficient number and its dependent factors is of limited significance due to the following two reasons, (1) the received measurement signal is corrupted by noise, (2)MM can be online adjusted according to channel conditions., where 𝐡1𝒮,𝐡2𝒮\mathbf{h}_{1}\in\mathcal{S},\mathbf{h}_{2}\in\mathcal{S}, 𝐡1𝐡2\mathbf{h}_{1}\neq\mathbf{h}_{2}, 0<ϵ<10<\epsilon<1 is the isometry constant [32].

Remark 3.

𝐡1𝐡222\|\mathbf{h}_{1}-\mathbf{h}_{2}\|_{2}^{2} is the Euclidean distance between two points 𝐡1\mathbf{h}_{1}, 𝐡2\mathbf{h}_{2} on the manifold, and 𝚽𝐡1𝚽𝐡222\left\|\bm{\Phi}\mathbf{h}_{1}-\bm{\Phi}\mathbf{h}_{2}\right\|_{2}^{2} is the Euclidean distance between the projected points 𝚽𝐡1,𝚽𝐡2\bm{\Phi}\mathbf{h}_{1},\bm{\Phi}\mathbf{h}_{2} on the image of 𝒮\mathcal{S} (namely 𝚽𝒮\bm{\Phi}\mathcal{S}). The isometry constant ϵ\epsilon measures the degree that the pairwise Euclidean distance between points on 𝒮\mathcal{S} is preserved under the mapping 𝚽\bm{\Phi}. Apparently, Lemma 1 indicates that 𝚽𝐡1𝚽𝐡222>0\left\|\bm{\Phi}\mathbf{h}_{1}-\bm{\Phi}\mathbf{h}_{2}\right\|_{2}^{2}>0 is satisfied with high probability, as it is a weaker condition than Lemma 1.

Although the sensing matrix 𝐃\mathbf{D} is not necessarily an orthoprojector, via singular value decomposition, it can be decomposed as 𝐃=𝚿~𝚲~𝚽~\mathbf{D}=\widetilde{\bm{\Psi}}\widetilde{\bm{\Lambda}}\widetilde{\bm{\Phi}}, where 𝚿~M×M,𝚲~M×M\widetilde{\bm{\Psi}}\in\mathbb{C}^{M\times M},\widetilde{\bm{\Lambda}}\in\mathbb{C}^{M\times M}, and 𝚽~M×N\widetilde{\bm{\Phi}}\in\mathbb{C}^{M\times N}. Then, we have 𝐃𝐡1𝐃𝐡222=𝚲~𝚽~𝐡1𝚲~𝚽~𝐡222\|\mathbf{D}\mathbf{h}_{1}-\mathbf{D}\mathbf{h}_{2}\|_{2}^{2}=\|\widetilde{\bm{\Lambda}}\widetilde{\bm{\Phi}}\mathbf{h}_{1}-\widetilde{\bm{\Lambda}}\widetilde{\bm{\Phi}}\mathbf{h}_{2}\|_{2}^{2}, where 𝚽~\widetilde{\bm{\Phi}} is indeed the orthoprojector, and 𝚲~\widetilde{\bm{\Lambda}} is a diagonal matrix with non-zero elements that scales the component in each dimension. 𝚽~𝐡1𝚽~𝐡222>0\|\widetilde{\bm{\Phi}}\mathbf{h}_{1}-\widetilde{\bm{\Phi}}\mathbf{h}_{2}\|_{2}^{2}>0 implicates 𝐃𝐡1𝐃𝐡222>0\|\mathbf{D}\mathbf{h}_{1}-\mathbf{D}\mathbf{h}_{2}\|_{2}^{2}>0, which is equivalent to 𝐃𝐡1𝐃𝐡2\mathbf{D}\mathbf{h}_{1}\neq\mathbf{D}\mathbf{h}_{2}, namely, δ1𝐃𝐛(θ1,ϕ1)δ2𝐃𝐛(θ2,ϕ2),(δ1,θ1,ϕ1)(δ2,θ2,ϕ2)\delta_{1}\mathbf{D}\mathbf{b}(\theta_{1},\phi_{1})\neq\delta_{2}\mathbf{D}\mathbf{b}(\theta_{2},\phi_{2}),\;\forall(\delta_{1},\theta_{1},\phi_{1})\neq(\delta_{2},\theta_{2},\phi_{2}). Thus, it is easy to find that 𝐃𝐛(θ1,ϕ1)μ𝐃𝐛(θ2,ϕ2),(θ1,ϕ1)(θ2,ϕ2),μ\mathbf{D}\mathbf{b}(\theta_{1},\phi_{1})\neq{\mu}\mathbf{D}\mathbf{b}(\theta_{2},\phi_{2}),\;\forall(\theta_{1},\phi_{1})\neq(\theta_{2},\phi_{2}),\forall{\mu}\in\mathbb{C}, where μδ2δ1{\mu}\triangleq\frac{\delta_{2}}{\delta_{1}}.

To conclude, the randomly generated sensing matrix 𝐃\mathbf{D} has a large probability to guarantee the uniqueness of ML based joint AoA and AoD estimation.

IV-C On The Impact of Training Length NN

Theorem 1 indicates that, with random beamforming, Eq. (32) holds with high probability. In other words, in noiseless scenario, the distance gap between the highest peak (global optimum) and other peaks (other local optimums) exist with high probability. However, in practice, corrupted by noise and interference, the highest peak may (1) shift to its adjacent points, or (2) be transcended and replaced by other peaks. Error Type 1 incurs mild AoA, AoD estimation error followed by power loss of an acceptable level; Error Type 2 incurs significant AoA, AoD estimation error followed by beam misalignment. Apparently, we would like to avoid Error Type 2.

To study the estimation error, the pairwise error probability (PEP) of any two parameter sets (θ,ϕ)(\theta,\phi) and (θ~,ϕ~)(\widetilde{\theta},\widetilde{\phi}) is derived in the following theorem.

Theorem 2.

The PEP Pe((θ,ϕ)(θ~,ϕ~))Pe\left((\theta,\phi)\rightarrow(\widetilde{\theta},\widetilde{\phi})\right) that (θ,ϕ)(\theta,\phi) is mistaken as (θ~,ϕ~)(\widetilde{\theta},\widetilde{\phi}) in relatively high SNR regime can be approximated as

Pe((θ,ϕ)(θ~,ϕ~))Q(|δ|22σ2d2(𝐃,θ,ϕ,θ~,ϕ~))\displaystyle\qquad Pe\left((\theta,\phi)\rightarrow(\widetilde{\theta},\widetilde{\phi})\right)\approx Q\left(\frac{|\delta|^{2}}{2\sigma^{2}}d^{2}(\mathbf{D},\theta,\phi,\widetilde{\theta},\widetilde{\phi})\right) (35)

where the Q-function is the tail distribution function of the standard normal distribution [37], and

d2(𝐃,θ,ϕ,θ~,ϕ~)𝐃𝐛(θ,ϕ)22|𝐛H(θ~,ϕ~)𝐃H𝐃𝐛(θ,ϕ)|2𝐃𝐛(θ~,ϕ~)2\displaystyle d^{2}(\mathbf{D},\theta,\phi,\widetilde{\theta},\widetilde{\phi})\triangleq\left\|\mathbf{D}\mathbf{b}(\theta,\phi)\right\|_{2}^{2}-\frac{|\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}^{H}\mathbf{D}\mathbf{b}(\theta,\phi)|^{2}}{\|\mathbf{D}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})\|_{2}}
Proof.

See Appendix C. ∎

Refer to caption
(a) N=4N=4
Refer to caption
(b) N=8N=8
Refer to caption
(c) N=12N=12
Refer to caption
(d) N=16N=16
Figure 4: Contour plots of g(θ,ϕ)g(\theta,\phi) with different training lengths (Red cross represents the position of the first peak, purple asterisk represents the position of the second peak, and black square is the position of the actual AoA, AoD pair)
TABLE I: Peak values of g(θ,ϕ)g(\theta,\phi) over training length
Training Length Peak 1 Peak 2 Peak 1 - Peak 2
    N=4\;N=4 1.1156 1.1044 0.0112
    N=8\;N=8 8.7223 7.2658 1.4573
    N=12N=12 9.4986 5.8000 3.6986
    N=16N=16 12.3338 6.6508 5.6830

Theorem 2 indicates that PEP is inversely proportional to d2(𝐃,θ,ϕ,θ~,ϕ~)d^{2}(\mathbf{D},\theta,\phi,\widetilde{\theta},\widetilde{\phi}). To build the connection between PEP and training length NN, Proposition 1 is derived.

Proposition 1.

d2(𝐃N,θ,ϕ,θ~,ϕ~)d^{2}(\mathbf{D}_{N},\theta,\phi,\widetilde{\theta},\widetilde{\phi}) is monotonically increasing over training length NN, where 𝐃N=[𝐃N1H𝐝N]H\mathbf{D}_{N}=\left[\mathbf{D}_{N-1}^{H}\;\mathbf{d}_{N}\right]^{H}, i.e.,

d2(𝐃N,θ,ϕ,θ~,ϕ~)d2(𝐃N1,θ,ϕ,θ~,ϕ~)\displaystyle d^{2}(\mathbf{D}_{N},\theta,\phi,\widetilde{\theta},\widetilde{\phi})\geq d^{2}(\mathbf{D}_{N-1},\theta,\phi,\widetilde{\theta},\widetilde{\phi}) (36)

and the equality holds only if

𝐛H(θ~,ϕ~)𝐝N𝐝NH𝐛(θ~,ϕ~)𝐛H(θ~,ϕ~)𝐝N𝐝NH𝐛(θ,ϕ)=\displaystyle\frac{\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{d}_{N}\mathbf{d}_{N}^{H}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})}{\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{d}_{N}\mathbf{d}_{N}^{H}\mathbf{b}({\theta},{\phi})}= 𝐛H(θ~,ϕ~)𝐃N1H𝐃N1𝐛(θ~,ϕ~)𝐛H(θ~,ϕ~)𝐃N1H𝐃N1𝐛(θ,ϕ)\displaystyle\frac{\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}_{N-1}^{H}\mathbf{D}_{N-1}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})}{\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}_{N-1}^{H}\mathbf{D}_{N-1}\mathbf{b}(\theta,\phi)}
Proof.

See Appendix C. ∎

To verify Proposition 1, we plot the contour of g(θ,ϕ)g(\theta,\phi) with different training lengths in noiseless scenario in Figure 4. We set δ=1,θ=0,ϕ=0\delta=1,\theta=0,\phi=0. As can be seen that the gap between the first and the second peaks increases over training length, and the value of which is given in Table I. In addition, we can find that position of the first peak is invariant to training length and remains the same as the actual AoA, AoD pair, while position of the second peak varies. This verifies the uniqueness of ML based joint AoA, AoD estimation.

Remark 4.

According to Proposition 1, with random beamforming, the PEP probability of an erroneous estimate (θ~,ϕ~)(\widetilde{\theta},\widetilde{\phi}) being mistaken as the authentic parameters (θ,ϕ)(\theta,\phi) decreases almost surely over training length NN. Therefore, an appropriate NN can guarantee a satisfying accuracy of parameter estimation in scenarios with different SNR and interference levels.

V Interplay Between Positioning and Beam Training

In IRSs assisted mmWave MIMO system, BS/AP and IRSs, with their positions and array directions being known by all the MTs, can be seen as anchor nodes or beacons. The AoDs derived at beam training stage enable MT to estimate its own position. Hence, IRSs assisted mmWave MIMO system is endowed with the capability of high-accuracy localization. The acquired position information is not only a fringe benefit, but also in turn facilitates beam training. The interplay between beam training and indoor positioning is explained as follows. AoD estimate of the unblocked reliable links can yield the position of MT, and the position of MT, associated with anchor positions and anchor directions, can improve the precision of AoD/AoA estimation and assist in the decision of blockage indicator ζ\zeta.

V-A Reliability of The Estimated AoA, AoD Pair (θ^,ϕ^)(\hat{\theta},\hat{\phi})

To be concise, we treat BS/AP and IRSs as identical anchor nodes. The η=1\eta=1-st anchor is BS/AP and the rest NIRSN_{IRS} anchors (η=2,3,,NIRS+1\eta=2,3,\cdots,N_{IRS}+1) are IRSs. Although we have already obtained NIRS+1N_{IRS}+1 sets of path parameters (δ^η,θ^η,ϕ^η)(\hat{\delta}_{\eta},\hat{\theta}_{\eta},\hat{\phi}_{\eta}), we should be aware that the estimation is performed under the assumption that ζη=1\zeta_{\eta}=1. In practice, LoS and VLoS paths may suffer from blockage (namely ζη=0\zeta_{\eta}=0) by moving obstacles, which will jeopardize the estimation of (δ^η,θ^η,ϕ^η)(\hat{\delta}_{\eta},\hat{\theta}_{\eta},\hat{\phi}_{\eta}). Other than blockage, insufficient training length or low SNR may incur Error Type 2 of joint AoA and AoD estimation, which is defined in Section IV. C.

Therefore, it is essential to select the trustworthy parameters as the input of positioning algorithm. To this end, we introduce the metric – residual signal power ratio ϖη\varpi_{\eta}, to measure the reliability of (δ^η,θ^η,ϕ^η)(\hat{\delta}_{\eta},\hat{\theta}_{\eta},\hat{\phi}_{\eta}), i.e.,

ϖη=𝐲ηδ^η𝐃𝐛(θ^η,ϕ^η)22𝐲η22\displaystyle\varpi_{\eta}=\frac{\|\mathbf{y}_{\eta}-\hat{\delta}_{\eta}\mathbf{D}\mathbf{b}(\hat{\theta}_{\eta},\hat{\phi}_{\eta})\|_{2}^{2}}{\|\mathbf{y}_{\eta}\|_{2}^{2}} (38)

Recall that (δ^η,θ^η,ϕ^η)(\hat{\delta}_{\eta},\hat{\theta}_{\eta},\hat{\phi}_{\eta}) are obtained by minimizing 𝐲ηδη𝐃𝐛(θη,ϕη)22\|\mathbf{y}_{\eta}-\delta_{\eta}\mathbf{D}\mathbf{b}(\theta_{\eta},\phi_{\eta})\|_{2}^{2}, the yielded estimate (δ^η,θ^η,ϕ^η)(\hat{\delta}_{\eta},\hat{\theta}_{\eta},\hat{\phi}_{\eta}) will thus always result in 𝐲ηδη𝐃𝐛(θη,ϕη)22𝐲η22\|\mathbf{y}_{\eta}-\delta_{\eta}\mathbf{D}\mathbf{b}(\theta_{\eta},\phi_{\eta})\|_{2}^{2}\leq\|\mathbf{y}_{\eta}\|_{2}^{2}. Therefore, the range of ϖη\varpi_{\eta} is ϖη[0,1]\varpi_{\eta}\in[0,1].

Since the dominant component of mmWave channel is LoS path, the reconstructed signal δ^η𝐃𝐛(θ^η,ϕ^η)\hat{\delta}_{\eta}\mathbf{D}\mathbf{b}(\hat{\theta}_{\eta},\hat{\phi}_{\eta}) should account for the majority of the received signal 𝐲\mathbf{y} given that the parameters (δ^η,θ^η,ϕ^η)(\hat{\delta}_{\eta},\hat{\theta}_{\eta},\hat{\phi}_{\eta}) are accurate and residual signal power ratio ϖη\varpi_{\eta} should be smaller. Conversely, when blockage or Error Type 2 occurs, the parameters (δ^η,θ^η,ϕ^η)(\hat{\delta}_{\eta},\hat{\theta}_{\eta},\hat{\phi}_{\eta}) are heavily biased, and thus ϖη\varpi_{\eta} should be larger. Following the above heuristics, anchors’ reliability can be sorted.

V-B AoD Based Positioning

V-B1 Geometric Relationship Between AoDs and MT Position

We denote the index set of the reliable links as 𝒩\mathcal{N}, position coordinates of the η\eta-th anchor as 𝐩η\mathbf{p}_{\eta}, ULA direction of the η\eta-th anchor as 𝐞η\mathbf{e}_{\eta}. Note that 𝐩η,𝐞η\mathbf{p}_{\eta},\mathbf{e}_{\eta} are known by MTs. The direction vector of the LoS path between MT and the η\eta-th anchor is 𝐩𝐩η𝐩𝐩η2\frac{\mathbf{p}-\mathbf{p}_{\eta}}{\|\mathbf{p}-\mathbf{p}_{\eta}\|_{2}}, where 𝐩\mathbf{p} is the position of MT. Thus, the geometric relationship between AoDs and MT position is expressed as

ϕ^η=(𝐩𝐩η)T𝐞η𝐩𝐩η2ϕη(𝐩)+εη,η𝒩\displaystyle\hat{\phi}_{\eta}=\underbrace{\frac{(\mathbf{p}-\mathbf{p}_{\eta})^{T}\mathbf{e}_{\eta}}{\|\mathbf{p}-\mathbf{p}_{\eta}\|_{2}}}_{\phi_{\eta}(\mathbf{p})}+\varepsilon_{\eta},\;\;\eta\in\mathcal{N} (39)

where ϕ^η\hat{\phi}_{\eta} is the estimate of cosine AoD of the η\eta-th link derived in beam training stage, ϕη(𝐩)\phi_{\eta}(\mathbf{p}) is the actual cosine AoD that is dependent on position 𝐩\mathbf{p}, and εη\varepsilon_{\eta} is estimation error. For illustrative purposes, a typical scenario of IRSs assisted mmWave communications is shown in Figure 5.

V-B2 Taylor Series Method for AoD Based Positioning

In the ideal case, when εη=0\varepsilon_{\eta}=0, we have ϕ^η=ϕη(𝐩)\hat{\phi}_{\eta}=\phi_{\eta}(\mathbf{p}). The equation ϕη(𝐩)=(𝐩𝐩η)T𝐞η𝐩𝐩η2\phi_{\eta}(\mathbf{p})=\frac{(\mathbf{p}-\mathbf{p}_{\eta})^{T}\mathbf{e}_{\eta}}{\|\mathbf{p}-\mathbf{p}_{\eta}\|_{2}} corresponds to a right circular cone. There are 33 unknown variables of MT’s position coordinates, thus the minimum sufficient number of unblocked links to estimate the 3-D position of MT is |𝒩|=3|\mathcal{N}|=3, which is the intersection of the three right circular cones. As IRSs are cost-effective compared with conventional mmWave devices, they can be massively installed with minimal effort. We can expect that IRSs assisted mmWave with a large number of delicately placed IRSs is capable to guarantee |𝒩|3|\mathcal{N}|\geq 3 unblocked links with high probability.

In practice, estimation error εη\varepsilon_{\eta} cannot be zero. To estimate the 3-D position 𝐩=(x,y,z)T\mathbf{p}=(x,y,z)^{T}, least square criterion is adopted, i.e.,

min𝐩ξϕ(𝐩)η𝒩(ϕ^ηϕη(𝐩))2s.t.𝐩𝒮\displaystyle\begin{split}&\min_{\mathbf{p}}\;\;\xi_{\phi}(\mathbf{p})\triangleq\sum_{\eta\in\mathcal{N}}\left(\hat{\phi}_{\eta}-\phi_{\eta}(\mathbf{p})\right)^{2}\\ &s.t.\;\;\;\;\mathbf{p}\in\mathcal{S}\end{split} (40)

where 𝒮\mathcal{S} is the position range of indoor MT, e.g., the 3-D space of lecture hall. As the objective function ξϕ(𝐩)\xi_{\phi}(\mathbf{p}) is non-convex, it is non-trivial to derive the analytical solution to the problem. Fortunately, Taylor-series estimation method is capable to effectively solve a large class of position-location problems[38]. Starting with a rough initial guess, the Taylor-series estimation method iteratively improves its guess at each step by determining the local linear least-sum-squared-error correction[38]. In AoD based positioning, with an initial position guess 𝐩^\hat{\mathbf{p}}, the following approximation can be obtained through Taylor series expansion by neglecting mm-th order terms (m2m\geq 2), i.e.,

ϕη(𝐩)ϕη(𝐩^)+(𝐩𝐩^)Tϕη(𝐩)𝐩|𝐩=𝐩^\displaystyle\phi_{\eta}(\mathbf{p})\approx\phi_{\eta}(\hat{\mathbf{p}})+(\mathbf{p}-\hat{\mathbf{p}})^{T}\frac{\partial\phi_{\eta}({\mathbf{p}})}{\partial{\mathbf{p}}}\bigg{|}_{\mathbf{p}=\hat{\mathbf{p}}} (41)

where the first order derivative is denoted as

ϕη(𝐩)𝐩=𝐩𝐩η2𝐞η(𝐩𝐩η)T𝐞η𝐩𝐩η𝐩𝐩η2𝐩𝐩η22\displaystyle\frac{\partial\phi_{\eta}(\mathbf{p})}{\partial\mathbf{p}}=\frac{\|\mathbf{p}-\mathbf{p}_{\eta}\|_{2}\mathbf{e}_{\eta}-(\mathbf{p}-\mathbf{p}_{\eta})^{T}\mathbf{e}_{\eta}\frac{\mathbf{p}-\mathbf{p}_{\eta}}{\|\mathbf{p}-\mathbf{p}_{\eta}\|_{2}}}{\|\mathbf{p}-\mathbf{p}_{\eta}\|_{2}^{2}} (42)

Substituting (41) into (39), we have

ϕ^ηϕη(𝐩^)ϕη(𝐩)𝐩T|𝐩=𝐩^(𝐩𝐩^)+εη,η𝒩\displaystyle\hat{\phi}_{\eta}-\phi_{\eta}(\hat{\mathbf{p}})\approx\frac{\partial\phi_{\eta}({\mathbf{p}})}{\partial{\mathbf{p}}^{T}}\big{|}_{\mathbf{p}=\hat{\mathbf{p}}}(\mathbf{p}-\hat{\mathbf{p}})+\varepsilon_{\eta},\;\;\eta\in\mathcal{N} (43)

Its matrix form is written as

Δϕ𝐀TΔ𝐩+𝜺\displaystyle\Delta_{\bm{\phi}}\approx\mathbf{A}^{T}\Delta_{\mathbf{p}}+\bm{\varepsilon} (44)

where Δ𝐩=𝐩𝐩^\Delta_{\mathbf{p}}=\mathbf{p}-\hat{\mathbf{p}}, 𝜺=[ε1,,ε|𝒩|]T\bm{\varepsilon}=[\varepsilon_{1},\cdots,\varepsilon_{|\mathcal{N}|}]^{T}, and

Δϕ=[ϕ^1ϕ1(𝐩^),,ϕ^|𝒩|ϕ|𝒩|(𝐩^)]T\displaystyle\Delta_{\bm{\phi}}=[\hat{\phi}_{1}-\phi_{1}(\hat{\mathbf{p}}),\cdots,\hat{\phi}_{|\mathcal{N}|}-\phi_{|\mathcal{N}|}(\hat{\mathbf{p}})]^{T} (45a)
𝐀=[ϕ1(𝐩)𝐩|𝐩=𝐩^,,ϕ|𝒩|(𝐩)𝐩|𝐩=𝐩^]\displaystyle\mathbf{A}=\left[\frac{\partial\phi_{1}({\mathbf{p}})}{\partial{\mathbf{p}}}\big{|}_{\mathbf{p}=\hat{\mathbf{p}}},\cdots,\frac{\partial\phi_{|\mathcal{N}|}({\mathbf{p}})}{\partial{\mathbf{p}}}\big{|}_{\mathbf{p}=\hat{\mathbf{p}}}\right] (45b)

On the basis of (44), the Taylor series method for AoD based positioning is summarized in Algorithm 1.

Refer to caption
Figure 5: A typical scenario of IRS assisted mmWave communications
Initialization: Generate an initial guess of MT position 𝐩^\hat{\mathbf{p}}
Input: The estimate of cosine AoDs of a selected set of reliable links, i.e., ϕ^η,(η𝒩)\hat{\phi}_{\eta},(\eta\in\mathcal{N}), positions of anchors 𝐩η,(η𝒩)\mathbf{p}_{\eta},(\eta\in\mathcal{N}), directions of ULA on the anchors 𝐞η,(η𝒩)\mathbf{e}_{\eta},(\eta\in\mathcal{N}), and iteration stopping parameter ϵ\epsilon.
Repeat
1. With the given 𝐩^\hat{\mathbf{p}}, generate ϕη(𝐩^),(η𝒩)\phi_{\eta}(\hat{\mathbf{p}}),(\eta\in\mathcal{N}) according to Eq. (45a) and 𝐀\mathbf{A} according to Eq.(45b).
2. Find the least square estimate of Δ𝐩\Delta_{\mathbf{p}}, i.e.,
Δ^𝐩=(𝐀𝐀T)1𝐀Δϕ\displaystyle\hat{\Delta}_{\mathbf{p}}=(\mathbf{A}\mathbf{A}^{T})^{-1}\mathbf{A}\Delta_{\bm{\phi}}
3. Update 𝐩^\hat{\mathbf{p}}, i.e., 𝐩^𝐩^+Δ^𝐩\hat{\mathbf{p}}\leftarrow\hat{\mathbf{p}}+\hat{\Delta}_{\mathbf{p}}.
Until Δ^𝐩2<ϵ\|\hat{\Delta}_{\mathbf{p}}\|_{2}<\epsilon.
Algorithm 1 Taylor Series Method For AoD Based Positioning

V-B3 Reliable Link Set 𝒩\mathcal{N}

An intuitive method to construct the set of reliable links is to select |𝒩||\mathcal{N}| links with the |𝒩||\mathcal{N}| smallest ϖη\varpi_{\eta} to avoid unreliable AoDs resulted from blockage and Error Type 2 of joint AoA, AoD estimation. However, it is non-trivial to determine the exact value of |𝒩||\mathcal{N}|. Although |𝒩|=3|\mathcal{N}|=3 anchors are theoretically sufficient to yield the position of MT in the ideal noiseless case, more anchors are desirable in practice for positioning algorithm to enhance the accuracy of position estimation.

To utilize as many reliable anchors as possible, the following strategy is proposed to iteratively construct the reliable link set 𝒩\mathcal{N}. Firstly, we sort the anchors in ascending order according to residual signal power ratio ϖη\varpi_{\eta}. Then, starting from |𝒩|=3|\mathcal{N}|=3 anchors, we iteratively increase the number of anchors used for positioning in Algorithm 1, and by the end of each iteration, we calculate the cost ξϕ(𝐩)|𝒩|{\frac{\xi_{\phi}(\mathbf{p})}{|\mathcal{N}|}}, where ξϕ(𝐩)\xi_{\phi}(\mathbf{p}) is the squared error of least square method in Eq. (40) and |𝒩||\mathcal{N}| is the number of selected anchors. Finally, we select the output corresponding to the largest |𝒩||\mathcal{N}| that satisfies ξϕ(𝐩)|𝒩|ξth{\frac{\xi_{\phi}(\mathbf{p})}{|\mathcal{N}|}}\leq\xi_{th} as the estimated position of MT, where ξth\xi_{th} is a preset threshold444An appropriate ξth\xi_{th} can be obtained by carrying out a great number of Monte Carlo experiments offline. In our numerical experiment, we find that ξth=0.005\sqrt{\xi_{th}}=0.005 results in a good performance. .

V-C Parameter Estimation With The Aid of MT Position

With the estimated position 𝐩^\hat{\mathbf{p}}, channel parameters can be refined according to the geometric relationship.

V-C1 AoD Refinement

With 𝐩^\hat{\mathbf{p}}, AoD estimation is updated by

ϕη=(𝐩^𝐩η)T𝐞η𝐩^𝐩η2,η{1,2,,NIRS+1}\displaystyle{\phi}_{\eta}^{\star}=\frac{(\hat{\mathbf{p}}-\mathbf{p}_{\eta})^{T}\mathbf{e}_{\eta}}{\|\hat{\mathbf{p}}-\mathbf{p}_{\eta}\|_{2}},\;\;\eta\in\{1,2,\cdots,N_{IRS}+1\} (46)

V-C2 AoA Refinement

To estimate AoA, the direction of ULA in MT’s side is essential. Therefore, we firstly find the least square estimate of 𝐞MT\mathbf{e}_{MT} by solving the following optimization problem.

min𝐞MTξθ(𝐞MT)η𝒩((𝐩^𝐩η)T𝐞MT𝐩^𝐩η2θ^η)2s.t.𝐞MT2=1\displaystyle\begin{split}&\min_{\mathbf{e}_{MT}}\;\;\xi_{\theta}(\mathbf{e}_{MT})\triangleq\sum_{\eta\in\mathcal{N}}\left(\frac{(\hat{\mathbf{p}}-\mathbf{p}_{\eta})^{T}\mathbf{e}_{MT}}{\|\hat{\mathbf{p}}-\mathbf{p}_{\eta}\|_{2}}\ominus\hat{\theta}_{\eta}\right)^{2}\\ &s.t.\;\;\;\;\|\mathbf{e}_{MT}\|_{2}=1\end{split} (47)

Note that 𝒩\mathcal{N} can be derived in the iterative process according to Section V. A. 3.

The objective function of (47) can be rewritten in matrix form as

ξθ(𝐞MT)=𝐏T𝐞MT𝜽^22\displaystyle\xi_{\theta}(\mathbf{e}_{MT})=\|\mathbf{P}^{T}\mathbf{e}_{MT}\ominus\hat{\bm{\theta}}\|_{2}^{2} (48)

where 𝐏=[𝐩^𝐩η1𝐩^𝐩η12𝐩^𝐩η|𝒩|𝐩^𝐩η|𝒩|2]\mathbf{P}=\left[\frac{\hat{\mathbf{p}}-\mathbf{p}_{{\eta}_{1}}}{\|\hat{\mathbf{p}}-\mathbf{p}_{{\eta}_{1}}\|_{2}}\cdots\frac{\hat{\mathbf{p}}-\mathbf{p}_{{\eta}_{|\mathcal{N}|}}}{\|\hat{\mathbf{p}}-\mathbf{p}_{{\eta}_{|\mathcal{N}|}}\|_{2}}\right], 𝜽^=[θ^η1,,θ^η|𝒩|]T\hat{\bm{\theta}}=[\hat{\theta}_{{\eta}_{1}},\cdots,\hat{\theta}_{{\eta}_{|\mathcal{N}|}}]^{T} and 𝒩={η1,,η|𝒩|}\mathcal{N}=\{\eta_{1},\cdots,\eta_{|\mathcal{N}|}\}. The optimization problem can be solved via projected gradient descent method [39], in which we iteratively update 𝐞MT\mathbf{e}_{MT} as follows.

𝐝MT,i+1=𝐞MT,iλξθ(𝐞MT)𝐞MT|𝐞MT=𝐞MT,i𝐞MT,i+1=𝐝MT,i+1𝐝MT,i+12\displaystyle\begin{split}&\mathbf{d}_{MT,i+1}=\mathbf{e}_{MT,i}-\lambda\frac{\partial\xi_{\theta}(\mathbf{e}_{MT})}{\partial\mathbf{e}_{MT}}\Big{|}_{\mathbf{e}_{MT}=\mathbf{e}_{MT,i}}\\ &\mathbf{e}_{MT,i+1}=\frac{\mathbf{d}_{MT,i+1}}{\|\mathbf{d}_{MT,i+1}\|_{2}}\end{split} (49)

where λ\lambda is step size and ξθ(𝐞MT)𝐞MT=𝐏(𝐏T𝐞MT𝜽^)\frac{\partial\xi_{\theta}(\mathbf{e}_{MT})}{\partial\mathbf{e}_{MT}}=\mathbf{P}\left(\mathbf{P}^{T}\mathbf{e}_{MT}\ominus\hat{\bm{\theta}}\right).

Finally, with 𝐞^MT\hat{\mathbf{e}}_{MT} yielded by projected gradient descent method, AoA estimation is updated by

θη=(𝐩^𝐩η)T𝐞^MT𝐩^𝐩η2\displaystyle{\theta}_{\eta}^{\star}=\frac{(\hat{\mathbf{p}}-\mathbf{p}_{\eta})^{T}\hat{\mathbf{e}}_{MT}}{\|\hat{\mathbf{p}}-\mathbf{p}_{\eta}\|_{2}} (50)

V-C3 Estimation of Blockage

As a prerequisite of our proposed blockage estimation method, we firstly introduce the estimation of δη\delta_{\eta}, which is dependent on the values of (θη,ϕη)(\theta_{\eta},\phi_{\eta}). Note that the parameter estimate obtained in Section IV by ML estimation is under the assumption that ζη=1\zeta_{\eta}=1, while it is probable that ζη=0\zeta_{\eta}=0 in fact. It would be misleading in the estimation of δη\delta_{\eta} by directly substituting (θ^η,ϕ^η)(\hat{\theta}_{\eta},\hat{\phi}_{\eta}) into (20). Therefore, we will use the estimates of AoA and AoD refined by position to assist the estimation of δη\delta_{\eta} and ζη\zeta_{\eta}, as they are cross verified by multiple anchors and are thus more reliable.

Substituting (θη,ϕη)({\theta}_{\eta}^{\star},{\phi}_{\eta}^{\star}) into (20), we have

δη\displaystyle{\delta}_{\eta}^{\star} =𝐛H(θη,ϕη)𝐃H𝐲𝐃𝐛(θη,ϕη)22\displaystyle=\frac{\mathbf{b}^{H}(\theta_{\eta}^{\star},\phi_{\eta}^{\star})\mathbf{D}^{H}\mathbf{y}}{\|\mathbf{D}\mathbf{b}(\theta_{\eta}^{\star},\phi_{\eta}^{\star})\|_{2}^{2}}
=ζηδη𝐛H(θη,ϕη)𝐃H𝐃𝐛(θη,ϕη)+𝐛H(θη,ϕη)𝐃H𝐧𝐃𝐛(θ,ϕ)22\displaystyle=\frac{\zeta_{\eta}\delta_{\eta}\mathbf{b}^{H}(\theta_{\eta}^{\star},\phi_{\eta}^{\star})\mathbf{D}^{H}\mathbf{D}\mathbf{b}(\theta_{\eta},\phi_{\eta})+{\mathbf{b}^{H}(\theta_{\eta}^{\star},\phi_{\eta}^{\star})\mathbf{D}^{H}\mathbf{n}}}{\|\mathbf{D}\mathbf{b}(\theta^{\star},\phi^{\star})\|_{2}^{2}}
=ζηδηf(θη,ϕη)+n¯\displaystyle=\zeta_{\eta}\delta_{\eta}f({\theta}_{\eta}^{\star},{\phi}_{\eta}^{\star})+\bar{n} (51)

where f(θη,ϕη)𝐛H(θη,ϕη)𝐃H𝐃𝐛(θη,ϕη)𝐃𝐛(θη,ϕη)22f(\theta_{\eta}^{\star},\phi_{\eta}^{\star})\triangleq\frac{\mathbf{b}^{H}(\theta_{\eta}^{\star},\phi_{\eta}^{\star})\mathbf{D}^{H}\mathbf{D}\mathbf{b}(\theta_{\eta},\phi_{\eta})}{\|\mathbf{D}\mathbf{b}(\theta_{\eta}^{\star},\phi_{\eta}^{\star})\|_{2}^{2}}, n¯𝒞𝒩(0,σn¯2)\bar{n}\sim\mathcal{CN}(0,\sigma^{2}_{\bar{n}}), and σn¯2=σ𝐰¯2+σ𝝂2𝐃𝐛(θη,ϕη)22\sigma^{2}_{\bar{n}}=\frac{\sigma^{2}_{\bar{\mathbf{w}}}+\sigma^{2}_{\bm{\nu}}}{\|\mathbf{D}\mathbf{b}(\theta_{\eta}^{\star},\phi_{\eta}^{\star})\|_{2}^{2}} (or σn¯2=σ𝐰¯2+σ𝝂12+σ𝝂22𝐃𝐛(θη,ϕη)22\sigma^{2}_{\bar{n}}=\frac{\sigma^{2}_{\bar{\mathbf{w}}}+\sigma^{2}_{\bm{\nu}_{1}}+\sigma^{2}_{\bm{\nu}_{2}}}{\|\mathbf{D}\mathbf{b}(\theta_{\eta}^{\star},\phi_{\eta}^{\star})\|_{2}^{2}}). Thus, we have

δη={δηf(θη,ϕη)+n¯,ζη=1n¯,ζη=0\displaystyle{\delta}_{\eta}^{\star}=\left\{\begin{array}[]{cc}\delta_{\eta}f({\theta}_{\eta}^{\star},{\phi}_{\eta}^{\star})+\bar{n},&\;\;\zeta_{\eta}=1\\ \bar{n},&\;\;\zeta_{\eta}=0\end{array}\right. (54)

Theoretically, with the knowledge of δη\delta_{\eta}, f(θη,ϕη)f({\theta}_{\eta}^{\star},{\phi}_{\eta}^{\star}) and σn¯2\sigma^{2}_{\bar{n}}, the decision of ζη\zeta_{\eta} can be made by comparing the probabilities of δη{\delta}_{\eta}^{\star} conditioned on ζη=0\zeta_{\eta}=0 and ζη=1\zeta_{\eta}=1. However, accurate estimation of f(θη,ϕη)f({\theta}_{\eta}^{\star},{\phi}_{\eta}^{\star}) and σn¯2\sigma^{2}_{\bar{n}} is challenging in practice. With respect to δη\delta_{\eta}, its amplitude |δη||\delta_{\eta}| is estimable from the distance of MT, while its phase cannot be accurately estimated from the distance, as it is very sensitive to distance estimation error and may be affected by random initial phase of local oscillator in transmitter side.

Alternatively, a heuristic method is proposed to decide blockage indicator by comparing the pathloss estimated from (θη,ϕη)(\theta^{\star}_{\eta},\phi^{\star}_{\eta}) and pathloss estimated from 𝐩^\hat{\mathbf{p}}, i.e.,

|10log101|δη|210log101|δη(𝐩^)|2|ζη=1ζη=0PLth\displaystyle\left|10\log_{10}\frac{1}{|{{\delta}}_{\eta}^{\star}|^{2}}-10\log_{10}\frac{1}{|{\delta}_{\eta}(\hat{\mathbf{p}})|^{2}}\right|\begin{array}[]{c}\zeta_{\eta}^{\star}=1\\ \lesseqgtr\\ \zeta_{\eta}^{\star}=0\end{array}PL_{th} (58)

where

|δη(𝐩^)|={|PTxλej2πdBM4πdBM|,η=1|ξPTxNBλej2π(dBRη+dRηM)4π(dBRη+dRηM)|,η=2,,NIRS+1\displaystyle|{\delta}_{\eta}(\hat{\mathbf{p}})|=\left\{\begin{array}[]{c}\left|\frac{\sqrt{P_{Tx}}\lambda e^{-j2\pi d_{BM}}}{4\pi d_{BM}}\right|,\qquad\qquad\;\;\eta=1\\ \left|\frac{\sqrt{\xi P_{Tx}N_{B}}\lambda e^{-j2\pi(d_{BR_{\eta}}+d_{R_{\eta}M})}}{4\pi(d_{BR_{\eta}}+d_{R_{\eta}M})}\right|,\eta=2,\cdots,N_{IRS+1}\end{array}\right. (60)

BS/AP to MT distance dBMd_{BM} and IRS to MT distance dRηMd_{R_{\eta}M} are attainable from 𝐩^\hat{\mathbf{p}}, and PLthPL_{th} is the preset threshold of pathloss distance (In numerical simulations, we set PLth=6PL_{th}=6 dB ).

VI Numerical Results

In this section, we numerically study the performance of the proposed joint beam training and positioning scheme for IRSs assisted mmWave MIMO.

VI-A Settings of Numerical Experiment

TABLE II: Simulation Parameters
Parameter Value
Operating frequency 2828 GHz
Noise power 84-84 dBm
Position of IRSs
(5,10, 3.5)(5,\;-10,\;3.5), (5, 10, 3.5)(5,\;10,\;3.5),
(0,10, 3.5)(0,\;-10,\;3.5), (0, 10, 3.5)(0,\;10,\;3.5),
(5,10, 3.5)(-5,\;-10,\;3.5), (5, 10, 3.5)(-5,\;10,\;3.5),
(10, 5, 3.5)(-10,\;5,\;3.5), (10, 5, 3.5)(10,\;5,\;3.5),
(10, 0, 3.5)(-10,\;0,\;3.5), (10, 0, 3.5)(10,\;0,\;3.5),
(10,5, 3.5)(-10,\;-5,\;3.5), (10,5, 3.5)(10,\;-5,\;3.5)
Position of BS/AP (0, 0, 5)(0,\;0,\;5)
Direction of IRSs’ ULA
(0, 0, 1)(0,\;0,\;1), (1, 0, 0)(1,\;0,\;0), (0, 1, 0)(0,\;1,\;0),
(0, 1, 0)(0,\;1,\;0), (0, 0, 1)(0,\;0,\;1), (1, 0, 0)(1,\;0,\;0),
(0, 1, 0)(0,\;1,\;0), (0, 1, 0)(0,\;1,\;0), (0, 0, 1)(0,\;0,\;1),
(1, 0, 0)(1,\;0,\;0), (0, 1, 0)(0,\;1,\;0), (0, 1, 0)(0,\;1,\;0)
Direction of BS/AP’s ULA (22,22, 0)(\frac{\sqrt{2}}{2},\;\frac{\sqrt{2}}{2},\;0)
Reflection loss 10log10ξ-10\log_{10}\xi 1313 dB
Size of obstacles 0.6×0.4×1.70.6\times 0.4\times 1.7 meters
Altitude of MT [1.2, 1.4][1.2,\;1.4] meters
Number of users 20,50,10020,50,100
Number of NLoS paths 0 (only in Figure 7), 44
Number of antennas in BS/AP (NBN_{B}) 1616
Number of antennas in MT (NMN_{M}) 1616
Number of reflectors in IRS (NRN_{R}) 1616

We assume that IRSs-assisted mmWave MIMO system is deployed in an indoor scenario, e.g., lecture hall, and the length, width and height of which are 2020 meters, 2020 meters and 55 meters, respectively. The rest system parameters are listed in Table II. For simplicity, we assume that AoA, AoD of NLoS paths follow uniform distribution, i.e., θBM,l,ϕBM,lU(0,2π),l=2,,L\theta_{BM,l},\phi_{BM,l}\sim U(0,2\pi),l=2,...,L, and path coefficient follows complex Gaussian distribution, i.e., δl𝒞𝒩(0,σl2),l=2,,L\delta_{l}\sim\mathcal{CN}(0,\sigma_{l}^{2}),l=2,...,L and 10log10δ12σl2=2010\log_{10}\frac{\delta_{1}^{2}}{\sigma_{l}^{2}}=20 dB. We model user (MT holder) as a cube with its length, width and height being 0.60.6m, 0.40.4m and 1.71.7m, respectively. We denote position of the MT held by user as (x,y,z)(x,y,z), where x,y,zx,y,z follow uniform distribution, i.e., x,yU(10,10)x,y\sim U(-10,10) and zU(1.2,1.4)z\sim U(1.2,1.4). Users are uniformly distributed in the lecture hall under the non-overlapping constraint. For a typical MT, the other MT holders are its potential obstacles, and thus the blockage probability increases with user density.

VI-B Relationship Between User Density and Blockage Probability

To gain insights into the relationship between user density and blockage probability, Figure 6 is presented where there are 1212 IRSs deployed, which means a total of 1313 LoS/VLoS links are available. From the Figure 6, we can see that when the number of MTs is 2020, more than 50%50\% of channel realizations experience no link blockage, the largest number of blocked links is 44, and the percentage of which is less than 5%5\%; when the number of MTs is 5050, more than 80%80\% of channel realizations experience less than 33 blocked links, the largest number of blocked links is 77, and the percentage of which is less than 1%1\%; when the number of MTs is 100100, more than 80%80\% of channel realizations experience less than 55 blocked links, the largest number of blocked links is 99, and the percentage of which is almost negligible. Note that when there exists at least 11 unblocked link, uninterrupted communication over mmWave band can be guaranteed, and when there exist at least 33 unblocked links, positioning algorithm can be performed to locate MT and meanwhile enhance parameter estimation.

Refer to caption
Figure 6: Blockage rate with different user densities

VI-C Performance of Beam Training with Random Beamforming

As the performance of joint beam training and positioning is fundamentally determined by the decomposed Sub-problem 1 for BS/AP-MT link and Sub-problem 2 for BS/AP-IRS-MT links, whose unified signal model is Eq. (14), we start numerical evaluation from the sub-problems, i.e., the beam training scheme with random beamforming proposed in Section IV. The blockage indicator ζ\zeta of Eq. (14) is set as ζ=1\zeta=1, and the random variable 𝐧=𝐰+𝝂\mathbf{n}=\mathbf{w}+\bm{\nu}, where 𝐰\mathbf{w} is the noise term and 𝝂\bm{\nu} is the interference term. The noise term 𝐰𝒞𝒩(𝟎,σ𝐰2𝐈)\mathbf{w}\sim\mathcal{CN}(\mathbf{0},\sigma_{\mathbf{w}}^{2}\mathbf{I}) and σ𝐰2\sigma_{\mathbf{w}}^{2} is 86-86 dBm according to Table II. The interference term 𝝂\bm{\nu} is propagated via NLoS paths, and its entries are represented in Eq. (III-A) for BS/AP-MT link and in Eq. (13) for BS/AP-IRS-MT links. A notable difference between 𝝂\bm{\nu} and 𝐰\mathbf{w} is that the power of 𝝂\bm{\nu} is proportional to transmit power. Since Sub-problem 1 and Sub-problem 2 are mathematically equivalent, we carry out the numerical study of beam training with random beamforming in BS/AP-MT link in this subsection.

Refer to caption
Figure 7: MSE performance of AoA/AoD estimated by random beamforming based beam training in LoS channel model and LoS + NLoS channel model, where NLoS path number is 44, training length is N=8,16N=8,16.

In Figure 7, we use mean squared error (MSE) of the estimated AoA/AoD as the performance metric, which is defined as MSE(θ^)𝔼((θ^θ)2),MSE(ϕ^)𝔼((ϕ^ϕ)2)MSE(\hat{\theta})\triangleq\mathbb{E}\left((\hat{\theta}\ominus\theta)^{2}\right),MSE(\hat{\phi})\triangleq\mathbb{E}\left((\hat{\phi}\ominus\phi)^{2}\right), where (θ^,ϕ^)(\hat{\theta},\hat{\phi}) are the estimated AoA and AoD of the LoS path, and (θ,ϕ)({\theta},{\phi}) are the exact values of AoA and AoD of the LoS path. The proposed beam training scheme is characterized by two steps, namely random beamforming and ML estimation. Random beamforming is performed to measure mmWave channel, and ML estimation is performed to estimate AoA and AoD of the LoS path based on channel measurements. To study the accuracy of ML estimator, we use Cramér-Rao bound555Since the estimation of (θ,ϕ)(\theta,\phi) is part of the joint estimation of (δ,θ,ϕ)(\delta,\theta,\phi), CRBs of θ\theta and ϕ\phi are obtained as the last two diagonal elements of the inverse of Fisher information matrix w.r.t. (δ,θ,ϕ)(\delta,\theta,\phi). The detailed derivation of CRB is omitted, as it follows the standard procedure.(CRB) in the ideal LoS channel (where 𝝂=𝟎\bm{\nu}=\mathbf{0}) as the benchmark. It can be seen from Figure 7(a) that, when the training length is N=16N=16, from 0 dBm to 66 dBm the empirical MSE of both AoA and AoD in LoS mmWave channel is significantly higher than CRB, but the performance gap gradually turns to be marginal from 66 dBm to above. It indicates that, from 0 dBm to 66 dBm ML estimation of (θ,ϕ)(\theta,\phi) experiences Error Type 2 as mentioned in Section IV. C, in which the estimated AoA and AoD pair are far apart from their authentic values, and from 66 dBm to above only Error Type 1 happens, in which the estimation error is mild and tightly lower bounded by CR bound. It validates the effectiveness of ML estimator in relative high SNR regimes. In practice, NLoS path’s impacts on beam training cannot be overlooked. In the numerical simulation of beam training in LoS + NLoS mmWave channel, we set the number of NLoS paths as 44. As can be seen from Figure 7(a) that, from 0 dBm to 66 dBm the empirical MSE of AoA and AoD in LoS + NLoS channel is slightly worse than that in LoS channel, which indicates that noise is the main detrimental factor. From 99 dBm to above, the MSE curves turn to be flat, and this is because the impact of NLoS path, namely 𝝂\bm{\nu}, does not diminish over SNR. A notable point is that MSE from 99 dBm to above is around 10510^{-5}, which is satisfactorily accurate. To study the impact of training length, MSE performance comparison is also performed when N=8N=8 in Figure 7(b). A remarkable difference from N=16N=16 case is that the flat curves of empirical MSE start from 1818 dBm, and the values of which are around 10410^{-4}, which indicates that the impact of noise in N=8N=8 case is more significant than N=16N=16 case and thus verifies the benefits of increasing training length.

Refer to caption
(a) When transmit power is 20-20 dBm
Refer to caption
(b) When transmit power is 0 dBm
Figure 8: Performance (misalignment rate) comparison between the proposed beam training with random beamforming and the existing beam training schemes with directional beamforming

In Figure 8, we compare the performance of the proposed random beamforming based beam training scheme with the existing directional beamforming based beam training schemes [40, 28, 27]. Directional beamforming is used for beam training in a more straightforward way than random beamforming, in which the candidate beams pairs are explored through exhaustive/hierachical beam sweeping, and then the strongest beam pair is selected based on the received power of the candidates. Directional beams are selected from a pre-configured finite set corresponding to quantized angles, e.g., discrete Fourier Transform (DFT) codebook. To compare the performance of random beamforming and directional beamforming in beam training, we use beam misalignment rate as the performance metric, which measures the probability that beam training fails to find the strongest beam pair. For random beamforming based beam training, we select the best beam pair by quantizing the estimated AoA/AoD to its nearest codeword. Two types of directional beamforming techniques are used as the benchmarks, namely, exhaustive beam sweeping [40] and hierarchical beam sweeping [27, 28]. Exhaustive beam sweeping explores all the possible beam pairs and its training length is N=NBNM=256N=N_{B}N_{M}=256; Hierarchical beam sweeping iteratively narrows down the direction search region and results in logarithmic training length, i.e., N=4log2min(NB,NM)+2log2max(NB,NM)min(NB,NM)=16N=4\log_{2}\min(N_{B},N_{M})+2\log_{2}\frac{\max(N_{B},N_{M})}{\min(N_{B},N_{M})}=16. By contrast, random beamforming is flexible with training length. In the simulation, we set the training length of random beamforming as N=8,16,,248,256N={8,16,\cdots,248,256} to investigate the impact of training length. We compare the performance of random beamforming based beam training with directional beamforming based beam training at two SNR levels, i.e., PTx=20P_{Tx}=-20 dBm, 0 dBm, in LoS + NLoS channel model. From Figure 8(a), it can be seen that, when PTx=20P_{Tx}=-20 dBm, the misalignment rate of exhaustive beam sweeping is 0.6060.606, and the misalignment rate of hierarchical beam sweeping is 0.9660.966. The beam misalignment rate of random beamforming is 0.9910.991 when training length is N=8N=8, and it decreases over training length and turns to be 0.5590.559 when training length is N=256N=256. It verifies the conclusion of Theorem 2 and indicates that random beamforming with an appropriate training length could achieve better performance than directional beamforming. From Figure 8(b), it can be seen that, when PTx=0P_{Tx}=0 dBm, the misalignment rate of exhaustive beam sweeping is 0.0310.031, and the misalignment rate of hierarchical beam sweeping is 0.2450.245. As for random beamforming, the performance improvement over training length becomes more significant. Specifically, the misalignment rate is 0.4640.464 when training length is N=8N=8 and sharply decreases to 0.1440.144 when N=16N=16, and finally it converges to 0.0230.023 when N=256N=256. It is noteworthy that the performance enhancement brought by increasing training length is marginal from N=32N=32. Therefore, the training length of random beamforming can be set adaptively according to SNR condition to achieve a satisfactory performance with moderate training cost.

VI-D Performance of Joint Beam Training and Positioning for IRSs Assisted MmWave Communications

Refer to caption
Figure 9: Accuracy of indoor positioning

In this subsection, we study the performance of joint beam training and positioning for IRSs assisted mmWave communications. The configurations of IRSs, BS and MT, which determine the path gain, AoA and AoD of the LoS path, are given in Table. II. In addition, we set the number of users as 100100, which determines the blockage indicator, and we also set the number of NLoS as 44.

In Figure 9, the accuracy of indoor positioning of IRSs assisted mmWave MIMO is studied in terms of root mean squared error (RMSE). When the training length is N=16N=16 for each LoS/VLoS path, RMSE is 0.130.13 meter at 0 dBm, and converges to 0.020.02 meter from 1515 dBm to 3030 dBm, which indicates that, with the aid of IRSs, mmWave MIMO achieves centimeter accuracy in indoor scenario. When the training length is N=8N=8 for each LoS/VLoS path, RMSE is 0.450.45 meter at 0 dBm, and converges to 0.040.04 meter from 1515 dBm to 3030 dBm. Considering the reduced training length, the accuracy limit of 0.040.04 meter for N=8N=8 case in high SNR regimes is acceptable. However, the positioning accuracy of N=8N=8 case is not satisfying in low SNR regimes. Through case analysis, we find that the correlation between residual ratio ϖη\varpi_{\eta} and the accuracy of (θ^η,ϕ^η)(\hat{\theta}_{\eta},\hat{\phi}_{\eta}) is weakened by the increased level of noise and the reduced training length. In other words, a small ϖη\varpi_{\eta} may misleadingly correspond to an unreliable anchor node, and thus results in inaccurate estimate of position. To improve the accuracy, a more sophisticated positioning algorithm that iteratively sorts the reliability will be developed in the future.

Refer to caption
Figure 10: Error rate of blockage estimation

In Figure 10, the error rate of blockage estimation is studied. For the purpose of comparison, two methods are adopted as benchmarks, which are (1) received power based blockage estimation and (2) residual ratio based blockage estimation. For (1), it is straightforward that unblocked links have significantly higher received signal level than that of blocked links. However, as power level is an absolute quantity, without the prior knowledge such as the likely range of received power, it is possible to mistake the unblocked link between MT and faraway anchor as a blocked link. In contrast, residual ratio in (2) is a relative quantity, which is not dependant on the likely range of received power. However, the optimal threshold that is essential for blockage estimation is unavailable either. Therefore, we adopt the K-means clustering method to partition the 1313 observations into 22 clusters, i.e., blocked links and unblocked links. When the training length is N=16N=16, we can see from the figure that position aided blockage estimation is slightly erroneous merely at 0 dBm and becomes errorless when transmit power increases. With respect to the benchmark methods, although the estimation accuracy of residual ratio based K-means clustering method is worse than position aided blockage estimation, its error rate is below 0.10.1, which is acceptable. By contrast, the estimation error rate of received power based K-means clustering method is nearly 0.50.5, which indicates that the estimation is almost random. When the training length reduces to N=8N=8, the superiority of position aided blockage estimation is more remarkable, and this is owing to the cross-validation mechanism enabled by location information.

Refer to caption
Figure 11: MSE performance of AoA/AoD refined by location information

In Figure 11, MSE performance evaluation of AoA/AoD refined by location information is performed. To this end, we intentionally filter out the blocked links, and reserve AoA/AoD estimate of the unblocked links. As can be seen that AoA/AoD refined by location information is more accurate than AoA/AoD estimated by beam training with random beamforming. This is because location information is derived by multiple anchors, and AoA/AoD refinement according to geometric relationship means that the estimation is cross verified. It is noteworthy that the performance enhancement is more significant when the training length is N=8N=8 for each LoS/VLoS path, from which we find the potential to reduce training length of beam training with the aid of location information. Another notable point is that AoA refined by location information is always worse than AoD refined by location information. This is because the direction vector 𝐞^MT\hat{\mathbf{e}}_{MT} is derived from estimation in (47), while the direction vectors of anchors 𝐞η\mathbf{e}_{\eta} are well known.

VII Conclusion

In this paper, beam training for IRSs assisted mmWave communications is studied. By breaking down beam training for IRSs assisted mmWave MIMO into several mathematically equivalent sub-problems, we perform random beamforming and maximum likelihood estimation to derive the optimal beam of BS/AP and MT and the optimal reflection pattern of IRSs. Then, by sorting the reliability of the estimated AoA, AoD paris, we propose an iterative positioning algorithm to acquire the position of MT, and with which we are able to cross verify and enhance the estimation of AoA and AoD, and accurately predict link blockage. Numerical results show the superiority of our proposed beam training scheme and verify the performance gain brought by location information.

Appendix A Partial derivatives of g(θ,ϕ)g({\theta,\phi})

The derivative of g(θ,ϕ)g({\theta,\phi}) with respect to θ\theta is

g(θ,ϕ)θ\displaystyle\frac{\partial g({\theta,\phi})}{\partial\theta}
=\displaystyle= 𝐛H(θ,ϕ)𝐃H𝐲𝐲H𝐃𝐛(θ,ϕ)θ𝐛H(θ,ϕ)𝐃H𝐃𝐛(θ,ϕ)\displaystyle\frac{\frac{\partial\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}\mathbf{y}\mathbf{y}^{H}\mathbf{D}\mathbf{b}(\theta,\phi)}{\partial\theta}}{\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}\mathbf{D}\mathbf{b}(\theta,\phi)}-
𝐛H(θ,ϕ)𝐃H𝐲𝐲H𝐃𝐛(θ,ϕ)(𝐛H(θ,ϕ)𝐃H𝐃𝐛(θ,ϕ))2𝐛H(θ,ϕ)𝐃H𝐃𝐛(θ,ϕ)θ\displaystyle\frac{\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}\mathbf{y}\mathbf{y}^{H}\mathbf{D}\mathbf{b}(\theta,\phi)}{\left(\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}\mathbf{D}\mathbf{b}(\theta,\phi)\right)^{2}}\frac{\partial\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}\mathbf{D}\mathbf{b}(\theta,\phi)}{\partial\theta}
=\displaystyle= 2Re(𝐛H(θ,ϕ)𝐃H𝐲𝐲H𝐃𝐛(θ,ϕ)θ𝐛H(θ,ϕ)𝐃H𝐃𝐛(θ,ϕ)\displaystyle 2Re\left(\frac{\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}\mathbf{y}\mathbf{y}^{H}\mathbf{D}\frac{\partial\mathbf{b}(\theta,\phi)}{\partial\theta}}{\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}\mathbf{D}\mathbf{b}(\theta,\phi)}-\right.
𝐛H(θ,ϕ)𝐃H𝐲𝐲H𝐃𝐛(θ,ϕ)(𝐛H(θ,ϕ)𝐃H𝐃𝐛(θ,ϕ))2𝐛H(θ,ϕ)𝐃H𝐃𝐛(θ,ϕ)θ)\displaystyle\left.\frac{\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}\mathbf{y}\mathbf{y}^{H}\mathbf{D}\mathbf{b}(\theta,\phi)}{\left(\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}\mathbf{D}\mathbf{b}(\theta,\phi)\right)^{2}}{\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}\mathbf{D}\frac{\partial\mathbf{b}(\theta,\phi)}{\partial\theta}}\right) (61)

where 𝐛(θ,ϕ)θ=vec((𝐚Rx(θ)ϑRx)𝐚TxH(ϕ))\frac{\partial\mathbf{b}(\theta,\phi)}{\partial\theta}=vec\left((\mathbf{a}_{Rx}(\theta)\odot\bm{\vartheta}_{Rx})\mathbf{a}_{Tx}^{H}(\phi)\right) and ϑRx=[0,jπ,,jπ(Nr1)]T\bm{\vartheta}_{Rx}=\left[0,j\pi,\cdots,j\pi(N_{r}-1)\right]^{T}. Similarly, the derivative of g(θ,ϕ)g({\theta,\phi}) with respect to ϕ\phi is

g(θ,ϕ)ϕ=2Re(𝐛H(θ,ϕ)𝐃H𝐲𝐲H𝐃𝐛(θ,ϕ)ϕ𝐛H(θ,ϕ)𝐃H𝐃𝐛(θ,ϕ)\displaystyle\frac{\partial g({\theta,\phi})}{\partial\phi}=2Re\left(\frac{\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}\mathbf{y}\mathbf{y}^{H}\mathbf{D}\frac{\partial\mathbf{b}(\theta,\phi)}{\partial\phi}}{\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}\mathbf{D}\mathbf{b}(\theta,\phi)}-\right.
𝐛H(θ,ϕ)𝐃H𝐲𝐲H𝐃𝐛(θ,ϕ)(𝐛H(θ,ϕ)𝐃H𝐃𝐛(θ,ϕ))2𝐛H(θ,ϕ)𝐃H𝐃𝐛(θ,ϕ)ϕ)\displaystyle\left.\frac{\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}\mathbf{y}\mathbf{y}^{H}\mathbf{D}\mathbf{b}(\theta,\phi)}{\left(\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}\mathbf{D}\mathbf{b}(\theta,\phi)\right)^{2}}{\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}\mathbf{D}\frac{\partial\mathbf{b}(\theta,\phi)}{\partial\phi}}\right) (62)

where 𝐛(θ,ϕ)ϕ=vec(𝐚Rx(θ)(𝐚Tx(ϕ)ϑTx)H)\frac{\partial\mathbf{b}(\theta,\phi)}{\partial\phi}=vec\left(\mathbf{a}_{Rx}(\theta)(\mathbf{a}_{Tx}(\phi)\odot\bm{\vartheta}_{Tx})^{H}\right) and ϑTx=[0,jπ,,jπ(Nt1)]T\bm{\vartheta}_{Tx}=\left[0,j\pi,\cdots,j\pi(N_{t}-1)\right]^{T}.

Appendix B Proof of Theorem 1

In the noiseless scenario where 𝐲=𝐃𝐛(θ,ϕ)\mathbf{y}=\mathbf{D}\mathbf{b}(\theta,\phi), according to Cauchy-Schwarz inequality, we have

𝐛H(θ~,ϕ~)𝐃H𝐃𝐛(θ~,ϕ~)2𝐃𝐛(θ,ϕ)2𝐃𝐛(θ,ϕ)2\displaystyle\left\|\frac{\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}^{H}}{\|\mathbf{D}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})\|_{2}}\mathbf{D}\mathbf{b}(\theta,\phi)\right\|_{2}\leq\|\mathbf{D}\mathbf{b}(\theta,\phi)\|_{2} (63)

Then, the proof of Eq. (32) is reduced to prove that

𝐛H(θ~,ϕ~)𝐃H𝐃𝐛(θ~,ϕ~)2𝐃𝐛(θ,ϕ)2𝐃𝐛(θ,ϕ)2\displaystyle\left\|\frac{\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}^{H}}{\|\mathbf{D}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})\|_{2}}\mathbf{D}\mathbf{b}(\theta,\phi)\right\|_{2}\neq\|\mathbf{D}\mathbf{b}(\theta,\phi)\|_{2} (64)

namely 𝐃𝐛(θ,ϕ)μ𝐃𝐛(θ~,ϕ~),μ,(θ,ϕ)(θ~,ϕ~)\mathbf{D}\mathbf{b}(\theta,\phi)\neq\mu\mathbf{D}\mathbf{b}(\widetilde{\theta},\widetilde{\phi}),\;\;\forall\mu\in\mathbb{C},\forall(\theta,\phi)\neq(\widetilde{\theta},\widetilde{\phi}), which is mathematically equivalent to Eq. (IV-B).

Appendix C Proof of Theorem 2

The PEP is written as

Pe((θ,ϕ)(θ~,ϕ~))\displaystyle\qquad Pe\left((\theta,\phi)\rightarrow(\widetilde{\theta},\widetilde{\phi})\right)
=\displaystyle= Pr(𝐛H(θ,ϕ)𝐃H𝐃𝐛(θ,ϕ)2𝐲22<𝐛H(θ~,ϕ~)𝐃H𝐃𝐛(θ~,ϕ~)2𝐲22)\displaystyle Pr\left(\left\|\frac{\mathbf{b}^{H}(\theta,\phi)\mathbf{D}^{H}}{\|\mathbf{D}\mathbf{b}(\theta,\phi)\|_{2}}\mathbf{y}\right\|_{2}^{2}<\left\|\frac{\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}^{H}}{\|\mathbf{D}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})\|_{2}}\mathbf{y}\right\|_{2}^{2}\right)
=\displaystyle= Pr(|𝐛H(θ,ϕ)𝐃H𝐧|2𝐃𝐛(θ,ϕ)2+|𝐛H(θ~,ϕ~)𝐃H𝐧|2𝐃𝐛(θ~,ϕ~)2\displaystyle Pr\Bigg{(}{-\frac{|\mathbf{b}^{H}({\theta},{\phi})\mathbf{D}^{H}\mathbf{n}|^{2}}{\|\mathbf{D}\mathbf{b}({\theta},{\phi})\|_{2}}+\frac{|\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}^{H}\mathbf{n}|^{2}}{\|\mathbf{D}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})\|_{2}}}
2{δ𝐧H𝐃𝐛(θ,ϕ)}\displaystyle\qquad-2\Re\left\{{\delta\mathbf{n}^{H}\mathbf{D}\mathbf{b}(\theta,\phi)}\right\}
+2{δ𝐧H𝐃𝐛(θ~,ϕ~)𝐛H(θ~,ϕ~)𝐃H𝐃𝐛(θ,ϕ)𝐃𝐛(θ~,ϕ~)22}\displaystyle\qquad+2\Re\left\{\frac{\delta\mathbf{n}^{H}\mathbf{D}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}^{H}\mathbf{D}\mathbf{b}({\theta},{\phi})}{\|\mathbf{D}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})\|_{2}^{2}}\right\}
>δ𝐃𝐛(θ,ϕ)22|δ𝐛H(θ~,ϕ~)𝐃H𝐃𝐛(θ,ϕ)|2𝐃𝐛(θ~,ϕ~)22)\displaystyle\qquad>\|\delta\mathbf{D}\mathbf{b}(\theta,\phi)\|_{2}^{2}-\frac{|\delta\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}^{H}\mathbf{D}\mathbf{b}(\theta,\phi)|^{2}}{\|\mathbf{D}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})\|_{2}^{2}}\Bigg{)}
\displaystyle\approx Pr(N1>δ𝐃𝐛(θ,ϕ)22|δ𝐛H(θ~,ϕ~)𝐃H𝐃𝐛(θ,ϕ)|2𝐃𝐛(θ~,ϕ~)22)\displaystyle Pr\Bigg{(}N_{1}>\|\delta\mathbf{D}\mathbf{b}(\theta,\phi)\|_{2}^{2}-\frac{|\delta\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}^{H}\mathbf{D}\mathbf{b}(\theta,\phi)|^{2}}{\|\mathbf{D}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})\|_{2}^{2}}\Bigg{)} (65)

where

N1=\displaystyle\qquad\qquad N_{1}=
2{δ𝐧H𝐃𝐛(θ,ϕ)+δ𝐧H𝐃𝐛(θ~,ϕ~)𝐛H(θ~,ϕ~)𝐃H𝐃𝐛(θ,ϕ)𝐃𝐛(θ~,ϕ~)22}\displaystyle 2\Re\left\{-{\delta\mathbf{n}^{H}\mathbf{D}\mathbf{b}(\theta,\phi)}+\frac{\delta\mathbf{n}^{H}\mathbf{D}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}^{H}\mathbf{D}\mathbf{b}({\theta},{\phi})}{\|\mathbf{D}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})\|_{2}^{2}}\right\}

and {}\Re\{\cdot\} is the real part of a complex number. Eq. (65) is obtained by neglecting the component |𝐛H(θ,ϕ)𝐃H𝐧|2𝐃𝐛(θ,ϕ)2+|𝐛H(θ~,ϕ~)𝐃H𝐧|2𝐃𝐛(θ~,ϕ~)2-\frac{|\mathbf{b}^{H}({\theta},{\phi})\mathbf{D}^{H}\mathbf{n}|^{2}}{\|\mathbf{D}\mathbf{b}({\theta},{\phi})\|_{2}}+\frac{|\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}^{H}\mathbf{n}|^{2}}{\|\mathbf{D}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})\|_{2}} in high SNR regime. Since N1N_{1} is a Gaussian random variable, we have

N1\displaystyle\qquad\qquad N_{1}\sim
𝒩(0,2σ2|δ|2(𝐃𝐛(θ,ϕ)22|𝐛H(θ~,ϕ~)𝐃H𝐃𝐛(θ,ϕ)|2𝐃𝐛(θ~,ϕ~)22))\displaystyle\mathcal{N}\left(0,2\sigma^{2}|\delta|^{2}\left(\|\mathbf{D}\mathbf{b}(\theta,\phi)\|_{2}^{2}-\frac{|\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}^{H}\mathbf{D}\mathbf{b}(\theta,\phi)|^{2}}{\|\mathbf{D}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})\|_{2}^{2}}\right)\right)

According to the definition of Q function, (35) is obtained.

Appendix D Proof of Proposition 1

Firstly, we write the expression of d2(𝐃n,θ,ϕ,θ~,ϕ~)d^{2}(\mathbf{D}_{n},\theta,\phi,\widetilde{\theta},\widetilde{\phi}) as

d2(𝐃n,θ,ϕ,θ~,ϕ~)\displaystyle\qquad d^{2}(\mathbf{D}_{n},\theta,\phi,\widetilde{\theta},\widetilde{\phi})
=\displaystyle= 𝐃n𝐛(θ,ϕ)22|𝐛H(θ~,ϕ~)𝐃nH𝐃n𝐛(θ,ϕ)|2𝐃n𝐛(θ~,ϕ~)22\displaystyle\left\|\mathbf{D}_{n}\mathbf{b}(\theta,\phi)\right\|_{2}^{2}-\frac{|\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}_{n}^{H}\mathbf{D}_{n}\mathbf{b}(\theta,\phi)|^{2}}{\|\mathbf{D}_{n}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})\|_{2}^{2}}
=\displaystyle= 𝐛H(θ,ϕ)𝐃n1H𝐃n1𝐛(θ,ϕ)+𝐛H(θ,ϕ)𝐝n𝐝nH𝐛(θ,ϕ)\displaystyle\mathbf{b}^{H}(\theta,\phi)\mathbf{D}_{n-1}^{H}\mathbf{D}_{n-1}\mathbf{b}(\theta,\phi)+\mathbf{b}^{H}(\theta,\phi)\mathbf{d}_{n}\mathbf{d}_{n}^{H}\mathbf{b}(\theta,\phi)-
|𝐛H(θ~,ϕ~)𝐃n1H𝐃n1𝐛(θ,ϕ)+𝐛H(θ~,ϕ~)𝐝n𝐝nH𝐛(θ,ϕ)|2𝐛H(θ~,ϕ~)𝐃n1H𝐃n1𝐛(θ~,ϕ~)+𝐛H(θ~,ϕ~)𝐝n𝐝nH𝐛(θ~,ϕ~)\displaystyle\frac{\left|\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}_{n-1}^{H}\mathbf{D}_{n-1}\mathbf{b}(\theta,\phi)+\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{d}_{n}\mathbf{d}_{n}^{H}\mathbf{b}(\theta,\phi)\right|^{2}}{\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}_{n-1}^{H}\mathbf{D}_{n-1}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})+\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{d}_{n}\mathbf{d}_{n}^{H}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})}

Thus

d2(𝐃n,θ,ϕ,θ~,ϕ~)d2(𝐃n1,θ,ϕ,θ~,ϕ~)\displaystyle d^{2}(\mathbf{D}_{n},\theta,\phi,\widetilde{\theta},\widetilde{\phi})-d^{2}(\mathbf{D}_{n-1},\theta,\phi,\widetilde{\theta},\widetilde{\phi})
=\displaystyle= 𝐛H(θ,ϕ)𝐝n𝐝nH𝐛(θ,ϕ)+|𝐛H(θ~,ϕ~)𝐃n1H𝐃n1𝐛(θ,ϕ)|2𝐛H(θ~,ϕ~)𝐃n1H𝐃n1𝐛(θ~,ϕ~)\displaystyle\mathbf{b}^{H}(\theta,\phi)\mathbf{d}_{n}\mathbf{d}_{n}^{H}\mathbf{b}(\theta,\phi)+\frac{|\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}_{n-1}^{H}\mathbf{D}_{n-1}\mathbf{b}(\theta,\phi)|^{2}}{\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}_{n-1}^{H}\mathbf{D}_{n-1}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})}
|𝐛H(θ~,ϕ~)𝐃n1H𝐃n1𝐛(θ,ϕ)+𝐛H(θ~,ϕ~)𝐝n𝐝nH𝐛(θ,ϕ)|2𝐛H(θ~,ϕ~)𝐃n1H𝐃n1𝐛(θ~,ϕ~)+𝐛H(θ~,ϕ~)𝐝n𝐝nH𝐛(θ~,ϕ~)\displaystyle-\frac{|\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}_{n-1}^{H}\mathbf{D}_{n-1}\mathbf{b}(\theta,\phi)+\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{d}_{n}\mathbf{d}_{n}^{H}\mathbf{b}(\theta,\phi)|^{2}}{\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}_{n-1}^{H}\mathbf{D}_{n-1}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})+\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{d}_{n}\mathbf{d}_{n}^{H}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})}

For the purpose of conciseness, let

aˇ\displaystyle\check{a} =𝐛H(θ~,ϕ~)𝐝n𝐝nH𝐛(θ~,ϕ~);\displaystyle=\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{d}_{n}\mathbf{d}_{n}^{H}\mathbf{b}(\widetilde{\theta},\widetilde{\phi});
bˇ\displaystyle\check{b} =𝐛H(θ~,ϕ~)𝐝n𝐝nH𝐛(θ,ϕ);\displaystyle=\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{d}_{n}\mathbf{d}_{n}^{H}\mathbf{b}({\theta},{\phi});
cˇ\displaystyle\check{c} =𝐛H(θ~,ϕ~)𝐃n1H𝐃n1𝐛(θ~,ϕ~);\displaystyle=\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}_{n-1}^{H}\mathbf{D}_{n-1}\mathbf{b}(\widetilde{\theta},\widetilde{\phi});
dˇ\displaystyle\check{d} =𝐛H(θ~,ϕ~)𝐃n1H𝐃n1𝐛(θ,ϕ).\displaystyle=\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}_{n-1}^{H}\mathbf{D}_{n-1}\mathbf{b}(\theta,\phi).

As 𝐛H(θ~,ϕ~)𝐝n\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{d}_{n} and 𝐝nH𝐛(θ,ϕ)\mathbf{d}_{n}^{H}\mathbf{b}({\theta},{\phi}) are numbers, rather than vectors, we have

𝐛H(θ,ϕ)𝐝n𝐝nH𝐛(θ,ϕ)\displaystyle\mathbf{b}^{H}(\theta,\phi)\mathbf{d}_{n}\mathbf{d}_{n}^{H}\mathbf{b}(\theta,\phi) =|𝐛H(θ~,ϕ~)𝐝n𝐝nH𝐛(θ,ϕ)|2𝐛H(θ~,ϕ~)𝐝n𝐝nH𝐛(θ~,ϕ~)=|bˇ|2aˇ\displaystyle=\frac{|\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{d}_{n}\mathbf{d}_{n}^{H}\mathbf{b}({\theta},{\phi})|^{2}}{\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{d}_{n}\mathbf{d}_{n}^{H}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})}=\frac{|\check{b}|^{2}}{\check{a}}

Then,

d2(𝐃n,θ,ϕ,θ~,ϕ~)d2(𝐃n1,θ,ϕ,θ~,ϕ~)\displaystyle d^{2}(\mathbf{D}_{n},\theta,\phi,\widetilde{\theta},\widetilde{\phi})-d^{2}(\mathbf{D}_{n-1},\theta,\phi,\widetilde{\theta},\widetilde{\phi})
=\displaystyle= |bˇ|2aˇ+|dˇ|2cˇ|bˇ+dˇ|2aˇ+cˇ\displaystyle\frac{|\check{b}|^{2}}{\check{a}}+\frac{|\check{d}|^{2}}{\check{c}}-\frac{|\check{b}+\check{d}|^{2}}{\check{a}+\check{c}}
=\displaystyle= |bˇ|2cˇ(aˇ+cˇ)+|dˇ|2aˇ(aˇ+cˇ)aˇcˇ|bˇ+dˇ|2aˇcˇ(aˇ+cˇ)\displaystyle\frac{|\check{b}|^{2}\check{c}(\check{a}+\check{c})+|\check{d}|^{2}\check{a}(\check{a}+\check{c})-\check{a}\check{c}|\check{b}+\check{d}|^{2}}{\check{a}\check{c}(\check{a}+\check{c})}
=\displaystyle= |bˇ|2cˇ(aˇ+cˇ)+|dˇ|2aˇ(aˇ+cˇ)aˇcˇ|bˇ|2aˇcˇ|dˇ|22aˇcˇRe{bˇdˇ}aˇcˇ(aˇ+cˇ)\displaystyle\frac{|\check{b}|^{2}\check{c}(\check{a}+\check{c})+|\check{d}|^{2}\check{a}(\check{a}+\check{c})-\check{a}\check{c}|\check{b}|^{2}-\check{a}\check{c}|\check{d}|^{2}-2\check{a}\check{c}Re\{\check{b}^{*}\check{d}\}}{\check{a}\check{c}(\check{a}+\check{c})}
=\displaystyle= |bˇ|2cˇ2+|dˇ|2aˇ22aˇcˇRe{bˇdˇ}aˇcˇ(aˇ+cˇ)\displaystyle\frac{|\check{b}|^{2}\check{c}^{2}+|\check{d}|^{2}\check{a}^{2}-2\check{a}\check{c}Re\{\check{b}^{*}\check{d}\}}{\check{a}\check{c}(\check{a}+\check{c})}
=\displaystyle= |aˇdˇbˇcˇ|2aˇcˇ(aˇ+cˇ)0\displaystyle\frac{|\check{a}\check{d}-\check{b}\check{c}|^{2}}{\check{a}\check{c}(\check{a}+\check{c})}\geq 0

and equality holds when aˇdˇbˇcˇ=0\check{a}\check{d}-\check{b}\check{c}=0, namely,

𝐛H(θ~,ϕ~)𝐝n𝐝nH𝐛(θ~,ϕ~)𝐛H(θ~,ϕ~)𝐝n𝐝nH𝐛(θ,ϕ)=\displaystyle\frac{\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{d}_{n}\mathbf{d}_{n}^{H}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})}{\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{d}_{n}\mathbf{d}_{n}^{H}\mathbf{b}({\theta},{\phi})}= 𝐛H(θ~,ϕ~)𝐃n1H𝐃n1𝐛(θ~,ϕ~)𝐛H(θ~,ϕ~)𝐃n1H𝐃n1𝐛(θ,ϕ)\displaystyle\frac{\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}_{n-1}^{H}\mathbf{D}_{n-1}\mathbf{b}(\widetilde{\theta},\widetilde{\phi})}{\mathbf{b}^{H}(\widetilde{\theta},\widetilde{\phi})\mathbf{D}_{n-1}^{H}\mathbf{D}_{n-1}\mathbf{b}(\theta,\phi)} (66)

References

  • [1] Z. Pi and F. Khan, “An introduction to millimeter-wave mobile broadband systems,” IEEE Commun. Mag., vol. 49, no. 6, Jun. 2011.
  • [2] S. Rangan, T. S. Rappaport, and E. Erkip, “Millimeter-wave cellular wireless networks: Potentials and challenges,” Proc. IEEE, vol. 102, no. 3, pp. 366–385, Mar. 2014.
  • [3] T. Nitsche, C. Cordeiro, A. B. Flores, E. W. Knightly, E. Perahia, and J. C. Widmer, “IEEE 802.11 ad: directional 60 GHz communication for multi-Gigabit-per-second Wi-Fi,” IEEE Commun. Mag., vol. 52, no. 12, pp. 132–141, 2014.
  • [4] A. I. Sulyman, A. T. Nassar, M. K. Samimi, G. R. MacCartney, T. S. Rappaport, and A. Alsanie, “Radio propagation path loss models for 5G cellular networks in the 28 GHz and 38 GHz millimeter-wave bands,” IEEE Commun. Mag., vol. 52, no. 9, pp. 78–86, 2014.
  • [5] RP-172115, “Revised WID on New Radio Access Technology,” 2017.
  • [6] A. Natarajan, S. K. Reynolds, M. Tsai, S. T. Nicolson, J. C. Zhan, D. G. Kam, D. Liu, Y. O. Huang, A. Valdes-Garcia, and B. A. Floyd, “A fully-integrated 16-element phased-array receiver in SiGe BiCMOS for 60-GHz communications,” IEEE J. Solid-State Circuits, vol. 46, no. 5, pp. 1059–1075, May 2011.
  • [7] V. Raghavan, S. Subramanian, J. Cezanne, A. Sampath, O. Koymen, and J. Li, “Directional hybrid precoding in millimeter-wave MIMO systems,” in 2016 IEEE Global Communications Conference (GLOBECOM), 2016, pp. 1–7.
  • [8] T. Bai and R. W. Heath, “Coverage and rate analysis for millimeter-wave cellular networks,” IEEE Trans. Wireless Commun., vol. 14, no. 2, pp. 1100–1114, 2014.
  • [9] Z. Muhi-Eldeen, L. Ivrissimtzis, and M. Al-Nuaimi, “Modelling and measurements of millimetre wavelength propagation in urban environments,” IET Microw., Antennas Propag., vol. 4, no. 9, pp. 1300–1309, 2010.
  • [10] P. Wang, J. Fang, X. Yuan, Z. Chen, H. Duan, and H. Li, “Intelligent reflecting surface-assisted millimeter wave communications: Joint active and passive precoding design,” arXiv preprint arXiv:1908.10734, 2019.
  • [11] Y. Yang, B. Zheng, S. Zhang, and R. Zhang, “Intelligent reflecting surface meets OFDM: Protocol design and rate maximization,” arXiv preprint arXiv:1906.09956, 2019.
  • [12] Q. Wu and R. Zhang, “Intelligent reflecting surface enhanced wireless network via joint active and passive beamforming,” IEEE Trans. Wireless Commun., vol. 18, no. 11, pp. 5394–5409, Nov 2019.
  • [13] Z. Wang, L. Liu, and S. Cui, “Channel estimation for intelligent reflecting surface assisted multiuser communications,” arXiv preprint arXiv:1911.03084, 2019.
  • [14] E. Basar, M. Di Renzo, J. de Rosny, M. Debbah, M.-S. Alouini, and R. Zhang, “Wireless communications through reconfigurable intelligent surfaces,” arXiv preprint arXiv:1906.09490, 2019.
  • [15] C. Huang, A. Zappone, G. C. Alexandropoulos, M. Debbah, and C. Yuen, “Reconfigurable intelligent surfaces for energy efficiency in wireless communication,” IEEE Trans. Wireless Commun., vol. 18, no. 8, pp. 4157–4170, 2019.
  • [16] A. Taha, M. Alrabeiah, and A. Alkhateeb, “Enabling large intelligent surfaces with compressive sensing and deep learning,” arXiv preprint arXiv:1904.10136, 2019.
  • [17] C. Huang, A. Zappone, M. Debbah, and C. Yuen, “Achievable rate maximization by passive intelligent mirrors,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2018, pp. 3714–3718.
  • [18] W. Khawaja, O. Ozdemir, Y. Yapici, F. Erden, M. Ezuma, and I. Guvenc, “Coverage enhancement for NLoS mmwave links using passive reflectors,” arXiv preprint arXiv:1905.04794, 2019.
  • [19] X. Tan, Z. Sun, D. Koutsonikolas, and J. M. Jornet, “Enabling indoor mobile millimeter-wave networks based on smart reflect-arrays,” in IEEE INFOCOM, 2018, pp. 270–278.
  • [20] W. Tang, X. Li, J. Y. Dai, S. Jin, Y. Zeng, Q. Cheng, and T. J. Cui, “Wireless communications with programmable metasurface: Transceiver design and experimental results,” China Commun., vol. 16, no. 5, pp. 46–61, 2019.
  • [21] J. Zhao, X. Yang, J. Y. Dai, Q. Cheng, X. Li, N. H. Qi, J. C. Ke, G. D. Bai, S. Liu, S. Jin et al., “Programmable time-domain digital-coding metasurface for non-linear harmonic manipulation and new wireless communication systems,” Natl. Sci. Rev., vol. 6, no. 2, pp. 231–238, 2018.
  • [22] W. Tang, J. Y. Dai, M. Chen, X. Li, Q. Cheng, S. Jin, K.-K. Wong, and T. J. Cui, “Programmable metasurface-based rf chain-free 8PSK wireless transmitter,” Electron. Lett., vol. 55, no. 7, pp. 417–420, 2019.
  • [23] C. L. Holloway, E. F. Kuester, J. A. Gordon, J. O’Hara, J. Booth, and D. R. Smith, “An overview of the theory and applications of metasurfaces: The two-dimensional equivalents of metamaterials,” IEEE Antennas Propag. Mag., vol. 54, no. 2, pp. 10–35, 2012.
  • [24] H. Yang, X. Cao, F. Yang, J. Gao, S. Xu, M. Li, X. Chen, Y. Zhao, Y. Zheng, and S. Li, “A programmable metasurface with dynamic polarization, scattering and focusing control,” Sci. Rep., vol. 6, p. 35692, 2016.
  • [25] P. Wang, J. Fang, H. Duan, and H. Li, “Compressed channel estimation for intelligent reflecting surface-assisted millimeter wave systems,” IEEE Signal Processing Letters, 2020.
  • [26] C. Jia, J. Cheng, H. Gao, and W. Xu, “High-resolution channel estimation for intelligent reflecting surface-assisted mmwave communications,” in 2020 IEEE 31st Annual International Symposium on Personal, Indoor and Mobile Radio Communications, 2020, pp. 1–6.
  • [27] Z. Xiao, T. He, P. Xia, and X.-G. Xia, “Hierarchical codebook design for beamforming training in millimeter-wave communication,” IEEE Trans. Wireless Commun., vol. 15, no. 5, pp. 3380–3392, 2016.
  • [28] A. Alkhateeb, O. El Ayach, G. Leus, and R. W. Heath, “Channel estimation and hybrid precoding for millimeter wave cellular systems,” IEEE J. Sel. Topics Signal Process., vol. 8, no. 5, pp. 831–846, 2014.
  • [29] W. Wang and W. Zhang, “Orthogonal projection-based channel estimation for multi-panel millimeter wave MIMO,” IEEE Trans. Commun., vol. 68, no. 4, pp. 2173–2187, 2020.
  • [30] A. Alkhateeb, G. Leus, and R. W. Heath, “Limited feedback hybrid precoding for multi-user millimeter wave systems,” IEEE Trans. Wireless Commun., vol. 14, no. 11, pp. 6481–6494, Nov. 2015.
  • [31] N. J. Myers, Y. Wang, N. González-Prelcic, and R. W. Heath, “Deep learning-based beam alignment in mmwave vehicular networks,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 8569–8573.
  • [32] R. G. Baraniuk and M. B. Wakin, “Random projections of smooth manifolds,” Found. Comput. Math., vol. 9, no. 1, pp. 51–77, 2009.
  • [33] K. L. Clarkson, “Tighter bounds for random projections of manifolds,” in Proceedings of the twenty-fourth annual symposium on Computational geometry, 2008, pp. 39–48.
  • [34] G. Efstathopoulos and A. Manikas, “Extended array manifolds: Functions of array manifolds,” IEEE Trans. Signal Process., vol. 59, no. 7, pp. 3272–3287, July 2011.
  • [35] S. Foucart and H. Rauhut, A Mathematical Introduction to Compressive Sensing.   Birkhäuser Basel, 2013, vol. 1, no. 3.
  • [36] E. J. Candes, J. K. Romberg, and T. Tao, “Stable signal recovery from incomplete and inaccurate measurements,” Comm. Pure Appl. Math., vol. 59, no. 8, pp. 1207–1223, 2006.
  • [37] M. Chiani, D. Dardari, and M. K. Simon, “New exponential bounds and approximations for the computation of error probability in fading channels,” IEEE Transactions on Wireless Communications, vol. 2, no. 4, pp. 840–845, 2003.
  • [38] W. H. Foy, “Position-location solutions by Taylor-series estimation,” IEEE Trans. Aerosp. Electron. Syst., no. 2, pp. 187–194, 1976.
  • [39] J. Liu and J. Ye, “Efficient Euclidean projections in linear time,” in Proceedings of the 26th Annual International Conference on Machine Learning, 2009, pp. 657–664.
  • [40] M. Giordani, M. Polese, A. Roy, D. Castor, and M. Zorzi, “A tutorial on beam management for 3gpp nr at mmwave frequencies,” IEEE Communications Surveys & Tutorials, vol. 21, no. 1, pp. 173–196, 2018.