This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A Geometry-aided Message Passing Method for AoA-Based Short Range MIMO Channel Estimation

Jarkko Kaleva, Nitin Jonathan Myers, Antti Tölli and Robert W. Heath Jr.
\dagger Centre for Wireless Communications, University of Oulu, Finland.
Department of Electrical and Computer Engineering, The University of Texas at Austin, USA.
Abstract

Short range channels commonly arise in millimeter wave (mmWave) wearable settings, where the length of the antenna arrays can be comparable to the distance between the radios. Conventional mmWave MIMO channel estimation techniques based on the far field assumption may perform poorly in short range settings due to the large angular spread and, hence, high available rank. We propose a geometry-aided message passing algorithm that exploits structure in short range line-of-sight (LoS) channels for spatial sub-Nyquist channel estimation. Our approach parametrizes the channel using angle-of-arrivals (AoAs) that are locally defined for subarrays of an antenna array. Furthermore, it leverages the dependencies between the local AoAs using factors based on the array geometry. We show that the LoS MIMO channel can be reconstructed using the derived local AoA estimates and the known transceiver geometry. The proposed approach achieves a reasonable rate with greatly reduced pilot transmissions when compared to exhaustive beam search-based local AoA estimation.

Index Terms:
AoA estimation, Mm-wave, message passing

I Introduction

Millimeter wave radio architectures can be different from the architectures that are commonly used at lower carrier frequencies [1]. The smaller wavelengths at millimeter wave (mmWave) allow the use of a large number of antennas. The number of available radio frequency (RF) chains, however, can be far less than the number of antenna elements, to minimize the power consumption and the cost of mmWave systems [1]. The hybrid beamforming architecture, used in the IEEE 802.11ay standard, is one such example. High spectral efficiency can be achieved in a beamforming system if its antenna arrays are configured properly. Such configuration can be achieved if the channel between the transmitter (TX) and the receiver (RX) is known.

Channel estimation can be challenging in hybrid beamforming systems due to a limited number of RF chains [1]. To reduce the overhead in learning the channel, prior work has exploited low rank and sparse nature of mmWave channels [2]. Most of these algorithms, however, make a far field assumption, i.e., the distance between the TX and the RX is larger than the length of the antenna arrays at the TX and the RX [3]. In typical mmWave wearable settings, the far field assumption may not be applicable. For instance, short range line-of-sight (LoS) channels can have rank that is larger than one; a study on the rank of LoS channels with the transceiver distance can be found in  [4]. In contrast, far field LoS channels have a rank one structure. Similarly, dictionaries used for compressive channel estimation in which far field channels have a sparse representation may not be appropriate in short range settings. Therefore, there is a need to develop algorithms that exploit the short range channel structure to minimize the training overhead for channel estimation.

In this paper, we propose a geometry-aided message passing algorithm for short range LoS channel estimation. We consider a point-to-point multiple-input multiple-output (MIMO) communication scenario, where the TX node is equipped with a fully digital architecture and a relatively small number of antennas, while the RX node comprises a large antenna array and a subarray-based hybrid beamforming architecture. Such a setup may be practical in an on-body mmWave sensor network. Assuming channel reciprocity, the estimated channel at the RX node can be used to design the corresponding hybrid beamforming structures for both data transmission and reception. We assume that the far field assumption holds for the subarray specific channels but not for the full channel. Under such an assumption, we define local angle-of-arrivals (AoAs) for each RX subarray. Our algorithm estimates the local AoAs using the pilots sent from the TX antennas, while exploiting the dependencies among the local AoAs that arise from the geometry of the RX antenna array. The full MIMO channel is then estimated from local AoA information corresponding to two outermost TX antennas of the TX antenna array. The proposed technique has low complexity and performs significantly better than the maximum likelihood approach that recovers local AoAs independently. Simulation results indicate that our algorithm can be used to greatly reduce the training required for LoS channel estimation when compared to conventional techniques.

II System and channel model

We consider a point-to-point111Extension to multiuser scenario is straightforward if each TX is assigned with orthogonal pilots. MIMO system with a subarray-based hybrid beamforming architecture at the RX illustrated in Fig 1. Let NRFN_{\mathrm{RF}} be the number of RF chains and corresponding analog subarrays at the RX. We use NrxN_{\mathrm{rx}} to denote the number of antennas at the RX. Each of the NRFN_{\mathrm{RF}} subarrays at the RX is considered to be λ/2\lambda/2 spaced uniform linear array with NN antennas, where λ\lambda denotes the wavelength corresponding to the mmWave carrier frequency. Furthermore, the Nrx=NNRFN_{\mathrm{rx}}=NN_{\mathrm{RF}} antennas at the RX are assumed to be collinear. Each subarray is equipped with an analog beamforming architecture with phase shifters. The TX is equipped with a fully digital architecture with NRFN_{\mathrm{RF}} RF chains and Ntx=NRFN_{\mathrm{tx}}=N_{\mathrm{RF}} antennas.

We consider a narrowband setting and define 𝐇Nrx×Ntx\mathbf{H}\in\mathbb{C}^{N_{\mathrm{rx}}\times N_{\mathrm{tx}}} as the channel matrix. The length of the antenna arrays at the TX and the RX are LtxL_{\mathrm{tx}} and LrxL_{\mathrm{rx}}, and the distance between the midpoint of the arrays at the TX and the RX is denoted by rr.

Refer to caption
Figure 1: Example of a short range system for NRF=4N_{\mathrm{RF}}=4, N=4N=4, and Ntx=4N_{\mathrm{tx}}=4. The angle-of-arrival varies across subarrays when rr is comparable to LrxL_{\mathrm{rx}}.

While the spacing between successive elements in each RX subarray is assumed to be λ/2\lambda/2, the spacing between the successive subarrays can be arbitrarily larger than λ/2\lambda/2. We assume that inter-subarray spacing is uniform across successive RX subarrays, and is determined by LrxL_{\mathrm{rx}} and NRFN_{\mathrm{RF}}. Similarly, the spacing between the TX antennas can be larger than λ/2\lambda/2. The assumptions like the use of uniform subarray spacing, and narrowband setting are made for simplicity of exposition. The ideas underlying our approach, however, can be extended to any TX/RX array geometry.

Now, we describe the channel and the system model in the hybrid beamforming setup. We use di,jd_{i,j} to denote the distance between the ithi^{\mathrm{th}} antenna at the RX and the jthj^{\mathrm{th}} antenna at the TX. The (i,j)th(i,j)^{\mathrm{th}} entry of the channel is then [4]

𝐇(i,j)=λ4πdi,je𝗃2πdi,j/λ.\mathbf{H}(i,j)=\frac{\lambda}{4\pi d_{i,j}}e^{-\mathsf{j}2\pi d_{i,j}/\lambda}. (1)

We define 𝐡k,N\mathbf{h}_{k,\ell}\in\mathbb{C}^{N}, a vector in 𝐇\mathbf{H}, as the channel between the kthk^{\mathrm{th}} subarray at the RX and the th{\ell}^{\mathrm{th}} RF chain at the TX. Let t[m]t_{\ell}[m]\in\mathbb{C} be the pilot transmitted by the TX in the mthm^{\mathrm{th}} training slot. In the same slot, the RX applies conjugate transpose of 𝐰k[m]N\mathbf{w}_{k}[m]\in\mathbb{C}^{N} to its kthk^{\mathrm{th}} subarray to acquire channel measurement yk[m]y_{k}[m]. Under perfect synchronization, the channel measurement is

yk[m]=𝐰k[m]=1NRF𝐡k,t[m]+vk[m],y_{k}[m]=\mathbf{w}^{\ast}_{k}[m]\sum_{\ell=1}^{N_{\mathrm{RF}}}\mathbf{h}_{k,\ell}t_{\ell}[m]+v_{k}[m], (2)

where vk[m]𝒩c(0,σ2)v_{k}[m]\sim\mathcal{N}_{\mathrm{c}}(0,\sigma^{2}) is circularly symmetric Gaussian noise with zero mean and variance σ2\sigma^{2}. As 𝐇\mathbf{H} has NNRF2NN_{\mathrm{RF}}^{2} entries and the RX can acquire NRFN_{\mathrm{RF}} channel measurements in parallel, standard channel estimation based on exhaustive search requires a training overhead of 𝒪(NNRF2/NRF)\mathcal{O}(NN_{\mathrm{RF}}^{2}/N_{\mathrm{RF}}). In this paper, we show that, by utilizing the known geometry of the RX antenna array, a reasonable approximation of 𝐇\mathbf{H} can be estimated with a small fraction of 𝒪(NNRF)\mathcal{O}(NN_{\mathrm{RF}}) pilot transmissions.

III Geometry-aided channel reconstruction

The idea underlying the proposed short range LoS channel estimation approach is best understood using local AoA estimation. For ease of exposition, consider a scenario in which the first antenna at the TX transmits known pilots and the rest of the NRF1N_{\mathrm{RF}}-1 antennas are inactive. For an indicator function 𝕀\mathbb{I}, setting t[m]=𝕀=1t_{\ell}[m]=\mathbb{I}_{\ell=1} in (2) results in the desired condition. Ignoring the TX antenna index \ell, we define θk\theta_{k} as the local AoA made by the ray between the midpoints of the kthk^{\mathrm{th}} RX subarray and the first TX antenna222In general, θk,\theta_{k,\ell} denotes the local AoA between TX antenna \ell and RX subarray kk. An identical AoA estimation process is carried out for each \ell., with the normal to the RX array. We make an assumption that the AoAs seen by the NN individual antennas within subarray kk are invariant, and can be approximated by the local AoA θk\theta_{k}. The angles {θk}k=1NRF\{\theta_{k}\}^{N_{\mathrm{RF}}}_{k=1}, however, can vary with the subarray index kk, as seen in Fig. 1. In mmWave wearable settings, the far field approximation may be valid for each subarray, while it may not hold true for the full array. For example, the length of a virtual reality headset can be comparable to the distance with a wearable. In this paper, we model θk\theta_{k} as a realization of a random variable Θk\Theta_{k}, and propose an algorithm to estimate the local AoAs from the channel measurements while exploiting the dependencies among {θk}k=1NRF\{\theta_{k}\}^{N_{\mathrm{RF}}}_{k=1}.

III-A Construction of likelihood functions for local AoAs

As each subchannel is assumed to satisfy the far field approximation, the local AoAs can be estimated using standard compressed sensing (CS) [5]. We use shifted Zadoff-Chu (ZC) sequences as analog RX beamformer weights 𝐰k[m]\mathbf{w}_{k}[m] for channel acquisition, as ZC sequences can be realized in beamforming systems with unimodulus constraints, and have good properties for AoA estimation [6]. For instance, CS-based AoA estimation with ZC sequences is efficient, if the AoAs come from a set of angles that are defined by the discrete Fourier transform. We define 𝐳kN\mathbf{z}_{k}\in\mathbb{C}^{N} as the ZC sequence used at the kthk^{\mathrm{th}} subarray of the RX. Spatially diverse channel measurements are acquired at the RX by applying different random circulant shifts of 𝐳k\mathbf{z}_{k} to its kthk^{\mathrm{th}} subarray.

For pilot transmission from the first antenna at the TX, the mthm^{\mathrm{th}} channel measurement at the kthk^{\mathrm{th}} RX subarray is

yk[m]=𝐰k[m]𝐡k,1+vk[m].y_{k}[m]=\mathbf{w}^{\ast}_{k}[m]\mathbf{h}_{k,1}+v_{k}[m]. (3)

We define a Vandermonde vector of length NN as 𝐚N(θ)=[1,e𝗃πsin(θ),e𝗃2πsin(θ),..,e𝗃(N1)πsin(θ)]T\mathbf{a}_{N}(\theta)=[1,e^{-\mathsf{j}\pi\mathrm{sin}(\theta)},e^{-\mathsf{j}2\pi\mathrm{sin}(\theta)},..,e^{-\mathsf{j}(N-1)\pi\mathrm{sin}(\theta)}]^{T}. Under the far field assumption for subchannels, 𝐡k,1\mathbf{h}_{k,1} can be approximated as αk𝐚N(θk)\alpha_{k}\mathbf{a}_{N}(\theta_{k}), where αk\alpha_{k} is an unknown complex gain. The channel measurement yk[m]y_{k}[m] is then

yk[m]=αk𝐰k[m]𝐚N(θk)+vk[m].y_{k}[m]=\alpha_{k}\mathbf{w}^{\ast}_{k}[m]\mathbf{a}_{N}(\theta_{k})+v_{k}[m]. (4)

A collection of MM projections of 𝐚N(θk)\mathbf{a}_{N}(\theta_{k}), obtained using RX beam training vectors {𝐰k[m]}m=1M\{\mathbf{w}_{k}[m]\}^{M}_{m=1}, is defined as 𝐲kM\mathbf{y}_{k}\in\mathbb{C}^{M}. In this paper, the first M1M-1 beam training vectors, i.e., {𝐰k[m]}m=1M1\{\mathbf{w}_{k}[m]\}^{M-1}_{m=1}, are chosen as M1M-1 distinct random circulant shifts of 𝐳k\mathbf{z}_{k}. The vector 𝐰k[M]\mathbf{w}_{k}[M] is defined as 𝐰k[M]=𝐰k[1][1,1,1,..1]T\mathbf{w}_{k}[M]=\mathbf{w}_{k}[1]\odot[1,-1,-1,..-1]^{T}, where \odot denotes the element-wise product. The MthM^{\mathrm{th}} measurement is defined differently so that the unknown gain αk\alpha_{k} can be estimated. With ηk\eta_{k} defined as the first entry of 𝐰k[1]\mathbf{w}_{k}[1], it can be observed from (4) that yk[1]+yk[M]y_{k}[1]+y_{k}[M] is a noisy version of 2αkηk2\alpha_{k}\eta_{k}. An estimate of αk\alpha_{k} is then α^k=(yk[1]+yk[M])/2ηk\hat{\alpha}_{k}=(y_{k}[1]+y_{k}[M])/2\eta_{k}.

The gain compensated channel measurements are defined as 𝐲~k=𝐲k/α^k\tilde{\mathbf{y}}_{k}=\mathbf{y}_{k}/\hat{\alpha}_{k}. The compression matrix associated with the kthk^{\mathrm{th}} subarray is defined as 𝐀kM×N\mathbf{A}_{k}\in\mathbb{C}^{M\times N}, where the mthm^{\mathrm{th}} row of 𝐀k\mathbf{A}_{k} is 𝐀k(m,:)=𝐰k[m]\mathbf{A}_{k}(m,:)=\mathbf{w}^{\ast}_{k}[m]. From (4), it can be observed that 𝐲k=αk𝐀k𝐚N(θk)+𝐯k\mathbf{y}_{k}=\alpha_{k}\mathbf{A}_{k}\mathbf{a}_{N}(\theta_{k})+\mathbf{v}_{k}. We ignore the errors in estimating αk\alpha_{k} to conclude that 𝐲~k\tilde{\mathbf{y}}_{k} is a realization of 𝒩c(𝐀k𝐚N(θk),σ2𝐈/|α^k|2)\mathcal{N}_{\mathrm{c}}(\mathbf{A}_{k}\mathbf{a}_{N}(\theta_{k}),\sigma^{2}\mathbf{I}/|\hat{\alpha}_{k}|^{2}). Thus, the scaled likelihood function p(θk)p(\theta_{k}) is defined as

p(θk)=exp(|α^k|2𝐲~k𝐀k𝐚N(θk)2/σ2).p(\theta_{k})=\mathrm{exp}\left(-{|\hat{\alpha}_{k}|^{2}\|\tilde{\mathbf{y}}_{k}-\mathbf{A}_{k}\mathbf{a}_{N}(\theta_{k})\|^{2}}/{\sigma^{2}}\right). (5)

A possible way to estimate θk\theta_{k} is by maximizing p(θk)p(\theta_{k}) in (5).

III-B Statistical dependency among local AoAs

In this section, we design geometry factors to model the strong dependencies among the local AoAs {θk}k=1NRF\{\theta_{k}\}^{N_{\mathrm{RF}}}_{k=1}. For a specific θ1\theta_{1}, it can be observed from Fig. 1 that θ2\theta_{2} is a function of the distance between the first antenna at the TX and the first subarray of the RX. As this distance is typically bounded, θ2\theta_{2} takes a range of values that depend on θ1\theta_{1}. The maximum likelihood (ML) estimator that independently maximizes (5) over {θk}k=1NRF\{\theta_{k}\}_{k=1}^{N_{\mathrm{RF}}} does not exploit such dependency, and may result in poor local AoA estimates.

We derive the geometry factor g(θ2|θ1)g(\theta_{2}|\theta_{1}) that represents the distribution of Θ2\Theta_{2} conditioned on Θ1=θ1\Theta_{1}=\theta_{1}. We define r1r_{1} as the distance between the first TX antenna and the midpoint of the RX array, and sks_{k} as the distance of the kthk^{\mathrm{th}} RX subarray from the midpoint of the RX. For a given r1r_{1}, the distance r11r_{11} in Fig. 1 can be solved from

(r11cosθ1)2+(r11sinθ1+s1)2=r12(r_{11}\mathrm{cos}\,\theta_{1})^{2}+(r_{11}\mathrm{sin}\,\theta_{1}+s_{1})^{2}=r_{1}^{2} (6)

and, subsequently, the local AoA θ2\theta_{2} can be expressed as

θ2=tan1(s1s2+r11sinθ1r11cosθ1).\theta_{2}=\mathrm{tan}^{-1}\left(\frac{s_{1}-s_{2}+r_{11}\mathrm{sin}\,\theta_{1}}{r_{11}\mathrm{cos}\,\theta_{1}}\right). (7)

From (6) and (7), it can be observed that there is a unique θ2\theta_{2} for a given θ1\theta_{1} and r1r_{1}. Let 𝒢\mathcal{G} be the mapping from θ1\theta_{1} and r1r_{1} to θ2\theta_{2}, i.e., θ2=𝒢(θ1,r1)\theta_{2}=\mathcal{G}(\theta_{1},r_{1}). We assume that r[rmin,rmax]r\in[r_{\mathrm{min}},r_{\mathrm{max}}]. For example, it is reasonable to assume rmin15cmr_{\mathrm{min}}\approx 15\,\mathrm{cm} and rmax80cmr_{\mathrm{max}}\approx 80\,\mathrm{cm} in on-body communication setups. The distance r1r_{1} is known to lie in [rminLtx/2,rmax+Ltx/2][r_{\mathrm{min}}-L_{\mathrm{tx}}/2,r_{\mathrm{max}}+L_{\mathrm{tx}}/2]. We assume that r1r_{1} is uniformly distributed within this interval to get

g(θ2|θ1)=1DrminLtx/2rmax+Ltx/2δ(θ2𝒢(θ1,r1))𝑑r1,g(\theta_{2}|\theta_{1})=\frac{1}{D}\int_{r_{\mathrm{min}}-L_{\mathrm{tx}}/2}^{r_{\mathrm{max}}+L_{\mathrm{tx}}/2}\delta(\theta_{2}-\mathcal{G}(\theta_{1},r_{1}))dr_{1}, (8)

where D=rmaxrmin+LtxD=r_{\mathrm{max}}-r_{\mathrm{min}}+L_{\mathrm{tx}} and δ()\delta(\cdot) denotes the Dirac-delta function indicating θ2=𝒢(θ1,r1)\theta_{2}=\mathcal{G}(\theta_{1},r_{1}). Similarly, other conditional distributions, i.e., {g(θk|θn)}k,n\{g(\theta_{k}|\theta_{n})\}_{k,n}, can be estimated with the arguments used to compute g(θ2|θ1)g(\theta_{2}|\theta_{1}). The geometry factors do not depend on the channel measurements, and can be computed offline based on the array geometry at the RX.

III-C Geometry-aided message passing

Refer to caption
Figure 2: Factor graphs in forward and backward passes of message passing for NRF=4N_{\mathrm{RF}}=4. The random variable Θk\Theta_{k} models a realization of θk\theta_{k}.

Now, we explain our algorithm that combines information about {Θk}k=1NRF\{\Theta_{k}\}_{k=1}^{N_{\mathrm{RF}}} from the likelihood functions (5) and the geometry factors (8). The factor graph [7] corresponding to our algorithm is shown in Fig. 2. The circular nodes in Fig. 2 denote random variables, and the rectangular nodes contain factors or likelihoods. A message flowing out of a node represents a distribution of a random variable, from the node’s perspective. Messages are sent between the circular nodes and the rectangular nodes to obtain better estimates of {θk}k=1NRF\{\theta_{k}\}_{k=1}^{N_{\mathrm{RF}}} when compared to the ML estimator. Our algorithm includes a forward pass, i.e., sequential flow of messages among {Θk}k=1NRF\{\Theta_{k}\}^{N_{\mathrm{RF}}}_{k=1}, and a backward pass in which message flows occur in the opposite direction. The messages in both directions are computed using the sum-product algorithm [7].

We explain how messages are constructed in the forward pass. In the first iteration of the forward pass, the likelihood function p(θ1)p(\theta_{1}) is sent to node Θ1\Theta_{1}, which forwards Poutfwd(θ1)=p(θ1)P^{\mathrm{fwd}}_{\mathrm{out}}(\theta_{1})=p(\theta_{1}) to the geometry factor that contains g(θ2|θ1)g(\theta_{2}|\theta_{1}). Using the two functions, the geometry factor forwards message Pinfwd(θ2)P^{\mathrm{fwd}}_{\mathrm{in}}(\theta_{2}) to node Θ2\Theta_{2}, defined by

Pinfwd(θ2)=π/2π/2Poutfwd(θ1)g(θ2|θ1)𝑑θ1.P^{\mathrm{fwd}}_{\mathrm{in}}(\theta_{2})=\int_{-\pi/2}^{\pi/2}P^{\mathrm{fwd}}_{\mathrm{out}}(\theta_{1})g(\theta_{2}|\theta_{1})d\theta_{1}. (9)

The distribution Pinfwd(θ2)P^{\mathrm{fwd}}_{\mathrm{in}}(\theta_{2}) is essentially Poutfwd(θ1)g(θ2|θ1)P^{\mathrm{fwd}}_{\mathrm{out}}(\theta_{1})g(\theta_{2}|\theta_{1}) marginalized over θ1\theta_{1}. In other words, Pinfwd(θ2)P^{\mathrm{fwd}}_{\mathrm{in}}(\theta_{2}) represents the belief about Θ2\Theta_{2}, i.e., a scaled probability distribution of Θ2\Theta_{2} believed by the geometry factor, using information about Θ1\Theta_{1}. Note that Pinfwd(θ2)P^{\mathrm{fwd}}_{\mathrm{in}}(\theta_{2}) provides side information about Θ2\Theta_{2} that is independent of the measurements acquired by the second subarray. This side information comes from the observations in the first subarray, and the statistical dependency between θ1\theta_{1} and θ2\theta_{2}. The node Θ2\Theta_{2} combines information from the channel measurements, i.e., p(θ2)p(\theta_{2}), with the one from Pinfwd(θ2)P^{\mathrm{fwd}}_{\mathrm{in}}(\theta_{2}), using

Poutfwd(θ2)=p(θ2)Pinfwd(θ2).P^{\mathrm{fwd}}_{\mathrm{out}}(\theta_{2})=p(\theta_{2})P^{\mathrm{fwd}}_{\mathrm{in}}(\theta_{2}). (10)

The message in (10) is sent to the factor containing g(θ3|θ2)g(\theta_{3}|\theta_{2}), which computes the belief about Θ3\Theta_{3} with g(θ3|θ2)g(\theta_{3}|\theta_{2}) and Poutfwd(θ2)P^{\mathrm{fwd}}_{\mathrm{out}}(\theta_{2}). The process of message flows continues until the last node with ΘNRF\Theta_{N_{\mathrm{RF}}} is reached. The messages in (9) and (10) can be generalized, by setting θ1\theta_{1} to θk1\theta_{k-1} and θ2\theta_{2} to θk\theta_{k}, to obtain recursive equations. The forward pass computes message inflows {Pinfwd(θk)}k=2NRF\{P^{\mathrm{fwd}}_{\mathrm{in}}(\theta_{k})\}_{k=2}^{N_{\mathrm{RF}}} using the recursive equations.

The forward pass does not exploit information about {θn}n=kNRF\{\theta_{n}\}^{N_{\mathrm{RF}}}_{n=k} to generate side information about θk1\theta_{k-1}. The backward pass overcomes this issue by performing message flows in the opposite direction of the forward pass. In the first iteration, message from the NRFthN_{\mathrm{RF}}^{\text{th}} node, i.e., Poutbwd(θNRF)=p(θNRF)P^{\mathrm{bwd}}_{\mathrm{out}}(\theta_{N_{\mathrm{RF}}})=p(\theta_{N_{\mathrm{RF}}}), flows into the geometry factor containing g(θNRF1|θNRF)g(\theta_{N_{\mathrm{RF}}-1}|\theta_{N_{\mathrm{RF}}}). The geometry factor then sends a belief about ΘNRF1\Theta_{N_{\mathrm{RF}}-1}, defined as Pinbwd(θNRF1)P^{\mathrm{bwd}}_{\mathrm{in}}(\theta_{N_{\mathrm{RF}}-1}), based on p(θNRF)p(\theta_{N_{\mathrm{RF}}}). Similarly, the backward pass is computed using

Pinbwd(θk1)\displaystyle P^{\mathrm{bwd}}_{\mathrm{in}}(\theta_{k-1}) =π/2π/2Poutbwd(θk)g(θk1|θk)𝑑θk,\displaystyle=\int_{-\pi/2}^{\pi/2}P^{\mathrm{bwd}}_{\mathrm{out}}(\theta_{k})g(\theta_{k-1}|\theta_{k})d\theta_{k}, (11)
Poutbwd(θk)\displaystyle P^{\mathrm{bwd}}_{\mathrm{out}}(\theta_{k}) =p(θk)Pinbwd(θk).\displaystyle=p(\theta_{k})P^{\mathrm{bwd}}_{\mathrm{in}}(\theta_{k}). (12)

It can be observed that nodes Θ1\Theta_{1} and ΘNRF\Theta_{N_{\mathrm{RF}}} do not receive side information in the forward and backward passes. Therefore, we set Pinfwd(θ1)=𝒰(θ1)P^{\mathrm{fwd}}_{\mathrm{in}}(\theta_{1})=\mathcal{U}(\theta_{1}) and Pinbwd(θNRF)=𝒰(θNRF)P^{\mathrm{bwd}}_{\mathrm{in}}(\theta_{N_{\mathrm{RF}}})=\mathcal{U}(\theta_{N_{\mathrm{RF}}}), where 𝒰(θ)\mathcal{U}(\theta) denotes a uniform distribution over θ\theta.

At the end of forward and backward passes, each node kk obtains side information in the form of Pinfwd(θk)P^{\mathrm{fwd}}_{\mathrm{in}}(\theta_{k}) and Pinbwd(θk)P^{\mathrm{bwd}}_{\mathrm{in}}(\theta_{k}), and also has access to the likelihood p(θk)p(\theta_{k}). The three sources of information about Θk\Theta_{k} can be combined by defining a new distribution pgmp(θk)p_{\mathrm{gmp}}(\theta_{k}), i.e.,

pgmp(θk)=p(θk)Pinfwd(θk)Pinbwd(θk).p_{\mathrm{gmp}}(\theta_{k})=p(\theta_{k})P^{\mathrm{fwd}}_{\mathrm{in}}(\theta_{k})P^{\mathrm{bwd}}_{\mathrm{in}}(\theta_{k}). (13)

Finally, the local AoA estimate with our geometry-aided message passing algorithm is defined as θ^k=argmaxpgmp(θk)\hat{\theta}_{k}=\mathrm{arg\,max}\,p_{\mathrm{gmp}}(\theta_{k}). For computational tractability, the integrals in (8), (9), and (11) are computed using a discrete sum, assuming angular resolution of κπ\kappa\pi. Thus, the complexity of the algorithm is 𝒪(NRF/κ2)\mathcal{O}(N_{\mathrm{RF}}/\kappa^{2}). It can be noticed that the factor graph in our algorithm models dependencies among adjacent local AoAs, and does not include factors like g(θk|θn)g(\theta_{k}|\theta_{n}) for |kn|>1|k-n|>1. Incorporating such factors can result in short cycles, which may not be desirable from a message passing perspective [8].

III-D MIMO channel reconstruction from local AoA estimates

A reasonable approximation of the LoS MIMO channel matrix can be estimated at the RX from the local AoA estimates. First, we note that the RX knows the geometry of its antenna array, and can compute the position vectors of all the RX antenna elements relative to a common reference. Let 𝐬1\mathbf{s}_{1} and 𝐬NRF\mathbf{s}_{N_{\mathrm{RF}}} denote the position vectors of the first and NRFthN_{\mathrm{RF}}^{\mathrm{th}} RX subarrays. We define 𝐬=(𝐬1𝐬NRF)/𝐬1𝐬NRF2\mathbf{s}=(\mathbf{s}_{1}-\mathbf{s}_{N_{\mathrm{RF}}})/\|\mathbf{s}_{1}-\mathbf{s}_{N_{\mathrm{RF}}}\|_{2} as a unit vector along the antenna array at the RX. It can be observed from Fig. 1 that the first TX antenna is at a distance of r11r_{11} from the first RX subarray. Furthermore, the direction of this TX antenna, relative to the RX array, can be found by rotating 𝐬\mathbf{s} in the anti-clockwise direction by π/2θ1\pi/2-\theta_{1}. For a rotation matrix 𝐑(Δ)\mathbf{R}(\Delta) given by

𝐑(Δ)=[cos(Δ)sin(Δ)sin(Δ)cos(Δ)],\mathbf{R}(\Delta)=\begin{bmatrix}\cos(\Delta)&-\sin(\Delta)\\ \sin(\Delta)&\cos(\Delta)\end{bmatrix}\text{,} (14)

the position vector of the first TX antenna is defined as 𝐪1=𝐬1+r11𝐑(π/2θ1)𝐬,\mathbf{q}_{1}=\mathbf{s}_{1}+r_{11}\mathbf{R}(\pi/2-\theta_{1})\mathbf{s}, where r11r_{11} can be computed using triangulation from multiple local AoA estimates. The position vector of the NRFthN_{\mathrm{RF}}^{\mathrm{th}} TX antennas can be derived by the algorithm in Section III-C to estimate the local AoAs corresponding to the NRFthN_{\mathrm{RF}}^{\mathrm{th}} TX. Finally, the coordinates of the remaining TX antennas are acquired using the estimated positions of the first and last TX antennas and the known TX geometry. The LoS channel is constructed from the estimated coordinates using (1). Note that the pilot transmissions can be performed from different antennas at the TX using spread sequences.

IV Simulation Results

We consider N=16N=16 antennas per subarray, and NRF=4N_{\mathrm{RF}}=4 subarrays for the hybrid beamforming architecture at the RX. The TX is equipped with a fully digital architecture with Ntx=4N_{\mathrm{tx}}=4 antennas and NRF=4N_{\mathrm{RF}}=4 RF chains. The carrier frequency in the system is set to 60GHz60\,\mathrm{GHz}, which corresponds to λ=5mm\lambda=5\,\mathrm{mm}. The length of the arrays at the TX and the RX are Ltx=4cmL_{\mathrm{tx}}=4\,\mathrm{cm} and Lrx=18cmL_{\mathrm{rx}}=18\,\mathrm{cm}. It may be possible to mount the RX array on a augmented reality headset, and the TX array on a wearable like smart watch. The spacing between the midpoint of successive RX subarrays is 4.75cm4.75\,\mathrm{cm}, and the spacing between neighbouring TX antennas is 1.33cm1.33\,\mathrm{cm}. The TX and RX arrays are placed on a horizontal plane at a height of 1.5m1.5\,\mathrm{m}, in a room of dimensions 5m×5m×3m5\,\mathrm{m}\times 5\,\mathrm{m}\times 3\,\mathrm{m}. The properties of the ceiling, side-walls, and the floor were set according to the parameters in [4]. The channel in our simulations has components corresponding to reflections from the walls in addition to the LoS component. Our algorithm, however, can only estimate the dominant signal path corresponding to the LoS component. The distance between the transceivers, i.e., rr, is chosen uniformly at random from [rmin,rmax][r_{\mathrm{min}},r_{\mathrm{max}}] for rmin=40cmr_{\mathrm{min}}=40\,\mathrm{cm} and rmax=80cmr_{\mathrm{max}}=80\,\mathrm{cm}. The orientations of the TX and RX arrays are chosen at random to cover a wide range of possible configurations. The RX SNR for pilot measurements, 5dB5\,\mathrm{dB}, is chosen to be fairly small to demonstrate the resilience of the algorithm against estimation inaccuracies.

Refer to caption
Figure 3: Likelihood functions {p(θk)}k=1NRF\{p(\theta_{k})\}_{k=1}^{N_{\mathrm{RF}}} for a particular channel realization.

We evaluate three different algorithms in a short range setting. The first method uses the ZC-based compression matrix defined in Sec. III-A, and independently maximizes the likelihood functions in (5). The second approach estimates local AoAs using the proposed geometry-aided message passing algorithm. Our algorithm uses information from the likelihoods, and the geometry factors that are based on the RX array configuration. For a benchmark, we define the third approach based on exhaustive beam search, i.e., the DFT dictionary is used for 𝐀kk\mathbf{A}_{k}\ \forall\ k, and M=16M=16 channel measurements are acquired in each subarray. The resolution in the angle (θ\theta) space is set to =0.125o=0.125^{\mathrm{o}}, using κ=1/1440\kappa=1/1440. An example of the likelihood functions {p(θk)}k=1NRF\{p(\theta_{k})\}_{k=1}^{N_{\mathrm{RF}}} is shown in Fig. 3. It can be observed that maximizing the likelihood functions independently for M=3M=3 can result in local AoAs that are significantly different from each other. The proposed method, however, is able to “repair” the mismatched AoAs using geometry factors and message passing. In Fig. 4, we plot the empirical cumulative distribution function (ECDF) of the errors in the local AoA estimates with the maximum likelihood approach and the proposed method. It can be observed from Fig. 4 that the local AoAs recovered by our algorithm are significantly closer to those obtained with the benchmark, when compared to the maximum likelihood approach.

Refer to caption
Figure 4: The proposed geometry-aided message passing approach results in small local AoA errors when compared to the maximum likelihood approach.
Refer to caption
Figure 5: Short range channels can be estimated with fewer pilot transmissions by exploiting local angle dependencies that arise from the array geometry.

The accuracy of the reconstructed MIMO channel is studied using the achievable rate obtained with the estimated channel as a figure of merit. The rate analysis with practical hybrid beamforming structures is left for future study. Let 𝐇~=𝐔~𝐃~𝐕~\tilde{\mathbf{H}}=\tilde{\mathbf{U}}\tilde{\mathbf{D}}\tilde{\mathbf{V}}^{\ast} denote the singular value decomposition (SVD) of the estimated channel. Singular matrices 𝐔~\tilde{\mathbf{U}} and 𝐕~\tilde{\mathbf{V}} are considered to configure the precoders and the combiners at the TX and the RX. The achievable rate of the system (assuming ideal all-digital RX) is determined from the capacity of MIMO channel 𝐔~𝐇𝐕~\tilde{\mathbf{U}}^{\ast}\mathbf{H}\tilde{\mathbf{V}}. The simulation results are then averaged over 25002500 random orientations. It can be observed from Fig. 5 that channel estimation using the proposed algorithm results in a reasonable achievable rate that is comparable to the one achieved with exhaustive scan. While the exhaustive scan requires 1616 channel measurements per subarray, our algorithm achieves good performance with just M=3M=3 measurements per subarray. The performance gap to the perfect CSI case is due to fairly low 55dB SNR available for estimation of both local AoA θk,\theta_{k,\ell} and αk\alpha_{k} in (4), as well as, unaccounted non-LoS components in the actual channel model.

V Conclusions

Short range channels exhibit different structure when compared to the commonly studied far field channels. For example, the angles-of-arrival in a short range setting can vary across multiple sections of the receive antenna array. In this paper, we have shown that the angles can be statistically dependent, with the dependencies determined by the receive array geometry. We have developed a message passing algorithm that exploits the statistical dependencies between the angles, for channel estimation with sub-Nyquist measurements in the angular domain. Our results indicate that geometry information can serve as strong regularizer for the channel estimation problem.

Acknowledgements

This research is supported by the Academy of Finland under grant numbers 311741 and 318927 (6Genesis Flagship), and by the U.S. National Science Foundation under grant numbers CNS-1702800 and ECCS-1711702.

References

  • [1] R. W. Heath, N. Gonzalez-Prelcic, S. Rangan, W. Roh, and A. M. Sayeed, “An overview of signal processing techniques for millimeter wave MIMO systems,” IEEE J. Sel. Topics Signal Process., vol. 10, no. 3, pp. 436–453, 2016.
  • [2] J. Rodríguez-Fernández, N. González-Prelcic, K. Venugopal, and R. W. Heath, “Frequency-domain compressive channel estimation for frequency-selective hybrid millimeter wave MIMO systems,” IEEE Trans. on Wireless Commun., vol. 17, no. 5, pp. 2946–2960, 2018.
  • [3] D. Tse and P. Viswanath, Fundamentals of wireless communication. Cambridge university press, 2005.
  • [4] E. Torkildson, U. Madhow, and M. Rodwell, “Indoor millimeter wave MIMO: Feasibility and performance,” IEEE Trans. on Wireless Commun., vol. 10, no. 12, pp. 4150–4160, 2011.
  • [5] E. J. Candès and M. B. Wakin, “An introduction to compressive sampling,” IEEE Signal Process. Mag., vol. 25, no. 2, pp. 21–30, 2008.
  • [6] Y. Ding, S.-E. Chiu, and B. D. Rao, “Bayesian channel estimation algorithms for massive MIMO systems with hybrid analog-digital processing and low-resolution ADCs,” IEEE J. of Sel. Topics in Signal Process., vol. 12, no. 3, pp. 499–513, 2018.
  • [7] F. R. Kschischang, B. J. Frey, H.-A. Loeliger et al., “Factor graphs and the sum-product algorithm,” IEEE Trans. on Inform. theory, vol. 47, no. 2, pp. 498–519, 2001.
  • [8] M. Karimi and A. H. Banihashemi, “Message-passing algorithms for counting short cycles in a graph,” IEEE Trans. on Commun., vol. 61, no. 2, pp. 485–495, 2013.