Channel Estimation in MIMO Systems with
One-bit Spatial Sigma-delta ADCs

R.S. Prasobh Sankar, , and Sundeep Prabhakar Chepuri The authors are with the Department of Electrical Communication Engineering, Indian Institute of Science, Bangalore, India. Email:{rsprasobh,spchepuri}@iisc.ac.in. This work was supported in part by Nokia Faculty Research Award (NSN Oy, Espoo, Finland) and MHRD, India. The conference precursor of this paper appeared in the 46th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), June 2021 [1].

Abstract

This paper focuses on channel estimation in single-user and multi-user MIMO systems with multi-antenna base stations equipped with 1-bit spatial sigma-delta analog-to-digital converters (ADCs). A careful selection of the quantization voltage level and phase shift used in the feedback loop of 1-bit sigma-delta ADCs is critical to improve its effective resolution. We first develop a quantization noise model for 1-bit spatial sigma-delta ADCs. Using the developed noise model, we then present a two-step channel estimation algorithm to estimate a multipath channel parameterized by the gains, angles of arrival (AoAs), and angles of departure (AoDs). Specifically, in the first step, the AoAs and path gains are estimated using uplink pilots, which excite all the angles uniformly. Next, in the second step, the AoDs are estimated by progressively refining uplink beams through a recursive bisection procedure. For this algorithm, we propose a technique to select the quantization voltage level and phase shift. Through numerical simulations, we demonstrate that with the proposed parametric channel estimation algorithm, MIMO systems with 1-bit spatial sigma-delta ADCs perform significantly better than those with regular 1-bit ADCs and are on par with MIMO systems with high-resolution ADCs.

Index Terms:

Angular channel model, channel estimation, mmWave MIMO, 1-bit quantization, quantization noise modeling, spatial sigma-delta ADC.

I Introduction

Millimeter wave (mmWave) multiple-input multiple-output (MIMO) systems have become very popular for sensing and wireless communications beyond 5G [2, 3, 4]. While the abundant spectrum available at the mmWave frequency bands enables higher cellular data rates and precise positioning, links at mmWave frequencies are very sensitive to blockages and have significantly higher path loss. These issues are alleviated by beamforming with very large antenna arrays, typically packed in small areas. MIMO systems operating at mmWave frequencies, commonly referred to as massive MIMO systems, are either single-user MIMO (SU-MIMO) systems with multi-antenna user equipment (UE) and a base station (BS) having a large antenna array or multi-user MIMO (MU-MIMO) systems with many single antenna UEs communicating with a BS having a large array.

High-resolution analog-to-digital converters (ADCs) and digital-to-analog converters (DACs) for every antenna in the array significantly increase the radio frequency (RF) complexity and power consumption of massive MIMO systems. Low-resolution quantizers (e.g., 1-bit) are thus preferred albeit their deteriorated performance [5, 6, 7]. Sigma-delta ( $\Sigma\Delta$ ) quantization is a popular technique frequently used to increase the effective resolution of low-resolution quantizers [8]. In a 1-bit $\Sigma\Delta$ quantizer, the time-domain signal is first oversampled at a rate significantly higher than the Nyquist rate. Then the difference between the input and the 1-bit quantized output, i.e., the quantization noise, is fed back in time by adding it to the input at the next time instance. This operation leads to noise shaping with the quantization noise pushed to higher temporal frequencies. This means that the effective quantization noise is negligible for a low-pass signal, and it would be as if the signal were quantized by a high-resolution quantizer. This classical architecture to increase the effective resolution of time-domain signals by using a simple 1-bit quantizer with feedback has been recently adapted to the spatial domain [9, 10] and is receiving steady attention for multi-antenna communications [11, 12, 13, 14, 15].

In a 1-bit spatial $\Sigma\Delta$ quantizer, oversampling and feedback are performed in the spatial domain, i.e., across antennas. To perform spatial oversampling, the antenna elements of an array are placed less than half wavelength apart. The quantization noise of each antenna is fed back along with the input of the next antenna. Analogous to its temporal counterpart, a spatial $\Sigma\Delta$ quantizer pushes and shapes the quantization noise to higher spatial frequencies away from the array broadside. In other words, the quantization noise at lower spatial frequencies in a spatial $\Sigma\Delta$ quantizer is reduced as if it was arising from a higher-resolution quantizer [9, 10]. By introducing phase shifts to the quantization noise before feedback allows angle steering so that the quantization noise (respectively, the effective resolution) will be lower (respectively, higher) for signals arriving around the steering angle [12, 14]. Therefore, with angle steering, it is possible to obtain a higher effective resolution for signals of interest in a spatial sector of certain width centered around any desired angle. In addition, a careful selection of the quantization voltage level assigned to 1-bit quantized signals significantly improves the inference performance when working with spatial $\Sigma\Delta$ quantizers.

In essence, 1-bit spatial $\Sigma\Delta$ quantization is an attractive architecture for massive MIMO systems, requiring only 1-bit quantizers per antenna element with feedback across the elements. However, feedback and 1-bit quantization make the channel estimation required for beamforming and symbol detection very challenging. This work focuses on channel estimation in MIMO systems with 1-bit spatial $\Sigma\Delta$ quantizers.

I-A Related prior works

For rich scattering environments with a large number of multipath components, the MIMO channel matrix does not have any apparent structure. Such unstructured channel models are useful for channels at sub-6 GHz frequency bands [14, 6]. In contrast, at mmWave frequencies, due to the extreme path loss, the mmWave channel matrix is sparse in the angular domain and can be parameterized with the angles of departure (AoD), angles of arrival (AoA), and the complex gain of each path. Such angular channel models are commonly used at mmWave frequencies, e.g., at 28 GHz [4, 3].

Channel estimation with unstructured models in MIMO systems with 1-bit or few-bit quantizers is typically performed by first linearizing the non-linear quantizer using the so-called Bussgang decomposition [16, 17, 6] followed by computing a linear minimum mean squared error (LMMSE) estimate of the MIMO channel matrix [6, 18]. Bussgang decomposition based techniques [6, 17] of linearizing low-resolution quantizers have also been extended to spatial $\Sigma\Delta$ quantizers for MU-MIMO channel estimation with unstructured models [13, 14].

For channel estimation with angular models, Bussgang decomposition based methods are not useful as the channel correlation matrix required for computing the Bussgang decomposition, LMMSE estimate, or setting the quantization voltage level in spatial $\Sigma\Delta$ quantizers is not available. This is because knowing the channel correlation matrix for angular channel models amounts to knowing the unknown parameters, namely, AoAs and AoDs that characterize the channel. Thus, for channel estimation in MIMO systems with 1-bit ADCs and angular models, techniques based on optimization to recover the missing amplitudes [19], sparse recovery [20], and deep learning [21] have been proposed.

To summarize, existing works on channel estimation in MIMO systems with 1-bit spatial $\Sigma\Delta$ quantizers focus on unstructured models [13, 14], and they cannot be directly extended to angular channel models. Therefore, in this work, we focus on channel estimation with angular channel models in MIMO systems having 1-bit spatial $\Sigma\Delta$ quantizers.

I-B Contributions and main results

This paper is an extension of the precursor [1], wherein we presented a parametric channel estimation technique for SU-MIMO systems with a single line-of-sight (LoS) path. In this work, we extend [1] in several aspects to estimate multipath channels in SU-MIMO and MU-MIMO systems by leveraging angle steering in spatial $\Sigma\Delta$ quantizers and describe methods to choose the quantization voltage level. The major contributions and results are summarized as follows.

•

Quantization noise model: For channel estimation with angular models, as discussed before, Bussgang decomposition based linearization techniques cannot be used. Therefore, we derive a model for the quantization noise in 1-bit spatial $\Sigma\Delta$ quantizers based on the deterministic input-output relation in 1-bit temporal $\Sigma\Delta$ quantizers. Specifically, we derive a closed-form expression for the correlation matrix of the approximation error due to linearization of the 1-bit spatial $\Sigma\Delta$ quantizer. To do so, we use one of the main results of the paper that for most of the antenna elements in a large array, the quantization noise is uncorrelated with the corresponding input.
•

Channel estimation and quantization voltage selection: Leveraging the proposed quantization noise model, we develop algorithms to estimate multipath channels admitting angular models for SU-MIMO and MU-MIMO systems. We use uplink pilots to perform channel estimation at the BS equipped with a 1-bit spatial $\Sigma\Delta$ quantizer. For SU-MIMO and MU-MIMO systems, we present techniques to choose the quantizer voltage level, which, when not chosen carefully, leads to significant performance degradation due to the extreme quantization.

For channel estimation in SU-MIMO systems, i.e., to estimate the AoAs at the BS, AoDs of the paths from the UE, and path gains, we propose a two-step channel estimation algorithm, which is computationally efficient and has low overhead. In Step 1, the multi-antenna UE omnidirectionally transmits pilot symbols to the BS, which estimates the AoAs using a Bartlett beamformer and the complex path gains using a weighted least squares estimator. Since the AoDs are not known in Step 1, the proposed omnidirectional transmission ensures that sufficient power reaches the BS via all the paths. In addition, it allows us to choose a suitable quantization voltage level, which is essential for path gain estimation. Next, in Step 2, to estimate the AoDs, the UE transmits precoded pilot symbols using a sequence of adaptively chosen beamformers from a codebook hierarchically. To reduce the overhead, we assume a 1-bit feedback link between the BS and UE. We also provide a method to choose a quantization voltage level in Step 2. We show that with angle steering and the proposed voltage level, the beampatterns (from the designed codebook) as seen at the output of the 1-bit spatial $\Sigma\Delta$ quantizer is comparable to that at the input (i.e., without quantization), and thereby resulting in channel estimates that are on par with that of unquantized MIMO systems.

We then specialize the SU-MIMO channel estimation algorithm for MU-MIMO systems to estimate the AoAs and path gains at the BS. Specifically, we reformulate the MU-MIMO channel estimation problem using orthogonal pilots and separately estimate the single-input multiple-output (SIMO) channels between each single antenna UE and multi-antenna BS with a 1-bit spatial $\Sigma\Delta$ quantizer.
•

Performance: Through numerical simulations, performance of the proposed channel estimation algorithms, in terms of normalized mean squared error (NMSE), are found to be significantly better than that of the regular (non- $\Sigma\Delta$ ) 1-bit channel estimation algorithms and are comparable with MIMO systems having infinite resolution quantizers for most of the SNRs and for paths with angles not far from the array broadside. Performance of algorithms with 1-bit spatial $\Sigma\Delta$ quantizers is limited for paths with angles away from the array broadside because of the quantization noise shaping. The proposed channel estimation algorithm also performs better than the state-of-the-art unstructured channel estimation algorithm for MIMO systems with 1-bit $\Sigma\Delta$ quantizers [14], where we use as input to the unstructured channel estimation algorithm a realistic approximation of the channel correlation matrix.

Although we focus on channel estimation with angular models, the developed quantization noise model is useful for estimating unstructured channels using classical estimation techniques (e.g., using least squares) whenever unquantized channel correlation information is not available a priori.

I-C Organization and notation

The remainder of the paper is organized as follows. In Sections II and III, we describe 1-bit spatial $\Sigma\Delta$ quantizers and model the quantization noise in 1-bit spatial $\Sigma\Delta$ quantizers, respectively. In Section IV, we present the SU-MIMO and MU-MIMO system models, which we use for channel estimation. In Sections V and VI, we propose channel estimation algorithms for SU-MIMO and MU-MIMO systems, respectively. In Section VII, we discuss results from numerical experiments and conclude the paper in Section VIII.

Throughout the paper, we use lowercase letters to denote scalars and boldface lowercase (respectively, uppercase) to denote vectors (respectively, matrices). We use $(\cdot)^{*}$ , $(\cdot)^{\raisebox{1.2pt}{$\rm\scriptstyle T$}}$ , and $(\cdot)^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}$ to denote complex conjugation, transpose, and Hermitian (i.e., complex conjugate transpose) operations, respectively. $[{{\mathbf{x}}}]_{n}$ or $x_{n}$ denotes the $n$ -th entry of the vector ${{\mathbf{x}}}$ . ${\mathbf{A}}\odot{\mathbf{B}}$ denotes the Khatri-Rao (or columnwise Kronecker) product of matrices ${\mathbf{A}}$ and ${\mathbf{B}}$ . Since we restrict ourselves to a 1-bit spatial $\Sigma\Delta$ quantizer only at the BS, henceforth, we simply refer to it as a 1-bit spatial $\Sigma\Delta$ ADC.

Software to reproduce the results in this paper is available at https://ece.iisc.ac.in/~spchepuri/sw/SpatialSigmaDelta.zip.

II One-bit spatial sigma-delta ADC

In this section, we describe the architecture of a multi-channel first-order 1-bit spatial $\Sigma\Delta$ ADC with angle steering. Let us denote the input and output of an $N_{\rm r}$ channel 1-bit spatial $\Sigma\Delta$ ADC, at time $t$ , as ${{\mathbf{x}}}(t)=[x_{1}(t),x_{2}(t),\dots,x_{N_{\rm r}}(t)]^{\raisebox{1.2pt}{$\rm\scriptstyle T$}}\in{\mathbb{C}}^{N_{\rm r}}$ and ${{\mathbf{y}}}(t)=[y_{1}(t),y_{2}(t),\dots,y_{N_{\rm r}}(t)]^{\raisebox{1.2pt}{$\rm\scriptstyle T$}}\in{\mathbb{C}}^{N_{\rm r}}$ , respectively.

Refer to caption — Figure 1: The 1-bit spatial $\Sigma\Delta$ ADC architecture. $\mathcal{Q}_{b}$ is the 1-bit quantizer with level $b$ and $\mathcal{L}_{c}$ is the amplitude limiter with level $c$ .

Quantization noise from each antenna is fed back along with the input of the next antenna to realize the $\Sigma\Delta$ architecture in space. Due to the presence of the feedback, the quantization noise in a spatial $\Sigma\Delta$ ADC may become unbounded, overloading the output [22, 12]. To prevent overloading, the input signal, $x\in{\mathbb{C}}$ , is clipped using ${\mathcal{L}}_{c}(\cdot)$ with clipping level $c>0$ as

{\mathcal{L}}_{c}[x]={\rm sign}(\Re(x))\left\{|\Re(x)|\right\}_{c}+\jmath\,{\rm sign}(\Im(x))\left\{|\Im(x)|\right\}_{c},

where the operator $\left\{x\right\}_{c}$ is defined as $\left\{x\right\}_{c}=\max\{x,c\}$ and $\jmath=\sqrt{-1}$ . The clipped signal is then quantized using a 1-bit quantizer ${\mathcal{Q}}_{b}[\cdot]$ with quantization voltage level $b$ as

{\mathcal{Q}}_{b}[{\mathcal{L}}_{c}[x]]=b\,{\rm sign}(\Re({\mathcal{L}}_{c}[x]))+\jmath\,b\,{\rm sign}(\Im({\mathcal{L}}_{c}[x])).

Let $r_{n}(t)$ denote the input to channel $n$ of the ADC at time $t$ . Then the corresponding output $y_{n}(t)$ is given by

y_{n}(t)={\mathcal{Q}}_{b}[r_{n}(t)]=r_{n}(t)+e_{n}(t),

(1)

where $e_{n}(t)=y_{n}(t)-r_{n}(t)$ is the quantization error. The quantization error at channel $n$ is phase shifted by $\varphi$ and added to the input of channel $n+1$ as

r_{n}(t)={\mathcal{L}}_{c}[x_{n}(t)]-e^{\jmath{\varphi}}e_{n-1}(t),

(2)

where $\varphi=2\pi d\sin{\psi}$ with $\psi$ being the steering angle and $d$ being the inter-element spacing in the array in wavelengths. Since there is no feedback to the first channel, $e_{0}(t)=\leavevmode\nobreak\ 0$ . From (1) and (2), we have the recursion

	$\displaystyle y_{n}(t)$	$\displaystyle={\mathcal{Q}}_{b}\left[\sum_{k=1}^{n}e^{\jmath(n-k){\varphi}}{\mathcal{L}}_{c}[x_{k}(t)]\right.$
		$\displaystyle\hskip 85.35826pt\left.-\sum_{k=1}^{n-1}e^{\jmath(n-k){\varphi}}y_{k}(t)\right].$		(3)

We are interested in estimating parameters underlying the input signal ${{\mathbf{x}}}(t)$ from the output of the 1-bit spatial $\Sigma\Delta$ ADC ${{\mathbf{y}}}(t)$ . This is a challenging problem because of the cascade of two non-linearities in (II). Moreover, the information loss introduced by the 1-bit spatial $\Sigma\Delta$ ADC is mainly determined by the clipping level $c$ and the quantization level $b$ . For a given quantization level $b$ , we can prevent the system from overloading by choosing the clipping voltage $c$ as [12]

c=b(2-|{\rm cos}(\varphi)|-|{\rm sin}(\varphi)|).

(4)

A small value of $c$ leads to severe loss in the input information due to clipping, which cannot be compensated later by any choice of the quantization level $b$ . Hence, it is necessary to choose a sufficiently large $c$ to avoid information loss due to clipping. In this work, we propose to choose $c$ such that ${\rm Pr}\left(\|{\mathcal{L}}_{c}[{{\mathbf{x}}}(t)]-{{\mathbf{x}}}(t)\|_{2}>\delta\right)\leq\epsilon$ for appropriately selected constants $\delta,\epsilon>0$ . See Section V-C for more details on the selection of $c$ . With such a choice of $c$ , the output of the clipper can be approximated as ${\mathcal{L}}_{c}[x_{n}(t)]\approx x_{n}(t),\>n=1,2,\ldots,N_{\rm r}$ . Then the 1-bit spatial $\Sigma\Delta$ recursion in (II) simplifies to

{{\mathbf{y}}}(t)={\mathcal{Q}}_{b}[{\mathbf{U}}{{\mathbf{x}}}(t)-{\mathbf{V}}{{\mathbf{y}}}(t)],

(5)

where the $N_{\rm r}\times N_{\rm r}$ lower triangular matrices ${\mathbf{U}}$ and ${\mathbf{V}}$ are defined as

{\mathbf{U}}=\begin{bmatrix}1&&&&\\ e^{\jmath{\varphi}}&1&&&\\ \vdots&\ddots&\ddots&&\\ e^{\jmath(N_{\rm r}-1){\varphi}}&\cdots&e^{\jmath{\varphi}}&1\end{bmatrix}\quad\text{and}\quad{\mathbf{V}}={\mathbf{U}}-{\mathbf{I}}

with ${\mathbf{I}}$ being the $N_{\rm r}\times N_{\rm r}$ identity matrix. The architecture of the first-order 1-bit spatial $\Sigma\Delta$ ADC is shown in Fig. 1.

III Quantization noise modeling

Quantization is a non-linear and irreversible operation that makes its statistical analysis complicated. To simplify the analysis of a quantizer, the usual approach is to linearize the quantizer and account for the error due to linearization through additive noise, which is often assumed to be uniformly distributed and uncorrelated with the input of the quantizer [23, 22, 10, 11, 12]. However, this assumption is reasonable only for a multi-level quantizer with many levels and when the input has a sufficiently large dynamic range. This classical approach of modeling the approximation error due to linearization as additive uniform noise is not suitable for 1-bit quantizers. Therefore, in what follows, we develop a noise model for 1-bit spatial $\Sigma\Delta$ ADCs.

To begin with, we first express the input-output relation of a spatial $\Sigma\Delta$ ADC in terms of the ${\rm floor}[\cdot]$ function by drawing inspiration from [22], where a similar expression for the deterministic error in a temporal 1-bit $\Sigma\Delta$ quantizer was developed. Next, we propose to linearize the ${\rm floor}[\cdot]$ function by interpreting it as a multi-level quantizer to compute the second-order statistics of the quantization noise that is useful for decorrelating observations when solving parametric estimation and detection problems involving 1-bit spatial $\Sigma\Delta$ ADCs.

The error $e_{n}(t)$ in (1) due to 1-bit spatial $\Sigma\Delta$ quantization admits a closed-form expression as given in the next Lemma.

Lemma 1.

For a 1-bit spatial $\Sigma\Delta$ ADC with $\psi=0$ , the quantization error as a function of the input is given by


	$\displaystyle\Re({e_{n}(t)})=b-2b\left\langle\frac{1}{2}(n-1)+\frac{1}{2b}\sum\limits_{k=1}^{n}\Re(x_{k}(t))\right\rangle$		(6a)
	$\displaystyle\Im({e_{n}(t)})=b-2b\left\langle\frac{1}{2}(n-1)+\frac{1}{2b}\sum\limits_{k=1}^{n}\Im(x_{k}(t))\right\rangle$		(6b)

for $n=1,\ldots,N_{\rm{r}}$ . Here, $\langle\cdot\rangle$ is the fractional part function.

The above expressions are derived in the appendix. Let us collect the output of all the channels of the 1-bit spatial $\Sigma\Delta$ ADC in a vector ${{\mathbf{y}}}(t)$ and rewrite (5) as

{{\mathbf{y}}}(t)={\mathbf{U}}{{\mathbf{x}}}(t)-{\mathbf{V}}{{\mathbf{y}}}(t)+{{\mathbf{e}}}(t).

(7)

From Lemma 1, we have

{{\mathbf{e}}}(t)=2b\mu\boldsymbol{1}-2b\left\langle{\boldsymbol{\nu}}-\frac{1}{2b}{\mathbf{U}}{{\mathbf{x}}}(t)\right\rangle,

(8)

where ${\boldsymbol{\nu}}=\mu{\mathbf{V}}\boldsymbol{1}\in{\mathbb{C}}^{N_{\rm r}}$ with $\mu=0.5+\jmath 0.5$ , $\boldsymbol{1}$ is the length- $N_{\rm r}$ column vector with all ones, and the fractional part function is applied elementwise. Using the fact that $\langle x\rangle=x-{\rm floor}[x],\>\forall\>x\in\mathbb{R}$ , and from (7), the deterministic relation between the input and output of a 1-bit spatial $\Sigma\Delta$ ADC is given by

0.5b^{-1}{\mathbf{U}}{{\mathbf{y}}}(t)+{\boldsymbol{\nu}}-\mu\boldsymbol{1}={\rm floor}\left[0.5b^{-1}{\mathbf{U}}{{\mathbf{x}}}(t)+{\boldsymbol{\nu}}\right],

(9)

where we have transformed the non-linearity due to the 1-bit quantization ${\mathcal{Q}}_{b}[\cdot]$ to the non-linear ${\rm floor[\cdot]}$ function.

Now we leverage the fact that the input-output relation of a ${\rm floor}[\cdot]$ function is similar to that of a multi-level quantizer with a step size of one. For the spatial $\Sigma\Delta$ ADC channels (corresponding to antenna elements) with larger indices that are away from the $1$ st channel, the dynamic range of the real and imaginary parts of $[0.5b^{-1}{\mathbf{U}}{{\mathbf{x}}}(t)+{\boldsymbol{\nu}}]_{n}$ is large when compared to $[0,1)$ . This means that for antenna elements with indices away from the $1$ st element, the ${\rm floor}[\cdot]$ function acts like a multi-level quantizer with an input having a large dynamic range. Therefore, it is now reasonable to model the approximation error due to the linearization of the ${\rm floor}[\cdot]$ function as additive noise, ${{\mathbf{w}}}(t)\in\leavevmode\nobreak\ \mathbb{C}^{N_{\rm r}}$ , which is uniformly distributed and uncorrelated with the input for channels having larger indices, i.e., we have

{\rm floor}\left[0.5b^{-1}{\mathbf{U}}{{\mathbf{x}}}(t)+{\boldsymbol{\nu}}\right]=0.5b^{-1}{\mathbf{U}}{{\mathbf{x}}}(t)+{\boldsymbol{\nu}}+{{\mathbf{w}}}(t)

with the real and imaginary parts of $[{{\mathbf{w}}}(t)]_{n}\in[0,1)$ . Substituting in (9) yields

\mathbf{y}(t)=\mathbf{x}(t)+{{\mathbf{q}}}(t),

(10)

where ${{\mathbf{q}}}(t)=2b{\mathbf{U}}^{-1}({{{\mathbf{w}}}}(t)+\mu\boldsymbol{1})$ , which is an affine transformation of ${{\mathbf{w}}}(t)$ , is also uniformly distributed and $[{{\mathbf{q}}}(t)]_{n}$ is uncorrelated with the input $[{{\mathbf{x}}}(t)]_{n}$ for larger channel (or antenna) indices $n$ . In massive MIMO systems with large number of antennas, there are many antennas for which $x_{n}(t)$ and $[{{\mathbf{q}}}(t)]_{n}$ are uncorrelated. The correlation of ${[{{\mathbf{q}}}(t)]}_{n}=y_{n}(t)-x_{n}(t)$ with $x_{n}(t)$ , i.e., $\mathbb{E}[x_{n}(y_{n}-x_{n})^{*}]$ for an antenna array having $N_{\rm r}=128$ elements with an inter-element spacing of one eighth the signal wavelength is illustrated in Fig. † ‣ 2(a).

The covariance matrix of the noise vector ${{\mathbf{q}}}(t)$ in (10) is given by ${\mathbf{R}}_{q}={\mathbb{E}}\left[{{\mathbf{q}}}(t){{\mathbf{q}}}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(t)\right]=\frac{2b^{2}}{3}{\mathbf{U}}^{-1}{\mathbf{U}}^{\raisebox{1.0pt}{-$\rm\scriptscriptstyle H$}}$ . When $\psi=0$ , the matrix ${\mathbf{U}}^{-1}$ with ones on the main diagonal, $-1$ on the first sub-diagonal, and zeros elsewhere, is a spatial high-pass filter, which shapes the quantization noise to higher spatial frequencies. We illustrate the angular power spectrum of the quantization noise, i.e., $\mathbb{E}[|{{\mathbf{a}}}_{\rm BS}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(\theta)({{\mathbf{y}}}(t)-{{\mathbf{x}}}(t))|^{2}]$ , for different values of the steering angle in Fig. † ‣ 2(b), where ${{\mathbf{a}}}_{\rm BS}(\theta)=[1,e^{-\jmath 2\pi d{\rm sin}(\theta)},\ldots,e^{-\jmath(N_{\rm r}-1)2\pi d{\rm sin}(\theta)}]^{\raisebox{1.2pt}{$\rm\scriptstyle T$}}$ is the steering vector of the uniform linear array (ULA) having $N_{\rm r}$ elements at the BS with an inter-element spacing $d$ wavelengths. We can see that the quantization noise is very small for the directions around the steering angle.

Before we end this section, we make the following two remarks. Although in the development of the noise model, the spatial $\Sigma\Delta$ ADC is steered to the array broadside with $\psi=0$ for ease of exposition, introducing a phase shift in the feedback does not alter the uncorrelatedness between the input and the quantization noise as can be seen in Fig. † ‣ 2(a). Next, for the ${\rm floor}[\cdot]$ function in (9) to behave as a multi-level quantizer, the quantization voltage level $b$ plays an important role. When $b$ is very large, the dynamic range of the real and imaginary parts of the argument of the ${\rm floor}[\cdot]$ function, i.e., $0.5b^{-1}\mathbf{U}\mathbf{x}(t)+{\boldsymbol{\nu}}$ , can be very small for which the uniform noise or uncorrelated noise assumption might fail. Hence, carefully selecting $b$ (as discussed in Section V-C) becomes crucial.

Next, we use the developed spatial $\Sigma\Delta$ signal and noise models to estimate MIMO channels with angular models.

IV Angular channel model

In this paper, we consider the two commonly encountered SU-MIMO and MU-MIMO settings in MIMO communications. In the SU-MIMO setting, a single UE with $N_{\rm t}$ antennas communicates with a multi-antenna BS, whereas in the MU-MIMO setting, $K$ single antenna UEs communicate with a multi-antenna BS. In both these settings, the BS has a ULA with $N_{\rm r}$ antennas and it receives and processes uplink training pilots.

Let us collect the uplink training pilots in ${\mathbf{S}}=[{{\mathbf{s}}}(1),{{\mathbf{s}}}(2),\ldots,{{\mathbf{s}}}(T)]$ , where $T$ is the pilot length. Without loss of generality, let us assume that the columns of the pilot matrix have unit norm. Let $P$ denote the total transmit power at the UE and ${\mathbf{H}}$ denote the MIMO channel matrix. The signal received at the BS prior to quantization, denoted as ${\mathbf{X}}=[{{\mathbf{x}}}(1),{{\mathbf{x}}}(2),\ldots,{{\mathbf{x}}}(T)]\in{\mathbb{C}}^{N_{\rm r}\times T}$ , can be expressed as

{\mathbf{X}}=\sqrt{P}{\mathbf{H}}{\mathbf{S}}+{\mathbf{Z}},

(11)

where $P$ is also the uplink SNR of the system and ${\mathbf{Z}}\in{\mathbb{C}}^{N_{\rm r}\times T}$ is the additive white Gaussian receiver noise matrix with entries $[{\mathbf{Z}}]_{ij}\sim{\mathcal{C}}{\mathcal{N}}(0,1)$ . The received signal is then quantized using an $N_{\rm r}$ -channel 1-bit spatial $\Sigma\Delta$ ADC to obtain ${\mathbf{Y}}=[{{\mathbf{y}}}(1),{{\mathbf{y}}}(2),\ldots,{{\mathbf{y}}}(T)]\in{\mathbb{C}}^{N_{\rm r}\times T}$ , which is given by

\displaystyle{\mathbf{Y}}={\mathcal{Q}}_{b}[{\mathbf{U}}{\mathbf{X}}-{\mathbf{V}}{\mathbf{Y}}]={\mathcal{Q}}_{b}[\sqrt{P}{\mathbf{U}}{\mathbf{H}}{\mathbf{S}}-{\mathbf{V}}{\mathbf{Y}}+{\mathbf{U}}{\mathbf{Z}}].

Next, we parameterize ${\mathbf{H}}$ by assuming a narrowband spatially sparse model for MU-MIMO and SU-MIMO systems.

IV-A SU-MIMO channel model

Suppose there are $L$ scatterers that result in $L$ paths between the UE and BS. Let us denote the AoDs of the $L$ paths from the UE by $\boldsymbol{\phi}=[\phi_{1},\ldots,\phi_{L}]^{\raisebox{1.2pt}{$\rm\scriptstyle T$}}$ , the AoAs of the $L$ paths at the BS by $\boldsymbol{\theta}=[\theta_{1},\ldots,\theta_{L}]^{\raisebox{1.2pt}{$\rm\scriptstyle T$}}$ , and the complex path gains by $\boldsymbol{\alpha}=[\alpha_{1},\ldots,\alpha_{L}]^{\raisebox{1.2pt}{$\rm\scriptstyle T$}}$ . The MIMO channel matrix is then expressed in terms of these parameters as

	$\displaystyle{\mathbf{H}}$	$\displaystyle=\frac{1}{\sqrt{L}}\sum_{k=1}^{L}\alpha_{k}{{\mathbf{a}}}_{\rm BS}(\theta_{k}){{\mathbf{a}}}_{\rm UE}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(\phi_{k})$
		$\displaystyle=\frac{1}{\sqrt{L}}{\mathbf{A}}_{\rm BS}(\boldsymbol{\theta}){\rm diag}(\boldsymbol{\alpha}){\mathbf{A}}_{\rm UE}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(\boldsymbol{\phi}),$		(12)

where ${\mathbf{A}}_{\rm BS}(\boldsymbol{\theta})=[{{\mathbf{a}}}_{\rm BS}(\theta_{1}),\ldots,{{\mathbf{a}}}_{\rm BS}(\theta_{L})]\in{\mathbb{C}}^{N_{\rm r}\times L}$ is the array manifold of the ULA at the BS and ${\mathbf{A}}_{\rm UE}(\boldsymbol{\phi})=[{{\mathbf{a}}}_{\rm UE}(\phi_{1}),\ldots,{{\mathbf{a}}}_{\rm UE}(\phi_{L})]\in{\mathbb{C}}^{N_{\rm t}\times L}$ is the array manifold at the UE. The columns of ${\mathbf{A}}_{\rm UE}(\boldsymbol{\phi})$ contain the array response vector of the critically spaced ULA at the UE with $N_{\rm t}$ elements, and is given by ${{\mathbf{a}}}_{\rm UE}(\phi)=[1,e^{-\jmath\pi{\rm sin}(\phi)},\ldots,e^{-\jmath(N_{\rm t}-1)\pi{\rm sin}(\phi)}]^{\raisebox{1.2pt}{$\rm\scriptstyle T$}}$ . Similarly, the columns of ${\mathbf{A}}_{\rm BS}(\boldsymbol{\theta})$ contain the array response vector of the oversampled ULA at the BS with an inter-element spacing $d$ wavelengths, and is given by ${{\mathbf{a}}}_{\rm BS}(\theta)=[1,e^{-\jmath 2\pi d{\rm sin}(\theta)},\ldots,e^{-\jmath(N_{\rm r}-1)2\pi d{\rm sin}(\theta)}]^{\raisebox{1.2pt}{$\rm\scriptstyle T$}}$ .

IV-B MU-MIMO channel model

Suppose that there are $K$ single antenna users. The SIMO channel ${{\mathbf{h}}}_{k}\in{\mathbb{C}}^{N_{\rm r}}$ between the $k$ th UE and the BS with $L_{k}$ paths is given by

{{\mathbf{h}}}_{k}=\frac{1}{\sqrt{L_{k}}}\sum_{j=1}^{L_{k}}\alpha_{k,j}{{\mathbf{a}}}_{\rm BS}(\theta_{k,j})=\frac{1}{\sqrt{L_{k}}}{\mathbf{A}}_{\rm BS}(\boldsymbol{\theta}_{k})\boldsymbol{\alpha}_{k},

(13)

where ${\mathbf{A}}_{\rm BS}(\boldsymbol{\theta}_{k})=[{{\mathbf{a}}}_{\rm BS}(\theta_{k,1}),\ldots,{{\mathbf{a}}}_{\rm BS}(\theta_{k,L_{k}})]\in{\mathbb{C}}^{N_{\rm r}\times L_{k}}$ denotes the array manifold at the BS, $\boldsymbol{\theta}_{k}=[\theta_{k,1},\ldots,\theta_{k,L_{k}}]^{\raisebox{1.2pt}{$\rm\scriptstyle T$}}$ collects AoAs of paths from $k$ th UE at the BS, and $\boldsymbol{\alpha}_{k}=[\alpha_{k,1},\ldots,\alpha_{k,L_{k}}]^{\raisebox{1.2pt}{$\rm\scriptstyle T$}}\in{\mathbb{C}}^{L_{k}}$ denotes the corresponding complex path gains. Then the overall channel matrix for the MU-MIMO system is given by

{\mathbf{H}}=\begin{bmatrix}{{\mathbf{h}}}_{1}&{{\mathbf{h}}}_{2}&\ldots&{{\mathbf{h}}}_{K}\end{bmatrix}\in{\mathbb{C}}^{N_{\rm r}\times K}.

Our aim is to estimate the MIMO channels (12) and (13) by estimating the underlying angles (namely, the AoAs and AODs) and path gains from the output of the 1-bit spatial $\Sigma\Delta$ ADC at the BS given the uplink pilots ${\mathbf{S}}$ , the SNR $P$ , and the number of paths.

V SU-MIMO channel estimation

In this section, we present the proposed algorithm for channel estimation with angular models in SU-MIMO systems having 1-bit spatial $\Sigma\Delta$ ADCs at the BS. Specifically, we propose a two-step algorithm to estimate the channel parameters $(\boldsymbol{\theta},\boldsymbol{\phi},\boldsymbol{\alpha})$ from uplink pilots. In the first step, we estimate the AoAs (i.e., $\boldsymbol{\theta}$ ) and the path gains (i.e., $\boldsymbol{\alpha}$ ) using precoded uplink pilots, which excite all the angles uniformly. Next, in the second step, to estimate the AoDs, i.e., $\boldsymbol{\phi}$ , we select precoders from a codebook using a recursive bisection procedure that leverages 1-bit feedback between the BS and UE to reduce the number of channel uses and thus the channel estimation overhead.

As we discuss later, it is crucial to carefully select the clipping voltage levels to benefit from the advantages of the 1-bit spatial $\Sigma\Delta$ ADCs. The proposed two-step channel estimation procedure is developed keeping in mind the dependence of the unknown parameters on voltage level selection.

V-A Step 1: AoA and path gain estimation

V-A1 AoA estimation

To estimate the AoAs at the BS, the UE transmits precoded pilot symbols ${{\mathbf{s}}}(t)={{\mathbf{p}}}_{1}(t)$ for $t=1,\ldots,T_{1}$ , where ${{\mathbf{p}}}_{1}(t)\in{\mathbb{C}}^{N_{\rm t}}$ is the precoder with $\|{{\mathbf{p}}}_{1}(t)\|_{2}=1$ so that the total transmit power is $P$ . As we do not yet know the AoDs, instead of selecting ${{\mathbf{p}}}_{1}(t)$ to focus energy in specific directions, we select it such that all the departure angles are excited uniformly as

{{\mathbf{p}}}_{1}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(t){{\mathbf{a}}}_{\rm UE}(\tilde{\phi}_{d})=1,\,d=1,2,\ldots,D,

(14)

where ${\mathcal{D}}=\{\tilde{\phi}_{1},\tilde{\phi}_{2},\ldots,\tilde{\phi}_{D}\}$ denotes the set of $D$ candidate AoDs. An obvious choice of ${{\mathbf{p}}}_{1}(t)$ that satisfies (14) is

{{\mathbf{p}}}_{1}(t)={{\mathbf{p}}}_{1}=\begin{bmatrix}1&0&\ldots&0\end{bmatrix}^{\raisebox{1.2pt}{$\rm\scriptstyle T$}},\,\,t=1,\ldots,T_{1}.

(15)

This means that we turn off all the antennas at the UE except the first one to perform an omnidirectional transmission.

From (10), the symbols received at the BS in Step 1 can be compactly expressed as

	$\displaystyle{\mathbf{Y}}_{1}$	$\displaystyle=\sqrt{\frac{P}{L}}{\mathbf{A}}_{\rm BS}(\boldsymbol{\theta}){\rm diag}(\boldsymbol{\alpha}){\mathbf{A}}_{\rm UE}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(\boldsymbol{\phi}){\mathbf{S}}+{\mathbf{N}}$
		$\displaystyle=\sqrt{\frac{P}{L}}{\mathbf{A}}_{\rm BS}(\boldsymbol{\theta}){\rm diag}(\boldsymbol{\alpha}){\mathbf{E}}+{\mathbf{N}},$		(16)

where ${\mathbf{S}}={{\mathbf{p}}}_{1}{\bf 1}^{\raisebox{1.2pt}{$\rm\scriptstyle T$}}\in{\mathbb{C}}^{N_{\rm t}\times T_{1}}$ is the transmitted pilot matrix and the effective noise matrix ${\mathbf{N}}=[{{\mathbf{n}}}(1),\cdots,{{\mathbf{n}}}(T_{1})]\in{\mathbb{C}}^{N_{\rm r}\times T_{1}}$ is defined as ${\mathbf{N}}={\mathbf{Z}}+{\mathbf{Q}}$ is the sum of the receiver noise ${\mathbf{Z}}$ and the quantization noise ${\mathbf{Q}}=[{{\mathbf{q}}}(1),\cdots,{{\mathbf{q}}}(T_{1})]$ . The $L\times T_{1}$ matrix ${\mathbf{E}}{=}{\mathbf{A}}_{\rm UE}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(\boldsymbol{\phi}){\mathbf{S}}={\mathbf{A}}_{\rm UE}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(\boldsymbol{\phi}){{\mathbf{p}}}_{1}{\bf 1}^{\raisebox{1.2pt}{$\rm\scriptstyle T$}}\in{\mathbb{C}}^{L\times T_{1}}$ has all one entries. Thus, the precoder in (15) makes ${\mathbf{Y}}_{1}$ independent of the AoDs.

The AoAs can now be estimated from ${\mathbf{Y}}_{1}$ using standard direction-finding techniques like Bartlett beamforming, minimum variance distortionless response (MVDR) beamforming [24], sparse recovery, or using a maximum likelihood based estimator. Since a massive MIMO BS typically has a large number (of the order of hundreds) of antennas, we may use a computationally less intensive method, such as the Bartlett beamforming for AoA estimation. Specifically, the AoA estimates, denoted by $\{\hat{\theta}_{k}\}_{k=1}^{L}$ , are the locations of the $L$ local maxima of the Bartlett spatial spectrum

\mathcal{J}(\theta)={{\mathbf{a}}}_{\rm BS}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(\theta){\mathbf{R}}_{y1}{{\mathbf{a}}}_{\rm BS}(\theta),

(17)

where ${\mathbf{R}}_{y1}=\frac{1}{T_{1}}{\mathbf{Y}}_{1}{\mathbf{Y}}_{1}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}$ is the sample covariance matrix.

V-A2 Path gain estimation

Next, to estimate the path gains, we form the estimated array manifold at the BS as $\hat{{\mathbf{A}}}_{\rm BS}=[{{\mathbf{a}}}_{\rm BS}(\hat{\theta}_{1}),{{\mathbf{a}}}_{\rm BS}(\hat{\theta}_{2}),\cdots,{{\mathbf{a}}}_{\rm BS}(\hat{\theta}_{L})]$ and approximate (16) as

\hat{{\mathbf{Y}}}_{1}=\sqrt{\frac{P}{L}}\hat{{\mathbf{A}}}_{\rm BS}{\rm diag}(\boldsymbol{\alpha}){\mathbf{E}}+{\mathbf{N}}.

(18)

Recall from the quantization noise modeling in Section III that the covariance matrix of the quantization noise is given by ${\mathbf{R}}_{q}=\frac{2b^{2}}{3}{\mathbf{U}}^{-1}{\mathbf{U}}^{\raisebox{1.0pt}{-$\rm\scriptscriptstyle H$}}$ and that the quantization noise is uncorrelated with the input. Thus the covariance matrix of the effective noise term in (16) is

{\mathbf{R}}_{n}=\mathbb{E}[{{\mathbf{n}}}(t){{\mathbf{n}}}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(t)]=\mathbf{I}+\frac{2b^{2}}{3}\mathbf{U}^{-1}\mathbf{U}^{\raisebox{1.0pt}{-$\rm\scriptscriptstyle H$}}.

(19)

We now prewhiten the observations $\hat{{\mathbf{Y}}}_{1}$ to obtain

{{\mathbf{Y}}}_{1}^{\prime}={\mathbf{R}}_{n}^{-1/2}\hat{{\mathbf{Y}}}_{1}=\sqrt{\frac{P}{L}}{\mathbf{R}}_{n}^{-1/2}\hat{{\mathbf{A}}}_{\rm BS}{\rm diag}(\boldsymbol{\alpha}){\mathbf{E}}+{\mathbf{N}}^{\prime},

where ${\mathbf{R}}_{n}^{-1/2}$ is the prewhitening matrix, which can be obtained using an eigenvalue decomposition of ${\mathbf{R}}_{n}$ and ${\mathbf{N}}^{\prime}={\mathbf{R}}_{n}^{-1/2}{\mathbf{N}}$ is the whitened noise term. Using the property that ${\rm vec}({\mathbf{A}}{\rm{diag}({{\mathbf{b}}})}{\mathbf{C}})=({\mathbf{C}}^{\raisebox{1.2pt}{$\rm\scriptstyle T$}}\odot{\mathbf{A}}){{\mathbf{b}}}$ , we have ${\rm vec}({{\mathbf{Y}}}_{1}^{\prime})=\boldsymbol{\Psi}\boldsymbol{\alpha}+{\rm vec}({\mathbf{N}}^{\prime})$ with $\boldsymbol{\Psi}=({\mathbf{E}}^{\raisebox{1.2pt}{$\rm\scriptstyle T$}}\odot\sqrt{\frac{P}{L}}{\mathbf{R}}_{n}^{-1/2}\hat{{\mathbf{A}}}_{\rm BS})\in{\mathbb{C}}^{T_{1}N_{\rm r}\times L}$ and $T_{1}N_{\rm r}\gg L$ as the BS usually has large number of antennas and the mmWave MIMO channel is sparse in the angular domain. The path gains can then be estimated using least squares as

\hat{\boldsymbol{\alpha}}=(\boldsymbol{\Psi}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}\boldsymbol{\Psi})^{-1}\boldsymbol{\Psi}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}{\rm vec}({{\mathbf{Y}}}_{1}^{\prime}).

(20)

V-B Step 2: AoD estimation

Now what remains is to estimate the AoDs. To do so, we propose a recursive bisection procedure, which divides the spatial sector into two subsectors at each stage and measures the power of the signal received in the direction corresponding to the estimated AoA. The method selects the subsector with the largest received power as the new sector to be used in the next bisection stage. This procedure is continued till the desired subsector resolution is obtained and is repeated for each of the estimated AoAs corresponding to $L$ paths.

Let us denote the precoder that we use in Step 2 by ${{\mathbf{p}}}$ with $\|{{\mathbf{p}}}\|=1$ , as before. Let us also define the inner product between ${{\mathbf{a}}}_{\rm UE}(\phi_{l})$ and ${{\mathbf{p}}}$ as $\rho_{l}({{\mathbf{p}}})={{\mathbf{a}}}_{\rm UE}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(\phi_{l}){{\mathbf{p}}}$ , where we have $0\leq|\rho_{l}({{\mathbf{p}}})|\leq\sqrt{N_{\rm t}}$ and when ${{\mathbf{p}}}=\frac{1}{\sqrt{N_{\rm t}}}{{\mathbf{a}}}_{\rm UE}(\phi_{l})$ , the upper bound is achieved with equality. Therefore, by choosing a precoder ${{\mathbf{p}}}$ that yields the maximum $\rho_{l}({{\mathbf{p}}})$ , we can indirectly refine the AoD sector of the $l$ th path to compute the AoD.

Suppose the UE transmits $T_{2}$ precoded symbols ${\mathbf{S}}={{\mathbf{p}}}{\bf 1}^{\raisebox{1.2pt}{$\rm\scriptstyle T$}}\in\mathbb{C}^{N_{\rm t}\times T_{2}}$ , where the subscript “2” denotes Step 2 of the algorithm. Using the combiner ${{\mathbf{c}}}_{l}=\frac{1}{\sqrt{N_{\rm r}}}{{\mathbf{a}}}_{\rm BS}({\theta}_{l})$ , the component of the received uplink pilot arriving along the $l$ th path at the BS can be expressed as

$\displaystyle{{\mathbf{c}}}_{l}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}{\mathbf{Y}}$	$\displaystyle=\sqrt{\frac{P}{L}}\sum_{k=1}^{L}\alpha_{k}{{\mathbf{c}}}_{l}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}{{\mathbf{a}}}_{\rm BS}(\theta_{k}){{\mathbf{a}}}_{\rm UE}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(\phi_{k}){{\mathbf{p}}}\boldsymbol{1}^{\raisebox{1.2pt}{$\rm\scriptstyle T$}}+{{\mathbf{c}}}_{l}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}{\mathbf{N}}$
	$\displaystyle=\sqrt{\frac{P}{L}}\alpha_{l}{{\mathbf{c}}}_{l}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}{{\mathbf{a}}}_{\rm BS}(\theta_{l})\rho_{l}({{\mathbf{p}}})\boldsymbol{1}^{\raisebox{1.2pt}{$\rm\scriptstyle T$}}+{{\mathbf{c}}}_{l}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}{\mathbf{N}}$
	$\displaystyle\quad\quad\quad+\sqrt{\frac{P}{L}}\sum_{k=1,k\neq l}^{L}\alpha_{k}{{\mathbf{c}}}_{l}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}{{\mathbf{a}}}_{\rm BS}(\theta_{k})\rho_{k}({{\mathbf{p}}})\boldsymbol{1}^{\raisebox{1.2pt}{$\rm\scriptstyle T$}}.$	(21)

Since we have a large array at the BS and assuming that the AoAs are sufficiently separated, we can approximate ${{\mathbf{c}}}_{l}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}{{\mathbf{a}}}_{\rm BS}(\theta_{k})=\frac{1}{\sqrt{N_{\rm r}}}{{\mathbf{a}}}_{\rm BS}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}({\theta}_{l}){{\mathbf{a}}}_{\rm BS}(\theta_{k})\approx 0$ for $l\neq k$ . Using this approximation and multiplying both sides of (21) with ${\bf 1}$ , we have

{{\mathbf{c}}}_{l}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}{\mathbf{Y}}{\bf 1}=\sqrt{\frac{PN_{\rm r}}{L}}T_{2}\alpha_{l}\rho_{l}({{\mathbf{p}}})+{{\mathbf{c}}}_{l}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}{\mathbf{N}}{\bf 1},

(22)

which allows us to compute the energy of the $T_{2}$ received symbols arriving from the $l$ th path as

E_{l}({{\mathbf{p}}})=\left|\frac{1}{T_{2}}{{\mathbf{c}}}_{l}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}{\mathbf{Y}}{\bf 1}\right|^{2}.

(23)

In practice, we compute $E_{l}({{{\mathbf{p}}}})$ using the estimate $\hat{\theta}_{l}$ from Step 1 to form the combiner as ${{{\mathbf{c}}}}_{l}=\frac{1}{\sqrt{N_{\rm r}}}{{\mathbf{a}}}_{\rm BS}(\hat{\theta}_{l})$ .

We can estimate the AoDs by finding precoders, ${{\mathbf{p}}}$ , that maximize $E_{l}({{{\mathbf{p}}}})$ for each path $l=1,\ldots,L$ , where recall that the structure of the precoder vector is constrained to ${{\mathbf{p}}}=\frac{1}{\sqrt{N_{\rm t}}}{{\mathbf{a}}}_{\rm UE}(\phi)$ . Here, we assume that the number of paths, $L$ , is known. Performing this maximization by exhaustively searching over all the departure angles in a predefined grid ${{\mathcal{D}}}$ results in excessive training overhead. Therefore, we present a recursive bisection procedure, where for each path $l$ , we start with a wide beam and progressively refine it to maximize $E_{l}({{{\mathbf{p}}}})$ .

V-B1 Design of the precoder codebook

Before we describe the proposed recursive biscection procedure, we first provide the design of a codebook that contains precoders required for each stage of the procedure. Let us discretize the departure angular domain into $D$ grid points ${\mathcal{D}}=\{\tilde{\phi}_{1},\tilde{\phi}_{2},\cdots,\tilde{\phi}_{D}\}$ by uniformly sampling the direction cosine space so that $\tilde{\phi}_{d}={\rm sin}^{-1}\left(-1+\frac{2}{D-1}(d-1)\right)$ for $d=1,2,\ldots,D$ . Let us define the index set ${\mathcal{I}}_{s,i}$ as

{\mathcal{I}}_{s,i}=\left\{\frac{D(i-1)}{2^{s}}+1,\frac{D(i-1)}{2^{s}}+2,\ldots,\frac{Di}{2^{s}}\right\}

for $i=1,2,\ldots,2^{s}$ with $|{\mathcal{I}}_{s,i}|=\frac{D}{2^{s}}$ . Next, we the partition the set ${\mathcal{D}}$ into $2^{s}$ partitions for the $s$ th stage of the recursive procedure with a total number of $N_{\rm s}={\rm log}_{2}(D)$ stages. In other words, at Stage $s$ , we have $2^{s}$ spatial sectors with the $i$ th sector formed from angles $\tilde{\phi}_{i}\in{\mathcal{I}}_{s,i}$ and in the last stage, i.e., at Stage $N_{\rm s}$ , we have sectors with $D$ angular grid points in $\mathcal{D}$ . For example, in the first stage, we have two spatial sectors $\{\tilde{\phi}_{1},\tilde{\phi}_{2},\cdots,\tilde{\phi}_{\frac{D}{2}}\}$ and $\{\tilde{\phi}_{\frac{D}{2}+1},\tilde{\phi}_{\frac{D}{2}+2},\cdots,\tilde{\phi}_{D}\}$ .

Let ${{\mathbf{p}}}_{s,i}$ denote the $i$ th precoder for the $s$ th stage. The precoders are designed to focus on a desired angular sector as

{{\mathbf{a}}}_{\rm UE}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(\tilde{\phi}_{j}){{\mathbf{p}}}_{s,i}=\begin{cases}1,&\quad j\in{\mathcal{I}}_{s,i},\\ 0,&\quad\text{otherwise}\end{cases}

(24)

for $s=1,2,\ldots,N_{\rm s}$ and $i=1,2,\ldots,2^{s}$ . Defining the dictionary matrix ${\mathbf{D}}=[{{\mathbf{a}}}_{\rm UE}(\tilde{\phi}_{1}),\ldots,{{\mathbf{a}}}_{\rm UE}(\tilde{\phi}_{D})]^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}\in{\mathbb{C}}^{D\times N_{\rm t}}$ with $D\gg N_{\rm t}$ , and the precoder matrix ${\mathbf{P}}_{s}=[{{\mathbf{p}}}_{s,1},{{\mathbf{p}}}_{s,2},\ldots,{{\mathbf{p}}}_{s,2^{s}}]\leavevmode\nobreak\ \in\leavevmode\nobreak\ \mathbb{C}^{N_{\rm t}\times 2^{s}}$ , we can compactly express (24) as the linear system

{\mathbf{D}}{\mathbf{P}}_{s}=\mbox{$\mbox{\boldmath$\Psi$}$}_{s},

(25)

where the desired beampattern matrix $\mbox{$\mbox{\boldmath$\Psi$}$}_{s}=[\mathbbm{1}_{s,1},\cdots,\mathbbm{1}_{s,2^{s}}]\in\{1,0\}^{D\times 2^{s}}$ with $\mathbbm{1}_{s,i}\in\{0,1\}^{D}$ being the indicator vector with entries equal to one at locations indexed by the set ${\mathcal{I}}_{s,i}$ . Then the precoders for the $s$ th stage can be computed using least squares as

{\mathbf{P}}_{s}=({\mathbf{D}}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}{\mathbf{D}})^{-1}{\mathbf{D}}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}\mbox{$\mbox{\boldmath$\Psi$}$}_{s}.

(26)

To obtain unit-norm precoding vectors, we normalize each column of ${\mathbf{P}}_{s}$ to unity. We repeat this procedure for $s=1,2,\ldots,N_{\rm s}$ to compute precoders for all the stages. This completes the design of the precoder codebook.

In Fig. 3(a), we show the beampattern $|{{\mathbf{a}}}_{\rm UE}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(\phi){{\mathbf{p}}}|^{2}$ for $\phi\in[0,2\pi]$ corresponding to Stage 1 with ${{\mathbf{p}}}\in\{{{\mathbf{p}}}_{1,1},{{\mathbf{p}}}_{1,2}\}$ and Stage 2 with ${{\mathbf{p}}}\in\{{{\mathbf{p}}}_{2,1},{{\mathbf{p}}}_{2,2},{{\mathbf{p}}}_{2,3},{{\mathbf{p}}}_{2,4}\}$ of the codebook. In Figs. 3(b) and 3(c), we show the beampatterns corresponding to the first two stages as seen by the BS without any quantizer, i.e., $\left|{{\mathbf{c}}}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}{{\mathbf{x}}}\right|^{2}$ and by the BS with 1-bit $\Sigma\Delta$ ADC, i.e., $\left|{{\mathbf{c}}}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}{{\mathbf{y}}}\right|^{2}$ , respectively, where ${{\mathbf{x}}}=\sqrt{P}\alpha{{\mathbf{a}}}_{\rm BS}(\theta){{\mathbf{a}}}_{\rm UE}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(\phi){{\mathbf{p}}}+{{\mathbf{z}}}$ as in (11), ${{\mathbf{y}}}={{\mathbf{x}}}+{{\mathbf{q}}}$ as in (10), and we vary $\phi$ and ${{\mathbf{p}}}$ as before. Here, we use $\theta=30^{\circ}$ , ${{\mathbf{c}}}=\frac{1}{\sqrt{N_{\rm r}}}{{\mathbf{a}}}_{\rm BS}(30^{\circ})$ , $|\alpha|=1$ , and SNR of 10 dB. We see that the received beampattern in Fig. 3(c) is significantly distorted as the 1-bit $\Sigma\Delta$ ADC is not steered to the $\theta$ , i.e., $\psi\neq\theta$ , and more importantly, because the quantization voltage level $b$ is set to an arbitrary level. While we can steer the 1-bit $\Sigma\Delta$ ADC based on the AoA estimate from Step 1 as $\psi=\hat{\theta}$ , we emphasize that a procedure to select an appropriate voltage level $b$ is crucial. Before describing the procedure to select $b$ , we next present the recursive bisection procedure to estimate the AoDs.

V-B2 Recursive bisection procedure

We estimate the AoDs of the $l$ th path by maximizing $E_{l}({{{\mathbf{p}}}})$ in (23) by recursively bisecting the spatial sector and selecting a subsector that yields the highest received energy. The BS informs via a 1-bit error-free feedback link the selected subsector to the UE, which further bisects the selected subsector to transmit pilots for the next stage. The procedure is continued for $N_{\rm s}$ stages, where at the last stage, we select one of the angular grid points in $\mathcal{D}$ as the estimated AoD of the $l$ th path. We repeat this procedure for all the $L$ paths.

To estimate the AoD of the $l$ th path, we proceed as follows. Recall from (24) that ${{\mathbf{p}}}_{s,i}$ has a unit response in the sector formed by departure angles $\tilde{\phi}_{i}\in\mathcal{I}_{s,i}$ . In the first stage, the UE transmits pilots using precoders ${{\mathbf{p}}}_{1,1}$ and ${{\mathbf{p}}}_{1,2}$ . The BS then computes the energy of the received symbols using (23) and selects the subsector (or the precoder) that yeilds $\max\,\{E_{l}({{\mathbf{p}}}_{1,1}),E_{l}({{\mathbf{p}}}_{1,2})\}$ . If $E_{l}({{\mathbf{p}}}_{1,1})>E_{l}({{\mathbf{p}}}_{1,2})$ , the BS sends a feedback of 0 to the UE through an error-free 1-bit feedback link indicating that the sector $\tilde{\phi}_{i}\in\mathcal{I}_{1,1}$ is selected. Similarly, it sends a 1 indicating that the sector $\tilde{\phi}_{i}\in\mathcal{I}_{1,2}$ is selected. In the second stage, the UE then bisects the selected subsector and transmits pilots using the precoders $\{{{\mathbf{p}}}_{2,1},{{\mathbf{p}}}_{2,2}\}$ (respectively, $\{{{\mathbf{p}}}_{2,3},{{\mathbf{p}}}_{2,4}\}$ ) if the received feedback from the BS is 0 (respectively, 1). More generally, at Stage s, suppose the UE transmits pilots using precoders $\{{{\mathbf{p}}}_{s,m},{{\mathbf{p}}}_{s,m+1}\}$ and $E_{l}({{\mathbf{p}}}_{s,m})>E_{l}({{\mathbf{p}}}_{s,m+1})$ . The BS transmits a 1 to the UE, which then selects the precoders $\{{{\mathbf{p}}}_{s+1,2m-1},{{\mathbf{p}}}_{s+1,2m}\}$ corresponding to a narrower subsector for the next stage. We continue this procedure for $N_{\rm s}$ stages, where the partition selected in the final stage corresponds to the index of the estimated AoD. The same procedure is repeated for all the $L$ paths.

V-C Selection of clipping and quantization voltage levels

In Fig. 3(c), we have seen that the choice of $b$ and $c$ play a crucial role in determining the channel estimation and beamforming performance of MIMO systems with 1-bit spatial $\Sigma\Delta$ ADCs. As discussed in Section II, we choose the clipping level $c$ based on the standard deviation of input that allows us to place a bound on the clipping error using the Chebyshev inequality ${\rm Pr}\left(\|{\mathcal{L}}_{c}[{{\mathbf{x}}}(t)]-{{\mathbf{x}}}(t)\|_{2}>\delta\right)\leq\epsilon$ for constants $\delta,\epsilon>0$ . Therefore, the choice of $c$ depends on the statistics of the unquantized signal ${{\mathbf{x}}}(t)$ received at the BS, leading to different choices of voltage levels for Step 1 and Step 2 of the proposed channel estimation algorithm.

V-C1 For estimating AoAs and path gains in Step 1

From (16) and (10), the unquantized signal received at the $i$ th antenna of the BS in Step 1 of the proposed channel estimation technique is

[{{\mathbf{x}}}_{1}(t)]_{i}=\sqrt{\frac{P}{L}}\sum_{k=1}^{L}\alpha_{k}e^{-\jmath 2\pi d(i-1){\rm sin}(\theta_{k})}+[{{\mathbf{z}}}_{1}(t)]_{i},

where $\alpha_{k}$ and $[{{\mathbf{z}}}_{1}(t)]_{i}$ follow a complex Gaussian distribution with zero mean and unit variance. Since the complex path gains and the additive noise are mutually independent, $[{{\mathbf{x}}}_{1}(t)]_{i}$ follows a complex Gaussian distribution with zero mean and variance $P+1$ . In other words, $\Re([{{\mathbf{x}}}_{1}(t)]_{i})\sim{\mathcal{N}}(0,\frac{P+1}{2})$ and $\Im([{{\mathbf{x}}}_{1}(t)]_{i})\sim{\mathcal{N}}(0,\frac{P+1}{2})$ . Let us recall that the probability that a Gaussian random variable takes values away from the mean by more than thrice the standard deviation is less than $1\%$ . Hence, to ensure

	$\displaystyle{\rm Pr}(\|\Re([{{\mathbf{x}}}_{1}(t)]_{i})\|>c)$
	$\displaystyle\quad={\rm Pr}(\|\Re([{{\mathbf{x}}}_{1}(t)]_{i})-{\mathcal{L}}_{c}[\Re([{{\mathbf{x}}}_{1}(t)]_{i}]\|>0)\leq 0.01,$

in Step 1, we choose the clipping voltage level

c=3\sqrt{\frac{P+1}{2}}.

(27)

The clipping voltage level for the imaginary part is computed similarly. The quantization voltage level $b$ is then selected using the overload condition (4).

V-C2 For estimating AoDs in Step 2

In the second step, from (21) and (10), the unquantized signal received at the $i$ th antenna of the BS related to the uplink pilot transmission with the precoder ${{\mathbf{p}}}$ is

[{{\mathbf{x}}}_{2}(t)]_{i}=\sqrt{\frac{P}{L}}\sum_{k=1}^{L}\alpha_{k}\rho_{k}({{\mathbf{p}}})e^{-\jmath 2\pi d(i-1){\rm sin}(\theta_{k})}+[{{\mathbf{z}}}_{2}(t)]_{i},

where $[{{\mathbf{x}}}_{2}(t)]_{i}$ follows a complex Gaussian distribution with zero mean and variance $\frac{P}{L}\sum_{k=1}^{L}|\rho_{k}({{\mathbf{p}}})|^{2}+1$ . Since $\rho_{k}({{\mathbf{p}}})$ is bounded from above by $\sqrt{N_{\rm t}}$ , the worst-case variance of $[{{\mathbf{x}}}_{2}(t)]_{i}$ is $PN_{\rm t}+1$ . Therefore, we choose the clipping level as

c=3\sqrt{\frac{PN_{\rm t}+1}{2}},

(28)

in Step 2, to ensure that the worst-case clipping probability of $[{{\mathbf{x}}}_{2}(t)]_{i}$ , $i=1,\ldots,N_{\rm r}$ is less than 1 $\%$ .

Since the clipping voltage level that we select in Step 1 is different from Step 2, a joint estimator (e.g., sparse recovery based method as in [3]) to jointly estimate the channel parameters is not straightforward when dealing with observations from 1-bit spatial $\Sigma\Delta$ ADCs.

V-D Computations and number of pilot transmissions

In Step 1, we transmit $T_{1}$ pilots. In Step 2, we transmit each beam $T_{2}$ times with 2 beams in each stage. Since there are $N_{\rm s}=\log_{2}D$ stages, and $L$ paths, the total pilot transmission overhead in the second stage is $2LT_{2}N_{\rm s}$ .

For a search grid of size $A$ , computing the Bartlett beamforming spectrum in (17) costs about order $AN_{\rm r}T_{1}$ flops. The least squares estimator to compute the path gains incurs about order $L^{2}N_{\rm r}T_{1}$ flops. In Step 2, for each beam and path, we compute the received power, which costs about order $LT_{2}N_{\rm s}N_{\rm r}$ flops. In contrast, a scheme that exhaustively searches over all possible AoA and AoD combinations with $DT_{2}$ pilot transmissions incurs a computational complexity of about order $ADN_{\rm r}T_{2}$ flops, which is higher than $N_{\rm r}(AT_{1}+LT_{2}N_{\rm s})$ flops incurred by the proposed method as typically $D\gg LN_{\rm s}$ . The computational complexity of the proposed channel estimation algorithm scales linearly with the number of receive antennas and log-linearly with the search grid size, thus making it well-suited for massive MIMO systems.

VI MU-MIMO channel estimation

In this section, we specialize the proposed channel estimation algorithm to the MU-MIMO setting with the angular channel model described in Section IV-B. We estimate the channel by estimating the AoAs and the path gains using Step 1 of the algorithm developed in the previous section. Since we assume that the UEs have a single antenna in the MU-MIMO setup, there is no AoD estimation step.

Let ${\mathbf{S}}(t)\in{\mathbb{C}}^{K\times K}$ denote the orthogonal pilot matrix transmitted at time instance $t$ from the $K$ UEs such that ${\mathbf{S}}(t){\mathbf{S}}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(t)={\mathbf{S}}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(t){\mathbf{S}}(t)={\mathbf{I}}$ . From (11) and (10), the received signal at the BS with 1-bit spatial $\Sigma\Delta$ ADC is

{\mathbf{Y}}(t)=\sqrt{P}{\mathbf{H}}{\mathbf{S}}(t)+{\mathbf{N}}(t),\,t=1,2,\cdots,T,

where $T$ is the total number of channel uses, ${\mathbf{N}}(t)$ is the sum of additive white Gaussian noise and quantization noise, as defined before. Using the known pilots, we preprocess the received signal to separate the signal components from the UEs to obtain ${\mathbf{Y}}{\mathbf{S}}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(t)=\sqrt{P}{\mathbf{H}}+{\mathbf{N}}{\mathbf{S}}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(t),$ where the $k$ th column of ${\mathbf{Y}}{\mathbf{S}}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(t)$ , denoted by ${{{\mathbf{y}}}}_{k,t}\in{\mathbb{C}}^{N_{\rm r}}$ , is related to the SIMO channel between the $k$ th UE and the BS at time instance $t$ , and is given by [cf. (13)]

	$\displaystyle{{{\mathbf{y}}}}_{k,t}$	$\displaystyle=\sqrt{\frac{P}{L_{k}}}{\mathbf{A}}_{\rm BS}(\boldsymbol{\theta}_{k})\boldsymbol{\alpha}_{k}+{{{\mathbf{n}}}}_{k,t}$
		$\displaystyle=\sqrt{\frac{P}{L_{k}}}{\mathbf{A}}_{\rm BS}(\boldsymbol{\theta}_{k}){\rm diag}(\boldsymbol{\alpha}_{k}){\bf 1}+{{{\mathbf{n}}}}_{k,t}.$

Here, ${{{\mathbf{n}}}}_{k,t}$ denotes the $k$ th column of ${\mathbf{N}}{\mathbf{S}}^{\raisebox{1.0pt}{$\rm\scriptscriptstyle H$}}(t)$ . Suppose ${\mathbf{Y}}_{k}=[{{{\mathbf{y}}}}_{k,1},\cdots,{{{\mathbf{y}}}}_{k,T}]$ and ${\mathbf{N}}_{k}=[{{{\mathbf{n}}}}_{k,1},\cdots,{{{\mathbf{n}}}}_{k,T}]$ . Then we have ${\mathbf{Y}}_{k}=\sqrt{\frac{P}{L_{k}}}{\mathbf{A}}_{\rm BS}(\boldsymbol{\theta}_{k}){\rm diag}(\boldsymbol{\alpha}_{k}){{\mathbf{E}}}+{{\mathbf{N}}}_{k},$ which readily resembles the signal model in (16). Therefore, we can use Step 1 of the SU-MIMO channel estimation algorithm developed in the previous section to estimate the channel parameters $(\boldsymbol{\theta}_{k},\boldsymbol{\alpha}_{k})$ for each user. Also, we choose the same voltage level $c=3\sqrt{\frac{P+1}{2}}$ as the one used in Step 1 of SU-MIMO channel estimation.

VII Numerical simulations

In this section, we present results from a number of numerical simulations to demonstrate the efficacy of the developed quantization noise model, voltage level selection, and channel estimation algorithms in mmWave MIMO systems with 1-bit spatial $\Sigma\Delta$ ADCs. We compare different algorithms in terms of normalized mean square error (NMSE) and angle estimation error. We define NMSE of a path gain estimate $\hat{\boldsymbol{\alpha}}$ and a channel estimate $\hat{{\mathbf{H}}}$ as

{\rm NMSE}(\hat{\boldsymbol{\alpha}})=\frac{{\mathbb{E}}\left[\|\hat{\boldsymbol{\alpha}}-\boldsymbol{\alpha}\|_{2}^{2}\right]}{{\mathbb{E}}\left[\|\boldsymbol{\alpha}\|_{2}^{2}\right]}

and

{\rm NMSE}(\hat{{\mathbf{H}}})=\frac{{\mathbb{E}}\left[\|\hat{{\mathbf{H}}}-{\mathbf{H}}\|_{F}^{2}\right]}{{\mathbb{E}}\left[\|{\mathbf{H}}\|_{F}^{2}\right]},

respectively. Let ${\mathcal{A}}=\{\tilde{\theta}_{1},\ldots,\tilde{\theta}_{A}\}$ denote the AoA search grid of size $A$ used in (17). Let us denote the index set of AoAs corresponding to the angles in ${\mathcal{A}}$ as

\mathbb{A}({\boldsymbol{\theta}})=\{j\,:\,[{\boldsymbol{\theta}}]_{l}=\tilde{\theta}_{j},1\leq l\leq L,1\leq j\leq A\}.

Let us also denote the index set of AoDs corresponding to the angles in AoD grid ${\mathcal{D}}=\{\tilde{\phi}_{1},\tilde{\phi}_{2},\cdots,\tilde{\phi}_{D}\}$ of size $D$ as

\mathbb{D}({\boldsymbol{\phi}})=\{j\,:\,[{\boldsymbol{\phi}}]_{l}=\tilde{\phi}_{j},1\leq l\leq L,1\leq j\leq D\}.

We then define the AoA and AoD estimation errors as

E_{\theta}={\rm Pr}\left(\mathbb{A}(\hat{\boldsymbol{\theta}})\neq\mathbb{A}({\boldsymbol{\theta}})\right)\quad\text{and}\quad E_{\phi}={\rm Pr}\left(\mathbb{D}(\hat{\boldsymbol{\phi}})\neq\mathbb{D}({\boldsymbol{\phi}})\right),

where $\hat{\boldsymbol{\theta}}$ and $\hat{\boldsymbol{\phi}}$ are the estimated AoAs and AoDs, respectively.

VII-A The SU-MIMO setting

We consider a SU-MIMO setup with the UE having $N_{\rm t}\leavevmode\nobreak\ =\leavevmode\nobreak\ 32$ antennas, the BS having $N_{\rm r}=128$ antennas that are spaced $d=1/8$ wavelengths apart. For Bartlett beamforming, we use the search grid ${{\mathcal{A}}}=\{-90^{\circ},-89^{\circ},\cdots,89^{\circ},90^{\circ}\}$ with $A=181$ points. The AoD grid ${\mathcal{D}}$ is obtained by uniformly sampling the direction cosine space in the interval $-1$ to $1$ with $D=128$ points as described in Section V-B1.

VII-A1 Angle steering and voltage level selection

We first begin with a discussion on the estimation performance of the channel parameters $(\boldsymbol{\theta},\boldsymbol{\alpha})$ in Step 1 and $\boldsymbol{\phi}$ in Step 2 of the proposed channel estimator for a single path channel with $L=1$ . This allows us to discuss the impact of voltage level selection and angle steering on the estimators. The AoA is drawn uniformly at random from the sector $[-x^{\circ},x^{\circ}]$ for $x=10,30,60$ . The AoD is drawn randomly from the sector $[-75^{\circ},75^{\circ}]$ . The path gain is assumed to be unit modulus. We use $T_{1}=10$ and $T_{2}=1$ . Unless otherwise mentioned, we use $10^{6}$ independent channel realizations to compute NMSE and angle errors. In this subsection, we compare the estimation performance of the proposed technique with 1-bit spatial $\Sigma\Delta$ ADC, referred to as “ $\Sigma\Delta$ ”, with an equivalent channel estimation method applied on unquantized data (wherein the BS is assumed to have a very high-resolution ADC). We label it as “Unquantized” in the plots. Since estimates from “Unquantized” will be better than the same scheme applied on quantized data for a comparable number of snapshots, we use this as a baseline to illustrate the loss due to 1-bit $\Sigma\Delta$ ADC.

We illustrate the channel estimation performance for different SNRs in terms of $E_{\theta}$ in Fig. 4(a), ${\rm NMSE}(\hat{\boldsymbol{\alpha}})$ in Fig. 4(b) and $E_{\phi}$ in Fig. 4(c). Here, “Unquantized, $\theta\in[-10^{\circ},10^{\circ}]$ ” and “ $\Sigma\Delta,\theta\in[-10^{\circ},10^{\circ}]$ ” indicate that the AoA is drawn uniformly at random from the sector $[-10^{\circ},10^{\circ}]$ in each channel realization. We can see that “ $\Sigma\Delta$ ” performs similar to that of “Unquantized” for the path with AoA arriving close to the array broadside, whereas the gap between “Unquantized” and “ $\Sigma\Delta$ ” increases for the path arriving with angles away from the array broadside. This is mainly due to the quantization noise shaping at higher spatial frequencies. Performance degradation at higher SNRs is inevitable due to the larger choice of $b$ at high SNRs, which results in higher quantization noise [cf. (19)].

Next, in Fig. 5(a), Fig. 5(b), and Fig. 5(c), respectively, we illustrate the impact of clipping voltage level selection on ${\mathcal{J}}(\theta)$ , $E_{\theta}$ , and $E_{\phi}$ by comparing the proposed clipping voltage level from Section V-C1 with a fixed, SNR independent arbitrary clipping voltage level $c=1$ . While we observe that the Bartlett beampatterns as well as the AoA estimation errors are not sensitive to the choice of clipping voltage levels, we, however, can observe from Fig. 5(c) that not selecting a correct clipping voltage level leads to severe performance degradation in path gain estimation. In other words, Fig. 5(c) demonstrates that a blind application of least squares without an appropriate selection of the clipping voltage level does not result in satisfactory performance.

Recall in Fig. 3(c), we discussed the impact of clipping voltage level selection and angle steering on the beampatterns in Step 2 as observed at the 1-bit spatial $\Sigma\Delta$ ADC. We now extend that discussion in Fig. 6(a), where we focus on the second stage of the proposed codebook as observed at the receiver of an unquantized system and compare it to the beampattern as seen at the receiver with a 1-bit spatial $\Sigma\Delta$ ADC. We consider a scenario where the path arrives at the BS with an AoA of $\theta=30^{0}$ and an SNR of $10$ dB. We observe that the beampatterns are severely distorted whenever the steering angle, voltage level, or both are incorrectly selected. Nevertheless, with a careful selection of voltage levels and steering angle, the beampatterns at the receiver with a 1-bit spatial $\Sigma\Delta$ ADC are comparable to that of the unquantized system.

In Fig. 6(b) and Fig. 6(c), we demonstrate, respectively, the impact of angle steering and clipping voltage level selection on AoD estimation, where the AoA is drawn uniformly at random from the sector $[-30^{\circ},30^{\circ}]$ in each channel realization to compute $E_{\phi}$ . We can observe that failing to exploit angle steering leads to serious loss in performance. While clipping voltage selection was not crucial for AoA estimation, we observe that fixing the voltage level to an arbitrary value, such as $c=1$ , or varying it in an inappropriate manner, e.g., using the clipping voltage level designed for Step 1 in Step 2 (indicated as “ $\Sigma\Delta$ , $c$ from Step1”), do not provide satisfactory AoD estimation performance. In other words, a direct application of existing hierarchical codebook-based channel estimation methods, e.g., [3] without a careful selection of clipping voltage levels and phase shifts in the feedback loop will not provide reasonable performance for MIMO systems with 1-bit spatial $\Sigma\Delta$ ADCs.

VII-A2 Multipath channel

We now consider a multipath SU-MIMO channel with $L=\{2,3\}$ , which are typical at mmWave frequencies. We assume that the complex path gains $\alpha_{i}$ follow a truncated Gaussian distribution with $|\Re(\alpha_{i})|,|\Im(\alpha_{i})|\geq\tau$ for $i=1,\ldots,L$ . To ensure that all the $L$ paths are sufficiently strong, we choose $\tau=0.5$ . The AoDs are drawn uniformly at random from the sector $[-75^{\circ},75^{\circ}]$ with a minimum spacing of $0.1$ in the direction cosine space and the AoAs are drawn uniformly at random from the sector $[-x^{\circ},x^{\circ}]$ with a minimum spacing of $20^{\circ}$ , where $x^{\circ}=\{45^{\circ},60^{\circ}\}$ . We choose $T_{1}=10$ and $T_{2}=1$ . Recall that the proposed method with 1-bit spatial $\Sigma\Delta$ ADC, referred to as “ $\Sigma\Delta$ ”, requires reception of $T_{1}$ pilots in Step 1 and $2LT_{2}N_{s}$ pilots in Step 2, where the reception at each step is with a different clipping voltage level. Hence, designing a scheme that utilizes all the available $T_{1}+2LT_{2}N_{s}$ measurements for channel estimation with 1-bit spatial $\Sigma\Delta$ ADCs is not straightforward.

We compare the performance of the proposed method with amplitude retrieval based one-bit SU-MIMO channel estimation algorithm [19], referred to as “AR” in the plots. In “AR”, after the amplitudes are recovered from one-bit measurements, standard channel estimation algorithm can be used. Since “AR” does not involve any clipping voltage selection, here, we use all the $T_{1}+2LT_{2}N_{s}$ measurements to perform amplitude recovery as in [19], AoA and path gain estimation using Step 1 and AoD estimation using Step 2 as described in Section V. In addition to “Unquantized”, which serves as a benchmark for “ $\Sigma\Delta$ ” as it uses $T_{1}$ pilots in Step 1 and $2LT_{2}N_{s}$ pilots in Step 2, we also report the performance of an unquantized system, labelled as “Unquantized full data”, that uses all the $T_{1}+2LT_{2}N_{s}$ pilots in both Step 1 and Step 2. Thus, the total number of pilot transmissions in all the methods that we compare with are the same. We have observed the runtime of the “AR” algorithm is significantly higher than the proposed method. Due to this reason, we use $500$ independent Monte-Carlo (MC) experiments to compute the NMSE of “AR”, whereas $40000$ MC experiments are used to obtain plots for “Unquantized,” “Unquantized full data,” and “ $\Sigma\Delta$ ” methods. NMSE of “AR” is, in general, much higher than that of the other methods, which suggests that a fewer number of MC runs is sufficient to obtain curves of comparable precision.

In Fig. 7, we show the channel estimation NMSE, where $\theta\in[-45^{\circ},45^{\circ}]$ or $\theta\in[-60^{\circ},60^{\circ}]$ indicate the sector from which the AoAs are drawn in each channel realization. We can observe that the performance of “ $\Sigma\Delta$ ” is comparable to that of “Unquantized” and is better than that of “AR” except at very low SNRs. At extremely low SNRs, we observe that the performance of “AR” is slightly better than that of “Unquantized” and “ $\Sigma\Delta$ ” as the effective number of snapshots available to estimate the channel parameters is larger for “AR”. At high SNRs, on the other hand, higher quantization noise in 1-bit systems leads to a deteriorated performance of “AR”. Similarly, at high SNRs and for angles away from broadside, there is an inevitable gap between the NMSE of “ $\Sigma\Delta$ ” and “Unquantized” due to the increase in the quantization noise. Furthermore, the NMSE of channel estimation for $L=3$ is slightly larger than that of $L=2$ due to the larger number of parameters to be estimated in the latter case. As expected, the benchmark scheme “Unquantized full data” has the lowest NMSE due to the absence of quantization and efficient use of all available snapshots to carry out channel estimation. In essence, the proposed method achieves performance comparable to that of “Unquantized” and significantly better than that of “AR” for most of the SNRs, making it an attractive choice for massive MIMO systems.

VII-B The MU-MIMO setting

For the MU-MIMO setting, we consider $K=8$ users and a BS having $N_{\rm r}\in\{128,256\}$ antennas that are $d=1/8$ wavelengths apart. We use $L_{k}\in\{1,3\}$ for $k=1,\ldots,8$ , $T=1$ , and use $500$ independent channel realizations for computing NMSE. For each path, the AoAs are drawn uniformly at random from the sector $[-45^{\circ},45^{\circ}]$ with a minimum spacing of $20^{\circ}$ . We compare the channel estimation performance of the proposed method, referred to as “ $\Sigma\Delta$ proposed” with the following state-of-the-art techniques in MU-MIMO channel estimation: (A) Bussgang decomposition followed by computing LMMSE MU-MIMO channel estimation for 1-bit MIMO systems [6], referred to as “BLMMSE”, (B) MU-MIMO channel estimation with 1-bit spatial $\Sigma\Delta$ ADC based on the Bussgang decomposition [14], labelled as “ $\Sigma\Delta$ Bussgang”, (C) an LMMSE MU-MIMO channel estimator that uses unquantized data, referred to as “Unquantized LMMSE”, and (D) the proposed channel estimation algorithm from Section IV-B applied to unquantized data, referred to as “Unquantized angular”. “Unquantized angular” serves as the benchmark scheme for “ $\Sigma\Delta$ proposed” and illustrates the loss due 1-bit spatial $\Sigma\Delta$ quantization. We reemphasize that “Unquantized LMMSE”, “BLMMSE”, and “ $\Sigma\Delta$ Bussgang” require the channel correlation information to perform the Bussgang decomposition and for subsequent channel estimation, and that this amounts to knowing the angles that parameterize the angular channel model. However, for the sake of comparison, we generate an approximate channel correlation by assuming that each UE has $91$ paths, each separated by 1 degree in the range of $[-45^{\circ},45^{\circ}]$ with ${\boldsymbol{\alpha}_{k}}\sim{\mathcal{C}}{\mathcal{N}}(0,{\mathbf{I}})$ [cf. (13)].

The NMSE performance of different methods for $(L_{k},N_{\rm r})=(1,128)$ , $(L_{k},N_{\rm r})=(3,128)$ and $(L_{k},N_{\rm r})=(3,256)$ are presented in Fig. 8(a), Fig. 8(b), and Fig. 8(c), respectively. It can be observed that the proposed scheme performs better in terms of NMSE than existing methods, namely, “BLMMSE” and “ $\Sigma\Delta$ Bussgang”. At low-to-moderate SNRs, the performance of “ $\Sigma\Delta$ Proposed” is comparable to that of “Unquantized angular”. This corroborates the developed theory that 1-bit spatial $\Sigma\Delta$ ADCs have higher effective resolution, which can be leveraged for parametric estimation. At higher SNRs, the NMSE performance reduces for both “ $\Sigma\Delta$ Bussgang” and “ $\Sigma\Delta$ Proposed” due to the inevitable increase in quantization noise that is bound to occur at high SNRs. Nonetheless, even at high SNRs, the proposed method outperforms “ $\Sigma\Delta$ Bussgang” by a margin of about $8\leavevmode\nobreak\ {\rm dB}$ (respectively, $12\leavevmode\nobreak\ {\rm dB}$ ) for $N_{\rm r}=128$ (respectively, $N_{\rm r}=256$ ).

As expected, the performance of the parametric channel estimation techniques (i.e., “Unquantized angular” and “ $\Sigma\Delta$ Proposed”) is better than their non-parametric counterparts (i.e., “Unquantized LMMSE” and “ $\Sigma\Delta$ Bussgang”) as the parametric techniques exploit the structure in the angular channel model. As expected, the channel estimation performance of “ $\Sigma\Delta$ Proposed” and “Unquantized angular” is better in Fig. 8(a) with $L_{k}=1$ as compared to the multipath scenario in Fig. 8(b). Furthermore, in Fig. 8(a), we can see that “ $\Sigma\Delta$ Proposed” is about $8-12\leavevmode\nobreak\ {\rm dB}$ better when compared to the state-of-the-art method “ $\Sigma\Delta$ Bussgang”. Also, when $N_{\rm r}$ is doubled from $128$ in Fig. 8(b) to $256$ in Fig. 8(c), due to the improved resolution and quantization noise shaping of the larger antenna array in the latter, we see that the proposed technique significantly outperforms existing techniques.

In Fig. 8(d), we compare the performance of the proposed channel estimator when the true channel correlation is used to compute the Bussgang decomposition. In this setting, though not realizable in practice, “Unquantized LMMSE” is the optimal channel estimator. We can see that “Unquantized angular”, which is the proposed technique that works with unquantized data performs similar to that of the optimal estimator at low-to-moderate SNRs. Also, we can see that the performance of our method is comparable (in terms of channel estimation NMSE) to “ $\Sigma\Delta$ Bussgang” at moderate-to-high SNRs.

VIII Conclusions

In this paper, we have presented an algorithm for channel estimation with angular models in massive MIMO systems employing 1-bit spatial $\Sigma\Delta$ ADC. We have developed a quantization noise model for 1-bit spatial $\Sigma\Delta$ ADCs that is useful, in general, for large array processing applications. Although computing the complete quantization noise probability density function is difficult, the developed noise model allows us to compute its second-order statistics that can be used to prewhiten data when solving parametric estimation problems. When the quantization voltage levels and phase shifts in the feedback loop are carefully selected, the effective resolution of 1-bit spatial $\Sigma\Delta$ ADCs can be improved and hence are comparable to unquantized systems in most operating regimes of interest. We have developed a two-step channel estimation procedure to estimate the AoAs, AoDs, and path gains that characterize the MIMO channel. The proposed algorithm allows us to select the phase shifts and quantization voltage levels, which depend on the unknown channel parameters. Through numerical simulations, we have demonstrated that with the proposed channel estimation algorithm, MIMO systems with 1-bit spatial $\Sigma\Delta$ ADCs perform significantly better than MIMO systems with regular 1-bit quantization and are often on par with that of unquantized MIMO systems for low-to-moderate SNRs. [Proof of Lemma 1] The expressions in (6) are obtained by adapting the derivation in [22] to 1-bit spatial $\Sigma\Delta$ ADCs and are derived here for self containment.

From (1) and (2) with $\psi=0$ , we have the recursion

y_{n}(t)=x_{n}(t)+e_{n}(t)-e_{n-1}(t).

(29)

Let us define the normalized quantization error as

\varepsilon_{n}(t)=0.5b^{-1}e_{n}(t)+\mu,

(30)

where $\mu=0.5+\jmath 0.5$ and $0\leq\Re(\varepsilon_{n}(t)),\Im(\varepsilon_{n}(t))\leq 1$ as the quantization error $e_{n}(t)$ is bounded when the voltage levels are chosen as in (4).

Using (1), we can express $\varepsilon_{n}(t)$ and $r_{n}(t)$ in terms of the input and output of the quantizer as

\displaystyle\Re(\varepsilon_{n}(t))=\frac{1}{2b}\left(\Re({\mathcal{Q}}_{b}[r_{n}(t)])-\Re(r_{n}(t))\right)+0.5

(31)

with

\Re(r_{n}(t))=\Re(x_{n}(t))-2b\Re(\varepsilon_{n-1}(t))+b.

When $\Re(r_{n}(t))>0$ , or equivalently, when $\Re(\varepsilon_{n-1}(t))-0.5b^{-1}\Re(x_{n}(t))<0.5$ , we have ${\mathcal{Q}}_{b}[r_{n}(t)]=b$ . This allows us to constrain (31) as

0\leq\Re(\varepsilon_{n}(t))=\Re(\varepsilon_{n-1}(t))-0.5b^{-1}\Re(x_{n}(t))+0.5<1.

Thus, $\Re(\varepsilon_{n}(t))=\langle\Re(\varepsilon_{n-1}(t))-0.5b^{-1}\Re(x_{n}(t))+0.5\rangle$ as the fractional part function $\langle x\rangle=x$ for $0\leq x<1$ . Similarly, when $\Re(r_{n}(t))<0$ , or equivalently, when $\Re(\varepsilon_{n-1}(t))-0.5b^{-1}\Re(x_{n}(t))>0.5$ , we have ${\mathcal{Q}}_{b}[r_{n}(t)]=-b$ ,

\Re(\varepsilon_{n}(t))=\Re(\varepsilon_{n-1}(t))-0.5b^{-1}\Re(x_{n}(t))-0.5,

and

1\leq\Re(\varepsilon_{n-1}(t))-0.5b^{-1}\Re(x_{n}(t))+0.5<2,

(32)

where the lower and upper bounds are due to the fact that $0\leq\Re(\varepsilon_{n-1}(t))\leq 1$ and due to clipping $-b\leq\Re(x_{n}(t))\leq b$ . For $x\in[1,2)$ , we have $\langle x\rangle=x-1.$ Therefore, when $\Re(r_{n}(t))<0$ , from (32), we again have

\Re(\varepsilon_{n}(t))=\left<\Re(\varepsilon_{n-1}(t))-0.5b^{-1}\Re(x_{n}(t))+0.5\right>.

(33)

Recursively substituting for $\Re(\varepsilon_{i-1}(t))$ , $\Re(\varepsilon_{i-2}(t))$ , and so on in (33), and using the fact that $\langle\langle x\rangle+y\rangle=\langle x+y\rangle,\leavevmode\nobreak\ \forall x,y\in{\mathbb{R}}$ , we obtain

\displaystyle\Re(\varepsilon_{n}(t))

\displaystyle=\left\langle 0.5(n+1)-0.5b^{-1}\sum_{k=1}^{n}\Re(x_{k}(t))\right\rangle.

Using the above expression in (30) yields

\Re(e_{n}(t))=2b\left\langle 0.5(n+1)-0.5b^{-1}\sum_{k=1}^{n}\Re(x_{n}(t))\right\rangle-b.

Using the fact that $\langle-x+(n-1)\rangle=-\langle x\rangle+1$ , $\forall n\in\mathbb{Z},x\in\mathbb{R}$ , the above expression can be equivalently expressed as (6a). The expression for the imaginary part in (6b) can be derived along the same lines.

References

[1] R. S. P. Sankar and S. P. Chepuri, “Millimeter wave MIMO channel estimation with 1-bit spatial sigma-delta analog-to-digital converters,” in Proc. of the IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Toronto, Canada, Jun. 2021.
[2] T. S. Rappaport, S. Sun, R. Mayzus, H. Zhao, Y. Azar, K. Wang, G. N. Wong, J. K. Schulz, M. Samimi, and F. Gutierrez, “Millimeter wave mobile communications for 5G cellular: It will work!” IEEE Access, vol. 1, pp. 335–349, May 2013.
[3] A. Alkhateeb, O. E. Ayach, G. Leus, and R. W. Heath, “Channel estimation and hybrid precoding for millimeter wave cellular systems,” IEEE J. Sel. Topics Signal Process., vol. 8, no. 5, pp. 831–846, Oct. 2014.
[4] A. Ghosh, T. A. Thomas, M. C. Cudak, R. Ratasuk, P. Moorut, F. W. Vook, T. S. Rappaport, G. R. MacCartney, S. Sun, and S. Nie, “Millimeter-wave enhanced local area systems: A high-data-rate approach for future wireless networks,” IEEE J. Sel. Areas Commun., vol. 32, no. 6, pp. 1152–1163, June 2014.
[5] J. Mo and R. W. Heath, “Capacity analysis of one-bit quantized MIMO systems with transmitter channel state information,” IEEE Trans. Signal Process., vol. 63, no. 20, pp. 5498–5512, Jul. 2015.
[6] Y. Li, C. Tao, G. Seco-Granados, A. Mezghani, A. L. Swindlehurst, and L. Liu, “Channel estimation and performance analysis of one-bit massive MIMO systems,” IEEE Trans. Signal Process., vol. 65, no. 15, pp. 4075–4089, Aug. 2017.
[7] K. Roth, H. Pirzadeh, A. L. Swindlehurst, and J. A. Nossek, “A comparison of hybrid beamforming and digital beamforming with low-resolution ADCs for multiple users and imperfect CSI,” IEEE J. Sel. Topics Signal Process., vol. 12, no. 3, pp. 484–498, Mar 2018.
[8] P. M. Aziz, H. V. Sorensen, and J. Van Der Spiegel, “An overview of sigma-delta converters: How a 1-bit ADC achieves more than 16-bit resolution,” IEEE Signal Process. Mag., vol. 13, no. 1, pp. 61–84, Jan. 1996.
[9] R. M. Corey and A. C. Singer, “Spatial sigma-delta signal acquisition for wideband beamforming arrays,” in Proc. Int. ITG Workshop Smart Antennas, Munich, Germany, Mar. 2016.
[10] D. Barać and E. Lindqvist, “Spatial sigma-delta modulation in a massive MIMO cellular system,” Master’s thesis, Chalmers University of Technology, Sweden, June 2016.
[11] V. Venkateswaran and A.-J. van der Veen, “Multichannel $\Sigma\Delta$ ADCs with integrated feedback beamformers to cancel interfering communication signals,” IEEE Trans. Signal Process., vol. 59, no. 5, pp. 2211–2222, May 2011.
[12] M. Shao, W. Ma, Q. Li, and A. L. Swindlehurst, “One-bit sigma-delta MIMO precoding,” IEEE J. Sel. Topics Signal Process., vol. 13, no. 5, pp. 1046–1061, Sept. 2019.
[13] S. Rao, A. L. Swindlehurst, and H. Pirzadeh, “Massive MIMO channel estimation with 1-bit spatial sigma-delta ADCs,” in Proc. of the IEEE Int. Conf. on Acoustics, Speech and Signal Process. (ICASSP), Brighton, UK, May 2019.
[14] S. Rao, G. Seco-Granados, H. Pirzadeh, J. A. Nossek, and A. L. Swindlehurst, “Massive MIMO channel estimation with low-resolution spatial sigma-delta ADCs,” IEEE Access, vol. 9, pp. 109 320–109 334, Jul. 2021.
[15] H. Pirzadeh, G. Seco-Granados, S. Rao, and A. L. Swindlehurst, “Spectral efficiency of one-bit sigma-delta massive MIMO systems,” IEEE J. Sel. Areas Commun., vol. 38, no. 9, pp. 2215–2226, Sept. 2020.
[16] J. J. Bussgang, “Crosscorrelation functions of amplitude-distorted Gaussian signals,” Technical report, Research Laboratory of Electronics, Massachusetts Institute of Technology, Mar. 1952. [Online]. Available: http://hdl.handle.net/1721.1/4847
[17] Ö. T. Demir and E. Björnson, “The Bussgang decomposition of non-linear systems: Basic theory and MIMO extensions,” arXiv preprint arXiv:2005.01597, May 2020.
[18] S. Jacobsson, G. Durisi, M. Coldrey, U. Gustavsson, and C. Studer, “Throughput analysis of massive MIMO uplink with low-resolution ADCs,” IEEE Trans. Wireless Commun., vol. 16, no. 6, pp. 4038–4051, June 2017.
[19] C. Qian, X. Fu, and N. D. Sidiropoulos, “Amplitude retrieval for channel estimation of MIMO systems with one-bit ADCs,” IEEE Signal Process. Lett., vol. 26, no. 11, pp. 1698–1702, Nov. 2019.
[20] J. Mo, P. Schniter, N. G. Prelcic, and R. W. Heath, “Channel estimation in millimeter wave MIMO systems with one-bit quantization,” in Proc. of the Asilomar Conference on Signals, Systems and Computers, Pacific Grove, USA, Nov. 2014.
[21] Y. Zhang, M. Alrabeiah, and A. Alkhateeb, “Deep learning for massive MIMO with 1-bit ADCs: When more antennas need fewer pilots,” IEEE Wireless Commun. Lett., vol. 9, no. 8, pp. 1273–1277, Apr. 2020.
[22] R. M. Gray, W. Chou, and P. W. Wong, “Quantization noise in single-loop sigma-delta modulation with sinusoidal inputs,” IEEE Trans. Commun., vol. 37, no. 9, pp. 956–968, Sept. 1989.
[23] R. M. Gray, “Oversampled sigma-delta modulation,” IEEE Trans. Commun., vol. 35, no. 5, pp. 481–489, May 1987.
[24] H. L. Van Trees, Optimum Array Processing: Part IV of Detection, Estimation, and Modulation Theory. USA: John Wiley & Sons, Ltd, 2002.

Channel Estimation in MIMO Systems with One-bit Spatial Sigma-delta ADCs