Vandermonde Constrained Tensor Decomposition for Hybrid Beamforming in Multi-Carrier MIMO Systems
Abstract
Hybrid beamforming has evolved as a promising technology that offers the balance between system performance and design complexity in mmWave MIMO systems. Existing hybrid beamforming methods either impose unit-modulus constraints or a codebook constraint on the analog precoders/combiners, which in turn results in a performance-overhead tradeoff. This paper puts forth a tensor framework to handle the wideband hybrid beamforming problem, with Vandermonde constraints on the analog precoders/combiners. The proposed method strikes the balance between performance, overhead and complexity. Numerical results on a 3GPP link-level test bench reveal the efficacy of the proposed approach relative to the codebook-based method while attaining the same feedback overhead. Moreover, the proposed method is shown to achieve comparable performance to the unit-modulus approaches, with substantial reductions in overhead.
I introduction
Millimeter wave (mmWave) has emerged as a powerful technology, that can handle the unprecedented demands on wireless connectivity, through offering large available bandwidth [1]. However, the high propagation loss inherent to mmWave bands, if not mitigated, can severely impact the system performance. Large antenna arrays which achieve high beamforming gains are used to compensate the propagation loss [2].
Large scale antenna systems implementation, on the other hand, incurs several practical challenges including the high energy consumption and cost of radio frequency (RF) chains, as each antenna element requires a dedicated RF chain. Such hurdles limit the possibility of employing a fully digital beamforming design. As an efficient surrogate, hybrid (analog/digital) beamforming has been introduced in [3, 4] as means of attaining favorable complexity-performance tradeoff in mmWave multicarrier massive MIMO systems. Hybrid beamforming relies on using a small number of RF chains to design high-dimensional analog precoders (implemented with only phase shifters) together with a low-dimensional (digital) baseband precoder. The combination of analog and digital precoders has the potential to approach the performance of a purely digital solution while providing substantial savings in energy consumption and design complexity.
Although maximizing the system spectral efficiency in the case of digital beamforming design admits a simple algebraic solution via singular value decomposition (SVD) [5], hybrid beamforming yields a highly non-convex problem that requires joint optimization of the hybrid precoders and combiners [3]. A more tractable formulation is to transform the hybrid beamforming design to a matrix factorization problem. In particular, the optimal SVD-based digital solution is first derived to maximize the spectral efficiency. Then, the hybrid beamforming is posed towards factorizing the fully digital precoder (combiner) as the hybrid precoding (combining) components. The factorization is usually solved either under unit modulus constraints [3, 6] or with codebook constraints [7] on the analog precoder (combiner), to ensure that the analog precoder can be modeled using phase shifters. While considering the unit modulus constraints, in general, result in a much better solution compared to the codebook constraint [6], the resulting communication overhead of the latter is considerably lower [7], rendering it more appropriate for limited feedback systems [8]. Further, compared to the codebook constraints approach, the feedback overhead for unit magnitude constraints scales linearly with the number of Tx/Rx antennas, thereby precluding its use in massive MIMO systems.
This begs the question whether it is possible to achieve a comparable performance to the unit-modulus based methods while yielding the feedback associated with the codebook-based approaches? This is the central question that this paper seeks to address. We answer the stated question in the affirmative by modeling the wideband hybrid beamforming as a low rank tensor decomposition problem with Vandermonde constraints on the analog precoders/combiners. Invoking the so-called parallel factor (PARAFAC) analysis, to decompose the resulting tensor, we show that PARAFAC yields high-quality hybrid precoders/combiners, with identifiability guarantees on the resulting factors. This paper adds to the broad variety of tensors applications in wireless communications [9, 10, 11]. Different from all prior hybrid beamforming works that adopt the spectral efficiency formula for performance evaluation, this paper evaluates the practical impact of the proposed method by integrating hybrid beamforming to an end-to-end communication scenario with time-varying channels. Numerical results demonstrate that the end-to-end performance of the proposed approach considerably outperforms the codebook based method while achieving comparable performance to the unit modulus based approaches. Further, the proposed method yields significantly lower communication overhead compared to unit modulus approaches.
II System Model
Consider a downlink transmission in a multi-carrier MIMO system comprising a base station (BS) and a single user equipment (UE). The BS is equipped with transmit antennas and transmit radio frequency (RF) chains while the UE is equipped with receive antennas and receive RF chains. The BS aims at communicating data streams to the UE over subcarriers, where and [3]. The BS first employs a digital baseband precoding matrix on the transmitted symbols , , as shown in Fig. 1. Then, the data symbols are transformed to the time domain using N-point inverse fast Fourier transform (IFFT). After a cyclic prefix (CP) is added to the time-domain signal, the BS applies an analog precoder (implemented using analog phase shifters), i.e., . Notice that same is applied across all subcarriers, i.e., is frequency independent. Towards this end, the transmitted complex signal from the BS can be expressed as,
(1) |
It is assumed that i) , and ii) the total power budget constraint is satisfied by enforcing the constraint .
At the receiver, the UE first employs an analog combiner followed by a digital baseband combiner after CP removal and frequency transformation using N-point FFT. Similar to the unit modulus constraint on the entries of , it assumed that the -th entry of has a unit modulus, i.e., . Thus, the -dimensional complex baseband signal at the UE at the -th subcarrier is given by,
(2) |
where represents the downlink channel at the -th subcarrier, and is the additive white Guassian noise vector associated with the -th subcarrier,. It is assumed that the entries of are independent and identically distributed (i.i.d) random variables with zero mean and variance , i.e., . Throughout this work, we assume that the channel matrices across the subcarriers are perfectly known at the UE.
Remark 1.
It is worth pointing out that, in practical wireless systems, there is one representative channel matrix for each group of subcarriers or resource blocks, referred to as subband size, and hence, there is one baseband precoder/combiner for each subband as opposed to each subcarrier. The reason behind that is primarily to reduce the overhead associated with the channel and/or precoding-related feedback. In the hybrid beamforming context, this will reduce the overhead associated with the baseband precoders/combiners, and will also reduce the complexity as obviously smaller number of baseband precoders and combiners need to be computed. This fact will be utilized later in the simulations (Section VI) and also in the overhead computations associated with the proposed approach and the existing hybrid beamforming methods.

III Problem Definition
The wideband hybrid beamforming problem seeks to find the set of hybrid precoders and combiners that can maximize the spectral efficiency. Assuming the transmitted symbols follow a Gaussian distribution, the achievable spectral efficiency associated with the -th subcarrier can be expressed as [5]
(3) |
where represents the covariance matrix of the post-processing noise term in (2). The goal is then to design the hybrid precoders that aim at maximizing the overall spectral efficiency while satisfying the imposed constraints on the analog and digital precoders/combiners. Maximizing the spectral efficiency, though, yields a highly non tractable optimization problem that requires the hybrid precoders and combiners to be jointly optimized.
Instead of maximizing the spectral efficiency, one can decouple the precoders and combiners design, and formulate the hybrid beamforming problem as two separate low-rank matrix factorization problems [3, 6, 12]. The precoder problem aims at factorizing the optimal digital precoder to , where the columns of are the dominant right singular vectors of , . On the other hand, the combiner problem seeks to factorize to , where is the WMMSE solution, i.e., and . Interestingly, it has been shown in [12] that solving the factorization problems implicitly leads to maximizing the system spectral efficiency. Since both problems exhibit similar mathematical formulation, except that the precoder problem has an additional sum power constraint, we will focus on the precoder factorization problem. However, the proposed method may be easily applied to solve the combiner problem. From an optimization perspective, given the fully digital SVD-based precoder , the hybrid beamforming problem can be posed as [3, 6, 12]
(4a) | |||
(4b) | |||
(4c) |
where is the feasible set of the analog precoders. In the wideband hybrid beamforming literature, the feasible set either includes unit modulus constraints on the entries of [6] (denoted as ), or code-book based selection of the columns of [7] (denoted as ). The two feasible sets yield an interesting overhead-performance trade-off. While considering the feasible set results in much lower overhead relative to the set , the solution associated with performs much better than that of . The intuition is that provides a much wider search space compared to , i.e., , and hence, better performance is expected.
In this paper, we will introduce a new feasible set (denoted as ) to the wideband hybrid beamforming problem in (4) by enforcing a Vandermonde structure on the columns of , i.e., , and . Towards this end, the problem that this paper seeks to solve is the following low-rank matrix optimization problem,
(5a) | |||
(5b) |
Notice that the sum power constraint in (4c) is temporarily omitted as it has been shown that such a constraint can be satisfied via a simple normalization step to the resulting baseband precoders [3]. To our best knowledge, the formulation in (5) has not been considered before in the hybrid beamforming literature. Such a formulation strikes the balance between the obtained solution quality and the resulting overhead. In particular, the resulting solution achieves the same overhead associated with the set while achieving comparable performance to the solutions associated with the set. In the subsequent section, we will show that (5) can be reformulated as a tensor factorization problem where efficient tensor decomposition methods can be applied.
IV PARAFAC Decomposition
Before reformulating (5) as a tensor factorization problem and to facilitate our discussion, we briefly review some key concepts that will be used in the proposed tensor approach.
IV-A Tensor Preliminaries
A third order tensor is a three way array whose elements are indexed by three indices . The so-called Parallel Factor decomposition (PARAFAC), a.k.a Canonical Polyadic Decomposition (CPD), is one powerful tensor decomposition method. A tensor admits a PARAFAC decomposition if it can be written as the sum of vector outer products [13],
(6) |
where denotes the vector outer product, and is a positive integer that we refer to as the tensor rank or CPD rank (the smallest value such that (6) holds). The terms , and are the -th columns of the so-called low-rank factors , , and , respectively, of the tensor .
Different from the tensor format in (6), PARAFAC can also be written in slab format. Let represent the -th frontal slab of .111Note that we used the MATLAB notation to read the frontal slab of a three-way tensor. The PARAFAC decomposition of in the slab-format is given by
(7) |
where with the elements on the diagonal be the -th row of . Throughout this paper, we will use the notation to denote (7).
IV-B Identifiability
One distinctive property of tensors is that the PARAFAC model is essentially unique under mild conditions even if is greater than . The definition of essential uniqueness is presented as follows.
Definition 1.
The PARAFAC decomposition of a tensor is said to be essentially unique, , if and are identifiable up to scaling and permutation. This means that if , for some , , and , then there exists a permutation matrix and diagonal scaling matrices such that,
(8) |
If there is no structure imposed on the low rank factors, then a generic identifiability condition on PARAFAC uniqueness is given in [14]. If, however, one or more of the low rank factor matrices have a Vandermonde structure, then more relaxed uniqueness conditions based on the Kruskal rank can be found in [13, 15, 10]. The latest and the most relevant identifiability results to the problem considered herein is given as follows.
Theorem 1.
V Hybrid Beamforming via PARAFAC
In this section, it will be shown how the wideband hybrid beamforming problem in (5) can be reformulated as a tensor decomposition problem. Let us define the matrices and , then it can be easily seen that (5) can be expressed in more compact form as
(10a) | |||
(10b) |
Remark 2.
Notice that while (10) assumes a uniform linear array (ULA) structure on the columns of the analog beamformer , the proposed tensor method can be further extended to handle other array structures, for e.g., uniform planar array (UPA) [16]. In that sense, the proposed method can be used to recover azimuth and elevation estimates for each column of . This is in fact a big advantage of the proposed approach relative to the state-of-the-art. Owing to space limitations, we will present only the ULA structure here.
Let us construct the following two subarrays,
(11a) | |||
(11b) |
Then, it follows that by exploiting the Vandermonde structure of the columns of the matrix , the matrices and are displaced but otherwise identical subarrays, i.e.,
(12) |
where . Further, for consistency, let , where . Let be a matrix holding the diagonal of on its -th row, for and . Then, upon defining , and , for , we can write the following,
(13) | ||||
(14) |
From the PARAFAC decomposition slab format defined in (7), it is easy to see that (13) and (14) form a two-slab, i.e., , PARAFAC model with Vandermonde structure in one mode. Thus, solving (10) is tantamount to decomposing the tensor with its -th slab defined as , for and . From an optimization perspective, this can be expressed as
(15) |
Several algorithms have been developed to tackle the optimization problem (15) [17]. In this work, we adopt the trilinear alternating least square (TALS) algorithm implemented in the widely known Tensorlab MATLAB toolbox [18].
Considering the condition in (9) in the context of hybrid beamforming, one can easily see that with and given that is tall (i.e., ) and Vandermonde, the condition in (9) is always satisfied.The only requirement though to ensure essential uniqueness of is that needs to be tall and full rank. This requires the number of subcarriers multiplied by the number of streams be greater than or equal to the number of transmit RF chains. This renders our proposed method not applicable for single carrier systems, i.e., , with , otherwise, such a condition can be easily satisfied with a modest number of subcarriers.
Let , and be the resulting solution of (15). The goal now is to find and given . To do so, we first recover the phases from the columns of by simply reading the angles of first elements of the columns of .
Input: ,
Construct
Construct as , and
Decompose using TALS
for do
Form
Obtain , ,
Obtain
To obtain , we need to resolve the complex scaling ambiguity that is inherent to PARAFAC (see Definition 1 for the essential uniqueness of PARAFAC). Note that we ignore the permutation ambiguity, as in the hybrid beamforming context, finding the analog and baseband precoders up to a common permutation ambiguity is irrelevant since it merely amounts to shuffling the RF chains. The complex scale ambiguity though is important as it amounts to entirely changing the directions of the precoders. Fortunately, since the columns of both matrices and exhibit a Vandermonde structure, the column-wise scale ambiguity in both matrices can be resolved by simply dividing the elements of each column by the first element. Once the complex scale ambiguities associated with the columns of and , denoted as and , respectively, are resolved, it can be seen from (8) that the column-wise scale ambiguity of , denoted as , can be easily obtained as . The above procedures for solving the wideband hybrid beamforming problem using Vandermonde-constrained Two-slab PARAFAC (V-TPAR) are outlined in Algorithm 1.
The complexity of Algorithm 1 is incurred in decomposing the tensor using the iterative TALS algorithm. The per iteration complexity of TALS is equal to the cost of inverting an matrix. The overall complexity then depends on the total number of iterations which in turn depends on the problem and the size of the tensor (see [17] and references therein for convergence properties of TALS). As we will see later, for the considered problem, a few iterations of TALS seem to be sufficient to obtain hiqh-quality solution.

Parameter | Value |
---|---|
Carrier frequency | 28 GHz |
Subcarrier spacing | 60 kHz |
Modulation | 16-QAM |
Code rate | 0.49 |
Number of transmit antennas | 32 |
Number of receive antennas | 8 |
UE speed | 0.5 km/hr |
Delay spread | 300 ns |
Channel model | CDL-C |
VI Simulations
In this section, we will provide numerical results on 3GPP link-level channel model to assess the performance of the proposed method. The adopted simulation parameters are listed in Table I. We use the CDL-C channel model with the delay spread set to 300 ns. Both BS and UE are equipped with uniform linear array where the antenna elements are separated by a half wavelength. All results are averaged out over 200 realizations. The number of subbands is set to , i.e., , where each subband consists of one resource block (RB), i.e., 12 subcarriers. The channel matrix for each subband is obtained by averaging out the channels across the 12 subcarriers. For the proposed method implementation, we used the TALS algorithm implemented in the Tensorlab MATLAB toolbox. Finally, all simulations were performed on an Intel(R) Xeon(R) Gold 6234 CPU.

To benchmark the performance of the proposed method, we use the manifold optimization (MO) alternating minimization algorithm [6], the phase extraction (PE) alternating minimization algorithm [6] and the OMP algorithm [3] as baselines. Both MO and PE solve the wideband hybrid beamforming problem (4) with unit modulus constraints on the entries of the analog beamformers, while the OMP algorithm solve (4) with codebook constraint on the columns of the analog beamformers. For OMP, we use the DFT codebook for both and .
From the feedback overhead perspective, one can see from Table II that the Vandermonde feasible set (our proposed method) attains the same overhead of the codebook one (OMP). In particular, the number of parameters to feed back is independent of the number of transmit (receive) antennas and is equal to the number of transmit (receive) RF chains if the the analog precoders and combiners are computed at the UE (BS). On the other side, the unit-modulus feasible set (MO and PE) suffers from the large overhead that scales up with the number of transmit/receive antennas, thereby limiting their use in limited feedback systems.
feasible set | |||
---|---|---|---|
Num. of parameters | |||
Method | MO and PE | OMP | T-VPAR |
To evaluate the practical impact of the different hybrid beamforming algorithms, we report the coded BER in an end-to-end system. First, we consider a scenario with and , i.e., , while the rest of the parameters are as listed in Table I. It is known from [6] that when , PE achieves the same performance of MO at much lower complexity while the performance of the former degrades when . Fig. 2 shows the end-to-end coded BER performance of the different methods. One can see that, for this case, the proposed method achieves more than 1 dB SNR gain relative to OMP. More interestingly, the proposed approach outperforms the PE method with more than an order of magnitude reduction in BER at -8 dB. Further, when as shown in Fig. 3, the performance of the proposed method significantly outperforms OMP with roughly 4 dB SNR gain. Finally, one can see that both the tensor method and PE attain approximately the same performance, with 1 dB loss relative to MO.
Next, we simulated another scenario with . It can be seen that now PE achieves the same performance as MO while the proposed method incurs roughly 2 dB SNR loss, as Fig. 4 depicts. In addition, one can see that the proposed algorithm considerably outperforms the OMP algorithm with more than an order of magnitude reduction in BER when the SNR exceeds -10 dB.
Finally, to assess the complexity of the proposed tensor approach, Fig. 5 depicts the average run time of the proposed method relative to the considered baselines, when and and . We observe that the run time of the proposed method is comparable to PE while achieving more than an order of magnitude reduction in run time compared to MO in both setups. Finally, OMP features the lowest run time but this obviously comes at the expense of performance.

VII Conclusions
This paper has considered single user hybrid precoding and combining in wideband mmWave MIMO systems under Vandermonde constraints on the hybrid precoders and combiners. The problem is formulated as a tensor factorization problem where PARAFAC is invoked to find the Vandermonde-constrained analog beamformers and the set of baseband precoders – with identifiability guarantees. Numerical results on a 3GPP link-level test bench have revealed the superiority of the proposed method relative to the state-of-the-art. In particular, the proposed method has shown to be striking the balance between performance, overhead and complexity. As a future work, we aim at expanding the applicability of the proposed framework to other array structures such as uniform plannar array (UPA). Further, we plan to explore the impact of increasing the number of subarrays (multi-slab PARAFAC as opposed to two slab) on the estimation accuracy, and its trade-off with computational complexity.


References
- [1] F. Boccardi, R. W. Heath, A. Lozano, T. L. Marzetta, and P. Popovski, “Five disruptive technology directions for 5G,” IEEE Commun. Mag., vol. 52, no. 2, pp. 74–80, Feb. 2014.
- [2] S. Rangan, T. S. Rappaport, and E. Erkip, “Millimeter-wave cellular wireless networks: Potentials and challenges,” Proc. of the IEEE, vol. 102, no. 3, pp. 366–385, Feb. 2014.
- [3] O. El Ayach, S. Rajagopal, S. Abu-Surra, Z. Pi, and R. W. Heath, “Spatially sparse precoding in millimeter wave MIMO systems,” IEEE Trans. on Wir. Commun., vol. 13, no. 3, pp. 1499–1513, Nov. 2014.
- [4] A. Alkhateeb, O. El Ayach, G. Leus, and R. W. Heath, “Channel estimation and hybrid precoding for millimeter wave cellular systems,” IEEE J. of Sel. Topics in Sig. Process., vol. 8, no. 5, pp. 831–846, Oct. 2014.
- [5] A. Goldsmith, S. A. Jafar, N. Jindal, and S. Vishwanath, “Capacity limits of MIMO channels,” IEEE J. on Sel. Areas in Commun., vol. 21, no. 5, pp. 684–702, June 2003.
- [6] X. Yu, J.-C. Shen, J. Zhang, and K. B. Letaief, “Alternating minimization algorithms for hybrid precoding in millimeter wave MIMO systems,” IEEE J. of Sel. Topics in Sig. Process., vol. 10, no. 3, pp. 485–500, Apr. 2016.
- [7] A. Alkhateeb and R. W. Heath, “Frequency selective hybrid precoding for limited feedback millimeter wave systems,” IEEE Trans. on Commun., vol. 64, no. 5, pp. 1801–1818, May 2016.
- [8] D. J. Love, R. W. Heath, V. K. Lau, D. Gesbert, B. D. Rao, and M. Andrews, “An overview of limited feedback in wireless communication systems,” IEEE J. on Sel. Areas in Commun., vol. 26, no. 8, pp. 1341–1365, Oct. 2008.
- [9] A. L. de Almeida, G. Favier, and J. C. M. Mota, “PARAFAC-based unified tensor modeling for wireless communication systems with application to blind multiuser equalization,” Signal Processing, vol. 87, no. 2, pp. 337–351, Feb. 2007.
- [10] N. D. Sidiropoulos, G. B. Giannakis, and R. Bro, “Blind parafac receivers for DS-CDMA systems,” IEEE Trans. on Sig. Process., vol. 48, no. 3, pp. 810–823, June 2000.
- [11] M. S. Ibrahim, C. I. Kanatsoulis, and N. D. Sidiropoulos, “Downlink channel feedback for FDD massive MIMO systems via tensor compression and sampling,” in 54th Asilomar Conference on Sig., Sys., and Comp., Pacific Grove, CA, USA, Nov. 2020, pp. 27–31.
- [12] O. El Ayach, R. W. Heath, S. Abu-Surra, S. Rajagopal, and Z. Pi, “Low complexity precoding for large millimeter wave MIMO systems,” in IEEE Int. Conf. on Commun. (ICC), Ottawa, Canada, June 2012, pp. 3724–3729.
- [13] N. D. Sidiropoulos and R. Bro, “On the uniqueness of multilinear decomposition of N-way arrays,” Journal of Chemometrics, vol. 14, no. 3, pp. 229–239, Nov. 2000.
- [14] L. Chiantini and G. Ottaviani, “On generic identifiability of 3-tensors of small rank,” SIAM Journal on Matrix Analysis and Applications, vol. 33, no. 3, pp. 1018–1037, Sept. 2012.
- [15] L. De Lathauwer, “A link between the canonical decomposition in multilinear algebra and simultaneous matrix diagonalization,” SIAM Journal on Matrix Analysis and Applications, vol. 28, no. 3, pp. 642–666, Sept. 2006.
- [16] N. D. Sidiropoulos, R. Bro, and G. B. Giannakis, “Parallel factor analysis in sensor array processing,” IEEE Trans. on Sig. Process., vol. 48, no. 8, pp. 2377–2388, Aug. 2000.
- [17] N. D. Sidiropoulos, L. De Lathauwer, X. Fu, K. Huang, E. E. Papalexakis, and C. Faloutsos, “Tensor decomposition for signal processing and machine learning,” IEEE Trans. on Sig. Process., vol. 65, no. 13, pp. 3551–3582, Apr. 2017.
- [18] N. Vervliet, O. Debals, L. Sorber, M. Van Barel, and L. De Lathauwer, “Tensorlab v3.0,” Mar. 2016.