Interleaved Training for Massive MIMO Downlink via Exploring Spatial Correlation
Abstract
Interleaved training has been studied for single-user and multi-user massive MIMO downlink with either fully-digital or hybrid beamforming. However, the impact of channel correlation on its average training overhead is rarely addressed. In this paper, we explore the channel correlation to improve the interleaved training for single-user massive MIMO downlink. For the beam-domain interleaved training, we propose a modified scheme by optimizing the beam training codebook. The basic antenna-domain interleaved training is also improved by dynamically adjusting the training order of the base station (BS) antennas during the training process based on the values of the already trained channels. Exact and simplified approximate expressions of the average training length are derived in closed-form for the basic and modified beam-domain schemes and the basic antenna-domain scheme in correlated channels. For the modified antenna-domain scheme, a deep neural network (DNN)-based approximation is provided for fast performance evaluation. Analytical results and simulations verify the accuracy of our derived training length expressions and explicitly reveal the impact of system parameters on the average training length. In addition, the modified beam/antenna-domain schemes are shown to have a shorter average training length compared to the basic schemes.
Index Terms:
Massive MIMO, interleaved training, spatial correlation, conditional distribution, training overhead.I Introduction
Via exploiting the large number of spatial degrees-of-freedom provided by large-scale antenna arrays, massive MIMO systems can achieve significant performance improvement compared to conventional MIMO systems [1, 2]. One crucial practical issue for massive MIMO downlink is the acquisition of channel state information (CSI) at the base station (BS), especially for frequency-division-duplexing (FDD) systems with no uplink-downlink channel reciprocity [3]. Traditional downlink training and channel estimation schemes cause prohibitive training overhead due to the massive number of channel coefficients to be estimated [4].
Existing studies on the downlink CSI acquisition of massive MIMO can be divided into the following categories. In [5, 6, 7, 8], the channel statistics, e.g., spatial and/or temporal correlation, are utilized to conduct beamformed channel estimation. In [9, 10, 11, 12, 13, 14], compressive sensing algorithms are designed to exploit the channel sparsity in the angular domain and/or the common sparsity among users and/or subcarriers. In [15, 16, 17, 18], either the partial reciprocity between the uplink and downlink channels, e.g., with similar angle and delay of propagation paths, or their implicit relationships, e.g., both being the functions of user location, are used for channel training designs.
These aforementioned schemes aim to obtain the complete antenna-domain CSI with the smallest possible pilot overhead before the data transmissions, thus the training design and data transmission design are decoupled, which imposes limitations on the tradeoff between training overhead and performance. Further, only the throughput or diversity gain has been considered in these existing works. The quality-of-service (QoS) provided by the obtained CSI is not taken into consideration during the training process. Therefore, the training length or pilot overhead is fixed and does not adjust according to specific channel realizations. For massive MIMO systems, it is possible to use partial CSI to design the beamforming scheme for the data transmission period, especially with the outage probability performance measure. A new training method, namely the interleaved training, is proposed to dynamically adjust the training overhead according to the required QoS and specific channel realizations.
The idea of interleaved training was proposed in [19, 20] for the downlink of single-user full-digital massive antenna systems with independent and identically distributed (i.i.d.) channels, where the channels of different antennas are trained sequentially and the estimated CSI or indicator is fed back at the end of each training step. With each feedback, the BS decides to conduct the training of another antenna’s channel or to terminate the training process based on whether an outage occurs. Compared to traditional schemes, the interleaved training can achieve a significant reduction in training overhead with no degradation of outage performance.
The work in [21] applied the idea of interleaved training to beam-domain transmission, where joint beam-based interleaved training and data transmission schemes are proposed for massive MIMO systems with single and multiple users. In [22], an improved codebook is further designed for interleaved training in millimeter-wave hybrid massive MIMO downlink. In [23], a joint interleaved training and transmission design is proposed for large-intelligent surface (LIS) assisted systems under i.i.d. Rayleigh fading channels. Recently, an interleaved training design is proposed for multi-user massive MIMO downlink in [24, 25], in which analytical results on the training length and the transmission success rate are provided for the maximum-ratio transmission (MRT) precoding. Different from the single-user scheme, the multi-user scheme needs to judge whether the signal-to-interference-plus-noise (SINR) requirements of all users can be satisfied with partial CSI during the interleaved training procedure. The advantages of interleaved training are clearly demonstrated in the above studies.
Different from previous papers on interleaved training [19, 20, 21, 22, 23, 24, 25], in this paper, we focus on exploiting channel statistics to further improve the performance of interleaved training for single-user massive MIMO downlink systems. Both beam-domain and antenna-domain modified interleaved training designs are proposed, and we further provide analytical results on the average training length of the proposed schemes and their comparison with those of the basic schemes. Detailed contributions are summarized as follows.
-
•
In the modified beam-domain scheme, both the beam direction and the beam training order are optimized based on the channel correlation information. In the modified antenna-domain scheme, the BS antenna training order is dynamically adjusted during the training process according to the channel values of the already trained antennas. To this end, we derive the conditional distribution of the untrained BS channels given the values of the trained channels under general correlated channels. And for exponentially correlated channels, we demonstrate that the conditional distribution of any untrained antenna’s channel is only dependent on the channels of its nearest antennas on both sides in the already-trained antenna set, which significantly simplifies the complexity of the modified antenna-domain scheme.
-
•
Closed-form expressions of the average training length are derived for the basic and modified beam-domain interleaved training schemes with general correlated channels. For exponentially correlated channels, we further provide a simplified approximation of the average training length for the modified scheme when the number of BS antennas is large.
-
•
A closed-form average-training-length expression is also derived for the basic antenna-domain interleaved training with general correlated channels, and its simple approximation is given for exponentially correlated channels. For the average training length of the modified antenna-domain interleaved training, we propose a deep neural network (DNN)-based approximation to achieve fast performance evaluation.
-
•
Simulation results verify our derived analytical expressions and theoretical analysis on the impact of systems parameters, e.g., channel correlation, antenna number, and the requirement of the signal-noise-ratio (SNR), on the average training length of the basic and modified antenna/beam-domain interleaved training schemes under two typical correlated channels. In addition, simulations also demonstrate that our proposed modified antenna and beam-domain interleaved training schemes both outperform the basic interleaved training schemes.
The remainder of this paper is organized as follows. In Section II, we introduce the single-user massive MIMO downlink system with two typical channel correlation models and the basic antenna/beam-domain interleaved training scheme. In Section III, the modified beam-domain interleaved training is proposed along with related analytical results. In Section IV, we propose the modified antenna-domain interleaved training and conduct theoretical analysis. Simulations are provided in Section V. Section VI summarizes this work. Some proofs are included in the appendix.
Notation: Bold upper and bold lower case letters denote matrices and vectors. denotes the by dimensional complex space. denote the -dimensional identity matrix. The conjugate transpose, transpose, determinant, adjugate matrix, rank and inverse of are denoted by , , , , and . The vector denotes the -th column of the matrix . For a vector , is the -th element of and is the sub-vector composed of the -th element of for when the subscript is an index set . Similarly, is the -th element of and is the sub-matrix composed of the -th element of for and . Define as the set . represents the probability of event . represents the floor function. denotes the Euclidean norm of . is the diagonal matrix whose diagonal entries are elements of vector . denotes the probability density function (PDF) of a random variable (RV) . denotes the circularly symmetric complex Gaussian distribution with mean vector and covariance matrix . denotes the chi-squared distribution with being the degrees of freedom. denotes the noncentral chi-squared distribution with and being the degrees of freedom and the non-centrality parameter. is the first order Marcum Q-function. denotes the equality in distribution.
II System Model
We consider a massive MIMO downlink system with an -antenna BS and a single-antenna user equipment (UE). The downlink BS-UE channel, denoted as , is modeled as a circular-symmetric complex Gaussian vector following the distribution , where is the channel covariance matrix. One typical correlation model is the one-ring correlation model [5],[26], which is expressed as
(1) |
where is the angle interval of the channel power seen at the BS, represents the power angle spectrum (PAS), satisfying , and is the BS array response vector. For the uniform linear array (ULA), where is the antenna spacing ratio. Another typical correlation model is the exponential one [27], i.e.,
(2) |
where , satisfying , is the channel correlation between adjacent antennas. This is a simple single-parameter model commonly used for many communication problems, which is also physically reasonable in the sense that the correlation decreases with increasing distance between antennas, e.g., in the ULA.
The downlink transmission can be represented as
(3) |
where is the received signal at the user, is the antenna-domain beamformer at the BS with the unit norm, i.e., , is the transmitted symbol with unit average power, is the transmit power and is the normalized receive noise at the UE which follows The received SNR can be written as
(4) |
If the beam-domain transmission is conducted, can be decomposed into two parts: , where () is the external beamforming matrix, is the beam-domain beamformer, and is the number of beams. One typical is the normalized discrete Fourier transformation (DFT) matrix with . Define the -dimensional beam-domain channel as . We have with . Eq. (4) can be converted to
(5) |
For a given target data transmission rate , an outage event occurs if , or equivalently if , where is the normalized receive SNR threshold.
II-A General Framework of Interleaved Training and the Basic Training Scheme
In this subsection, we introduce the general antenna-domain and beam-domain interleaved training and give the basic interleaved training algorithms proposed in [20, 21] as the baseline of our study. For a uniform representation, we define
(6) |
In the general antenna/beam-domain interleaved training scheme, the BS trains the channel of one antenna/beam for each step, and the order of the antennas/beams during the training is determined according to a predefined criterion. After the -th training step, the UE knows with denoting the set of indices of the already trained BS antennas/beams 111The main purpose of interleaved training is to reduce the training overhead. In order to focus on the theoretical analysis and give more insights, we do not consider the error resulting from channel estimation and feedback quantization in our study.. To maximize the receive SNR based on this currently acquired CSI, the BS can conduct the following downlink beamforming
(7) |
The receive SNR of this beamformer is thus . Based on whether an outage occurs, i.e., , the UE decides to notify the BS to continue training by one bit , or feed back one bit and channel values of the already trained antennas/beams to the BS for transmission beamforming.
In the basic antenna/beam-domain interleaved training scheme as shown in Algorithm 1, the channel of one BS antenna or one DFT beam is trained for each step, and the antennas/beams are trained sequentially following their original indices, i.e., after the -th training step, the index set of the already trained antennas/beams is .
III Modified Beam-Domain Interleaved Training and Performance Analysis
In the basic beam-domain interleaved training scheme [21], the adopted DFT beams give no guarantee to align the effective propagation paths, and nor does it consider setting a higher training priority for beams with stronger average power. In the following, we explore the channel covariance matrix to improve the beam-domain interleaved training via addressing the above issues. In addition, we perform analysis on the average training length of the modified beam-domain interleaved training and compare it with that of the basic beam-domain interleaved training to reveal the advantages of the modified design. The methods of acquiring channel covariance matrix at the BS in massive MIMO systems can be referred to [28, 29, 30].
III-A Modified Beam-Domain Training Design
Recall that the channel covariance matrix is positive semi-definite and we denote its rank as . We consider the compact eigenvalue decomposition of : , where is an semi-unitary matrix and with . With the knowledge of and , the BS can set , implying that , and therefore is the -dimentional vector of the beam-domain channel coefficients. In the modified scheme, the BS trains the effective beams in turn, such that the beams are trained with decreasing average power. After steps of beam training, the BS obtains the beam-domain channels where . The BS conducts the beam-domain precoding according to Eq. (7). And an outage occurs if . With this beam ordering, the specific process of the modified beam-domain training scheme can be referred to as Algorithm 1. The modified scheme has both beam alignment and ordering through the eigenmatrix of the channel covariance matrix. This is the major difference from the basic one.
From the Toeplitz eigen-subspace approximation result in [31], the eigenvectors of the one-ring covariance matrix in Eq. (1) and those of the exponential covariance matrix in Eq. (2) can both be well approximated by the columns of a DFT matrix for . As increases asymptotically to infinity, both the modified scheme and the basic scheme use the DFT codebook for beam training and their difference then only lies in the order of the beams during training. In this case, since the modified scheme trains the beams with decreasing average power sequentially, it has a shorter average training length.
III-B Average Training Length Analysis
Considering that the difference between the basic beam-domain interleaved training and the modified one lies only in the use of the training beam codebook, we first give the analysis of the average training length for the beam-domain interleaved training scheme with any given beam codebook, based on which the average training length of the modified scheme and its comparison with that of the basic scheme are subsequently given.
III-B1 Analysis for the General Beam-Domain Scheme
Recall that the receiver SNR after the -th training step is . From Algorithm 1, we can see that the training stops after the -th training step with probability for and for . And the training stops after the -th training with probability . The average training length of the beam-domain interleaved training scheme can be expressed as
(8) | ||||
Theorem 1.
For an arbitrary beam codebook , recall that and define . Consider the compact eigenvalue decomposition where is an semi-unitary matrix, and is the rank of . Suppose that there are different eigenvalues with value of and repeated time of for . Define . The average training length of the general beam-domain interleaved training scheme under correlated channels can be expressed as
(9) | ||||
where , and
III-B2 Analysis for the Modified Beam-Domain Scheme
The derivation for the average training length of the modified beam-domain interleaved training scheme can be referred to Theorem 1 by setting the beam codebook as . Therefore, and , which leads to for . It is noteworthy that since the modified scheme uses the eigenvectors of the channel covariance matrix as the beam codebook, is independent of the training step index . Suppose that there are different values in the first eigenvalues of with value of and repeated times for . Define .
Corollary 1.
The average training length of the modified beam-domain interleaved training scheme can be expressed as:
(13) | ||||
where , and .
Proof.
The result can be directly obtained from Theorem 1. ∎
Corollary 2.
For channels with the exponential covariance matrix in Eq. (2) and , a large approximation of the average training length for the modified beam-domain interleaved training scheme can be written as
(14) |
where and
(15) |
for .
Proof.
The results in Eq. (9), Eq. (13) and Eq. (14) are in closed-form and can be used to evaluate the average training length for different system parameter values.
In the following, we discuss the impact of the antenna number on the average training length of the modified beam-domain interleaved training.
Corollary 3.
For the modified beam-domain interleaved training scheme, when the use of all beams can avoid an outage, the average training length is a non-increasing function of the antenna number , in both one-ring correlated channels with non-zero AS and exponentially correlated channels.
Proof.
Denote the eigenvalues of the channel covariance matrix in descending order as and for the number of antennas being and , respectively. Since the channel covariance matrix for BS antennas is a submatrix of that for BS antennas, we have either , e.g., for channels with the full-rank exponential covariance matrix in Eq. (2), or . From Eq. (8) and Eq. (10), the average training length can be expressed as . Then the difference between the average training lengths for systems with and BS antennas for the case of is
(16) |
where . From the Eigenvalue Interlacing Theorem [34], we have . Thus . The condition that the use of all beams can meet the transmission requirement leads to . Therefore, is non-increasing with increasing under this condition. For the case of , this conclusion still stands due to . ∎
Remark 1.
Numerical simulation based on Eq. (13) shows that the average training length increases with for small ; while for large , decreases with and converges to a constant value. This is because when is small, is the dominant term in in Eq. (16), which has a positive value. When is large, , therefore, as shown in Corollary 3, decreases with .
Next, we discuss the effect of the channel correlation on the average training length of the modified beam-domain interleaved training scheme for channels with the exponential covariance matrix. Numerical simulation based on Eq. (14) shows that for relatively small , higher channel correlation helps reduce the average training length of the modified beam-domain interleaved training. However, as continues to increase, an increase in the channel correlation may have the opposite effect. According to the derivative of eigenvalues in Eq. (15), i.e., , with respect to , larger eigenvalues increase, while smaller eigenvalues become smaller, as increases. For very large , for , and has the greatest impact on . In this case, smaller results in flatter eigenvalue distribution which provides lower and shorter . For small enough , for , and has the greatest impact on . In this case, larger results in higher and shorter .
IV Modified Antenna-Domain Interleaved Training and Performance Analysis
In this section, we first discuss the impact of channel correlation on the average training length of the basic antenna-domain interleaved training. Then we derive the conditional distribution of channels of un-trained BS antennas based on channel values of the already trained BS antennas during the interleaved training process, based on which we further propose the design of the modified antenna-domain interleaved training.
IV-A Average Training Length Analysis and Impact of Channel Correlation
In the following, we give closed-form expressions of the average training length of the basic antenna-domain interleaved training scheme under general correlated channels and exponentially correlated channels respectively.
Compared to Theorem 1, the only difference in the derivation on the average training length of the basic antenna-domain interleaved training is that the covariance matrix of the trained channels after training steps is and the vector of the trained channels can be represented as . Consider the compact eigenvalue decomposition: where and is the rank of . Then we have with . Suppose that there are different eigenvalues with value of and repeated times of for . Define .
Theorem 2.
The average training length of the basic antenna-domain interleaved training scheme under general correlated channels can be expressed as
(17) | ||||
where , , and
Proof.
Please refer to the proof of Theorem 1. ∎
To analyze the impact of channel correlation on the average training length, we consider two extreme cases, i.e., the i.i.d. channels with and the fully correlated channels with and for .
For the i.i.d. channels, we have , , , and ; and . Via substituting these into Eq. (17) in Theorem 2, we can obtain
(18) |
According to the result in [20, Theorem 2], we have for .
For the fully correlated channels, we have , , , and ; and . Via substituting these into Eq. (17) in Theorem 2, we can obtain
(19) |
And the behavior of for is given in the following corollary.
Corollary 4.
For , we have , where is the Euler’s constant.
Proof.
Define . For , we have . Meanwhile, we have . Therefore, we can obtain . Since and , we have for . ∎
Remark 2.
When increases asymptotically, under independent channels, the average training length of the basic scheme has the upper bound ; while under fully correlated channels, the average training length of the basic scheme is proportional to , which implies the negative effect of channel correlation to the average training length of the basic training scheme. Further, numerical calculations based on Eq. (18) and (19) show that when is small compared to , the basic scheme has a shorter average training length in the i.i.d. channels. This mainly benefits from the higher antenna diversity gain. After reaches about the same size as , the basic scheme has a shorter average training length in the fully correlated channels, where the consistency of antenna energy is more important than the diversity gain.
Corollary 5.
For channels with the exponential covariance matrix in Eq. (2) and almost all values of , the average training length of the basic antenna-domain interleaved training can be expressed as
(20) |
where and
(21) |
for and
(22) |
where .
Proof.
For the exponential covariance matrix, the approximations of can be written as Eq. (21) and Eq. (22) according to [33, Eq. (35), Eq. (43a-b), Eq. (49a-b)]. From the monotonicity of in and that of in , we have that for , are different from each other, while for , are different from each other, and is different from for . For intermediate values, are different from each other due to the monotonicity of in as well, and is different from for . Therefore, we have , , , for . Via substituting these into Eq. (17) in Theorem 2, Eq. (20) can be obtained. ∎
For almost all values of , we can conduct faster evaluation and analysis for the average training length of the basic antenna-domain interleaved training scheme with Eq. (20) compared to that with Eq. (17). For large values satisfying or intermediate values satisfying , the training length can still be calculated according to Eq. (17).
IV-B Derivations on the Conditional PDF of the Untrained Channels
On the one hand, simulation based on Eq. (17) shows that the channel correlation leads to an increase in the average training length of the basic antenna-domain interleaved training at the general rate requirement. On the other hand, if the channel correlation exists and the system knows it as a priori, we can use the correlation to improve the training efficiency. Specifically, the conditional PDF of the untrained channels can be derived for given values of the already trained channels. Based on this conditional PDF, the choice of the BS antenna for the next training step can be optimized. In this subsection, we derive the conditional PDF.
Lemma 1.
Given the channel values of the already trained BS antennas, , the conditional PDF of the un-trained channels follows where
(23) |
(24) | ||||
and . The conditional cumulative distribution function (CDF) of the power of the untrained channel is
(25) |
Proof.
is the covariance matrix of the vector of the trained channels , which is a submatrix of the overall channel covariance matrix . Recall that is a circular-symmetric complex Gaussian vector. Then from [35, Eq. (32)], the conditional mean in Eq. (23) and the conditional variance in Eq. (24) can be obtained. The CDF in Eq. (25) can be obtained from properties of noncentral chi-squared distribution, i.e., due to . Then, we have the conditional PDF and the conditional CDF
(26) |
Therefore, Eq. (25) can be obtained. ∎
Recall that denotes the set of indices of the already trained BS antennas and for simplicity of presentation, we assume that . If the index of an un-trained BS antenna satisfies , we denote the index of its nearest BS antennas in the trained set with a smaller index as , that is, . Thus is the index of the trained BS antenna which is the nearest to Antenna with a larger index than . Define and .
Corollary 6.
Under the exponential correlation model, we have
(27) |
and
(28) |
The conditional distribution of the channel of BS antenna is only related to channel values of the two nearest BS antennas on both sides, and .
Proof.
See Appendix A. ∎
The results in Corollary 6 can help significantly reduce the computational complexity for the conditional CDF of the untrained channel power for scenarios with an exponential correlation model.
IV-C Modified Antenna-Domain Interleaved Training Scheme
Based on the conditional PDF of the untrained channels in Lemma 1, we propose a modified antenna-domain interleaved training scheme where at the beginning of each training step, the BS antenna whose channel is to be trained is optimally selected. The basic idea is to use the channel values of the already trained antennas to calculate the probability of meeting the transmission requirement if each untrained BS antenna is selected. Then the antenna with the highest probability is chosen.
For the selection of the first antenna to be trained, we cannot use the same approach since no channel values have been obtained. Instead, we use the conditional variance in Eq. (24). Under the assumption that all antennas have the same average power, the first antenna to be trained can be the one resulting in the minimum overall conditional variance of other antennas, e.g., . It can be seen from Eq. (28) that under the exponential correlation model.
Recall that and are the set of indices of the BS antennas whose channels have been trained and the obtained channel vector of these BS antennas after training steps. At the beginning of the -th training step, based on the already obtained channel vector , the BS antenna to be trained for the -th step is selected by the BS as follows. For each untrained BS antenna , the BS calculates the probability that the obtainment of the channel of BS antenna in the -th training step can meet the transmission requirements as follows:
(29) | ||||
Then, the BS selects the one with the highest probability among all untrained antennas, i.e., the index of the BS antenna for the training step is
(30) |
The proposed modified antenna-domain interleaved training scheme is presented in Algorithm 2, where the major difference to the basic scheme is in Step 4 on the antenna selection.
IV-C1 Complexity Analysis
The complexity of Algorithm 2 is mainly generated by Step in the loop from Step to Step . Denote the training length for a random channel realization as satisfying . For the -th training step, where , the number of operations needed for calculating in Eq. (23) and Eq. (24) scales as , and the number of operations needed for the remaining matrix multiplications in Eq. (23) and Eq. (24) scales as . Therefore, the complexity for calculating the conditional mean and variance for a training process with steps is and an upper bound on the complexity of Algorithm 2 is since . For channels with the exponential covariance matrix, since the conditional mean and variance can be calculated according to Eq. (27) and Eq. (28) without matrix inversion and matrix multiplication, an upper bound of the algorithm complexity is .
IV-C2 Average Training Length
Similar to the derivation of Eq. (8), the average training length of the modified antenna-domain interleaved training scheme can be expressed as
(31) |
To derive the analytical or even closed-form expression of in Eq. (31), the key is to calculate . From Step in Algorithm 2 and Eq. (29) we know that one should first calculate the conditional mean and conditional variance for based on both and to decide the antenna index for the -th training step. This makes the derivation of the PDF of challenging because changes for each channel realization. In addition, the function involves a two-fold infinite series summation, resulting in an implicit relationship between and , . These all make the derivation of an analytical expression of in Eq. (31) intractable.
To circumvent the above difficulties, we introduce the deep neural network (DNN) with being the network parameter matrix to model the function of with respective to system parameters, e.g., the BS antenna number , the channel covariance matrix and the normalized SNR threshold . For channels with the exponential covariance matrix, one can use the correlation coefficient to replace the input parameter . The latter simulation results show that the function can be well-fitted by a DNN model with fully connected hidden layers. This deep learning-based approximation of the average training length can provide a faster performance evaluation of the modified antenna-domain interleaved training scheme compared to the Monte Carlo simulation.
V Simulation and Discussion
In this section, simulation results are shown for the proposed modified beam-domain and antenna-domain interleaved training schemes and their comparison with existing baseline schemes. The exponential correlation model in Eq. (2) is considered in Sections V-A to V-D. The one-ring correlation model in Eq. (1) is considered in Section V-E.
V-A Beam-Domain Interleaved Training Under the Exponential Correlation Model
Fig. 1 shows the average training lengths of the basic and modified beam-domain interleaved training schemes under the exponential correlation model, including the simulation values, the theoretical values in Eq. (9) of Theorem 1 and Eq. (13) of Corollary 1, and the approximate values in Eq. (14) of Corollary 2. We can see from Fig. 1a that the curves of simulation values and theoretical values match well for different scenarios. The curves of approximate values for and in Fig. 1b have some gap with the simulation curves, while the gap for the case of is relatively small. This is because that Eq. (15) is a large- eigenvalue approximation. These observations verify our derivations in Section III-B.
V-B Comparison of Basic and Modified Beam-Domain Interleaved Training Under the Exponential Correlation Model
Fig. 2 shows the average training lengths of the basic and modified beam-domain interleaved training with different antenna number for and . It can be seen that the modified scheme outperforms the basic scheme in the average training length for three combinations of and , i.e., 1) bit/s/Hz, dB and ; 2) bit/s/Hz, dB and ; 3) bit/s/Hz, dB and . And the advantage becomes larger as increases from to , showing that the modified beam-domain scheme exhibits greater performance advantages under more stringent transmission requirements. In addition, the average training lengths of both schemes first increase and then decrease as increases. For large enough , the training length levels off as increases. These results are consistent with the description in Corollary 3 and Remark 1.
Fig. 3 shows the average training lengths of the basic and modified beam-domain interleaved training schemes with different channel correlation levels for , bit/s/Hz and dB. Note that respectively. The basic scheme uses the DFT codebook. The shorter average training length of the modified scheme can also be observed in the figure, especially for relatively low transmit power, e.g., dB. And with increasing , the performance advantage of the modified scheme over the basic scheme enlarges for dB, while for dB, it first increases for and then decreases for .
V-C Antenna-Domain Interleaved Training Under the Exponential Correlation Model
Fig. 4 shows the average training lengths of the basic and modified antenna-domain interleaved training schemes under the exponential correlation model for , bit/s/Hz and , i.e., i.i.d. channels, , and . Fig. 4a shows the simulation values, the theoretical values provided by Corollary 2, i.e., Eq. (9), and the approximate values of the basic scheme in Eq. (20) of Corollary 5. We can see from the figure that all three curves match well for different scenarios. This verifies the results in Section IV-A. For bit/s/Hz, the average training length of the basic scheme increases when increases from to for dB, i.e., . For smaller and larger , the increase of , on the contrary, leads to a decrease of average training length for the basic scheme.
In Fig. 4b, we use a fully connected DNN containing four hidden layers (each with 4, 8, 16 and 32 Relu neurons) to provide an approximate average training length of the modified antenna-domain interleaved training scheme. The function for training the DNN model is the trainlm based the Levenberg-Marquardt algorithm, which has the fastest convergence speed for medium-sized DNN. The loss is the mean-square error (MSE). The dataset has samples of . The ratio of the training set, validation set, and test set is 0.7:0.15:0.15. As shown in the figure, the designed DNN fits the function of training overhead well and its prediction of for these unseen combinations of , and matches with the simulation results.
V-D Comparison of Basic and Modified Antenna-Domain Interleaved Training Under the Exponential Correlation Model
Fig. 5 shows the average training lengths of the basic and modified antenna-domain interleaved training schemes with different BS antenna number for and . It can be seen that the modified scheme outperforms the basic scheme in the average training length for three combinations of and , i.e., 1) bit/s/Hz, dB and ; 2) bit/s/Hz, dB and ; 3) bit/s/Hz, dB and . And the advantage becomes larger as increases from to . In addition, with increasing , the average training length of the basic scheme increases and converges, with faster convergence and a smaller value for compared to those for . However, the average training length of the modified scheme first increases and then decreases and finally levels off with increasing . This is because as increases, there are more untrained antennas available after each interleaved training step, which increases the diversity of untrained antennas’ conditional distributions. Furthermore, the performance advantage of the modified scheme over the basic scheme first increases and then levels off as increases.
Fig. 6 shows the average training lengths of the basic and modified antenna-domain interleaved training schemes with different channel correlation levels for , bit/s/Hz and dB. We can see from the figure that the modified scheme has a shorter average training length than the basic scheme. As the correlation coefficient increases, the average training length of the basic scheme increases for dB and dB, but decreases for dB. On the contrary, with increasing , the average training length of the modified scheme for dB first decreases for and then increases for larger .
V-E Antenna and Beam-Domain Interleaved Training Under the One-Ring Correlated Channels
In this section, simulations are given to demonstrate the applicability of partial analytical results to the one-ring correlation model in Eq. (1). The uniform PAS model [36], i.e., , is considered where denotes the mean angular of departure (AoD) and the angular spread (AS) is .
Fig. 7 shows the average training lengths of the basic and modified beam-domain interleaved training schemes with different transmit powers dB under the one-ring correlated channel model for , bit/s/Hz, , and . The theoretical values of the average training length in Eq. (9) of Theorem 1 and Eq. (13) of Corollary 1 match the simulation values well. And the modified scheme under three different ASs has an obvious performance advantage over the basic scheme. With decreasing AS or increasing channel correlation, this performance advantage enlarges for relatively high transmit power, e.g., dB, while decreases for low transmit power, e.g., dB.
Fig. 8 shows the average training lengths of the basic and modified antenna-domain interleaved training schemes with different transmit powers dB under the one-ring correlated channel model for , bit/s/Hz, , and . The modified scheme in the antenna-domain under three different ASs also outperforms the basic scheme, and the theoretical average training length in Eq. (9) matches the simulation value well.
VI Conclusion
In this paper, the channel spatial correlation was explored to improve the interleaved training for single-user massive MIMO downlink. Via optimizing the beam training codebook and the antenna training sequence based on the channel correlation, we respectively proposed the modified beam-domain and antenna-domain interleaved training schemes. For exponentially correlated channels, the conditional channel distribution of an untrained BS antenna given channel values of the already trained BS antennas was demonstrated to be only dependent on the channels of its nearest antennas on both sides in the already-trained antenna set, simplifying the complexity of the modified antenna-domain scheme significantly. Exact and approximate closed-form expressions were derived for the basic and modified beam/antenna-domain schemes in correlated channels. The impact of system parameters, e.g., the channel correlation, the antenna number, and the SNR requirement, on the average training length was explicitly revealed. Simulations verified our derivations and demonstrated the performance advantage of our proposed modified schemes.
In addition to spatial correlation, channel temporal correlation can be exploited to improve the channel acquisition efficiency in massive MIMO systems. Unlike spatial correlation, temporal correlation has causality constraints in the time dimension and we can only use the historical training result to extrapolate. However, the historical result is not complete due to the characteristics of the interleaved scheme. How to explore the temporal correlation in interleaved training is worth to be further studied.
Appendix A Appendix
A-A Proof of Corollary 6
Define We denote as for simplicity of presentation. Then we have where is by replacing its -th row with . Recall that where is the adjugate matrix of , i.e., with being the algebraic cofactor of . Therefore, we have .
Here we prove that and are the only two non-zero elements in , equivalently when and when . For the first part, it is suffice to show that the row vectors of are linearly dependent when and we show this by construction. Let , we have via straightforward calculations that .
Next we prove that when . Define for . Via splitting the -th element in , i.e., the -th element in , into , we can split the into the sum of two determinants and obtain the recurrence formula via expanding the first determinant in Eq. (32) by the second column, i.e.,
(32) | ||||
Note that the second determinant in Eq. (32) is zero since the first column of its matrix is times the second column. Then we calculate as follows:
(33) | ||||
Similar procedure can be used to calculate . The difference is that the -th element in is split into . Then we can obtain
(34) | ||||
for and . can also be calculated by splitting the -th element into . Then we split the determinant by the -th row into the sum of two determinants and leave the rest of the rows unchanged. Similarly, we calculate in this way. The difference is that the -th element in is split into . Then we obtain
(35) | ||||
References
- [1] T. L. Marzetta, “Noncooperative cellular wireless with unlimited numbers of base station antennas,” IEEE Trans. Wireless Commun., vol. 9, no. 11, pp. 3590–3600, 2010.
- [2] H. Q. Ngo, E. G. Larsson, and T. L. Marzetta, “Energy and spectral efficiency of very large multiuser MIMO systems,” IEEE Trans. Commun., vol. 61, no. 4, pp. 1436–1449, 2013.
- [3] C. Zhang, Y. Huang, Y. Jing, S. Jin, and L. Yang, “Sum-rate analysis for massive MIMO downlink with joint statistical beamforming and user scheduling,” IEEE Trans. Wireless Commun., vol. 16, no. 4, pp. 2181–2194, 2017.
- [4] M. Biguesh and A. B. Gershman, “Training-based MIMO channel estimation: A study of estimator tradeoffs and optimal training signals,” IEEE Trans. Signal Process., vol. 54, no. 3, pp. 884–893, 2006.
- [5] A. Adhikary, J. Nam, J.-Y. Ahn, and G. Caire, “Joint spatial division and multiplexing—The large-scale array regime,” IEEE Trans. Inf. Theory, vol. 59, no. 10, pp. 6441–6463, 2013.
- [6] J. Choi, D. J. Love, and P. Bidigare, “Downlink training techniques for FDD massive MIMO systems: Open-loop and closed-loop training with memory,” IEEE J. Sel. Topics Signal Process., vol. 8, no. 5, pp. 802–814, 2014.
- [7] W. Shen, L. Dai, B. Shim, S. Mumtaz, and Z. Wang, “Joint CSIT acquisition based on low-rank matrix completion for FDD massive MIMO systems,” IEEE Commun. Lett., vol. 19, no. 12, pp. 2178–2181, 2015.
- [8] Y.-X. Zhang, A.-A. Lu, and X. Gao, “Sum-rate-optimal statistical precoding for FDD massive MIMO downlink with deterministic equivalents,” IEEE Trans. Veh. Technol., vol. 71, no. 7, pp. 7359–7370, 2022.
- [9] X. Rao and V. K. Lau, “Distributed compressive CSIT estimation and feedback for FDD multi-user massive MIMO systems,” IEEE Trans. Signal Process., vol. 62, no. 12, pp. 3261–3271, 2014.
- [10] Z. Gao, L. Dai, Z. Wang, and S. Chen, “Spatially common sparsity based adaptive channel estimation and feedback for FDD massive MIMO,” IEEE Trans. Signal Process., vol. 63, no. 23, pp. 6169–6183, 2015.
- [11] W. Shen, L. Dai, Y. Shi, B. Shim, and Z. Wang, “Joint channel training and feedback for FDD massive MIMO systems,” IEEE Trans. Veh. Technol., vol. 65, no. 10, pp. 8762–8767, 2015.
- [12] K. Venugopal, A. Alkhateeb, N. G. Prelcic, and R. W. Heath Jr., “Channel estimation for hybrid architecture-based wideband millimeter wave systems,” IEEE J. Sel. Areas Commun., vol. 35, no. 9, pp. 1996–2009, 2017.
- [13] P. N. Alevizos, X. Fu, N. D. Sidiropoulos, Y. Yang, and A. Bletsas, “Limited feedback channel estimation in massive MIMO with non-uniform directional dictionaries,” IEEE Trans. Signal Process., vol. 66, no. 19, pp. 5127–5141, 2018.
- [14] M. Ke, Z. Gao, Y. Wu, X. Gao, and R. Schober, “Compressive sensing-based adaptive active user detection and channel estimation: Massive access meets massive MIMO,” IEEE Trans. Signal Process., vol. 68, pp. 764–779, 2020.
- [15] Y. Han, T.-H. Hsu, C.-K. Wen, K.-K. Wong, and S. Jin, “Efficient downlink channel reconstruction for FDD multi-antenna systems,” IEEE Trans. Wireless Commun., vol. 18, no. 6, pp. 3161–3176, 2019.
- [16] W. Peng, W. Li, W. Wang, X. Wei, and T. Jiang, “Downlink channel prediction for time-varying FDD massive MIMO systems,” IEEE J. Sel. Topics Signal Process., vol. 13, no. 5, pp. 1090–1102, 2019.
- [17] S. Kim, J. W. Choi, and B. Shim, “Downlink pilot precoding and compressed channel feedback for FDD-based cell-free systems,” IEEE Trans. Wireless Commun., vol. 19, no. 6, pp. 3658–3672, 2020.
- [18] Y. Yang, F. Gao, G. Y. Li, and M. Jian, “Deep learning-based downlink channel prediction for FDD massive MIMO system,” IEEE Commun. Lett., vol. 23, no. 11, pp. 1994–1998, 2019.
- [19] E. Koyuncu and H. Jafarkhani, “Interleaving training and limited feedback for point-to-point massive multiple-antenna systems,” in Proc. IEEE Int. Symp. Inf. Theor., Hong Kong, China, Jun. 2015, pp. 1242–1246.
- [20] E. Koyuncu, X. Zou, and H. Jafarkhani, “Interleaving channel estimation and limited feedback for point-to-point systems with a large number of transmit antennas,” IEEE Trans. Wireless Commun., vol. 17, no. 10, pp. 6762–6774, 2018.
- [21] C. Zhang, Y. Jing, Y. Huang, and L. Yang, “Interleaved training and training-based transmission design for hybrid massive antenna downlink,” IEEE J. Sel. Topics Signal Process., vol. 12, no. 3, pp. 541–556, 2018.
- [22] W. He, C. Zhang, and Y. Huang, “Interleaved training codebook design for millimeter-wave communication system,” in Proc. IEEE/CIC Int. Conf. Commun., Beijing, China, Aug. 2018, pp. 6–10.
- [23] C. Zhang, Y. Jing, Y. Huang, and X. You, “Interleaved training for intelligent surface-assisted wireless communications,” IEEE Signal Process Lett., vol. 27, pp. 1774–1778, 2020.
- [24] Y. Jing, S. ShahbazPanahi, and X. Yu, “SINR-based interleaved training design for multi-user massive MIMO downlink with MRT,” in Proc. IEEE Int. Conf. Commun., Seoul, South Korea, May 2022, pp. 237–242.
- [25] Y. Jing, X. Yu, and S. ShahbazPanahi, “Interleaved training scheme for multi-user massive MIMO downlink with user SINR constraint,” IEEE Trans. Commun., 2023.
- [26] L. You, X. Gao, X.-G. Xia, N. Ma, and Y. Peng, “Pilot reuse for massive MIMO transmission over spatially correlated Rayleigh fading channels,” IEEE Trans. Wireless Commun., vol. 14, no. 6, pp. 3352–3366, 2015.
- [27] S. L. Loyka, “Channel capacity of MIMO architecture using the exponential correlation matrix,” IEEE Commun. Lett., vol. 5, no. 9, pp. 369–371, 2001.
- [28] A. Decurninge, M. Guillaud, and D. T. Slock, “Channel covariance estimation in massive MIMO frequency division duplex systems,” in Proc. IEEE Globecom Workshops, San Diego, CA, USA, Dec. 2015, pp. 1–6.
- [29] D. Neumann, M. Joham, and W. Utschick, “Covariance matrix estimation in massive MIMO,” IEEE Signal Process Lett., vol. 25, no. 6, pp. 863–867, 2018.
- [30] K. Li, Y. Li, L. Cheng, Q. Shi, and Z.-Q. Luo, “Downlink channel covariance matrix reconstruction for FDD massive MIMO systems with limited feedback,” arXiv preprint arXiv:2204.00779, 2022.
- [31] U. Grenander and G. Szegö, Toeplitz forms and their applications. California, U.S.A.: California Univ. Press, 1958.
- [32] E. Björnson, D. Hammarwall, and B. Ottersten, “Exploiting quantized channel norm feedback through conditional statistics in arbitrarily correlated MIMO systems,” IEEE Trans. Signal Process., vol. 57, no. 10, pp. 4027–4041, 2009.
- [33] R. K. Mallik, “The exponential correlation matrix: Eigen-analysis and applications,” IEEE Trans. Wireless Commun., vol. 17, no. 7, pp. 4690–4705, 2018.
- [34] S.-G. Hwang, “Cauchy’s interlace theorem for eigenvalues of Hermitian matrices,” Amer. Math. Monthly, vol. 111, no. 2, pp. 157–159, 2004.
- [35] B. Picinbono, “Second-order complex random vectors and normal distributions,” IEEE Trans. Signal Process., vol. 44, no. 10, pp. 2637–2640, 1996.
- [36] L. Schumacher, K. I. Pedersen, and P. E. Mogensen, “From antenna spacings to theoretical capacities-guidelines for simulating MIMO systems,” in Proc. IEEE Int. Symp. Person. Indoor Mobile Radio Commun., vol. 2, Lisboa, Portugal, Sep. 2002, pp. 587–592.
![]() |
Cheng Zhang (Member, IEEE) received the B.Eng. degree from Sichuan University, Chengdu, China, in June 2009, the M.Sc. degree from the Xi’an Electronic Engineering Research Institute (EERI), Xi’an, China, in May 2012, and the Ph.D. degree from Southeast University (SEU), Nanjing, China, in Dec. 2018. From Nov. 2016 to Nov. 2017, he was a Visiting Student with the University of Alberta, Edmonton, AB, Canada. From June 2012 to Aug. 2013, he was a Radar Signal Processing Engineer with Xi’an EERI. Since Dec. 2018, he has been with SEU, where he is currently an Associate Professor, and supported by the Zhishan Young Scholar Program of SEU. His current research interests include space-time signal processing and machine learning-aided optimization for B5G/6G wireless communications. He has authored or co-authored more than 50 IEEE journal papers and conference papers. He was the recipient of the excellent Doctoral Dissertation of the China Education Society of Electronics in Dec. 2019, that of Jiangsu Province in Dec. 2020, and the Best Paper Awards of 2023 IEEE WCNC and 2023 IEEE WCSP. |
![]() |
Chang Liu (Student Member, IEEE) received the B.Eng. degree in information engineering with the School of Information Science and Engineering, Southeast University, Nanjing, China, in 2021, where he is currently pursuing the M.Sc. degree as well. His research interests mainly focus on low overhead massive MIMO channel acquisition. |
![]() |
Yindi Jing (Senior Member, IEEE) received the B.Eng. and M.Eng. degrees in automatic control from the University of Science and Technology of China, Hefei, China, in 1996 and 1999, respectively. She received the M.Sc. degree and the Ph.D. in electrical engineering from California Institute of Technology, Pasadena, CA, in 2000 and 2004, respectively. From October 2004 to August 2005, she was a postdoctoral scholar at the Department of Electrical Engineering of California Institute of Technology. From February 2006 to June 2008, she was a postdoctoral scholar at the Department of Electrical Engineering and Computer Science of the University of California, Irvine. In 2008, she joined the Electrical and Computer Engineering Department of the University of Alberta, where she is currently a professor. She was an Associate Editor for IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS and a Senior Area Editor for IEEE SIGNAL PROCESSING LETTERS. She was a member of the IEEE Signal Processing Society Signal Processing for Communications and Networking (SPCOM) Technical Committee and a member of the NSERC Discovery Grant Evaluation Group for Electrical and Computer Engineering. Her research interests are in wireless communications and signal processing. |
![]() |
Minjie Ding (Student Member, IEEE) received the B.Eng. degree in information engineering from the School of Electronic Information and Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing, China, in 2020, where she is currently pursuing the M.Sc. degree in information and communication engineering with the School of Information Science and Engineering, Southeast University. Her research interests mainly focus on low overhead massive MIMO channel acquisition. |
![]() |
Yongming Huang (M’10-SM’16) received the B.S. and M.S. degrees from Nanjing University, Nanjing, China, in 2000 and 2003, respectively, and the Ph.D. degree in electrical engineering from Southeast University, Nanjing, in 2007. Since March 2007 he has been a faculty in the School of Information Science and Engineering, Southeast University, China, where he is currently a full professor. He has also been the Director of the Pervasive Communication Research Center, Purple Mountain Laboratories, since 2019. From 2008 to 2009, he was visiting the Signal Processing Lab, Royal Institute of Technology (KTH), Stockholm, Sweden. He has published over 200 peer-reviewed papers, hold over 80 invention patents. His current research interests include intelligent 5G/6G mobile communications and millimeter wave wireless communications. He submitted around 20 technical contributions to IEEE standards, and was awarded a certificate of appreciation for outstanding contribution to the development of IEEE standard 802.11aj. He served as an Associate Editor for the IEEE Transactions on Signal Processing and a Guest Editor for the IEEE Journal on Selected Areas in Communications. He is currently an Editor-at-Large for the IEEE Open Journal of the Communications Society. |