This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Using Loaded N-port Structures
to Achieve the Continuous-Space
Electromagnetic Channel Capacity Bound

Zixiang Han,  Shanpu Shen,  Yujie Zhang,  Shiwen Tang,  Chi-Yuk Chiu, and Ross Murch This work was supported by the Hong Kong research councils Collaborative Research Fund (CRF) grant C6012-20G. This paper has been submitted to IEEE Transactions on Wireless Communications. (Corresponding author: Shanpu Shen.)Z. Han, S. Shen, Y. Zhang, S. Tang, and C. Y. Chiu are with the Department of Electronic and Computer Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong (e-mail: sshenaa@connect.ust.hk).R. Murch is with the Department of Electronic and Computer Engineering and the Institute for Advanced Study, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong.
Abstract

A method for achieving the continuous-space electromagnetic channel capacity bound using loaded NN-port structures is described. It is relevant for the design of compact multiple-input multiple-output (MIMO) antennas that can achieve channel capacity bounds when constrained by size. The method is not restricted to a specific antenna configuration and a closed-form expression for the channel capacity limits are provided with various constraints. Furthermore, using loaded NN-port structures to represent arbitrary antenna geometries, an efficient optimization approach is proposed for finding the optimum MIMO antenna design that achieves the channel capacity bounds. Simulation results of the channel capacity bounds achieved using our MIMO antenna design with one square wavelength size are provided. These show that at least 18 ports can be supported in one square wavelength and achieve the continuous-space electromagnetic channel capacity bound. The results demonstrate that our method can link continuous-space electromagnetic channel capacity bounds to MIMO antenna design.

Index Terms:
Beamspace, channel capacity, continuous space, electromagnetic field, information theory

I Introduction

Information theory has been widely applied to the analysis and design of wireless communication systems to approach theoretical capacity limits [1]-[3]. However, the physical realization of wireless communication systems is based on the implementation of antennas and radio-frequency (RF) circuits [4], [5]. Therefore, the joint study of information theory and electromagnetic field theory can be utilized to further extend the theoretical channel capacity limits of the entire wireless communication system. This has led to the development of electromagnetic information theory (EIT) [6], [7]. Utilizing EIT, the concept of aerial degrees-of-freedom (ADoF) has been introduced to estimate the number of orthogonal basis functions required for describing electromagnetic fields around antenna systems [8]. The number of ADoF can be used to determine the channel capacity because it refers to the maximum number of parallel sub-channels that can be used for transmission [9]. To estimate the channel capacity bound for excitations restricted to a given volume or area, continuous current sources limited to a region, exciting the electromagnetic channel, have been analyzed using continuous-space approaches [10]-[12]. The results can be used to provide achievable bounds on the channel capacity given a specific volume or area. However, they have not provided the corresponding antenna designs to achieve those bounds.

Multiple-input multiple-output (MIMO) antenna systems, which exploit ADoF by using multiple antennas at transceivers, play a critical role in approaching the predicted channel capacity bounds in wireless communication systems [13]. As a result, an enormous number of MIMO antenna designs have been proposed [14]-[16]. A particular emphasis of these designs has been to devise approaches that achieve as many antennas as possible within a certain volume or area [17] and one example has provided up to 22 antennas per square wavelength [18]. These designs have to tradeoff strong mutual coupling effects with performance and this has led to design limits on the antennas possible per unit volume or area [19]. As a result, the analysis of the effects of mutual coupling on channel capacity performance have also been well studied [20], [21]. However there has been no direct link that can connect the design of MIMO antennas with the continuous-space approaches utilized in EIT [7], [22].

In this paper we introduce a method to link the antenna geometry and continuous-space electromagnetic channel capacity together by using loaded NN-port structures [23]-[25]. It is an attempt to bridge the gap between the practical design of MIMO antennas and their design limits predicted by EIT. This method is not restricted to a specific antenna configuration and uses the beamspace representation of MIMO systems. In beamspace MIMO, orthogonal radiation patterns also form independent sub-channels in a similar manner to the spatial separation of antennas that form spatial sub-channels [26]. Therefore, field distributions in the far-field can be linked with the capacity analysis in beamspace MIMO [27], [28] where the closed-form expression for the channel capacity limits [29], [30] are provided with various constraints. Furthermore, by using loaded NN-port structures to represent arbitrary antenna geometries, an efficient optimization approach is proposed for finding the optimum MIMO antenna design that achieves the channel capacity bounds. Simulation results of the channel capacity bounds and achieved capacity using our proposed antenna design with one wavelength square size are provided. These show the designed antenna can achieve capacity performance close to the fundamental bounds, demonstrating the effectiveness of the proposed method.

Organization: Section II formulates the MIMO system model for the loaded NN-port structures. Section III provides the derivation for the resultant channel capacity with different constraints. Section IV introduces the link with antenna design and describes efficient optimization approaches to obtain the optimum antenna configuration. In Section V, we provide numerical results of channel capacity for a proposed MIMO antenna design to demonstrate the potential of the technique. Section VI concludes the work.

Notation: Bold lower and upper case letters denote vectors and matrices respectively. Upper case letters in calligraphy denote sets. Letters not in bold font represent scalars. E{a}\mathrm{E}\left\{a\right\} denotes the expectation of scalar aa. [𝐚]i\left[\mathbf{a}\right]_{i} and 𝐚\left\|\mathbf{a}\right\| refer to the iith entry and l2l_{2}-norm of vector 𝐚\mathrm{\mathbf{a}}, respectively. 𝐀T\mathrm{\mathbf{A}}^{T}, 𝐀H\mathrm{\mathbf{A}}^{H}, [𝐀]i,j\left[\mathrm{\mathbf{A}}\right]_{i,j} , |𝐀|\left|\mathbf{A}\right| and Tr(𝐀)\mathrm{Tr}\left(\mathrm{\mathbf{A}}\right) refer to the transpose, conjugate transpose, (i,j)\left(i,j\right)th entry, determinant, and trace of a matrix 𝐀\mathrm{\mathbf{A}}, respectively. \mathbb{C} denotes complex number sets and j=1j=\sqrt{-1} denotes an imaginary number. 𝒞𝒩(μ,σ2)\mathcal{CN}(\mu,\sigma^{2}) denotes complex Gaussian distribution with mean μ\mu and variance σ2\sigma^{2}. 𝐔N\mathbf{U}_{N} denotes an N×NN\times N identity matrix. diag(a1,,aN)\left(a_{1},...,a_{N}\right) is a diagonal matrix with diagonal entries being a1,,aNa_{1},...,a_{N}. 𝐚,𝐛=𝐚H𝐛\left\langle\mathrm{\mathbf{a}},\mathrm{\mathbf{b}}\right\rangle=\mathrm{\mathbf{a}}^{H}\mathrm{\mathbf{b}} refers to the inner product of two vectors 𝐚\mathrm{\mathbf{a}} and 𝐛\mathrm{\mathbf{b}}. It should also be noted that the dependence of all variables on frequency is assumed and it is not explicitly shown for brevity.

II System Model

In Fig. 1(a) an arbitrary transmit element is shown, confined to a volume VV, with current distribution 𝐣(𝐫)\mathbf{j}\left(\mathbf{r}\right), where 𝐫\mathbf{r} is the spatial coordinate. When matched with the proper receive element and channel, and with the current distribution 𝐣(𝐫)\mathbf{j}\left(\mathbf{r}\right) appropriately set, the resulting system can achieve the continuous-space electromagnetic channel capacity bound [11]. Our objective in this work is to design the optimum MIMO antenna that can best approximate the required current distribution 𝐣(𝐫)\mathbf{j}\left(\mathbf{r}\right) in VV to approach the continuous-space electromagnetic channel capacity bound. The challenge is to find structures that can create the necessary current distribution 𝐣(𝐫)\mathbf{j}\left(\mathbf{r}\right) with the discrete feeding ports required in MIMO antenna design.

Loaded structures are one approach that allow the formation of any arbitrary current distribution on a surface [23] and to connect this with the discrete feeds required in MIMO antenna design, we can use NN-port loaded structures [24] as also shown in Fig. 1(b). In this figure, the NN discrete ports are denoted by black dots [24], [25]. That is the volume VV is discretized into ports each with currents i(𝐫1),i(𝐫2),,i(𝐫N)i\left(\mathbf{r}_{1}\right),i\left(\mathbf{r}_{2}\right),...,i\left(\mathbf{r}_{N}\right) [25]. By setting the separations between ports in the structure to be significantly less than a wavelength, it can approximate the required current distribution 𝐣(𝐫)\mathbf{j}\left(\mathbf{r}\right) by utilizing appropriate current excitations across the NN ports. The advantage of this approach is that the feeds for the final potential MIMO antenna are already incorporated at the beginning of the analysis through the NN-port loaded structure. In the final MIMO antenna design, only some of the ports will be utilized as feeds with the others loaded with reactances to form the required current distribution that achieves the channel capacity bound. This allows us to form antennas with the desired number of feeding ports while achieving performance close to the channel capacity bounds.

Refer to caption
Refer to caption
Figure 1: Illustration of a current distribution 𝐣(𝐫)\mathbf{j}\left(\mathbf{r}\right) in a confined volume VV represented by (a) continuous space and (b) approximated by a loaded NN-port discretized structure.

For analysis we write 𝐢=[i(𝐫1),i(𝐫2),,i(𝐫N)]TN×1\boldsymbol{\mathrm{i}}=\left[i\left(\mathbf{r}_{1}\right),i\left(\mathbf{r}_{2}\right),...,i\left(\mathbf{r}_{N}\right)\right]{}^{T}\in\mathbb{C}^{N\times 1} as the current at each of the ports in Fig. 1(b). Correspondingly the radiation patterns arising from each of the port currents are written as 𝐄T=[𝐞T,1,𝐞T,2,,𝐞T,N]K×N\mathbf{E}_{T}=\left[\mathbf{e}_{T,1},\mathbf{e}_{T,2},...,\mathbf{e}_{T,N}\right]\in\mathbb{C}^{K\times N} where 𝐞T,nK×1\mathbf{e}_{T,n}\in\mathbb{C}^{K\times 1} is the radiation pattern of the nnth port excited by a unit current with all the other ports open-circuited, over KK uniformly sampled three-dimensional (3D) spatial angles. We use these patterns and the results in [31] to decompose the radiation patterns into a natural set of orthonormal modal functions similarly to the Theory of Characteristic Modes (TCM) [24], [32]. These orthonormal modal functions can then be used for analysis of the channel capacity and interpreted as being related to the ADoF of the structure.

For an arbitrary excitation current 𝐢\boldsymbol{\mathrm{i}}, the resulting radiation pattern can be expressed as 𝐞=𝐄T𝐢\mathbf{e}=\mathbf{E}_{T}\boldsymbol{\mathrm{i}}, so that the radiated power is given by

1η𝐞,𝐞=1η𝐄T𝐢,𝐄T𝐢=1η𝐢H𝐄TH𝐄T𝐢=𝐢H𝐊T𝐢,\frac{1}{\text{$\eta$}}\left\langle\mathbf{e},\mathbf{e}\right\rangle=\frac{1}{\text{$\eta$}}\left\langle\mathbf{E}_{T}\boldsymbol{\mathrm{i}},\mathbf{E}_{T}\boldsymbol{\mathrm{i}}\right\rangle=\frac{1}{\text{$\eta$}}\boldsymbol{\mathrm{i}}^{H}\mathbf{E}_{T}^{H}\mathbf{E}_{T}\boldsymbol{\mathrm{i}}=\boldsymbol{\mathrm{i}}^{H}\mathbf{K}_{T}\boldsymbol{\mathrm{i}}, (1)

where η\eta is the free space impedance, 𝐊T\mathbf{K}_{T} is the correlation of the steering matrix at the transmitter side [31], which is defined as

𝐊T=1η𝐄TH𝐄T.\mathbf{K}_{T}=\frac{1}{\text{$\eta$}}\mathbf{E}_{T}^{H}\mathbf{E}_{T}. (2)

We can perform eigenvalue decomposition (EVD) on 𝐊T\mathbf{K}_{T} as

𝐊T=𝐐𝚲𝐐H,\mathbf{K}_{T}=\mathbf{Q}\mathbf{\Lambda}\mathbf{Q}^{H}, (3)

where Λ=diag(λ1,λ2,,λN)\Lambda=\mathrm{diag}\left(\lambda_{1},\lambda_{2},...,\lambda_{N}\right) is a real diagonal eigenvalue matrix. It is assumed that there is no inner resonance in the transmit element [24] so that the loaded NN-port structure can provide NN orthogonal basis. Therefore, 𝐊T\mathbf{K}_{T} is a positive definite matrix with all eigenvalues being positive λ1λ2λN>0\lambda_{1}\geqslant\lambda_{2}\geqslant...\geqslant\lambda_{N}>0. In practice some eigenvalues might be very small and this will affect ADoF as discussed later in the paper. 𝐐=[𝐪1,𝐪2,,𝐪N]\mathbf{Q}=\left[\mathbf{q}_{1},\mathbf{q}_{2},...,\mathbf{q}_{N}\right] is a unitary matrix whose column vectors 𝐪i\mathbf{q}_{i}, i=1,2,,Ni=1,2,...,N, are eigenvectors of 𝐊T\mathbf{K}_{T} and form a set of current basis. By selecting current 𝐢\boldsymbol{\mathrm{i}} to be 𝐪i\mathbf{q}_{i} and collecting the corresponding radiation patterns 𝐟T,i=𝐄T𝐪i\mathbf{f}_{T,i}=\mathbf{E}_{T}\mathbf{q}_{i}, i=1,2,,Ni=1,2,...,N, a set of orthogonal radiation patterns can be formed since the inner product of two radiation patterns excited by 𝐪i\mathbf{q}_{i} and 𝐪j\mathbf{q}_{j} is given by

1η𝐟T,i,𝐟T,j\displaystyle\frac{1}{\text{$\eta$}}\left\langle\mathbf{f}_{T,i},\mathbf{f}_{T,j}\right\rangle =1η𝐄T𝐪i,𝐄T𝐪j=1η𝐪iH𝐄TH𝐄T𝐪j\displaystyle=\frac{1}{\text{$\eta$}}\left\langle\mathbf{E}_{T}\mathbf{q}_{i},\mathbf{E}_{T}\mathbf{q}_{j}\right\rangle=\frac{1}{\text{$\eta$}}\mathbf{q}_{i}^{H}\mathbf{E}_{T}^{H}\mathbf{E}_{T}\mathbf{q}_{j}
=𝐪iH𝐐𝚲𝐐H𝐪j=δijλi,\displaystyle=\mathbf{q}_{i}^{H}\mathbf{Q}\mathbf{\Lambda}\mathbf{Q}^{H}\mathbf{q}_{j}=\delta_{ij}\lambda_{i}, (4)

where δij\delta_{ij} is the Kronecker delta function (0 if iji\neq j and 1 if i=ji=j). That is the radiation patterns excited by different column vectors in 𝐐\mathbf{Q} are orthogonal. The λi\lambda_{i}, i=1,2,,Ni=1,2,...,N, can be regarded as the radiation resistance of the iith radiation pattern since when excited by 𝐪i\mathbf{q}_{i} (whose l2l_{2}-norm is unity), the radiated power of 𝐟T,i\mathbf{f}_{T,i} is λi\lambda_{i}.

The orthonormal basis set with unity radiated power can be written as

𝐁T=𝐄T𝐐𝚲12\mathbf{B}_{T}=\mathbf{E}_{T}\mathbf{Q}\mathbf{\Lambda}^{-\frac{1}{2}} (5)

so that we have 1η𝐁TH𝐁T=𝐔N\frac{1}{\text{$\eta$}}\mathbf{B}_{T}^{H}\mathbf{B}_{T}=\mathbf{U}_{N} with 𝐁T=[𝐛T,1,𝐛T,2,,𝐛T,N]K×N\mathbf{B}_{T}=\left[\mathbf{b}_{T,1},\mathbf{b}_{T,2},...,\mathbf{b}_{T,N}\right]\in\mathbb{C}^{K\times N}. The diagonal entries in 𝚲12\mathbf{\Lambda}^{-\frac{1}{2}} are scaling factors to form the orthonormal basis. Therefore, for those basis with small radiation resistances, large currents are required to radiate patterns with unity power.

By replicating the approach for the receive volume, we respectively denote the steering matrices of the MM receive antennas as 𝐄R=[𝐞R,1,𝐞R,2,,𝐞R,M]K×M\mathbf{E}_{R}=\left[\mathbf{e}_{R,1},\mathbf{e}_{R,2},\ldots,\mathbf{e}_{R,M}\right]\in\mathbb{C}^{K\times M} where 𝐞R,mK×1\mathbf{e}_{R,m}\in\mathbb{C}^{K\times 1}, m=1,2,,Mm=1,2,...,M.

The final step is to connect the transmitter and receiver using an appropriate channel model. The beamspace domain or virtual channel representation [26], [33] is a well established approach that decomposes the channel into orthogonal beams. This model fits naturally with the modal decompositions we have formed at the transmitter and receiver. Using the beamspace approach, the equivalent channel matrix in the angular domain can be expressed as

𝐇=1η𝐄RH𝐇v𝐄T\mathbf{H}=\frac{1}{\text{$\eta$}}\mathbf{E}_{R}^{H}\mathbf{H}_{v}\mathbf{E}_{T} (6)

where 𝐇vK×K\mathbf{H}_{v}\in\mathbb{C}^{K\times K} is the virtual channel whose entries refer to the channel gain from each angle of departure (AoD) to each angle of arrival (AoA) [33]. By taking (6) we can write the overall system model as

𝐲\displaystyle\mathbf{y} =1η𝐄RH𝐇v𝐄T𝐢+𝐧,\displaystyle=\frac{1}{\text{$\eta$}}\mathbf{E}_{R}^{H}\mathbf{H}_{v}\mathbf{E}_{T}\boldsymbol{\mathrm{i}}+\mathbf{n}, (7)

where 𝐧M×1\mathbf{n}\in\mathbb{C}^{M\times 1} is the additive Gaussian noise and satisfies the complex Gaussian distribution 𝒞𝒩(0,σ2𝐔M)\mathcal{CN}\left(0,\sigma^{2}\mathbf{U}_{M}\right) with noise power σ2\sigma^{2} and 𝐲M×1\mathbf{y}\in\mathbb{C}^{M\times 1} is the received signal.

III Channel Capacity Formulation

Using (6) and (7), channel capacity of the system in the beamspace domain can be written as

C\displaystyle C =log2|𝐔M+𝐇𝐑𝐢𝐇Hσ2|\displaystyle=\mathrm{log_{2}}\left|\mathbf{U}_{M}+\frac{\mathbf{H}\mathbf{R}_{\mathbf{i}}\mathbf{H}^{H}}{\sigma^{2}}\right| (8)

where 𝐑𝐢=E{𝐢𝐢H}\mathbf{R}_{\mathbf{i}}=\mathrm{E}\left\{\mathbf{i}\mathbf{i}^{H}\right\} is the covariance matrix of current.

We assume the MM receive antennas (NMN\leqslant M for tractability) are ideally isolated so that the steering matrix 𝐄R\mathbf{E}_{R} is an exactly orthonormal basis set of receive antennas satisfying 1η𝐄RH𝐄R=𝐔M\frac{1}{\text{$\eta$}}\mathbf{E}_{R}^{H}\mathbf{E}_{R}=\mathbf{U}_{M}. Using (5), we can rewrite the channel matrix (6) as

𝐇=1η𝐄RH𝐇v𝐁TΛ12𝐐H=𝐇iidΛ12𝐐H,\mathbf{H}=\frac{1}{\text{$\eta$}}\mathbf{E}_{R}^{H}\mathbf{H}_{v}\mathbf{B}_{T}\Lambda^{\frac{1}{2}}\mathbf{Q}^{H}=\mathbf{H}_{\mathrm{iid}}\Lambda^{\frac{1}{2}}\mathbf{Q}^{H}, (9)

where 𝐇iid=1η𝐄RH𝐇v𝐁TM×N\mathbf{H}_{\mathrm{iid}}=\frac{1}{\text{$\eta$}}\mathbf{E}_{R}^{H}\mathbf{H}_{v}\mathbf{B}_{T}\in\mathbb{C}^{M\times N} with each entry following i.i.d. distribution due to the orthonormality among radiation patterns in 𝐄R\mathbf{E}_{R} and 𝐁T\mathbf{B}_{T}. The capacity (8) is then rewritten using (9) as

C\displaystyle C =log2|𝐔M+𝐇iidΛ12𝐐H𝐑𝐢𝐐Λ12𝐇iidHσ2|.\displaystyle=\mathrm{log_{2}}\left|\mathbf{U}_{M}+\frac{\mathbf{H}_{\mathrm{iid}}\Lambda^{\frac{1}{2}}\mathbf{Q}^{H}\mathbf{R}_{\mathbf{i}}\mathbf{Q}\Lambda^{\frac{1}{2}}\mathbf{H}_{\mathrm{iid}}^{H}}{\sigma^{2}}\right|. (10)

Channel capacity is constrained by physical limits consisting of 1) the radiated power constraint Tr(1η𝐄T𝐑𝐢𝐄TH)=Tr(𝐑𝐢𝐊T)Prad\mathrm{Tr}\left(\frac{1}{\text{$\eta$}}\mathbf{E}_{T}\mathbf{R}_{\mathbf{i}}\mathbf{E}_{T}^{H}\right)=\mathrm{Tr}\left(\mathbf{R}_{\mathbf{i}}\mathbf{K}_{T}\right)\leqslant P_{\mathrm{rad}} with PradP_{\mathrm{rad}} being the upper bound of radiated power and 2) the maximum currents that can exist at the input to the ports of the reactively loaded structure, Tr(𝐑𝐢)Iin2\mathrm{Tr}\left(\mathbf{R}_{\mathbf{i}}\right)\leqslant I_{\mathrm{in}}^{2} with IinI_{\mathrm{in}} being the upper bound of the current norm. In practice, both constraints need to be applied to obtain an implementable antenna. A constraint on the input power (it would be the same as PradP_{\mathrm{rad}} if the antenna was lossless) cannot be formulated since an expression for the input impedances are not known as we do not yet know the final feed arrangement.

In the following subsections, the formulation of capacity with two constraints are derived, which is general and can be applied to arbitrary NN-port antenna systems.

III-A Capacity with Radiated Power Constraint

We firstly constrain the radiated power for the derivation of channel capacity. This optimization problem can be formulated as

max𝐑𝐢\displaystyle\underset{\mathbf{R}_{\mathbf{i}}}{\mathrm{max}} log2|𝐔M+𝐇iidΛ12𝐐H𝐑𝐢𝐐Λ12𝐇iidHσ2|\displaystyle\quad\mathrm{log_{2}}\left|\mathbf{U}_{M}+\frac{\mathbf{H}_{\mathrm{iid}}\Lambda^{\frac{1}{2}}\mathbf{Q}^{H}\mathbf{R}_{\mathbf{i}}\mathbf{Q}\Lambda^{\frac{1}{2}}\mathbf{H}_{\mathrm{iid}}^{H}}{\sigma^{2}}\right| (11)
s.t.\displaystyle\mathrm{s.t.} Tr(𝐑𝐢𝐊T)Prad.\displaystyle\quad\mathrm{Tr}\left(\mathbf{R}_{\mathbf{i}}\mathbf{K}_{T}\right)\leqslant P_{\mathrm{rad}}. (12)

By decomposing arbitrary radiation pattern 𝐞\mathbf{e} onto the set of orthonormal radiation pattern basis in 𝐁T\mathbf{B}_{T}, we have

𝐞=𝐄T𝐢=𝐁TΛ12𝐐H𝐢=𝐁T𝜷\mathbf{e}=\mathbf{E}_{T}\boldsymbol{\mathrm{i}}=\mathbf{B}_{T}\Lambda^{\frac{1}{2}}\mathbf{Q}^{H}\mathbf{i}=\mathbf{B}_{T}\boldsymbol{{\beta}} (13)

with

𝜷=Λ12𝐐H𝐢\boldsymbol{{\beta}}=\Lambda^{\frac{1}{2}}\mathbf{Q}^{H}\mathbf{i} (14)

so that each entry in 𝜷\boldsymbol{{\beta}} refers to the magnitude allocated to each orthonormal pattern basis. The radiated power constraint (12) can be transformed to

Tr(𝐑𝐢𝐊T)\displaystyle\mathrm{Tr}\left(\mathbf{R}_{\mathbf{i}}\mathbf{K}_{T}\right) =Tr(Λ12𝐐H𝐑𝐢𝐐Λ12)\displaystyle=\mathrm{Tr}\left(\Lambda^{\frac{1}{2}}\mathbf{Q}^{H}\mathbf{R}_{\mathbf{i}}\mathbf{Q}\Lambda^{\frac{1}{2}}\right)
=Tr(E{𝜷𝜷H})=Tr(𝐑β)Prad,\displaystyle=\mathrm{Tr}\left(\mathrm{E}\left\{\boldsymbol{{\beta}}\boldsymbol{{\beta}}^{H}\right\}\right)=\mathrm{Tr}\left(\mathbf{R}_{\beta}\right)\leqslant P_{\mathrm{rad}}, (15)

where 𝐑β=E{𝜷𝜷H}\mathbf{R}_{\beta}=\mathrm{E}\left\{\boldsymbol{{\beta}}\boldsymbol{{\beta}}^{H}\right\} is the covariance matrix of orthonormal pattern basis magnitude. Accordingly, the optimization problem (11) and (12) is re-formulated as

max𝐑β\displaystyle\underset{\mathbf{R}_{\beta}}{\mathrm{max}} log2|𝐔M+𝐇iid𝐑β𝐇iidHσ2|\displaystyle\quad\mathrm{log_{2}}\left|\mathbf{U}_{M}+\frac{\mathbf{H}_{\mathrm{iid}}\mathbf{R}_{\beta}\mathbf{H}_{\mathrm{iid}}^{H}}{\sigma^{2}}\right| (16)
s.t.\displaystyle\mathrm{s.t.} Tr(𝐑β)Prad.\displaystyle\quad\mathrm{Tr}\left(\mathbf{R}_{\beta}\right)\leqslant P_{\mathrm{rad}}. (17)

It can be noticed that the problem (16) and (17) is the same as the capacity maximization problem of ideal M×NM\times N MIMO. However, due to some extremely small eigenvalues in Λ\Lambda, the norm of current 𝐢\mathbf{i} could be extremely large and not implementable. In other words, the maximum current is unconstrained.

III-B Capacity with Current Constraint

Next, we consider imposing a constraint on the current norm Tr(𝐑𝐢)\mathrm{Tr}\left(\mathbf{R}_{\mathbf{i}}\right). The optimization problem of channel capacity can then be formulated as

max𝐑𝐢\displaystyle\underset{\mathbf{R}_{\mathbf{i}}}{\mathrm{max}} log2|𝐔M+𝐇iidΛ12𝐐H𝐑𝐢𝐐Λ12𝐇iidHσ2|\displaystyle\quad\mathrm{log_{2}}\left|\mathbf{U}_{M}+\frac{\mathbf{H}_{\mathrm{iid}}\Lambda^{\frac{1}{2}}\mathbf{Q}^{H}\mathbf{R}_{\mathbf{i}}\mathbf{Q}\Lambda^{\frac{1}{2}}\mathbf{H}_{\mathrm{iid}}^{H}}{\sigma^{2}}\right| (18)
s.t.\displaystyle\mathrm{s.t.} Tr(𝐑𝐢)Iin2.\displaystyle\quad\mathrm{Tr}\left(\mathbf{R}_{\mathbf{i}}\right)\leqslant I_{\mathrm{in}}^{2}. (19)

By decomposing the current 𝐢\mathbf{i} onto the set of current basis 𝐪i\mathbf{q}_{i}, i=1,2,,Ni=1,2,...,N, we can obtain the coefficient for the decomposition as γi=𝐪i,𝐢=𝐪iH𝐢\gamma_{i}=\left\langle\mathbf{q}_{i},\mathbf{i}\right\rangle=\mathbf{q}_{i}^{H}\mathbf{i} which refers to the magnitude allocated to the iith current basis. We collect the coefficient γi\gamma_{i} into a vector 𝜸=[γ1,γ2,,γN]T\boldsymbol{{\gamma}}=\left[\gamma_{1},\gamma_{2},...,\gamma_{N}\right]^{T} which can be related to 𝐢\mathbf{i} by

𝜸=𝐐H𝐢.\boldsymbol{{\gamma}}=\mathbf{Q}^{H}\mathbf{i}. (20)

𝜸\boldsymbol{{\gamma}} has the same norm as 𝐢\mathbf{i} so that the current constraint (19) can be transformed to

Tr(𝐑𝐢)\displaystyle\mathrm{Tr}\left(\mathbf{R}_{\mathbf{i}}\right) =Tr(𝐐H𝐑𝐢𝐐)\displaystyle=\mathrm{Tr}\left(\mathbf{Q}^{H}\mathbf{R}_{\mathbf{i}}\mathbf{Q}\right)
=Tr(E{𝜸𝜸H})=Tr(𝐑γ)Iin2,\displaystyle=\mathrm{Tr}\left(\mathrm{E}\left\{\boldsymbol{{\gamma}}\boldsymbol{{\gamma}}^{H}\right\}\right)=\mathrm{Tr}\left(\mathbf{R}_{\gamma}\right)\leqslant I_{\mathrm{in}}^{2}\text{,} (21)

where 𝐑γ=E{𝜸𝜸H}\mathbf{R}_{\gamma}=\mathrm{E}\left\{\boldsymbol{{\gamma}}\boldsymbol{{\gamma}}^{H}\right\} is the covariance matrix of current basis magnitude. Then the problem (18) and (19) can be transformed to

max𝐑γ\displaystyle\underset{\mathbf{R}_{\gamma}}{\mathrm{max}} log2|𝐔M+𝐇iidΛ12𝐑γΛ12𝐇iidHσ2|\displaystyle\quad\mathrm{log_{2}}\left|\mathbf{U}_{M}+\frac{\mathbf{H}_{\mathrm{iid}}\Lambda^{\frac{1}{2}}\mathbf{R}_{\gamma}\Lambda^{\frac{1}{2}}\mathbf{H}_{\mathrm{iid}}^{H}}{\sigma^{2}}\right| (22)
s.t.\displaystyle\mathrm{s.t.} Tr(𝐑γ)Iin2.\displaystyle\quad\mathrm{Tr}\left(\mathbf{R}_{\gamma}\right)\leqslant I_{\mathrm{in}}^{2}. (23)

The optimal solution for the currents can be found by performing equal power (EP) or water-filling (WF) allocation directly on 𝐑γ\mathbf{R}_{\gamma}.

It can be observed that due to the existence of Λ12\Lambda^{\frac{1}{2}} in (22), the radiated power of each orthogonal basis in 𝐁T\mathbf{B}_{T} is not equal and proportional to the corresponding eigenvalues in Λ\Lambda. Therefore, in this formulation, the WF method tends to allocate more power to those basis with larger eigenvalues since it increases radiated power and resulting system capacity. In other words, the radiated power is unconstrained.

III-C Capacity with Dual Constraint

Finally, we consider dual constraints imposed by the current norm and radiated power simultaneously. That is, in practical setups the radiated power is limited and the norm of current 𝐢\mathbf{i} must also be restricted. This optimization problem is formulated as

max𝐑𝐢\displaystyle\underset{\mathbf{R}_{\mathbf{i}}}{\mathrm{max}} log2|𝐔M+𝐇iidΛ12𝐐H𝐑𝐢𝐐Λ12𝐇iidHσ2|\displaystyle\quad\mathrm{log_{2}}\left|\mathbf{U}_{M}+\frac{\mathbf{H}_{\mathrm{iid}}\Lambda^{\frac{1}{2}}\mathbf{Q}^{H}\mathbf{R}_{\mathbf{i}}\mathbf{Q}\Lambda^{\frac{1}{2}}\mathbf{H}_{\mathrm{iid}}^{H}}{\sigma^{2}}\right| (24)
s.t.\displaystyle\mathrm{s.t.} Tr(𝐑𝐢𝐊T)Prad,\displaystyle\quad\mathrm{Tr}\left(\mathbf{R}_{\mathbf{i}}\mathbf{K}_{T}\right)\leqslant P_{\mathrm{rad}}, (25)
Tr(𝐑𝐢)Iin2.\displaystyle\quad\mathrm{Tr}\left(\mathbf{R}_{\mathbf{i}}\right)\leqslant I_{\mathrm{in}}^{2}. (26)

We follow the transformation in (14) as well as the formulation in (16) and (17), so the problem (24) to (26) can be transformed to

max𝐑β\displaystyle\underset{\mathbf{R}_{\beta}}{\mathrm{max}} log2|𝐔M+𝐇iid𝐑β𝐇iidHσ2|\displaystyle\quad\mathrm{log_{2}}\left|\mathbf{U}_{M}+\frac{\mathbf{H}_{\mathrm{iid}}\mathbf{R}_{\beta}\mathbf{H}_{\mathrm{iid}}^{H}}{\sigma^{2}}\right| (27)
s.t.\displaystyle\mathrm{s.t.} Tr(𝐑β𝚲1)Iin2,\displaystyle\quad\mathrm{Tr}\left(\mathbf{R}_{\beta}\mathbf{\Lambda}^{-1}\right)\leqslant I_{\mathrm{in}}^{2}, (28)
Tr(𝐑β)Prad.\displaystyle\quad\mathrm{Tr}\left(\mathbf{R}_{\beta}\right)\leqslant P_{\mathrm{rad}}. (29)

It can be observed that the capacity formulation is only related to Λ\Lambda, which is a diagonal matrix with diagonal entries being the eigenvalues of 𝐊T\mathbf{K}_{T}.

To solve the problem (27) to (29), we firstly consider EP allocation method. Although the system provides NN orthogonal radiation pattern basis, some eigenvalues in Λ\Lambda are too small so that large currents are required to radiate their corresponding basis (5) which cannot be used for practical MIMO antenna design. To avoid generating large currents, i.e. satisfying the current constraint (29), PradP_{\mathrm{rad}} should be allocated to those basis with largest radiation resistance (i.e. larger eigenvalues in Λ\Lambda). Therefore, we equally allocate power PradP_{\mathrm{rad}} to the first NeffN_{\mathrm{eff}} (NeffN)\left(N_{\mathrm{eff}}\leqslant N\right) basis and drop the remaining NNeffN-N_{\mathrm{eff}} basis. These NeffN_{\mathrm{eff}} basis can be effectively used with maximum current norm Iin2I_{\mathrm{in}}^{2} and radiated power PradP_{\mathrm{rad}}, and thus can be regarded as the effective aerial degrees-of-freedom (EADoF) of the transmitter [28].

The diagonal entries in 𝐑β\mathbf{R}_{\beta} are then given by

[𝐑β]i,i={PradNeff,i=1,2,,Neff,0,otherwise.\left[\mathbf{R}_{\beta}\right]_{i,i}=\begin{cases}\frac{P_{\mathrm{rad}}}{N_{\mathrm{eff}}},&i=1,2,...,N_{\mathrm{eff}},\\ 0,&\mathrm{otherwise.}\end{cases} (30)

The constraint (29) can then be written as

Tr(𝐑βΛ1)=i[=1]NeffPradNeffλi1Iin2.\mathrm{Tr}\left(\mathbf{R}_{\beta}\Lambda^{-1}\right)=\stackrel{{\scriptstyle[}}{{i}}=1]{N_{\mathrm{eff}}}{\sum}\frac{P_{\mathrm{rad}}}{N_{\mathrm{eff}}}\mathbf{\lambda}_{i}^{-1}\leqslant I_{\mathrm{in}}^{2}. (31)

We define ϵ=Iin2Prad\epsilon=\frac{I_{\mathrm{in}}^{2}}{P_{\mathrm{rad}}} in (31) and it can be interpreted loosely as a conductance which has the unit of Ω1\Omega^{-1}. In essence for a given PradP_{\mathrm{rad}}, a larger IinI_{\mathrm{in}} (ϵ\epsilon high) indicates that the antenna will have a smaller input resistance overall and if it is too small the antenna will not be implementable. Alternatively if IinI_{\mathrm{in}} is too low (ϵ\epsilon low), the input impedance will be required to be too high and also not implementable. Therefore we should set it to be in a range centered around 1/50 Ω1\Omega^{-1} so that it is in line with the antennas desired input impedance.

Then the EADoF NeffN_{\mathrm{eff}} is related to ϵ\epsilon for a fixed SNR as

i[=1]Neffλi1Neffϵ\stackrel{{\scriptstyle[}}{{i}}=1]{N_{\mathrm{eff}}}{\sum}\frac{\mathbf{\lambda}_{i}^{-1}}{N_{\mathrm{eff}}}\leqslant\epsilon (32)

where the left term refers to the average of the reciprocal of eigenvalues for the first NeffN_{\mathrm{eff}} basis. It can be observed from (32) that a larger ϵ\epsilon indicates that the number of available orthogonal basis, i.e. NeffN_{\mathrm{eff}}, and the consequent capacity bound can be larger. The maximum NeffN_{\mathrm{eff}} can be obtained by solving (32) for a fixed ϵ\epsilon. That is EADoF NeffN_{\mathrm{eff}} is dependent on ϵ\epsilon (the maximum allowable current norm given PradP_{\mathrm{rad}}), indicating the number of basis that can be used with the constraint of current norm and radiation power.

Next we consider using the WF method to maximize capacity in problem (27) to (29). We start with the channel gain of NN sub-channels in channel matrix 𝐇iid\mathbf{H}_{\mathrm{iid}}. By performing EVD on 𝐇iid𝐇iidH\mathbf{H}_{\mathrm{iid}}\mathbf{H}_{\mathrm{iid}}^{H}, we obtain the channel gain matrix 𝐒=diag(s1,s2,,sN)\mathbf{S}=\mathrm{diag}\left(s_{1},s_{2},...,s_{N}\right) which is a real diagonal matrix with sis_{i}, i=1,2,,Ni=1,2,...,N, being the channel gain of the iith sub-channel. Then we use the Lagrangian method whose function is given by

L=\displaystyle L= log2|𝐔N+𝐒σ2𝐑β|+μradTr(𝐑βPradN𝐔N)\displaystyle-\mathrm{log_{2}}\left|\mathbf{U}_{N}+\frac{\mathbf{S}}{\sigma^{2}}\mathbf{R}_{\beta}\right|+\mu_{\mathrm{rad}}\mathrm{Tr}\left(\mathbf{R}_{\beta}-\frac{P_{\mathrm{rad}}}{N}\mathbf{U}_{N}\right)
+μinTr(𝐑βΛ1Iin2N𝐔N),\displaystyle+\mu_{\mathrm{in}}\mathrm{Tr}\left(\mathbf{R}_{\beta}\Lambda^{-1}-\frac{I_{\mathrm{in}}^{2}}{N}\mathbf{U}_{N}\right), (33)

where μrad\mu_{\mathrm{rad}} and μin\mu_{\mathrm{in}} are Lagrangian multipliers. Taking the partial derivative of LL with respect to 𝐑β\mathbf{R}_{\beta}, we have

log2e𝐒(σ2𝐔N+𝐒𝐑β)1\displaystyle-\mathrm{log_{2}}e\cdot\mathbf{S}\left(\sigma^{2}\mathbf{U}_{N}+\mathbf{S}\mathbf{R}_{\beta}\right)^{-1} +μrad𝐔N+μinΛ1=𝟎.\displaystyle+\mu_{\mathrm{rad}}\mathbf{U}_{N}+\mu_{\mathrm{in}}\Lambda^{-1}=\mathbf{0}. (34)

Combining the solution of 𝐑β\mathbf{R}_{\beta} in (34), two constraints (28), (29) can be written as [21]

Tr(log2e(μrad𝐔N+μinΛ1)1σ2𝐒1)=Prad,\mathrm{Tr}\left(\mathrm{log_{2}}e\left(\mu_{\mathrm{rad}}\mathbf{U}_{N}+\mu_{\mathrm{in}}\Lambda^{-1}\right)^{-1}-\sigma^{2}\mathbf{S}^{-1}\right)=P_{\mathrm{rad}}, (35)
Tr(log2e(μradΛ+μin𝐔N)1σ2(Λ𝐒)1)\displaystyle\mathrm{Tr}\left(\mathrm{log_{2}}e\left(\mu_{\mathrm{rad}}\Lambda+\mu_{\mathrm{in}}\mathbf{U}_{N}\right)^{-1}-\sigma^{2}\left(\Lambda\mathbf{S}\right)^{-1}\right) =Iin2.\displaystyle=I_{\mathrm{in}}^{2}. (36)

The optimal multipliers μrad\mu_{\mathrm{rad}}^{\star} and μin\mu_{\mathrm{in}}^{\star} in (36) and (35) can be found via binary search [21] and the optimal 𝐑β\mathbf{R}_{\beta} can be solved in (34).

It should be noted that the above closed-form expressions for system capacity are formulated without considering the physical realization and practical excitation for MIMO antennas. Therefore, the MIMO antenna structure should be carefully designed and optimized to achieve the capacity bounds with dual constraints, which will be described in the next section.

IV Loaded NN-port Structure and Analysis

In the previous section, we have provided an optimization formulation to obtain the required port currents on the loaded NN-port structure for capacity maximization with various constraints. In this section we link those currents to MIMO antenna design with only Q<<NQ<<N feeding ports in the loaded NN-port structures.

IV-A Network Analysis

To begin to construct the MIMO antenna, we divide the NN-ports in Fig. 1(b) into QQ active feeding ports and NQN-Q loaded ports. The advantage of this approach is that the feeds for the final potential QQ-port MIMO antenna are incorporated at the beginning of the analysis through the NN-port structure. The equivalent circuit model for the NN-port network with QQ active feeding ports (numbered 1 to QQ) and NQN-Q parasitic loaded ports (numbered Q+1Q+1 to NN) is shown in Fig. 2. The loaded ports are either lossless with inductive and capacitive reactance or open. The active feeding ports are connected to a matching network to achieve the necessary 50 Ω\Omega input impedance. By optimizing the NQN-Q load reactances, we wish to generate QQ orthogonal radiation patterns from the designated QQ feeding ports so that the capacity (8) can be maximized. In essence we will attempt to find loads that create the required current distributions for approaching the capacity bounds. The design procedure for selecting which ports are fed and which are loaded to achieve the optimum channel capacity using (10) is described later.

Refer to caption
Figure 2: Equivalent circuit model of the NN-port network with QQ feeding ports and NQN-Q parasitic ports.

In Fig. 2, the qqth feeding port is excited by a voltage source vqv_{q}, q=1,2,,Qq=1,2,...,Q, and the (Q+p)\left(Q+p\right)th parasitic port is loaded with reactance xQ+pLx_{Q+p}^{L}, p=1,2,,NQp=1,2,...,N-Q. We group the voltage source excitation vqv_{q}, q=1,2,,Qq=1,2,...,Q, into a vector as 𝐯A=[v1,v2,,vQ]TQ×1\mathbf{v}_{\mathrm{A}}=\left[v_{1},v_{2},...,v_{Q}\right]{}^{T}\in\mathbb{C}^{Q\times 1} and define 𝐯P=[0,0,,0]T(NQ)×1\mathbf{v}_{\mathrm{P}}=\left[0,0,...,0\right]{}^{T}\in\mathbb{C}^{\left(N-Q\right)\times 1} as a (NQ)\left(N-Q\right)-dimension zero vector due to there being no excitation at the parasitic ports. We also group current at the feeding and parasitic ports into vectors as 𝐢A=[i1,i2,,iQ]TQ×1\mathbf{i}_{\mathrm{A}}=\left[i_{1},i_{2},...,i_{Q}\right]{}^{T}\in\mathbb{C}^{Q\times 1} and 𝐢P=[iQ+1,iQ+2,,iN]T(NQ)×1\mathbf{i}_{\mathrm{P}}=\left[i_{Q+1},i_{Q+2},...,i_{N}\right]{}^{T}\in\mathbb{C}^{\left(N-Q\right)\times 1}, respectively. The voltage and current in the NN-port network are then related by

[𝐯A𝐯P]=[𝐙A𝐙AP𝐙PA𝐙P+j𝐗L][𝐢A𝐢P],\left[\begin{array}[]{c}\mathbf{v}_{\mathrm{A}}\\ \mathbf{v}_{\mathrm{P}}\end{array}\right]=\left[\begin{array}[]{ll}\mathbf{Z}_{\mathrm{A}}&\mathbf{Z}_{\mathrm{AP}}\\ \mathbf{Z}_{\mathrm{PA}}&\mathbf{Z}_{\mathrm{P}}+j\mathrm{\mathbf{X}}^{L}\end{array}\right]\left[\begin{array}[]{c}\mathbf{i}_{\mathrm{A}}\\ \mathbf{i}_{\mathrm{P}}\end{array}\right], (37)

where 𝐗L=diag(xQ+1L,xQ+2L,,xNL)\mathrm{\mathbf{X}}^{L}=\mathrm{diag}\mathnormal{\left(x_{Q+\text{1}}^{L},x_{Q+\text{2}}^{L},\cdots,x_{N}^{L}\right)} represents the load reactance connected to each parasitic port. 𝐙AQ×Q\mathbf{Z}_{\mathrm{A}}\in\mathbb{C}^{Q\times Q} and 𝐙P(NQ)×(NQ)\mathbf{Z}_{\mathrm{P}}\in\mathbb{C}^{\left(N-Q\right)\times\left(N-Q\right)} are the impedance sub-matrix of the QQ feeding ports and the NQN-Q parasitic ports, respectively. 𝐙APQ×(NQ)\mathbf{Z}_{\mathrm{AP}}\in\mathbb{C}^{Q\times\left(N-Q\right)} and 𝐙PA(NQ)×Q\mathbf{Z}_{\mathrm{PA}}\in\mathbb{C}^{\left(N-Q\right)\times Q} are the sub-matrix referring to the mutual impedance between QQ feeding and NQN-Q parasitic ports with 𝐙AP=𝐙PAT\mathbf{Z}_{\mathrm{AP}}=\mathbf{Z}_{\mathrm{PA}}^{T}. From (37) we can obtain the relationship between the current on the feeding and parasitic ports as

𝐢P=(𝐙P+j𝐗L)1𝐙PA𝐢A.\mathbf{i}_{\mathrm{P}}=-\left(\mathbf{Z}_{\mathrm{P}}+j\mathrm{\mathbf{X}}^{L}\right)^{-1}\mathbf{Z}_{\mathrm{PA}}\mathbf{i}_{\mathrm{A}}. (38)

Our interest is obtaining QQ orthogonal far-field radiation patterns to form the beamspace MIMO system. To that end, we collect the radiation pattern of each feeding port 𝐞T,q(Ω)\mathrm{\mathbf{e}}_{T,q}\left(\Omega\right), q=1,2,,Qq=1,2,...,Q, in the antenna system into matrix form 𝐄T=[𝐞T,1(Ω),𝐞T,2(Ω),,𝐞T,Q(Ω)]\mathrm{\mathbf{E}}_{T}=\left[\mathrm{\mathbf{e}}_{T,1}\left(\Omega\right),\mathrm{\mathbf{e}}_{T,2}\left(\Omega\right),...,\mathrm{\mathbf{e}}_{T,Q}\left(\Omega\right)\right], which can be given as

𝐄T(Ω,𝐗L)=𝐄A(Ω)𝐄P(Ω)(𝐙P+j𝐗L)1𝐙PA,\mathrm{\mathbf{E}}_{T}\left(\Omega,\mathrm{\mathbf{X}}^{L}\right)=\mathbf{E}_{\mathrm{A}}\left(\Omega\right)-\mathbf{E}_{\mathrm{P}}\left(\Omega\right)\left(\mathbf{Z}_{\mathrm{P}}+j\mathrm{\mathbf{X}}^{L}\right)^{-1}\mathbf{Z}_{\mathrm{PA}}, (39)

where Ω=(θ,ϕ)\Omega=\left(\theta,\phi\right) denotes the spatial angle with θ\theta and ϕ\phi representing the elevation and azimuth angles in spherical coordinates, respectively. 𝐄A(Ω)=[𝐞A,1(Ω),,𝐞A,Q(Ω)]\mathbf{E}_{\mathrm{A}}\left(\Omega\right)=\left[\mathbf{e}_{\mathrm{A},1}\left(\Omega\right),\ldots,\mathbf{e}_{\mathrm{A},Q}\left(\Omega\right)\right] collects QQ open-circuit radiation patterns of the feeding ports with 𝐞A,q(Ω)\mathbf{e}_{\mathrm{A},q}\left(\Omega\right), q=1,2,,Qq=1,2,...,Q being the open-circuit radiation pattern of the qqth feeding port excited by a unit current when all the other feeding and parasitic ports are open-circuit. Similarly, 𝐄P(Ω)=[𝐞P,Q+1(Ω),,𝐞P,N(Ω)]\mathbf{E}_{\mathrm{P}}\left(\Omega\right)=\left[\mathbf{e}_{\mathrm{P},Q+1}\left(\Omega\right),\ldots,\mathbf{e}_{\mathrm{P},N}\left(\Omega\right)\right] collects NQN-Q open-circuit radiation patterns of the parasitic ports with 𝐞P,Q+p(Ω)\mathbf{e}_{\mathrm{P},Q+p}\left(\Omega\right), p=1,2,,NQp=1,2,...,N-Q, being the open-circuit radiation pattern of the (Q+p)\left(Q+p\right)th parasitic port excited by a unit current when all the other feeding and parasitic ports are open-circuited. It can be observed that 𝐄T\mathrm{\mathbf{E}}_{T} consists of the original open-circuit radiation pattern of the QQ feeding ports 𝐄A\mathbf{E}_{\mathrm{A}} and a perturbation term affected by the load reactance 𝐗L\mathrm{\mathbf{X}}^{L} across the parasitic ports.

We use a full electromagnetic solver, CST studio suite [34], to simulate the open-circuit radiation patterns of all NN ports of the loaded NN-port structure. It should be noted that the simulation only needs to be performed once because any radiation pattern excited by any current distribution can then be found by using (39) [35]. This reduces the computation complexity enormously as full electromagnetic simulation is not needed during the load optimization and capacity simulation stages.

IV-B Optimization

To obtain the optimal design of the MIMO antenna, we wish to select QQ active feeding ports and optimize the load reactances at the NQN-Q parasitic ports. The objective is to generate a set of orthogonal radiation patterns in 𝐄T\mathrm{\mathbf{E}}_{T} at QQ feeding ports so that the resulting channel capacity can be maximized.

To achieve an optimization result that also meets practical design constraints, we need to note that not all ports would be suitable for use as feeding. For example, ports positioned on the edges of the structure are more likely to be accessible for feeds than those completely surrounded by ports. For this reason, we partition the NN ports into two sets for optimization. Assume SS (S>Q)\left(S>Q\right) out of NN ports are feasible feeding ports and whose indices in the set are given by 𝒮={1,2,,S}\mathcal{S}=\left\{1,2,...,S\right\}. The objective is therefore to find the optimum QQ ports from the SS feasible ports where the set of indices for the QQ feeding ports is given by 𝒫={p1,p2,,pQ}𝒮\mathcal{P}=\left\{p_{1},p_{2},...,p_{Q}\right\}\subset\mathcal{S} . In addition, we must find the optimum loads for the remaining NSN-S ports so that together with the SQS-Q unselected feeding ports, the load reactance matrix 𝐗L\mathbf{X}^{L} can meet the requirements of orthogonality for the QQ selected feeding ports.

We use the correlation coefficient between the radiation patterns of two feeding ports, i.e. 𝐞T,j(Ω,𝐗L)\text{$\mathrm{\mathbf{e}}_{T,j}$}\left(\Omega,\mathrm{\mathbf{X}}^{L}\right) of the jjth feeding port and 𝐞T,k(Ω,𝐗L)\text{$\mathrm{\mathbf{e}}_{T,k}$}\left(\Omega,\mathrm{\mathbf{X}}^{L}\right) of the kkth feeding port, to evaluate their similarity, and this is defined as

ρjk(𝐗L)=𝐞T,j(Ω,𝐗L),𝐞T,k(Ω,𝐗L)𝐞T,j(Ω,𝐗L)𝐞T,k(Ω,𝐗L)\rho_{jk}\left(\mathrm{\mathbf{X}}^{L}\right)=\frac{\left\langle\text{$\mathrm{\mathbf{e}}_{T,j}$}\left(\Omega,\mathrm{\mathbf{X}}^{L}\right),\text{$\mathrm{\mathbf{e}}_{T,k}$}\left(\Omega,\mathrm{\mathbf{X}}^{L}\right)\right\rangle}{\left\|\mathrm{\mathbf{e}}_{T,j}\left(\Omega,\mathrm{\mathbf{X}}^{L}\right)\right\|\left\|\mathrm{\mathbf{e}}_{T,k}\left(\Omega,\mathrm{\mathbf{X}}^{L}\right)\right\|} (40)

where ρjk\rho_{jk} satisfies 0|ρjk|10\leq\left|\rho_{jk}\right|\leq 1. When |ρjk|=0\left|\rho_{jk}\right|=0, 𝐞T,j\mathrm{\mathbf{e}}_{T,j} and 𝐞T,k\mathrm{\mathbf{e}}_{T,k} are orthogonal to each other. Leveraging the correlation coefficient (40), we can formulate the optimization problem as

min𝒫,𝐗L\displaystyle\underset{\mathcal{P},\>\mathbf{X}^{L}}{\mathrm{min}}\>\>\>\> k[𝒫]jkj[𝒫]|ρjk(𝐗L)|.\displaystyle\stackrel{{\scriptstyle[}}{{k}}\in\mathcal{P}]{j\neq k}{\sum}\stackrel{{\scriptstyle[}}{{j}}\in\mathcal{P}]{}{\sum}\left|\rho_{jk}\left(\mathrm{\mathbf{X}}^{L}\right)\right|. (41)

It should be noted that the optimization variables 𝒫\mathcal{P} and 𝐗L\mathbf{X}^{L} are highly coupled with each other in this problem. This is because altering feeding port indices greatly changes the impedance matrix 𝐙A\mathbf{Z}_{\mathrm{A}}, 𝐙AP\mathbf{Z}_{\mathrm{AP}} and 𝐙P\mathbf{Z}_{\mathrm{P}} in (37), making the problem (41) complicated to solve. To overcome this difficulty and meet these constraints, we propose the use of the alternating optimization method to iteratively optimize the load reactances and selection of the feeding ports [36], [37]. For simplicity, the SQS-Q unselected feeding ports are left open in the entire optimization process to ease the computational burden. However they could also be included in the load optimization process if deemed necessary.

To start with, we randomly select QQ feeding ports, out of SS, as the initial guess at iteration 0. The set consisting of the indices for QQ initial random feeding ports is given by 𝒫(0)={p1(0),p2(0),,pQ(0)}𝒮\mathcal{P}^{\left(0\right)}=\left\{p_{1}^{\left(0\right)},p_{2}^{\left(0\right)},...,p_{Q}^{\text{$\left(0\right)$}}\right\}\subset\mathcal{S} with pq(0)p_{q}^{\left(0\right)} being the index of the qqth initial feeding port. Presetting the SQS-Q unused feeding ports to be open, we then need to optimize NSN-S load reactances at the parasitic ports where the initial guess for the load reactance matrix is given as 𝐗L(0)=diag(,,,0,,0)\mathbf{X}^{L\left(0\right)}=\mathrm{diag}\mathnormal{\left(\infty,...,\infty,\text{0},...,\text{0}\right)} with the first SQS-Q diagonal entries being fixed as infinity (open).

At the iith iteration, we wish to find the load reactances 𝐗L(i)\mathbf{X}^{L\left(i\right)} to produce near orthogonal radiation patterns with feeding ports indices set 𝒫(i1)\mathcal{P}^{\left(i-1\right)}. This can be performed by solving the optimization problem

min𝐗L(i)\displaystyle\underset{\mathbf{X}^{L\left(i\right)}}{\mathrm{min}}\>\>\>\> k[𝒫(i1)]jkj[𝒫(i1)]|ρjk(𝐗L(i))|.\displaystyle\stackrel{{\scriptstyle[}}{{k}}\in\mathcal{P}^{\left(i-1\right)}]{j\neq k}{\sum}\stackrel{{\scriptstyle[}}{{j}}\in\mathcal{P}^{\left(i-1\right)}]{}{\sum}\left|\rho_{jk}\left(\mathrm{\mathbf{X}}^{L\left(i\right)}\right)\right|. (42)

This makes the correlation coefficient norm between any two patterns in 𝐄T\mathrm{\mathbf{E}}_{T} as close to zero as possible. To solve the unconstrained optimization problem (42), we can use the quasi-Newton method [38] which guarantees convergence to a stationary point of the problem (42). The optimal load reactances at the iith iteration are denoted by 𝐗L(i)\mathbf{X}^{L\left(i\right)}.

We then use 𝐗L(i)\mathbf{X}^{L\left(i\right)} from (42) to optimize the indices of the feeding ports 𝒫(i)\mathcal{P}^{\left(i\right)}, that is,

min𝒫(i)\displaystyle\underset{\mathcal{P}^{\left(i\right)}}{\mathrm{min}}\>\>\>\> k[𝒫(i)]jkj[𝒫(i)]|ρjk(𝐗L(i))|,\displaystyle\stackrel{{\scriptstyle[}}{{k}}\in\mathcal{P}^{\left(i\right)}]{j\neq k}{\sum}\stackrel{{\scriptstyle[}}{{j}}\in\mathcal{P}^{\left(i\right)}]{}{\sum}\left|\rho_{jk}\left(\mathrm{\mathbf{X}}^{L\left(i\right)}\right)\right|, (43)

which is an NP-hard optimization problem. To solve this problem, we propose a low-complexity approach which is based on sequentially optimizing each entry in 𝒫(i)\mathcal{P}^{\left(i\right)}, i.e. p1(i)p_{1}^{\left(i\right)}, p2(i)p_{2}^{\left(i\right)},…, pQ(i)p_{Q}^{\left(i\right)}, one-by-one. Specifically, we define 𝒫q(i)={p1(i),,pq(i),pq+1(i1),pq+2(i1),,pQ(i1)}𝒮\mathcal{P}_{q}^{\left(i\right)}=\left\{p_{1}^{\left(i\right)},...,p_{q}^{\text{$\left(i\right)$}},p_{q+1}^{\left(i-1\right)},p_{q+2}^{\left(i-1\right)},...,p_{Q}^{\left(i-1\right)}\right\}\subset\mathcal{S} as the intermediate state of feeding port indices at the iith iteration where the indices of the first qq feeds have been optimized and the remaining QqQ-q feeds not optimized. When optimizing index pq(i)p_{q}^{\left(i\right)} for the qqth feeding port, we fix the other Q1Q-1 indices and select the optimal pq(i)p_{q}^{\left(i\right)} from the remaining SQ+1S-Q+1 indices in 𝒮\mathcal{S} to minimize the correlation coefficient (40) among all QQ ports in 𝒫q(i)\mathcal{P}_{q}^{\left(i\right)}, which can be formulated as

minpq(i)\displaystyle\underset{p_{q}^{\left(i\right)}}{\mathrm{min}}\>\>\>\> k[𝒫q(i)]jkj[𝒫q(i)]|ρjk(𝐗L(i))|,\displaystyle\stackrel{{\scriptstyle[}}{{k}}\in\mathcal{P}_{q}^{\left(i\right)}]{j\neq k}{\sum}\stackrel{{\scriptstyle[}}{{j}}\in\mathcal{P}_{q}^{\left(i\right)}]{}{\sum}\left|\rho_{jk}\left(\mathrm{\mathbf{X}}^{L\left(i\right)}\right)\right|, (44)
s.t.\displaystyle\mathrm{s.t.}\>\>\>\> pq(i)𝒮(𝒫q1(i)pq(i1)),\displaystyle p_{q}^{\left(i\right)}\in\mathcal{S}\setminus\left(\mathcal{P}_{q-1}^{\left(i\right)}\setminus p_{q}^{\left(i-1\right)}\right), (45)

with qq sequentially taken as q=1,2,,Qq=1,2,...,Q. It should be noted that 𝒫q1(i)=𝒫(i1)\mathcal{P}_{q-1}^{\left(i\right)}=\mathcal{P}^{\left(i-1\right)} when q=1q=1.

Algorithm 1 The alternating optimization method

Input: 𝒫(0)\mathcal{P}^{\text{$\left(0\right)$}}, 𝐗L(0)\mathbf{X}^{L\mathrm{\left(0\right)}} (initial values described in text);

   1: Initialization: i=0i=0;

   2: repeat

          i=i+1i=i+1;

   3:     Find 𝐗L(i)\mathbf{X}^{L\left(i\right)} with 𝒫(i1)\mathcal{P}^{\text{$\left(i-1\right)$}} by (42);

   4:     for q=1:Qq=1\colon Q

   5:         Find pq(i)p_{q}^{\left(i\right)} subject to (45) with 𝐗L(i)\mathbf{X}^{L\left(i\right)} by (44);

   6:         Update 𝒫q(i)\mathcal{P}_{q}^{\left(i\right)};

   7:     end

   8:     𝒫(i)=𝒫Q(i)\mathcal{P}^{\left(i\right)}=\mathcal{P}_{Q}^{\left(i\right)}

   9: until 𝒫(i)=𝒫(i1)\mathcal{P}^{\left(i\right)}=\mathcal{P}^{\left(i-1\right)};

Output: 𝐗L=𝐗L(i)\mathbf{X}^{L\star}=\mathbf{X}^{L\left(i\right)}, 𝒫=𝒫(i)\mathcal{P}^{\star}=\mathcal{P}^{\left(i\right)};

By iteratively optimizing the load reactances and selecting feeding ports, the objective function (42) and (44) can converge to a local optimal solution [37]. Therefore, we obtain the optimal load reactances 𝐗L\mathbf{X}^{L\star} of the loaded NN-port structures with QQ feeds with indices of 𝒫\mathcal{P}^{\star}. Algorithm 1 summarizes the overall algorithm for optimizing the load reactances and feeding ports. The performance of the algorithm in finding optimum MIMO antenna configuration will be shown in the next section.

V Numerical Results

In this section, we firstly compare our proposed method with previous work [11] to verify its accuracy and correctness. After that we propose a MIMO antenna that can approach the continuous channel capacity bound.

For channel capacity simulations with the current constraint, we use the same SNR definition as in [11] which is the ratio of the current norm Iin2I_{\mathrm{in}}^{2} to the noise power σ2\sigma^{2}. While for capacity simulation with the radiated power constraint and also dual constraints, we use the conventional SNR definition, i.e. the ratio of maximum radiated power PradP_{\mathrm{rad}} to the noise power σ2\sigma^{2}.

V-A Comparison of Channel Capacity

Refer to caption
Refer to caption
Figure 3: Discretization of a one square wavelength 2D square planar surface using an NN-port loaded structure where the ports are indicated by the red marks. In example a) there are 2×22\times 2 sub-elements with 4 ports, and for example b) there are 10×1010\times 10 sub-elements with 180 ports.

In the first set of simulation results we wish to verify our approach by comparing with previous results [11]. The previous results consider a continuous source current on a 2D square planar surface with one square wavelength size at both the transmitter and receiver. The 2D surface is discretized into an NN-port structure as shown in Fig 3 where discretizations with 4 and 180 ports are shown. This corresponds to dividing the one wavelength square surface into 4 and 100 sub-elements respectively similarly to the discretizations of previous work [11].

To compare our method to previous work [11], we only need to find the required currents at all NN ports and do not need to consider finding NQN-Q loads or QQ feeding ports. We can therefore use the results in (22) and (23) directly without considering the loads and feeds to find the port currents.

The same channel setup utilized previously [11] is also invoked and includes single polarization in a rich scattering environment with Rayleigh fading and 2D uniform power angular spectrum (PAS) on the azimuth plane [31]. In addition, we use the current norm constraint so that the capacity optimization problem is formulated as (22) and (23), with WF to find the currents. This is similar to equation (21) in [11].

Simulation results of the system capacity are shown in Fig. 4 for various levels of discretizations. In the figure the number of discretizations per dimension is used to align with previous work [11]. That is 10 sub-elements per wavelength correspond to 100 sub-elements in total. It can be observed that by using our method, the system capacity approaches that using the method in [11] when the number of sub-elements per wavelength is more than four. The capacity gap (for 4 or less sub-elements per square wavelength) arises due to the current on the discrete ports being an approximation to the continuous current on the surface, which is related to the position of the ports in the NN-port structure. Therefore, we can conclude that our method can obtain the continuous-space electromagnetic channel capacity bound if the dimension of the sub-element and the separation distance between ports is less than a wavelength (from Fig. 3 greater than 5 sub-elements per wavelength).

Refer to caption
Figure 4: Capacity comparison with method in [11] at SNR=20\mathrm{SNR}=20 dB. Note that the discretizations on the horizontal axis are sub-elements per wavelength. That is 10 sub-elements per wavelength corresponds to 100 sub-elements in total.

V-B MIMO Antenna Capacity Bound

Next, we use the technique in Section IV to design a practical MIMO antenna that approaches the capacity bounds. The frequency considered in the following is 2.4 GHz so that the wavelength is 125 mm.

The first step is to propose a structure that is implementable with QQ feeds but general enough to allow the formation of arbitrary current distributions for capacity maximization. As such our proposed NN-port structure is shown in Fig. 5 where a 2D square copper planar surface with one square wavelength size is again utilized. The copper surface is mounted on a substrate where the copper has electric conductivity of 5.8×1075.8\times 10^{7} S/m while the substrate is made with Rogers 5880C which has permittivity of 2.2 and loss tangent of 0.0009. We discretize the 2D copper surface into a 11×1111\times 11 sub-element array where the size of the sub-element is 10×10mm210\times 10\>\mathrm{mm}^{2}. To implement the feeds, an additional copper plane is placed underneath as a ground plane so that a conventional feed can be connected between the ground and the center of a sub-element as shown in the elevation view in Fig. 5. The NN ports of the structure are defined as the ports across each pair of adjacent sub-elements on the surface and also between the ground and the center of each sub-element. That is there are 220 ports on the surface and 121 ports between the ground and sub-elements making up a total of N=341N=341 ports. The QQ feeding ports need to be selected from the 121 ports between the ground and sub-elements.

Refer to caption
Figure 5: Plan and elevation views of the proposed NN-port structure. A 2D square copper surface with one square wavelength size on top of a ground plane is utilized.

While the ultimate aim is to select QQ feeding ports from the ground to sub-element ports, we firstly need to find the required currents on the NN ports to optimize capacity and approach the capacity bounds similarly as in Section V.A. This allows us to first determine the capacity performance of the structure with ideal port currents. The final load reactances and feeds arrangements of the antenna that can approximate these currents are found in the next subsection.

In the capacity simulations, we obtain them as a function of PP (P=1,2,3P=1,2,3, etc.) basis with the highest eigenvalues (out of N=341)N=341) for transmission. This allows us to determine the number of EADoF provided by the structure. In the simulations, we use PP ideally isolated antennas at the receiver side. We also assume a rich scattering environment so that entries in 𝐇v\mathbf{H}_{v} satisfy [𝐇v]ij𝒞𝒩(0,1)\left[\mathbf{H}_{v}\right]_{ij}\sim\mathcal{CN}\left(0,1\right) for i,j=1,2,,Ki,j=1,2,\ldots,K. In addition, dual polarization in a rich scattering environment with Rayleigh fading and 3D uniform PAS over full sphere is utilized. For these reasons the capacity results obtained for this channel will be much greater than that obtained in Section V.A which used only one polarization, a 2D PAS and a receiver antenna that was the same as the transmitter.

V-B1 Capacity Bound with Individual Constraints

Refer to caption
Figure 6: Capacity of P×PP\times P MIMO system using the NN-port structure in Fig. 5 with the constraints of radiated power and current norm at SNR=20\mathrm{SNR}=20 dB.

We simulate the capacity of the proposed structure when we use the PP basis functions with the highest eigenvalues to create a P×PP\times P system with the two individual constraints of current norm and radiated power. The simulated capacity with the two constraints using EP allocation and WF method are shown in Fig. 6. These are benchmarked with the results of an ideal P×PP\times P MIMO system where the transmitter is equipped with PP spatially isolated antennas. It can be observed that if we only consider the constraint of radiated power, the simulated capacity of the structure is the same as that of ideal MIMO. However, the current norm in this case can be extremely large and not implementable.

On the other hand, if we only consider the constraint of current norm, the capacity of the loaded structure is higher than the ideal MIMO systems for the first few basis. This is because those basis with large eigenvalues have larger radiated power than the ideal MIMO system. However, when we consider more basis, e.g. more than 50, the increase in capacity is negligible since those basis have small eigenvalues and only radiate limited power. Therefore, in this case the WF method can allocate more power to these 50 basis with large eigenvalues to improve the capacity.

V-B2 Capacity Bound with Dual Constraint

Refer to caption
Figure 7: Results relating the dual constraints ratio, ϵ\epsilon to the EADoF NeffN_{\mathrm{eff}}. For ϵ=0.02\epsilon=0.02 it can be observed Neff18.N_{\mathrm{eff}}\approx 18.

Next we consider imposing dual constraints for the simulation of channel capacity and this leads us to consider ϵ\epsilon and NeffN_{\mathrm{eff}}. In Fig. 7, we plot the minimum ϵ\epsilon for the first NeffN_{\mathrm{eff}} basis as calculated in (32). It can be noticed that ϵ\epsilon is an increasing function of NeffN_{\mathrm{eff}} as expected.

For an ideal lossless antenna, the input impedance is Zin=50ΩZ_{\mathrm{in}}=50\ \Omega, i.e. Prad=ZinIin2P_{\mathrm{rad}}=Z_{\mathrm{in}}I_{\mathrm{in}}^{2}. Therefore a region of interest on Fig. 7, is when ϵ=Iin2/Prad=1/Zin=0.02\epsilon=I_{\mathrm{in}}^{2}/P_{\mathrm{rad}}=1/Z_{\mathrm{in}}=0.02. In this region NeffN_{\mathrm{eff}} is approximately 18 and therefore provides an estimate of the minimum EADoF to expect. In essence we can expect to achieve at least 18 ports in the one square wavelength area. We consider it as a lower bound in practice, depending on the exact relation between Iin2I_{\mathrm{in}}^{2} and PradP_{\mathrm{rad}}. We may be able to achieve more ports because the internal currents in the antenna can be higher than those at the feeds.

In Fig. 8 we plot the resulting capacity when we use the first PP basis functions to create a P×PP\times P system for various ϵ\epsilon using EP and WF. It can be observed again that the capacity of the MIMO system using the NN-port structure increases as ϵ\epsilon becomes larger. When considering basis PP smaller than NeffN_{\mathrm{eff}}, the system capacity of MIMO using the loaded structure is the same as that of ideal P×PP\times P MIMO. However when PP approaches NeffN_{\mathrm{eff}}, the capacity using EP allocation starts to decrease since equality in dual constraints cannot be achieved. The WF method allocates most power to the first NeffN_{\mathrm{eff}} basis so that the capacity slightly increases when adding more basis to the P×PP\times P MIMO system. In addition, the ϵ\epsilon value points where the WF and EP lines separate indicate the number of available basis NeffN_{\mathrm{eff}} (i.e. EADoF) provided by the ϵ\epsilon value structure as calculated in (32) and approximately match with the results in Fig. 7.

The results in Fig. 7 and and Fig. 8 are useful references in designing MIMO antenna for approaching the capacity bound. The use of ϵ\epsilon provides us with an estimate of the number of feeding ports or NeffN_{\mathrm{eff}}. In particular, we can expect to be able to provide at least 18 ports in the one square wavelength size antenna structure shown in Fig. 5.

Refer to caption
Figure 8: Capacity of P×PP\times P MIMO using using the NN-port structure in Fig. 5 with dual constraints at SNR=20\mathrm{SNR}=20 dB.

V-C MIMO Antenna Design

To provide a useful antenna, we need to select feeding ports and the loads for all the ports in the NN-port loaded structure. The feeding ports and loads must be selected to achieve a close match to the optimum currents found in the previous section for a feasible PP. This then would provide the optimum antenna configuration that can achieve the channel capacity bound with dual constraints.

As discussed previously for an ideal lossless antenna, the input impedance is Zin=50ΩZ_{\mathrm{in}}=50\ \Omega and therefore ϵIin2/Prad=1/Zin=0.02\epsilon\geqslant I_{\mathrm{in}}^{2}/P_{\mathrm{rad}}=1/Z_{\mathrm{in}}=0.02 so that NeffN_{\mathrm{eff}} is at least 18. Expecting to have higher currents internally on the antenna, we aim for 20 ports in our design. That is we will need to re-create the currents obtained when P=20P=20 from the previous section.

To reduce the computational effort in finding the loads, we preset the sub-elements in the center row and center column of the 2D copper surface as being connected together (shorted) so there is a single large cross element centered on the surface. That is ports {56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 6, 17, 28, 39, 50, 72, 83, 94, 105, 116} in Fig. 5 are shorted together. The cross element separates the other sub-elements into four sub-square arrays. The cross is also isolated from the other sub-elements so that the ports between the cross and the other adjacent sub-elements are open without loads. In effect, we are presetting some of the ports to be shorted or open on the surface. This reduces the computational load of the optimization by reducing the number of load reactances to be found from 220 to 160. In addition, to reduce the search space for the Q=20Q=20 feeds we also restrict it to the S=64S=64 ports between the ground plane and the sub-elements on the edge of each sub-square array (instead of all 121 possible locations). For those ports underneath the surface that were not selected as feeds, we set them to be open.

Refer to caption
Figure 9: Return loss, S11, of the 20 feeding ports in the loaded structure in Fig. 5.

To find the optimum reactances across the 160 ports on the surface and the indices of the 20 feeding ports underneath, we use the method as described in Section IV so that the excited radiation patterns of the 20 feeding ports are closest to being orthogonal (42). The final indices of the 20 feeding ports after the alternating optimization are given by

{1,3,5,7,9,11,29,33,49,55,67,69,75,89,95,111,113,117,119,121}\left\{1,3,5,7,9,11,29,33,49,55,67,69,75,89,95,111,113,117,119,121\right\}

and their physical location can be found from Fig. 5. The 20 feeding ports are then each fed by a feeding probe across the common ground and the center of the selected sub-elements as shown in the elevation view in Fig. 5. The impedance matching for the 20 feeding ports is performed separately with a conventional T-type matching network to achieve matching to 50 Ω\Omega. The return loss (S11) results of the 20 ports are provided in Fig. 9 where the reflected power at 2.4 GHz is around -15 dB and the bandwidth is around 40 MHz.

By using the 20 excited radiation patterns from the resultant loaded structure with 20 feeds, and a beamspace channel model (7), the capacity of the 20×2020\times 20 MIMO system is found and the simulated results are shown in Fig. 10. These are also benchmarked with the capacity bound of the loaded structure without feeds obtained by WF method in Section III.C and the capacity of ideal MIMO. It can be observed that the MIMO antenna using the loaded structure with 20 feeds achieves performance close to ideal MIMO and the capacity bound predicted by Fig. 8 for loaded structure without feeds. The capacity gap is due to the small correlations among the radiation patterns of the feeding ports and power loss from mutual coupling. However, the key advantage of the proposed method is that the optimum antenna structure for the MIMO transmitter with size constraint can be found with only minor compromise on system capacity.

Refer to caption
Figure 10: Capacity of 20×2020\times 20 MIMO system for the loaded structure in Fig. 5 with and without feeds.

VI Conclusion

In this paper, we have proposed a novel method for designing antennas that approach the continuous-space electromagnetic channel capacity bounds. The method can link continuous-space electromagnetic channel capacity bounds to MIMO antenna design. The method is not restricted to a specific antenna configuration and uses loaded NN-port structures to discretize the continuous space and represent arbitrary antenna geometries. It is useful for designing compact MIMO antennas that can approach channel capacity bounds when constrained by size.

In the proposed method, we derive the closed-form expressions for the channel capacity limits using a beamspace channel model with a current constraint, radiated power constraint as well as with dual constraints. We also introduce a method for antenna design using the loaded ports structure and provide an efficient alternating optimization approach for finding the optimum MIMO antenna configuration to generate orthogonal radiation patterns. This can be used to construct a beamspace MIMO system that approaches the channel capacity bounds.

Simulation results of the channel capacity using our proposed method matches well with previous work. Furthermore, by optimizing the load reactance and feeding port positions in our proposed antenna design with one square wavelength size, the achieved capacity performance is close to the continuous-space channel capacity bounds, demonstrating the effectiveness of the proposed method. In particular we show that at least 18 ports can be supported in a one wavelength square structure to achieve the continuous-space electromagnetic channel capacity bound. It is also shown that the limit on the number of ports is constrained by the maximum current that the antenna can handle. An example design for a 20-port antenna in a one square wavelength area that achieves the capacity bounds is also provided. One challenge of the final antenna is the limited bandwidth and this can be addressed by increasing the height in the antenna design.

References

  • [1] C. E. Shannon, “A mathematical theory of communication,” The Bell system technical journal, vol. 27, no. 3, pp. 379–423, 1948.
  • [2] V. Tarokh, N. Seshadri, and A. Calderbank, “Space-time codes for high data rate wireless communication: performance criterion and code construction,” IEEE Trans. Inf. Theory, vol. 44, no. 2, pp. 744–765, 1998.
  • [3] E. Telatar, “Capacity of multi-antenna Gaussian channels,” Eur. Trans. Telecommun., vol. 10, no. 6, pp. 585–595, 1999.
  • [4] D. M. Pozar, Microwave engineering.   John wiley & sons, 2011.
  • [5] C. A. Balanis, Antenna theory: analysis and design.   John wiley & sons, 2015.
  • [6] M. D. Migliore, “On electromagnetics and information theory,” IEEE Trans. Antennas and Propagation, vol. 56, no. 10, pp. 3188–3200, 2008.
  • [7] F. K. Gruber and E. A. Marengo, “New aspects of electromagnetic information theory for wireless and antenna systems,” IEEE Trans. Antennas Propag., vol. 56, no. 11, pp. 3470–3484, 2008.
  • [8] O. M. Bucci and G. Franceschetti, “On the degrees of freedom of scattered fields,” IEEE Trans. Antennas Propag., vol. 37, no. 7, pp. 918–926, 1989.
  • [9] A. S. Poon, R. W. Brodersen, and D. N. Tse, “Degrees of freedom in multiple-antenna channels: A signal space approach,” IEEE Trans. Inf. Theory, vol. 51, no. 2, pp. 523–536, 2005.
  • [10] M. D. Migliore, “On the role of the number of degrees of freedom of the field in MIMO channels,” IEEE Trans. Antennas Propag., vol. 54, no. 2, pp. 620–628, 2006.
  • [11] M. A. Jensen and J. W. Wallace, “Capacity of the continuous-space electromagnetic channel,” IEEE Trans. Antennas Propag., vol. 56, no. 2, pp. 524–531, 2008.
  • [12] W. Jeon and S.-Y. Chung, “Capacity of continuous-space electromagnetic channels with lossy transceivers,” IEEE Trans. Inf. Theory, vol. 64, no. 3, pp. 1977–1991, 2017.
  • [13] R. D. Murch and K. B. Letaief, “Antenna systems for broadband wireless access,” IEEE Commun. Mag., vol. 40, no. 4, pp. 76–83, 2002.
  • [14] C.-Y. Chiu, J.-B. Yan, and R. D. Murch, “Compact three-port orthogonally polarized MIMO antennas,” IEEE Antennas Wirel. Propag. Lett., vol. 6, pp. 619–622, 2007.
  • [15] J. Ren, W. Hu, Y. Yin, and R. Fan, “Compact printed MIMO antenna for UWB applications,” IEEE Antennas Wirel. Propag. Lett., vol. 13, pp. 1517–1520, 2014.
  • [16] G. Zhai, Z. N. Chen, and X. Qing, “Enhanced isolation of a closely spaced four-element MIMO antenna system using metamaterial mushroom,” IEEE Trans. Antennas Propag., vol. 63, no. 8, pp. 3362–3370, 2015.
  • [17] L. Lu, G. Y. Li, A. L. Swindlehurst, A. Ashikhmin, and R. Zhang, “An overview of massive MIMO: Benefits and challenges,” IEEE J. Sel. Top. Signal Process., vol. 8, no. 5, pp. 742–758, 2014.
  • [18] S. Soltani and R. D. Murch, “A compact planar printed MIMO antenna design,” IEEE Trans. Antennas Propag., vol. 63, no. 3, pp. 1140–1149, 2015.
  • [19] S. Shen and R. D. Murch, “Impedance matching for compact multiple antenna systems in random RF fields,” IEEE Trans. Antennas Propag., vol. 64, no. 2, pp. 820–825, 2015.
  • [20] J. W. Wallace and M. A. Jensen, “Mutual coupling in MIMO wireless systems: A rigorous network theory analysis,” IEEE Trans. Wirel. Commun., vol. 3, no. 4, pp. 1317–1325, 2004.
  • [21] L. Sun, P. Li, M. R. McKay, and R. D. Murch, “Capacity of MIMO systems with mutual coupling: Transmitter optimization with dual power constraints,” IEEE Trans. Signal Process., vol. 60, no. 2, pp. 848–861, 2011.
  • [22] V. Shyianov, M. Akrout, F. Bellili, A. Mezghani, and R. W. Heath, “Achievable rate with antenna size constraint: Shannon meets Chu and Bode,” IEEE Trans. Commun., 2021.
  • [23] R. Harrington and J. Mautz, “Control of radar scattering by reactive loading,” IEEE Trans. Antennas Propag., vol. 20, no. 4, pp. 446–454, 1972.
  • [24] J. Mautz and R. Harrington, “Modal analysis of loaded N-port scatterers,” IEEE Trans. Antennas Propag., vol. 21, no. 2, pp. 188–199, 1973.
  • [25] R. Harrington and J. Mautz, “Pattern synthesis for loaded n-port scatterers,” IEEE Trans. Antennas Propag., vol. 22, no. 2, pp. 184–190, 1974.
  • [26] A. M. Sayeed, “Deconstructing multiantenna fading channels,” IEEE Trans. Signal Process., vol. 50, no. 10, pp. 2563–2579, 2002.
  • [27] V. I. Barousis, A. G. Kanatas, and A. Kalis, “Beamspace-domain analysis of single-RF front-end MIMO systems,” IEEE Trans. Veh. Technol., vol. 60, no. 3, pp. 1195–1199, 2011.
  • [28] V. Barousis and A. G. Kanatas, “Aerial degrees of freedom of parasitic arrays for single RF front-end MIMO transceivers,” Progress Electromagn. Research, vol. 35, pp. 287–306, 2011.
  • [29] A. Kalis, A. G. Kanatas, and C. B. Papadias, “A novel approach to MIMO transmission using a single RF front end,” IEEE J. Sel. Areas Commun., vol. 26, no. 6, pp. 972–980, 2008.
  • [30] O. N. Alrabadi, C. B. Papadias, A. Kalis, and R. Prasad, “A universal encoding scheme for MIMO transmission using a single active element for PSK modulation schemes,” IEEE Trans. Wirel. Commun., vol. 8, no. 10, pp. 5133–5142, 2009.
  • [31] Z. Han, S. Shen, Y. Zhang, C.-Y. Chiu, and R. Murch, “A pattern correlation decomposition method for analysis of ESPAR in single-RF MIMO systems,” IEEE Trans. Wirel. Commun., 2021.
  • [32] Z. Han, Y. Zhang, S. Shen, Y. Li, C.-Y. Chiu, and R. Murch, “Characteristic mode analysis of ESPAR for single-RF MIMO systems,” IEEE Trans. Wirel. Commun., vol. 20, no. 4, pp. 2353–2367, 2021.
  • [33] K. Maliatsos and A. G. Kanatas, “Modifications of the IST-WINNER channel model for beamspace processing and parasitic arrays,” in in Proc. 7th Eur. Conf. Antennas and Propag.   IEEE, 2013, pp. 989–993.
  • [34] CST Microwave Studio 2019, http://www.cst.com.
  • [35] Y. Zhang, S. Shen, Z. Han, C.-Y. Chiu, and R. Murch, “Compact MIMO systems utilizing a pixelated surface: Capacity maximization,” IEEE Trans. Veh. Technol., vol. 70, no. 9, pp. 8453–8467, 2021.
  • [36] Y. Bingbing, R. Wenbo, Y. Bolin, and L. Yang, “An indoor positioning algorithm and its experiment research based on RFID,” International Journal on Smart Sensing & Intelligent Systems, vol. 7, no. 2, 2014.
  • [37] J. C. Bezdek and R. J. Hathaway, “Convergence of alternating optimization,” Neural, Parallel & Scientific Computations, vol. 11, no. 4, pp. 351–368, 2003.
  • [38] P. E. Gill and W. Murray, “Quasi-Newton methods for unconstrained optimization,” IMA J. Appl. Math., vol. 9, no. 1, pp. 91–108, 1972.