This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Intelligent Reflecting Surface Assisted Multi-Cluster AirComp via Dynamic Beamforming

Yapeng Zhao, Qingqing Wu, Wen Chen, Celimuge Wu, and Octavia A. Dobre,  Y. Zhao is with the State Key Laboratory of Internet of Things for Smart City, University of Macau, Macao 999078, China, and also with the Institute for Signal Processing and Systems at Shanghai Jiao Tong University, 200240, China (email: yc17435@connect.um.edu.mo). Q. Wu and W. Chen is with the Department of Electronic Engineering, Shanghai Jiao Tong University, 200240, China (e-mail: qingqingwu@sjtu.edu.cn; wenchen@sjtu.edu.cn). C. Wu is with Graduate School of Informatics and Engineering, The University of Electro-Communications, Tokyo,182-8585, Japan (e-mail: Celimuge@uec.ac.jp). O. A. Dobre is with the Faculty of Engineering and Applied Science, Memorial University, St. John’s, NL A1B 3X5, Canada (e-mail: odobre@mun.ca).
Abstract

This paper studies an multi-cluster over-the-air computation (AirComp) system, where an intelligent reflecting surface (IRS) assists the signal transmission from devices to an access point (AP). The clusters are activated to compute heterogeneous functions in a time-division manner. Specifically, two types of IRS beamforming (BF) schemes are proposed to reveal the performance-cost tradeoff. One is the cluster-adaptive BF scheme, where each BF pattern is dedicated to one cluster, and the other is the dynamic BF scheme, which is applied to any number of IRS BF patterns. By deeply exploiting their inherent properties, both generic and low-complexity algorithms are proposed in which the IRS BF patterns, time and power resource allocation are jointly optimized. Numerical results show that IRS can significantly enhance the function computation performance, and demonstrate that the dynamic IRS BF scheme with half of the total IRS BF patterns can achieve near-optimal performance which can be deemed as a cost-efficient approach for IRS-aided multi-cluster AirComp systems.

Index Terms:
Intelligent reflecting surface (IRS), over-the-air computation (AirComp), dynamic beamforming (BF), computation rate maximization.

I Introduction

The sixth generation (6G) network aims to integrate communication, sensing, computing, and intelligence together, thus constructing native multi-functional systems that serve as powerful engines to build an intelligent world [1, 2, 3]. A series of advanced services are progressively conceived and experimented in such multi-functional systems, such as extended reality (XR), edge artificial intelligence (AI), and autonomous driving [4, 5], either of which needs to collect and process enormous data from distributed devices. Conventionally, the raw data collection and process are regarded as separate procedures and designed in isolation.

In practice, these intelligent services demand the functional computation results of the raw data, such as the maximum reading of several temperature monitors, the average of local models, and the minimum distance between cars and barricades. The recently aroused over-the-air computation (AirComp), deemed as a task-oriented multiple-access (MA) strategy, has shown its high efficiency in integrating communication and computation through leveraging the signal superposition characteristic of the wireless channel [6]. The co-channel interference is treated as a catalyst in AirComp for computation, whereas traditional task-agnostic MA techniques consider it to be detrimental. AirComp was conceived [7, 8] and validated [9] to be a promising scalable function computation strategy in a series of works. Specifically, the transceiver design that exhibits a uniform-forcing structure to compensate individual channel fading was proposed in [10], and the multiple-input multiple-output (MIMO) AirComp [7] was exploited for simultaneously multi-modal sensing. However, the harsh wireless propagation environment still significantly weakens the signal strength and thus deteriorates the computation performance.

Intelligent reflecting surface (IRS), a promising technology for future wireless networks, has shown the potential to overcome this detrimental effect by adapting the phase shifts of low-cost passive elements. Several works have exhibited the effectiveness of IRSs in the single-cluster AirComp system, e.g., the authors in [11, 12] employed an IRS to strengthen channels thus reduce the distortion. If the single-cluster AirComp is intuitively extended to a multi-cluster case like the conventional communication systems [13, 6], the considered configuration times of IRS passive beamforming (BF) is the same as the number of participated clusters, otherwise its BF remains static during a transmission frame, i.e., each cluster is assigned with a dedicated BF pattern or all clusters employ the same pattern. It is observed that the two policies are rigid in that they provide the upper and lower bounds on system performance with no tradeoff. Although harnessing more BF patterns provides more degrees of freedom (DoFs) which can configure a suitable propagation environment for devices, thereby further enhancing the system performance, it also adds extra signaling overhead. Hence, a flexible dynamic IRS BF scheme is desired to balance the system performance and cost. Recently, a novel dynamic IRS beamforming (DIBF) technique has been proposed to offer promising DoF thus enabling more flexible resource allocation in half-duplex and full-duplex wireless powered communication networks (WPCNs) [14, 15], mobile edge computing (MEC) with binary offloading [16], etc. DIBF proactively develops favorable time-selective channels by harnessing the unique character of IRSs that the BF patterns can be tuned multiple times within a given channel coherence time. Additionally, a good balance between performance, associated tuning costs, and signaling overhead can be achieved by flexibly managing the amount of reconfigurations.

In this work, we analyze an IRS-aided AirComp system comprising of multiple clusters by taking the performance-cost tradeoff into account. The previous studies concentrated on the transceiver design in one dedicated time slot (TS) that aim to minimize the function computation error measured by the mean square error (MSE). We study in this paper how to boost the computation capability within a given period in which the IRS can be configured multiple times. Specifically, clusters are activated in a time-division manner to eliminate the inter-cluster interference thereby computing heterogeneous functions, and we present two types of BF schemes to unleash the fundamental performance-cost tradeoff. The cluster-adaptive BF method, where each BF is allocated to one cluster, is first taken into consideration to provide the upper bound for comparison. The performance-cost tradeoff is then explored using the general dynamic IRS BF method, which is be applied to any number of BF patterns. Simulation results verify the theoretical findings and show how IRS improves the performance of multi-cluster AirComp. In particular, it is observed that the proposed DIBF design is sufficient to acquire near-optimal performance with a minimal computation rate decrease (less than 6%6\%) by halving the number of BF patterns, which sheds light on balancing the performance-cost via limiting the number of BF patterns.

Refer to caption
Figure 1: An IRS-assisted multi-cluster AirComp system.
Refer to caption
Figure 2: Transmission protocol of the multi-cluster AirComp with DIBF.

II System Model and Problem Formulation

As depicted in Fig. 1, the considered multi-cluster AirComp system composes of a single antenna access point (AP), an IRS equipped with NN passive elements, and KK devices. Due to the different computational requirements, the KK devices are divided into LL clusters. Each cluster consists of KK_{\ell} devices and computes a desired type-\ell function named ff_{\ell}. We denote 𝒦{1,,K}\mathcal{K}_{\ell}\triangleq\{1,\ldots,K_{\ell}\} as the set of devices within \ell-th cluster, where 𝒦i𝒦j=,ij,i,j,={1,,L}\mathcal{K}_{i}\cap\mathcal{K}_{j}=\varnothing,\forall i\neq j,\;i,j\in\mathcal{L},\;\mathcal{L}=\{1,\ldots,L\}. The AP computes heterogeneous functions via AirComp from each cluster in a time-division manner. Specifically, the AP computes ff_{\ell} with the raw data from all the KK_{\ell} devices in cluster \ell. Let sk,s_{k,\ell} denote one dedicated reading of the device kk in cluster \ell. The target type-\ell function computed at AP is expressed as

f=ϕ(k𝒦φk(sk,)),\displaystyle f_{\ell}=\phi_{\ell}\left(\sum\nolimits_{k\in\mathcal{K}_{\ell}}\varphi_{k}(s_{k,\ell})\right), (1)

where φk()\varphi_{k}(\cdot) is the pre-processing function at device kk, and ϕ()\phi_{\ell}(\cdot) is the post-processing function at the AP. We adopt the digital AirComp to perform function computation, in which the nest lattice coding strategy is harnessed to achieve the integer combinations of transmitted codewords and combat the channel noise meanwhile. Specifically, the reading of each device is first processed by φk()\varphi_{k}(\cdot), then the result is quantized and mapped into a vector, and finally, it is encoded into a nested lattice codeword (see [17, 13] for more details). We assume that each symbol xkx_{k} of the transmitted signal has been normalized into a mutual independent symbol with zero mean and unit power. Hence, the target recovered data at the AP of cluster \ell when computing a type-\ell function is given by y=k𝒦xk,y_{\ell}=\sum_{k\in\mathcal{K}_{\ell}}x_{k,\ell}.

The baseband channels from the IRS to AP, from the device kk to IRS, and from the device kk to AP are denoted by 𝒈H1×N\bm{g}^{\mathrm{H}}\in\mathbb{C}^{1\times N}, 𝒉,krN×1\bm{h}^{r}_{\ell,k}\in\mathbb{C}^{N\times 1} and h,kd{h}^{d}_{\ell,k}\in\mathbb{C}, respectively. In this paper, we assume that all the involved channels are perfectly known by applying the state-of-the-art channel estimation techniques [18]. All devices in each cluster transmit concurrently with the symbol-level synchronization assumed. In a channel coherence interval TtT_{t}, the duration required to compute a function is further divided into JJ TSs. In addition, tjt_{j}, where 𝚯j\mathbf{\Theta}_{j} is employed in, denotes the time interval for the jj-th TS. Then, tjt_{j} is divided to LL sub-TSs for the function computation in different clusters. In sub-TS t,jt_{\ell,j}, the recovered signal from cluster \ell to compute ff_{\ell} is given by

y^,j\displaystyle\hat{y}_{\ell,j} =a,j(k𝒦h,j,keb,j,kx,j,k+z)\displaystyle=a_{\ell,j}\bigg{(}\sum_{k\in\mathcal{K}_{\ell}}{h}^{e}_{\ell,j,k}b_{\ell,j,k}{x}_{\ell,j,k}+{z}_{\ell}\bigg{)}
=k𝒦x,j,k+k𝒦(a,jh,j,keb,j,k1)x,j,k+a,jz,\displaystyle=\!\!\sum_{k\in\mathcal{K}_{\ell}}\!{x}_{\ell,j,k}\!+\!\!\!\sum_{k\in\mathcal{K}_{\ell}}\!\!\big{(}a_{\ell,j}{h}^{e}_{\ell,j,k}b_{\ell,j,k}\!-\!1\big{)}{x}_{\ell,j,k}\!+\!a_{\ell,j}z_{\ell}, (2)

where b,j,k,a,jb_{\ell,j,k},a_{\ell,j}\in\mathbb{C} are the transmitter and the receiver scalars, respectively. Besides, h,j,ke=h,kd+𝒈H𝚯j𝒉,kr=h,kd+𝒗jHdiag(𝒈H)𝐡,kr{h}^{e}_{\ell,j,k}={h}^{d}_{\ell,k}+\bm{g}^{\mathrm{H}}_{\ell}\bm{\Theta}_{j}\bm{h}^{r}_{\ell,k}={h}^{d}_{\ell,k}+\bm{v}_{j}^{\mathrm{H}}\mathrm{diag}(\bm{g}_{\ell}^{\mathrm{H}})\mathbf{h}^{r}_{\ell,k} denotes the composite device-AP channel coefficient of device kk in cluster \ell, where 𝚯j=diag(eȷθj,1,,eȷθj,N)\bm{\Theta}_{j}=\operatorname{diag}(e^{\jmath\theta_{j,1}},\ldots,e^{\jmath\theta_{j,N}}) denotes the phase shift matrix of IRS, in which θj,n[0,2π),n𝒩\theta_{j,n}\in[0,2\pi),\forall n\in\mathcal{N} denotes the phase shift on the incident signal of nn-th element [19], 𝒗j=[vj,1,,vj,N]H\bm{v}_{j}=[v_{j,1},\ldots,v_{j,N}]^{\mathrm{H}} with vj,n=eȷθnv_{j,n}=e^{\jmath\theta_{n}}, and z𝒞𝒩(0,σ2)z_{\ell}\in\mathbb{C}\sim\mathcal{CN}(0,\sigma_{\ell}^{2}) denotes the additional white Gaussian noise.

The landmark work in [17] established the fundamentals of compute-and-forward strategy from the information theory perspective. Accordingly, the subsequent work [13] proposed the computation rate of AirComp, defined as the achieved number of computed function values per channel use (in num/Hz), which can be expressed as

r,jc=1m+log2Klog2+(1MSE(y^,j,y,j)),\displaystyle r_{\ell,j}^{\mathrm{c}}=\frac{1}{m_{\ell}+\log_{2}K_{\ell}}\log_{2}^{+}\left(\frac{1}{\mathrm{MSE}(\hat{y}_{\ell,j},y_{\ell,j})}\right), (3)

with mm_{\ell} being the number of the optimal quantization bits for the type-\ell function computation, KK_{\ell} being the number of involved devices, and log2+()max{log2(),0}\log_{2}^{+}(\cdot)\triangleq\max\{\log_{2}(\cdot),0\}. It can be observed from (3) that the computation rate maximization problem can be equivalently transformed into the MSE minimization problem. By adopting the uniform-forcing transceiver proposed in [10], the solution to the MSE minimization problem is

a,j=1η,j,b,j,k=η,j(h,j,ke)H|h,j,ke|2,η,j=P0mink|h,j,ke|2,\displaystyle\!a_{\ell,j}\!\!=\!\!\frac{1}{\sqrt{\eta_{\ell,j}}},b_{\ell,j,k}\!\!=\!\!\sqrt{\eta_{\ell,j}}\frac{(h_{\ell,j,k}^{e})^{\mathrm{H}}}{|h_{\ell,j,k}^{e}|^{2}},\eta_{\ell,j}\!\!=\!\!P_{0}\min_{k}|h_{\ell,j,k}^{e}|^{2},\! (4)

where P0P_{0} denotes the maximum allowed transmit power. Similarly, for the case when devices are energy-limited, we have

η,j=mink(p,j,k|h,j,ke|2),\displaystyle\eta_{\ell,j}=\min_{k}(p_{\ell,j,k}|h_{\ell,j,k}^{e}|^{2}), (5)

where p,j,kp_{\ell,j,k} is the allocated power of device kk for current transmission. The corresponding computation rate of cluster \ell under BF pattern 𝚯j\mathbf{\Theta}_{j} is given by

r,jc=1m+log2Klog2+(mink(p,j,k|h,j,ke|2)σ2).\displaystyle r_{\ell,j}^{\mathrm{c}}=\frac{1}{m_{\ell}+\log_{2}K_{\ell}}\log_{2}^{+}\left(\frac{\min_{k}(p_{\ell,j,k}|h_{\ell,j,k}^{e}|^{2})}{\sigma^{2}_{\ell}}\right). (6)

We aim to boost the computation capability within TtT_{t} in which IRS can be configured multiple times. As shown in Fig. 2, the two proposed IRS BF schemes depend on how the IRS sets its BF during the time interval, and are specified as follows.

II-A Cluster-adaptive IRS BF

Each cluster is mapped with a unique IRS BF pattern as well as the TSs in the cluster-adaptive BF scheme, which provide the performance upper bound, i.e., arbitrary BF schemes with J>KJ>K cannot further improve the performance [14]. Accordingly, the computation rate maximization problem is given by

(P1):max{t},{𝒗},{p,k}\displaystyle\!\!\!\!\!\text{(P1)}:\!\!\mathop{\max}\limits_{\scriptstyle\{t_{\ell}\},\{\bm{v}_{\ell}\},\atop\scriptstyle\{p_{\ell,k}\}}\;\;\!\!\! =1Lwtm+log2Klog2+(mink(p,k|h,ke|2)σ2)\displaystyle\sum\limits_{\ell=1}^{L}\!\!\frac{w_{\ell}t_{\ell}}{m_{\ell}\!+\!\log_{2}K_{\ell}}\!\log_{2}^{+}\!\left(\!\!\frac{\min_{k}(p_{\ell,k}|h_{\ell,k}^{e}|^{2})}{\sigma^{2}_{\ell}}\!\!\right) (7a)
s.t.\displaystyle{\rm{s.t.}}\;\;\;\;\;\; t0,p,k0,,k,\displaystyle t_{\ell}\geq 0,\ p_{\ell,k}\geq 0,\ \forall\ell,k, (7b)
=1LtTt,\displaystyle\sum\nolimits_{\ell=1}^{L}t_{\ell}\leq T_{t}, (7c)
tp,kEmax,,k,\displaystyle t_{\ell}p_{\ell,k}\leq E_{\max},\ \forall\ell,k, (7d)
|v,n|=1,,n,\displaystyle|v_{\ell,n}|=1,\ \forall\ell,n, (7e)

where ww_{\ell} denotes the weight of cluster \ell. Since the weight ww_{\ell}, quantization bit mm_{\ell}, and the number of devices KK_{\ell} of each cluster do not affect the scheme design, we set that w=1,m=m~,K=K~,w_{\ell}=1,m_{\ell}=\tilde{m},K_{\ell}=\tilde{K},\forall\ell. Note that (P1) is generally a non-convex optimization problem with non-convex norm-one constraint (7e) and the coupled variables in (7a) and (7d).

II-B Dynamic IRS BF

For the proposed general framework that the BF at IRS can be reconfigured JJ times during TtT_{t}, the clusters can flexibly choose one or multiple BF patterns for computation. The corresponding problem is formulated as

(P2):max{t,j},{𝒗j},{p,j,k}\displaystyle\!\!\!\text{(P2)}:\mathop{\max}\limits_{\scriptstyle\{t_{\ell,j}\},\{\bm{v}_{j}\},\atop\scriptstyle\{p_{\ell,j,k}\}}\;\;\!\!\! =1Lj=1Jwt,jr,jc\displaystyle\sum\nolimits_{\ell=1}^{L}\sum\nolimits_{j=1}^{J}w_{\ell}t_{\ell,j}r_{\ell,j}^{\mathrm{c}} (8a)
s.t.\displaystyle{\rm{s.t.}}\;\;\;\;\;\; t,j0,p,j,k0,,j,k,\displaystyle t_{\ell,j}\geq 0,\ p_{\ell,j,k}\geq 0,\ \forall\ell,j,k, (8b)
=1Lj=1Jt,jTt,\displaystyle\sum\nolimits_{\ell=1}^{L}\sum\nolimits_{j=1}^{J}t_{\ell,j}\leq T_{t}, (8c)
j=1Jt,jp,j,kEmax,,k,\displaystyle\sum\nolimits_{j=1}^{J}t_{\ell,j}p_{\ell,j,k}\leq E_{\max},\ \forall\ell,k, (8d)
|vj,n|=1,j,n.\displaystyle|v_{j,n}|=1,\ \forall j,n. (8e)

Similar to (P1), we assume that w=1,m=m~,K=K~,w_{\ell}=1,m_{\ell}=\tilde{m},K_{\ell}=\tilde{K},\forall\ell. Note that (P2) is still non-convex since there exist highly-coupled variables in both (8a) and (8d), and constraint (8e) is generally non-convex.

III Proposed Algorithms

III-A Proposed Algorithm for (P1)

(P1) is shown as a maximin problem, which can be converted to a traditional maximization problem by intuitively introducing the auxiliary variables to substitute the inner minimization part. Then, multiple techniques can be adopted, such as alternating optimization. However, it is still inefficient and may suffer from performance loss. In this section, by analyzing the inherent properties of (P1), we propose an efficient algorithm correspondingly.

Proposition 1.

The optimal solution of (P1) satisfies,

mink(p,k|h,ke|2)=Emax/tmink|h,ke|2,.\displaystyle\min\nolimits_{k}(p_{\ell,k}^{*}|h^{e}_{\ell,k}|^{2})=E_{\max}/t_{\ell}\min\nolimits_{k}|h^{e}_{\ell,k}|^{2},\ \forall\ell. (9)

Proof: Suppose that 𝒮={{t},{𝒗},{p,k}}\mathcal{S}^{*}=\big{\{}\{t_{\ell}^{*}\},\{\bm{v}_{\ell}^{*}\},\{p_{\ell,k}^{*}\}\big{\}} achieves the optimal solution to (P1), we have p,kEmax/t,,kp_{\ell,k}^{*}\leq E_{\max}/t_{\ell}^{*},\ \forall\ell,k. The maximum allowed transmit power for the devices in cluster \ell becomes P0=Emax/t,P_{0}=E_{\max}/t_{\ell}^{*},\ \forall\ell. By combing it with (4), Proposition 1 is obtained. \hfill\blacksquare

Inspired by Proposition 1, we introduce the auxiliary variables {Γ}\{\Gamma_{\ell}\} that satisfy mink|h,ke|2Γ,\min_{k}|h^{e}_{\ell,k}|^{2}\geq\Gamma_{\ell},\ \forall\ell and reformulate (P1) as

max{t},{𝒗},{Γ}\displaystyle\mathop{\max}\limits_{\scriptstyle\{t_{\ell}\},\{\bm{v}_{\ell}\},\atop\scriptstyle\{\Gamma_{\ell}\}}\;\;\!\!\! =1Ltm~+log2K~log2+(EmaxΓtσ2)\displaystyle\sum\nolimits_{\ell=1}^{L}\frac{t_{\ell}}{\tilde{m}+\log_{2}\tilde{K}}\log_{2}^{+}\left(\frac{E_{\max}\Gamma_{\ell}}{t_{\ell}\sigma^{2}_{\ell}}\right) (10a)
s.t.\displaystyle{\rm{s.t.}}\;\;\;\; |h,ke|2Γ,,k,\displaystyle|h_{\ell,k}^{e}|^{2}\geq\Gamma_{\ell},\ \forall\ell,k, (10b)
(7b),(7c),(7e).\displaystyle\eqref{e_c_P1_t1},\eqref{e_c_P1_t2},\eqref{e_c_P1_v}. (10c)

Note that the objective function is jointly concave with tt_{\ell} and Γ\Gamma_{\ell}, thus the difficulty of problem (10) consists of the non-convex constraints (10b) and (7e).

Let 𝒗¯H=[𝒗H 1]\overline{\bm{v}}_{\ell}^{\mathrm{H}}=[\bm{v}_{\ell}^{\mathrm{H}}\ 1] and 𝒒k=[diag(𝒈H)𝐡,krh,kd]T\bm{q}_{k}=[\mathrm{diag}(\bm{g}_{\ell}^{\mathrm{H}})\mathbf{h}^{r}_{\ell,k}\ {h}^{d}_{\ell,k}]^{\mathrm{T}}. Exploiting the matrix lifting technique, i.e., define 𝐕=𝒗¯𝒗¯H,𝐐,k=𝒒,k𝒒,kH\mathbf{V}_{\ell}=\overline{\bm{v}}_{\ell}\overline{\bm{v}}_{\ell}^{\mathrm{H}},\mathbf{Q}_{\ell,k}=\bm{q}_{\ell,k}\bm{q}_{\ell,k}^{\mathrm{H}}, we have |h,ke|2=Tr(𝐕𝐐,k)|h_{\ell,k}^{e}|^{2}=\mathrm{Tr}(\mathbf{V}_{\ell}\mathbf{Q}_{\ell,k}). Besides, we convert the rank-one constraint of {𝐕}\{\mathbf{V}_{\ell}\} to a difference-of-convex form and add it to objective function as

max{t},{𝐕},{Γ}\displaystyle\!\!\!\!\!\mathop{\max}\limits_{\scriptstyle\{t_{\ell}\},\{\mathbf{V}_{\ell}\},\atop\scriptstyle\{\Gamma_{\ell}\}}\;\;\!\!\! =1Ltlog2+(EmaxΓtσ2)m~+log2K~12ρ=1L(Tr(𝐕)𝐕2)\displaystyle\!\!\sum_{\ell=1}^{L}\!\!\frac{t_{\ell}\log_{2}^{+}\left(\frac{E_{\max}\Gamma_{\ell}}{t_{\ell}\sigma^{2}_{\ell}}\right)}{\tilde{m}+\log_{2}\tilde{K}}\!-\!\frac{1}{2\rho}\!\!\sum_{\ell=1}^{L}(\mathrm{Tr}(\mathbf{V}_{\ell})\!\!-\!\|\mathbf{V}_{\ell}\|_{2})\!\!\! (11a)
s.t.\displaystyle{\rm{s.t.}}\;\;\;\; Tr(𝐕𝐐,k)Γ,,k,\displaystyle\mathrm{Tr}(\mathbf{V}_{\ell}\mathbf{Q}_{\ell,k})\geq\Gamma_{\ell},\ \forall\ell,k, (11b)
𝐕0,,\displaystyle\mathbf{V}_{\ell}\succeq 0,\ \forall\ell, (11c)
(7b),(7c),\displaystyle\eqref{e_c_P1_t1},\eqref{e_c_P1_t2}, (11d)

where ρ>0\rho>0 is the penalty factor, and 𝐕2\|\mathbf{V}_{\ell}\|_{2} denotes the spectral norm of 𝐕\mathbf{V}_{\ell}. The factor ρ\rho is harnessed to penalize the violation of the rank-one constraint. Note that 1/2ρ1/2\rho\rightarrow\infty when ρ0\rho\rightarrow 0, hence the obtained solution satisfies the rank-one constraint. For any given ρ\rho and any given point 𝐕(i)\mathbf{V}_{\ell}^{(i)} in the ii-th iteration, by linearizing Tr(𝐕)𝐕2\mathrm{Tr}(\mathbf{V}_{\ell})\!-\!\|\mathbf{V}_{\ell}\|_{2} to Tr((𝐈𝝀(𝐕(i))𝝀H(𝐕(i)))𝐕)\mathrm{Tr}((\mathbf{I}-\bm{\lambda}(\mathbf{V}_{\ell}^{(i)})\bm{\lambda}^{\mathrm{H}}(\mathbf{V}_{\ell}^{(i)}))\mathbf{V}_{\ell}) via the successive convex approximation (SCA) technique,111𝝀H()\bm{\lambda}^{\mathrm{H}}(\cdot) is the eigenvector corresponding to the biggest eigenvalue. problem (11) is converted to a convex optimization problem. Hence, by decreasing the value of ρ\rho and updating 𝐕(i)\mathbf{V}_{\ell}^{(i)} in each iteration, the convergence of proposed algorithm for (P1) can be finally reached [15].

III-B Proposed Algorithms for (P2)

For (P2), we reveal the following proposition to simplify the original problem and then propose an efficient algorithm.

Proposition 2.

The optimal solution of (P2) satisfies

mink(p,j,k|h,j,ke|2)=e,j/t,jmink|h,j,ke|2,,\displaystyle\min_{k}(p_{\ell,j,k}^{*}|{h}^{e}_{\ell,j,k}|^{2})=e_{\ell,j}/t_{\ell,j}^{*}\min_{k}|{h}^{e}_{\ell,j,k}|^{2},\ \forall\ell, (12)

where e,je_{\ell,j} denotes the maximum energy of the devices in cluster \ell allocated for IRS BF pattern jj, which satisfies j=1Je,jEmax,\sum\nolimits_{j=1}^{J}e_{\ell,j}\leq E_{\max},\ \forall\ell.

Proof: Suppose that 𝒮={{t,j},{𝒗j},{p,j,k}}\mathcal{S}^{*}=\big{\{}\{t_{\ell,j}^{*}\},\{\bm{v}_{j}^{*}\},\{p_{\ell,j,k}^{*}\}\big{\}} achieves the optimal solution to (P2), we have p,j,kt,je,j,,j,kp_{\ell,j,k}^{*}t_{\ell,j}^{*}\leq e_{\ell,j},\ \forall\ell,j,k. The maximum allowed transmit power for the devices in cluster \ell under BF pattern 𝒗j\bm{v}_{j}^{*} becomes P0=e,j/t,j,P_{0}=e_{\ell,j}/t_{\ell,j}^{*},\ \forall\ell. By combing it with (4), Proposition 2 is obtained. \hfill\blacksquare

According to Proposition 2, (P2) can be reformulated as

max{t,j},{𝒗j},{e,j}\displaystyle\mathop{\max}\limits_{\scriptstyle\{t_{\ell,j}\},\{\bm{v}_{j}\},\atop\scriptstyle\{e_{\ell,j}\}}\;\;\!\!\! =1Lj=1Jt,jlog2+(e,jmink|h,j,ke|2t,jσ2)m~+log2K~\displaystyle\sum\nolimits_{\ell=1}^{L}\sum\nolimits_{j=1}^{J}\frac{t_{\ell,j}\log_{2}^{+}\!\left(\!\frac{e_{\ell,j}\min_{k}|h^{e}_{\ell,j,k}|^{2}}{t_{\ell,j}\sigma^{2}_{\ell}}\!\right)}{\tilde{m}+\log_{2}\tilde{K}} (13a)
s.t.\displaystyle{\rm{s.t.}}\;\;\;\;\;\; j=1Je,jEmax,\displaystyle\sum\nolimits_{j=1}^{J}e_{\ell,j}\leq E_{\max}, (13b)
(8b),(8c),(8e).\displaystyle\ \eqref{e_c_P2_t1},\eqref{e_c_P2_t2},\eqref{e_c_P2_v}. (13c)

By introducing the auxiliary variables {γ,j}\{\gamma_{\ell,j}\} that satisfy mink|h,j,ke|2γ,j,\min_{k}|{h}^{e}_{\ell,j,k}|^{2}\geq\gamma_{\ell,j},\ \forall\ell, we have

max{t,j},{γ,j},{e,j},{𝒗j}\displaystyle\mathop{\max}\limits_{\scriptstyle\{t_{\ell,j}\},\{\gamma_{\ell,j}\},\atop\scriptstyle\{e_{\ell,j}\},\{\bm{v}_{j}\}}\;\;\!\!\! =1Lj=1Jt,jlog2+(e,jγ,jt,jσ2)m~+log2K~\displaystyle\sum\nolimits_{\ell=1}^{L}\sum\nolimits_{j=1}^{J}\frac{t_{\ell,j}\log_{2}^{+}\left(\frac{e_{\ell,j}\gamma_{\ell,j}}{t_{\ell,j}\sigma^{2}_{\ell}}\right)}{\tilde{m}+\log_{2}\tilde{K}} (14a)
s.t.\displaystyle{\rm{s.t.}}\;\;\;\;\;\; |h,j,ke|2γ,j,,j,k\displaystyle|{h}^{e}_{\ell,j,k}|^{2}\geq\gamma_{\ell,j},\ \forall\ell,j,k (14b)
(8b),(8c),(8e),(13b).\displaystyle\eqref{e_c_P2_t1},\eqref{e_c_P2_t2},\eqref{e_c_P2_v},\eqref{e_c_P21_e}. (14c)

Introducing slack variables S,j=e,jγ,j,,jS_{\ell,j}=e_{\ell,j}\gamma_{\ell,j},\ \forall\ell,j, it yields

max{t,j},{γ,j},{S,j},{e,j},{𝒗j}\displaystyle\mathop{\max}\limits_{\scriptstyle\{t_{\ell,j}\},\{\gamma_{\ell,j}\},\atop\scriptstyle\{S_{\ell,j}\},\{e_{\ell,j}\},\{\bm{v}_{j}\}}\;\;\!\!\! =1Lj=1Jt,jlog2+(S,jt,jσ2)m~+log2K~\displaystyle\sum\nolimits_{\ell=1}^{L}\sum\nolimits_{j=1}^{J}\frac{t_{\ell,j}\log_{2}^{+}\left(\frac{S_{\ell,j}}{t_{\ell,j}\sigma^{2}_{\ell}}\right)}{\tilde{m}+\log_{2}\tilde{K}} (15a)
s.t.\displaystyle{\rm{s.t.}}\;\;\;\;\;\; S,je,jγ,j,,j,\displaystyle S_{\ell,j}\leq e_{\ell,j}\gamma_{\ell,j},\ \forall\ell,j, (15b)
(8b),(8c),(8e),(13b),(14b).\displaystyle\eqref{e_c_P2_t1},\eqref{e_c_P2_t2},\eqref{e_c_P2_v},\eqref{e_c_P21_e},\eqref{e_c_h2}. (15c)

Since the objective value can always be increased by raising S,jS_{\ell,j} until the constraint (15b) becomes active, it can be seen that the constraint (15b) is satisfied with equality for the optimal solution to problem (15).

Observing that the objective function of problem (15) is show as a concave function with respect to the variables S,jS_{\ell,j} and t,jt_{\ell,j}, but the constraints (8e), (14b), and (15b) are still non-convex. First, we convert (15b) into

S,j((e,j+γ,j)2(e,j2+γ,j2))/2,,j,\displaystyle S_{\ell,j}\leq\big{(}(e_{\ell,j}+\gamma_{\ell,j})^{2}-(e_{\ell,j}^{2}+\gamma_{\ell,j}^{2})\big{)}/2,\ \forall\ell,j, (16)

and then apply SCA technique to linearize the right-hand sides of (16) with given points {e,j(i),γ,j(i)}\{e_{\ell,j}^{(i)},\gamma_{\ell,j}^{(i)}\} in the ii-th iteration as

((e,j(i)+γ,j(i))2+2(e,j(i)+γ,j(i))(e,j+γ,je,j(i)γ,j(i))\displaystyle\big{(}(e_{\ell,j}^{(i)}+\gamma_{\ell,j}^{(i)})^{2}+2(e_{\ell,j}^{(i)}+\gamma_{\ell,j}^{(i)})(e_{\ell,j}+\gamma_{\ell,j}-e_{\ell,j}^{(i)}-\gamma_{\ell,j}^{(i)})
(e,j2+γ,j2))/2.\displaystyle\quad-(e_{\ell,j}^{2}+\gamma_{\ell,j}^{2})\big{)}/2. (17)

Second, we convert (14b) to

|𝒗¯jH𝒒k|2γ,j,,j,k,\displaystyle|\overline{\bm{v}}_{j}^{\mathrm{H}}\bm{q}_{k}|^{2}\geq\gamma_{\ell,j},\ \forall\ell,j,k, (18)

and linearize it as

2{(𝒗¯j(i))H𝐐,j,k𝒗¯j}(𝒗¯j(i))H𝐐,j,k𝒗¯j(i)γ,j,\displaystyle\vspace{-2pt}2\Re\{(\overline{\bm{v}}_{j}^{(i)})^{\mathrm{H}}\mathbf{Q}_{\ell,j,k}\overline{\bm{v}}_{j}\}-(\overline{\bm{v}}_{j}^{(i)})^{\mathrm{H}}\mathbf{Q}_{\ell,j,k}\overline{\bm{v}}_{j}^{(i)}\geq\gamma_{\ell,j},\vspace{-2pt} (19)

in which 𝒗¯j(i)\overline{\bm{v}}_{j}^{(i)} is the given point in iteration ii. Furthermore, problem (15) is approximated as a convex optimization problem by loosening (8e) to |v¯j,n|1,j,n|\overline{v}_{j,n}|\leq 1,\ \forall j,n which can be addressed by the standard solvers, the corresponding unit-modulus BF patterns are then obtained by subtracting the phases.

The proposed algorithm for tackling the dynamic IRS BF scheme is shown as solving a series of convex problems. Specifically, problem (15) has five kinds of variables, the last of which has dimension JNJN, and the others are LJLJ. Hence, the corresponding computational complexity is given by 𝒪(IiterJ3.5(4L+N)3.5){\cal O}\left(I_{\mathrm{iter}}J^{3.5}(4L+N)^{3.5}\right) via the interior-point method [20], in which IiterI_{\mathrm{iter}} stands for the quantity of iterations required to attain convergence. Note that the amount of constraints and optimization variables scale linearly with the number of participated clusters LL and BF patterns JJ, it can be observed that the related computational complexity is extraordinarily large.

Remark 1.

By analyzing (P2), we can observe that the optimal association between IRS BF patterns {𝚯j}\{\mathbf{\Theta}_{j}\} and clusters is binary [14], i.e., each cluster only needs to select one BF pattern to compute the function.

Low-complexity algorithm for (P2): Inspired by Remark 1 and the cluster-adaptive BF scheme, we can propose an efficient low-complexity algorithm. We assume that each cluster is assigned with a specific BF as the cluster-adaptive scheme presented in Section III-A.The clusters are then arranged in descending order by their respective computation rates. Each of the first J1J\!-1 clusters is given with a dedicated BF pattern and the remaining LJ+1L\!-\!J+1 clusters are assumed to employ the same BF pattern 𝚯J\mathbf{\Theta}_{J}. Finally, we jointly optimize the time allocation and 𝚯J\mathbf{\Theta}_{J}. By relaxing the norm-one constraints of IRS BF, the computational complexity of the first phase is given by 𝒪(LN3.5){\cal O}(LN^{3.5}) [20]. The maximum computation complexity of the sorting algorithm is given by 𝒪(L2){\cal O}(L^{2}). With given IRS BF, we have η=Emax/t|h,mine|2, 1J1\eta_{\ell}=E_{\max}/t_{\ell}|h^{e}_{\ell,\min}|^{2},\;1\leq\ell\leq J-1 where |h,mine|2=mink|h,ke|2|h^{e}_{\ell,\min}|^{2}=\min_{k}|h^{e}_{\ell,k}|^{2}. Reformulate the problem as (14), the remaining variables to be optimized is 𝒗J,{t},{γ}\bm{v}_{J},\{t_{\ell}\},\{\gamma_{\ell}\}, where the dimensions are given by N,L,LJ+1N,L,L-J+1, respectively. By linearizing the non-convex constraint respect to {γ}\{\gamma_{\ell}\} via SCA technique and relaxing the norm-one constraint, the problem can be converted to a convex one. Hence, the overall computational complexity of this low-complexity algorithm is given by 𝒪(LN3.5+L2+Iiter(2LJ+N+1)3.5){\cal O}(LN^{3.5}+L^{2}+I_{\textrm{iter}}(2L-J+N+1)^{3.5}) [20].

IV Simulation Results

In this section, we provide numerical results to demonstrate the efficiency of the proposed schemes and the insights for IRS-aided multiple-cluster AirComp system. The AP and the IRS are located at (0,0,10)(0,0,10) meter (m) and (10,0,10)(10,0,10) m, respectively. The placement of each device is random within a radius of 1010 m that centered at (10,10,0)(10,10,0) m. The path-loss exponents of the AP-devices channels are set to 3.3, whereas the path-loss exponents for the AP-IRS and IRS-devices channels are set to 2.3. Additionally, the signal attenuation at a reference distance of 11 m is set as 3030 dB. Unless otherwise stated, other parameters are set as: K=K~=5,K_{\ell}=\tilde{K}=5,\ \forall\ell, L=5L=5, Emax=0.01E_{\max}=0.01 J, σz2=80\sigma_{z}^{2}=-80 dBm, and Tt=0.1T_{t}=0.1 s. For comparison, we consider the following schemes: 1) Upper bound: relaxing the rank-one constraint in problem (11), which serves as a performance upper bound; 2) Cluster-adaptive BF: the approach in Section III-A to solve (P1); 3) Dynamic BF: the approach in Section III-B to solve (P2); 4) Low-complexity algorithm: the proposed low-complexity algorithm for (P2).

Refer to caption
Figure 3: Computation rate versus the number of IRS elements NN.
Refer to caption
Figure 4: Computation rate versus the number of IRS BF patterns JJ.

In Fig. 3, we analyze the system performance versus the number of IRS elements NN. It can be seen that the dynamic IRS BF designs in the case of L=JL=J can perform almost as well as the special case with cluster-adaptive BF and nearly as the upper bound. Additionally, it can be seen that the low-complexity algorithm is capable of achieving the same performance as the joint optimization strategy. This shows the applicability of the proposed DIBF design. In comparison to the cases that BFs are randomly given and without IRS, it is observed that the proposed algorithms can greatly enhance computation rate, with the performance gap widening as the number of IRS elements NN rises. Besides, the computation rate of IRS-assisted AirComp with optimized IRS phase monotonically increases with respect to the number of IRS elements NN since more elements can reflect more captured signal energy.

As shown in Fig. 4, we depict the computation rate versus the number of available BF patterns JJ. It can be seen that the low-complexity algorithm for DIBF can achieve similar performance as the previously proposed joint optimization algorithm, making it more desirable in practice. In addition, one can see that the dynamic IRS BF scheme’s performance improvement progressively reaches saturation as JJ rises. Employing a total of J=7J=7 (J=4)J=4) BF patterns is virtually able to attain the maximum performance for L=10L=10 (L=5L=5), and adding JJ further only results in a marginal performance enhancement. It is further noted that the proposed DIBF design can achieve nearly optimal computation rate with a negligible performance loss (less than 6%6\%) with only half the number of the maximum BF patterns, demonstrating its potential to meet the performance-cost tradeoff by managing the number of BF patterns.

V Conclusion

In this paper, we studied the IRS-aided multi-cluster AirComp system and proposed two types of IRS BF schemes, which struck a balance between the system performance and cost. By jointly optimizing the time allocation for various clusters, the phase shifts at IRS, and the power allocation at devices, the sum computation rate maximization problems for the two scenarios were addressed. We proposed both general and low-complexity algorithms to tackle the optimization problem with any number of IRS BF patterns, and thus provided considerable flexibility in balancing between the performance gain of DIBF and its consequent system cost. Numerical results demonstrated that IRS can significantly improve the function computation performance, and presented the performance-cost tradeoff for IRS-aided AirComp. In particular, the dynamic IRS BF scheme was validated to be a cost-efficient strategy to achieve close-to-optimal performance with only half number of IRS BF patterns, and the proposed low-complexity algorithm can achieve satisfactory performance compared to the generic design.

References

  • [1] X. You et al., “Towards 6G wireless communication networks: Vision, enabling technologies, and new paradigm shifts,” Sci. China Inf. Sci., vol. 64, no. 1, pp. 1–74, Jan. 2021.
  • [2] K. B. Letaief, Y. Shi, J. Lu, and J. Lu, “Edge artificial intelligence for 6G: Vision, enabling technologies, and applications,” IEEE J. Sel. Areas Commun., vol. 40, no. 1, pp. 5–36, Jan. 2022.
  • [3] X. Li et al., “Integrated sensing, communication, and computation over-the-air: MIMO beamforming design,” IEEE Trans. Wireless Commun., early access, doi: 10.1109/TWC.2022.3233795.
  • [4] W. Saad, M. Bennis, and M. Chen, “A vision of 6G wireless systems: Applications, trends, technologies, and open research problems,” IEEE Netw., vol. 34, no. 3, pp. 134–142, May 2020.
  • [5] Y. Shi, K. Yang, T. Jiang, J. Zhang, and K. B. Letaief, “Communication-efficient edge AI: Algorithms and systems,” IEEE Commun. Surv. Tutorials, vol. 22, no. 4, pp. 2167–2191, 4th Quat., 2020.
  • [6] Z. Wang, Y. Zhao, Y. Zhou, Y. Shi, C. Jiang, and K. B. Letaief, “Over-the-air computation: Foundations, technologies, and applications,” 2022, arXiv: 2210.10524. [Online]. Available: https://arxiv.org/abs/2210.10524
  • [7] G. Zhu and K. Huang, “MIMO over-the-air computation for high-mobility multimodal sensing,” IEEE Internet Things J., vol. 6, no. 4, pp. 6089–6103, Aug. 2019.
  • [8] Y. Zhao et al., “Performance-oriented design for intelligent reflecting surface assisted federated learning,” IEEE Trans. Commun., early access, doi: 10.1109/TCOMM.2023.3283799.
  • [9] H. Guo et al., “Over-the-air aggregation for federated learning: Waveform superposition and prototype validation,” J. Commun. Inf. Networks, vol. 6, no. 4, pp. 429–442, Dec. 2021.
  • [10] L. Chen, X. Qin, and G. Wei, “A uniform-forcing transceiver design for over-the-air function computation,” IEEE Wirel. Commun. Lett., vol. 7, no. 6, pp. 942–945, Dec. 2018.
  • [11] Z. Wang, J. Qiu, Y. Zhou, Y. Shi, L. Fu, W. Chen, and K. B. Letaief, “Federated learning via intelligent reflecting surface,” IEEE Trans. Wirel. Commun., vol. 21, no. 2, pp. 808–822, Jul. 2022.
  • [12] X. Zhai, G. Han, Y. Cai, and L. Hanzo, “Beamforming design based on two-stage stochastic optimization for RIS-assisted over-the-air computation systems,” IEEE Internet Things J., vol. 9, no. 7, pp. 5474–5488, Apr. 2022.
  • [13] M. Goldenbaum, H. Boche, and S. Stanczak, “Nomographic functions: Efficient computation in clustered gaussian sensor networks,” IEEE Trans. Wirel. Commun., vol. 14, no. 4, pp. 2093–2105, Apr. 2015.
  • [14] Q. Wu, X. Zhou, W. Chen, J. Li, and X. Zhang, “IRS-aided WPCNs: A new optimization framework for dynamic IRS beamforming,” IEEE Trans. Wirel. Commun., vol. 21, no. 7, pp. 4725–4739, Jul. 2022.
  • [15] M. Hua and Q. Wu, “Joint dynamic passive beamforming and resource allocation for IRS-aided full-duplex WPCN,” IEEE Trans. Wirel. Commun., vol. 21, no. 7, pp. 4829–4843, Jul. 2022.
  • [16] G. Chen, Q. Wu, R. Liu, J. Wu, and C. Fang, “IRS aided MEC systems with binary offloading: A unified framework for dynamic IRS beamforming,” IEEE J. Sel. Areas Commun., vol. 41, no. 2, pp. 349–365, Feb. 2023.
  • [17] B. Nazer and M. Gastpar, “Compute-and-forward: Harnessing interference through structured codes,” IEEE Trans. Inf. Theory, vol. 57, no. 10, pp. 6463–6486, Oct. 2011.
  • [18] Q. Wu, S. Zhang, B. Zheng, C. You, and R. Zhang, “Intelligent reflecting surface-aided wireless communications: A tutorial,” IEEE Trans. Commun., vol. 69, no. 5, pp. 3313–3351, Jan. 2021.
  • [19] Q. Wu and R. Zhang, “Intelligent reflecting surface enhanced wireless network via joint active and passive beamforming,” IEEE Trans. Wirel. Commun., vol. 18, no. 11, pp. 5394–5409, Aug. 2019.
  • [20] S. J. Wright, Primal-dual interior-point methods.   SIAM, 1997.