An Online Scheduling Algorithm for Energy Minimization in Wireless Powered Mobile Edge Computing Networks

Xingqiu He1, Yuhang Shen2, Xiong Wang3, Sheng Wang4, Shizhong Xu5 and Jing Ren6 School of Information and Communication Engineering, University of Electronic Science and Technology of China
E-mail: 1hexqiu@gmail.com, 2shenyuhang@std.uestc.edu.cn, 345 6 {wangxiong, wsh_keylab, xsz, renjing}@uestc.edu.cn

Abstract

The integration of Mobile Edge Computing (MEC) and Wireless Power Transfer (WPT), which is usually referred to as Wireless Powered Mobile Edge Computing (WP-MEC), has been recognized as a promising technique to enhance the lifetime and computation capacity of wireless devices (WDs). Compared to the conventional battery-powered MEC networks, WP-MEC brings new challenges to the computation scheduling problem because we have to jointly optimize the resource allocation in WPT and computation offloading. In this paper, we consider the energy minimization problem for WP-MEC networks with multiple WDs and multiple access points. We design an online algorithm by transforming the original problem into a series of deterministic optimization problems based on the Lyapunov optimization theory. To reduce the time complexity of our algorithm, the optimization problem is relaxed and decomposed into several independent subproblems. After solving each subproblem, we adjust the computed values of variables to obtain a feasible solution. Extensive simulations are conducted to validate the performance of the proposed algorithm.

Index Terms:

wireless power transfer, mobile edge computing, Lyapunov optimization, energy minimization

I Introduction

With the rapid development in recent years, the Internet of Things (IoT) technology has played an important role in the intelligent and autonomous control of many industrial and commercial systems, such as smart grid and smart cities [1]. Due to the stringent size constraint and production cost consideration, the ubiquitously deployed IoT devices usually have restricted computation capability and finite battery capacity, which severely degrades the quality of service experienced by users. To handle the two fundamental performance limitations, Wireless Powered Mobile Edge Computing (WP-MEC) has been proposed as a novel paradigm that combines the advantages of Wireless Power Transfer (WPT) and Mobile Edge Computing (MEC).

As a promising approach that provides sustainable energy supply for wireless devices (WDs), WPT utilizes dedicated energy transmitters to broadcast radio frequency (RF) signals. The received RF signals can be converted to electricity by energy harvesting circuits and used to charge WDs continuously. On the other hand, MEC is a newly emerged computing paradigm that enables WDs to offload their computation tasks to nearby edge servers located at the edge of radio access networks. As the integration of both techniques, WP-MEC charges WDs with WPT and alleviates WDs’ computation workloads with MEC. As a result, the WDs’ device lifetime and computation capacity are simultaneously improved, which leads to significantly enhanced user experiences.

In this paper, we study the computation offloading and system resource allocation in WP-MEC networks. Compared with the computation scheduling in conventional battery-powered MEC networks, our problem is much more challenging because 1) the optimal control decisions depend on the remaining energy in the battery and 2) WPT and computation offloading need to share the same limited system resources, such as time and frequency.

I-A Prior Works

Computation scheduling in conventional MEC networks has been extensively studied and was systematically summarized in [2] and [3]. Recently, the advancements in WPT technology bring in the possibility of building wireless powered MEC networks. To the best of our knowledge, the works in [4] and [5] are the first that simultaneously study energy harvesting and computation offloading in MEC networks, but they only consider simple networks with only one WD and one access point (AP). Their model was extended by subsequent researches to incorporate more WDs. The works in [6] and [7] consider the WP-MEC networks with two near-far WDs and try to resolve the so-called “double-near-far” effect, which occurs because a farther device harvests less energy from the AP but spends more power to communicate in longer distances. The authors in [8] and [9] aim to maximize the overall computation rate of all WDs in the networks, where an unmanned aerial vehicle was utilized to transmit energy in [9]. Due to the existence of binary offloading variables, the considered problems are generally formulated as mixed integer programming (MIP) problems and require a prohibitively long time to solve. In order to make real-time control decisions in fast fading environments, a deep reinforcement learning-based algorithm is proposed in [10] to obtain near-optimal solutions in large-scale WP-MEC networks. To the best of our knowledge, [11] is the only work that considers WP-MEC networks with multiple APs, where an approximation algorithm is derived to maximize the ratio of computation tasks completed before their deadlines.

The computation offloading considered in [4, 5, 6, 7, 8, 9, 10, 11] operate in binary mode, i.e. the tasks are non-splittable and are either processed by WDs or fully offloaded to APs. In addition to binary offloading, the scheduling problem is also investigated under partial offloading, where tasks can be divided into smaller parts and executed on WDs and APs concurrently. The work in [12] aims to optimize the energy consumption of multiple WDs and one AP, where the AP is assumed to have multiple antennas. The authors in [13] examined the computation efficiency maximization problem under both binary offloading and partial offloading modes. They also studied the problem under non-orthogonal multiple access (NOMA) in addition to the widely used time division multiple access (TDMA) scheme.

The researches described above generally focus on the one-shot optimization where the scheduling problem is only considered in a specific time interval. However, in practical settings, the WP-MEC networks are operated under sustainable manners and the control decisions at different times are mutually dependent. For example, the energy harvested in the current time frame, if not fully consumed, may be stored in the battery for later use. Inspired by this fact, many recent works try to solve the scheduling problem in online settings. A one-WD-one-AP model is considered in [14] where the energy consumption is minimized by optimizing the power of the energy transmitter and the offloading decisions of WDs. The model is extended to include multiple WDs in [15] where the long-term system throughput is maximized. In [16], the authors designed an online algorithm based on the Lyapunov optimization theory and presented a theoretical tradeoff between energy efficiency and delay.

I-B Our Contributions

In this paper, we study the computation scheduling problem in WP-MEC networks with multiple WDs and multiple APs. Different from [11], we formulate our problem under an online setting where both channel states and computation data arrivals fluctuate over time. This brings new challenges in algorithm design as decision variables are coupled along the timeline. To facilitate the process of tasks, the partial offloading mode is adopted in our model. Our objective is to minimize the long-term energy consumption by jointly optimizing the resource allocation in WPT and wireless communication stages. To avoid the mutual interference of concurrently emitted energy waves [17], only one AP is allowed to broadcast RF energy at the same time. By choosing different APs for WPT in turn, we also alleviate the “double near-far” effect because each WD has a chance to harvest energy from a closer AP. Our main contributions are summarized as follows.

•

We formulate the energy minimization problem for WP-MEC networks with partial offloading. To the best of our knowledge, this is the first work that considers multiple WDs and multiple APs under online settings.
•

We design an online algorithm by transforming the original problem into a series of deterministic optimization problems in each time slot based on the Lyapunov optimization theory. To reduce the time complexity of our algorithm, we propose a relax-then-adjust technique where the optimization problem is first relaxed and decomposed into several independent subproblems. After solving each subproblem, we adjust the computed values of variables to obtain a feasible solution.
•

To solve the non-convex computation offloading subproblem obtained in the previous step, we propose an iterative algorithm based on the Alternating Minimization method. We demonstrate that each iterative step can be solved in polynomial time and the algorithm converges to local optima.
•

Extensive simulations are conducted to validate the performance of our algorithm. Numerical results show that the total energy consumption is significantly reduced under various settings.

The rest of the paper is organized as follows. In Section II, we introduce the system model and the problem formulation. An online algorithm that jointly optimizes the WPT and computation scheduling is proposed in Section III. The simulations and related numerical results are presented in Section IV and we conclude our paper in Section V.

II System Model and Problem Formulation

Refer to caption — Figure 1: An example of system model and time allocation.

As shown in Fig. 1, we consider a WP-MEC network consisting of $N$ WDs and $M$ APs, where each AP is integrated with an RF energy transmitter and a MEC server. APs are assumed to have a stable power supply and broadcast RF energy to WDs. The energy harvested by each WD is stored in a rechargeable battery, which is used to power its computing and communication operations. Similar to [11], we assume that WPT and wireless communications (for offloading) of the same AP cannot be performed simultaneously and the TDMA protocol is applied to avoid mutual interference, but WPT and wireless communications of different APs can be operated simultaneously over orthogonal frequency bands.

The time horizon is divided into slots with equal length $T$ , where each slot consists of four phases, i.e., WPT, computation offloading, edge computing, and result downloading, as illustrated in Fig. 1. Note that we have assumed WDs have simultaneous wireless information and power transfer (SWIPT) abilities, as shown in [18]. However, our model can be easily tailored for WDs without SWIPT capabilities, as discussed in Section II-C. Since MEC servers have strong computation capacities compared with WDs and the computation results are of small data sizes, the time consumption for edge computing and result downloading is negligible [11, 12, 13, 15, 16]. Therefore, we only consider the duration of the first two phases in our model. Let $h^{D}_{ij}(t)$ and $h^{U}_{ij}(t)$ denote the downlink and uplink channel gain between WD $i$ and AP $j$ on slot $t$ . If WD $i$ beyonds the communication scape of AP $j$ , the corresponding channel gain is zero. As in [14, 15, 16], we assume all channels follow quasi-static flat-fading, i.e., the channel state remains constant within each time slot, but may vary across different slots. Our goal is to minimize the total energy consumption for processing the data of WDs.

II-A Wireless Power Transfer and Energy Harvesting Model

According to [17], when multiple APs transfer RF power simultaneously, their energy waves may interfere with each other and lead to a possible energy cancellation. To improve the efficiency of RF energy transfer, we select only one AP to broadcast RF energy during each time slot. By choosing different APs in turn, we also alleviate the “double near-far” effect because each WD has a chance to harvest energy from a close AP. We use $a^{T}_{j}(t)\in\{0,1\}$ to indicate whether AP $j$ broadcasts RF energy in slot $t$ and let $P^{T}_{j}(t)$ and $\tau^{T}_{j}(t)$ be the corresponding transmission power and transmission time. Then $a^{T}_{j}(t)$ should satisfy

\sum_{j=1}^{M}a^{T}_{j}(t)\leq 1

(1)

and the energy consumption of AP $j$ during the WPT is

E^{T}_{j}(t)=a^{T}_{j}(t)P^{T}_{j}(t)\tau^{T}_{j}(t).

(2)

As in [16, 19], we assume the energy harvested from noise is negligible and adopt a linear model to characterize the energy harvesting circuit of WDs. In particular, the energy harvested by WD $i$ during the $t$ -th time slot is

E^{H}_{i}(t)=\sum_{j=1}^{M}\mu_{i}a^{T}_{j}(t)P^{T}_{j}(t)h^{D}_{ij}(t)\tau^{T}_{j}(t)

(3)

where $\mu_{j}\in(0,1)$ is the energy conversion efficiency of WD $i$ . The harvested energy is stored in the battery of WDs. Let $B^{max}_{i}$ and $B_{i}(t)$ be the capacity and remaining power of the battery in WD $i$ , then the update rule of $B_{i}(t)$ is

B_{i}(t+1)=\min\left[B_{i}(t)-E^{L}_{i}(t)-E^{O}_{i}(t)+E^{H}_{i}(t),B^{max}_{i}\right]

where $E^{L}_{i}(t)$ and $E^{O}_{i}(t)$ are the energy consumption for local computation and wireless communication, as explained in the next subsection. Due to the energy causality constraint, the energy consumption cannot exceed the available energy in the battery, so we must ensure

E^{L}_{i}(t)+E^{O}_{i}(t)\leq B_{i}(t).

(4)

II-B Computation Scheduling Model

In each time slot $t$ , let $A_{i}(t)$ be the amount of computation data arrived at WD $i$ . Without loss of generality, we assume $A_{i}(t)$ is i.i.d. with average rate $\mathbb{E}[A_{i}(t)]=\lambda_{i}$ . The arrived data can be either processed locally or offloaded to the APs. The length of queueing data $Q_{i}(t)$ at WD $i$ evolves according to the following equation

Q_{i}(t+1)=Q_{i}(t)-D^{L}_{i}(t)-D^{O}_{i}(t)+A_{i}(t)

where $D^{L}_{i}(t)$ and $D^{O}_{i}(t)$ are the amount of locally processed data and offloaded data, respectively. Note that the new data may arrive at the end of slot $t$ and cannot be processed until the beginning of next slot, so $D^{L}_{i}(t)$ and $D^{O}_{i}(t)$ must satisfy

D^{L}_{i}(t)+D^{O}_{i}(t)\leq Q_{i}(t).

(5)

II-B1 Local Computation

As in previous researches, we assume the local computation and WPT can be performed simultaneously and each WD $i$ adopts the Dynamic Voltage and Frequency Scaling technique [20] to control its CPU frequency $f_{i}(t)$ . Let $\tau^{L}_{i}(t)$ be the local computation time of WD $i$ in slot $t$ , then we can express $D^{L}_{i}(t)$ with the following equation

D^{L}_{i}(t)=\frac{f_{i}(t)\tau^{L}_{i}(t)}{\phi_{i}}

(6)

where $\phi_{i}$ is the number of CPU cycles required to process one bit of computation data. According to [6], the energy consumption for the local computation of WD $i$ is

E^{L}_{i}(t)=\kappa_{i}f^{3}_{i}(t)\tau^{L}_{i}(t)

(7)

where $\kappa_{i}$ is the energy efficiency coefficient of the chip equipped with WD $i$ . By substituting (7) into (6) we can obtain

D^{L}_{i}(t)=\sqrt[3]{\frac{E^{L}_{i}(t){\tau^{L}_{i}(t)}^{2}}{\kappa_{i}}}\cdot\frac{1}{\phi_{i}}

which is an increasing function with respect to the computation time $\tau^{L}_{i}(t)$ under fixed energy consumption $E^{L}_{i}(t)$ . Therefore, we can simply set

\tau^{L}_{i}(t)=T

to maximize the computation data processed locally.

II-B2 Computation Offloading

Let $\tau_{i}(t)$ and $P_{i}(t)$ denote the offloading time and the transmit power of WD $i$ , respectively. Suppose each WD can only communicate with at most one AP within one time slot. We use the binary variable $a_{ij}(t)\in\{0,1\}$ to indicate whether WD $i$ is communicating with AP $j$ , then $a_{ij}(t)$ should satisfy

\sum_{j=1}^{M}a_{ij}(t)\leq 1

(8)

and the amount of computation data offloaded from WD $i$ to APs can be expressed as

D^{O}_{i}=\sum_{j=1}^{M}\frac{a_{ij}(t)B\tau_{i}(t)}{v_{i}}\log_{2}\left(1+\frac{P_{i}(t)h^{U}_{ij}(t)}{\sigma_{j}^{2}}\right)

where $B$ is the spectrum bandwidth, $\sigma^{2}_{j}$ is the noise power of AP $j$ , and $v_{i}>1$ indicates the communication overhead induced by encryption and packet header [8, 13]. The energy consumption for the computation offloading of WD $i$ is

E^{O}_{i}=\sum_{j=1}^{M}a_{ij}(t)P_{i}(t)\tau_{i}(t)=P_{i}(t)\tau_{i}(t).

Similar to [12], we assume the energy consumption for the computation at AP $j$ is proportional to the total data it received from WDs

E^{C}_{j}(t)=\sum_{i=1}^{N}\eta a_{ij}(t)\phi_{i}D^{O}_{i}(t)

where $\eta$ is the energy consumption per CPU cycle of APs.

II-C Problem Formulation

To reduce the carbon footprint, in this paper, we aim to minimize the energy consumption for processing computation data arrived at WDs by jointly optimizing the control decisions for WPT, local computation, and data offloading. A similar objective is also considered in [7, 16, 5]. Based on our model, the considered problem can be formulated as follows.

$\displaystyle\min\quad$	$\displaystyle\lim_{H\to\infty}\frac{1}{H}\sum_{t=0}^{H-1}\sum_{j=1}^{M}\mathbb{E}\left\{E^{T}_{j}(t)+E^{C}_{j}(t)\right\}$
$\displaystyle s.t.\quad$	$\displaystyle\eqref{cons:a^T_j},\eqref{cons:energy_causality},\eqref{cons:Q_i},\eqref{cons:a_ij}$
	$\displaystyle a^{T}_{j}(t)\tau^{T}_{j}(t)+\sum_{i=1}^{N}a_{ij}(t)\tau_{i}(t)\leq T$	$\displaystyle\forall j,\forall t$	(9)
	$\displaystyle 0\leq\tau^{T}_{j}(t)\leq T,0\leq P^{T}_{j}\leq P^{T,max}_{j}$	$\displaystyle\forall j,\forall t$	(10)
	$\displaystyle 0\leq\tau_{i}(t)\leq T,0\leq P_{i}(t)\leq P^{max}_{i}$	$\displaystyle\forall i,\forall t$	(11)
	$\displaystyle 0\leq f_{i}(t)\leq f^{max}_{i}$	$\displaystyle\forall i,\forall t$	(12)
	$\displaystyle Q_{i}(t)\mbox{ is stable}$	$\displaystyle\forall i,\forall t$

where $P^{max}_{i}$ and $f^{max}_{i}$ is the maximum offloading power and computation capacity of WD $i$ . Constraint (9) ensures the time allocation is feasible under the TDMA protocol. The objective reflects APs’ long-term energy consumption. Since WDs harvest energy from APs, this is also the total energy consumption in the system.

For WDs without SWIPT capabilities, we only need to replace constraint (9) with

\sum_{j^{\prime}=1}^{M}a^{T}_{j^{\prime}}\tau^{T}_{j^{\prime}}(t)+\sum_{i=1}^{N}a_{ij}(t)\tau_{i}(t)\leq T\quad\forall j,\forall t

because the wireless communication is prohibited during WPT and thus the available communication time of all APs (not just the one that transfers energy) must exclude the WPT time. It should be noted that the algorithm proposed in the next section can also be conveniently adapted to this situation.

III An Online Algorithm for Wireless Power Transfer and Computation Scheduling

In this section, we design an online algorithm that jointly optimizes WPT and computation scheduling based on the Lyapunov optimization. A relax-then-adjust technique is proposed to decompose the original problem into smaller subproblems so that the algorithm’s complexity is significantly reduced.

III-A Algorithm Design with Lyapunov Optimization

For convenience of description, we first define the battery shortage $B^{-}_{i}(t)$ as

B^{-}_{i}(t)=B^{max}_{i}-B_{i}(t).

According to the constraint (4), the energy consumed by WD $i$ on slot $t$ cannot exceed the remaining energy in its battery. From a long-term perspective, this implies the time-average harvested energy by WD $i$ is equal to or greater than its time-average energy consumption, so $B^{-}_{i}(t)$ is also stable in the long run. Let $\bm{\Theta}(t)=[\bm{Q}(t),\bm{B}^{-}(t)]$ be the combined queue vector, where $\bm{Q}(t)=(Q_{1}(t),Q_{2}(t),\dots,Q_{N}(t))$ and $\bm{B}^{-}(t)=(B^{-}_{1}(t),B^{-}_{2}(t),\dots,B^{-}_{N}(t))$ . According to the Lyapunov optimization theory, we start by defining the quadratic Lyapunov function

L(t)=\frac{1}{2}\sum_{i=1}^{N}\left[Q_{i}(t)^{2}+B^{-}_{i}(t)^{2}\right]

and the conditional Lyapunov drift

\Delta L(t)=\mathbb{E}\left\{L(t+1)-L(t)|\bm{\Theta}(t)\right\}.

Next, we combine $\Delta L(t)$ with the objective function and form the following drift-plus-penalty term

\Delta_{V}L(t)=\Delta L(t)+V\mathbb{E}\left\{\sum_{j=1}^{M}\left(E^{T}_{j}(t)+E^{C}_{j}(t)\right)|\bm{\Theta}(t)\right\}

where $V$ is a tunable parameter that controls the trade-off between the energy consumption and the queueing delay of computation data. The following lemma provides an upper bound for $\Delta_{V}L(t)$ .

Lemma 1

On every slot $t$ and for any value of $\bm{\Theta}(t)$ , the drift-plus-penalty term always satisfies

$\displaystyle\Delta_{V}L(t)\leq$	$\displaystyle C-\sum_{i=1}^{N}Q_{i}(t)\mathbb{E}\left\{D^{L}_{i}(t)+D^{O}_{i}(t)-A_{i}(t)\|\bm{\Theta}(t)\right\}$
	$\displaystyle-\sum_{i=1}^{N}B^{-}_{i}(t)\mathbb{E}\left\{E^{H}_{i}(t)-E^{L}_{i}(t)-E^{O}_{i}(t)\|\bm{\Theta}(t)\right\}$
	$\displaystyle+V\mathbb{E}\left\{\sum_{j=1}^{M}\left(E^{T}_{j}(t)+E^{C}_{j}(t)\right)\|\bm{\Theta}(t)\right\}$	(13)

where $C$ is a constant defined in the proof.

The proof follows a standard procedure [21] and is omitted for brevity. Based on the Lyapunov optimization theory, we can obtain an approximately optimal algorithm of our problem by minimizing the right-hand side of (13) in every slot $t$ . The resulting time-average energy consumption decreases at the rate of $O(1/V)$ and the time-average queueing delay increases at the rate of $O(V)$ , presenting a $O(1/V)$ - $O(V)$ tradeoff between the two metrics. As a result, we can approach the optimal energy consumption arbitrarily close by increasing the value of $V$ . However, in our problem, directly solving the minimization problem is difficult because the decision variables are coupled in constraints such as (4) and (5). To reduce the complexity of our algorithm, we propose a relax-then-adjust technique to decompose the original minimization problem into independent subproblems. The details are presented in the next subsection.

III-B Relax-Then-Adjust

In our problem, the objective is to minimize the system energy consumption used to process the computation data of WDs. When the amount of arrived workload is fixed, this is equivalent to maximizing the energy efficiency, which is defined as the ratio of total energy consumption to the corresponding aggregate accomplished computation data. Inspired by this interpretation, we define the marginal energy efficiency of local computation and computation offloading, denoted by $\epsilon^{L}_{i}(t)$ and $\epsilon^{O}_{i}(t)$ respectively, as follows

$\displaystyle\epsilon^{L}_{i}(t)$	$\displaystyle=\frac{\partial E^{L}_{i}(t)}{\partial D^{L}_{i}(t)}=3\kappa_{i}\phi_{i}f^{2}_{i}(t)$
$\displaystyle\epsilon^{O}_{i}(t)$	$\displaystyle=\frac{\partial\left(E^{O}_{i}(t)+\eta\phi_{i}D^{O}_{i}(t)\right)}{\partial D^{O}_{i}(t)}$
	$\displaystyle=\sum_{j=1}^{M}\frac{a_{ij}(t)v_{i}\ln 2}{B}\cdot\left(\frac{\sigma^{2}_{j}}{h^{U}_{ij}(t)}+P_{i}(t)\right)+\eta\phi_{i}$	(14)

where we have assumed the time allocation $\tau_{i}(t)$ is given in deriving (14). According to the optimality conditions, the two marginal energy efficiency should be equal when the overall energy efficiency is maximized. Based on this fact, we can first relax constraint (4) and (5) and then adjust the values of $f_{i}(t)$ and $P_{i}(t)$ to obtain a feasible solution that makes $\epsilon^{L}_{i}(t)=\epsilon^{O}_{i}(t)$ . A similar technique is also used to decouple the WPT and computation offloading variables in constraint (9). After relaxing constraints (4), (5), and (9), the optimization problem of minimizing the right-hand side of (13) can be decomposed into three subproblems by classifying variables into independent groups. In our algorithm, we first compute solutions for these subproblems and then adjust their values according to the optimality conditions. The details of our algorithm are described as follows.

III-B1 WPT

We compute the WPT-related control variables by solving the following subproblem

\displaystyle\min\quad

\displaystyle\sum_{j=1}^{M}\left(V-\sum_{i=1}^{N}B^{-}_{i}(t)\mu_{i}h^{D}_{ij}(t)\right)a^{T}_{j}(t)P^{T}_{j}(t)\tau^{T}_{j}(t)

(15)

subject to constraints (1) and (10). Note that we have relaxed the time allocation constraint (9). This problem is derived by substituting (2) and (3) into (13) and group WPT-related terms together. The next two subproblems are derived in a similar way. Let $c^{T}_{j}(t)=V-\sum_{i=1}^{N}B^{-}_{i}(t)\mu_{i}h^{D}_{ij}(t)$ be the coefficient of $a^{T}_{j}(t)P^{T}_{j}(t)\tau^{T}_{j}(t)$ . If $c^{T}_{j}(t)\geq 0$ for all $j$ , then the optimal value is obtained by setting all $a^{T}_{j}(t)$ to $0$ . Otherwise, find $j^{*}=\operatorname*{arg\,min}c^{T}_{j}(t)P^{T,max}_{j}$ and the solution is $a^{T}_{j^{*}}(t)=1,P^{T}_{j^{*}}(t)=P^{T,max}_{j^{*}},\tau^{T}_{j^{*}}(t)=T$ .

III-B2 Local Computation

The only variable for local computation is the CPU frequency of WDs. By relaxing constraints (4) and (5), we get the following subproblem

\min\sum_{i=1}^{N}B^{-}_{i}(t)\kappa_{i}f^{3}_{i}(t)T-\sum_{i=1}^{N}Q_{i}(t)\frac{f_{i}(t)T}{\phi_{i}}

(16)

where $f_{i}(t)$ is subject to constraint (12). This problem can be further divided into $N$ subproblems because the variable $f_{i}(t)$ is independent with each other. As a result, the problem can be solved analytically and the optimal value of $f_{i}(t)$ is obtained at either the boundary points or the stationary point of (16), which is given by

f_{i}(t)=\min\left\{f^{max}_{i},\sqrt{\frac{Q_{i}(t)}{3\kappa_{i}\phi_{i}B^{-}_{i}(t)}}\right\}.

III-B3 Computation Offloading

The optimal time allocation of computation offloading can be derived by solving

$\displaystyle\min\quad$	$\displaystyle\sum_{j=1}^{M}\sum_{i=1}^{N}V\eta a_{ij}(t)\phi_{i}D^{O}_{i}(t)-\sum_{i=1}^{N}Q_{i}(t)D^{O}_{i}(t)$
	$\displaystyle+\sum_{i=1}^{N}B^{-}_{i}(t)P_{i}(t)\tau_{i}(t)$	(17)
$\displaystyle s.t.\quad$	$\displaystyle\eqref{cons:a_ij},\eqref{cons:var_offloading}$
	$\displaystyle\sum_{i=1}^{N}a_{ij}(t)\tau_{i}(t)\leq T\qquad\forall j,\forall t$	(18)

where we have used a relaxed time allocation constraint (18) instead of (9). This MIP problem is non-convex and has no efficient algorithms in general. In the next subsection, we will devise a heuristic algorithm for this problem based on the Alternating Minimization method.

III-B4 Adjust Variable Values

If the variable values computed in previous steps satisfy constraint (4), (5), and (9), then we can skip this step. Otherwise, we have to adjust their values to obtain a feasible solution. As described earlier, our intuition is to equalize the marginal benefits of variables constrained by the same resource.

Let us first consider the case where WDs have sufficient queueing data so constraint (5) is redundant. Without loss of generality, we can assume the energy causality constraint (4) is tight. Let $\epsilon^{L}_{i}(t)=\epsilon^{O}_{i}(t)$ , we can express the local CPU frequency $f_{i}(t)$ with respect to the offloading power $P_{i}(t)$

f_{i}(t)=\sqrt{\sum_{j=1}^{M}\frac{a_{ij}(t)v_{i}\ln 2}{3\kappa_{i}\phi_{i}B}\cdot\left(\frac{\sigma^{2}_{j}}{h^{U}_{ij}(t)}+P_{i}(t)\right)+\frac{\eta}{3\kappa_{i}}}.

(19)

Suppose AP $j^{*}$ is the one that broadcasts RF energy in time slot $t$ and let $\mathcal{N}^{*}(t)$ denote the set of WDs that offload data to AP $j^{*}$ , i.e., $a_{ij^{*}}(t)=1$ and $\tau_{i}(t)>0$ for all $i\in\mathcal{N}^{*}(t)$ . If WD $i$ does not belong to $\mathcal{N}^{*}(t)$ , which means it communicates with some other AP $j$ such that $a^{T}_{j}=0$ , then the time allocation constraint (9) of $\tau_{i}(t)$ reduces to (18). Therefore, the time allocation computed in Section III-B3 is feasible and no need for adjustment. Thus, we only need to adjust the values of $f_{i}(t)$ and $P_{i}(t)$ by solving

E^{L}_{i}(t)+E^{O}_{i}(t)=B_{i}(t),

(20)

which results in

P_{i}(t)=\frac{B_{i}(t)-\kappa_{i}f^{3}_{i}(t)T}{\tau_{i}(t)}.

(21)

For the rest WDs belongs to $\mathcal{N}^{*}(t)$ , we first re-allocate the WPT time $\tau^{T}_{j^{*}}(t)$ and offloading time $\tau_{i}(t),i\in\mathcal{N}^{*}(t)$ . According to (15) and (17), the marginal cost of $\tau^{T}_{j^{*}}(t)$ is $c^{T}_{j^{*}}(t)P^{T,max}_{j^{*}}$ and the marginal cost of $\tau_{i}(t)$ for all $i\in\mathcal{N}^{*}(t)$ is

		$\displaystyle\frac{\partial\left(V\eta\phi_{i}D^{O}_{i}(t)-Q_{i}(t)D^{O}_{i}(t)+B^{-}_{i}(t)P_{i}(t)\tau_{i}(t)\right)}{\partial\tau_{i}(t)}$
	$\displaystyle=$	$\displaystyle\frac{\left(V\eta\phi_{i}-Q_{i}(t)\right)B}{v_{i}}\log_{2}\left(1+\frac{P_{i}(t)h^{U}_{ij^{}}(t)}{\sigma^{2}_{j^{}}}\right)+B^{-}_{i}(t)P_{i}(t).$		(22)

By substituting (21) into (22), the marginal cost of $\tau_{i}(t)$ is a function of itself. Due to the optimality condition, we can compute the new time allocation by equalizing the marginal cost of $\tau_{i}(t)$ and $\tau^{T}_{j^{*}}(t)$ . After that, the values of $P_{i}(t)$ and $f_{i}(t)$ for WDs in $\mathcal{N}^{*}(t)$ are adjusted just like other WDs.

For cases where the remaining energy in the battery is adequate to handle all the queueing data in WDs, the data constraint (5) is tight and the energy constraint (4) is redundant. As a result, we can repeat the above procedures by replacing (20) with

D^{L}_{i}(t)+D^{O}_{i}(t)=Q_{i}(t).

III-C A Heuristic Algorithm for Computation Offloading

In this subsection, we propose a heuristic algorithm for the optimization problem of computation offloading based on the Alternating Minimization (AM) method. The main idea underlying AM is to replace the difficult joint optimization with a sequence of easier optimization involving grouped subsets of the variables. In our problem, we partition the decision variables into two groups: the time allocation variables $a_{ij}(t)\tau_{i}(t)$ and the transmission power variables $P_{i}(t)$ . According to the AM method, the original problem can be solved iteratively by solving the following subproblems in each step $k$

	$\displaystyle P^{k}(t)=\operatorname*{arg\,min}_{P(t)}g\left(a^{k}(t)\tau^{k}(t),P(t)\right)$		(23)
	$\displaystyle a^{k+1}(t)\tau^{k+1}(t)=\operatorname*{arg\,min}_{a(t)\tau(t)}g\left(a(t)\tau(t),P^{k}(t)\right)$		(24)

where $g(t)$ is the objective function in (17). Theoretical analysis guarantees that the computed solution converges to local minima [22].

III-C1 Transmission Power Selection

When the time allocation is given, minimizing (17) can be decomposed into $N$ independent subproblems and we can obtain an analytical solution as in Section III-B2. If $\phi_{i}\eta>Q_{i}(t)$ , then the optimal value of $P_{i}(t)$ is $P^{max}_{i}$ . Otherwise, set

P_{i}(t)=\min\left\{P^{max}_{i},\frac{(Q_{i}(t)-\phi_{i}\eta)B}{B^{-}_{i}(t)v_{i}\ln 2}-\frac{\sigma^{2}_{j^{\prime}}}{h^{U}_{ij^{\prime}}(t)}\right\}

where $j^{\prime}$ is the AP that WD $i$ communicate with.

III-C2 Time Allocation

Since the objective function is linear with respect to $\tau_{i}(t)$ , the marginal cost of time allocation is constant. According to the optimality condition, there is an optimal solution in which each AP $j$ is devoted to at most one WD $j$ . Therefore, we can assume $\tau_{i}(t)=T$ without loss of generality. Then the problem is to determine the value of $a_{ij}(t)$ , which turns to be a standard assignment problem and can be solved within $O(M^{2}N+N^{2}\log N)$ by the Hungarian algorithm [23].

IV Simulation Results

In this section, we evaluate the proposed algorithm through simulations and compare its performance with the following two benchmark schemes:

•

Local Computation Only (LCO) scheme: the computation data of WDs are processed locally;
•

Fully Offloading (FO) scheme: the computation data of WDs are fully offloaded to the APs.

Similar to our algorithm, we optimize the control decisions for these two schemes based on the Lyapunov optimization, so their performance is also associated with the parameter $V$ .

We consider a WPMEC network with $N=30$ WDs and $M=5$ APs. The simulation settings are selected based on the works in [16] and [11]. The maximum CPU frequency and battery capacity of each WD are $f^{max}_{i}=0.5GHz$ and $B^{max}_{i}=30kJ$ , respectively. As in [11], we adopt a simplified Rayleigh fading channel model and the uplink gain from WD $i$ to AP $j$ is $h^{U}_{ij}=\theta^{U}d^{-3}_{ij}\bar{h}_{ij}$ , where $\theta^{U}=6.25\times 10^{-4}$ (i.e. $-32dB$ ), $d_{ij}$ is the distance between WD $i$ and AP $j$ , and $\bar{h}_{ij}$ is a random variable drawn from the standard complex normal distribution $\mathcal{CN}(0,1)$ . We set the downlink gain as $h^{D}_{ij}=2h^{U}_{ij}$ . The other parameters used in our simulations are $B=1MHz$ , $\mu_{i}=0.51$ , $\kappa_{i}=10^{-28}$ , $v_{i}=1.1$ , $\eta=8.2nJ$ , $\phi_{i}=1000\ cycles/bit$ , and $\sigma^{2}_{j}=10^{-9}W$ .

The impact of control parameter $V$ on the system energy consumption and average delay are demonstrated in Fig. 3 and Fig. 3, respectively. In all algorithms, the energy consumption decreases with $V$ and the average delay grows with $V$ , which is in accordance with the energy-delay tradeoff of Lyapunov optimization. As shown in Fig. 3, the energy consumption of the proposed algorithm is slightly higher than LCO when $V$ is small, but is significantly better than the other two when $V$ is large. Meanwhile, the average delay induced by our algorithm outperforms the two benchmarks in all cases. This is because our algorithm performs local computation and computation offloading simultaneously, thus resulting in higher process rate and better energy efficiency.

In addition to the varying parameter $V$ , we also conducted simulations under distinct network scales. The system energy consumption under different number of WDs and different number of APs are presented in Fig. 5 and Fig. 5. The default value of $V$ is set to $3$ in all simulations. The growth of $N$ means there are more data to be processed in the system, thus results in higher energy consumption. When $N=40$ , APs already operate at the peak WPT power in FO but we still observed an extremely large average delay. We analysized the run-time data and found that the generated workload exceeds the system capacity. This also explains why the energy consumption of FO does not increase when $N$ grows from $40$ to $50$ . In Fig. 5, the energy consumption of FO decreases with respect to $M$ because WDs can choose closer APs to offload their computation tasks. However, the improvement on LCO is very limited. In both situations, our algorithm can utilize the advantages of both local computation and computation offloading, thus achieves the minimal energy consumption and average delay.

V Conclusions

In this paper, we investigate the computation scheduling problem in WP-MEC networks with multiple WDs and multiple APs. To minimize energy consumption, we propose an online algorithm that jointly optimizes the resource allocation in WPT and computation scheduling. The time complexity of our algorithm is significantly reduced via the relax-then-adjust technique and the Alternating Minimization method. Simulations are conducted to validate the performance of our algorithm.

References

[1] A. Al-Fuqaha, M. Guizani, M. Mohammadi, M. Aledhari, and M. Ayyash, “Internet of things: A survey on enabling technologies, protocols, and applications,” IEEE communications surveys & tutorials, vol. 17, no. 4, pp. 2347–2376, 2015.
[2] Y. Mao, C. You, J. Zhang, K. Huang, and K. B. Letaief, “A survey on mobile edge computing: The communication perspective,” IEEE Communications Surveys & Tutorials, vol. 19, no. 4, pp. 2322–2358, 2017.
[3] P. Mach and Z. Becvar, “Mobile edge computing: A survey on architecture and computation offloading,” IEEE Communications Surveys & Tutorials, vol. 19, no. 3, pp. 1628–1656, 2017.
[4] Y. Mao, J. Zhang, and K. B. Letaief, “Dynamic computation offloading for mobile-edge computing with energy harvesting devices,” IEEE Journal on Selected Areas in Communications, vol. 34, no. 12, pp. 3590–3605, 2016.
[5] C. You, K. Huang, and H. Chae, “Energy efficient mobile cloud computing powered by wireless energy transfer,” IEEE Journal on Selected Areas in Communications, vol. 34, no. 5, pp. 1757–1771, 2016.
[6] X. Hu, K.-K. Wong, and K. Yang, “Wireless powered cooperation-assisted mobile edge computing,” IEEE Transactions on Wireless Communications, vol. 17, no. 4, pp. 2375–2388, 2018.
[7] L. Ji and S. Guo, “Energy-efficient cooperative resource allocation in wireless powered mobile edge computing,” IEEE Internet of Things Journal, vol. 6, no. 3, pp. 4744–4754, 2018.
[8] S. Bi and Y. J. Zhang, “Computation rate maximization for wireless powered mobile-edge computing with binary computation offloading,” IEEE Transactions on Wireless Communications, vol. 17, no. 6, pp. 4177–4190, 2018.
[9] F. Zhou, Y. Wu, R. Q. Hu, and Y. Qian, “Computation rate maximization in uav-enabled wireless-powered mobile-edge computing systems,” IEEE Journal on Selected Areas in Communications, vol. 36, no. 9, pp. 1927–1941, 2018.
[10] L. Huang, S. Bi, and Y. J. Zhang, “Deep reinforcement learning for online computation offloading in wireless powered mobile-edge computing networks,” IEEE Transactions on Mobile Computing, 2019.
[11] T. Zhu, J. Li, Z. Cai, Y. Li, and H. Gao, “Computation scheduling for wireless powered mobile edge computing networks,” in IEEE INFOCOM 2020-IEEE Conference on Computer Communications. IEEE, 2020, pp. 596–605.
[12] F. Wang, J. Xu, X. Wang, and S. Cui, “Joint offloading and computing optimization in wireless powered mobile-edge computing systems,” IEEE Transactions on Wireless Communications, vol. 17, no. 3, pp. 1784–1797, 2017.
[13] F. Zhou and R. Q. Hu, “Computation efficiency maximization in wireless-powered mobile edge computing networks,” IEEE Transactions on Wireless Communications, vol. 19, no. 5, pp. 3170–3184, 2020.
[14] F. Wang, J. Xu, and S. Cui, “Optimal energy allocation and task offloading policy for wireless powered mobile edge computing systems,” IEEE Transactions on Wireless Communications, vol. 19, no. 4, pp. 2443–2459, 2020.
[15] H. Wu, X. Lyu, and H. Tian, “Online optimization of wireless powered mobile-edge computing for heterogeneous industrial internet of things,” IEEE Internet of Things Journal, vol. 6, no. 6, pp. 9880–9892, 2019.
[16] S. Mao, S. Leng, S. Maharjan, and Y. Zhang, “Energy efficiency and delay tradeoff for wireless powered mobile-edge computing systems with multi-access schemes,” IEEE Transactions on Wireless Communications, vol. 19, no. 3, pp. 1855–1867, 2019.
[17] M. Y. Naderi, P. Nintanavongsa, and K. R. Chowdhury, “Rf-mac: A medium access control protocol for re-chargeable sensor networks powered by wireless energy harvesting,” IEEE Transactions on Wireless Communications, vol. 13, no. 7, pp. 3926–3937, 2014.
[18] T. D. P. Perera, D. N. K. Jayakody, S. K. Sharma, S. Chatzinotas, and J. Li, “Simultaneous wireless information and power transfer (swipt): Recent advances and future challenges,” IEEE Communications Surveys & Tutorials, vol. 20, no. 1, pp. 264–302, 2017.
[19] X. Lu, P. Wang, D. Niyato, D. I. Kim, and Z. Han, “Wireless networks with rf energy harvesting: A contemporary survey,” IEEE Communications Surveys & Tutorials, vol. 17, no. 2, pp. 757–789, 2014.
[20] J. M. Rabaey, A. P. Chandrakasan, and B. Nikolić, Digital integrated circuits: a design perspective. Pearson Education Upper Saddle River, NJ, 2003, vol. 7.
[21] M. J. Neely, “Stochastic network optimization with application to communication and queueing systems,” Synthesis Lectures on Communication Networks, vol. 3, no. 1, pp. 1–211, 2010.
[22] J. C. Bezdek and R. J. Hathaway, “Convergence of alternating optimization,” Neural, Parallel & Scientific Computations, vol. 11, no. 4, pp. 351–368, 2003.
[23] H. W. Kuhn, “The hungarian method for the assignment problem,” Naval research logistics quarterly, vol. 2, no. 1-2, pp. 83–97, 1955.

$\displaystyle\Delta_{V}L(t)\leq$	$\displaystyle C-\sum_{i=1}^{N}Q_{i}(t)\mathbb{E}\left\{D^{L}_{i}(t)+D^{O}_{i}(t)-A_{i}(t)\|\bm{\Theta}(t)\right\}$
	$\displaystyle-\sum_{i=1}^{N}B^{-}_{i}(t)\mathbb{E}\left\{E^{H}_{i}(t)-E^{L}_{i}(t)-E^{O}_{i}(t)\|\bm{\Theta}(t)\right\}$
	$\displaystyle+V\mathbb{E}\left\{\sum_{j=1}^{M}\left(E^{T}_{j}(t)+E^{C}_{j}(t)\right)\|\bm{\Theta}(t)\right\}$	(13)