Joint User Scheduling and Computing Resource Allocation Optimization in Asynchronous Mobile Edge Computing Networks

Yihan Cang, Ming Chen, Yijin Pan, Zhaohui Yang,
Ye Hu, Haijian Sun, and Mingzhe Chen The work of Ming Chen was supported by the National Natural Science Foundation of China (NSFC) under grant 61871128, 61960206005 and 61960206006, by the National Key Research and Development Program under grant 2018YFB1801905, and by Fundamental Research on Foreword Leading Technology of Jiangsu Province under grant BK20192002. The work of Y. Pan was supported by Chongqing Natural Science Joint Fund Project under Grant No. CSTB2023NSCQ-LZX0121. (Corresponding author: Ming Chen). Y. Cang, Ming Chen and Y. Pan are with the National Mobile Communications Research Laboratory, Southeast University, Nanjing 210096, China (e-mails: yhcang@seu.edu.cn, chenming@seu.edu.cn, panyj@seu.edu.cn). Ming Chen is also with the Purple Mountain Laboratories, Nanjing 211100, China. Z. Yang is with College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China, and with International Joint Innovation Center, Zhejiang University, Haining 314400, China, and also with Zhejiang Provincial Key Laboratory of Info. Proc., Commun. & Netw. (IPCAN), Hangzhou 310027, China (e-mail: yang_zhaohui@zju.edu.cn). Y. Hu is with the Department of Industrial and System Engineering, University of Miami, Coral Gables, FL, 33146, (e-mail: yehu@miami.edu). H. Sun is with the School of Electrical and Computer Engineering, The University of Georgia, Athens, GA 30602 USA (e-mail: hsun@uga.edu). M. Chen is with the Department of Electrical and Computer Engineering and Institute for Data Science and Computing, University of Miami, Coral Gables, FL, 33146, USA (e-mail: mingzhe.chen@miami.edu).

Abstract

In this paper, the problem of joint user scheduling and computing resource allocation in asynchronous mobile edge computing (MEC) networks is studied. In such networks, edge devices will offload their computational tasks to an MEC server, using the energy they harvest from this server. To get their tasks processed on time using the harvested energy, edge devices will strategically schedule their task offloading, and compete for the computational resource at the MEC server. Then, the MEC server will execute these tasks asynchronously based on the arrival of the tasks. This joint user scheduling, time and computation resource allocation problem is posed as an optimization framework whose goal is to find the optimal scheduling and allocation strategy that minimizes the energy consumption of these mobile computing tasks. To solve this mixed-integer non-linear programming problem, the general benders decomposition method is adopted which decomposes the original problem into a primal problem and a master problem. Specifically, the primal problem is related to computation resource and time slot allocation, of which the optimal closed-form solution is obtained. The master problem regarding discrete user scheduling variables is constructed by adding optimality cuts or feasibility cuts according to whether the primal problem is feasible, which is a standard mixed-integer linear programming problem and can be efficiently solved. By iteratively solving the primal problem and master problem, the optimal scheduling and resource allocation scheme is obtained. Simulation results demonstrate that the proposed asynchronous computing framework reduces $87.17\%$ energy consumption compared with conventional synchronous computing counterpart.

Index Terms:

Mobile edge computing, asynchronous computing, user scheduling, wireless power transfer.

I Introduction

Mobile edge computing (MEC) provides powerful computing ability to edge devices [1, 2]. Numerous works have investigated MEC systems from the perspective of resource allocation. In [3], computing offloading and service caching are jointly optimized in MEC-enabled smart grid to minimize system cost. Work [4] proposed a reverse auction-based offloading and resource allocation scheme in MEC. With the aid of machine learning, a multi-agent deep deterministic policy gradient (MADDPG) algorithm is designed to maximize energy efficiency in [5]. However, deploying computing resources at edge servers of a wireless network faces several challenges. First, due to limited energy of edge servers, they may not be able to provide sufficient computation resource according to devices’ requirements [6, 7]. Second, executing all offloading tasks synchronously requires edge servers to wait the arrival of the task with maximum transmission delay which may not be efficient. Meanwhile, task scheduling sequence is nonnegligible in synchronous task offloading, which will also impact the network loads and task completion [8].

To address the first challenge, wireless power transfer (WPT) technology that exploits energy carried by radio frequency (RF) signals emerges [9]. Instead of using solar and wind sources, ambient RF signals can be a viable new source for energy scavenging. Harvesting energy from the environment provides perpetual energy supplies to wireless devices for tasks offloading [10]. Thus, WPT has been regarded as a promising paradigm for MEC scenarios. Combining WPT with MEC, the authors in [11] proposed a multi-user wireless-powered MEC framework aiming at minimizing the total energy consumption under latency constraints. In [12], considering binary computation offloading, the weighted sum computation rate of all wireless devices was maximized by optimizing computation mode selection and transmission time allocation. The work in [13] proposed a multiple intelligent reflecting surfaces (IRSs) assisted wireless powered MEC system, where the IRSs are deployed to assist both the downlink WPT from the access point (AP) to the wireless devices and the uplink computation offloading. However, the above works [14, 15, 16, 17, 11, 12, 13] assumed that all computational tasks offloaded by users will arrive at the server at the same time and then the server starts to process all tasks simultaneously, which is not efficient and even impractical due to users’ dynamic computational task processing requests [18].

Currently, only a few existing works [18, 19, 20, 21, 22, 23] optimized MEC networks under dynamic computation requests. The work in [18] designed a Whittle index based offloading algorithm to maximize the long-term reward for asynchronous MEC networks where computational tasks arrive randomly. In [19], the authors studied the co-channel interference caused by asynchronous task uploading in NOMA MEC systems. The work in [20] investigated the energy efficient asynchronous task offloading for a MEC system where computational tasks with various latency requirements arrive at different time slots. Task scheduling problem for MEC systems with task interruptions and insertions was studied in [21]. However, the above works [18, 19, 20, 21] that focused on the asynchronous task offloading neglected how the asynchronous task arrival affects the computation at the MEC server. The authors in [22] used a sequential computation method to solve the energy consumption minimization problem under asynchronous task arrivals. The work in [23] designed a computation strategy that only allows a task to be executed after the completion of the previous tasks. Yet, works in [22] and [23] are still constrained by their limited usage of the server computation capacity, and cannot act as resource efficient asynchronous task offloading solutions.

The sequential computation strategy [24] has shown to have the potential to improve the computation resource efficiency and task execution punctuality in an asynchronous MEC network. However, since the computation resource allocation at the server depends on the arrival of the offloaded tasks, the sequential scheduling of the tasks will inevitably affect system performance, which is a fact that has been wildly ignored [22, 25, 26].

The main contribution of this paper is a novel asynchronous MEC framework that jointly schedules tasks and allocates computation resource with optimized system energy efficiency. In brief, our key contributions include:

•

We develop a novel framework to manage computation resource for the sequential computation in asynchronous MEC networks. In particular, we consider a MEC network in which the edge devices sequentially harvest energy for transmission, offload their computational tasks to a MEC server, and then compete for computation source at the server to get their tasks accomplished. To achieve the high energy efficient task execution, a policy needs to be designed for determining the optimal task scheduling sequence, time and computational resource allocation. We pose this joint scheduling and resource allocation problem in an optimization framework and seek to find the strategy which minimizes the energy consumption of the tasks.
•

Then, a general benders decomposition (GBD) based algorithm is proposed to solve the formulated mixed-integer non-linear programming (MINLP) problem which is decomposed into a primal problem that allocates computation resource and time, and a master problem that schedules user tasks. By iteratively solving the primal problem and master problem, the optimal scheduling and resource allocation scheme is obtained.
•

To show the effectiveness of the proposed algorithm, we prove that the optimal energy efficient scheduling and resource allocation scheme also optimizes the task punctuality. Our analytical results also show that the optimal allocation scheme for a given offloading task follows a specific pattern: the computation frequency allocated to each task remains constant initially, then gradually decreases before eventually reaching zero. Notably, all tasks experience a simultaneous decrease, the time of which is given in a closed form, in terms of their required central processing unit (CPU) cycles. Leveraging these identified properties, we introduce a computation resource allocation algorithm that offers a low-complexity solution.

Simulation results demonstrate that the proposed asynchronous computing framework reduces $87.87\%$ energy consumption compared with conventional synchronous computing counterpart. Moreover, computational complexity of the proposed computation resource allocation algorithm is reduced by $100$ times compared with conventional interior point method.

The rest of this paper is structured as follows. Section II elaborates system model and problem formulation. In Section III, we investigate the properties of asynchronous frequency allocation with given time allocation and user scheduling. The joint optimization of user scheduling, time allocation, and computation resource allocation is rendered in Section IV. Simulation results are presented in Section V. Finally, Section VI draws the conclusions.

II System Model and Problem Formulation

Refer to caption — Figure 1: An illustration about the flow chart of the ordered TDMA system with asynchronous computing.

Consider a MEC network consisting of one MEC server, and a set $\mathcal{K}=\{1,2,\cdots,K\}$ of energy harvesting enabled edge devices. Within this network, each device $k\in\mathcal{K}$ needs to execute an $A_{k}$ bits computational task, and will offload its computational task to the MEC server. As shown in Fig. 1, the devices need to first harvest energy from the server to enable such offloading. Then, using the time division multi-access (TDMA) technique, the devices need to schedule their offloading toward the MEC server. In other words, the computational tasks offloaded by devices will arrive at the MEC server asynchronously. To this end, the MEC server will process each device’s computational task in an asynchronous manner. In particular, the server will process devices’ computational tasks according to the time that it receives each of these computational tasks.

The server and edge devices must complete their computational tasks within a time period $T$ which is divided into $\left(K+2\right)$ time slots. The duration of each time slot $n\in\{0,\cdots,N+1\}$ is represented by $\Delta t_{n}$ , with $N=K$ . Each device uses one time slot to offload its computational task. Let $a_{k,n}$ be the index to indicate whether device $k$ offloads its task to the server at time slot $t_{n}$ . In particular, if device $k$ uses time slot $t_{n}$ to offload its computational task, we have $a_{k,n}=1$ ; otherwise, $a_{k,n}=0$ . Since each device uses only one time slot and each time slot can only be allocated to one device, we have $\sum_{k=1}^{K}a_{k,n}=1,\ \forall n\in\{1,\cdots,N\}$ , and $\sum_{n=1}^{N}a_{k,n}=1,\ \forall k\in\mathcal{K}$ . Meanwhile, when $a_{k,n}=1$ , device $k$ will harvest energy from time slot $t_{0}$ to $t_{n-1}$ . Once task $k$ arrives at the MEC server, i.e., at time slot slot $t_{n+1}$ , the server will process this computational task.

The task computation process of the server and a device jointly completing a computational task $k$ consists of three stages: 1) energy harvesting, 2) task offloading, and 3) remote computing. Next, we first introduce the process of the energy harvesting, task offloading, and remote computing stages. Then, the problem formulation is given.

II-A Energy Harvesting Model

The path loss model is given by $\bar{h}_{k}=A\left(\frac{c}{4\pi f_{c}d_{k}}\right)^{\ell}$ , where $A$ represents antenna gain, $c$ denotes the speed of light, $f_{c}$ is the carrier frequency, $\ell$ denotes the path-loss factor, and $d_{k}$ represents the distance between device $k$ and the server [27]. The instant channel gain between device $k$ and server denoted by $h_{k}$ , follows an i.i.d. Rician distribution with line-of-sight (LoS) link gain equal to $\gamma\bar{h}_{k}$ , where $\gamma$ is Rician factor. If device $k$ offloads its task at time slot $t_{n}$ (i.e., $a_{k,n}=1$ ), the harvested energy of device $k$ is $E_{k}^{H}=\sum_{i=0}^{n-1}\Delta{t_{i}}h_{k}\eta P_{0}$ , where $\eta$ is the energy harvesting efficiency of each device, which is assumed to be equal for all devices [28]. $P_{0}$ denotes the transmit power of the server. Since each device has only a single time slot for task offloading (i.e., there exists only one $n\in\mathcal{N}$ such that $a_{k,n}=1$ for a certain device $k$ ), the energy harvested by device $k$ can be reformulated by $E_{k}^{H}=\sum_{n=1}^{K}\sum_{i=0}^{n-1}a_{k,n}\Delta{t_{i}}h_{k}\eta P_{0}$ $(\forall k\in\mathcal{K})$ .

II-B Tasks Offloading Model

Based on the monomial offloading power model [20, 29], the transmit power of device $k$ at its offloading time slot $t_{n}$ is

\displaystyle p_{k,n}=\frac{\lambda(r_{k,n})^{3}}{h_{k}}=\frac{\lambda(A_{k})^{3}}{h_{k}(\Delta t_{n})^{3}},\forall k\in\mathcal{K},\forall n\in\{1,\cdots,N\},

(1)

where $r_{k,n}=A_{k}/\Delta t_{n}$ is the transmission rate, $\lambda>0$ is the energy coefficient related to the bandwidth and the noise power, and the order $3$ is the monomial order associated with coding scheme. Since the transmit power of devices comes from harvested energy, we have $\sum_{n=1}^{N}a_{k,n}\Delta t_{n}p_{k,n}\leq E_{k}^{H}$ $(\forall k\in\mathcal{K})$ .

II-C Computing Model

The MEC server is equipped with multiple CPUs such that the computational tasks offloaded from different devices can be executed in parallel [30]. Let $I_{k}$ be the computation intensity of task $k$ in terms of CPU cycles per bit. As shown in Fig. 2, to sufficiently utilize asynchronous computing, the computation resource of the server will be reallocated to the computational tasks offloaded from devices at each time slot from $t_{2}$ to $t_{N+1}$ . Intuitively, the first uploading task can occupy the whole computation capacity of the server before the second offloading task arrives, while all the tasks compete for computation resource at time slot $t_{N+1}$ . At an arbitrary time slot $m\in\{2,\cdots,N+1\}$ , $f_{n,m}$ , $\forall n\in\{1,\cdots,m-1\}$ is set to be the computation resource allocated to the task that arrives at the server at time slot $t_{n}$ . Given these definitions, we have $\sum_{n=1}^{m-1}f_{n,m}\leq F_{\max}$ $(\forall m=2,\cdots,N+1)$ , where $F_{\max}$ represents the maximum computation capacity of the MEC server. To complete the task computation for each device $k$ , we have $\sum_{n=1}^{N}\sum_{m=n+1}^{N+1}a_{k,n}f_{n,m}\Delta t_{m}\geq F_{k}$ $(\forall k\in\mathcal{K})$ , where $F_{k}=A_{k}I_{k}$ represents the computation cycles of device $k$ . Besides, the energy consumption of the MEC server for all tasks computation can be formulated by $E_{MEC}=\sum_{n=1}^{N}\sum_{m=n+1}^{N+1}\kappa f_{n,m}^{3}\Delta t_{m}$ , where $\kappa$ denotes the energy coefficient of the MEC server.

II-D Problem Formulation

Our goal is to minimize the MEC server’s energy consumption of completing the tasks offloaded by all devices, which is formulated as an optimization problem as


$\displaystyle\min_{\Delta\boldsymbol{t},\boldsymbol{A},\boldsymbol{f}}\$	$\displaystyle\sum_{n=1}^{N}\sum_{m=n+1}^{N+1}\kappa f_{n,m}^{3}\Delta t_{m},$	(2)
s.t.	$\displaystyle\sum_{n=1}^{N}a_{k,n}\frac{\lambda(A_{k})^{3}}{h_{k}(\Delta t_{n})^{2}}\leq\sum_{n=1}^{N}\sum_{i=0}^{n-1}a_{k,n}\Delta t_{i}h_{k}\eta P_{0},\forall k\in\mathcal{K},$	(2a)
	$\displaystyle\sum_{n=1}^{m-1}f_{n,m}\leq F_{\max},\quad\forall m=2,\cdots,N+1,$	(2b)
	$\displaystyle\sum_{n=1}^{N}\sum_{m=n+1}^{N+1}a_{k,n}f_{n,m}\Delta t_{m}\geq A_{k}I_{k},\quad\forall k\in\mathcal{K},$	(2c)
	$\displaystyle\sum_{i=0}^{N+1}\Delta t_{i}\leq T,$	(2d)
	$\displaystyle\sum_{k=1}^{K}a_{k,n}=1,\quad\forall n\in\{1,\cdots,N\},$	(2e)
	$\displaystyle\sum_{n=1}^{N}a_{k,n}=1,\quad\forall k\in\mathcal{K},$	(2f)
	$\displaystyle a_{k,n}\in\{0,1\},\quad\forall k\in\mathcal{K},\forall n\in\{1,\cdots,N\},$	(2g)

where $\Delta\boldsymbol{t}=[\Delta t_{0},\cdots,\Delta t_{N+1}]^{T}$ , $\boldsymbol{f}=(f_{n,m})_{\forall n\in\{1,\cdots,N\},m\in\{n+1,\cdots,N+1\}}$ , and $\boldsymbol{A}=(a_{k,n})_{K\times N}$ . In (2), (2a) is energy consumption causality constraint; (2b) represents a computational resource allocation constraint; (2c) ensures the completion of task computing; (2d) implies that the execution time of all devices should be less than $T$ ; (2e)-(2g) are user scheduling constraints. Since the discrete user scheduling variables $a_{k,n}$ and continuous resource allocation variables $\Delta t_{i}$ , $f_{n,m}$ are highly coupled, problem (2) is a standard MINLP problem which is difficult to solve. To handle this issue, we first analyze the optimal computation resource allocation with given user scheduling and time allocation in Section III, based on which an efficient low-complexity computation frequency optimization algorithm is proposed. Finally, in Section IV, we propose a GBD-based algorithm to jointly optimize user scheduling and resource allocation so as to solve problem (2). ¹¹1 For multi-server edge computing systems, new indicator variables can be introduced to denote the association between tasks and servers. Then the energy minimization problem can be formulated as a MINLP problem containing two kinds of binary optimization variables for task-server association and scheduling sequence, respectively. Despite being more complex, the problem can be solved efficiently using conventional MINLP methods such as convex relaxation and branch-and-bound, or latest approach using machine learning (see e.g., [27]). It is worth noting that with given task-server association, the proposed algorithm in this work is still applicable to scheduling and resource allocation optimization for each server. The detailed transmission protocol and algorithm procedure are left for future works.

III Analysis and Algorithm of the Optimal Computation Resource Allocation

In this section, we first analyze the properties of the optimal computation resource allocation, and then a low-complexity computation resource allocation algorithm is accordingly proposed. For ease of notation, we use $F_{n}$ $(\forall n\in\{1,\cdots,N\})$ to represent the computation cycles to complete the task that arrives at the server with order $n$ . With given time slot allocation vector $\Delta\boldsymbol{t}$ and user scheduling matrix $\boldsymbol{A}$ , problem (2) is simplified as follows:


$\displaystyle\min_{\{f_{n,m}\}}\$	$\displaystyle\sum_{n=1}^{N}\sum_{m=n+1}^{N+1}\kappa f_{n,m}^{3}\Delta t_{m},$	(3)
$\displaystyle s.t.\ \$	$\displaystyle\sum_{n=1}^{m-1}f_{n,m}\leq F_{\max},\quad\forall m=2,\cdots,N+1,$	(3a)
	$\displaystyle\sum_{m=n+1}^{N+1}f_{n,m}\Delta t_{m}\geq F_{n},\quad\forall n\in\{1,\cdots,N\},$	(3b)
	$\displaystyle f_{n,m}\geq 0,\forall n\in\{1,\cdots,N\},\forall m=n+1,\cdots,N+1.$	(3c)

Before solving problem (3), we provide the feasibility condition as follows.

Proposition 1.

Problem (3) is feasible if and only if $F_{\max}\geq\max_{n\in\{1,\cdots,N\}}\frac{\sum_{i=n}^{N}F_{i}}{\sum_{i=n+1}^{N+1}\Delta t_{i+1}}$ .

Proof.

Please refer to Appendix A. ∎

Denote $\{\alpha_{m}\}$ , $\{\beta_{n}\}$ , and $\{\gamma_{n,m}\}$ as the non-negative Lagrangian multipliers associated with the maximum frequency constraints (3a), task computation completion constraints (3b) and non-negative frequency constraints (3c), respectively. The optimal computation resource allocation is given by the following proposition.

Proposition 2.

Given the optimal $\{\alpha^{*}_{m}\}$ , $\{\beta^{*}_{n}\}$ , the optimal solution of problem (3) is given by

\displaystyle f^{*}_{n,m}=\sqrt{\left[\frac{\beta^{*}_{n}}{3\kappa}-\frac{\alpha^{*}_{m}}{3\kappa\Delta t_{m}}\right]^{+}},\forall n\in\mathcal{K},\forall m\in\{2,\cdots,K+1\}.

(4)

Proof.

Since (4) can be effectively obtained by solving Karush-Kuhn-Tucker (KKT) and Slater conditions, the proofs is omitted here. ∎

According to Propostion 2, we can use the sub-gradient method to obtain the optimal $\{\alpha^{*}_{m}\}$ and $\{\beta^{*}_{n}\}$ so as to acquire the optimal computation resource allocation. To further reduce the computational complexity and provide some design insights, the properties of the optimal solution of problem (3) are summarized in the following theorem.

Theorem 3.

Denote $\digamma(i)=\sum_{n=1}^{i-2}\frac{F_{n}}{\sum_{m=n+1}^{K+1}\Delta t_{m}}+\frac{\sum_{n=i-1}^{K}F_{n}}{\sum_{m=i}^{K+1}\Delta t_{m}}$ $(2\leq i\leq K+1)$ . The optimal computation resource has the following properties:

1)

The optimal solution of problem (3) satisfies $f_{n,n+1}^{*}=\cdots=f_{n,i}^{*}>\cdots>f^{*}_{n,j}=\cdots=f_{n,K+1}^{*}=0,$ $(n+1\leq i<j\leq K+1)$ , where $t_{i}$ is referred as “transition point”.
2)

The optimal $\{\alpha^{*}_{m}\}$ satisfies $0=\frac{\alpha^{*}_{2}}{\Delta t_{2}}=\cdots=\frac{\alpha^{*}_{i}}{\Delta t_{i}}<\frac{\alpha^{*}_{i+1}}{\Delta t_{i+1}}\cdots<\frac{\alpha_{K+1}^{*}}{\Delta t_{K+1}}$ .
3)

The transition point is $t_{i}$ $(3\leq i\leq K+1)$ if and only if $\digamma(i-1)\leq F_{\max}<\digamma(i)$ .

Proof.

The proofs of 1), 2) and 3) are provided in Appendix B, C, D, respectively. ∎

According to property 1) in Theorem 3, the optimal frequency allocation scheme for a certain offloading task always follows a specific pattern: the frequency allocated to each device remains constant initially, then gradually decreases and eventually reaches zero. This property motivates us to deduce the condition $f^{*}_{n,i}>f^{*}_{n,i+1}$ . The property 2) in Theorem 3 implies that the computation resource of the server is redundant at time slots from $t_{2}$ to $t_{i}$ , while the maximum computation resource $F_{\max}$ is utilized at slots from $t_{i+1}$ to $t_{K+1}$ .

According to property 1) in Theorem 3, unless $f^{*}_{n,n+1}=\cdots=f^{*}_{n,K+1}$ $(\forall n\in\mathcal{K})$ , there always exists a special time slot $t_{\varkappa}$ we called “transition point” such that $f^{*}_{n,n+1}=\cdots=f^{*}_{n,\varkappa-1}>f^{*}_{n,\varkappa}\geq\cdots\geq f^{*}_{n,K+1}$ $(3\leq\varkappa\leq K+1)$ . The transition point indicates the number of time slots that the computation resource remains the same. The computation resource decreases for all tasks at the transition point. The method to find out the transition point when it exists is given by property 3) in Theorem 3.

Property 3) in Theorem 3 also shows that the transition point is impacted by the computation ability of the server. We can directly determine the transition point $t_{\varkappa}$ utilizing property 3) in Theorem 3 without the need of solving problem (3). After determining the transition point, we have $\alpha^{*}_{m}=0$ $(2\leq m\leq\varkappa-1)$ according to property 2) in Theorem 3.

Fig. 3 depicts an illustration of properties in the optimal computation resource allocation. As can be seen, before the transition point $t_{\varkappa}$ , the optimal $\alpha^{*}_{m}=0$ and $f^{*}_{n,m}$ keeps unchanged as $m=2,\cdots,\varkappa-1$ . Based on Theorem 3, a low-complexity algorithm is proposed in Algorithm 1. First, we check the feasibility of problem (3) according to Proposition 1. Then, we determine the transition point $t_{\varkappa}$ based on property 3) in Theorem 3. If there is no transition points, which means that the computation resource of server is abundant, we can directly obtain the optimal solution $f^{*}_{n,n+1}=\cdots=f^{*}_{n,K+1}=\frac{F_{n}}{\sum_{m=n+1}^{K+1}\Delta t_{m}}$ for $n=1,\cdots,K$ ; otherwise, we obtain the transition point $t_{\varkappa}$ and have $\alpha^{*}_{m}=0$ for $m=2,\cdots,\varkappa-1$ . Hence, we only need to find out the optimal $\alpha^{*}_{m}$ $(m=\varkappa,\cdots,K+1)$ and $\beta^{*}_{n}$ $(n=1,\cdots,K)$ . Note that with given $\beta^{*}_{n}$ $(n=1,\cdots,K)$ , we can obtain the optimal $\alpha^{*}_{m}$ by solving the following $(K-\varkappa+2)$ equalities

	$\displaystyle G(m,\alpha_{m})\triangleq\sum_{n=1}^{m-1}$	$\displaystyle\sqrt{\left[\frac{\beta_{n}}{3\kappa}-\frac{\alpha_{m}}{3\kappa\Delta t_{m}}\right]^{+}}=F_{\max},$
		$\displaystyle\quad\quad\quad(m=\varkappa,\cdots,K+1),$		(5)

since the maximum frequency is utilized at time slots from $t_{\varkappa}$ to $t_{K+1}$ . Since $G(m,\alpha_{m})$ decreases with respect to $\alpha_{m}$ , the bisection method is adopted. It should be noticed that $G(m,\alpha_{m})$ achieves the maximum value of $\sum_{n=1}^{m-1}\sqrt{\frac{\beta_{n}}{3\kappa}}$ when $\alpha_{m}=0$ and the minimum value of $0$ when $\frac{\alpha_{m}}{\Delta t_{m}}\geq\max_{n=1,\cdots,m-1}\beta_{n}$ . Therefore, the upper bound of $\frac{\alpha_{m}}{\Delta t_{m}}$ is set as $\frac{\alpha_{m}^{ub}}{\Delta t_{m}}=\max_{n=1,\cdots,m-1}\beta_{n}$ . For the lower bound, we set $\frac{\alpha_{m}^{lb}}{\Delta t_{m}}=\frac{\alpha_{m-1}}{\Delta t_{m-1}}$ according to property 2) in Theorem 3. After obtaining $\alpha_{m}$ for $m=\varkappa,\cdots,K+1$ , $\beta_{n}$ is updated by a sub-gradient method [31], where $\phi_{n}$ is the dynamically chosen step-size. Through repeating Steps 5 to 13 until the objective of (3) converges, we can obtain the optimal $\alpha^{*}_{m}$ for $m=2,\cdots,K+1$ and $\beta^{*}_{n}$ for $n=1,\cdots,K$ .

The complexity of Algorithm 1 is $\mathcal{O}\left(\frac{(K+2-\varkappa)}{\sqrt{\epsilon_{1}}}\log_{2}(\frac{1}{\epsilon_{0}})\right)$ , where $\epsilon_{0}$ denotes the accuracy of the bisection method and $\epsilon_{1}$ is the accuracy of the objective of problem (3). Compared with the complexity of $\mathcal{O}\left(\left(K^{2}+K\right)^{3.5}\right)$ by the interior point method, the complexity of the proposed algorithm is significantly reduced. Moreover, when $\varkappa$ is large, the complexity can be further reduced since more numbers of $\alpha^{*}_{m}$ are zeros.

2If

F_{\max}\geq\max_{n\in\{1,\cdots,K\}}\frac{\sum_{i=n}^{K}F_{i}}{\sum_{i=n+1}^{K+1}\Delta t_{i+1}}

, go to Step 2; otherwise, problem (3) is infeasible.

3According to Theorem 3, if there is no transition point, the optimal solution is given by

f^{*}_{n,n+1}=\cdots=f^{*}_{n,K+1}=\frac{F_{n}}{\sum_{m=n+1}^{K+1}\Delta t_{m}}

for

n=1,\cdots,K

; otherwise, obtain transition point

t_{\varkappa}

and let

\alpha^{*}_{m}=0

for

m=2,\cdots,\varkappa-1

4Initialize

\beta_{n}=\left(\frac{F_{n}}{\sum_{m=n+1}^{K+1}\Delta t_{m}}\right)^{2}

for

n=1,\cdots,K

and required precision

\epsilon_{0}

5repeat

7 for $m=\varkappa,\cdots,K+1$ do

8 Let

\frac{\alpha_{m}^{lb}}{\Delta t_{m}}=\frac{\alpha_{m-1}}{\Delta t_{m-1}}

and

\frac{\alpha_{m}^{ub}}{\Delta t_{m}}=\max_{n=1,\cdots,m-1}\beta_{n}

9 while $\frac{\alpha_{m}^{ub}}{\Delta t_{m}}-\frac{\alpha_{m}^{lb}}{\Delta t_{m}}>\epsilon_{0}$ do

10 Set

\frac{\alpha_{m}}{\Delta t_{m}}\leftarrow(\frac{\alpha_{m}^{lb}}{\Delta t_{m}}+\frac{\alpha_{m}^{ub}}{\Delta t_{m}})/2

11 Calculate

f_{n,m}=\sqrt{\left[\frac{\beta_{n}}{3\kappa}-\frac{\alpha_{m}}{3\kappa\Delta t_{m}}\right]^{+}}

12 If

\sum_{n=1}^{m-1}f_{n,m}>F_{\max}

, let

\frac{\alpha_{m}^{lb}}{t_{m}}\leftarrow\frac{\alpha_{m}}{\Delta t_{m}}

; otherwise, let

\frac{\alpha_{m}^{ub}}{t_{m}}\leftarrow\frac{\alpha_{m}}{\Delta t_{m}}

13 end while

15 end for

17 Update

\beta_{n}\leftarrow\left[\beta_{n}+\phi_{n}\left(F_{n}-\sum_{m=n+1}^{K+1}f_{n,m}\Delta t_{m}\right)\right]^{+}

for

n=1,\cdots,K

18until the objective of (3) converges;

Output the optimal

\{f^{*}_{n,m}\}

Algorithm 1 Optimal Computation Frequency Allocation Algorithm

IV Joint User Scheduling and Resource Allocation Algorithm

In this section, we employ the GBD method to solve problem (2). The core idea of GBD method is decomposing the original MINLP problem into a primal problem related to continuous variables and a master problem associated with integer variables, which are iteratively solved²²2Interested readers may refer to [29, 32, 33, 34] for details. . Specifically, for problem (2), the primal problem is a joint communication and computation resource optimization problem with fixed user scheduling. The master problem optimizes user scheduling by utilizing the optimal solutions and dual variables of the primal problem. Next, we describe the detailed procedures.

IV-A Primal Problem

With given user scheduling $\boldsymbol{A}$ , problem (2) is reduced to the following optimization problem:


$\displaystyle\min_{\Delta\boldsymbol{t},\boldsymbol{f}}\quad$	$\displaystyle\sum_{n=1}^{K}\sum_{m=n+1}^{K+1}\kappa f_{n,m}^{3}\Delta t_{m},$	(6)
$\displaystyle s.t.\quad$	$\displaystyle\frac{\lambda(A_{\pi_{n}})^{3}}{h_{\pi_{n}}(\Delta t_{n})^{2}}\leq\sum_{i=0}^{n-1}\Delta t_{i}h_{\pi_{n}}\eta P_{0},\quad\forall n\in\mathcal{K},$	(6a)
	$\displaystyle\sum_{m=n+1}^{K+1}f_{n,m}\Delta t_{m}\geq A_{\pi_{n}}I_{\pi_{n}},\quad\forall n\in\mathcal{K},$	(6b)
	$\displaystyle\text{(\ref{P1}b)},\text{(\ref{P1}d)},$	(6c)

where $\pi_{n}$ denotes the index of the $n$ -th offloading device, i.e., we have $\pi_{n}=k$ if $a_{k,n}=1$ . Since the user scheduling scheme $\boldsymbol{A}$ is known, the value of $\pi_{n},(\forall n\in\mathcal{K})$ can be deduced and substituted into problem (2). Since problem (6) is non-convex due to the constraints (6a), (6b) and the objective, we introduce $x_{n,m}=f_{n,m}\Delta t_{m}(\forall n=1,\cdots,K,\forall m=n+1,\cdots,K+1)$ to represent computation amounts of the $n$ -th offloading task at time slot $m$ . Hence, problem (6) is equivalent to


$\displaystyle\min_{\Delta\boldsymbol{t},\boldsymbol{x}}\quad$	$\displaystyle\sum_{n=1}^{K}\sum_{m=n+1}^{K+1}\kappa\frac{(x_{n,m})^{3}}{(\Delta t_{m})^{2}},$	(7)
$\displaystyle s.t.\quad$	$\displaystyle\sum_{n=1}^{m-1}x_{n,m}\leq F_{\max}\Delta t_{m},\forall m=2,\cdots,K+1,$	(7a)
	$\displaystyle\sum_{m=n+1}^{K+1}x_{n,m}\geq A_{\pi_{n}}I_{\pi_{n}},\quad\forall n\in\mathcal{K},$	(7b)
	$\displaystyle\text{(\ref{P1}d)},\text{(\ref{P2}a)},$	(7c)

where $\boldsymbol{x}=(x_{n,m})_{\forall n\in\mathcal{K},m\in\{n+1,\cdots,K+1\}}$ is the collections of $x_{n,m}$ . It can be proved that problem (7) is convex utilizing the tricks of perspective function [35]. To further provide useful insights and reduce computation complexity, we utilize the block coordinate decent (BCD) method to iteratively optimize time allocation and computation resource. Since the low-complexity computation resource allocation algorithm with given time allocation has been provided in Algorithm 1, next we propose time allocation algorithm with fixed computation resource allocation.

The Lagrangian function of problem (7) with respect to $\Delta\boldsymbol{t}$ is given by

	$\displaystyle\mathcal{L}=$	$\displaystyle\sum_{n=1}^{K}\sum_{m=n+1}^{K+1}\!\!\kappa\frac{(x_{n,m})^{3}}{(\Delta t_{m})^{2}}\!+\!\sum_{n=1}^{K}\rho_{n}\!\left(\!\frac{\lambda(A_{\pi_{n}})^{3}}{h_{\pi_{n}}(\Delta t_{n})^{2}}\!-\!\sum_{i=0}^{n-1}\Delta t_{i}h_{\pi_{n}}\eta P_{0}\right)$
		$\displaystyle+\xi\left(\sum_{i=0}^{K+1}\Delta t_{i}-T\right)+\sum_{m=2}^{K+1}\omega_{m}\left(\sum_{n=1}^{m-1}x_{n,m}-F_{\max}\Delta t_{m}\right),$		(8)

where $\rho_{n}$ , $\omega_{m}$ and $\xi$ are dual variables related to constraints (6a), (7a) and (2d), respectively. Taking the derivative with respect to $\Delta\boldsymbol{t}$ , we have

$\displaystyle\frac{\partial\mathcal{L}}{\partial\Delta t_{0}^{*}}$	$\displaystyle=-\sum_{n=1}^{K}\rho^{}_{n}h_{\pi_{n}}\eta P_{0}+\xi^{}=0,$	(9)
$\displaystyle\frac{\partial\mathcal{L}}{\partial\Delta t^{*}_{1}}$	$\displaystyle=-\rho^{}_{1}\frac{2\lambda(A_{\pi_{1}})^{3}}{h_{\pi_{1}}(\Delta t^{}_{1})^{3}}-\sum_{n=2}^{K}\rho^{}_{n}h_{\pi_{n}}\eta P_{0}+\xi^{}=0,$	(10)
$\displaystyle\frac{\partial\mathcal{L}}{\partial\Delta t^{*}_{i}}$	$\displaystyle=-\sum_{n=1}^{i-1}2\kappa\frac{(x_{n,i})^{3}}{(\Delta t^{}_{i})^{3}}-\rho^{}_{i}\frac{2\lambda(A_{\pi_{i}})^{3}}{h_{\pi_{i}}(\Delta t^{}_{i})^{3}}-\sum_{n=i+1}^{K}\rho^{}_{n}h_{\pi_{n}}\eta P_{0}$
	$\displaystyle\quad\ +\xi^{}-\omega^{}_{i}F_{\max}=0,\quad(2\leq i\leq K-1),$	(11)
$\displaystyle\frac{\partial\mathcal{L}}{\partial\Delta t^{*}_{K}}$	$\displaystyle\!=\!-\!\!\sum_{n=1}^{K-1}\!2\kappa\frac{(x_{n,K})^{3}}{(\Delta t^{}_{K})^{3}}\!-\!\rho^{}_{K}\frac{2\lambda(A_{\pi_{K}})^{3}}{h_{\pi_{K}}(\Delta t^{}_{K})^{3}}\!+\!\xi^{}\!-\!\omega^{*}_{K}F_{\max}\!=\!0,$	(12)
$\displaystyle\frac{\partial\mathcal{L}}{\partial\Delta t^{*}_{K+1}}$	$\displaystyle=-\sum_{n=1}^{K}2\kappa\frac{(x_{n,K+1})^{3}}{(\Delta t^{}_{K+1})^{3}}+\xi^{}-\omega^{*}_{K+1}F_{\max}=0.$	(13)

Through solving above equations, the optimal $\Delta t^{*}_{i}$ $\left(\forall i\right)$ is obtained in the following proposition.

Proposition 4.

The optimal $\Delta\boldsymbol{t}^{*}$ is given by

	$\displaystyle\Delta t_{0}^{}=T-\sum_{i=1}^{K+1}\Delta t_{i}^{},$		(14)
	$\displaystyle\Delta t^{}_{i}=\sqrt[3]{\frac{2\kappa h_{\pi_{i}}\sum_{n=1}^{i-1}(x_{n,i})^{3}+2\rho^{}_{i}\lambda(A_{\pi_{i}})^{3}}{\xi^{}h_{\pi_{i}}-\sum_{n=i+1}^{K}\rho^{}_{n}(h_{\pi_{n}})^{2}\eta P_{0}-\omega^{*}_{i}h_{\pi_{i}}F_{\max}}},$
	$\displaystyle\quad\quad\quad\quad\quad\quad\quad\quad\quad(\forall i\in\{2,\cdots,K-1\}),$		(15)
	$\displaystyle\Delta t^{}_{K}=\sqrt[3]{\frac{2\kappa h_{\pi_{K}}\sum_{n=1}^{K-1}(x_{n,K})^{3}+2\rho_{K}^{}\lambda(A_{\pi_{K}})^{3}}{\xi^{}h_{\pi_{K}}-\omega^{}_{K}h_{\pi_{K}}F_{\max}}},$		(16)
	$\displaystyle\Delta t_{K+1}^{}=\sqrt[3]{\frac{2\kappa\sum_{n=1}^{K}(x_{n,K+1})^{3}}{\xi^{}-\omega^{*}_{K+1}F_{\max}}},$		(17)

and $\Delta t_{1}^{*}$ is the null point of $\Psi(x)$ , where $\Psi(x)=\left(T-\sum_{i=1}^{K+1}\Delta t_{i}\right)x^{2}(h_{\pi_{1}})^{2}\eta P_{0}-\lambda(A_{\pi_{1}})^{3}$ , $x\in\Big{(}0,\frac{2}{3}\left(T-\sum_{i=2}^{K+1}\Delta t_{i}\right)\Big{]}$ .

Proof.

The proof of Proposition 4 is provided in Appendix H. ∎

Through iteratively optimizing time allocation and computation resource allocation, we can obtain the optimal solution of primal problem (7). However, if problem (7) is infeasible, we formulate the corresponding $\ell_{1}$ -minimization problem as follows:


$\displaystyle\min_{\Delta\boldsymbol{t},\boldsymbol{x},\boldsymbol{\zeta}>0,\boldsymbol{\iota}>0}$	$\displaystyle\sum_{k=1}^{K}\left(\zeta_{k}+\iota_{k}\right),$	(18)
$\displaystyle s.t.\quad\$	$\displaystyle\sum_{n=1}^{K}a_{k,n}\frac{\lambda(A_{k})^{3}}{h_{k}(\Delta t_{n})^{2}}\!\leq\!\zeta_{k}\!+\!\sum_{n=1}^{K}\sum_{i=0}^{n-1}a_{k,n}\Delta t_{i}h_{k}\eta P_{0},$
	$\displaystyle\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\forall k\in\mathcal{K},$	(18a)
	$\displaystyle\iota_{k}\!+\!\sum_{n=1}^{K}\sum_{m=n+1}^{K+1}\!a_{k,n}x_{n,m}\!\geq\!A_{k}I_{k},\forall k\in\mathcal{K},$	(18b)
	$\displaystyle\text{(\ref{P1}d)},\text{(\ref{P2.1}a)}.$	(18c)

Since problem (18) is convex and always feasible, we can use the interior point method to obtain the optimal solution and corresponding dual variables.

Furthermore, we can observe that the solution of primal problem always provides a performance upper bound for problem (2) since user scheduling is fixed. Then the upper bound is updated as $UB^{(j)}\leftarrow\min\{UB^{(j-1)},f^{(j)}\}$ , where $f^{(j)}$ denotes the objective value of primal problem (6). As can be seen, the upper bound is always non-increasing as iteration proceeds. Subsequently, we construct master problem using the solutions and dual variables of primal problem (7) and feasibility problem (18).

IV-B Master Problem

At each iteration, optimality cut or feasibility cut are added to master problem depending on whether the primal problem is feasible. Denote $\mathcal{J}_{1}$ and $\mathcal{J}_{2}$ as the set of iteration indexes indicating the primal problem is feasible and infeasible, respectively. Specifically, the optimality cut for each $j\in\mathcal{J}_{1}$ of feasible iterations is defined as

		$\displaystyle\theta(\boldsymbol{A},\boldsymbol{\rho}^{(j)},\boldsymbol{\beta}^{(j)})=\sum_{n=1}^{K}\sum_{m=n+1}^{K+1}\kappa\frac{\left(x^{(j)}_{n,m}\right)^{3}}{\left(\Delta t^{(j)}_{m}\right)^{2}}$
		$\displaystyle+\sum_{k=1}^{K}\rho^{(j)}_{k}\left(\sum_{n=1}^{K}a_{k,n}\frac{\lambda(A_{k})^{3}}{h_{k}\left(\Delta t^{(j)}_{n}\right)^{2}}-\sum_{n=1}^{K}\sum_{i=0}^{n-1}a_{k,n}\Delta t^{(j)}_{i}h_{k}\eta P_{0}\right)$
		$\displaystyle+\sum_{k=1}^{K}\beta_{k}^{(j)}\left(A_{k}I_{k}-\sum_{n=1}^{K}\sum_{m=n+1}^{K+1}a_{k,n}x^{(j)}_{n,m}\right),$		(19)

where $\rho_{k}^{(j)}$ and $\beta_{k}^{(j)}$ represent the dual variables related to primal problem at the $j$ -th iteration, $x_{n,m}^{(j)}$ and $\Delta t^{(j)}_{m}$ denote the solution of primal problem at the $j$ -th iteration. The terms irrelavant to $\boldsymbol{A}$ are omitted based on complementary slackness theorem [31]. Similarly, the feasibility cut for each $j\in\mathcal{J}_{2}$ of infeasible iterations is defined as

		$\displaystyle\hat{\theta}(\boldsymbol{A},\hat{\boldsymbol{\rho}}^{(j)},\hat{\boldsymbol{\beta}}^{(j)})=\sum_{k=1}^{K}\hat{\beta}_{k}^{(j)}\left(A_{k}I_{k}-\sum_{n=1}^{K}\sum_{m=n+1}^{K+1}a_{k,n}\hat{x}^{(j)}_{n,m}\right)$
		$\displaystyle+\sum_{k=1}^{K}\hat{\rho}^{(j)}_{k}\left(\sum_{n=1}^{K}a_{k,n}\frac{\lambda(A_{k})^{3}}{h_{k}\left(\Delta\hat{t}^{(j)}_{n}\right)^{2}}-\sum_{n=1}^{K}\sum_{i=0}^{n-1}a_{k,n}\Delta\hat{t}^{(j)}_{i}h_{k}\eta P_{0}\right),$		(20)

where $\hat{\rho}_{k}^{(j)}$ and $\hat{\beta}_{k}^{(j)}$ represent the dual variables related to feasibility problem at the $j$ -th iteration, $\hat{x}_{n,m}^{(j)}$ and $\Delta\hat{t}^{(j)}_{m}$ denote the solution of feasibility problem at the $j$ -th iteration. Therefore, master problem is formulated as


$\displaystyle\min_{\boldsymbol{A},\psi}\quad$	$\displaystyle\psi,$	(21)
$\displaystyle s.t.\quad$	$\displaystyle\theta(\boldsymbol{A},\boldsymbol{\rho}^{(j)},\boldsymbol{\beta}^{(j)})\leq\psi,\quad\forall j\in\mathcal{J}_{1},$	(21a)
	$\displaystyle\hat{\theta}(\boldsymbol{A},\hat{\boldsymbol{\rho}}^{(j)},\hat{\boldsymbol{\beta}}^{(j)})\leq 0,\quad\forall j\in\mathcal{J}_{2},$	(21b)
	$\displaystyle\text{(\ref{P1}e)}-\text{(\ref{P2}g)}.$	(21c)

In particular, (21a) and (21b) denote the set of hyperplanes spanned by the optimality cut and feasibility cut from the first to the $j$ -th iteration, respectively. The two different types of cuts are exploited to reduce the search region for the global optimal solution [36]. Master problem (21) is a standard mixed-integer linear programming (MILP) problem, which can be solved by numerical solvers such as Gurobi[37] and Mosek[38]. Since master problem is the relaxing problem of MINLP problem (2), solving master problem provides a performance lower bound for problem (2). The lower bound is given by $LB^{(j)}\leftarrow\psi$ . Since at each iteration, an additional cut (optimality cut or feasibility cut) is added to master problem which narrows the feasible zone, the lower bound is always non-decreasing. As a consequence, the performance upper bound obtained by primal problem and the performance lower bound obtained by the master problem are non-increasing and non-decreasing w.r.t. the iteration index, respectively. As a result, the performance upper bound and the performance lower bound go to converge [29]. Therefore, through iteratively solving primal problem and master problem, we can obtain the optimal solution when the upper bound and lower bound are sufficiently close [33, 36]. The detailed algorithm is summarized in Algorithm 2.

IV-C Complexity Analysis

The complexity of solving problem (2) by Algorithm 2 lies in solving the primal problem, feasibility problem, and master problem at each iteration. For primal problem, where we iteratively update time allocation variables and frequency variables. The frequency optimization method is given in Algorithm 1, whose complexity is $\mathcal{O}\left(\frac{K+2-\varkappa}{\sqrt{\epsilon_{1}}}\log_{2}(1/\epsilon_{0})\right)$ as analyzed in Section III. The time allocation optimization is according to Proposition 4, whose complexity is estimated as $\mathcal{O}\left(K\log_{2}(T)\right)$ . Therefore, the total complexity of solving primal problem is $\mathcal{O}\left(\frac{K+2-\varkappa}{\sqrt{\epsilon_{1}}}\log_{2}(1/\epsilon_{0})K\log_{2}(T)L_{1}\right)$ , where $L_{1}$ denotes the iteration number in the primal problems. For the feasibility problem, the complexity is given by $\mathcal{O}\left(\left(\frac{(K+1)K}{2}+3K+1\right)^{3.5}\right)$ by the interior point method. For the master problem, the computational complexity is $\mathcal{O}\left(2^{K}\right)$ by the Branch and Bound (BnB) method [39].

2Initialize arbitrary feasible user scheduling

\boldsymbol{A}^{(j)}

, and set

j=1

UB=+\infty

LB=-\infty

\mathcal{J}_{1}=\mathcal{J}_{2}=\emptyset

;

3repeat

5 if problem (7) is feasible then

7 repeat

9 Obtain the optimal computation resource

\boldsymbol{f}^{(j)}

and dual variable

\beta_{k}^{(j)}

according to Algorithm 1;

10 Obtain the optimal time allcation

\Delta\boldsymbol{t}^{(j)}

and dual variable

\rho_{k}^{(j)}

;

11 until the objective of (7) converges;

12 Update

UB

and

\mathcal{J}_{1}

;

13 else

14 Solve feasibility problem (18) and update

\mathcal{J}_{2}

;

15 Obtain the corresponding optimal solution

\hat{\boldsymbol{x}}^{(j)}

and

\Delta\hat{\boldsymbol{t}}^{(j)}

as well as dual variables

\hat{\rho}_{k}^{(j)}

and

\hat{\beta}_{K}^{(j)}

;

16 end if

18 Solve master problem (21) by adding optimality cuts (IV-B) and feasibility cuts (IV-B);

19 Set

j\leftarrow j+1

;

20 Update

\boldsymbol{A}^{(j)}

and

LB

;

21until $UB$ and $LB$ are sufficiently close;

Output the optimal

\boldsymbol{f}^{*}

\Delta\boldsymbol{t}^{*}

and

\boldsymbol{A}^{*}

Algorithm 2 Joint User Scheduling and Resource Allocation Algorithm

V Simulations

In this section, we perform simulations to validate the proposed scheme and algorithm. There are $K=10$ devices around the server. The task size $A_{k}$ and computation intensity obey uniform distribution on $[10,50]$ Kbits and $[500,1500]$ cycles/bit, respectively. The transmit power of BS is $P_{0}=3$ W. The energy coefficient of the MEC server and energy conversion factor of devices are set as $\kappa=10^{-26}$ and $\eta=0.51$ . We set the energy constant of transmission $\lambda=10^{-25}$ . Furthermore, the maximum computation resource is $F_{\max}=1$ GHz and the allowable delay is $T=1$ second. In channel model, we set antenna gain $A=3$ , carrier frequency $f_{c}=915$ MHz, path-loss factor $\ell=3$ , speed of light $c=3\times 10^{8}$ m/s, and the Rician factor is $\gamma=0.3$ . The following benchmarking schemes are provided:

•

JSORA[26]: The joint sensing-and-offloading resource allocation algorithm, where the allocated frequency for each device keeps unchanged during its computation duration, i.e., $f_{n,n+1}=\cdots=f_{n,K+1}$ $(\forall n\in\mathcal{K})$ .
•

JCCRM-Sync[14, 15, 16, 17, 11, 12, 13]: The joint communication and computation resource management algorithm, where the computing of server will not begin until all tasks are received, which is adopted by most of the literature.
•

Random scheduling scheme[25, 22]: We randomly set the user scheduling for offloading.
•

Exhaustive search: We randomly choose multiple initial points for Algorithm 2 and select the smallest result as output. The results of exhaustive search method can be regarded as global optimal solutions.

Besides, all accuracies used in the simulations are set as $10^{-5}$ for fairness.

Fig. 5 demonstrates the energy consumption performance comparisons between different schemes under different numbers of devices. We can observe that the energy consumption of the proposed scheme as well as benchmark schemes increases with the number of devices getting large. This is because that devices have to compete for fixed communication and computation resource. As the number of devices increases, the average transmission time and computation time of each device get small, thus average computation resource becomes large. Therefore, larger energy consumption of server is required in order to finish devices’ tasks within the required delay. Moreover, as can be seen in Fig. 5, the gap between the proposed algorithm and exhaustive search scheme is small. This indicates that the proposed algorithm achieves close-to-optimal solutions. Compared with JSORA scheme, random scheduling scheme and JCCRM-Sync scheme, the proposed scheme achieves $30.88\%$ , $19.51\%$ , $87.87\%$ energy reductions, respectively. This can be explained by that the proposed algorithm can take full advantage of the flexibility of asynchronous computing and user scheduling. Particularly, compared with the proposed scheme, JCCRM-Sync scheme wastes the idle computation resource from time slots $t_{2}$ to $t_{K}$ . Similarly, JSORA scheme can not make full use of computation resource from time slots $t_{2}$ to $t_{K}$ . Hence, its performance is better than JCCRM-Sync but worse than the proposed scheme. Additionally, random scheduling, as most of the existing literature does, can not utilize the heterogeneity of tasks size and computation intensity well in MEC networks.

In Fig. 5, we depict the energy consumption curves of different schemes versus the maximum allowable delay. As can be seen, the energy consumption of all schemes decreases as the maximum delay becomes large. This is because as delay gets large, the server has more time to finish tasks. Thus, the fewer computation resource is allowable. Hence, energy consumption can be reduced. From Fig. 5, it can be verified that the proposed algorithm outperforms benchmarking schemes in terms of energy consumption in the considered region of delay, especially in resource-scarce scenarios. This phenomenon can be observed in Fig. 5 and Fig. 5 that the difference in energy consumption between the proposed algorithm and benchmark schemes gets small when resource is abundant. This is because the flaws of benchmark schemes compared with the proposed algorithm can be appropriately compensated by utilizing additional sources. Furthermore, it should be noticed that JSORA scheme is equivalent to the proposed scheme when computation resource is abundant according to Theorem 3.

Fig. 6 illustrates a specific case of the allocated frequency of each device at each computation slot under different maximum computation frequencies when $K=5$ . It can be seen that as the maximum frequency $F_{\max}$ becomes large, the transition point is gradually postponed, and finally no transition point exists when computation resource is sufficiently large which is in accordance with Theorem 3. Specifically, for each subfigure, we can find that before the transition point, the allocated frequency for each device being computed remains unchanged and the maximum frequency constraints do not work. From the transition point to the end, the allocated frequency for each device becomes small and the maximum frequency of the server is used. This verifies Theorem 3.

Fig. 7 depicts the computational complexity comparisons between the proposed Algorithm 1 and the interior point method under different transition points. As can be seen, the computational complexity of Algorithm 1 is significantly reduced compared with the interior point method, by more than $100$ times on average. As the transition point becomes larger, the complexity further decreases. For example, when the transition point $\varkappa=11$ , the complexity of Algorithm 1 is reduced by $1000$ times. This is because the proposed computation resource allocation algorithm fully utilizes the properties in Theorem 3 to reduce algorithm complexity, especially when the computation resource of the server is abundant.

To test the compatibility of the proposed algorithm under different task scale differences, the energy consumption versus minimum task size $A_{\min}$ is shown in Fig. 8, where task size obeys uniform distribution on $[A_{\min},A_{\max}]$ with fixed mean value $\frac{A_{\min}+A_{\max}}{2}=30$ Kbits. With a large $A_{\min}$ , the task scale difference is small. In Fig. 8, the proposed scheme and exhaustive search scheme achieve nearly the same performance, and outperform other schemes. One can observe that the energy consumption increases as task scale difference gets large. This can be explained by that the resources have to be tilted towards the devices with large task sizes, thus resulting in more energy consumption.

VI Conclusion

In this paper, we have investigated a joint user scheduling and resource optimization framework for MEC networks with asynchronous computing. An optimization problem of joint user scheduling, communication and computation resource management has been solved aiming to minimize the energy consumption of server under the delay constraint. Simulations verified that the proposed algorithm yields significant performance gains compared with benchmark schemes. This work establishes a new principle of asynchronous computing and verifies the superiority over its synchronous counterpart. For future works, we will generalize the proposed asynchronous computing framework to heterogeneous task deadlines scenarios so as to further activate its potential. As another direction, the extension to online algorithm design and accommodate new coming devices deserve further investigation.

Appendix A Proof of Proposition 1

The feasibility problem of (3) can be constructed as


$\displaystyle\min_{f_{n,m}}\quad$	$\displaystyle\max_{m\in\{2,\cdots,K+1\}}\sum_{n=1}^{m-1}f_{n,m},$	(A.1)
$\displaystyle s.t.\quad$	$\displaystyle\sum_{m=n+1}^{K+1}f_{n,m}\Delta t_{m}\geq F_{n},\quad\forall n\in\mathcal{K},$	(A.1a)
	$\displaystyle f_{n,m}\geq 0,\quad\forall n\in\mathcal{K},\forall m=n+1,\cdots,K+1.$	(A.1b)

If the optimal objective of problem (A.1) is less than or equal to $F_{\max}$ , problem (3) is feasible; otherwise, it is infeasible. Subsequently, we analyze the optimal solution of problem (A.1). First, when $K=1$ , i.e., there exists only one task, the optimal objective of problem (A.1) is $\frac{F_{K}}{\Delta t_{K+1}}$ . When $K=2$ , we consider two cases: 1) If $\frac{F_{1}}{\Delta t_{2}}\leq\frac{F_{2}}{\Delta t_{3}}$ , this indicates that the optimal scheme is computing task $2$ after task $1$ is finished. Therefore, the optimal solution is $f_{1,2}=\frac{F_{1}}{\Delta t_{2}}$ and $f_{2,3}=\frac{F_{2}}{\Delta t_{3}}$ . Hence, the optimal objective is $\frac{F_{2}}{\Delta t_{3}}$ . 2) If $\frac{F_{1}}{\Delta t_{2}}>\frac{F_{2}}{\Delta t_{3}}$ , this implies that part of task $1$ can be processed in parallel with task $2$ . Hence, the optimal solution is given by $f_{1,2}=f_{1,3}+f_{2,3}=\frac{F_{1}+F_{2}}{\Delta t_{2}+\Delta t_{3}}$ . Since $\frac{F_{1}+F_{2}}{\Delta t_{2}+\Delta t_{3}}>\frac{F_{2}}{\Delta t_{3}}$ , the optimal objective is $\frac{F_{1}+F_{2}}{\Delta t_{2}+\Delta t_{3}}$ . In conclusion, when $K=2$ , the optimal objective is $\max\{\frac{F_{1}+F_{2}}{\Delta t_{2}+\Delta t_{3}},\frac{F_{2}}{\Delta t_{3}}\}$ . Similarly, by recursion, we can deduce that when there exist $K$ TD, the optimal solution is $\max\{\frac{F_{1}+\cdots+F_{K}}{\Delta t_{2}+\cdots+\Delta t_{K+1}},\frac{F_{2}+\cdots+F_{K}}{\Delta t_{3}+\cdots+\Delta t_{K+1}},\cdots,\frac{F_{K}}{\Delta t_{K+1}}\}$ . That completes the proof. $\blacksquare$

Appendix B Proof of Property 1) in Theorem 3

Before that, we give the following two corollaries to facilitate the proof.

Corollary 5.

[Row property] The optimal computation resource of each task is non-increasing during its computation period, i.e., $f_{n,n+1}^{*}\geq f_{n,n+2}^{*}\geq\cdots\geq f_{n,K+1}^{*}$ $(\forall n\in\mathcal{K})$ .

Proof.

Please refer to Appendix E. ∎

Corollary 6.

[Column property] Denote the sum computation cycles of the $n$ -th offloading device in time slots $t_{m}$ and $t_{m+1}$ by $F_{n}$ for $m=3,\cdots,K$ and $n=1,\cdots,m-1$ . If $F_{n}>0$ holds for all $n=1,\cdots,m-1$ , the optimal frequency shifts $\delta^{*}_{n}\triangleq f^{*}_{n,m}-f^{*}_{n,m+1}$ $(n=1,\cdots,m-1)$ are either all zeros or all positive, i.e., have the coincident zero or positive characteristics.

Proof.

Please refer to Appendix F. ∎

First, applying the KKT conditions gives

	$\displaystyle 3\kappa(f^{*}_{n,m})^{2}\Delta t_{m}+\alpha_{m}-\beta_{n}\Delta t_{m}-\gamma_{n,m}=0,$		(B.1)
	$\displaystyle\alpha_{m}\left(\sum_{n=1}^{m-1}f^{*}_{n,m}-F_{\max}\right)=0,$		(B.2)
	$\displaystyle\beta_{n}\left(F_{n}-\sum_{m=n+1}^{K+1}f^{*}_{n,m}\Delta t_{m}\right)=0,$		(B.3)
	$\displaystyle\gamma_{n,m}f^{*}_{n,m}=0,\quad\forall n\in\mathcal{K},$		(B.4)
	$\displaystyle\alpha_{m}\!\geq\!0,\beta_{n}\!\geq\!0,\gamma_{n,m}\!\geq\!0.$		(B.5)

Based on (B.1), we obtain that

\displaystyle f^{*}_{n,m}=\sqrt{\frac{\beta_{n}}{3\kappa}+\frac{\gamma_{n,m}-\alpha_{m}}{3\kappa\Delta t_{m}}}.

(B.6)

In case of $f_{n,m}>0$ $(\forall n\in\mathcal{K},\forall m=n+1,\cdots,K+1)$ , we have $\gamma_{n,m}=0$ according to (B.4). Furthermore, $\beta_{n}>0$ is derived from (B.6). According to Corollary 5, the optimal solution satisfies $f^{*}_{n,m}\geq f^{*}_{n,m+1}$ . Thus, we have $\frac{\alpha_{m}}{\Delta t_{m}}\leq\frac{\alpha_{m+1}}{\Delta t_{m+1}}$ . Assume that there exists a certain $i\in\{3,\cdots,K\}$ such that $f^{*}_{n,i}=f^{*}_{n,i+1}$ . We have $\frac{\alpha_{i}}{\Delta t_{i}}=\frac{\alpha_{i+1}}{\Delta t_{i+1}}$ . If $\frac{\alpha_{i}}{\Delta t_{i}}=\frac{\alpha_{i+1}}{\Delta t_{i+1}}>0$ , i.e., $\alpha_{i}>0$ and $\alpha_{i+1}>0$ , we should have $\sum_{n=1}^{i-1}f_{n,i}=\sum_{n=1}^{i}f_{n,i+1}=F_{\max}$ according to (B.2). Furthermore, due to $f_{n,m}>0$ $(\forall n\in\mathcal{K},\forall m\in=n+1,\cdots,K+1)$ , the computation cycles $f_{n,i}\Delta t_{i}+f_{n,i+1}\Delta t_{i+1}>0$ $(\forall n\in 1,\cdots,i-1)$ . According to Corollary 6, we have $f_{n,i}=f_{n,i+1}$ $(n=1,\cdots,i-1)$ . Hence, it can be derived that $f_{i,i+1}=0$ which contradicts that $f_{i,i+1}$ is positive. Therefore, we have $\alpha_{i}=\alpha_{i+1}=0$ , i.e., $\frac{\alpha_{i}}{\Delta t_{i}}=\frac{\alpha_{i+1}}{\Delta t_{i+1}}=0$ . Since $\frac{\alpha_{2}}{\Delta t_{2}}\leq\cdots\leq\frac{\alpha_{i}}{\Delta t_{i}}$ and $\alpha_{2},\cdots,\alpha_{i-1}\geq 0$ , we can further obtain that $\alpha_{2}=\cdots=\alpha_{i+1}=0$ . This indicates that if there exists a certain $i\in\{3,\cdots,K\}$ such that $f^{*}_{n,i}=f^{*}_{n,i+1}>0$ , we have $f^{*}_{n,n+1}=f^{*}_{n,n+2}=\cdots=f^{*}_{n,i+1}$ .

Additionally, if there exists a certain $i<j\leq K+1$ such that $f^{*}_{n,j}=0$ , we can deduce that $f^{*}_{n,j}=f^{*}_{n,j+1}=\cdots=f_{n,K+1}^{*}=0$ since $f^{*}_{n,j}\geq f^{*}_{n,j+1}\geq\cdots\geq f^{*}_{n,K+1}$ .

Combing the above two cases, we complete the proof. $\blacksquare$

Appendix C Proof of Property 2) in Theorem 3

According to Appendix B, we can obtain that $\alpha^{*}_{2}=\cdots=\alpha^{*}_{i}=0$ . Moreover, since $f^{*}_{n-1,n}>f^{*}_{n-1,n+1}\geq 0$ $(i\leq n\leq K)$ , we have $\sqrt{\frac{\beta_{n-1}}{3\kappa}+\frac{\gamma_{n-1,n}-\alpha_{n}}{3\kappa\Delta t_{n}}}>\sqrt{\frac{\beta_{n}-1}{3\kappa}+\frac{\gamma_{n-1,n+1}-\alpha_{n+1}}{3\kappa\Delta t_{n+1}}}$ according to (B.6). Due to that $\gamma_{n-1,n}=0$ and $\gamma_{n-1,n+1}>0$ , we can deduce that $-\frac{\alpha_{n}}{\Delta t_{n}}>\frac{\gamma_{n-1,n+1}-\alpha_{n+1}}{\Delta t_{n+1}}>-\frac{\alpha_{n+1}}{\Delta t_{n+1}}$ , i.e., $\frac{\alpha_{n}}{\Delta t_{n}}<\frac{\alpha_{n+1}}{\Delta t_{n+1}}$ $(i\leq n\leq K)$ . Overall, we can conclude that $0=\frac{\alpha^{*}_{2}}{\Delta t_{2}}=\cdots=\frac{\alpha^{*}_{i}}{\Delta t_{i}}<\frac{\alpha^{*}_{i+1}}{\Delta t_{i+1}}\cdots<\frac{\alpha^{*}_{K+1}}{\Delta t_{K+1}}$ . $\blacksquare$

Appendix D Proof of Property 3) in Theorem 3

We first prove the “only if” part. According to property 1) in Theorem 3, if $t_{i}$ is the transition point, we have $f_{n,n+1}^{*}=\cdots=f_{n,i-1}^{*}>f_{n,i}^{*}\cdots>f^{*}_{n,j}=\cdots=f_{n,K+1}^{*}=0$ $(n+1\leq i<j\leq K+1)$ and $\sum_{n=1}^{m-1}f_{n,m}=F_{\max}$ $(i\leq m\leq K+1)$ . Since $\sum_{m=n+1}^{K+1}f^{*}_{n,m}\Delta t_{m}=F_{n}$ , we can obtain that $\sum_{m=i}^{K+1}f^{*}_{n,m}\Delta t_{m}<\frac{F_{n}(\sum_{m=i}^{K+1}\Delta t_{m})}{\sum_{m=n+1}^{K+1}\Delta t_{m}}$ $(1\leq n\leq i-2)$ . Thus, we have $(\sum_{m=i}^{K+1}\Delta t_{m})F_{\max}=\sum_{n=1}^{i-2}\sum_{m=i}^{K+1}f^{*}_{n,m}\Delta t_{m}+\sum_{n=i-1}^{K}\sum_{m=n+1}^{K+1}f^{*}_{n,m}\Delta t_{m}<\sum_{n=1}^{i-2}\frac{F_{n}(\sum_{m=i}^{K+1}\Delta t_{m})}{\sum_{m=n+1}^{K+1}\Delta t_{m}}+\sum_{n=i-1}^{K}F_{n}$ , i.e., $F_{\max}<\digamma(i)$ .

Similarly, if $t_{i-1}$ is the transition point, we have $F_{\max}<\digamma(i-1)$ . Since $\digamma(i)-\digamma(i-1)=\frac{F_{i-2}}{\sum_{m=i-1}^{K+1}\Delta t_{m}}+\frac{\sum_{n=i-1}^{K}F_{n}}{\sum_{m=i}^{K+1}\Delta t_{m}}-\frac{\sum_{n=i-2}^{K}F_{n}}{\sum_{m=i-1}^{K+1}\Delta t_{m}}=\frac{\sum_{n=i-1}^{K}F_{n}}{\sum_{m=i}^{K+1}\Delta t_{m}}-\frac{\sum_{n=i-1}^{K}F_{n}}{\sum_{m=i-1}^{K+1}\Delta t_{m}}>0$ , we should have $\digamma(i-1)\leq F_{\max}<\digamma(i)$ .

For the “if” part, if $F_{\max}<\digamma(i)$ , we can deduce that $(\sum_{m=i}^{K+1}\Delta t_{m})F_{\max}<\sum_{n=1}^{i-2}\frac{F_{n}(\sum_{m=i}^{K+1}\Delta t_{m})}{\sum_{m=n+1}^{K+1}\Delta t_{m}}+\sum_{n=i-1}^{K}F_{n}$ . Moreover, since $\sum_{n=1}^{i-2}\sum_{m=i}^{K+1}f^{*}_{n,m}\Delta t_{m}+\sum_{n=i-1}^{K}F_{n}\leq(\sum_{m=i}^{K+1}\Delta t_{m})F_{\max}$ , we have $\sum_{n=1}^{i-2}\sum_{m=i}^{K+1}f^{*}_{n,m}\Delta t_{m}+\sum_{n=i-1}^{K}F_{n}<\sum_{n=1}^{i-2}\frac{F_{n}(\sum_{m=i}^{K+1}\Delta t_{m})}{\sum_{m=n+1}^{K+1}\Delta t_{m}}+\sum_{n=i-1}^{K}F_{n}$ , i.e., $\sum_{n=1}^{i-2}\sum_{m=i}^{K+1}f^{*}_{n,m}\Delta t_{m}<\sum_{n=1}^{i-2}\frac{F_{n}(\sum_{m=i}^{K+1}\Delta t_{m})}{\sum_{m=n+1}^{K+1}\Delta t_{m}}$ . Since $f_{n,n+1}^{*}\geq\cdots\geq f_{n,i-1}^{*}\geq\cdots\geq f_{n,K+1}^{*}$ $(n+1\leq i<K+1)$ , we can obtain that $\sum_{m=i}^{K+1}f^{*}_{n,m}\Delta t_{m}\leq\frac{F_{n}(\sum_{m=i}^{K+1}\Delta t_{m})}{\sum_{m=n+1}^{K+1}\Delta t_{m}}$ $(1\leq n\leq i-2)$ . Therefore, to let $\sum_{n=1}^{i-2}\sum_{m=i}^{K+1}f^{*}_{n,m}\Delta t_{m}<\sum_{n=1}^{i-2}\frac{F_{n}(\sum_{m=i}^{K+1}\Delta t_{m})}{\sum_{m=n+1}^{K+1}\Delta t_{m}}$ hold, we should have $\sum_{m=i}^{K+1}f^{*}_{n,m}\Delta t_{m}<\frac{F_{n}(\sum_{m=i}^{K+1}\Delta t_{m})}{\sum_{m=n+1}^{K+1}\Delta t_{m}}$ $(1\leq n\leq i-2)$ . Further, it can be deduced that $\sum_{m=i-1}^{K+1}f^{*}_{n,m}\Delta t_{m}<\frac{F_{n}(\sum_{m=i-1}^{K+1}\Delta t_{m})}{\sum_{m=n+1}^{K+1}\Delta t_{m}}$ $(1\leq n\leq i-3)$ .

Additionally, since $\digamma(i-1)\leq F_{\max}$ , we can deduce that $\sum_{n=1}^{i-3}\frac{F_{n}(\sum_{m=i-1}^{K+1}\Delta t_{m})}{\sum_{m=n+1}^{K+1}\Delta t_{m}}+\sum_{n=i-2}^{K}F_{n}\leq(\sum_{m=i-1}^{K+1}\Delta t_{m})F_{\max}$ . Thus, we have $\sum_{n=1}^{i-3}\sum_{m=i-1}^{K+1}f^{*}_{n,m}\Delta t_{m}+\sum_{n=i-2}^{K}F_{n}<\sum_{n=1}^{i-3}\frac{F_{n}(\sum_{m=i-1}^{K+1}\Delta t_{m})}{\sum_{m=n+1}^{K+1}\Delta t_{m}}+\sum_{n=i-2}^{K}F_{n}\leq(\sum_{m=i-1}^{K+1}\Delta t_{m})F_{\max}$ , which indicates that the computation resource is abundant from $t_{i-1}$ to $t_{K+1}$ . Therefore, we can deduce that $f^{*}_{n,n+1}=\cdots=f^{*}_{n,i-1}$ $(n\leq i-2)$ . Assume that $t_{\tilde{i}}$ $(\tilde{i}>i)$ is the transition point. We have $\digamma(\tilde{i}-1)\leq F_{\max}$ . Since $\digamma(\tilde{i}-1)-\digamma(i)\geq 0$ , $F_{\max}$ is infeasible, which breaks the assumption. Therefore, we can conclude that $t_{i}$ is the transition point.

Combining the proofs of “if” and “only if” part, we complete the proof. $\blacksquare$

Appendix E Proof of Corollary 5

To find out the optimal computation resource allocation scheme, we first investigate the property of the most energy-efficient scheme without the maximum frequency restriction in Lemma 7, whose proof is provided in Appendix G.

Lemma 7.

Regardless of $\Delta t_{m}$ , $\Delta t_{m+1}$ and with given computation cycles $F_{n}$ in time slots $t_{m}$ and $t_{m+1}$ , scheme $f^{*}_{n,m}=f^{*}_{n,m+1}$ consumes the least energy among all the solutions satisfying $f_{n,m}\Delta t_{m}+f_{n,m+1}\Delta t_{m+1}=F_{n}$ .

For Corollary 5, we first prove that $f_{1,2}^{*}\geq f_{1,3}^{*}$ with given computation cycle $F_{1}$ and $f_{2,3}$ . Denote the sum computation cycles in $t_{2}$ and $t_{3}$ of the first offloading device as $F_{1}$ . Through relaxing the maximum computation resource constraint, the energy consumption is the least when $f_{1,2}=f_{1,3}=\frac{F_{1}}{\Delta t_{2}+\Delta t_{3}}$ according to Lemma 7. Since constraint (3a) should be satisfied, we have

\displaystyle f_{1,2}\leq F_{\max},\quad f_{1,3}+f_{2,3}\leq F_{\max}.

(E.1)

We consider the following two cases; otherwise, $f_{1,2}$ and $f_{1,3}$ have no feasible solution with the given $F_{1}$ .

Case 1: $\frac{F_{1}}{\Delta t_{2}+\Delta t_{3}}+f_{2,3}\leq F_{\max}$ . In this case, we can deduce that $f_{1,2}=f_{1,3}=\frac{F_{1}}{\Delta t_{2}+\Delta t_{3}}$ satisfies (E.1). Since $f_{1,2}=f_{1,3}=\frac{F_{1}}{\Delta t_{2}+\Delta t_{3}}$ is the most energy efficient solution, the optimal solution in this case is $f^{*}_{1,2}=f^{*}_{1,3}=\frac{F_{1}}{\Delta t_{2}+\Delta t_{3}}$ .

Case 2: $\frac{F_{1}}{\Delta t_{2}+\Delta t_{3}}+f_{2,3}>F_{\max}$ . Obviously, $f_{1,2}=f_{1,3}=\frac{F_{1}}{\Delta t_{2}+\Delta t_{3}}$ is infeasible in this case. We then prove that $f_{1,2}<f_{1,3}$ is also impossible. Since $f_{1,2}\Delta t_{2}+f_{1,3}\Delta t_{3}=F_{1}$ , $f_{1,2}$ increases as $f_{1,3}$ decreases. If $f_{1,2}<f_{1,3}$ , we can deduce that $f_{1,3}>\frac{F_{1}}{\Delta t_{2}+\Delta t_{3}}$ . Therefore, we have $f_{1,3}+f_{2,3}>\frac{F_{1}}{\Delta t_{2}+\Delta t_{3}}+f_{2,3}>F_{\max}$ which violates the maximum frequency constraint. As a consequence, the optimal solution is $f_{1,2}^{*}>f_{1,3}^{*}$ . According to Lemma 7, the energy consumption $E$ increases with $\delta_{1}=f_{1,2}-f_{1,3}$ in the considered region $0<\delta_{1}\leq F_{1}/\Delta t_{2}$ . Therefore, in order to achieve the fewest energy consumption, we should let $\delta_{1}$ as small as possible. Hence, we can obtain that $f^{*}_{1,3}=F_{\max}-f_{2,3}$ and $f^{*}_{1,2}=\frac{F_{1}-f^{*}_{1,3}\Delta t_{3}}{\Delta t_{2}}$ . The corresponding optimal $\delta_{1}^{*}=\frac{F_{1}-(F_{\max}-f_{2,3})(\Delta t_{2}+\Delta t_{3})}{\Delta t_{2}}$ .

Summarizing the above two cases, we can obtain that $f^{*}_{1,2}\geq f^{*}_{1,3}$ . Subsequently, we prove that in $t_{m}$ and $t_{m+1}$ $(m=3,\cdots,K)$ , we always have $f^{*}_{n,m}\geq f^{*}_{n,m+1}$ for all $n=1,\cdots,m-1$ .

Denote the sum computation cycles of the $n$ -th offloading device in time slots $t_{m}$ and $t_{m+1}$ by $F_{n}$ , i.e.,

\displaystyle\left\{\begin{aligned} &f_{1,m}\Delta t_{m}+f_{1,m+1}\Delta t_{m+1}=F_{1},\\ &\quad\quad\quad\quad\quad\cdots\\ &f_{n,m}\Delta t_{m}+f_{n,m+1}\Delta t_{m+1}=F_{n},\\ &\quad\quad\quad\quad\quad\cdots\\ &f_{m-1,m}\Delta t_{m}+f_{m-1,m+1}\Delta t_{m+1}=F_{m-1}.\end{aligned}\right.

(E.2)

According to Lemma 7, when $f_{n,m}=f_{n,m+1}=\frac{F_{n}}{\Delta t_{m}+\Delta t_{m+1}}$ for all $n=1,\cdots,m-1$ , the minimum energy consumption of the $n$ -th offloading device can be achieved, thus the total energy consumption is minimum. Moreover, the following constraints should be satisfied:

\displaystyle\sum_{n=1}^{m-1}f_{n,m}\leq F_{\max},\quad\sum_{n=1}^{m}f_{n,m+1}\leq F_{\max}.

(E.3)

We consider two cases.

Case 1: $\sum_{n=1}^{m-1}\frac{F_{n}}{\Delta t_{m}+\Delta t_{m+1}}+f_{m,m+1}\leq F_{\max}$ . In this case, we can deduce that $f_{n,m}=f_{n,m+1}=\frac{F_{n}}{\Delta t_{m}+\Delta t_{m+1}}$ for all $n=1,\cdots,m-1$ satisfies (E.3). Thus, the optimal solution in this case is $f^{*}_{n,m}=f^{*}_{n,m+1}=\frac{F_{n}}{\Delta t_{m}+\Delta t_{m+1}}$ for all $n=1,\cdots,m-1$ .

Case 2: $\sum_{n=1}^{m-1}\frac{F_{n}}{\Delta t_{m}+\Delta t_{m+1}}+f_{m,m+1}>F_{\max}$ . Obviously, $f_{n,m}=f_{n,m+1}=\frac{F_{n}}{\Delta t_{m}+\Delta t_{m+1}}$ is infeasible in this case. We then prove that $\sum_{n=1}^{m-1}f^{*}_{n,m}>\sum_{n=1}^{m-1}f^{*}_{n,m+1}$ by contradiction. By summing all the equalities in (E.2), we have $\Delta t_{m}\sum_{n=1}^{m-1}f_{n,m}+\Delta t_{m+1}\sum_{n=1}^{m-1}f_{n,m+1}=\sum_{n=1}^{m-1}F_{n}$ . Thus, $\sum_{n=1}^{m-1}f_{n,m}$ is negatively correlated with $\sum_{n=1}^{m-1}f_{n,m+1}$ . If $\sum_{n=1}^{m-1}f_{n,m}<\sum_{n=1}^{m-1}f_{n,m+1}$ , it can be inferred that $\sum_{n=1}^{m-1}f_{n,m+1}>\sum_{n=1}^{m-1}\frac{F_{n}}{\Delta t_{m}+\Delta t_{m+1}}$ . We can further have $\sum_{n=1}^{m-1}f_{n,m+1}+f_{m,m+1}>\sum_{n=1}^{m-1}\frac{F_{n}}{\Delta t_{m}+\Delta t_{m+1}}+f_{m,m+1}>F_{max}$ which violates constraint (E.3). Similarly, if $\sum_{n=1}^{m-1}f_{n,m}=\sum_{n=1}^{m-1}f_{n,m+1}$ , we can obtain that $\sum_{n=1}^{m-1}f_{n,m}=\sum_{n=1}^{m-1}f_{n,m+1}=\sum_{n=1}^{m-1}\frac{F_{n}}{\Delta t_{m}+\Delta t_{m+1}}$ . Hence, we have $\sum_{n=1}^{m-1}f_{n,m+1}+f_{m,m+1}=\sum_{n=1}^{m-1}\frac{F_{n}}{\Delta t_{m}+\Delta t_{m+1}}+f_{m,m+1}>F_{max}$ which breaks constraint (E.3). As a consequence, the optimal solution in this case satisfies $\sum_{n=1}^{m-1}f^{*}_{n,m}>\sum_{n=1}^{m-1}f^{*}_{n,m+1}$ .

Next, we prove that $f^{*}_{n,m}\geq f^{*}_{n,m+1}$ for all $n=1,\cdots,m-1$ . With given $\Delta t_{m}$ and $\Delta t_{m+1}$ , we denote the energy consumption of the $n$ -th offloading device by $E_{n}(F_{n},\delta_{n})$ , where $\delta_{n}=f_{n,m}-f_{n,m+1}$ . According to (G), $E_{n}(F_{n},\delta_{n})$ is expressed by

	$\displaystyle E_{n}(F_{n},\delta_{n})=\kappa\frac{(F_{n}+\delta_{n}\Delta t_{m+1})^{3}}{(\Delta t_{m}+\Delta t_{m+1})^{3}}\Delta t_{m}$
	$\displaystyle+\kappa\frac{(F_{n}-\delta_{n}\Delta t_{m})^{3}}{(\Delta t_{m}+\Delta t_{m+1})^{3}}\Delta t_{m+1},(n=1,\cdots,m-1),$		(E.4)

which decreases when $-F_{n}/\Delta t_{m+1}\leq\delta_{n}\leq 0$ while increases when $0<\delta_{n}\leq F_{n}/\Delta t_{m}$ .

Furthermore, since $\sum_{n=1}^{m-1}f_{n,m+1}\leq F_{\max}-f_{m,m+1}$ , we have

		$\displaystyle\sum_{n=1}^{m-1}f_{n,m}-\sum_{n=1}^{m-1}f_{n,m+1}$
		$\displaystyle\geq\frac{\sum_{n=1}^{m-1}F_{n}-\Delta t_{m+1}\sum_{n=1}^{m-1}f_{n,m+1}}{\Delta t_{m}}-\sum_{n=1}^{m-1}f_{n,m+1}$
		$\displaystyle\geq\frac{\sum_{n=1}^{m-1}F_{n}-(\Delta t_{m}+\Delta t_{m+1})\left(F_{\max}-f_{m,m+1}\right)}{\Delta t_{m}}.$		(E.5)

According to (E), we have $\sum_{n=1}^{m-1}\delta_{n}\geq\frac{\sum_{n=1}^{m-1}F_{n}-(\Delta t_{m}+\Delta t_{m+1})\left(F_{\max}-f_{m,m+1}\right)}{\Delta t_{m}}\triangleq\Omega$ . Next, we utilize contradiction to prove that $\delta_{n}^{*}\geq 0$ for all $n=1,\cdots,m-1$ . Assume in the optimal solution $\delta^{*}_{n}$ there exists a certain $\delta_{n}<0$ . We can suitably decrease other positive $\delta^{*}_{n}$ and increase the negative $\delta_{n}<0$ to zero while keeping $\sum_{n=1}^{m-1}\delta_{n}$ unchanged. In this case, the total energy consumption is effectively reduced, which contradicts the optimality. That completes the proof of $\delta^{*}_{n}\geq 0$ , i.e., $f^{*}_{n,m}\geq f^{*}_{n,m+1}$ for all $n=1,\cdots,m-1$ .

In summary, since we have proven $f_{1,2}^{*}\geq f_{1,3}^{*}$ and $f_{n,m}^{*}\geq f_{n,m+1}^{*}$ for $m=3,\cdots,K$ and $n=1,\cdots,m-1$ , we can deduce Corollary 5. $\blacksquare$

Appendix F Proof of Corollary 6

In case of $\sum_{n=1}^{m-1}\frac{F_{n}}{\Delta t_{m}+\Delta t_{m+1}}+f_{m,m+1}\leq F_{\max}$ , the optimal $\delta^{*}_{n}$ $(n=1,\cdots,m-1)$ are all zeros according to Corollary 5. Therefore, we only need to justify the case of $\sum_{n=1}^{m-1}\frac{F_{n}}{\Delta t_{m}+\Delta t_{m+1}}+f_{m,m+1}>F_{\max}$ . In this case, we first prove that the optimal $\sum_{n=1}^{m-1}\delta^{*}_{n}=\Omega$ . Assume that the optimal $\sum_{n=1}^{m-1}\delta^{*}_{n}>\Omega$ . We can suitably reduce the positive $\delta^{*}_{n}$ such that the energy consumption is further reduced, which contradicts the optimality. Therefore, we can construct the following energy consumption minimization problem:


$\displaystyle\min_{\boldsymbol{\delta}}\quad$	$\displaystyle\sum_{n=1}^{m-1}E_{n}(F_{n},\delta_{n}),$	(F.1)
$\displaystyle s.t.\quad$	$\displaystyle\sum_{n=1}^{m-1}\delta_{n}=\Omega,$	(F.1a)
	$\displaystyle 0\leq\delta_{n}\leq F_{n}/\Delta t_{m},\quad\forall n=1,\cdots,m-1,$	(F.1b)

where $\boldsymbol{\delta}=[\delta_{1},\cdots,\delta_{m-1}]^{T}$ .

Based on (G.3), the second derivative of $E_{n}(F_{n},\delta_{n})$ with respect to $\delta_{n}$ is given by

\displaystyle\frac{\mathrm{d}^{2}E_{n}(F_{n},\delta_{n})}{\mathrm{d}(\delta_{n})^{2}}

\displaystyle\!=\!\frac{6\kappa\Delta t_{m}\Delta t_{m+1}}{(\Delta t_{m}+\Delta t_{m+1})^{2}}\left[\delta_{n}(\Delta t_{m+1}\!-\!\Delta t_{m})\!+\!F_{n}\right].

(F.2)

We can infer that the second derivative of $E_{n}(F_{n},\delta_{n})$ is always positive in the considered region $-F_{n}/\Delta t_{m+1}\leq\delta_{n}\leq F_{n}/\Delta t_{m}$ , no matter $\Delta t_{m}$ is larger than or smaller than, or equal to $\Delta t_{m+1}$ . Hence, $E_{n}(F_{n},\delta_{n})$ is convex with respect to $\delta_{n}$ . Thus, problem (F.1) is convex. The partial Lagrangian function of this problem is expressed as


$\displaystyle\min_{\boldsymbol{\delta}}\quad$	$\displaystyle\sum_{n=1}^{m-1}E_{n}(F_{n},\delta_{n})+\Upsilon\left(\sum_{n=1}^{m-1}\delta_{n}-\Omega\right),$	(F.3)
$\displaystyle s.t.\quad$	$\displaystyle 0\leq\delta_{n}\leq F_{n}/\Delta t_{m},\quad\forall n=1,\cdots,m-1,$	(F.3a)

where $\Upsilon$ is the dual variable with respect to constraint (F.1a). Problem (F.3) can be decomposed into a series of $(m-1)$ parallel problems:


$\displaystyle\min_{\delta_{n}}\quad$	$\displaystyle E_{n}(F_{n},\delta_{n})+\Upsilon\delta_{n},$	(F.4)
$\displaystyle s.t.\quad$	$\displaystyle 0\leq\delta_{n}\leq F_{n}/\Delta t_{m},$	(F.4a)

Denote the objective of (F.4) by $J_{n}$ . Taking the derivative of $J_{n}$ with respect to $\delta_{n}$ , we have

		$\displaystyle\frac{\mathrm{d}J_{n}}{\mathrm{d}\delta_{n}}=\frac{\mathrm{d}E_{n}(F_{n},\delta_{n})}{\mathrm{d}\delta_{n}}+\Upsilon,$
		$\displaystyle=\frac{3\kappa\Delta t_{m}\Delta t_{m+1}}{(\Delta t_{m}+\Delta t_{m+1})^{2}}\delta_{n}\left(\delta_{n}(\Delta t_{m+1}\!-\!\Delta t_{m})\!+\!2F_{n}\right)\!+\!\Upsilon.$		(F.5)

It can be deduced that $\frac{\mathrm{d}E_{n}(F_{n},\delta_{n})}{\mathrm{d}\delta_{n}}$ is non-negative when $0\leq\delta_{n}\leq F_{n}/\Delta t_{m}$ . If $\Upsilon\geq 0$ , we have $\frac{\mathrm{d}J_{n}}{\mathrm{d}\delta_{n}}>=0$ . Thus, the optimal solution is achieved when $\delta_{n}=0$ for all $n=1,\cdots,m-1$ , which contradicts (F.1a). Hence, we should have $\Upsilon<0$ . Due to $\frac{\mathrm{d}^{2}J_{n}}{\mathrm{d}(\delta_{n})^{2}}=\frac{d^{2}E_{n}}{d(\delta_{n})^{2}}>0$ in the region of $[0,F_{n}/\Delta t_{m}]$ , we can obtain that $\frac{\mathrm{d}J_{n}}{\mathrm{d}\delta_{n}}$ monotonously increases. Moreover, we have $\frac{\mathrm{d}J_{n}}{\mathrm{d}\delta_{n}}|_{\delta_{n}=0}=\Upsilon<0$ . Therefore, we consider the following two cases.

Case 1: $\frac{\mathrm{d}J_{n}}{\mathrm{d}\delta_{n}}|_{\delta_{n}=F_{n}/\Delta t_{m}}\leq 0$ , i.e., $\Upsilon\leq-\frac{3\kappa\Delta t_{m+1}F_{n}^{2}}{(\Delta t_{m}+\Delta t_{m+1})\Delta t_{m}}$ . In this case, $\frac{\mathrm{d}J_{n}}{\mathrm{d}\delta_{n}}<=0$ in the region of $0\leq\delta_{n}\leq F_{n}/\Delta t_{m}$ . Therefore, the optimal solution is $\delta_{n}^{*}=F_{n}/\Delta t_{m}$ .

Case 2: $\frac{\mathrm{d}J_{n}}{\mathrm{d}\delta_{n}}|_{\delta_{n}=F_{n}/\Delta t_{m}}>0$ , i.e., $\Upsilon>-\frac{3\kappa\Delta t_{m+1}F_{n}^{2}}{(\Delta t_{m}+\Delta t_{m+1})\Delta t_{m}}$ . In this case, $\frac{\mathrm{d}J_{n}}{\mathrm{d}\delta_{n}}$ has a null point in the region of $0\leq\delta_{n}\leq F_{n}/\Delta t_{m}$ . Thus, $J_{n}$ decreases first and then increases. Through solving $\frac{\mathrm{d}J_{n}}{\mathrm{d}\delta_{n}}=0$ , we obtain that

\displaystyle\delta_{n}^{*}=\left\{\begin{aligned} &\frac{-F_{n}+\sqrt{F_{n}^{2}-\Upsilon\Xi(\Delta t_{m+1}-\Delta t_{m})}}{\Delta t_{m+1}-\Delta t_{m}},\text{if }\Delta t_{m}<\Delta t_{m+1}\\ &\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\ \ \text{ or }\Delta t_{m}>\Delta t_{m+1},\\ &-\frac{\Upsilon\Xi}{2F_{n}},\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\quad\ \text{if }\Delta t_{m}=\Delta t_{m+1},\end{aligned}\right.

(F.6)

where $\Xi=\frac{(\Delta t_{m}+\Delta t_{m+1})^{2}}{3\kappa\Delta t_{m}\Delta t_{m+1}}$ . Meanwhile, the optimal $\Upsilon^{*}$ should satisfy constraint (F.1a). Obviously, both the above two cases satisfy $\delta^{*}_{n}>0$ $(n=1,\cdots,m-1)$ , completing the proof. $\blacksquare$

Appendix G Proof of Lemma 7

Denote $\delta_{n}=f_{n,m}-f_{n,m+1}$ . Since $f_{n,m}$ and $f_{n,m+1}$ should be larger than or equal to zero, we can deduce that $-F_{n}/\Delta t_{m+1}\leq\delta_{n}\leq F_{n}/\Delta t_{m}$ . Hence, we can obtain that

\displaystyle f_{n,m}=\frac{F_{n}+\delta_{n}\Delta t_{m+1}}{\Delta t_{m}+\Delta t_{m+1}},\ f_{n,m+1}=\frac{F_{n}-\delta_{n}\Delta t_{m}}{\Delta t_{m}+\Delta t_{m+1}}.

(G.1)

Therefore, the energy consumption in time slot $t_{m}$ and $t_{m+1}$ can be given by

	$\displaystyle E_{n}$	$\displaystyle=\kappa f_{n,m}^{3}\Delta t_{m}+\kappa f_{n,m+1}^{3}\Delta t_{m+1},$
		$\displaystyle=\kappa\frac{(F_{n}+\delta_{n}\Delta t_{m+1})^{3}}{(\Delta t_{m}+\Delta t_{m+1})^{3}}\Delta t_{m}+\kappa\frac{(F_{n}-\delta_{n}\Delta t_{m})^{3}}{(\Delta t_{m}+\Delta t_{m+1})^{3}}\Delta t_{m+1}.$		(G.2)

Taking the first derivative of $E_{n}$ with respect to $\delta_{n}$ , we have

\displaystyle\frac{\mathrm{d}E_{n}}{\mathrm{d}\delta_{n}}

\displaystyle=\frac{3\kappa\Delta t_{m}\Delta t_{m+1}}{(\Delta t_{m}+\Delta t_{m+1})^{2}}\delta_{n}\left(\delta_{n}(\Delta t_{m+1}-\Delta t_{m})+2F_{n}\right).

(G.3)

Equation (G.3) has two null points: $0$ and $\frac{2F_{n}}{\Delta t_{m}-\Delta t_{m+1}}$ . We consider the following three cases.

Case 1: $\Delta t_{m}>\Delta t_{m+1}$ . In this case, we have $0<\frac{2F_{n}}{\Delta t_{m}-\Delta t_{m+1}}$ . The energy consumption decreases when $\delta_{n}<0$ and $\delta_{n}>\frac{2F_{n}}{\Delta t_{m}-\Delta t_{m+1}}$ while increases when $0\leq\delta_{n}\leq\frac{2F_{n}}{\Delta t_{m}-\Delta t_{m+1}}$ , as shown in Fig. 9(a). Since we can easily prove that $\frac{2F_{n}}{\Delta t_{m}-\Delta t_{m+1}}>F_{n}/\Delta t_{m}$ , the minimum energy consumption is obtained when $\delta_{n}=0$ , i.e., $f^{*}_{n,m}=f^{*}_{n,m+1}$ .

Case 2: $\Delta t_{m}<\Delta t_{m+1}$ . In this case, we have $\frac{2F_{n}}{\Delta t_{m}-\Delta t_{m+1}}<0$ . The energy consumption increases when $\delta_{n}<\frac{2F_{n}}{\Delta t_{m}-\Delta t_{m+1}}$ and $\delta_{n}>0$ while decreases when $\frac{2F_{n}}{\Delta t_{m}-\Delta t_{m+1}}\leq\delta_{n}\leq 0$ , as shown in Fig. 9(b). Similarly, since we can prove that $\frac{2F_{n}}{\Delta t_{m}-\Delta t_{m+1}}<-F_{n}/\Delta t_{m+1}$ , the minimum energy consumption is obtained when $\delta_{n}=0$ .

Case 3: $\Delta t_{m}=\Delta t_{m+1}$ . In this case, two null points coincide, i.e., $\frac{2F_{n}}{\Delta t_{m}-\Delta t_{m+1}}=0$ . Therefore, the energy consumption decreases when $\delta_{n}<0$ while increases when $\delta_{n}\geq 0$ , as shown in Fig. 9(c). $\delta_{n}=0$ is the solution that minimizes energy consumption.

In summary, the energy consumption when $f^{*}_{n,m}=f^{*}_{n,m+1}=\frac{F_{n}}{\Delta t_{m}+\Delta t_{m+1}}$ is the most energy efficient solution. $\blacksquare$

Appendix H Proof of Proposition 4

According to Theorem 3, we have $x_{i-1,i}>0$ $(\forall i\in\{2,\cdots,K+1\})$ . Hence, according to (11), we have $\xi^{*}-\sum_{n=i+1}^{K}\rho^{*}_{n}h_{\pi_{n}}\eta P_{0}-\omega^{*}_{i}F_{\max}=\sum_{n=1}^{i-1}2\kappa\frac{(x_{n,i})^{3}}{(\Delta t^{*}_{i})^{3}}+\rho^{*}_{i}\frac{2\lambda(A_{\pi_{i}})^{3}}{h_{\pi_{i}}(\Delta t^{*}_{i})^{3}}>0$ . Therefore, it can be derived that $\Delta t^{*}_{i}=\sqrt[3]{\frac{2\kappa h_{\pi_{i}}\sum_{n=1}^{i-1}(x_{n,i})^{3}+2\rho^{*}_{i}\lambda(A_{\pi_{i}})^{3}}{\xi^{*}h_{\pi_{i}}-\sum_{n=i+1}^{K}\rho^{*}_{n}(h_{\pi_{n}})^{2}\eta P_{0}-\omega^{*}_{i}h_{\pi_{i}}F_{\max}}}$ $(\forall i\in\{2,\cdots,K-1\})$ . Similarly, based on (12) and (13), we have $\Delta t^{*}_{K}=\sqrt[3]{\frac{2\kappa h_{\pi_{K}}\sum_{n=1}^{K-1}(x_{n,K})^{3}+2\rho_{K}^{*}\lambda(A_{\pi_{K}})^{3}}{\xi^{*}h_{\pi_{K}}-\omega^{*}_{K}h_{\pi_{K}}F_{\max}}}$ and $\Delta t_{K+1}^{*}=\sqrt[3]{\frac{2\kappa\sum_{n=1}^{K}(x_{n,K+1})^{3}}{\xi^{*}-\omega^{*}_{K+1}F_{\max}}}$ .

Besides, according to (13), we have $\xi^{*}=\sum_{n=1}^{K}2\kappa\frac{(x_{n,K+1})^{3}}{(\Delta t^{*}_{K+1})^{3}}+\omega^{*}_{K+1}F_{\max}>0$ since $\sum_{n=1}^{K}(x_{n,K+1})^{3}>(x_{K,K+1})^{3}=(A_{\pi_{K}}I_{\pi_{K}})^{3}>0$ . Therefore, we can obtain that $\sum_{i=0}^{K+1}\Delta t^{*}_{i}=T$ . Furthermore, based on (10), we have $\xi^{*}=\sum_{n=1}^{K}\rho^{*}_{n}h_{\pi_{n}}\eta P_{0}$ . Due to that $\xi^{*}$ is positive, there exists at least an $n\in\mathcal{K}$ such that $\rho_{n}^{*}>0$ . This indicates that for energy causality constraints (6a), at least a device is run out of energy after offloading, i.e., this device uses all the harvested energy for transmission. Substituting (9) into (10), we have $\rho^{*}_{1}\frac{2\lambda(A_{\pi_{1}})^{3}}{h_{\pi_{1}}(\Delta t^{*}_{1})^{3}}=\rho^{*}_{1}h_{\pi_{1}}\eta P_{0}$ . If $\rho_{1}^{*}>0$ , we can deduce that $\Delta t^{*}_{1}=\sqrt[3]{\frac{2\lambda}{(h_{\pi_{1}})^{2}\eta P_{0}}}A_{\pi_{1}}$ . Moreover, since $\frac{\lambda(A_{\pi_{1}})^{3}}{h_{\pi_{1}}(\Delta t_{1})^{2}}=\Delta t_{0}h_{\pi_{1}}\eta P_{0}$ , we have $\Delta t_{0}^{*}=\sqrt[3]{\frac{\lambda}{4(h_{\pi_{1}})^{2}\eta P_{0}}}A_{\pi_{1}}$ . If $\rho^{*}_{1}=0$ , $\frac{\mathrm{d}\mathcal{L}}{\mathrm{d}\Delta t^{*}_{1}}=0$ can be guaranteed for arbitrary $\Delta t_{1}$ satisfying $\frac{\lambda(A_{\pi_{1}})^{3}}{h_{\pi_{1}}(\Delta t_{1})^{2}}\leq\Delta t_{0}h_{\pi_{1}}\eta P_{0}$ . That means any pairs of $\Delta t_{0}$ and $\Delta t_{1}$ satisfying $\frac{\lambda(A_{\pi_{1}})^{3}}{h_{\pi_{1}}(\Delta t_{1})^{2}}\leq\Delta t_{0}h_{\pi_{1}}\eta P_{0}$ and $\Delta t_{0}+\Delta t_{1}=T-\sum_{i=2}^{K+1}\Delta t^{*}_{i}$ are the optimal solutions. Hence, according to $\Delta t_{0}+\Delta t_{1}=T-\sum_{i=2}^{K+1}\Delta t_{i}$ , we should have $\frac{\lambda(A_{\pi_{1}})^{3}}{h_{\pi_{1}}(\Delta t_{1})^{2}}\leq\left(T-\sum_{i=2}^{K+1}\Delta t_{i}-\Delta t_{1}\right)h_{\pi_{1}}\eta P_{0}$ , i.e., $\Psi(\Delta t_{1})\triangleq\left(T-\sum_{i=2}^{K+1}\Delta t_{i}-\Delta t_{1}\right)(\Delta t_{1})^{2}(h_{\pi_{1}})^{2}\eta P_{0}-\lambda(A_{\pi_{1}})^{3}\geq 0$ . Taking the derivative of $\Psi(\Delta t_{1})$ , we have $\Psi^{\prime}(\Delta t_{1})=\Delta t_{1}\left(2T-2\sum_{i=2}^{K+1}\Delta t_{i}-3\Delta t_{1}\right)(h_{\pi_{1}})^{2}\eta P_{0}$ , which has two null points $\Delta t_{1}=0$ and $\Delta t_{1}=\frac{2}{3}\left(T-\sum_{i=2}^{K+1}\Delta t_{i}\right)$ . Thus, $\Psi(\Delta t_{1})$ increases in the region of $\Big{(}0,\frac{2}{3}\left(T-\sum_{i=2}^{K+1}\Delta t_{i}\right)\Big{]}$ and decreases in $\left(\frac{2}{3}\left(T-\sum_{i=2}^{K+1}\Delta t_{i}\right),\left(T-\sum_{i=2}^{K+1}\Delta t_{i}\right)\right)$ . Additionally, we can obtain that $\Psi(0)=\Psi\left(T-\sum_{i=2}^{K+1}\Delta t_{i}\right)$ . Therefore, we can choose $\Delta t^{*}_{1}$ as the unique null point of $\Psi(\Delta t_{1})$ in the range of $\Delta t_{1}\in\Big{(}0,\frac{2}{3}\left(T-\sum_{i=2}^{K+1}\Delta t_{i}\right)\Big{]}$ without loss of generality. $\blacksquare$

References

[1] Q.-V. Pham, F. Fang, V. N. Ha, M. J. Piran, M. Le, L. B. Le, W.-J. Hwang, and Z. Ding, “A Survey of Multi-Access Edge Computing in 5G and Beyond: Fundamentals, Technology Integration, and State-of-the-Art,” IEEE Access, vol. 8, pp. 116 974–117 017, June 2020.
[2] Y. Mao, C. You, J. Zhang, K. Huang, and K. B. Letaief, “A Survey on Mobile Edge Computing: The Communication Perspective,” IEEE Commun. Surv. Tutor., vol. 19, no. 4, pp. 2322–2358, Fourthquarter 2017.
[3] H. Zhou, Z. Zhang, D. Li, and Z. Su, “Joint Optimization of Computing Offloading and Service Caching in Edge Computing-Based Smart Grid,” IEEE Trans. Cloud Comp., vol. 11, no. 2, pp. 1122–1132, Apr. 2023.
[4] X. Li, H. Zhang, H. Zhou, N. Wang, K. Long, S. Al-Rubaye, and G. K. Karagiannidis, “Multi-Agent DRL for Resource Allocation and Cache Design in Terrestrial-Satellite Networks,” IEEE Trans. Wirel. Commun., vol. 22, no. 8, pp. 5031–5042, Aug. 2023.
[5] H. Zhou, T. Wu, X. Chen, S. He, D. Guo, and J. Wu, “Reverse Auction-Based Computation Offloading and Resource Allocation in Mobile Cloud-Edge Computing,” IEEE Trans. Mobile Comp., vol. 22, no. 10, pp. 6144–6159, Oct. 2023.
[6] X. Cao, F. Wang, J. Xu, R. Zhang, and S. Cui, “Joint Computation and Communication Cooperation for Energy-Efficient Mobile Edge Computing,” IEEE Int. Things J., vol. 6, no. 3, pp. 4188–4200, June 2019.
[7] Y. Pan, M. Chen, Z. Yang, N. Huang, and M. Shikh-Bahaei, “Energy-Efficient NOMA-Based Mobile Edge Computing Offloading,” IEEE Commun. Letters, vol. 23, no. 2, pp. 310–313, Feb. 2019.
[8] Z. Yu, Y. Tang, L. Zhang, and H. Zeng, “Deep Reinforcement Learning Based Computing Offloading Decision and Task Scheduling in Internet of Vehicles,” in Proc. IEEE/CIC Int. Conf. Commun. China (ICCC), Xiamen, China, July 2021, pp. 1166–1171.
[9] X. Lu, P. Wang, D. Niyato, D. I. Kim, and Z. Han, “Wireless networks with RF energy harvesting: A contemporary survey,” IEEE Commun. Surv. Tutor., vol. 17, no. 2, pp. 757–789, 2014.
[10] Z. Zhang, H. Pang, A. Georgiadis, and C. Cecati, “Wireless Power Transfer—An Overview,” IEEE Trans. Industrial Electron., vol. 66, no. 2, pp. 1044–1058, Feb. 2019.
[11] F. Wang, J. Xu, X. Wang, and S. Cui, “Joint Offloading and Computing Optimization in Wireless Powered Mobile-Edge Computing Systems,” IEEE Trans. Wirel. Commun., vol. 17, no. 3, pp. 1784–1797, Mar. 2018.
[12] S. Bi and Y. J. Zhang, “Computation Rate Maximization for Wireless Powered Mobile-Edge Computing With Binary Computation Offloading,” IEEE Trans. Wirel. Commun., vol. 17, no. 6, pp. 4177–4190, June 2018.
[13] P. Chen, B. Lyu, Y. Liu, H. Guo, and Z. Yang, “Multi-IRS Assisted Wireless-Powered Mobile Edge Computing for Internet of Things,” IEEE Trans. Green Commun. Netw., pp. 1–1, Sep. 2022.
[14] X. Hu, K.-K. Wong, and K. Yang, “Wireless Powered Cooperation-Assisted Mobile Edge Computing,” IEEE Trans. Wirel. Commun., vol. 17, no. 4, pp. 2375–2388, Apr. 2018.
[15] K. Zhang, Y. Mao, S. Leng, S. Maharjan, and Y. Zhang, “Optimal delay constrained offloading for vehicular edge computing networks,” in Proc. IEEE Int. Conf. Commun. (ICC), Paris, France, July 2017, pp. 1–6.
[16] Z. Zhu, J. Peng, X. Gu, H. Li, K. Liu, Z. Zhou, and W. Liu, “Fair Resource Allocation for System Throughput Maximization in Mobile Edge Computing,” IEEE Access, vol. 6, pp. 5332–5340, Jan. 2018.
[17] F. Zhou and R. Q. Hu, “Computation Efficiency Maximization in Wireless-Powered Mobile Edge Computing Networks,” IEEE Trans. Wirel. Commun., vol. 19, no. 5, pp. 3170–3184, Feb. 2020.
[18] Y. Xu, P. Cheng, Z. Chen, M. Ding, Y. Li, and B. Vucetic, “Task offloading for large-scale asynchronous mobile edge computing: An index policy approach,” IEEE Trans. Signal Proc., vol. 69, pp. 401–416, Dec. 2020.
[19] Y. Dai, M. Sheng, J. Liu, N. Cheng, and X. Shen, “Delay-efficient offloading for NOMA-MEC with asynchronous uploading completion awareness,” in Proc. IEEE Global Commun. Conf. (GLOBECOM). Waikoloa, HI, USA: IEEE, Feb. 2019, pp. 1–6.
[20] C. You, Y. Zeng, R. Zhang, and K. Huang, “Asynchronous Mobile-Edge Computation Offloading: Energy-Efficient Resource Management,” IEEE Trans. Wirel. Commun., vol. 17, no. 11, pp. 7590–7605, Nov. 2018.
[21] Y. Hu, M. Chen, Y. Wang, Z. Li, M. Pei, and Y. Cang, “Discrete-Time Joint Scheduling of Uploading and Computation for Deterministic MEC Systems Allowing for Task Interruptions and Insertions,” IEEE Wirel. Commun. Letters, vol. 12, no. 1, pp. 21–25, Jan. 2023.
[22] S. Eom, H. Lee, J. Park, and I. Lee, “Asynchronous Protocol Designs for Energy Efficient Mobile Edge Computing Systems,” IEEE Trans. Veh. Techn., vol. 70, no. 1, pp. 1013–1018, Jan. 2021.
[23] K. Guo and T. Q. S. Quek, “On the Asynchrony of Computation Offloading in Multi-User MEC Systems,” IEEE Trans. Commun., vol. 68, no. 12, pp. 7746–7761, Dec. 2020.
[24] Z. Kuang, L. Li, J. Gao, L. Zhao, and A. Liu, “Partial Offloading Scheduling and Power Allocation for Mobile Edge Computing Systems,” IEEE Internet Things J., vol. 6, no. 4, pp. 6774–6785, Aug. 2019.
[25] P. Cai, F. Yang, J. Wang, X. Wu, Y. Yang, and X. Luo, “JOTE: Joint Offloading of Tasks and Energy in Fog-Enabled IoT Networks,” IEEE Internet of Things Journal, vol. 7, no. 4, pp. 3067–3082, April 2020.
[26] Z. Liang, H. Chen, Y. Liu, and F. Chen, “Data Sensing and Offloading in Edge Computing Networks: TDMA or NOMA?” IEEE Trans. Wirel. Commun., vol. 21, no. 6, pp. 4497–4508, June 2022.
[27] S. Bi, L. Huang, H. Wang, and Y.-J. A. Zhang, “Lyapunov-Guided Deep Reinforcement Learning for Stable Online Computation Offloading in Mobile-Edge Computing Networks,” IEEE Trans. Wirel. Commun., vol. 20, no. 11, pp. 7519–7537, Nov. 2021.
[28] F. Wang, J. Xu, and S. Cui, “Optimal Energy Allocation and Task Offloading Policy for Wireless Powered Mobile Edge Computing Systems,” IEEE Trans. Wireless Commun., vol. 19, no. 4, pp. 2443–2459, Apr. 2020.
[29] J. Liu, K. Xiong, D. W. K. Ng, P. Fan, Z. Zhong, and K. B. Letaief, “Max-Min Energy Balance in Wireless-Powered Hierarchical Fog-Cloud Computing Networks,” IEEE Trans. Wirel. Commun., vol. 19, no. 11, pp. 7064–7080, Nov. 2020.
[30] M. Li, N. Cheng, J. Gao, Y. Wang, L. Zhao, and X. Shen, “Energy-Efficient UAV-Assisted Mobile Edge Computing: Resource Allocation and Trajectory Optimization,” IEEE Trans. Veh. Techn., vol. 69, no. 3, pp. 3424–3438, Mar. 2020.
[31] D. P. Bertsekas, Convex optimization Theory. Athena Scientific Belmont, 2009.
[32] Y. Yu, X. Bu, K. Yang, H. Yang, X. Gao, and Z. Han, “UAV-Aided Low Latency Multi-Access Edge Computing,” IEEE Trans. Veh. Techn., vol. 70, no. 5, pp. 4955–4967, May 2021.
[33] A. Ibrahim, O. A. Dobre, T. M. N. Ngatched, and A. G. Armada, “Bender’s Decomposition for Optimization Design Problems in Communication Networks,” IEEE Netw., vol. 34, no. 3, pp. 232–239, May 2020.
[34] D. W. K. Ng, Y. Wu, and R. Schober, “Power Efficient Resource Allocation for Full-Duplex Radio Distributed Antenna Networks,” IEEE Trans. Wirel. Commun., vol. 15, no. 4, pp. 2896–2911, Apr. 2016.
[35] S. Boyd, S. P. Boyd, and L. Vandenberghe, Convex optimization. Cambridge university press, 2004.
[36] D. W. K. Ng and R. Schober, “Secure and Green SWIPT in Distributed Antenna Networks With Limited Backhaul Capacity,” IEEE Trans. Wirel. Commun., vol. 14, no. 9, pp. 5082–5097, Sep. 2015.
[37] “GUROBI Optimization, State-of-the-Art Mathematical Programming Solver, v5.6”. Apr. 2014. [Online]. Available: http://www.gurobi.com/
[38] “MOSEK ApS: Software for Large-Scale Mathematical Optimization Problems, Version 7.0.0.111”. Apr. 2014. [Online]. Available: http://www.mosek.com/
[39] Y. Mezentsev, “Binary cut-and-branch method for solving mixed integer programming problems,” in Proc. Construct. Nonsmooth Analy. Related Topics (CNSA), St. Petersburg, Russia, May 2017, pp. 1–3.