Single Threshold Packet Scheduling Policy for AoI Minimization in Resource-Constrained Network
Abstract
This paper investigates the tradeoff between the average age of information (AoI) and the transmission cost for networks with stochastic packet arrival and random erasure channel. Specifically, we model the resource-constrained AoI minimization problem as a constrained Markov decision process (CMDP) and propose a low-complexity single threshold packet scheduling policy for it. The key advantage of the proposed policy is its tractability and convenience for implementation. The AoI distribution and long-term average transmission cost of the proposed policy are derived as closed-form functions of the selected threshold. Furthermore, we show that the proposed policy reduces to the optimal policies under special settings and achieves close-to-optimal performance under general settings.
I Introduction
Data freshness is of paramount importance in real-time communication applications, such as automatic driving, remote sensing and control. Age of information (AoI) is a widely used metric to quantize the information freshness at the receiver [1]. The pioneer works [2][3][4] mainly considered the average AoI performance under different queuing policies, e.g., the first-come-first-serve (FCFS) and the last-generated-first-serve (LGFS) queues. It was pointed out in [5] that the preemptive LGFS (pLGFS) achieves the minimum average AoI in point-to-point networks with stochastic packet arrival. The analysis of AoI under different queuing policies was extended to the discrete-time networks in [4]. To capture the effect of AoI on the real applications, general age penalty functions and the corresponding optimal policies were investigated in [6]. A typical example of the age penalty function is the age violation probability raised in [7]. Evaluating the general age penalty function usually requires not only the average AoI, but also the complete AoI distribution. Reference [8] and [9] analyzed the AoI distribution under various packet arrival models and queuing policies, for both continuous and discrete queues.
The tradeoff between the AoI and other network performance metrics has also attracted a lot of research interest, such as the tradeoff between AoI and energy consumption [10], and the tradeoff between AoI and packet delivery ratio[11]. Besides, optimization of AoI under resource constraint is of particular interest for wireless sensor networks with limited power and/or spectrum. The authors in [12] considered a resource-constrained network with “generate-at-will” packet arrival model and random erasure channel. They proved that the threshold structure packet scheduling policies can achieve the optimal average AoI under both the automatic repeat request (ARQ) and hybrid ARQ transmission protocols. Reference [13] and [14] considered the AoI minimization in two-hop relaying networks, with total resource constraint applied on the source and relay nodes. A double threshold method was shown to achieve the near optimal AoI performance for the two-hop network when the source packet is generated at will [13].
When the packets can be generated at will by the transmitter, the design of the packet scheduling policy only needs to consider the receiving status of the previous transmissions. However, the practical data arrival may be controlled by independent component of the circuit and subject to certain constraint, e.g., the sensing power limitation. In this paper, we extend the existing works by considering the resource-constrained network with stochastic packet arrival. We formulate the optimization of AoI under resource constraint as a constrained Markov decision process (CMDP), and propose a lower bound of the AoI as a function of the long-term average transmission cost. Furthermore, we propose a low-complexity single threshold packet scheduling policy and derive the AoI distribution, average AoI and average transmission cost as the closed-form functions of a single threshold. With the closed-form performance characterizations, the optimal threshold can be easily determined for any given constraint and the proposed policy can be implemented in a very efficient manner. We show that when the resource constraint is relaxed or the packet arrival rate is high enough to mimic the “generate-at-will” model, the proposed single threshold policy reduces to the corresponding optimal schemes presented in [5] and [13], respectively. Finally, extensive simulations are conducted to verify the analytical results and compare the performance of the proposed policy with exiting benchmarks.
II System Model
As shown in Fig. 1, we consider a slotted point-to-point communication system, where the updating packets are generated from the source at the beginning of each time slot. The arrival of the packets to the transmitter follows Bernoulli process with arrival rate and the arriving time cost is neglected. Each packet arrived at the transmitter is stored in a unit-size buffer, and sent by the transmitter to the receiver through an erasure channel with erasure probability . If the transmission is successful, the packet will reach the receiver side at the end of the same time slot and an instantaneous error-free feedback will be received by the transmitter at the same time. The packet is removed from the buffer if it is successfully delivered or preempted by the newly arrived packet.
Note that each packet transmission, regardless whether it is successful, will cost certain amount of transmission power and occupy the channel spectrum and time. Hence, it is desired to minimize the transmission cost, while maintain the acceptable data freshness at the receiver.

II-A Data Freshness
At the transmitter side, the data freshness is only affected by the packet arrival rate . We assume that the transmitter data freshness at the th time slot is denoted by and measured at the beginning of each time slot. If there is a packet arrival at the th time slot, we have . Otherwise, we have
(1) |
where is the arrival time of the latest packet observed at the th time slot.
At the receiver side, the data freshness is not only affected by the packet arrival process, but also by the transmission policy and channel quality. We assume that the data freshness at the receiver is measured by the observed AoI at the end of each time slot. Denoted by the AoI observed at the th time slot. Following the conventional definition of AoI, we have:
(2) |
where is the arrival slot of the latest packet successfully received by the end of th slot. For consistence, we initialize and adopt the average AoI as the measurement of long-term data freshness, i.e.,
(3) |
where is the observation interval.
II-B Long-term Average Transmission Cost
The transmission cost is proportional to the number of transmissions made by the transmitter. We denote the transmitter action at the th time slot by , where if the transmitter chooses to send the packet in the buffer and if the buffer is empty or the transmitter chooses to remain silent even though a packet is waiting in the buffer. The long-term average transmission cost is defined as
(4) |
This paper aims to investigate the tradeoff between the data freshness and the transmission cost . The conventional pLGFS without resource constraint [5] achieves the minimum AoI with the maximum transmission cost. Hence, we have
(5) |
where the equality holds when the pLGFS transmission policy is adopted. On the other hand, when the packet is generated-at-will, which is equivalent to the proposed network model with , the optimal tradeoff between the data freshness and transmission cost has been presented in [12], which is reproduced as following
(6) | |||
(7) |
where is the system design parameter and various tradeoff between and can be achieved by changing . This paper aims to generalize the result of [12] to the network with stochastic arrival.
III Problem Formulation and Analysis
To investigate the tradeoff between data freshness and transmission cost, we consider the AoI minimization problem subject to the long-term transmission cost constraint, i.e.,
(8) | ||||
s.t., | (9) |
where is the maximum long-term average transmission cost. By varying the constraint and solving , we can obtain the trade-off between AoI and transmission cost.
III-A CMDP Modeling
The problem can be modeled as a CMDP as in [12] and [13]. For the network model considered in this paper, the CMDP consists of the following key elements:
-
•
State: Under the assumption of random packet arrival and erasure channel, the optimal action of the transmitter at a typical time slot is affected by the data freshness observed by the transmitter at the beginning of this time slot and that observed by the receiver at the end of the previous time slot. Hence, the state of the environment can be characterized by .
-
•
Action: At the th time slot, the transmitter can choose to transmit the packet in the buffer or remain silent, corresponding to and , respectively.
-
•
Reward: Since the objective is to minimize the average AoI observed by the receiver, we define the immediate reward at the th time slot as the negative of the receiver AoI, i.e., .
-
•
Cost: Each transmission incurs a unit cost and hence we can define the cost at the th time slot, denoted by , as the action taken, i.e., .
The state transition probability is related with the random packet arrival, channel status and action taken, which is illustrated in Fig. 2.

In summary, solving is equivalent to find the optimal policy for the above CMDP. The classical relative value iteration algorithm (RVIA)[15] can be used to find the optimal policy of CMDP by reducing the original infinite state space to finite size as approximation. However, as the optimal policy derived by RVIA has a multiple threshold structure [13], its implementation complexity is high and tractability is limited. In this paper, we derive a lower bound on the AoI under a given resource constraint and propose a low-complexity single threshold policy, which allows us to have a near optimal solution to problem in closed form, and hence investigate the tradeoff between AoI and transmission cost easily.
III-B Lower Bound on the AoI
Suppose there are packets successfully delivered to the receiver during the observation interval . We denote by the number of slots between the receiving time of the th packet and the th packet, and let be the number of time slots before receiving the first packet. For notational convenience, we further denote by the remaining number of time slots after receiving the last packet, i.e., .
Let be the age of the th packet when it is successfully delivered, where . The receiver AoI during the time slots between the delivery of the th and th packet are , respectively. Hence, the long-term average AoI is given by
(10) |
where follows from the fact that and .
Denote the expectation of by , i.e., . By Jensen’s inequality, we have
(11) |
Note that the term is the long-term throughput of the network, which is bounded by the packet arrival rate and the capacity of the network. Specifically, we have
(13) | |||
(14) |
where in (14) is the effective capacity of the network under the resource constraint. Substituting (13) and (14) into (12) renders the lower bound as
(15) |
IV Single Threshold Packet Scheduling Policy
In this section, we propose a low-complexity single threshold packet scheduling policy which achieves near optimal AoI at a given resource constraint, and allows us to derive the closed-form expressions for the AoI distribution, average AoI and transmission cost as functions of the threshold. Hence, the tradeoff between the receiver AoI and transmission cost can be easily obtained by varying the threshold.
IV-A Single Threshold Policy
First, note that the transmitter will naturally choose to remain silent if the buffer at the transmitter is empty, i.e., the latest arrived packet has been successfully delivered, which happens with probability . For all the best-effort transmission policies, we have
(16) |
If the buffer is not empty, the optimal action at the th time slot is determined based on the state . If the transmitter decides to send a packet, it will receive a reward of with probability , and a reward of with probability . On the other hand, if the transmitter choose to keep silent, it will receive a reward of . The expected reward gain by sending the packet is hence given by
(17) |
which is obtained at the constant cost of . From (17), the expected AoI improvement by sending a packet is proportional to . Therefore, we propose a low-complexity single threshold policy in Definition 1, which maximizes the expected reward gain under a given cost constraint.
Definition 1 (Single threshold policy).
At the th time slot, if the buffer is not empty, the transmitter chooses to transmit the packet if and only if is above certain threshold , where is a positive integer. Mathematically,
(18) |
Compared to the double threshold policy proposed in [13], the single threshold policy can be easily analyzed and it allows us to investigate the tradeoff between AoI and transmission cost more conveniently. Furthermore, the transmitter only needs to make the transmission decision at the time slot when new packet arrives. Specifically, assume a new packet arrives at the th time slot, we have . If the transmission is not activated, i.e., , there will be no transmission activated until the next packet arrival, since both and grow linearly with time, resulting no change in their difference. On the other hand, if the transmission is activated at the th time slot, i.e., , this packet will be repeated until it is successfully delivered or preempted by the newly arrival packet, since the value will not change until the packet is successfully delivered.
Denote by the probability that the AoI observed at the end of a typical time slot is , i.e., . Further denote by the probability that the receiver AoI is greater or equal to , i.e.,
(19) |
Based on the above analysis, the event occurs in the following two cases.
-
•
Case 1: The previous packet arrival occurs at time slots before, the transmission is activated when that packet arrives, and that packet has been successfully delivered during the subsequent time slots. Case 1 occurs with probability
(20) -
•
Case 2: The previous packet arrival occurs at time slots before, where , and the observed AoI is when that packet arrives. Furthermore, no transmission is activated when the previous packet arrives or the transmission is activated, but all the attempts are erased. Case 2 occurs with probability
(21)
By combining the above two independent cases, we have
(22) |
From (19)-(22), we can solve for the distribution of , as given in Lemma 1. The detailed derivation are omitted for brevity.
Lemma 1.
Consider a point-to-point communication system with Bernoulli packet arrival at rate and random packet erasure channel with erasure probability . If the single threshold policy with threshold is adopted, the probability mass function (PMF) of the receiver AoI is given by
(23) |
where .
Given the distribution of the receiver AoI, we are now ready to derive the long-term average AoI and transmission cost. Specifically, the average AoI is a function of threshold , given by
(24) |
Note that the transmission is activated when the buffer is not empty and the difference of data freshness between the receiver and the transmitter is greater than or equal to the threshold . These two independent events occur with probability and , respectively. Hence, the long-term average transmission cost is also a function of threshold , given by
(25) |
By substituting derived from Lemma 1 into (24) and (25), we can obtain the closed-form expressions of the average AoI and transmission cost, as given in Lemma 2.
Lemma 2.
Consider a point-to-point communication system with Bernoulli packet arrival at rate and random packet erasure channel with erasure probability . If the single threshold policy with threshold is adopted, the long-term average receiver AoI and transmission cost are given by
(26) | |||
(27) |
where is defined in Lemma 1 and .
IV-B Selection of the Threshold
It is observed that the average AoI in (26) is a monotonically increasing function with the threshold , while the transmission cost in (27) is a monotonically decreasing function with . To achieve the minimum AoI subject to cost constraint, we can find the maximum value of that meets the cost constraint. Mathematically, we have
(28) |
The optimal threshold is solved from
(29) |
where is the natural number set. The proposed single threshold policy with the optimal threshold in (29) is referred as the “deterministic single threshold policy”.
Note that the integer constraint on the threshold may lead to under-use of the resource, i.e., . In such case, the AoI performance of the proposed transmission policy can be improved by introducing a “random policy” where the transmitter uses the threshold with probability and uses the threshold with probability . The probability is determined by solving
(30) |
The combination of the deterministic and the random single threshold policy mentioned above can achieve arbitrary point on the tradeoff curve between the AoI and transmission cost, and hence provide a complete solution to the problem .
Note that if there is no resource constraint, we have and the proposed scheme reduces to the conventional pLGFS with performance given in (5). Furthermore, if we increase the packet arrival rate to , the network degenerates to the one with generate-at-will arrival model. The AoI and transmission cost equation in (26) and (27) reduces to (6) and (7), respectively, which are proven to be optimal in [12].
V Numerical Results
In this section, numerical results are presented to verify the analytical results and validate the performance of the proposed single threshold packet scheduling policy.
First, we verify the analytical PMF of the AoI presented in Lemma 1 with the simulation results in Fig. 3. The simulation results are obtained with the observation interval time slots. It is observed that the analytical results match very well with the simulation results and the AoI tends to be larger with the increase of the threshold . Next, the long-term average AoI and transmission cost presented in Lemma 2 are verified with the simulation results in Fig. 4. As expected, the average AoI monotonically increases with the threshold, while the transmission cost monotonically decreases with the threshold.


To validate the performance of the proposed single threshold packet scheduling policy, we compare the achievable tradeoff between the AoI and transmission cost with the lower bound in (15) and that achieved by the following benchmark schemes:
-
•
Random Transmission: When the buffer is not empty, the transmitter chooses to activate the transmission with probability . Random transmission can be viewed as pLGFS with effective erasure probability . The average AoI and transmission cost are given by
(31) To ensure that the resource constraint is met, the transmission probability is set by solving , which renders
One of the advantages of the random transmission policy is that it does not rely on the feedback information.
-
•
Double Threshold Policy [13]: As suggested in [13], a double threshold method may further enhance the AoI performance. With double threshold policy, the transmitter chooses to send the packet in the buffer if the age of that packet is below certain threshold and the difference of data freshness between the transmitter and the receiver is above certain threshold, i.e.,
Since [13] considered the generate-at-will arrival model, the analytical results cannot be directly applied to the case with stochastic packet arrival. Hence, we choose the optimal threshold by exhaustive search in Fig. 5.
It is observed from Fig. 5 that the random transmission policy has the poorest performance, i.e., it has the largest AoI at the same amount of transmission cost. This is consistent with expectation since random transmission policy has neglected the environment status. The performance of the double threshold method is dependent on the selection of the thresholds. When the resource constraint is tight, it is desired to choose a small . On the other hand, a larger value of is preferred when the resource constraint is loose. As compared with the double threshold policy, the proposed policy offers the benefit of simple implementation and closed-form performance expressions, which allow us to select the optimal threshold easily. It is observed from Fig. 5 that the proposed single threshold policy has very close performance with the double threshold policy when and exhaustive search of , and outperforms the double threshold policy when . The performance of the proposed single threshold policy is also very close to the lower bound, which further validates its optimality. Finally, note that all the schemes converge to the pLGFS when the maximum transmission cost is large enough, which is optimal with the absence of resource constraint.

VI Conclusion
This paper investigates the tradeoff between the long-term average AoI and transmission cost for a network with stochastic packet arrival and random erasure channel. We formulate the AoI minimization problem under the resource constraint as a CMDP and propose a low-complexity single threshold packet scheduling policy. The achievable AoI and transmission cost of the proposed scheme are expressed in simple closed-form. Simulation results show that the proposed single threshold policy has very close performance with the lower bound and the complex double threshold method with the optimal threshold pairs obtained via exhaustive search. Furthermore, the proposed policy degenerates to the optimal policies proposed in the literature under special cases.
Note that most of the packet scheduling policies discussed in this paper rely on the instantaneous feedback from the receiver, which may not be available in practice. In the future, we will extend the proposed scheme to the network where the feedback information is delayed or even absent, and investigate the tradeoff among more network performance metrics, e.g., the tradeoff between the receiver AoI and the long-term network throughput under resource constraint.
Acknowledgement
This work was supported by the Natural Science Foundation of China with grant number 6504009712 and the Natural Science Foundation of Jiangsu Province with grant number BK20200354.
References
- [1] A. Kosta, N. Pappas, and V. Angelakis, “Age of information: A new concept, metric, and tool,” Foundations and Trends in Networking, vol. 12, no. 3, pp. 162–259, 2017.
- [2] S. Kaul, R. Yates, and M. Gruteser, “Real-time status: How often should one update?,” in 2012 Proceedings IEEE INFOCOM, pp. 2731–2735, IEEE, 2012.
- [3] S. K. Kaul, R. D. Yates, and M. Gruteser, “Status updates through queues,” in 2012 46th Annual Conference on Information Sciences and Systems (CISS), pp. 1–6, IEEE, 2012.
- [4] V. Tripathi, R. Talak, and E. Modiano, “Age of information for discrete time queues,” arXiv preprint arXiv:1901.10463, 2019.
- [5] A. M. Bedewy, Y. Sun, and N. B. Shroff, “Optimizing data fresheness, throughput, and delay in multi-server information-update systems,” in IEEE Intern. Symp. Inform. Theory, 2016.
- [6] Y. Sun, E. Uysal-Biyikoglu, R. D. Yates, C. E. Koksal, and N. B. Shroff, “Update or wait: How to keep your data fresh,” IEEE Trans. Information Theory, vol. 63, pp. 7492–7508, Nov. 2017.
- [7] R. Devassy, G. Durisi, G. C. Ferrante, O. Simeone, and E. Uysal, “Reliable transmission of short packets through queues and noisy channels under latency and peak-age violation guarantees,” IEEE Journal on Selected Areas in Communications, vol. 37, no. 4, pp. 721–734, 2019.
- [8] A. Kosta, N. Pappas, A. Ephremides, and V. Angelakis, “The age of information in a discrete time queue: Stationary distribution and non-linear age mean analysis,” IEEE Journal on Selected Areas in Communications, vol. 39, no. 5, pp. 1352–1364, 2021.
- [9] Y. Inoue, H. Masuyama, T. Takine, and T. Tanaka, “A general formula for the stationary distribution of the age of information and its application to single-server queues,” IEEE Transactions on Information Theory, vol. 65, no. 12, pp. 8305–8324, 2019.
- [10] J. Gong, J. Zhu, X. Chen, and X. Ma, “Sleep, sense or transmit: Energy-age tradeoff for status update with two-thresholds optimal policy,” IEEE Transactions on Wireless Communications, 2021.
- [11] X. Xu and Y. Zeng, “Packet encoding for data freshness and transmission efficiency with delayed feedback,” in IEEE Wireless Communications and Networking Conference (WCNC), IEEE, 2021.
- [12] E. T. Ceran, D. Gündüz, and A. György, “Average age of information with hybrid ARQ under a resource constraint,” IEEE Transactions on Wireless Communications, vol. 18, no. 3, pp. 1900–1913, 2019.
- [13] Y. Gu, Q. Wang, H. Chen, Y. Li, and B. Vucetic, “Optimizing information freshness in two-hop status update systems under a resource constraint,” IEEE Journal on Selected Areas in Communications, vol. 39, no. 5, pp. 1380–1392, 2021.
- [14] A. Zakeri, M. Moltafet, M. Leinonen, and M. Codreanu, “Minimizing AoI in resource-constrained multi-source relaying systems with stochastic arrivals,” arXiv preprint arXiv:2109.05106, 2021.
- [15] M. L. Puterman, Markov decision processes: discrete stochastic dynamic programming. John Wiley & Sons, 2014.