RRO: A Regularized Routing Optimization Algorithm for Enhanced Throughput and Low Latency with Efficient Complexity
Abstract
In the rapidly evolving landscape of wireless networks, achieving enhanced throughput with low latency for data transmission is crucial for future communication systems. While low complexity OSPF-type solutions have shown effectiveness in lightly-loaded networks, they often falter in the face of increasing congestion. Recent approaches have suggested utilizing backpressure and deep learning techniques for route optimization. However, these approaches face challenges due to their high implementation and computational complexity, surpassing the capabilities of networks with limited hardware devices.
A key challenge is developing algorithms that improve throughput and reduce latency while keeping complexity levels compatible with OSPF. In this collaborative research between Ben-Gurion University and Ceragon Networks Ltd., we address this challenge by developing a novel approach, dubbed Regularized Routing Optimization (RRO). The RRO algorithm offers both distributed and centralized implementations with low complexity, making it suitable for integration into 5G and beyond technologies, where no significant changes to the existing protocols are needed. It increases throughput while ensuring latency remains sufficiently low through regularized optimization. We analyze the computational complexity of RRO and prove that it converges with a level of complexity comparable to OSPF. Extensive simulation results across diverse network topologies demonstrate that RRO significantly outperforms existing methods.
Index Terms:
Routing algorithms, low-complexity algorithms, enhanced-throughput, low-latency.I Introduction
The rapid evolution of communication network technology in 5G and beyond has led to a surge in demand for wireless communication services. Despite this progress, spectrum scarcity remains a significant constraint in meeting this growing demand. Hence, the development of efficient algorithms for data transmission in communication networks, which utilize available spectral resources and manage data transmissions effectively, has become a critical challenge in modern communication networks, such as 5G and beyond wireless and mobile networks [1, 2, 3, 4], cognitive radio networks [5, 2, 6, 7, 8], distributed [9, 10] and federated learning systems [11, 12].
Recent applications in 5G and beyond technology require high demands on the network for higher throughput and low latency, especially in complex environments where multi-flow transmissions across network links interfere with each other, such as IoT, wireless sensor networks, drone networks, and autonomous vehicles.
Low-complexity algorithms, like the shortest-path algorithms in OSPF-type solutions, are effective in lightly-loaded networks but struggle as congestion and mutual interference increase. Recent approaches advocate for route optimization using backpressure to alleviate congestion on heavily utilized preferred links. However, they face challenges such as frequent data exchanges between nodes and high storage and computational complexity as the network scales. Furthermore, deep learning methods have been proposed to boost throughput with low latency but require significant computational resources, exceeding the capabilities of networks with limited hardware devices. Therefore, a critical challenge in communication network research is to develop algorithms that achieve higher throughput with lower latency while keeping complexity levels compatible with OSPF.
In this collaborative research between Ben-Gurion University and Ceragon Networks Ltd., we address this challenge. In this context, this paper delves into the routing problem within communication networks with mutual interference environment. The network under consideration is represented as a directed connected graph , where denotes the set of nodes, each corresponding to a user, and represents the communication links. Each directed link , connecting a transmitter to a receiver (where ), encapsulates the notion of neighboring nodes in this graph. A key characteristic of these links is their susceptibility to mutual interference, which arises when radio frequency signals overlap in the same spatial region, thereby affecting the link capacities adversely. We consider a network with distinct flows, with each flow originating from a source node and directed towards a destination node , both of which are elements of . The primary objective articulated in this research is the strategic allocation of routes for these multiple data flows. Such allocation must be optimized to enhance the network performance, reflecting a comprehensive approach to managing the inherent mutual interference. This interference, characteristic of concurrent transmissions along proximal routes, can significantly diminish the effective capacity of the involved links.
Hence, the challenge addressed here is to enhance the efficiency of spectral resource utilization by routing data flows through paths that enhance throughput and reduce delay, while simultaneously mitigating the detrimental effects of mutual interference between links across the network. This must be achieved using a low-complexity algorithm compatible with OSPF.
I-A Related Works
Solving the shortest-path problem is a prevalent approach for data routing in contemporary communication networks [13, 14, 15]. This method aims to find the shortest path between a source and a destination node in a network, reducing the transmission delay and improving network performance. Shortest-path routing algorithms are widely used in various network protocols, such as OSPF and RIP. These protocols rely on the shortest-path algorithm to determine the most efficient routes for data transmission across the network. By selecting paths with the least number of hops or the lowest accumulated cost, these protocols aim to achieve efficient data routing and network operation. For example, the widely adopted OSPF routing protocol employs this principle to route data flows efficiently across the network, where the computation of the shortest paths is often facilitated by the Dijkstra algorithm [16, 14, 15]. More recent studies focus on online learning algorithms for shortest path computations under unknown link states that need to be inferred [17, 18, 19].
However, while shortest path-based routing is effective under various circumstances, it presents a significant drawback: It tends to increase congestion along the shortest routes, especially in networks experiencing high traffic loads. This congestion can, in turn, degrade overall network performance, as the most direct routes become highly congested with excessive data flow.
To address this limitation, backpressure routing emerges as an alternative routing aproach [20]. Backpressure routing, derived from network utility maximization principles, dynamically adjusts data routes based on real-time congestion levels, contrasting sharply with traditional methods that focus on minimizing path cost [21]. This strategy reduces congestion by maximizing differential queue backlog between nodes. It is known to achieve maximal throughput theoretically in a non-interference environment [14]. Related studies on backpressure routing can be found in [22, 23, 24, 25, 26, 14, 27, 28, 19] and references therein.
Despite the efficiency of backpressure routing in terms of achievable throughput, it suffers from several main drawbacks. Firstly, backpressure routing tends to reduce congestion by transmitting packets through less congested nodes. While this approach is beneficial when congestion levels are high, it is inefficient when congestion is low. In such cases, packets travel along longer paths when shorter paths could have been used, leading to a significant increase in delay. Secondly, in an interference environment where links interfere with each other, transmitting data through longer paths increases the interference levels across the network, thereby reducing its overall capacity. Thirdly, from a practical implementation perspective, backpressure routing requires frequent message exchanges between neighboring nodes regarding their instantaneous queue states to make routing decisions. This necessity for constant updates for each transmission creates significant overhead for the communication protocol. These drawbacks have so far prevented the operational implementation of backpressure routing in real-world systems.
To address the limitation of increased delay in low congestion levels by backpressure routing, while incorporating the benefits of traditional shortest path routing, recent studies have suggested the development of hybrid routing algorithms. These approaches leverage the strengths of both methods, using shorter paths when congestion is low to avoid the delays characteristic of backpressure routing, and longer paths during high congestion to prevent the issues associated with shortest path routing [29, 19]. This balance aims to optimize network performance across varying congestion levels, thereby increasing throughput while limiting path costs and adapting to dynamic network conditions. However, these methods still suffer from high implementation complexity, as they require frequent message exchanges between neighboring nodes regarding their instantaneous queue states to make backpressure-type routing decisions. Additionally, they necessitate storing a large number of buffers for each flow to optimize the route for each packet. Consequently, these methods are not practically appealing for a variety of network applications.
In this paper, inspired by the hybrid approach as in [29, 19], we aim to strike a balance between increasing throughput and limiting path costs. However, a significant advantage of our approach is that we reduce congestion by seeking routes with high capacity, resulting in increased throughput without relying on backpressure. This avoids the drawbacks of backpressure-based routing discussed earlier. Additionally, our method maintains a complexity level compatible with OSPF.
Moreover, inspired by the recent success of deep learning and AI in solving complex decision-making tasks, deep learning methods have been proposed to solve related spectrum access [30, 31, 32, 33, 34], scheduling [35, 36, 37, 38, 39], and routing [40, 41, 42, 43, 44, 45] problems. Specifically, recent studies developed deep learning based methods to compute flow routes in communication networks to increase throughput with low latency [40, 41, 42, 43, 44, 45]. However, these methods demand high computational resources that exceed the capabilities of networks with limited hardware devices, making them unsuitable in many cases. Therefore, a key challenge in communication network research is to devise algorithms that achieve enhanced throughput with low latency while maintaining a complexity level compatible with OSPF.
Furthermore, combining low-complexity algorithms with deep-learning techniques could offer a versatile solution for future networks. This hybrid approach would allow for the flexibility to use deep learning when higher processing time is permissible and low-complexity algorithms when quick decisions are needed. Thus, these solutions do not compete but rather complement each other, offering a balanced and adaptive strategy to meet varying network demands.
I-B Main Results
In this paper, we concentrate on developing low-complexity algorithms that are compatible with OSPF. Our aim is to enhance network performance by achieving higher throughput and lower latency without increasing computational demands. These algorithms are designed to be efficient and scalable, ensuring they can be implemented in real-world networks with limited hardware resources. By aligning with the principles of OSPF, our approach maintains compatibility with existing network infrastructure, facilitating easy integration and adoption. Through this research, we seek to provide practical solutions that address the pressing challenges in modern communication networks.
Our motivation to balance enhanced throughput with low latency shares similarities with the approaches in [29, 19]. However, our algorithmic solution is fundamentally different and has significant advancements compared to [29, 19]. Firstly, it does not use backpressure for data transmission. Instead, it computes the balanced path to enhance capacity with low path cost at the source node and routes the packet through the computed path. This avoids the need for frequent message exchanges between neighboring nodes regarding their instantaneous queue states, as required by backpressure transmissions. By leveraging the advantages of 5G and beyond technologies, our method enables both distributed and centralized path computation, similar to current OSPF implementations. Consequently, no changes to the communication protocol are needed compared to current deployments. Secondly, the implementation complexity of our approach is much simpler as it does not require storing a large amount of packet buffers when making decisions, unlike [29, 19]. Thirdly, the computational complexity of our approach is very low and is compatible with OSPF, as we will describe later.
Specifically, in terms of algorithm development, we present a novel algorithm called Regularized Routing Optimization (RRO). The RRO algorithm offers both distributed and centralized implementations with low complexity, making it suitable for integration into 5G and beyond technologies with centralized node deployments. The RRO optimization process is implemented on a computation unit, either a source node of the flow in a distributed implementation or a centralized unit in a centralized implementation. This process utilizes the topology, link weights (such as link capacity), and an interference map, similar to the OSPF protocol used in 5G networks. The innovative design of the regularized optimization strikes a balance between the interference level over a route and the path length to yield the corresponding path allocation for each flow. It achieves this by searching for paths with high throughput while ensuring that latency remains sufficiently low.
Secondly, in terms of complexity analysis, we prove that RRO requires a low-complexity running time of , where , and denote the numbers of edges and nodes, respectively. This maintains the same complexity order as OSPF, as required.
Finally, we extensively evaluated RRO’s performance through simulations across various network topologies, including random deployments, NSFNET, and GEANT2 networks. The results consistently demonstrated that RRO significantly outperforms existing methods with compatible complexity. These simulations showcased RRO’s ability to handle diverse and complex network conditions efficiently. The improvements were evident in terms of both throughput and latency, proving RRO’s robustness and adaptability. Additionally, the algorithm’s performance was stable across different scenarios, underscoring its potential for real-world applications in 5G and beyond technologies. This comprehensive evaluation highlights RRO as a superior routing solution that is very well-suited for modern network environments.
II Network Model and Problem Statement
We consider a routing problem in communication networks operating in an environment with mutual interferences between links. The communication network topology is conceptualized as a directed connected graph , where each vertex represents a distinct node or user within the communication network. The set of edges (i.e. links) of the graph corresponds to a set of communication channels between pairs of users. An edge exists between vertices and if and only if there is a communication link between user and user . It is assumed that each communication channel is bidirectional, allowing for symmetrical data exchange between connected users. Links cause mutual interference due to the overlapping of their radio frequency signals in the same spatial region.
Let be a set of data flows, where each flow (say ) is indicated by source node and destination node . A path (i.e., route) between and is denoted by , where , and . A length of path is defined by the number of hops through the path: . Let be a selected path for flow , be the selected path vector for all flows, and be the selected path vector for all flows excluding flow .
Let be the data-rate of flow . Note that depends on the selected route of flow as well as the selected routes of other flows that interfere with flow . We denote the set of flows that interfere with flow (i.e., interfering neighbors) by . All other flows are transmitted through paths that cause interference below the noise floor (due to geographical distance, directional antennas, etc.). As a result, depends on the selected route of flow , and the selected routes of flows , . The cases of no-interference environment, and full-interference environment applies as special cases of our model, by setting , and , respectively.
Let a congestion function be a monotonic decreasing function of the data rate [14, 46]. Detailed computations of the data rate and congestion functions are discussed in Section III.
The objective is to strike a balance between routing data through short paths to reduce the latency, and through low-congested paths to increase the throughput. Specifically, assume that paths were selected for flows , given by path vector . Then, the objective is to solve for flow the following optimization problem:
(1) |
where is a weight parameter. Increasing tends to allocate a shorter path to flow at the expense of using more congested links, whereas decreasing tends to assign a path with lower congestion to flow , at the expense of using longer paths.
The solution is required to be of low complexity and compatible with OSPF. The solution is updated by each flow , periodically, given the current path allocation vector of other flows. In Section III, we develop both centralized and distributed implementations for solving (1).
Remark 1
The optimization problem we address shares similarities with the one in [29, 19] and subsequent studies, which combine shortest path and backpressure routing to mitigate congestion with low latency. However, there are significant differences between our optimization problem and algorithmic solutions compared to those in [29, 19]. Specifically, the problem in [29, 19] balances path length and queue size (a direct measure of congestion), leading to a complex backpressure-based routing algorithm. This approach suffers from high implementation complexity, as discussed in Section I. In contrast, our method reduces congestion by balancing path length with a congestion function that is monotonically decreasing with the data rate. This innovative approach allows us to develop a low-complexity end-to-end path solution at the source node without requiring backpressure transmissions. Additionally, the computational complexity of our method is very low and compatible with OSPF, as detailed in Section III.
III The Regularized Routing Optimization (RRO) Algorithm
In this section, we introduce the RRO algorithm. This algorithm supports both distributed and centralized implementations with low complexity, making it well-suited for integration into 5G and next-generation technologies that use centralized node deployments. The RRO optimization process is executed on a computation unit, which can be either the source node in a distributed setup or a centralized unit in a centralized setup. This process leverages network topology, link weights (such as link capacity), and an interference map, similar to the OSPF protocol used in 5G networks. We start by presenting the distributed implementation of RRO, then discuss the necessary adjustments for the centralized implementation. The pseudocode of the algorithm is given in Algorithm 1.
III-A Distributed RRO
In the distributed implementation, each flow updates its route distributedly from time to time, as currently employed by OSPF mechanism, given the current path allocation vector of all other flows. Once a flow selects a path it broadcasts this information to all other source nodes in the network, to have each source node holds the updated path allocation vector of all other flows.
We now describe the basic steps of RRO with respect to flow , computed by its corresponding source node . In the description below, we often omit the subscript when the notations become cumbersome, to simplify the presentation. Assume that paths were selected for flows , given by path vector .
Initial computation step at source node : Based on the interference map, and the
current path allocation vector of all other flows, the source node computes the achievable rates of all links in the network: , assuming it selects link in its updated path. Specifically, the achievable rate over link , , is computed as follows:
(2) |
where is the bandwidth of link , and
(3) |
denotes the Signal to Interference plus Noise Ratio (SINR) at the reciever of link . Here, is the received power of the signal, is the power spectrum density (PSD) of the additive white Gaussian noise (AWGN), and
(4) |
is the cumulative interference power at the receiver of link , aggregated across all links within its interference range . Here, is the interference power at the receiver of link caused by flow , greater than zero if active by strategy . The transmission power control mechanism considered in this paper is independent, where a predetermined transmission power is assigned to each link based on factors such as geographical distance and channel state. Consequently, the search space of the optimization problem is solely determined by the selected paths that affect the throughput.
Regularized Routing Optimization step at source node : In this step, source node computes the updated path given the current path allocation vector .
Note that the rate of flow transmitted through path is determined by the link with the lowest rate across the links in its path, say link . The achievable rate of flow is thus given by:
(5) |
Therefore, the RRO algorithm aims at searching path with enhanced data rate, while limiting the path length to strike a trade-off between enhanced throughput and low latency.
To allow this optimization procedure, we first apply linear transformation to transform the computed link rates into corresponding weights, where a link with higher capacity is assigned a lower weight, and vice versa. Let , and . Then, RRO computes:
(6) |
where the values were computed in the initial computation step. As a result, is a mapping function from the link rate into , preserving the significance of order in the inverse direction.
Next, the congestion function is set to be:
(7) |
Then, using this construction, the RRO algorithm can use in (1) as a regularized parameter of flow , to compute a low-complexity regularized optimization to update the selected path iteratively over the nodes. Increasing increases the penalty of using long paths, with the price of increasing the congestion over the selected path. The procedure applies a Dijkstra-type update, were at each iteration, each node updates the index of its predecessor in the updated path through the network, and the updated regularized cost. The procedure starts with storing all nodes in a set , and initializing all regularized distances used in the computation to infinity, except the distance at the source node which is set to zero.
First, the source node is removed from . Then, the regularized distance of all its neighbors is updated based on the following computation:
(8) |
and the index of the predecessor of is set to be . Then, RRO updates the hop counter (in units of ) of node such that and set the maximal weight on the path from to to be . In the next iteration, the node with the smallest regularized distance in is selected (i.e., the neighbor of with the smallest regularized distance): . Assume node is selected for the next iteration. Then, it is removed from . Next, the regularized distance is computed with respect to all its neighbors:
(9) |
If is smaller than the current value of , then RRO updates the value of to be , and the predecessor of is set to be . The procedure continues until convergence for flow (convergence to smallest regularized cost, and complexity analysis is provided in Section IV)
Then, the updated path is generated by the stored predecessors
(10) |
Note that throughout the routine RRO searches for path with minimal regularized cost, consisting of the maximal value between and plus the regularization parameter . Decreasing assigns more weight on reducing congestion to increase throughput, determined by the link with lowest data rate. while increasing assigns more weight on minimizing the path length (i.e., tending to shortest path selection).
Notification Step: In the final step, the source node must inform all other source nodes of its selected route by broadcasting a control signal containing this information. This process does not significantly complicate the system compared to the current OSPF protocol, which already requires extensive message exchanges between nodes regarding network topologies and link weights before computing the shortest path. Therefore, it does not increase the implementation complexity compared to current deployments.
The procedure continues as each source node periodically updates its selected route. The pseudocode is given in Algorithm 1
III-B Centralized RRO
The centralized implementation of RRO follows similar steps to the distributed implementation, with the adjustments discussed in the following section. The key difference in the centralized implementation is that a central unit receives the input information regarding the network topology and interference map, and computes all routes for all nodes. It determines the desired path for each flow one by one and notifies each flow of its updated path. This approach aligns with 5G and beyond technology, which supports central unit deployments.
Initial computation step at the centralized unit: This step is similar to that in the distributed implementation, but it is performed at the centralized unit for flow .
Regularized Routing Optimization step at the centralized unit:
This step is similar to that in the distributed implementation, but it is performed at the centralized unit for flow .
Notification Step: The centralized node must inform only source node of its selected route by
broadcasting a control signal containing this information.
The procedure continues as the centralized node repeat these steps for all flows to periodically update their selected routes. The pseudocode is given in Algorithm 1.
IV Convergence and Complexity Analysis of the RRO Algorithm
In this section, we analyze the convergence of RRO to the most efficient path in terms of regularized path cost, and the computational complexity of the algorithm. We denote the path that solves (1), with congestion function given in (7) as regularized most efficient path (RMEP).
Theorem 1
proof Throughout the proof, we refer to flow . For simplicity, we often omit the subscript . Let represent a subset of vertices, where initially . This subset comprises vertices for which the true regularized most efficient paths from source have already been determined. Let represent the true RMEP of node , signifying that this value remains invariant under any further relaxations. Correspondingly, the true RMEP from to node , defined as an ordered sequence of vertices, is represented by .
We shall prove the correctness of the RRO algorithm by induction, establishing the non-existence of any alternative RMEP.
For the base base case, initially, the RMEP to is set to 0, and to . Clearly, as the RMEP from to itself is zero.
For the inductive hypothesis, assume that after iterations (where ), the algorithm has correctly computed the RMEP for every node in .
Next, in the inductive step, for the -th iteration, where , consider node as having the minimum distance, , among all nodes yet to be visited. We aim to demonstrate that the algorithm continues to compute the correct RMEP, i.e., that is, . We proceed by contradiction, assuming the existence of a hypothetical alternative path with a cost . We analyze the implications of this assumption under two distinct scenarios that together cover all possibilities: (i) comprising nodes both within and outside ; (ii) consists solely of nodes from within .
By examining these scenarios, we assert the non-existence of a valid that would invalidate the algorithm’s current RMEP calculation, thereby upholding the correctness of the computed paths at each iteration.
We start by contradicting case (i). At the iteration, we add a node to . The following conditions are subsequently upheld: (a) , (according to the algorithm operation); (b) , (according to induction hypothesis).
Let be the first edge taken from , where and , Since we applied relaxation to when it was visited, we get:
(11) |
where:
(12) |
Moreover, since is only a subpath of , we must have:
(13) |
Note that has a cost of , and therefore . However, since we must have . Hence, a contradiction arises.
Next, we contradict case (ii). Let be , where . According to the operations of the algorithm, we get:
(14) |
Furthermore, we observe the following:
(15) |
However, once is visited, it should have set to at most . Hence,
(16) |
yielding a contradiction.
As a result, no such path can exist, confirming the algorithm’s correctness by induction. ∎
As previously noted, our aim is to devise an algorithm for routing problems that is not only simple to implement but also maintains the same level of complexity as traditional, implemented in current deployments. Here, we establish that the complexity of our algorithm is commensurate with OSPF that implements the Dijkstra’s algorithm when constructing the routing table.
Theorem 2
(Complexity Analysis of RRO Algorithm) The RRO algorithm exhibits time complexity of and space complexity of .
Proof: To analyze the complexity of the RRO algorithm, we examine the computational cost associated with each step.
In the initial computation step, the source node calculates the achievable rates for all network links, resulting in time and space complexities of .
Next, the initialization of predecessors and RMEP lists incurs time and space complexities of .
Next, initializing the queue requires . Extracting the minimum from traditionally requires per extraction, summing to for all vertices. However, utilizing a Fibonacci heap reduces this to , as shown in [47].
Finally, in the relaxation process, each neighbor’s check and update operation requires complexity of . As a result, considering each edge once results in a total complexity of .
Combining these findings, when allocating over data flows, the overall time complexity is , with the Q operations being the primary contributor. The total space complexity is , accounted for by the initial computations and the storage needs of the queue, predecessors, and RMEP lists. ∎
V Numerical Analysis
In this section, we present extensive experimental results to evaluate the performance of the proposed RRO algorithm under various wireless network topologies, namely NSFNET, GEANT2, and large-scale random network deployments. We conducted comparisons among well-known algorithms with compatible complexity: (i) Open Shortest Path First (OSPF): The OSPF protocol is extensively utilized for data routing and are implemented in numerous real-world systems[14]. In this context, we have implemented the well-established OSPF version that selects the route with the fewest number of hops between the source and the destination. This algorithm is widely employed in communication networks where minimizing latency is a critical objective[48]. (ii) Interference Minimization Algorithm (IMA): Routing over paths with minimal interference is commonly used in environments where links experience mutual interference. In this context, we have implemented an interference minimization strategy that is compatible with OSPF’s complexity, selecting paths that minimize absorbed interference to enhance throughput efficiency [49]. (iii) Randomized Greedy Algorithm (RGA): RGA is a heuristic method that selects the best path allocation from independent trials of random choices from the action set. This algorithm is commonly used in machine learning research to assess a learning algorithm’s capability to identify effective strategies and outperform exhaustive search within a constrained number of trials[43]. All selected algorithms exhibit the same complexity as our algorithm.
In the implementation of all tested algorithms, consistent knowledge of the wireless network state and the interference map was utilized to optimize the objective and determine the selected paths. For each simulation experiment, the results were averaged over randomly independent configurations, which included variations in flow demands and link capacities. In our analysis, we evaluated and presented the achievable data rate, averaged across data flows, the average delay experienced by packets as they traverse from the source node to the destination node, the log-rate of each flow as a fairness metric (i.e., proportional fairness), and the maximal queue length as a metric to evaluate the algorithms’ efficacy in supporting data transmissions.
We categorized the simulated networks based on the node-to-link ratio. Specifically, networks where are classified as lightly-loaded, those where as moderately-loaded, and networks with as heavily-loaded. This classification helped us systematically assess the network performance under varying load conditions.
V-A Results for the NSFNET Network

We begin our evaluation by simulating the well-known NSFNET network topology [50], which comprises nodes and edges, as illustrated in Fig. 1(a). The results are presented in Figs. 1(b)-1(e). It can be seen that RRO consistently outperforms all other evaluated algorithms, with an average data rate outperforming IMA by to , as presented in Fig. 1(b). It is important to highlight that the superior data rate performance of the RRO algorithm does not compromise packet delay, as depicted in Fig. 1(c). Additionally, RRO demonstrates significantly higher fairness values compared to other algorithms, as illustrated in Fig. 1(d). Finally, in Fig. 1(e) we present the maximum queue length for , indicating that RRO’s queues stabilize quickly, unlike other methods which diverge and fail to support high loads.
V-B Results for the GEANT2 Network
Next, we simulate the well-known GEANT2 network topology [51], which comprises nodes and edges, as illustrated in Fig 2(a). The results are presented in Figs. 2(b)-2(e). Once again, the RRO algorithm consistently surpasses all other evaluated algorithms, achieving an average data rate significant improvement, as presented in Fig. 2(b). We observe a significant improvement in lightly loaded networks, with the performance gap gradually narrowing. Ultimately, when the network is heavily loaded (), the relative advantage of RRO compared to the other methods becomes less significant in terms of average data rate. In terms of average delay and fairness values, significant improvements were observed, where RRO continues to demonstrate substantial improvements even under extremely heavy loads settings, as illustrated in Fig. 2(c) and in Fig. 2(d), respectively.

Finally, Fig. 2(e) displays the maximum queue length observed in the temporal dynamics for a heavily loaded network with data flows The findings reveal that the queue achieves stability quickly under RRO, unlike other methods which fail to support high loads.
V-C Results for Large-Scale Network deployments
Finally, we conducted simulations on random large-scale network deployments. First, we present the results on a random medium-size network configuration (presented in Fig. 3(a)) which comprises nodes and random edges between nodes. The results are presented in Figs. 3(b)-3(e). Across all scenarios, RRO consistently outperforms all other algorithms, particularly in lightly and moderately loaded networks. In terms of average flow data rates (Fig. 3(b)), RRO surpasses the second-best performer, IMA, by margins ranging from (where ) to (where ). Furthermore, RRO demonstrates superior performance in reducing flow delays (Fig. 3(c)) and enhancing fairness (Fig. 3(d)), even under extreme network loads. We also evaluated the queue length for each algorithm where (Fig. LABEL:fig:random_topologyY_buffer). The findings indicate that the maximum queue length stabilizes quickly by RRO, whereas other algorithms fail to support high loads.
To further substantiate the robustness of the RRO algorithm in accommodating diverse network configurations characterized by varying numbers of flow demands , nodes , and edges , and to validate its applicability in 5G topologies, which typically comprise dense networks with numerous edges, additional experiments were conducted using randomized settings. The outcomes of these experiments are detailed in Table I. The simulation results demonstrate that our method excels in both heavily-loaded and moderately-loaded dense networks (lines 1 and 2, respectively), where . Continuing in line 3, a scenario is presented where , indicative of a highly connected dense network. Additionally, a scenario with a moderate degree of connectivity under heavy load, where , is examined in line 4. The outcomes, detailed in Table I, confirm that RRO significantly surpasses all other algorithms in terms of average flow rate, packet delay and fairness across all tested network configurations.
These results underscore the notable superiority of RRO over alternative algorithms in all tested scenarios, showcasing its consistent performance advantage. The robustness and efficacy of RRO are evident, presenting a compelling case for its adoption in diverse network environments.

Average Flow Rates [Mbps] | Average Delay [time-steps] | Fairness | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
RGA | OSPF | IMA | RRO | RGA | OSPF | IMA | RRO | RGA | OSPF | IMA | RRO | |||
100 | 70 | 130 | 3.839 | 5.310 | 6.679 | 9.389 | 108.608 | 74.396 | 43.543 | 18.395 | -0.728 | -0.557 | -0.164 | 0.565 |
50 | 60 | 120 | 5.425 | 6.530 | 8.468 | 13.582 | 91.985 | 79.509 | 54.112 | 18.664 | -0.655 | -0.644 | -0.265 | 0.730 |
40 | 40 | 100 | 9.757 | 11.287 | 18.972 | 23.790 | 47.586 | 39.699 | 18.073 | 2.75 | -0.345 | -0.340 | 0.455 | 1.260 |
50 | 30 | 50 | 3.027 | 6.113 | 6.799 | 7.689 | 127.832 | 92.847 | 72.830 | 43.028 | -1.011 | -0.783 | -0.556 | 0.092 |
30 | 25 | 40 | 6.413 | 10.929 | 12.432 | 14.804 | 78.615 | 39.376 | 34.426 | 10.826 | -0.604 | -0.263 | -0.017 | 0.825 |
VI Conclusion
Achieving enhanced throughput with low latency for data transmission is crucial for future communication systems. Low-complexity OSPF-type solutions are effective in lightly-loaded networks but struggle with increased congestion. We addressed this challenge by developing a novel approach: Low-complexity Regularized Routing Optimization (RRO) in communication networks. The RRO algorithm offered both distributed and centralized implementations with low complexity, making it suitable for integration into 5G and beyond technologies. It increased throughput while ensuring latency remained sufficiently low through regularized optimization. We analyzed the convergence complexity of RRO and proved that it converged with a complexity order comparable to OSPF. Our simulation results demonstrated significant performance improvements over existing methods.
These findings position RRO as a highly effective solution for routing data in modern communication networks, offering both enhanced efficiency and low complexity.
References
- [1] G. K. Papageorgiou, K. Voulgaris, K. Ntougias, D. K. Ntaikos, M. M. Butt, C. Galiotto, N. Marchetti, V. Frascolla, H. Annouar, A. Gomes et al., “Advanced dynamic spectrum 5g mobile networks employing licensed shared access,” IEEE Communications Magazine, vol. 58, no. 7, pp. 21–27, 2020.
- [2] K. Cohen, “Machine learning for spectrum access and sharing,” Machine Learning for Future Wireless Communications, Wiley-IEEE Press, pp. 1–25, 2020.
- [3] S. Iyer, A. Patil, S. Bhairanatti, S. Halagatti, and R. J. Pandya, “A survey on technological trends to enhance spectrum-efficiency in 6g communications,” Transactions of the Indian National Academy of Engineering, vol. 7, no. 4, pp. 1093–1120, 2022.
- [4] S. V. Patil, S. Kore, and T. K. Harhare, “A case study: Spectrum efficient multiple access techniques for 5g networks,” in 2023 International Conference for Advancement in Technology (ICONAT). IEEE, 2023, pp. 1–4.
- [5] S. W. H. Shah, A. N. Mian, and J. Crowcroft, “Statistical QoS guarantees for licensed-unlicensed spectrum interoperable D2D communication,” IEEE Access, vol. 8, pp. 27 277–27 290, 2020.
- [6] A. Kaur and K. Kumar, “A comprehensive survey on machine learning approaches for dynamic spectrum access in cognitive radio networks,” Journal of Experimental & Theoretical Artificial Intelligence, vol. 34, no. 1, pp. 1–40, 2022.
- [7] T. Gafni and K. Cohen, “Distributed learning over markovian fading channels for stable spectrum access,” IEEE Access, vol. 10, pp. 46 652–46 669, 2022.
- [8] A. Bavistale, A. Dhokne, A. Kukade, A. Kumbhare, A. Talokar, and P. Jaronde, “Energy and spectrum efficient cognitive radio sensor networks,” in 2023 11th International Conference on Emerging Trends in Engineering & Technology-Signal and Information Processing (ICETET-SIP). IEEE, 2023, pp. 1–4.
- [9] T. Sery and K. Cohen, “On analog gradient descent learning over multiple access fading channels,” IEEE Transactions on Signal Processing, vol. 68, pp. 2897–2911, 2020.
- [10] R. Paul, Y. Friedman, and K. Cohen, “Accelerated gradient descent learning over multiple access fading channels,” IEEE Journal on Selected Areas in Communications, vol. 40, no. 2, pp. 532–547, 2021.
- [11] T. Gafni, N. Shlezinger, K. Cohen, Y. C. Eldar, and H. V. Poor, “Federated learning: A signal processing perspective,” IEEE Signal Processing Magazine, vol. 39, no. 3, pp. 14–41, 2022.
- [12] S. Salgia, T. Gabay, Q. Zhao, and K. Cohen, “A communication-efficient adaptive algorithm for federated learning under cumulative regret,” IEEE Transactions on Signal Processing, 2024.
- [13] M. Heal and M. Fayed, “Ad hoc networks routing: Shortest path is enough,” in Proceedings of the International Conference on Wireless Networks (ICWN), 2012, pp. 451–457.
- [14] R. Srikant and L. Ying, Communication networks: an optimization, control, and stochastic networks perspective. Cambridge University Press, 2013.
- [15] H. Gong, L. Fu, X. Fu, L. Zhao, K. Wang, and X. Wang, “Distributed multicast tree construction in wireless sensor networks,” IEEE Transactions on Information Theory, vol. 63, no. 1, pp. 280–296, 2016.
- [16] E. W. Dijkstra, “A note on two problems in connexion with graphs,” Numerische mathematik, vol. 1, no. 1, pp. 269–271, 1959.
- [17] P. Tehrani and Q. Zhao, “Distributed online learning of the shortest path under unknown random edge weights,” in IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2013, pp. 3138–3142.
- [18] K. Liu and Q. Zhao, “Adaptive shortest-path routing under unknown and stochastically varying link states,” in 10th International Symposium on Modeling and Optimization in Mobile, Ad Hoc and Wireless Networks (WiOpt), 2012, pp. 232–237.
- [19] O. Amar, I. Sarfati, and K. Cohen, “An online learning approach to shortest path and backpressure routing in wireless networks,” IEEE Access, 2023.
- [20] Z. Jiao, B. Zhang, C. Li, and H. T. Mouftah, “Backpressure-based routing and scheduling protocols for wireless multihop networks: A survey,” IEEE Wireless Communications, vol. 23, no. 1, pp. 102–110, 2016.
- [21] L. Tassiulas and A. Ephremides, “Stability properties of constrained queueing systems and scheduling policies for maximum throughput in multihop radio networks,” IEEE Transactions on Automatic Control, vol. 37, no. 12, pp. 1936–1948, 1992.
- [22] A. L. Stolyar, “Maximizing queueing network utility subject to stability: Greedy primal-dual algorithm,” Queueing Systems, vol. 50, no. 4, pp. 401–457, 2005.
- [23] A. Eryilmaz and R. Srikant, “Joint congestion control, routing, and MAC for stability and fairness in wireless networks,” IEEE Journal on Selected Areas in Communications, vol. 24, no. 8, pp. 1514–1524, 2006.
- [24] L. Bui, R. Srikant, and A. Stolyar, “Novel architectures and algorithms for delay reduction in back-pressure scheduling and routing,” in IEEE International Conference on Computer Communications (INFOCOM), 2009, pp. 2936–2940.
- [25] J. Ryu, L. Ying, and S. Shakkottai, “Back-pressure routing for intermittently connected networks,” in 2010 Proceedings IEEE INFOCOM. IEEE, 2010, pp. 1–5.
- [26] E. Athanasopoulou, L. X. Bui, T. Ji, R. Srikant, and A. Stolyar, “Back-pressure-based packet-by-packet adaptive routing in communication networks,” IEEE/ACM Transactions on Networking, vol. 21, no. 1, pp. 244–257, 2013.
- [27] A. Sinha and E. Modiano, “Optimal control for generalized network-flow problems,” IEEE/ACM Transactions on Networking, vol. 26, no. 1, pp. 506–519, 2017.
- [28] C. Joo, “On the performance of back-pressure scheduling schemes with logarithmic weight,” IEEE transactions on wireless communications, vol. 10, no. 11, pp. 3632–3637, 2011.
- [29] L. Ying, S. Shakkottai, A. Reddy, and S. Liu, “On combining shortest-path and back-pressure routing over multihop wireless networks,” IEEE/ACM Transactions on Networking, vol. 19, no. 3, pp. 841–854, 2010.
- [30] S. Wang, H. Liu, P. H. Gomes, and B. Krishnamachari, “Deep reinforcement learning for dynamic multichannel access in wireless networks,” IEEE Transactions on Cognitive Communications and Networking, vol. 4, no. 2, pp. 257–265, 2018.
- [31] Y. Yu, T. Wang, and S. C. Liew, “Deep-reinforcement learning multiple access for heterogeneous wireless networks,” IEEE Journal on Selected Areas in Communications, vol. 37, no. 6, pp. 1277–1290, 2019.
- [32] O. Naparstek and K. Cohen, “Deep multi-user reinforcement learning for distributed dynamic spectrum access,” IEEE Transactions on Wireless Communications, vol. 18, no. 1, pp. 310–323, 2019.
- [33] Y. Bokobza, R. Dabora, and K. Cohen, “Deep reinforcement learning for simultaneous sensing and channel access in cognitive networks,” IEEE Transactions on Wireless Communications, 2023.
- [34] Y. Cohen, T. Gafni, R. Greenberg, and K. Cohen, “Sinr-aware deep reinforcement learning for distributed dynamic channel allocation in cognitive interference networks,” arXiv preprint arXiv:2402.17773, 2024.
- [35] I. Budhiraja, N. Kumar, and S. Tyagi, “Deep-reinforcement-learning-based proportional fair scheduling control scheme for underlay d2d communication,” IEEE Internet of Things Journal, vol. 8, no. 5, pp. 3143–3156, 2020.
- [36] B. Gahtan, R. Cohen, A. Bronstein, and G. Kedar, “Deep reinforcement learning for scheduling and power allocation in a 5g urban mesh,” ArXiv, vol. abs/2210.01423, 2022.
- [37] B. Gahtan, R. Cohen, A. M. Bronstein, and G. Kedar, “Using deep reinforcement learning for mmwave real-time scheduling,” in 2023 14th International Conference on Network of the Future (NoF). IEEE, 2023, pp. 71–79.
- [38] H. Szostak and K. Cohen, “Decentralized anomaly detection via deep multi-agent reinforcement learning,” in 2022 58th Annual Allerton Conference on Communication, Control, and Computing (Allerton). IEEE, 2022, pp. 1–4.
- [39] ——, “Deep multi-agent reinforcement learning for decentralized active hypothesis testing,” arXiv preprint arXiv:2309.08477, 2023.
- [40] J. Reis, M. Rocha, T. K. Phan, D. Griffin, F. Le, and M. Rio, “Deep neural networks for network routing,” in 2019 International Joint Conference on Neural Networks (IJCNN), 2019, pp. 1–8.
- [41] J. Reis, T. K. Phan, M. Kheirkhah, F. Yang, D. Griffin, M. Rocha, and M. Rio, “R2l: routing with reinforcement learning,” in 2021 International Joint Conference on Neural Networks (IJCNN), 2021, pp. 1–7.
- [42] P. Almasan, J. Suárez-Varela, A. Badia-Sampera, K. Rusek, P. Barlet-Ros, and A. Cabellos-Aparicio, “Deep reinforcement learning meets graph neural networks: An optical network routing use case,” CoRR, vol. abs/1910.07421, 2019.
- [43] R. Paul, K. Cohen, and G. Kedar, “Multi-flow transmission in wireless interference networks: A convergent graph learning approach,” IEEE Transactions on Wireless Communications, 2023.
- [44] O. S. Eyobu and K. Edwinah, “A deep learning-based routing approach for wireless mesh backbone networks,” IEEE Access, 2023.
- [45] S. Yamin and H. H. Permuter, “Multi-agent reinforcement learning for network routing in integrated access backhaul networks,” Ad Hoc Networks, vol. 153, p. 103347, 2024.
- [46] K. Cohen and A. Leshem, “Distributed game-theoretic optimization and management of multichannel aloha networks,” IEEE/ACM Transactions on Networking, vol. 24, no. 3, pp. 1718–1731, 2015.
- [47] M. Fredman and R. Tarjan, “Fibonacci heaps and their uses in improved network optimization algorithms,” in 25th Annual Symposium onFoundations of Computer Science, 1984., 1984, pp. 338–346.
- [48] B. Becker, C. Oberli, T. Meuser, and R. Steinmetz, “On maximizing the probability of achieving deadlines in communication networks,” Journal of Sensor and Actuator Networks, vol. 13, no. 1, p. 9, 2024.
- [49] S. Thelen, K. Volbert, and D. W. Nunes, “A survey on algorithmic problems in wireless systems.” in SENSORNETS, 2023, pp. 101–111.
- [50] X. Hei, J. Zhang, B. Bensaou, and C.-C. Cheung, “Wavelength converter placement in least-load-routing-based optical networks using genetic algorithms,” Journal of Optical Networking, vol. 3, pp. 363–378, 05 2004.
- [51] F. Barreto, E. Wille, and L. Nacamura, “Fast emergency paths schema to overcome transient link failures in ospf routing,” International Journal of Computer Networks & Communications, vol. 4, 04 2012.