This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A Computationally Efficient Bi-level Coordination Framework for CAVs at Unsignalized Intersections

Jiping Luo, Tingting Zhang,  and Qinyu Zhang, 
Abstract

In this paper, we investigate cooperative vehicle coordination for connected and automated vehicles (CAVs) at unsignalized intersections. To support high traffic throughput while reducing computational complexity, we present a novel collision region model and decompose the optimal coordination problem into two sub-problems: centralized priority scheduling and distributed trajectory planning. Then, we propose a bi-level coordination framework which includes: (i) a Monte Carlo Tree Search (MCTS)-based high-level priority scheduler aims to find high-quality passing orders to maximize traffic throughput, and (ii) a priority-based low-level trajectory planner that generates optimal collision-free control inputs. Simulation results demonstrate that our bi-level strategy achieves near-optimal coordination performance, comparable to state-of-the-art centralized strategies, and significantly outperform the traffic signal control systems in terms of traffic throughput. Moreover, our approach exhibits good scalability, with computational complexity scaling linearly with the number of vehicles. Video demonstrations can be found online at https://youtu.be/WYAKFMNnQfs.

Index Terms:
connected and automated vehicles (CAVs), intersection management, cooperative vehicle control.

I INTRODUCTION

Benefiting from advances in autonomous driving and vehicular communications[1, 2, 3], connected and automated vehicles (CAVs) have emerged as a key enabler of intelligent transportation systems (ITS)[4, 5, 6, 7]. By allowing vehicles to share information and make decisions cooperatively via vehicle-to-everything (V2X) communication links, cooperative vehicle coordination will be a promising alternative to traditional traffic signal control systems, and is envisioned to make everyday traveling safer, greener, and more efficient[8, 9, 10].

Over the last two decades, centralized coordination strategies have been widely studied to enable optimal maneuvers for CAVs at unsignalized intersections. In this paradigm, a coordination center, usually fulfilled at the road side unit (RSU), collects the driving status (e.g., velocities, accelerations, positions, routes, etc) of all involved vehicles and plans safe controls for them[11, 12]. Generally, the central goal is to maximize traffic throughput subjecting to dynamics/kinematics, actuator limits, and collision avoidance constraints (including rear-end and lateral safety). Since all vehicles are considered as one system, centralized strategies could generate globally optimal trajectories that are collision-free and deadlock-free. However, centralized strategies confront challenges due to the contradiction between stringent real-time actuation requirements and the high computational complexity required for optimizations [13, 14].

At intersections, traffic throughput and safety are closely intertwined and need to be jointly considered. One classical formulation for safety is the collision set (CS) model[15, 16, 17], as shown in Fig. 1(a). In this model, the whole intersection area was defined as a collision set (i.e., blue area) where any two vehicles from conflicting routes cannot exist in the area simultaneously. By indicating vehicle priorities using auxiliary binary variables [16, 17, 18], the optimal coordination problem can be formulated as a Mixed-integer Quadratic Programming (MIQP) problems. Although optimal passing orders and vehicle trajectories can be obtained simultaneously, such centralized CS based approaches are still far from satisfactory due to the following two limitations. Firstly, excessive safety redundancy is reserved for only one vehicle at a time, thus reducing the utilization of intersection spatial resources and traffic efficiency. Secondly, MIQPs are computationally prohibitive or even intractable since the time complexity increases exponentially with the number of vehicles. To approach these gaps, several collision region models and low-complexity coordination algorithms have been proposed in recent years.

Refer to caption
(a) collision set
Refer to caption
(b) cross-collision points
Refer to caption
(c) rectangular sub-zones
Refer to caption
(d) space-time resource searching
Figure 1: Description of collision region models for a single-lane intersection.

To address the underutilization issue of the intersection spatial resources, several collision region models have been proposed. The authors in [19, 18, 20] defined multiple cross-collision points (CCPs) inside the intersection area. The circle area centering the CCP show an approximate area where any two conflicting vehicles are not allowed to enter at the same time, as shown in Fig. 1(b). Similarly, the intersection area can also be divided into multiple rectangular sub-zones according to the combination of each pair of orthogonal lanes[21, 14], as shown in Fig. 1(c). However, all the aforementioned collision region models regard vehicles as mass points and provide excessive safety redundancy compared to the vehicle geometry. More recently, a three-dimensional collision region model called space-time resource searching (STRS) was proposed in [22]. This model stretches the intersection capacity to its limit by allocating non-overlapping space-time resources to vehicles in the XYT domains, as shown in Fig. 1(d). Although these models demonstrate remarkable coordination efficiency advantages compared to the CS model, the corresponding centralized coordination strategies often suffer from overly long problem-solving times due to the need to avoid collisions at each sub-region.

To address the key computational challenges in real-world implementations, in our previous contributions[23], we proposed a deep reinforcement learning (DRL)-based strategy to generate cooperative trajectories for a batch of vehicles in real-time. Although near-optimal traffic efficiency can be achieved in sub-static coordination scenarios, the introduction of vehicle grouping and virtual vehicle padding methods may hinder this strategy for high-traffic loads. A number of research efforts have developed distributed111We follow the definition from [24], where ”distributed control” refers to vehicles exchanging local states and decisions to collaboratively accomplish a task, whereas in ”decentralized control”, vehicles independently determine their actions without exchanging information. strategies to reduce computation time. In these methods, vehicles are firstly prioritized according to some basic rules, such as first-in-first-out (FIFO)[25], distance to intersection[26], time to react[27, 28], etc. Then each vehicle solves its part of optimization problem sequentially to avoid collisions with high-priority neighbors. In this way, the optimal coordination problem is decomposed into a sequence of low-complexity trajectory planning sub-problems and the exponential complexity can be avoided. However, due to the lack of global information, such rule-based priority assignment methods may result in inefficient coordination performances.

To achieve a good trade-off between complexity and optimality, a number of research efforts have recently developed bi-level coordination strategies. The main idea behind these strategies is to use a centralized layer to assign collision-free time slots to vehicles, along with a decentralized layer that follows the predetermined vehicle motion patterns [29, 30, 31]. For instance, the authors in [29] derived fuel-efficient piece-wise analytic speed profiles, based on which formed an efficient centralized problem to resolve collisions and improve throughput. Another method is the virtual platoon [30, 31], which simplifies trajectory planning by projecting vehicles from different lanes onto a single virtual lane and constructing virtual platoons on this lane to maintain typical platooning behavior. Similar methods are employed in [32, 33], which leverage vehicle platooning schemes for dealing with mixed traffic intersection coordination. However, these strategies require predetermined rules about vehicle motion patterns, thereby limiting degrees of freedom available for vehicle maneuvering and reducing coordination efficiency.

In this paper, we aim to develop a high degree of freedom bi-level coordination framework that combines the strengths of centralized and distributed strategies. Our aim is to improve traffic throughput while reducing computational complexity. The main contributions are summarized as follows.

  • We propose a novel bi-level coordination framework that decomposes the optimal coordination problem into two sub-problems: centralized priority scheduling and distributed trajectory planning. The centralized coordinator only assigns priorities to vehicles, whereas each vehicle thereafter plan and share its own trajectory in sequence to avoids collisions with high-priority vehicles.

  • To fully utilize the spatial resources and improve coordination efficiency, we introduce a novel collision region model that leverages collision-checking algorithms to pre-compute collision boundaries for any two conflicting paths. Additionally, we propose a low-complexity distributed trajectory planner that enables vehicles to cross intersections safely and efficiently.

  • To tackle the challenging combinatorial priority scheduling problem, we propose a Monte Carlo tree search (MCTS)-based high-level planner that efficiently identifies near-optimal priority sequences within a limited computational budget.

  • Our proposed bi-level strategy has been evaluated through simulations, demonstrating comparable coordination performance to state-of-the-art centralized strategies while significantly reducing computational complexity.

The remained of this paper is organized as follows. The system model and the bi-level coordination framework are presented in Section II. The low-level distributed trajectory planner and the high-level priority scheduler are presented in Section III and Section IV, respectively. Section V presents our simulation results to confirm the capability and scalability of the proposed strategy. Finally, we conclude this paper and discuss future directions in Section VI.

II System Model

II-A Intersection Model

In this paper, we consider a typical single-lane intersection scenario which consists of 4 roads, i.e., down road, right road, up road, and left road, as shown in Fig. 2. On each road, there are three possible paths that vehicles can follow: go straight, turn left and turn right, and there is a total of 12 different paths within the intersection area. The central area where multiple paths cross and merge is called the conflict area (CA). A centralized coordinator is deployed at the RSU to store road structure information (e.g., intersection’s geometric parameters, pre-defined paths, etc), collect vehicles’ status information (e.g., positions, velocities, accelerations, and routes, etc), and give instructions to vehicles to cross the intersection safely and efficiently. CDL,RD\mathrm{C}_{\mathrm{DL,RD}} represents the collision region of two conflicting paths: ego vehicle’s path DL\mathrm{D}\rightarrow\mathrm{L} and high-priority vehicle’s RD\mathrm{R}\rightarrow\mathrm{D}. The calculation of all collision regions will be discussed in the next subsection.

Refer to caption
Figure 2: A typical signal-free intersection scenario.

Without loss of generality, we assume that vehicles must follow one of the 12 pre-defined paths according to their driving directions and only the longitudinal movements of vehicles are controlled. Hence, the dynamics of each vehicle i𝒩i\in\mathcal{N} can be modeled as a double integrator, i.e.,

s˙i(t)=vi(t)\displaystyle\dot{s}_{i}(t)=v_{i}(t) (1)
v˙i(t)=ui(t)\displaystyle\dot{v}_{i}(t)=u_{i}(t) (2)

where si(t)s_{i}(t) denotes the position along its path, vi(t)v_{i}(t) and ui(t)u_{i}(t) are the longitudinal speed and acceleration. Considering the practical constraints in the vehicle dynamics, we do have:

[si(0),vi(0\displaystyle[s_{i}(0),v_{i}(0 ),ui(0)]=[si0,vi0,ui0]\displaystyle),u_{i}(0)]=[s_{i}^{0},v_{i}^{0},u_{i}^{0}] (3)
0\displaystyle 0 vi(t)vmax\displaystyle\leq v_{i}(t)\leq v_{\mathrm{max}} (4)
umin\displaystyle u_{\mathrm{min}} ui(t)umax\displaystyle\leq u_{i}(t)\leq u_{\mathrm{max}} (5)

here [si0,vi0,ui0][s_{i}^{0},v_{i}^{0},u_{i}^{0}] is the initial state of vehicle ii, which should be collected by the coordination center at the beginning.

II-B Collision Region Model

Refer to caption
Figure 3: An illustration of the proposed collision region model.

Collision region modeling is essential for coordination safety and efficiency. To fully utilize the spatial resources of the intersection area, we propose a novel collision region model by leveraging collision-checking algorithms. Fig. 3 demonstrates the calculation of collision region between two conflicting routes DL\mathrm{D}\rightarrow\mathrm{L} and RD\mathrm{R}\rightarrow\mathrm{D}. It is worth noting that CDL,RD\mathrm{C}_{\mathrm{DL,RD}} and CRD,DL\mathrm{C}_{\mathrm{RD,DL}} are two different collision regions since the ego vehicle is on different paths. Take CDL,RD\mathrm{C}_{\mathrm{DL,RD}} as an example, we can locate the ego vehicle at a given point segos_{\mathrm{ego}} on path DL, and let conflicting vehicle travel at a constant speed along path RD. We then use collision-checking algorithms such as Separating Axis Theorem (SAT)[34] to check if a collision occurs. By iteratively locating the ego vehicle from one side to another along path DL, we can get the exact collision boundary sDL,RDins_{\mathrm{DL,RD}}^{\mathrm{in}} that the ego vehicle cannot enter before the conflicting vehicle passes through. In this case, the ego vehicle can enter the collision region CDL,RD=[sDL,RDin,sDL,RDout]\mathrm{C}_{\mathrm{DL,RD}}=[s_{\mathrm{DL,RD}}^{\mathrm{in}},s_{\mathrm{DL,RD}}^{\mathrm{out}}] if and only if the high-priority vehicle pass its collision region CRD,DL=[sRD,DLin,sRD,DLout]\mathrm{C}_{\mathrm{RD,DL}}=[s_{\mathrm{RD,DL}}^{\mathrm{in}},s_{\mathrm{RD,DL}}^{\mathrm{out}}]. This calculation process is time-consuming but can be pre-computed since road structures and paths are fixed. The collision table 𝒯collision\mathcal{T}_{\mathrm{collision}} consisting of all collision regions can be obtained from Algorithm 1. Herein, each pair of paths intersect with each other at most once and we do not consider the self-collision of two identical paths.

Input: pre-defined paths 𝒫route\mathcal{P}_{\mathrm{route}}
Output: collision table 𝒯collision\mathcal{T}_{\mathrm{collision}}
1 for pego𝒫routep_{\mathrm{ego}}\in\mathcal{P}_{\mathrm{route}} do
2       for pconflict𝒫route,pegopconflictp_{\mathrm{conflict}}\in\mathcal{P}_{\mathrm{route}},\,p_{\mathrm{ego}}\neq p_{\mathrm{conflict}} do
3             asegoinit,bsegoenda\leftarrow s_{\mathrm{ego}}^{\mathrm{init}},b\leftarrow s_{\mathrm{ego}}^{\mathrm{end}} ;
4             while |ab|eps|a-b|\geq eps do
5                   Locate ego vehicle at c(a+b)/2c\leftarrow(a+b)/2 ;
6                   Let conflicting vehicle travel along pconflictp_{\mathrm{conflict}};
7                   Check if collision occurs using SAT;
8                   if collide then bcb\leftarrow c;
9                   else aca\leftarrow c;
10                  
11            segoin=as_{\mathrm{ego}}^{\mathrm{in}}=a;
12             Set asegoend,bsegoina\leftarrow s_{\mathrm{ego}}^{\mathrm{end}},b\leftarrow s_{\mathrm{ego}}^{\mathrm{in}}, repeat line 4-9;
13             segoout=as_{\mathrm{ego}}^{\mathrm{out}}=a;
14             𝒯collision[pego][pconflict]=[segoin,segoout]\mathcal{T}_{\mathrm{collision}}[p_{\mathrm{ego}}][p_{\mathrm{conflict}}]=[s_{\mathrm{ego}}^{\mathrm{in}},s_{\mathrm{ego}}^{\mathrm{out}}];
15            
16      
Algorithm 1 Offline Collision Table Generation.

II-C Centralized Coordination Strategies

In this section, we first introduce a classical centralized coordination strategy based on the CS model[16], as shown in Fig. 1(a). Considering that any two conflicting vehicles must cross the collision set one after another, the lateral collision-free constraints can be written as:

[ti(siin),ti(siout)][tj(sjin),tj(sjout)]=,j𝒩coni\displaystyle[t_{i}(s_{i}^{\mathrm{in}}),t_{i}(s_{i}^{\mathrm{out}})]\cap[t_{j}(s_{j}^{\mathrm{in}}),t_{j}(s_{j}^{\mathrm{out}})]=\emptyset,\forall j\in\mathcal{N}_{\mathrm{con}}^{i} (6)

where 𝒩coni\mathcal{N}_{\mathrm{con}}^{i} denotes the set of all conflicting vehicles of vehicle ii, sins^{\mathrm{in}} and souts^{\mathrm{out}} are the entry and exit point of the collision set, respectively. For a given initial configuration, the exact time that a vehicle occupies the collision set is determined by the passing order and trajectory planning results. However, the optimal ordering for a given initial configuration can only be found by evaluating all feasible passing order alternatives in a structured manner. Consequently, constraint (6) is essentially NP-hard, and it is necessary to seek low-complexity relaxations. One common approach is introducing auxiliary binary variables, i.e., δi(t),γi(t){0,1}\delta_{i}(t),\gamma_{i}(t)\in\{0,1\}, to indicate the priority of each vehicle. For instance, δi(t)=1,γi(t)=1\delta_{i}(t)=1,\gamma_{i}(t)=1 indicates that vehicle ii is inside the CA, while δi(t)=0,γi(t)=1\delta_{i}(t)=0,\gamma_{i}(t)=1 or δi(t)=1,γi(t)=0\delta_{i}(t)=1,\gamma_{i}(t)=0 means that vehicle ii has not yet reached the CA or has already passed the CA. δi(t)=0,γi(t)=0\delta_{i}(t)=0,\gamma_{i}(t)=0 is invalid state. Thus, we can replace constraint (6) using the following linear inequalities:

sioutγi(t)Msi(t)siin+δi(t)M\displaystyle s_{i}^{\mathrm{out}}-\gamma_{i}(t)M\leq s_{i}(t)\leq s_{i}^{\mathrm{in}}+\delta_{i}(t)M (7)
δi(t)+γi(t)+δj(t)+γj(t)3\displaystyle\delta_{i}(t)+\gamma_{i}(t)+\delta_{j}(t)+\gamma_{j}(t)\leq 3 (8)

where MM is a sufficiently large number.

To guarantee no rear-end collision occurs between vehicles on the same lane, we impose the following rear-end safety constraint:

sj(t)si(t)Lsafe\displaystyle s_{j}(t)-s_{i}(t)\geq L_{\mathrm{safe}} (9)

where j𝒩preij\in\mathcal{N}_{\mathrm{pre}}^{i} is the preceding vehicles in front of vehicle ii, LsafeL_{\mathrm{safe}} is the longitudinal safety gap.

Thus, the centralized-CS problem with the aim of maximizing traffic throughput can be formulated as:

𝒫centralizedCS:\displaystyle\mathscr{P}_{\mathrm{centralized-CS}}:\,\, minui(t),δi(t),γi(t)i=1Nt=0T1(vi(t)vmax)2\displaystyle\min_{u_{i}(t),\delta_{i}(t),\gamma_{i}(t)}\,\,\sum_{i=1}^{N}\sum_{t=0}^{T-1}(v_{i}(t)-v_{\mathrm{max}})^{2}
s.t.\displaystyle\mathrm{s.t.}\,\,\, (1)(5),i𝒩\displaystyle\,\,\eqref{eq:kinematic_1}-\eqref{eq:kinematic_4},\,\,\,\forall i\in\mathcal{N}
(7)(8),i𝒩,j𝒩coni\displaystyle\,\,\eqref{eq:cslateral_1}-\eqref{eq:cslateral_2},\,\,\,\forall i\in\mathcal{N},j\in\mathcal{N}_{\mathrm{con}}^{i}
(9),i𝒩,j𝒩prei\displaystyle\,\,\eqref{eq:csrear_end},\,\,\,\forall i\in\mathcal{N},j\in\mathcal{N}_{\mathrm{pre}}^{i}

Similarly, we can formulated a centralized optimal coordination problem (OCP) based on our collision region model described in Section II-B. The lateral collision avoidance constraints are as follows,

si,coutγi,c(t)Msi(t)si,cin+δi,c(t)M\displaystyle s_{i,c}^{\mathrm{out}}-\gamma_{i,c}(t)M\leq s_{i}(t)\leq s_{i,c}^{\mathrm{in}}+\delta_{i,c}(t)M (10)
δi,c(t)+γi,c(t)+δj,c(t)+γj,c(t)3\displaystyle\delta_{i,c}(t)+\gamma_{i,c}(t)+\delta_{j,c}(t)+\gamma_{j,c}(t)\leq 3 (11)

where c𝒞coni,jc\in\mathcal{C}_{\mathrm{con}}^{i,j} is the corresponding collision region between vehicle ii and vehicle jj, si,cins_{i,c}^{\mathrm{in}} and si,couts_{i,c}^{\mathrm{out}} are the entry and exit point of collision region cc. As a result, the centralized optimal coordination problem (OCP) can be formulated as follows,

𝒫centralizedOCP:\displaystyle\mathscr{P}_{\mathrm{centralized-OCP}}: minui(t),δi,c(t),γi,c(t)i=1Nt=0T1(vi(t)vmax)2\displaystyle\,\,\min_{u_{i}(t),\delta_{i,c}(t),\gamma_{i,c}(t)}\,\,\sum_{i=1}^{N}\sum_{t=0}^{T-1}(v_{i}(t)-v_{\mathrm{max}})^{2}
s.t.\displaystyle\mathrm{s.t.}\,\,\, (1)(5),i𝒩\displaystyle\,\,\eqref{eq:kinematic_1}-\eqref{eq:kinematic_4},\,\,\,\forall i\in\mathcal{N}
(9),i𝒩,j𝒩prei\displaystyle\,\,\eqref{eq:csrear_end},\,\,\,\forall i\in\mathcal{N},j\in\mathcal{N}_{\mathrm{pre}}^{i}
(10)(11),i𝒩,j𝒩coni,c𝒞coni,j\displaystyle\,\,\eqref{eq:ccplateral_1}-\eqref{eq:ccplateral_2},\,\,\,\forall i\in\mathcal{N},j\in\mathcal{N}_{\mathrm{con}}^{i},c\in\mathcal{C}_{\mathrm{con}}^{i,j}

Problems 𝒫centralizedCS\mathscr{P}_{\mathrm{centralized-CS}} and 𝒫centralizedOCP\mathscr{P}_{\mathrm{centralized-OCP}} can be cast into Mixed-integer Quadratic Programming (MIQP) problems, which can return optimal inputs uu and priority sequence γ,δ\gamma,\delta simultaneously. However, MIQPs are generally time-consuming and the problem size may grow exponentially with the number of vehicles.

II-D Priority-based Bi-level Coordination Framework

Although the aforementioned centralized strategies are able to find global optima by jointly optimizing cooperative trajectories and priority sequence, they scale poorly and are somehow not directly applicable in industrial practice. In this work, we propose an efficient bi-level coordination framework, as shown in Fig. 4.

The centralized node (high-level planner) is deployed to collect system-wide state information of all involved vehicles. Based on the initial configuration, the high-level planner only solves the priority scheduling problem and then assigns priority to vehicles. Note that adjacent non-conflicting vehicles may be assigned with the same priority, while conflicting vehicles must be sequentially arranged. The low-level planner of each vehicle thereafter solves its local trajectory planning problem in sequence to avoid collisions with high-priority vehicles. Then, the optimized trajectory is transmitted to low-priority neighbors for their planning problems.

Refer to caption
Figure 4: The proposed bi-level coordination framework.
Refer to caption
Figure 5: S-T graph based low-complexity trajectory planner.
Refer to caption
Figure 6: Calculation of the simplified speed profiles.

The key issue to be addressed is decomposing the optimal coordination problem into the priority scheduling and distributed trajectory planning. The quality (i.e., traffic efficiency) of a given priority sequence can only be evaluated after all vehicles have planned their trajectories, which means priority scheduling is closely intertwined with trajectory planning. Rule-base approaches[25, 26, 27, 28] greedily find a feasible priority sequence without evaluating the quality of the solution which may, however, greatly sacrifice traffic efficiency. Enumeration-based approaches[22] find the globally optimal solution by evaluating all possible alternatives. However, they are not applicable in practice since the priority scheduling itself is a combinatorial optimization problem and iterative trajectory planning is also time-consuming.

In the following two sections, we will proposed a promising scheme to address these challenges. We start our exposition with the low-level trajectory planner in Section III and then we discuss the high-level priority scheduler in Section IV.

III Low-level Distributed Trajectory Planner

In this section, we consider that the solution of the high-level problem is given. We now focus on the low-level distributed trajectory planning problem. Our proposed low-level planner follows the principle that each vehicle needs to give way to high-priority vehicles. Take the coordination scenario described in Fig. 2 as an example. For a given priority sequence (L1,R1,U1,D1,D2,U2)(\mathrm{L}_{1},\mathrm{R}_{1},\mathrm{U}_{1},\mathrm{D}_{1},\mathrm{D}_{2},\mathrm{U}_{2}), vehicle L1\mathrm{L}_{1} with the highest priority can directly cross the CA, while vehicle R1\mathrm{R}_{1} needs to give way to vehicle L1\mathrm{L}_{1} since potential merge collision may occur. Now that we focus on vehicle D1\mathrm{D}_{1}, it may conflict with both vehicle R1\mathrm{R}_{1} and vehicle U1\mathrm{U}_{1}. In this case, R1\mathrm{R}_{1} and U1\mathrm{U}_{1} first plan their own trajectories and then update occupation time of all collision regions along their paths. We use a collision time table 𝒯time\mathcal{T}_{\mathrm{time}} which can be stored in the coordinator or shared among vehicles to update occupation times. For example, vehicle R1\mathrm{R}_{1} predicts that it will occupy collision region CRD,DL=[sRD,DLlow,sRD,DLhigh]\mathrm{C}_{\mathrm{RD,DL}}=[s_{\mathrm{RD,DL}}^{\mathrm{low}},s_{\mathrm{RD,DL}}^{\mathrm{high}}] during time duration [tRD,DLleft,tRD,DLright][t_{\mathrm{RD,DL}}^{\mathrm{left}},t_{\mathrm{RD,DL}}^{\mathrm{right}}], then the collision time table can be updated as 𝒯time[DL][RD]=[tRD,DLleft,tRD,DLright]\mathcal{T}_{\mathrm{time}}[\mathrm{DL}][\mathrm{RD}]=[t_{\mathrm{RD,DL}}^{\mathrm{left}},t_{\mathrm{RD,DL}}^{\mathrm{right}}]. Similarly, vehicle U1\mathrm{U}_{1} will occupy collision region CUD,DL=[sUD,DLlow,sUD,DLhigh]\mathrm{C}_{\mathrm{UD,DL}}=[s_{\mathrm{UD,DL}}^{\mathrm{low}},s_{\mathrm{UD,DL}}^{\mathrm{high}}] during time duration [tUD,DLleft,tUD,DLright][t_{\mathrm{UD,DL}}^{\mathrm{left}},t_{\mathrm{UD,DL}}^{\mathrm{right}}], so 𝒯time[DL][UD]\mathcal{T}_{\mathrm{time}}[\mathrm{DL}][\mathrm{UD}] can be set as [tUD,DLleft,tUD,DLright][t_{\mathrm{UD,DL}}^{\mathrm{left}},t_{\mathrm{UD,DL}}^{\mathrm{right}}].

All the blocked regions along the path can be marked in the S-T graph, as shown in Fig. 6. The red and green areas represent the time-spatial resources occupied by high-priority vehicles R1\mathrm{R}_{1} and U1\mathrm{U}_{1}, respectively. Thus, the local trajectory planning is equivalent to finding a speed profile that does not intersect with these blocked areas. The solid line means vehicle D1\mathrm{D}_{1} gives way to vehicle U1\mathrm{U}_{1} and enters the collision region CDL,UD\mathrm{C}_{\mathrm{DL,UD}} right after vehicle U1\mathrm{U}_{1} exits. It should be noted that though in this case vehicle D1\mathrm{D}_{1} could preempt the collision region CDL,UD\mathrm{C}_{\mathrm{DL,UD}} (the dashed line), this is unacceptable since vehicles must follow the given priority sequence in order to avoid possible deadlocks. This is also a clear indication that priority scheduling is essential for coordination efficiency.

Since ego vehicle must give way to high-priority vehicles, the speed profile of ego vehicle must below the blocked areas to avoid lateral collisions. Denote 𝒞ts\mathcal{C_{\mathrm{ts}}} the set of coordinates of the lower right corner of occupied resource blocks in the S-T graph, i.e.,

𝒞ts={(tright,slow)|\displaystyle\mathcal{C_{\mathrm{ts}}}=\big{\{}(t_{\mathrm{right}},s_{\mathrm{low}})\big{|} (tleft,tright)𝒯times[pego],\displaystyle(t_{\mathrm{left}},t_{\mathrm{right}})\in\mathcal{T}_{\mathrm{times}}[p_{\mathrm{ego}}],
(slow,shigh)𝒯collision[pego]}\displaystyle(s_{\mathrm{low}},s_{\mathrm{high}})\in\mathcal{T}_{\mathrm{collision}}[p_{\mathrm{ego}}]\big{\}} (12)

the lateral safety constraints can be expressed using the following linear inequalities:

s(tright)slow,(tright,slow)𝒞ts\displaystyle s(t_{\mathrm{right}})\leq s_{\mathrm{low}},\forall(t_{\mathrm{right}},s_{\mathrm{low}})\in\mathcal{C_{\mathrm{ts}}} (13)

In general, the local trajectory planning problem of each vehicle i𝒩i\in\mathcal{N} can be summarized as follows,

𝒫distributedi:\displaystyle\mathscr{P}_{\mathrm{distributed}}^{i}:\,\, minui(t)i=1Nt=0T1(vi(t)vmax)2\displaystyle\min_{u_{i}(t)}\,\,\sum_{i=1}^{N}\sum_{t=0}^{T-1}(v_{i}(t)-v_{\mathrm{max}})^{2}
s.t.\displaystyle\mathrm{s.t.} (1)(5),(9),(13)\displaystyle\,\,\,\eqref{eq:kinematic_1}-\eqref{eq:kinematic_4},\,\eqref{eq:csrear_end},\,\eqref{eq:mctslateral}

This problem can be cast as a low-complexity Quadratic Programming (QP) problem with only a few linear collision-free constraints. In this way, the centralized MIQP problem 𝒫centralizedOCP\mathscr{P}_{\mathrm{centralized-OCP}} is decomposed into a sequence of distributed QP problems, i.e., {𝒫distributedi|i𝒩}\{\mathscr{P}_{\mathrm{distributed}}^{i}|i\in\mathcal{N}\}.

To further improve the overall traffic efficiency, in the next section, we present the high-level optimization problem in which we seek the optimal priority sequence (i.e., the order to solve the above sub-problems).

IV High-level Centralized Priority Scheduler

IV-A Simplified Speed Profiles

The high-level planner aims at finding an optimal priority sequence from a systematic point of view. The priority assignment problem can be mathematically written as the following combinatorial optimization problem:

𝒫priority:\displaystyle\mathscr{P}_{\mathrm{priority}}:\,\, min𝐨𝒪tleave(𝐨)\displaystyle\min_{\mathbf{o}\in\mathcal{O}}\,\,t_{\mathrm{leave}}(\mathbf{o})

where tleave(𝐨)t_{\mathrm{leave}}(\mathbf{o}) denotes the total passing time the vehicles follow a given priority sequence 𝐨\mathbf{o}, 𝒪\mathcal{O} is the set of all possible priority sequences. However, the calculation of tleave(o)t_{\mathrm{leave}}(o) is time-consuming. Specifically, for a given priority sequence oo, tleave(o)t_{\mathrm{leave}}(o) can only be obtained after all vehicles have planned their trajectories. This direct evaluation approach takes over-long problem-solving time in real-world implementations. For example, if a vehicle needs τc\tau_{c} seconds to plan its local trajectory, the evaluation of a given priority sequence of NN vehicles consumes NτcN\tau_{c} seconds. To find the optimal priority sequence, we need to traverse all N!N! solutions, which requires about N!NτcN!N\tau_{c} seconds. To address this issue, we present an efficient evaluation method, as shown in Fig. 6.

The speed profiles in Fig. 6 are approximated by straight lines, i.e.,

si(t)={s0i,t0ttaccivmax(ttacci)+s0i,taccittendi\displaystyle s_{i}(t)=\begin{cases}s_{0}^{i},&t_{0}\leq t\leq t_{\mathrm{acc}}^{i}\\ v_{\mathrm{max}}(t-t_{\mathrm{acc}}^{i})+s_{0}^{i},&t_{\mathrm{acc}}^{i}\leq t\leq t_{\mathrm{end}}^{i}\end{cases} (14)

where taccit_{\mathrm{acc}}^{i} and tendit_{\mathrm{end}}^{i} are the starting time and existing time, respectively. The straight line with slop vmaxv_{\mathrm{max}} can be denoted as s=vmaxt+bs=v_{\mathrm{max}}t+b and must below all the blocked regions, such that

bi=min{svmaxt|(t,s)𝒞ts}\displaystyle b_{i}=\min\{s-v_{\mathrm{max}}t|(t,s)\in\mathcal{C}_{\mathrm{ts}}\} (15)

To avoid rear-end collisions with preceding vehicle on the same lane, we have

bi=min{bi,bpreLsafe}\displaystyle b_{i}^{*}=\min\{b_{i},b_{\mathrm{pre}}^{*}-L_{\mathrm{safe}}\} (16)

where bpreb_{\mathrm{pre}}^{*} is the optimal parameter of preceding vehicle, LsafeL_{\mathrm{safe}} is the safety distance. Hence, taccit_{\mathrm{acc}}^{i} and tendit_{\mathrm{end}}^{i} can be calculated as follows,

tacci=si0bivmax,tendi=siendbivmax\displaystyle t_{\mathrm{acc}}^{i}=\frac{s_{i}^{0}-b_{i}^{*}}{v_{\mathrm{max}}},\,\,\,t_{\mathrm{end}}^{i}=\frac{s^{\mathrm{end}}_{i}-b_{i}^{*}}{v_{\mathrm{max}}} (17)

Also, the occupation time of the collision regions along its path can be calculated in the same way. For a given priority sequence 𝐨\mathbf{o}, tleave(𝐨)t_{\mathrm{leave}}(\mathbf{o}) can be calculated as:

tleave(𝐨)=maxi𝒩{tendi|𝐨}\displaystyle t_{\mathrm{leave}}(\mathbf{o})=\max_{i\in\mathcal{N}}\{t_{\mathrm{end}}^{i}\,|\,\mathbf{o}\} (18)

Though the simplified speed profiles do not satisfy kinematic constraints (1) - (2), they can evaluate the quality of a priority sequence simply using simple calculations.

Refer to caption
Figure 7: Construction of the MCTS based solution tree.

IV-B Monte Carlo Tree Search Based Priority Scheduler

Problem 𝒫priority\mathscr{P}_{\mathrm{priority}} is still challenging and it is typically impossible to traverse all priority sequences within the limited computational budget, especially when a large number of vehicles within the coordination area. In the following, we present an MCTS based approach which could efficiently find high-quality sequences within a short planning time. The effectiveness of MCTS has been shown in many sequential decision-making problems such as games, planning, and control[35, 36, 37].

The key idea of MCTS is to iteratively build a search tree by estimating the values of tree nodes through random simulations [35]. The computational budget (e.g., planning time constraint or iteration constraint) is pre-defined, such that MCTS can always be stopped and return a solution, though might not be optimal. Four steps are applied per search iteration: selection, expansion, simulation, and backpropagation, as shown in Fig. 7. Firstly, select the most valuable and expandable node, i.e., vehicle R1\mathrm{R}_{1}, according to a certain tree search policy. Secondly, expand a new node that can be selected in the next round, i.e., U1\mathrm{U}_{1}. Thirdly, simulate by randomly sampling a feasible priority sequence, e.g., (L1,R1,U1,D1,D2,U2)(\mathrm{L}_{1},\mathrm{R}_{1},\mathrm{U}_{1},\mathrm{D}_{1},\mathrm{D}_{2},\mathrm{U}_{2}) shown in Fig. 2. Calculate the cost tleavet_{\mathrm{leave}} according to (15) - (18) and convert it into reward rr using a certain reward function. Then, backpropagate the current reward rr to the selected node U1\mathrm{U}_{1} and its ancestors (root,L1,R1)(\mathrm{root},\mathrm{L}_{1},\mathrm{R}_{1}).

In this paper, we adopt the commonly used tree search policy called Upper Confidence Bounds for Trees (UCT), which can be expressed as:

c=maxc𝒜pRcnc+ϵlnnpnc\displaystyle c^{*}=\max_{c\in\mathcal{A}_{p}}\,\frac{R_{c}}{n_{c}}+\epsilon\sqrt{\frac{\ln n_{p}}{n_{c}}} (19)

where pp is a parent node, 𝒜p\mathcal{A}_{p} is the set of possible children (the first uncoordinated vehicle on each lane), and cc is a child of parent node pp. RcR_{c}, ncn_{c}, npn_{p} represent the cumulative rewards of child cc, the number of visits of child rr, and the number of visits of parent pp, respectively. The first term of (19) is called the exploitation term, which means a child with higher cumulative rewards is more likely to be selected. While the second term is called the exploration term, which encourages agent to select less visited nodes that may result in better results. The constant ϵ>0\epsilon>0 makes a trade-off between exploitation and exploration.

The reward function is essential for the MCTS algorithm. Generally, the first term of (19) should be within [0,1][0,1]. This can be easily guaranteed in problems such as games since the reward is set as {win=1,draw=0.5,loss=0}\{\mathrm{win=1},\mathrm{draw=0.5},\mathrm{loss=0}\}. In our priority scheduling problem, we normalize the reward of each node cc whose parent is node pp to [0,1][0,1] with the following formulation,

rpc=fpcfp,minfp,maxfp,min\displaystyle r_{p}^{c}=\frac{f_{p}^{c}-f_{p,\mathrm{min}}}{f_{p,\mathrm{max}}-f_{p,\mathrm{min}}} (20)

where fpc=tleavecf_{p}^{c}=-t_{\mathrm{leave}}^{c} is the instant reward obtained after each simulation step, fp,minf_{p,\mathrm{min}} and fp,maxf_{p,\mathrm{max}} are the minimal and maximal reward among all children nodes of node pp respectively. Note that fp,maxf_{p,\mathrm{max}} and fp,minf_{p,\mathrm{min}} are dynamically determined as the MCTS algorithm progresses. Specifically, at the current simulation step, if the parent node pp has no child nodes, then we set fp,min=fp,max=fsimf_{p,\mathrm{min}}=f_{p,\mathrm{max}}=f_{\mathrm{sim}}, where fsimf_{\mathrm{sim}} is a temporal reward obtained by randomly sampling a feasible sequence from parent node pp. On the other hand, if pp already has existing children, then fp,minf_{p,\mathrm{min}} and fp,maxf_{p,\mathrm{max}} are set to be the minimal and maximal reward among these children. The proposed MCTS-based priority scheduler is summarized in Algorithm 2.

Input: 𝒩\mathcal{N}, 𝒯collision\mathcal{T}_{\mathrm{collision}}, 𝒯time\mathcal{T}_{\mathrm{time}}
Output: priority sequence 𝐨\mathbf{o}^{*}
1 for Pre-defined computational budget do
2       node \leftarrow root, cc^{*} = \emptyset;
3       while node is not terminal do
4             if node is expandable then
5                   Expand a child cc for node;
6                   cc^{*} \leftarrow cc, break loop;
7            else
8                  Find a best child cc acoording to (19);
9                   node \leftarrow cc, continue;
10                  
11            
12      Run a simulation from cc^{*};
13       Calculate reward according to (15) - (18) and (20);
14       Backpropogate rewards of current branch;
15      
16Set ϵ=0\epsilon=0. Starting at the root node, recursively apply (19) to descend through the tree until finding an optimal priority sequence 𝐨\mathbf{o}^{*};
Algorithm 2 MCTS Based Priority Scheduler.
Refer to caption
Figure 8: A capture of coordination animation under CS strategy (tleave=18.1t_{\mathrm{leave}}=18.1s).
Refer to caption
Figure 9: A capture of coordination animation under the proposed bi-level strategy (tleave=14.9t_{\mathrm{leave}}=14.9s).
Refer to caption
Figure 10: Comparison of coordination efficiency.
Refer to caption
Figure 11: Comparison of computational complexity.

V Simulation Results and Experiments

V-A Simulation Settings

In this section, we conduct simulations to evaluate the efficiency and adaptability of the proposed bi-level coordination strategy. To compare the coordination efficiency in sub-static scenarios, both the centralized-CS and centralized-OCP strategies described in Section II-C are considered as benchmarks. In addition, we implement traditional traffic signal control and a FIFO-based distributed strategy to validate traffic throughput in continuous traffic flows with different arrival rates. In traffic signal control, signal timing is predetermined using historical traffic data, and only non-conflicting vehicles are allowed to enter the CA during each green light phase.

All experiments are conducted with Python 3.8 on a personal computer with Intel i7-10700F 2.9 GHz CPU and 16 GB of RAM. We construct a signalized intersection using the Simulation of Urban MObility (SUMO) platform and use Python to interact with SUMO via the TraCI library. The centralized-CS problem 𝒫centralized\mathscr{P}_{\mathrm{centralized}}, the centralized-OCP problem 𝒫centralizedOCP\mathscr{P}_{\mathrm{centralized-OCP}}, and the distributed trajectory planning problem {𝒫distributedi|i𝒩}\{\mathscr{P}_{\mathrm{distributed}}^{i}|i\in\mathcal{N}\} are solved using the MOSEK solver. We also use Python API to access the MOSEK optimizer. The chosen parameters for the vehicles and the road structure are shown in Table I. Videos of our simulation results can be found at the supplementary materials, or online at https://youtu.be/WYAKFMNnQfs.

TABLE I: Simulation Parameters
Parameters Value
radius of the coordination area 100 m
lane width 10 m
left-turning radius 15 m
right-turning radius 10 m
total simulation time TtotalT_{\mathrm{total}} 40 s
slot duration Δt\Delta t 0.1 s
maximum velocity vmaxv_{\mathrm{max}} 15 m/s
maximum acceleration umaxu_{\mathrm{max}} 5 m/s2\mathrm{m/}\mathrm{s}^{2}
minimum acceleration uminu_{\mathrm{min}} - 5 m/s2\mathrm{m/}\mathrm{s}^{2}
vehicle geometry 2 m ×\times 4 m
safety box 4 m ×\times 8 m
safety distance LsafeL_{\mathrm{safe}} 8 m
big-M parameter 10000
UCT factor ϵ\epsilon 2\sqrt{2}
MCTS iteration constraint 10000

V-B Comparison of Different Coordination Strategies

We first consider the coordination scenario illustrated in Fig. 2 for comparison of the centralized-CS and our bi-level strategies. Fig. 8 shows a capture of coordination animation under the centralized-CS strategy. We observe that vehicle L1\mathrm{L}_{1} crosses the coordination area first, followed by high-priority vehicles D1\mathrm{D}_{1} and D2\mathrm{D}_{2}, causing the remaining vehicles to decelerate or stop before the entry point. Once D1\mathrm{D}_{1} and D2\mathrm{D}_{2} exit the coordination area, vehicles U1\mathrm{U}_{1} and U2\mathrm{U}_{2} enter, with vehicle R1\mathrm{R}_{1} being the last to cross. The resulting priority sequence is (L1,D1,D2,U1,U2,R1)(\mathrm{L}_{1},\mathrm{D}_{1},\mathrm{D}_{2},\mathrm{U}_{1},\mathrm{U}_{2},\mathrm{R}_{1}), and the optimal total travel time (i.e., the time when the last vehicle leaves the coordination area) is tleave=18.1t_{\mathrm{leave}}=18.1s.

Fig. 9 shows an animation of the coordination process under the proposed bi-level strategy. The priority sequence obtained by the high-level planner is (L1,U1,D1,R1,U2,D1)(\mathrm{L}_{1},\mathrm{U}_{1},\mathrm{D}_{1},\mathrm{R}_{1},\mathrm{U}_{2},\mathrm{D}_{1}). The vehicles then plan their trajectories sequentially based on this priority configuration. The collision set can accommodate up to 4 vehicles simultaneously, suggesting that the proposed collision region model can fully utilize time-spatial resources. The resulting total travel time is tleave=14.9t_{\mathrm{leave}}=14.9s, which represents a 17.7% increase in traffic efficiency compared to the centralized-CS strategy. We also evaluate the priority sequence (L1,R1,U1,D1,D2,U2)(\mathrm{L}_{1},\mathrm{R}_{1},\mathrm{U}_{1},\mathrm{D}_{1},\mathrm{D}_{2},\mathrm{U}_{2}) shown in Fig. 2, which results in a total travel time of approximately 16.7s. This highlights the significance of assigning priorities for coordination efficiency.

Fig. 11 demonstrates the coordination efficiency of these three strategies with respect to the number of vehicles. We observe that our strategy significantly outperforms the centralized-CS strategy, as the collision model we propose allows vehicles to share the CA simultaneously. As the number of vehicles increases, the efficiency of the centralized-CS strategy decreases because more vehicles have to yield to conflicting vehicles. The centralized-OCP strategy, which integrates our collision region model, can always achieve optimal coordination performance. Notably, our bi-level strategy achieves near-optimal coordination performance, with a small gap observed between our approach and the centralized-OCP strategy. This gap may be attributed to (i) sub-optimal priority sequences caused by MCTS and (ii) sub-optimal trajectories caused by sequential optimization.

Fig. 11 shows the maximum computation time with respect to the number of vehicles for these strategies. Our results show that when solving a local trajectory planning problem (i.e., 𝒫distributedi,i𝒩\mathscr{P}_{\mathrm{distributed}}^{i},i\in\mathcal{N}) over the total planing horizon Ttotal=40sT_{\mathrm{total}}=40s (i.e., Ttotal/Δt=400T_{\mathrm{total}}/\Delta t=400 slots), the MOSEK solver typically takes about 0-1.5s, and the computation time increases linearly with the number of vehicles. It is worth noting that in real-world deployments, receding horizon control methods (i.e., model predictive control) can be employed in our framework to solve the coordination problem at each time slot over a short planning horizon, thus reducing computation time. Additionally, the MCTS-based high-level planner always returns a priority sequence within 0.1s. In contrast, the centralized strategies suffer from high computational complexity, with computation time increasing exponentially with the number of vehicles. This is because 𝒫centralizedCS\mathscr{P}_{\mathrm{centralized-CS}} and 𝒫centralizedOCP\mathscr{P}_{\mathrm{centralized-OCP}} are MIQP problems, which are computationally prohibitive in large problem sizes. For instance, when dealing with 12 vehicles, the centralized-CS strategy takes approximately 13.2 hours to find a solution, while our approach achieves near-optimal results in only 14.1 seconds.

V-C Evaluation of Throughput in Continuous Traffic Flow

The proposed bi-level coordination strategy is also applied to continuous traffic flow. The vehicles arrival is assumed to be a Poisson process. In Fig. 12, we vary the arrival rates from 100 to 3000 vehicles/h/lane to compare the traffic throughput of the traffic signal control and our strategy under different traffic loads. For each arrival rate, the traffic throughput is averaged over the number of coordinated vehicles of 100 independent trials, and each trial simulates a 1-hour traffic process. To present the upper bound of throughput under a certain strategy, Fig. 12 does not consider left-turning vehicles. It can be seen that, in low traffic loads (100 to 800 veh/h/lane), both traffic signal control and the proposed strategy can provide enough coordination capacity for incoming traffic volumes. However, the traffic signal control soon reaches its capacity (about 800 veh/h/lane) when the arrival rate is 1200 veh/h/lane. In contrast, the proposed strategy reaches its limit when the arrival rate is about 2800 veh/h/lane and can provide a much higher serving rate of up to 1400 veh/h/lane.

Refer to caption
Figure 12: Traffic throughput without left-turning vehicles.

Fig. 13 further examines the influence of different left-turning probabilities on traffic throughput. For each traffic pattern, we simulate a 1-hour traffic process with a arrival rate of 3000 veh/h/lane. It is evident that compared to traffic signal control, our strategy can significantly improve traffic throughput, regardless of the traffic patterns. We observe that an increase in left-turning vehicles results in a decline in throughput. This is because left-turning paths occupy more collision regions and intersect with many paths, leading to more vehicles giving way to left-turning ones. Furthermore, we compare the efficiency of our MCTS-based priority scheduler with the FIFO method. For fairness, the distributed-FIFO strategy uses the same trajectory planner as ours. We observe that our MCTS-based scheduler has an approximately 12% improvement in traffic throughput compared to the FIFO method.

Refer to caption
Figure 13: Traffic throughput with different left-turning probabilities.

VI Conclusion

This work presented a priority-based bi-level coordination framework for intersection control of CAVs. Two main issues to be addressed were traffic throughput and computational complexity. To reduce computation time, we decomposed the optimal coordination problem into the priority scheduling and trajectory planning. An MCTS based priority scheduler was proposed to efficiently find a near-optimal priority sequence. Based on the given priority configuration, a low-complexity distributed trajectory planning problem with the aim of minimizing travel time was also formulated. Simulation results have shown that the proposed strategy achieves comparable coordination performance as the centralized-OCP strategy and exhibits good scalability.

Future research directions involve an extension of the presented framework to accommodate system uncertainties. We plan to explore robust controllers based on model predictive control (MPC), control barrier function (CBF), and covariance steering (CS). Moreover, we will take into account realistic coordination scenarios and incorporate non-cooperative traffic participants. Addressing these challenges will make the priority scheduling and distributed trajectory planning problems more interesting but also more challenging to solve.

References

  • [1] M. H. C. Garcia, A. Molina-Galan, M. Boban, J. Gozalvez, B. Coll-Perales, T. Şahin, and A. Kousaridas, “A tutorial on 5g nr v2x communications,” IEEE Communications Surveys & Tutorials, vol. 23, no. 3, pp. 1972–2026, 2021.
  • [2] P. Papadimitratos, A. D. La Fortelle, K. Evenssen, R. Brignolo, and S. Cosenza, “Vehicular communication systems: Enabling technologies, applications, and future outlook on intelligent transportation,” IEEE Communications Magazine, vol. 47, no. 11, pp. 84–95, 2009.
  • [3] B. Paden, M. Čáp, S. Z. Yong, D. Yershov, and E. Frazzoli, “A survey of motion planning and control techniques for self-driving urban vehicles,” IEEE Transactions on Intelligent Vehicles, vol. 1, no. 1, pp. 33–55, 2016.
  • [4] B. Häfner, V. Bajpai, J. Ott, and G. A. Schmitt, “A survey on cooperative architectures and maneuvers for connected and automated vehicles,” IEEE Communications Surveys & Tutorials, vol. 24, no. 1, pp. 380–403, 2022.
  • [5] A. Gholamhosseinian and J. Seitz, “A comprehensive survey on cooperative intersection management for heterogeneous connected vehicles,” IEEE Access, vol. 10, pp. 7937–7972, 2022.
  • [6] M. Khayatian, M. Mehrabian, E. Andert, R. Dedinsky, S. Choudhary, Y. Lou, and A. Shirvastava, “A survey on intersection management of connected autonomous vehicles,” ACM Transactions on Cyber-Physical Systems, vol. 4, no. 4, pp. 1–27, 2020.
  • [7] Z. Zhong, M. Nejad, and E. E. Lee, “Autonomous and semiautonomous intersection management: A survey,” IEEE Intelligent Transportation Systems Magazine, vol. 13, no. 2, pp. 53–70, 2021.
  • [8] L. Chen and C. Englund, “Cooperative intersection management: A survey,” IEEE Transactions on Intelligent Transportation Systems, vol. 17, no. 2, pp. 570–586, 2016.
  • [9] J. Rios-Torres and A. A. Malikopoulos, “A survey on the coordination of connected and automated vehicles at intersections and merging at highway on-ramps,” IEEE Transactions on Intelligent Transportation Systems, vol. 18, no. 5, pp. 1066–1077, 2017.
  • [10] R. Hult, G. R. Campos, E. Steinmetz, L. Hammarstrand, P. Falcone, and H. Wymeersch, “Coordination of cooperative autonomous vehicles: Toward safer and more efficient road transportation,” IEEE Signal Processing Magazine, vol. 33, no. 6, pp. 74–84, 2016.
  • [11] J. Luo, T. Zhang, R. Hao, D. Li, C. Chen, Z. Na, and Q. Zhang, “Cooperative trajectory planning at unsignalized intersections using deep reinforcement learning,” in 2022 IEEE/CIC International Conference on Communications in China (ICCC Workshops), 2022, pp. 227–232.
  • [12] C. Chen, J. Luo, T. Liang, and T. Zhang, “Re-planning optimization of cooperative vehicle coordination at road intersections,” in 2022 IEEE 95th Vehicular Technology Conference: (VTC2022-Spring), 2022, pp. 1–6.
  • [13] A. Colombo and D. Del Vecchio, “Efficient algorithms for collision avoidance at intersections,” in Proceedings of the 15th ACM international conference on Hybrid Systems: Computation and Control, 2012, pp. 145–154.
  • [14] R. Hult, M. Zanon, S. Gros, H. Wymeersch, and P. Falcone, “Optimisation-based coordination of connected, automated vehicles at intersections,” Vehicle System Dynamics, vol. 58, no. 5, pp. 726–747, 2020.
  • [15] J. Lee and B. Park, “Development and evaluation of a cooperative vehicle intersection control algorithm under the connected vehicles environment,” IEEE transactions on intelligent transportation systems, vol. 13, no. 1, pp. 81–90, 2012.
  • [16] R. Hult, G. R. Campos, P. Falcone, and H. Wymeersch, “An approximate solution to the optimal coordination problem for autonomous vehicles at intersections,” in 2015 American Control Conference (ACC).   IEEE, 2015, pp. 763–768.
  • [17] M. Wang, T. Zhang, L. Gao, and Q. Zhang, “High throughput dynamic vehicle coordination for intersection ground traffic,” in 2018 IEEE 88th Vehicular Technology Conference (VTC-Fall), 2018, pp. 1–6.
  • [18] C. Liu, Y. Zhang, T. Zhang, X. Wu, L. Gao, and Q. Zhang, “High throughput vehicle coordination strategies at road intersections,” IEEE Transactions on Vehicular Technology, vol. 69, no. 12, pp. 14 341–14 354, 2020.
  • [19] M. A. S. Kamal, J.-i. Imura, T. Hayakawa, A. Ohata, and K. Aihara, “A vehicle-intersection coordination scheme for smooth flows of traffic without using traffic lights,” IEEE Transactions on Intelligent Transportation Systems, vol. 16, no. 3, pp. 1136–1147, 2014.
  • [20] Y. Mo, M. Wang, T. Zhang, and Q. Zhang, “Autonomous cooperative vehicle coordination at road intersections,” Journal of Communications and Information Networks, vol. 4, no. 1, pp. 78–87, 2019.
  • [21] P. Dai, K. Liu, Q. Zhuge, E. H.-M. Sha, V. C. S. Lee, and S. H. Son, “Quality-of-experience-oriented autonomous intersection control in vehicular networks,” IEEE Transactions on Intelligent Transportation Systems, vol. 17, no. 7, pp. 1956–1967, 2016.
  • [22] Y. Zhang, R. Hao, T. Zhang, X. Chang, Z. Xie, and Q. Zhang, “A trajectory optimization-based intersection coordination framework for cooperative autonomous vehicles,” IEEE Transactions on Intelligent Transportation Systems, pp. 1–15, 2021.
  • [23] J. Luo, T. Zhang, R. Hao, D. Li, C. Chen, Z. Na, and Q. Zhang, “Real-time cooperative vehicle coordination at unsignalized road intersections,” IEEE Transactions on Intelligent Transportation Systems, vol. 24, no. 5, pp. 5390–5405, 2023.
  • [24] R. R. Negenborn and J. M. Maestre, “Distributed model predictive control: An overview and roadmap of future research opportunities,” IEEE Control Systems Magazine, vol. 34, no. 4, pp. 87–97, 2014.
  • [25] K.-D. Kim, “Collision free autonomous ground traffic: A model predictive control approach,” in 2013 ACM/IEEE International Conference on Cyber-Physical Systems (ICCPS), 2013, pp. 51–60.
  • [26] B. Fankhauser, L. Makarem, and D. Gillet, “Collision-free intersection crossing of mobile robots using decentralized navigation functions on predefined paths,” in 2011 IEEE 5th International Conference on Cybernetics and Intelligent Systems (CIS), 2011, pp. 392–397.
  • [27] G. Rodrigues de Campos, P. Falcone, R. Hult, H. Wymeersch, and J. Sjöberg, “Traffic coordination at road intersections: Autonomous decision-making algorithms using model-based heuristics,” IEEE Intelligent Transportation Systems Magazine, vol. 9, no. 1, pp. 8–21, 2017.
  • [28] M. Kloock, P. Scheffe, S. Marquardt, J. Maczijewski, B. Alrifaee, and S. Kowalewski, “Distributed model predictive intersection control of multiple vehicles,” in 2019 IEEE Intelligent Transportation Systems Conference (ITSC), 2019, pp. 1735–1740.
  • [29] A. A. Malikopoulos, L. Beaver, and I. V. Chremos, “Optimal time trajectory and coordination for connected and automated vehicles,” Automatica, vol. 125, p. 109469, 2021.
  • [30] C. Chen, Q. Xu, M. Cai, J. Wang, B. Xu, X. Wu, J. Wang, K. Li, and C. Qi, “A graph-based conflict-free cooperation method for intelligent electric vehicles at unsignalized intersections,” in 2021 IEEE International Intelligent Transportation Systems Conference (ITSC), 2021, pp. 52–57.
  • [31] X. Cong, B. Yang, F. Gao, C. Chen, and Y. Tang, “Virtual platoon based cavs cooperative driving at unsignalized intersection,” in 2022 IEEE 25th International Conference on Intelligent Transportation Systems (ITSC).   IEEE, 2022, pp. 53–58.
  • [32] M. Faris, P. Falcone, and J. Sjöberg, “Optimization-based coordination of mixed traffic at unsignalized intersections based on platooning strategy,” in 2022 IEEE Intelligent Vehicles Symposium (IV).   IEEE, 2022, pp. 977–983.
  • [33] Z. Deng, K. Yang, W. Shen, and Y. Shi, “Cooperative platoon formation of connected and autonomous vehicles: Toward efficient merging coordination at unsignalized intersections,” IEEE Transactions on Intelligent Transportation Systems, 2023.
  • [34] C. Ericson, Real-time collision detection.   Crc Press, 2004.
  • [35] C. B. Browne, E. Powley, D. Whitehouse, S. M. Lucas, P. I. Cowling, P. Rohlfshagen, S. Tavener, D. Perez, S. Samothrakis, and S. Colton, “A survey of monte carlo tree search methods,” IEEE Transactions on Computational Intelligence and AI in Games, vol. 4, no. 1, pp. 1–43, 2012.
  • [36] D. Silver, A. Huang, C. J. Maddison, A. Guez, L. Sifre, G. Van Den Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot et al., “Mastering the game of go with deep neural networks and tree search,” nature, vol. 529, no. 7587, pp. 484–489, 2016.
  • [37] D. Lenz, T. Kessler, and A. Knoll, “Tactical cooperative planning for autonomous highway driving using monte-carlo tree search,” in 2016 IEEE Intelligent Vehicles Symposium (IV), 2016, pp. 447–453.