This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Distributionally Robust Day-ahead Scheduling for Power-traffic Network under a Potential Game Framework

Haoran Deng, Bo Yang, Chao Ning, Cailian Chen and Xinping Guan Corresponding author: Bo Yang.The authors are with the Department of Automation, Shanghai Jiao Tong University, Shanghai 200240, China, the Key Laboratory of System Control and Information Processing, Ministry of Education of China, Shanghai 200240, China, and also with Shanghai Engineering Research Center of Intelligent Control and Management, Shanghai 200240, China.
Abstract

Widespread utilization of electric vehicles (EVs) incurs more uncertainties and impacts on the scheduling of the power-transportation coupled network. This paper investigates optimal power scheduling for a power-transportation coupled network in the day-ahead energy market considering multiple uncertainties related to photovoltaic (PV) generation and the traffic demand of vehicles. The crux of this problem is to model the coupling relation between the two networks in the day-ahead scheduling stage and consider the intra-day spatial uncertainties of the source and load. Meanwhile, the flexible load with a certain adjustment margin is introduced to ensure the balance of supply and demand of power nodes and consume the renewable energy better. Furthermore, we show the interactions between the power system and EV users from a potential game-theoretic perspective, where the uncertainties are characterized by an ambiguity set. In order to ensure the individual optimality of the two networks in a unified framework in day-ahead power scheduling, a two-stage distributionally robust centralized optimization model is established to carry out the equilibrium of power-transportation coupled network. On this basis, a combination of the duality theory and the Benders decomposition is developed to solve the distributionally robust optimization (DRO) model. Simulations demonstrate that the proposed approach can obtain individual optimal and less conservative strategies.

Index Terms:
Power-traffic coupled network, day-ahead power scheduling, potential game, uncertainty, distributionally robust optimization (DRO).

I Introduction

In the past decade, transportation electrification grows rapidly as a result of the increasing electric vehicles (EVs) charging demand with a positive role in alleviating environmental pollution of EVs for metropolis [1]. Public fast charging stations (FCSs) become the main source of the energy for EVs gradually. The spatial-temporal distribution characteristics of EVs flow can lead to the uncertainty of charging load in the power network (PN), which will result in a direct impact on the power scheduling strategy [2]. In addition, with the large-scale application of renewable energy, the stability of the power network is impacted seriously by the uncertainty of generated energy. As a result, the day-ahead energy dispatch of the power-transportation coupled network gains more and more research interest. Under this context, due to the complex coupling characteristics of the two networks, an urgent need for day-ahead energy scheduling while ensuring individual optimality of this coupled network considering multiple uncertainties is required.

There are the characteristics of mutual coupling between the transportation network (TN) and the power distribution network. Specifically, the traveling and charging plans of EVs can be affected by the congestion time and FCS electricity price strategies, and in turn, the EV route choices and traffic flow distribution can redistribute the charging load of PN. In the power-transportation coupled network engineering literature [3, 4, 5], the system modeling considering the interaction of coupled network in an equilibrium modeling framework has been widely researched. In addition, the traffic side is impacted by electricity price on traffic assignment, and further, the power flow is redispatched according to the temporal and spatial fluctuations of the charging load caused by the price. As a result, the two networks achieve a dynamic balance of supply and demand through the electricity price decisions. In this context, researchers proposed some schemes towards a socially optimum operating point taking into account travel time and the variations of electricity prices for battery charging in spatial space, while the traffic congestion is considered simultaneously [6, 7, 8]. In addition, EV drivers in the coupled network are strategic and not just electricity price takers. In this connection, game theory provides an effective method to investigate the coupled relation between traffic assignment and power dispatch. Extensive game-theoretic approaches about the traffic assignment [9], [10] and EV charging problems [11, 12, 13] have been developed. Among the game theoretic approach, there is a special one called potential game, which can reflect incentive of all individuals to change their strategy by using a single global function called the potential function. Thus the individual objectives can be achieved by optimizing the potential function. The authors in [14] and [15] analyzed the interdependency between the EV travelers and power operators by establishing the potential game-theoretic model to achieve the co-optimization of the payoff for the two networks. Nevertheless, the electricity is not generated instantaneously and random demands of FCSs must be met immediatly. Therefore, advance planning is required to ensure the intra-day balance of supply and demand. In the day-ahead power scheduling problem, the spatial-temporal interaction characteristics and the individual selfishness of the coupled network need to be considered from the game perspective. However, the intra-day uncertainties exist in the coupled network, which need to be taken into account in the day-ahead scheduling. Authors in [16] and [17] considered the uncertainties of demands and renewable power in the day-ahead scheduling problem of PN to reduce intra-day scheduling costs effectively.

For the power-transportation coupled network, uncertainty is generally considered in a stochastic optimization (SO) or robust optimization (RO) framework, which is computationally tractable. In [17, 18, 19], the co-operation problem of MGs was formulated as a two-stage SO problem which was reformulated as a mixed-integer linear programing (MILP) framework. Authors in [20] and [21] established a two-stage RO model considering the uncertainties of wind power generation and electric demand for the coupled network, which was solved by a delayed constraint generation algorithm and the column-and-constraint generation (C&CG) method, respectively. In [22], the authors introduced an adaptive three-stage robust model considering the uncertain load and power production with bounded intervals. However, the accurate probability distribution of uncertain parameters in SO is unobtainable in practical applications. Meanwhile, the RO model in the above literature considering the upper/lower boundary values of uncertain variables can result in over-conservative decision-making [23], due to the probability distribution information of uncertain variables is not incorporated in the optimization problem.

In the actual system decision-making problem, it is difficult to obtain the probability distribution of uncertain parameters accurately, and a part of the probability distribution information can be inferred by decision makers from limited data. In this regard, researchers have proposed a distributionally robust optimization (DRO) method to analyze the uncertainties in system optimization problems, and the risk aversion caused by the unknown probability distribution of uncertainty can be characterized more adequately. The construction of ambiguity set plays a crucial role in the DRO problem. In general, ambiguity sets are divided into the moment information [24, 25, 26], statistical distance [27, 28], and data-driven approaches [29, 30]. Among them, the moment-based ambiguity set is commonly applied, since it possesses better solvability and can reflect the characteristics of the uncertainty probability distribution accurately.

Based on the existing literature, the key of modeling is to construct the coupling relation of the two networks considering source-load uncertainties in this study. In the optimization problem modeling, the decision-making selfishness of each individual in the coupled network should be considered to ensure that the individual revenue is optimal. Meanwhile, in the day-ahead scheduling optimization problem, it is necessary to ensure that the obtained strategies can provide appropriate margin for the intra-day selfish behavior of individuals. Therefore, the application of the game theory is essential. In addition, in order to reduce the conservativeness of day-ahead scheduling strategy and improve the economic performance, it is necessary to characterize the uncertainties in the DRO framework. As a result, how to establish the coupling relation of the two networks with game theory in the day-ahead scheduling problem under multiple uncertainties is extremely challenging. Meanwhile, it is difficult to investigate the impact of uncertainties on game-theoretic model and derive the effective method for solving the DRO problem. Table I represents a summary of the literature review and the contributions of this paper. The major contributions of this paper are as follows:

1) We propose a novel day-ahead power scheduling approach for power-transportation coupled network based on game-theory. Specifically, the spatial and temporal distribution models of microgrids (MGs) loads and EV traffic flow in response to the electricity price strategies are established, respectively. A potential function is found to establish the existence of Nash equilibrium and an equivalent formulation of a centralized optimization problem under multiple uncertainties, which is shown that the decision-making is individual optimal.

2) Under the above theoretical framework, the centralized optimization problem can be transformed into a two-stage DRO model. In this way, the master-subproblem framework can be obtained through the duality theory and robust counterpart conversion method, which can be solved by the Benders decomposition approach effectively.

3) The proposed scheduling approach is simulated on the system modified from an IEEE 33-bus system with PVs. Simulations show that the proposed approach has less conservativeness and better economic performance for the day-ahead scheduling profile, which can adapt to the intra-day individual optimal traveling and charging strategies. Meanwhile, the configuration of the power network can improve the flexibility of system energy dispatch, which is closer to the actual application.

The rest of the paper is organized as follows. In Section II, the basic structure and mathematical model of the coupled network are introduced. In Section III, we propose a game-theoretic model with uncertainties for the coupled network and present the equilibrium properties. Section IV and V define the ambiguity set and present a DRO two-stage optimization model, and then the corresponding solution methodology is introduced. Numerical results and conclusions are analyzed in Sections VI and VII, respectively.

TABLE I: Comparison of the proposed model in this paper with recent studies
Ref. Netwok Model Multiple Method
TN PN Potential Game Relation Uncertainties SO RO DRO
[8] ×\times ×\times
[14] ×\times
[15] ×\times
[17, 19] ×\times ×\times
[2, 20] ×\times
[21, 22] ×\times ×\times
[24, 25, 27, 28, 29, 30] ×\times ×\times
[26] ×\times
Our paper

II Problem formulation and model

In this paper, daily 24-hour profiles are employed to represent the dynamic characteristics of active network management. The hour set can be defined as 𝒯\mathcal{T}, t𝒯\forall t\in\mathcal{T}, and 𝒯={1,2,,24}\mathcal{T}=\{1,2,\cdots,24\}.

II-A Transportation System Modeling

Assumptions: Herein, we make some important modeling assumptions for the transportation system. 1) It is a non-atomic measure for each traveler to control a negligible traffic flow and the influence of a single vehicle is infinitesimal. 2) The monetary value of travel time for EV travelers is homogeneous, which is represented by ω\omega. According to Ref. [2], it is widely accepted that the value of ω\omega is constant. 3) The heterogeneous information of vehicles is neglected, such as the state of charging, unit energy consumption, and capacity of the battery. 4) The gasoline vehicles (GVs) and EVs without plenty of electricity to destinations are neglected. 5) We assume that EV travelers can acquire the traffic congestion information for each path and the electricity price of each FCS, and then each traveler chooses the minimum cost route spontaneously to travel and charge. Every EV has to pass through and charge at one FCS before reaching the destination.

A TN can be represented by 𝒢TN=(𝒩,)\mathcal{G}_{TN}=(\mathcal{N},\mathcal{L}), where 𝒩\mathcal{N} and 𝒩×𝒩\mathcal{L}\subseteq\mathcal{N}\times\mathcal{N} denote the set of nodes and links, respectively. A pair (a,b)(a,b)\in\mathcal{L} denotes the link from node a to node b.

1) Traffic demand

In the transportation network, each vehicle has a pair of origin and destination and travels between them, which are named O-D pairs. The set of O-D pairs is denoted by \mathcal{R}, specifically, (o,d)(o,d)\in\mathcal{R}. The traffic demand (also called trip rate) of O-D pairs in time slot t can be described by 𝒒tod\boldsymbol{{q}}_{t}^{od} (1), which defines the number of vehicles in each time slot intending to travel from o to d.

𝒒tod=[q1od,,q24od]T\boldsymbol{q}_{t}^{od}=\begin{bmatrix}q_{1}^{od},&\cdots,&q_{24}^{od}\end{bmatrix}^{\rm T} (1)

2) O-D flow

Each O-D pair is connected by several paths, which consists of certain links [31]. The set of feasible routes p is denoted by p𝒫od,𝒫od={1,,Nod}p\in\mathcal{P}_{od},\mathcal{P}_{od}=\{1,\cdots,N_{od}\}, where NodN_{od} denotes the number of the routes for each O-D pair. Let fp,todf_{p,t}^{od} be the traffic flow of path p, and the traffic demand balance equation is described by (2). Eq. (3) denotes the non-negativity of path flows.

qtod=p=1Nodfp,todq_{t}^{od}=\sum_{p=1}^{N_{od}}f_{p,t}^{od} (2)
fp,tod0f_{p,t}^{od}\geq 0 (3)

3) Link flow

In the transportation system, the set of all links with FCS is represented by C\mathcal{L}_{C}\subseteq\mathcal{L}. The traffic link flow is equal to the total number of vehicles of the paths through it, which is described by (4).

xl,t(fp,tod)=odp𝒫odfp,todδl,podx_{l,t}(f_{p,t}^{od})=\sum_{od\in\mathcal{R}}\sum_{p\in\mathcal{P}_{od}}f_{p,t}^{od}\delta_{l,p}^{od} (4)

where an indicator variable δl,pod\delta_{l,p}^{od} is noted as reflecting the link-path relation. The value of δl,pod\delta_{l,p}^{od} is set to 11, if link lpl\in\mathcal{L}_{p}, where p\mathcal{L}_{p} is a set of links belonging to path pp; otherwise, δl,pod\delta_{l,p}^{od} is set to 0.

In this paper, we assume that FCSs are located on some links. EVs can choose to refill in FCSs or pass by the bypass link. The bus index of distribution systems is marked as jj, and we define the following constraints to ensure that all EVs are charged through the TN.

jxj,t=odpfp,tod\sum_{j}x_{j,t}=\sum_{od}\sum_{p}f_{p,t}^{od} (5)
fp,todjδj,podxj,tf_{p,t}^{od}\leq\sum_{j}\delta_{j,p}^{od}x_{j,t} (6)
xj,todpδj,podfp,todx_{j,t}\leq\sum_{od}\sum_{p}\delta_{j,p}^{od}f_{p,t}^{od} (7)

where xj,tx_{j,t} denotes the number of EVs charging at bus jj, and δj,pod\delta_{j,p}^{od} is noted as reflecting the bus-path relation. Specifically, Eq. (5) denotes the balance relationship of the traffic flows of charging link and path. Constraint (6) sets that the traffic flow must be less or equal to the total traffic flows of all links passing the FCS. Constraint (7) denotes that the number of charging EVs for one link must be less or equal to the total traffic flows passed by the link.

4) Traffic expense

EV travelers choose their driving routes mainly based on the total travel time, which is affected by the congestion level. The travel time can be described by a latency function tl,t(xl,t)t_{l,t}(x_{l,t}) (8), which is also called the Bureau of Public Roads (BPR) function [32] and can reflect the delayed travel time in links accurately,

tl,t(xl,t)=tl0[1+0.15(xl.t/Cl)4](l)t_{l,t}(x_{l,t})=t_{l}^{0}\begin{bmatrix}1+0.15(x_{l.t}/C_{l})^{4}\end{bmatrix}\ (l\in\mathcal{L}) (8)
xl,tClx_{l,t}\leq C_{l} (9)

where tl0t_{l}^{0} is free-speed time and ClC_{l} is the link flow when tl,t=1.15tl0t_{l,t}=1.15t_{l}^{0}, i.e., the link capacity. For constraint (9), if xl,t>Clx_{l,t}>C_{l}, the travel time will be penalized by a quick growth. According to (8), the congestion time of each vehicle can be derived by (10).

tl,tde(xl,t)=tl,ttl0=0.15tl0(xl,t/Cl)4t_{l,t}^{de}\left(x_{l,t}\right)=t_{l,t}-t_{l}^{0}=0.15t_{l}^{0}\left(x_{l,t}/C_{l}\right)^{4} (10)

Furthermore, we assume that the charging demand of each EV in time slot tt is uniform, in other words, the power demand of EVs for each bus is proportional to the link flow. λj,t\lambda_{j,t} is defined as the electricity price of FCS on bus jj, and then the charging energy cost of a single vehicle is λj,tet\lambda_{j,t}e_{t}, where ete_{t} is the charging demand of one vehicle. It is noted that the charging time of EV is neglected in travel cost due to the same charging efficiency. The total expense by a single EV on path p from o to d in time slot t can be calculated as

costt,pod=lωtl,tde(xl,t)δl,pod+jλj,tetδj,pod\operatorname{cost}_{t,p}^{od}=\sum_{l\in\mathcal{L}}\omega t_{l,t}^{de}\left(x_{l,t}\right)\delta_{l,p}^{od}+\sum_{j\in\mathcal{B}}\lambda_{j,t}e_{t}\delta_{j,p}^{od} (11)

where ω\omega is the monetary value of travel time and \mathcal{B} is the set of buses.

5) Wardropian traffic assignment

The balanced distribution of traffic flow is analyzed by the Wardrop user equilibrium (UE) principle, which can achieve a stable situation of traffic network that no vehicle can decrease the total cost by changing its driving route unilaterally. In other words, a stable situation occurs when total travel costs on all active routes are equal. The UE principle, which is also called Wardrop’s first principle, is closer to the real situation than the second principle [33].

Then a TAP considering the charging cost is described as follows, in which each traveler wishes to minimize his travel expense.

TAP:minfp,tod0xl,t(fp,tod)costt,pod(θ)𝑑θ s.t. (2)(11)\begin{split}&{\rm TAP:}\min_{f_{p,t}^{od}}\int_{0}^{x_{l,t}(f_{p,t}^{od})}cost_{t,p}^{od}(\theta)d\theta\\ &\text{ s.t. }\ (2)-(11)\end{split} (12)

According to classic transportation theory, the UE traffic state can be characterized as (13).

{fp,tod(costt,podutod)=0,costt,podutod0\left\{\begin{array}[]{lr}f_{p,t}^{od}(cost_{t,p}^{od}-u_{t}^{od})=0,&\\ cost_{t,p}^{od}-u_{t}^{od}\geq 0\end{array}\right. (13)

where utodu_{t}^{od} is the minimal travel cost for each vehicle between an O-D pair.

Proposition 1: The solution to problem (12) is an UE flow pattern satisfying (13).

Proof: The proof of this equivalency can be acquired by generalizing the method in [34]. In detail, the Lagrangian function of TAP for the equality constraint (2) and inequality constraint (3) can be formulated as

LTN(fp,tod,utod,\displaystyle L_{\rm TN}(f_{p,t}^{od},u_{t}^{od}, vt,pod)=0xl,t(fp,tod)costt,pod(θ)dθ+\displaystyle v_{t,p}^{od})=\int_{0}^{x_{l,t}(f_{p,t}^{od})}cost_{t,p}^{od}(\theta)d\theta+
od[utod(qtodp𝒫odfp,tod)p𝒫odvt,podfp,tod]\displaystyle\sum_{od\in\mathcal{R}}\begin{bmatrix}u_{t}^{od}(q_{t}^{od}-\sum\limits_{p\in\mathcal{P}^{od}}f_{p,t}^{od})-\sum\limits_{p\in\mathcal{P}^{od}}v_{t,p}^{od}f_{p,t}^{od}\end{bmatrix}

where utodu_{t}^{od} and vt,podv_{t,p}^{od} are the vectors of Lagrangian multipliers. The optimal solution of (12) satisfies the Karush-Kuhn-Tucher conditions as follows:

Stationarity: LTNfp,tod=0\frac{\partial L_{\mathrm{TN}}}{\partial f_{p,t}^{od}}=0.

Complementary slackness: vt,podfp,tod=0v_{t,p}^{od}f_{p,t}^{od}=0.

Further considering that

LTNfp,tod\displaystyle\frac{\partial L_{\mathrm{TN}}}{\partial f_{p,t}^{od}} =ωltl,t(xl,t)xl,t(fp,tod)fp,tod+jλj,tetδj,podutodvt,pod\displaystyle=\omega\sum_{l}t_{l,t}\left(x_{l,t}\right)\frac{\partial x_{l,t}\left(f_{p,t}^{od}\right)}{\partial f_{p,t}^{od}}+\sum_{j}\lambda_{j,t}e_{t}\delta_{j,p}^{od}-u_{t}^{od}-v_{t,p}^{od}
=lωtl,tde(xl,t)δl,pod+jλj,tetδj,podutodvt,pod\displaystyle=\sum_{l}\omega t_{l,t}^{de}\left(x_{l,t}\right)\delta_{l,p}^{od}+\sum_{j}\lambda_{j,t}e_{t}\delta_{j,p}^{od}-u_{t}^{od}-v_{t,p}^{od}
=costp,todutodvt,pod\displaystyle=cost_{p,t}^{od}-u_{t}^{od}-v_{t,p}^{od}
=0\displaystyle=0

Then vt,pod=costt,podutod0v_{t,p}^{od}=cost_{t,p}^{od}-u_{t}^{od}\geq 0 and vt,podfp,tod=0v_{t,p}^{od}f_{p,t}^{od}=0 can be obtained, which is equivalent to (13).∎

It is observed that satisfying condition (13) is equivalent to reaching the traffic UE state, and thus it can be considered as a complementary constraint for the optimization problem (12), whose UE state of the optimal solution can be ensured [35]. Meanwhile, the constraint (13) can be linearized as follows by the big-M method [36].

{0fp,todM(1wp,tod),0costt,podutodMwp,tod\left\{\begin{array}[]{lr}0\leq f_{p,t}^{od}\leq M(1-w_{p,t}^{od}),&\\ 0\leq cost_{t,p}^{od}-u_{t}^{od}\leq Mw_{p,t}^{od}\end{array}\right. (14)

where MM is a large enough constant and wp,todw_{p,t}^{od} is the introduced binary variable, i.e., wp,tod{0,1}w_{p,t}^{od}\in\{0,1\}.

Furthermore, since the bivariate terms exist in Eq. (10), a piecewise linearization method [2] is adopted to transform it as (15).

tl,tde=hH(gl,hΔxl,h,t)xl,t=hHΔxl,h,t0Δxl,h,txlmax/H\begin{split}&t_{l,t}^{de}=\sum_{h}^{H}(g_{l,h}\Delta x_{l,h,t})\\ &x_{l,t}=\sum_{h}^{H}\Delta x_{l,h,t}\\ &0\leq\Delta x_{l,h,t}\leq x_{l}^{max}/H\end{split} (15)

where H is the number of linear segments; gl,hg_{l,h} is the linear segment slope; and Δxl,h,t\Delta x_{l,h,t} is the link flow of segment h.

Consequently, the linear optimization problem for TN can be constructed as follows:

minFTN(fp,tod)t(lωtl,tdexl,t+jλj,tetxj,t)\displaystyle\min\ F_{\rm TN}(f_{p,t}^{od})\triangleq\sum_{t}(\sum_{l}\omega t_{l,t}^{de}x_{l,t}+\sum_{j}\lambda_{j,t}e_{t}x_{j,t}) (16)
s.t. (2)(7),(9)(11),(14),(15)\displaystyle\text{ s.t. }\ (2)-(7),(9)-(1),(4),(5)
l,t𝒯,od,p𝒫od,j\displaystyle\qquad{\forall}l\in\mathcal{L},t\in\mathcal{T},od\in\mathcal{R},p\in\mathcal{P}_{od},j\in\mathcal{B}

The objective of the traffic network optimization problem (16) is to find the optimal traffic flow distribution at a certain electricity price.

II-B Power System Modeling

In this study, there are a dispatchable generator (DG), energy storage (ES) unit and photovoltaic (PV) generation unit locating inside each FCS, which can be regarded as a MG [37]. Without loss of generality, we assume that each bus of power distribution network is connected to a MG. Demands of each MG include flexible demand response load and the charging load of EV users, which is depicted in Fig. 1. The application of ES can improve the flexibility of system power dispatch. The operating costs of MGs can be reduced by adjusting the electricity consumption schedule of the flexible demand response loads. Meanwhile, the demand response load can also obtain certain benefits by providing this service. Note that we assume MGs can sell their redundant energy to the main grid.

In the electrical power network, the index of MGs coupled with transportation network is denoted by jj, which is also connected on the bus j={1,2,,Nb}j\in\mathcal{B}=\{1,2,\cdots,N_{b}\}, where NbN_{b} is the number of buses or MGs.

Refer to caption
Figure 1: Illustration of MG electrical network.

It should be noted that the adjacent node of bus jj is denoted by i𝒩i\in\mathcal{N}, and subscript jiji indicates the transmission line from jj to ii. The objective is to maximize the operation revenue of MG jj. The constraints consist of the following five parts.

1) Transaction with the main grid

It is necessary for MGs to purchase power from the main grid when the power generation units cannot meet the load demand. On the contrary, the MGs can sell the surplus energy of ES or PV generation to the main grid to obtain some revenue. The transaction cost with the main grid in time slot tt can be expressed as

CtG=λtG(pj,tbuypj,tsell)C_{t}^{\rm G}=\lambda_{t}^{\rm G}\left(p_{j,t}^{\rm buy}-p_{j,t}^{\rm sell}\right) (17)

where pj,tbuyp_{j,t}^{\rm buy} and pj,tsellp_{j,t}^{\rm sell} are the power purchased and sold by MG in period tt, respectively. λtG\lambda_{t}^{\rm G} is the day-ahead transaction price of the main grid. The interactive power between MG and main grid must satisfy the constraints as follows.

0pj,tbuyuj,tPmaxG0\leq p_{j,t}^{\rm buy}\leq u_{j,t}P_{\max}^{\rm{G}} (18)
0pj,tsell (1uj,t)PmaxG0\leq p_{j,t}^{\text{sell }}\leq\left(1-u_{j,t}\right)P_{\max}^{\rm{G}} (19)

where PmaxGP_{\max}^{\rm G} represents the maximum value of the exchange power between the MG and the main grid. It can be influenced by the capacity of the transformer at the connection between the distribution network and the MG. uj,tu_{j,t} is a binary variable representing the electricity purchase and sale status of MG jj. The MG purchases electricity from the main grid when the value of uj,tu_{j,t} is 11, otherwise, the MG sells electricity to the main grid.

2) The generation constraint of DG

The power generation cost of DG in MG jj can be calculated by Eq. (20).

CtDG=a(pj,tDG)2+bpj,tDG+cC_{t}^{\rm DG}=a\left(p_{j,t}^{\rm DG}\right)^{2}+bp_{j,t}^{\rm DG}+c (20)

where aa, bb and cc denote the cost parameters, and pj,tDGp_{j,t}^{\rm DG} is the output power of DG in time slot tt. The ramp rate constraint is not considered of DG, since that the power response speed is faster than that of hourly scheduling. The output power constraint is considered as following,

Pj,minDGpj,tDGPj,maxDGP_{j,\min}^{\rm DG}\leq p_{j,t}^{\rm DG}\leq P_{j,\max}^{\rm DG} (21)

where the feasible range of DG power output is specified.

3) The ES

The cost of ES can be expressed as the charge and discharge cost by Eq. (22).

CtES=λES(ηCpj,tES,C+pj,tES,D/ηD)C_{t}^{\rm ES}=\lambda^{\rm ES}\left(\eta_{C}p_{j,t}^{\rm ES,C}+p_{j,t}^{\rm ES,D}/\eta_{D}\right) (22)

where λES\lambda^{\rm ES} denotes the unit charge and discharge cost, pj,tES,Cp_{j,t}^{\rm ES,C} and pj,tES,Dp_{j,t}^{\rm ES,D} represent the charge and discharge power of the ES inverter, respectively. ηC\eta_{C} and ηD\eta_{D} are the charge and discharge efficiency of the ES unit. In addition, the operation constraints of the ES units include functions (23)-(27) as follows.

Ej,t+1=Ej,t+ηCpj,tES,CΔtpj,tES,DΔt/ηDE_{j,t+1}=E_{j,t}+\eta_{C}p_{j,t}^{\rm ES,C}\Delta t-p_{j,t}^{\rm ES,D}\Delta t/\eta_{D} (23)
0pj,tES,Cvj,tPj,maxES0\leq p_{j,t}^{\rm ES,C}\leq v_{j,t}P_{j,\max}^{\rm ES} (24)
0pj,tES,D(1vj,t)Pj,maxES0\leq p_{j,t}^{\rm ES,D}\leq\left(1-v_{j,t}\right)P_{j,\max}^{\rm ES} (25)
Ej,0=Ej,TE_{j,0}=E_{j,\rm T} (26)
SOCj,minEj,t/ELSOCj,maxSOC_{j,\min}\leq E_{j,t}/E_{\rm L}\leq SOC_{j,\max} (27)

The Eq. (23) represents the remaining capacity of the ES in each period tt. Δt\Delta t denotes the time interval, whose value is 1h1h. Constraints (24) and (25) are the charge and discharge power limits of the ES, respectively. Pj,maxESP_{j,\max}^{\rm ES} is the maximum charge and discharge power, which is mainly affected by the capacity limitation of inverters. vj,tv_{j,t} represents the charging and discharging state of the ES, The value of vj,tv_{j,t} is 11 when the state of the ES is charging, otherwise, it is 0. Constraints (26) and (27) describe the SOCSOC limits, where ELE_{\rm L} is the rated capacity of the ES device. TT is the scheduling period, and the value is 24h24h. It is noted that the cost of ES equipment life loss is ignored in day-ahead power dispatch.

4) The flexible demand response load

The MG can adjust the power consumption plan of the demand response load flexibly. Meanwhile, users need to be compensated from the MG. The operating cost CtDRC_{t}^{\rm DR} in the time slot tt can be expressed as

CtDR=λDR|pj,tLpj,tLE|C_{t}^{\rm DR}=\lambda_{\rm DR}\left|p_{j,t}^{\rm L}-p_{j,t}^{\rm LE}\right| (28)

where λDR\lambda_{\rm DR} represents the unit dispatch cost of demand response load. pj,tLEp_{j,t}^{\rm LE} and pj,tLp_{j,t}^{\rm L} are the expected power consumption and the actual dispatched power of the MG to the demand response load in time slot tt, respectively. The auxiliary variable pj,tL,up_{j,t}^{\rm{L},\emph{u}} and pj,tL,dp_{j,t}^{\rm{L},\emph{d}} can be introduced to substitute the absolute value terms in Eq. (28), which can be replaced by constraints (29)-(31).

pj,tLpj,tLE+pj,tL,upj,tL,d=0p_{j,t}^{\rm L}-p_{j,t}^{\rm LE}+p_{j,t}^{\rm{L},\emph{u}}-p_{j,t}^{\rm{L},\emph{d}}=0 (29)
pj,tL,u,pj,tL,d0p_{j,t}^{\rm{L},\emph{u}},p_{j,t}^{\rm{L},\emph{d}}\geq 0 (30)
CtDR=λDR(pj,tL,u+pj,tL,d)C_{t}^{\rm DR}=\lambda_{\rm DR}\left(p_{j,t}^{\rm{L},\emph{u}}+p_{j,t}^{\rm{L},\emph{d}}\right) (31)

The power dispatch of demand response load satisfies the following constraints.

Pj,tL,minpj,tLPj,tL,maxP_{j,t}^{\rm L,\min}\leq p_{j,t}^{\rm L}\leq P_{j,t}^{\rm L,\max} (32)
tpj,tLΔt=PjL\sum_{t}p_{j,t}^{\rm L}\Delta t=P_{j}^{\rm L} (33)

where PjLP_{j}^{\rm L} is the total power demand for one scheduling period, Pj,tL,maxP_{j,t}^{\rm L,\max} and Pj,tL,minP_{j,t}^{\rm L,\min} denote the maximum and minimum power demand in time slot tt.

5) The power flow constraint

For the power flow constraint of this network, the DC approximation of AC optimal power flow is applied to simplify this model. The fixed voltage magnitudes and small phase angle differences are assumed in the network. The reactive part of electrical power is neglected.

i𝒩(j)pji,t=pj,tbuypj,tsell+pj,tPV+pj,tDG+pj,tES,Dpj,tES,Cpj,td\sum_{i\in\mathcal{N}(j)}p_{ji,t}=p_{j,t}^{\rm buy}-p_{j,t}^{\rm sell}+p_{j,t}^{\rm PV}+p_{j,t}^{\rm DG}+p_{j,t}^{\rm ES,D}-p_{j,t}^{\rm ES,C}-p_{j,t}^{d} (34)
pj,td=pj,tL+xj,tet/Δtp_{j,t}^{d}=p_{j,t}^{\rm L}+x_{j,t}e_{t}/\Delta t (35)
pji,t=bji(θjθi)p_{ji,t}=b_{ji}\left(\theta_{j}-\theta_{i}\right) (36)
Fjimaxpji,tFjimax-F_{ji}^{\max}\leq p_{ji,t}\leq F_{ji}^{\max} (37)

Eq. (34) models the active power flow, which is determined by Kirchhoff’s first law and the power flow equations and pj,tPVp_{j,t}^{\rm PV} is the PV power output. The power on each node has to be balanced. In addition, the electricity demands of MG jj include the flexible demand response load and the charging demand, which depend on the traffic flow of the pertaining link. The active power flow on transmission line jiji is the multiply by voltage angle (in radians) differences θjθi\theta_{j}-\theta_{i} and susceptance bjib_{ji} as Eq. (36), where bjib_{ji} is the power distribution factor for line jiji and bji>0b_{ji}>0. For the power flow limits (37), FjimaxF_{ji}^{\max} is the transmission capacity for line jiji.

Based on the above five categories constraint, let vector 𝒑j,t{pj,tbuy,pj,tsell,pj,tDG,pj,tES,D,pj,tES,C,pj,td}\boldsymbol{p}_{j,t}\triangleq\left\{p_{j,t}^{\rm buy},p_{j,t}^{\text{\rm sell}},p_{j,t}^{\rm DG},p_{j,t}^{\rm ES,D},p_{j,t}^{\rm ES,C},p_{j,t}^{d}\right\} be the decision vector of MG jj in time slot tt. The objective of MG jj is to maximize the operation revenue, and the optimization problem is shown as follows. Note that the cost of PV generation is ignored.

maxRj(𝒑j,t)t(CtDG+CtES+CtDR+CtG)Δt\displaystyle\max R_{j}\left(\boldsymbol{p}_{j,t}\right)\triangleq\sum_{t}-\left(C_{t}^{\rm DG}+C_{t}^{\rm ES}+C_{t}^{\rm DR}+C_{t}^{\rm G}\right)\Delta t (38)
s.t. (17)(27),(29)(37)\displaystyle\text{ s.t. }(7)-(7),(9)-(7)
t𝒯,i,j\displaystyle t\in\mathcal{T},i,j\in\mathcal{B}

In this power-transportation coupled network, each EV traveler intends to minimize its total travel cost by route selection autonomously under the uncertainty of traffic demand, and the traffic flow distribution can be analyzed by the convex optimization model (16). Meanwhile, each MG operator determines the day-ahead power schedule according to the charging load caused by traffic flows under the uncertainty of PV generation. As a result, the supply and demand of the power network are matched. In addition, the uncertainties of this network will be discussed in Section IV.

III Game-theoretic analysis

In this section, we construct the EV charging behavior as a potential game to optimize the costs of MGs and the payoffs of drivers simultaneously. In addition, obtaining a Nash equilibrium of this game is equivalent to solving a centralized optimization problem, which is transformed from Section II [38], [39]. Based this theory framework, the individual optimality of the coupled network can be ensured.

III-A Game-theory with uncertainty

In the power network, MGs determine the power schedule and maximize their operation revenues by providing energy at the electricity prices. In the transportation network, selecting the lowest-cost routes and charging in FCSs for EV drivers is determined by the electricity prices and the degree of traffic congestion. Meanwhile, the uncertain variables in the two networks are pj,tPV{p}_{j,t}^{\rm PV} and qtod{q}_{t}^{od}, respectively. There are interconnections between the strategies and revenues of the EV travelers and MGs, and thus it can be interpreted as a noncooperative game with uncertainties.

Definition 1: The game can be defined by a triplet Ξ={{𝒬},{{vod}od,{𝒫j}j},{(FTNv)v𝒬,(Rj)j}}\Xi=\Big{\{}\{\mathcal{Q}\cup\mathcal{B}\},\{\{\mathcal{F}_{v}^{od}\}_{od\in\mathcal{R}},\{\mathcal{P}_{j}\}_{j\in\mathcal{B}}\},\{(-F_{{\rm TN}v})_{v\in\mathcal{Q}},(R_{j})_{j\in\mathcal{B}}\}\\ \Big{\}}, and uncertain variables of the coupled network exist in the strategy sets. The components of game Ξ\Xi are described as follows:

1) Customer Side (EVs)

The players noted as 𝒬\mathcal{Q} in the traffic network are the EV travelers corresponding to the travel demand in different O-D pairs. Player v belongs to 𝒬[0,qtod]\mathcal{Q}\triangleq[0,{q}_{t}^{od}], which is an interval for the number of players. The strategy set consists of all routes connecting the O-D pair, in other words, the traffic flow on each path can be regarded as the strategy of traffic side, i.e., vod={𝒇v,1od,𝒇v,2od,,𝒇v,Nodod}\mathcal{F}_{v}^{od}=\left\{\boldsymbol{f}_{v,1}^{od},\boldsymbol{f}_{v,2}^{od},\cdots,\boldsymbol{f}_{v,N_{od}}^{od}\right\} , where 𝒇vod\boldsymbol{f}_{v}^{od} is the vector of the traffic flow on the O-D pair in a scheduling cycle. Each EV traveler aims to selfishly choose its route and FCS to maximize its traveling utility with uncertain travel demand, i.e., max𝒇vodvod(tcostt,pod)\max\limits_{\boldsymbol{f}_{v}^{od}\in\mathcal{F}_{v}^{od}}\left(-\sum\limits_{t}\operatorname{cost}_{t,p}^{od}\right).

2) Supplier Side (MGs)

The players noted as \mathcal{B} in the power network are MG operators, player j belongs to {1,2,,Nb}\mathcal{B}\triangleq\{1,2,\cdots,N_{b}\}, where NbN_{b} is the number of MGs. The power decision vector for MG jj is 𝒑j\boldsymbol{p}_{j} in one scheduling cycle. 𝒫j={𝒑1,𝒑2,,𝒑Nj}\mathcal{P}_{j}=\{\boldsymbol{p}_{1},\boldsymbol{p}_{2},\cdots,\boldsymbol{p}_{N_{j}}\} is the strategy set, where NjN_{j} is the number of MG jj’s power generation strategies. Then the revenue maximization problem for MG jj with uncertain PV generation can be described by max𝒑j𝒫jRj\max\limits_{\boldsymbol{p}_{j}\in\mathcal{P}_{j}}R_{j}.

The above game-theoretic perspective of this coupled network is shown in Fig. 2, where q~tod\tilde{q}_{t}^{od} and p~j,tPV\tilde{p}_{j,t}^{\rm PV} denote the uncertain parameters of traffic demand and PV power output. The game is designed to derive the global optimums for EVs and MGs with the constraints of the optimization problem (16) and (38), respectively. It is noted that the number of player v is uncertain due to the uncertainty of traffic demand, which leads to the existence of an uncertain vector in the strategies of the transportation network. Meanwhile, when qtodq_{t}^{od} is determined, fp,todf_{p,t}^{od} is variational with the changing of the route selection and can be the decision variable of EV travelers. Note that the power network revenue function does not contain the uncertain variable pjPVp_{j}^{\rm PV}, which is only reflected in constraints.

Refer to caption
Figure 2: The game-theoretic perspective of the power-transportation coupled network.

III-B Potential function construction

Due to the autonomous decision-making of the vehicle, a potential game is formulated. Under the above mathematical framework, the whole problem turns out to be seeking the Nash equilibrium of the game. The corresponding Nash equilibrium is a strategy profile on which no traveler can improve its utility by unilaterally changing its route at a certain electricity price, i.e., the Wardrop UE, and no MG can benefit more by switching to another strategy except the optimal power strategy.

Proposition 2: The Nash equilibrium can be obtained by solving an equivalent centralized optimization problem, and the game is a potential game with potential function as follows:

Φ(fp,tod,𝒑j,t)=t[lωtl,tdexl,t(fp,tod)+j(CtDG\displaystyle\Phi\left(f_{p,t}^{od},\boldsymbol{p}_{j,t}\right)=-\sum_{t}[\sum_{l}\omega t_{l,t}^{de}x_{l,t}\left(f_{p,t}^{od}\right)+\sum_{j}(C_{t}^{\rm DG} (39)
+CtES+CtDR+CtG)Δt]\displaystyle\phantom{=\;\;}+C_{t}^{\rm ES}+C_{t}^{\rm DR}+C_{t}^{\rm G})\Delta t]

where the variables satisfy the constraints of the optimization problems (16) and (38).

Proof:

1) Variable Declaration

There are two kinds of decision vectors as follows. In transportation network, 𝒇vod=(fv,1od,fv,2od,,fv,24od)\boldsymbol{f}_{v}^{od}=(f_{v,1}^{od},f_{v,2}^{od},\cdots,f_{v,24}^{od}) is the traffic flow in the route selected by EV vv. 𝒇vod\boldsymbol{f}_{-v}^{od} is the vector of the traffic flow in the route selected by all EVs except EV vv, defined as 𝒇vod=(𝒇vod,,𝒇v1od,𝒇v+1od,,𝒇qsodod)\boldsymbol{f}_{-v}^{od}=(\boldsymbol{f}_{v}^{od},\cdots,\boldsymbol{f}_{v-1}^{od},\boldsymbol{f}_{v+1}^{od},\cdots,\boldsymbol{f}_{q_{s}^{od}}^{od}). (𝒇v,𝒇vod)(\boldsymbol{f}_{-v},\boldsymbol{f}_{v}^{od}) and (𝒇vod,𝒇^vod)(\boldsymbol{f}_{-v}^{od},\hat{\boldsymbol{f}}_{v}^{od}) are the arbitrary two strategies of EV vv. In the same way, in the power network, 𝒑i\boldsymbol{p}_{i} is the power profile of MG ii. 𝒑i\boldsymbol{p}_{-i} is the power profiles of all MGs except MG ii defined as 𝒑i=(𝒑1,,𝒑i1,𝒑i+1,,𝒑Nb)\boldsymbol{p}_{-i}=(\boldsymbol{p}_{1},\cdots,\boldsymbol{p}_{i-1},\boldsymbol{p}_{i+1},\cdots,\boldsymbol{p}_{N_{b}}). (𝒑i,𝒑i)(\boldsymbol{p}_{-i},\boldsymbol{p}_{i}) and (𝒑i,𝒑^i)(\boldsymbol{p}_{-i},\hat{\boldsymbol{p}}_{i}) are the arbitrary two strategies of MG ii.

2) Transportation Network Side

We first let LTPL_{\mathrm{T}-\mathrm{P}} be the partial Lagrangian of Φ-\Phi associated with constraints (2) and (34),which are shown in Eq. (40). It should be noted that the charging price is determined by the marginal cost of power production of FCSs, viz. the dual variables of power balance constraints (34).

 

LTP(fp,tod,𝒑j,t,utod,λj,t)=t[lωtl,tdexl,t(fp,tod)+j(CtDG+CtES+CtDR+CtG)Δt]+\displaystyle L_{\mathrm{T}-\mathrm{P}}\left(f_{p,t}^{od},\boldsymbol{p}_{j,t},u_{t}^{od},\lambda_{j,t}\right)=\sum_{t}\left[\sum_{l}\omega\mathrm{t}_{l,t}^{de}x_{l,t}\left(f_{p,t}^{od}\right)+\sum_{j}\left(C_{t}^{\rm DG}+C_{t}^{\rm ES}+C_{t}^{\rm DR}+C_{t}^{\rm G}\right)\Delta t\right]+ (40)
tod[utod(pfp,todqtod)]tjλj,t[pj,tbuypj,tsell+pj,tPV+pj,tDG+pj,tES,Dpj,tES,C(pj,tL+xj,tet/Δt)i𝒩(j)pji,t]\displaystyle\sum_{t}\sum_{od}\left[u_{t}^{od}\left(\sum_{p}f_{p,t}^{od}-q_{t}^{od}\right)\right]-\sum_{t}\sum_{j}\lambda_{j,t}\left[p_{j,t}^{\rm buy}-p_{j,t}^{\rm sell}+p_{j,t}^{\rm PV}+p_{j,t}^{\rm DG}+p_{j,t}^{\rm ES,D}-p_{j,t}^{\rm ES,C}-\left(p_{j,t}^{\rm L}+x_{j,t}e_{t}/\Delta t\right)-\sum_{i\in\mathcal{N}(j)}p_{ji,t}\right]

 

The Lagrangian for traffic side can be expressed as

LT(fp,tod,utod)=t[lωtl,tdexl,t(fp,tod)+jλj,tetxl,t(fp,tod)]\displaystyle L_{\rm T}\left(f_{p,t}^{od},u_{t}^{od}\right)=\sum_{t}\left[\sum_{l}\omega t_{l,t}^{de}x_{l,t}\left(f_{p,t}^{od}\right)+\sum_{j}\lambda_{j,t}e_{t}x_{l,t}\left(f_{p,t}^{od}\right)\right] (41)
+tod[utod(pfp,todqtod)]\displaystyle\phantom{=\;\;}+\sum_{t}\sum_{od}\left[u_{t}^{od}\left(\sum_{p}f_{p,t}^{od}-q_{t}^{od}\right)\right]

By calculating the Lagrangian partial derivative, (40) and (41) yields

LTPfp,tod=FTNfp,tod+utod=LTfp,tod\frac{\partial L_{\mathrm{T}-\mathrm{P}}}{\partial f_{p,t}^{od}}=\frac{\partial F_{\mathrm{TN}}}{\partial f_{p,t}^{od}}+u_{t}^{od}=\frac{\partial L_{\mathrm{T}}}{\partial f_{p,t}^{od}} (42)

In addition, let 𝒇od=(f1,1od,,fi,24od,,fqtod,1od,,fqtod,24od)\boldsymbol{f}^{od}=(f_{1,1}^{od},\cdots,f_{i,24}^{od},\cdots,f_{q_{t}^{od},1}^{od},\\ \cdots,f_{q_{t}^{od},24}^{od}) denote the travel flow profile vector of all EVs in the O-D pair. Considering the definition of ordinal potential game [39], for two strategy profiles 𝒇od=(𝒇vod,𝒇vod)\boldsymbol{f}^{od}=(\boldsymbol{f}_{-v}^{od},\boldsymbol{f}_{v}^{od}) and 𝒇od^=(𝒇vod,𝒇^vod)\hat{\boldsymbol{f}^{od}}=(\boldsymbol{f}_{-v}^{od},\hat{\boldsymbol{f}}_{v}^{od}), if [FTNv(𝒇vod,𝒇vod)][FTNv(𝒇vod,𝒇vod^)]0[-F_{{\rm TN}_{v}}(\boldsymbol{f}_{-v}^{od},\boldsymbol{f}_{v}^{od})]-[-F_{{\rm TN}_{v}}(\boldsymbol{f}_{-v}^{od},\hat{\boldsymbol{f}_{v}^{od}})]\geq 0, then Φ(𝒇vod,𝒇vod,𝒑i)Φ(𝒇vod,𝒇vod^,𝒑i)\Phi(\boldsymbol{f}_{-v}^{od},\boldsymbol{f}_{v}^{od},\boldsymbol{p}_{i})-\Phi(\boldsymbol{f}_{-v}^{od},\hat{\boldsymbol{f}_{v}^{od}},\boldsymbol{p}_{i})\geq0 can be derived due to the monotonicity and non-negativity of FTNF_{\rm TN}. Thus Φ\Phi is a potential function for EV travelers.

3) Power network Side

In the power network side, the Lagrangian can be expressed as

LP(𝒑j,t,λj,t)=tj(CtDG+CtES+CtDR+CtG)Δt\displaystyle L_{\rm P}\left(\boldsymbol{p}_{j,t},\lambda_{j,t}\right)=\sum_{t}\sum_{j}\left(C_{t}^{\rm DG}+C_{t}^{\rm ES}+C_{t}^{\rm DR}+C_{t}^{\rm G}\right)\Delta t- (43)
tjλj,t[pj,tbuypj,tsell+pj,tPV+pj,tDG+pj,tES,Dpj,tES,C\displaystyle\sum_{t}\sum_{j}\lambda_{j,t}[p_{j,t}^{\rm buy}-p_{j,t}^{\rm sell}+p_{j,t}^{\rm PV}+p_{j,t}^{\rm DG}+p_{j,t}^{\rm ES,D}-p_{j,t}^{\rm ES,C}
(pj,tL+xj,tet/Δt)i𝒩(j)pji,t]\displaystyle-\left(p_{j,t}^{\rm L}+x_{j,t}e_{t}/\Delta t\right)-\sum_{i\in\mathcal{N}(j)}p_{ji,t}]

The necessary and sufficient condition for potential function being obtained and the objective function of power network getting the minimum value simultaneously can be expressed as (44), which can be satisfied obviously.

LTP𝒑j,t=LP𝒑j,t\frac{\partial L_{\mathrm{T}-\mathrm{P}}}{\partial\boldsymbol{p}_{j,t}}=\frac{\partial L_{\mathrm{P}}}{\partial\boldsymbol{p}_{j,t}} (44)

In addition, for two strategy profiles 𝒑=(𝒑i,𝒑i)\boldsymbol{p}=(\boldsymbol{p}_{-i},\boldsymbol{p}_{i}) and 𝒑^=(𝒑i,𝒑^i)\hat{\boldsymbol{p}}=(\boldsymbol{p}_{-i},\hat{\boldsymbol{p}}_{i}), if Ri(𝒑i,𝒑i)Ri(𝒑i,𝒑i^)R_{i}(\boldsymbol{p}_{-i},\boldsymbol{p}_{i})-R_{i}(\boldsymbol{p}_{-i},\hat{\boldsymbol{p}_{i}})\geq0, it can be proofed readily that Φ(𝒑i,𝒑i,𝒇vod)Φ(𝒑i,𝒑i^,𝒇vod){\Phi}(\boldsymbol{p}_{-i},\boldsymbol{p}_{i},\boldsymbol{f}_{v}^{od})-{\Phi}(\boldsymbol{p}_{-i},\hat{\boldsymbol{p}_{i}},\boldsymbol{f}_{v}^{od})\geq0. Thus Φ\Phi is a potential function for MGs. ∎

In the potential game Ξ\Xi, the utility and revenue function for each player can be mapped to the potential function Φ(fp,tod,𝒑j,t){\Phi}(f_{p,t}^{od},\boldsymbol{p}_{j,t}). The Nash equilibrium of game Ξ\Xi is equivalent to the set of optimal solutions of the potential function (39), where the feasible set of decision variables can be defined by constraints of the optimization problems (16) and (38). Note that the potential function is concave with respect to decision variables, which guarantees the existence and uniqueness of the Nash equilibrium. Therefore, we can determine the optimal strategy of the coupled network by locating the local optima of the above potential function [38].

IV Ambiguity set and reformulation of DRO problem

The exogenous uncertainties of the power-transportation coupled network include the PV power output and charging demand of FCS which is led by the uncertainty of the traffic demand. Furthermore, these uncertainties can be transformed into the source-load uncertainties of the power network, which are built as moment-based ambiguity sets incorporating the probability distribution information to reduce the conservativeness by RO in this study. In this framework, the uncertainties of the coupled network can be captured by a two-stage DRO model, in which all game players seek to optimize their profits. Generally, the decision variables in the DRO model include two-stage strategies. The first-stage strategies are mainly the design strategies, such as price setting and power purchase plan in the day-ahead, which must be made “here-and-now” before the realization of uncertainties. The second-stage strategies are the operational decisions and can also be called a “wait-and-see” pattern, which is determined after the uncertainty realization of the coupled network.

IV-A Uncertainty Handling

The uncertainty of traffic demand for each O-D pair in time slot tt can be characterized as a box uncertainty set [20] in (45).

q~todBOX(q¯tod,q¯tod){q¯todq~todq¯tod,od}\tilde{q}_{t}^{od}\in{\rm BOX}(\underline{q}_{t}^{od},\overline{q}_{t}^{od})\triangleq\{\underline{q}_{t}^{od}\leq\tilde{q}_{t}^{od}\leq\overline{q}_{t}^{od},\forall od\in\mathcal{R}\} (45)

The traffic flow space-time distribution led by the uncertainty of traffic demand can be transformed into uncertainties of the charging demand of FCSs connected to the buses, which is illustrated in Fig. 3.

Refer to caption
Figure 3: Illustration of uncertain variables conversion in a transportation system.

According to the Wardrop UE Principle, there is a unique solution for the convex optimization problem (12), and the box uncertainty set f~p,todBOX(f¯p,tod,f¯p,tod)\tilde{f}_{p,t}^{od}\in{\rm BOX}(\underline{f}_{p,t}^{od},\overline{f}_{p,t}^{od}) can be obtained by [20]

f¯p,tod={argmin0xl,t(fp,tod)costt,pod(θ)dθs.t.q¯tod=p𝒫odfp,tod,(3)(11)f¯p,tod={argmin0xl,t(fp,tod)costt,pod(θ)dθs.t.q¯tod=p𝒫odfp,tod,(3)(11)\begin{split}&\overline{f}_{p,t}^{od}=\left\{\begin{array}[]{lr}\mathop{\arg\min}\int_{0}^{x_{l,t}(f_{p,t}^{od})}\rm cost_{t,p}^{od}(\theta)d\theta\\ \text{s.t.}\ \overline{q}_{t}^{od}=\sum\limits_{p\in\mathcal{P}^{od}}f_{p,t}^{od},(3)-(11)\end{array}\right.\\ &\underline{f}_{p,t}^{od}=\left\{\begin{array}[]{lr}\mathop{\arg\min}\int_{0}^{x_{l,t}(f_{p,t}^{od})}\rm cost_{t,p}^{od}(\theta)d\theta\\ \text{s.t.}\ \underline{q}_{t}^{od}=\sum\limits_{p\in\mathcal{P}^{od}}f_{p,t}^{od},(3)-(11)\end{array}\right.\end{split} (46)

The link flow is monotonically increasing for the path flow according to Eq. (5). Then the box set of the link flow can be derived as

x~j,tBOX(xj,t(f¯p,tod),xj,t(f¯p,tod))\tilde{x}_{j,t}\in{\rm BOX}(x_{j,t}(\underline{f}_{p,t}^{od}),x_{j,t}(\overline{f}_{p,t}^{od})) (47)

Due to the charging demand for each FCS being regarded as a linear relation with the link flow, the uncertainty box set of charging load in MG jj can be represented as

e~j,tBOX(xj,t(f¯p,tod)et,xj,t(f¯p,tod)et)\tilde{e}_{j,t}\in{\rm BOX}(x_{j,t}(\underline{f}_{p,t}^{od})e_{t},x_{j,t}(\overline{f}_{p,t}^{od})e_{t}) (48)

In addition, it is assumed that the available energy of PV generation fluctuates with the interval between p¯j,tPV\underline{p}_{j,t}^{\rm PV} and p¯j,tPV\overline{p}_{j,t}^{\rm PV} , and then the uncertainty set of PV power output can also be characterized by a box set as follows:

p~j,tPVBOX(p¯j,tPV,p¯j,tPV)\tilde{p}_{j,t}^{\rm PV}\in{\rm BOX}(\underline{p}_{j,t}^{\rm PV},\overline{p}_{j,t}^{\rm PV}) (49)

IV-B Ambiguity Set Construction

There are two common methods to establish an ambiguity set generally, including moment-based [25] and distance metric approach [40]. In this study, the moment-based method is adopted to construct a ambiguity set considering the mean and support information.

According to Ref. [26], the uncertain variable e~j,t\tilde{e}_{j,t} and p~j,tPV\tilde{p}_{j,t}^{\rm PV} can be written as

{e~j,t=ej,t,pr+ej,t,deα~j,tp~j,tPV=pj,t,prPV+pj,t,dePVβ~j,t\left\{\begin{array}[]{lr}\tilde{e}_{j,t}=e_{j,t,pr}+e_{j,t,de}\tilde{\alpha}_{j,t}\\ \tilde{p}_{j,t}^{\rm PV}=p_{j,t,pr}^{\rm PV}+p_{j,t,de}^{\rm PV}\tilde{\beta}_{j,t}\end{array}\right. (50)

where ej,t,pre_{j,t,pr} and pj,t,prPVp_{j,t,pr}^{\rm PV} are the predicted charging demand and predicted PV generation. Parameters ej,t,dee_{j,t,de} and pj,t,dePVp_{j,t,de}^{\rm PV} denote the maximum deviations relative to the predicted values, satisfying Eq. (51). Random variables α~j,t\tilde{\alpha}_{j,t} and β~j,t\tilde{\beta}_{j,t} take values within [1,1][-1,1] indicating the degree of fluctuation relative to the predicted values. We assume that 𝜶~t=[α~1,t,α~2,t,,α~j,t,,α~Nb,t]Nb\boldsymbol{\tilde{\alpha}}_{t}=[\tilde{\alpha}_{1,t},\tilde{\alpha}_{2,t},\cdots,\tilde{\alpha}_{j,t},\cdots,\tilde{\alpha}_{N_{b},t}]\in\mathbb{R}^{N_{b}} and 𝜷~t=[β~1,t,β~2,t,,β~j,t,,β~Nb,t]Nb\boldsymbol{\tilde{\beta}}_{t}=[\tilde{\beta}_{1,t},\tilde{\beta}_{2,t},\cdots,\tilde{\beta}_{j,t},\cdots,\tilde{\beta}_{N_{b},t}]\in\mathbb{R}^{N_{b}} are variable vectors, and then the uncertain vector 𝝈~t\boldsymbol{\tilde{\sigma}}_{t} is defined as 𝝈~t={𝜶~tT𝜷~tT}(2Nb)×1T\boldsymbol{\tilde{\sigma}}_{t}=\{\boldsymbol{\tilde{\alpha}}_{t}^{\rm T}\ \boldsymbol{\tilde{\beta}}_{t}^{\rm T}\}_{(2N_{b})\times 1}^{\rm T}(t𝒯)(t\in\mathcal{T}).

ej,t,de=max{ej,t,prxj,t(f¯p,tod)et,xj,t(f¯p,tod)etej,t,pr}\displaystyle e_{j,t,de}=\max\left\{e_{j,t,pr}-x_{j,t}(\underline{f}_{p,t}^{od})e_{t},x_{j,t}(\overline{f}_{p,t}^{od})e_{t}-e_{j,t,pr}\right\} (51)
pj,t,dePV=max{pj,t,prPVp¯j,tPV,p¯j,tPVpj,t,prPV}\displaystyle p_{j,t,de}^{\rm PV}=\max\left\{p_{j,t,pr}^{\rm PV}-\underline{p}_{j,t}^{\rm PV},\overline{p}_{j,t}^{\rm PV}-p_{j,t,pr}^{\rm PV}\right\}

In this way, the ambiguity set 𝒫t\mathscr{P}_{t} is defined as [41]

𝒫t={tNb×Nb:𝔼t[𝝈~t]=𝟎,\displaystyle\mathscr{P}_{t}=\left\{\mathbb{P}_{t}\in\mathbb{R}^{N_{b}}\times\mathbb{R}^{N_{b}}:\mathbb{E}_{\mathbb{P}_{t}}\left[\tilde{\boldsymbol{\sigma}}_{t}\right]=\mathbf{0},\right. (52)
t[𝝈~t𝒞m,t]=Pm,t,m}\displaystyle\phantom{=\;\;}\left.\mathbb{P}_{t}\left[\tilde{\boldsymbol{\sigma}}_{t}\in\mathscr{C}_{m,t}\right]=P_{m,t},\forall m\in\mathcal{M}\right\}

where 𝒫t\mathscr{P}_{t} denotes that the joint probability distribution on Nb×Nb\mathbb{R}^{N_{b}}\times\mathbb{R}^{N_{b}} of uncertain vector 𝜶~t\boldsymbol{\tilde{\alpha}}_{t} and 𝜷~t\boldsymbol{\tilde{\beta}}_{t} is t\mathbb{P}_{t}, which is obtained by the historical data of the PV power output and the traffic demand. The first item in (52) indicates that the expectation of the uncertain vector in time slot tt is equal to 𝟎\boldsymbol{0}. The second item implies that the probability of confidence set 𝒞m,t\mathscr{C}_{m,t} occurring is Pm,tP_{m,t} with ={1,,M0}\mathcal{M}=\{1,\cdots,M_{0}\}, and M0M_{0} is the number of confidence sets. Meanwhile, the value of Pm,tP_{m,t} belongs to [0,1][0,1] for all mm\in\mathcal{M} and PM0,t=1P_{M_{0},t}=1 is assumed.

According to the Ref. [41], the confidence set 𝒞m,t\mathscr{C}_{m,t} is defined as

𝒞m,t={𝝈~tNb×Nb:𝝈t1,𝝈t1Γm,t,\displaystyle\mathscr{C}_{m,t}=\left\{\tilde{\boldsymbol{\sigma}}_{t}\in\mathbb{R}^{N_{b}}\times\mathbb{R}^{N_{b}}:\left\|\boldsymbol{\sigma}_{t}\right\|_{\infty}\leq 1,\left\|\boldsymbol{\sigma}_{t}\right\|_{1}\leq\Gamma_{m,t},\right. (53)
t𝒯,m}\displaystyle\phantom{=\;\;}\left.\forall t\in\mathcal{T},\forall m\in\mathcal{M}\right\}

where the first item denotes that all the elements in 𝝈~t\boldsymbol{\tilde{\sigma}}_{t} take values between -1 and 1, and the second item implies that the sum of absolute values for all components in 𝝈~t\boldsymbol{\tilde{\sigma}}_{t} take values lower than the budget of uncertainty defined as Γm,t\Gamma_{m,t}, which is added to avoid that 𝝈~t\boldsymbol{\tilde{\sigma}}_{t} always achieves boundary values to reduce the conservativeness of this model. For obtaining tractable DRO, we assume that Γm,t\Gamma_{m,t} is strictly increasing with the increase of mm and 𝒞m,t\mathscr{C}_{m,t}\subset𝒞m+1,t\mathscr{C}_{m+1,t}. Furthermore, the probabilities in (52) satisfy Pm,t{P}_{m,t}\leqPm+1,t{P}_{m+1,t} for m{1,,\forall m\in\{1,\cdots,M0M_{0}1}-1\}. It is noted that if Γm,t\Gamma_{m,t} is given, Pm,t{P}_{m,t} is the probability for the uncertain vector 𝝈~t\boldsymbol{\tilde{\sigma}}_{t} occurring in 𝒞m,t\mathscr{C}_{m,t}.

IV-C Reformulation of the DRO Problem

The purpose of this study is to optimize the cost of power generation from the perspective of distribution network and MG, and to achieve the optimal social benefit simultaneously, viz. to ensure the optimality of traffic scheduling. Under the potential game-theory framework, the centralized optimization method is employed to carry out the day-ahead power scheduling and ensure the individual optimality. Due to the existence of uncertain variables, the Nash equilibrium state of potential function (39) can be derived by an equivalent optimization problem as follows:

minsupt𝒫t𝔼t{tTlωtl,tdexl,t(fp,tod)+tTjB[λtG(pj,tbuy\displaystyle\min\sup_{\mathbb{P}_{t}\in\mathscr{P}_{t}}\mathbb{E}_{\mathbb{P}_{t}}\{\sum_{t\in T}\sum_{l\in\mathcal{L}}\omega t_{l,t}^{de}x_{l,t}\left(f_{p,t}^{od}\right)+\sum_{t\in T}\sum_{j\in B}[\lambda_{t}^{\rm G}(p_{j,t}^{\rm buy}- (54)
pj,tsell)+a(pj,tDG)2+bpj,tDG+c+λES(ηCpj,tES,C\displaystyle\phantom{=\;\;}p_{j,t}^{\rm sell})+a\left(p_{j,t}^{\rm DG}\right)^{2}+bp_{j,t}^{\rm DG}+c+\lambda^{\rm ES}(\eta_{C}p_{j,t}^{\rm ES,C}
+pj,tES,D/ηD)+λDR|pj,tLpj,tLE|]Δt}\displaystyle\phantom{=\;\;}+p_{j,t}^{\rm ES,D}/\eta_{D})+\lambda_{\rm DR}\left|p_{j,t}^{\rm L}-p_{j,t}^{\rm LE}\right|]\Delta t\}
s.t.(2)(7),(9)(11),(14),(15),(17)(27),(29)(37)\displaystyle\phantom{=\;\;}\text{s.t.}(2)-(7),(9)-(1),(4),(5),(7)-(7),(9)-(7)

In the following, a two-stage distributionally robust optimization model is proposed with the underlying compact matrix form:

min𝒙𝒄T𝒙+min𝒚tt𝒯supt𝒫t𝔼t(𝒃T𝒚t)\displaystyle\min_{\boldsymbol{x}}\boldsymbol{c}^{\mathrm{T}}\boldsymbol{x}+\min_{\boldsymbol{y}_{t}}\sum_{t\in\mathcal{T}}\sup_{\mathbb{P}_{t}\in\mathscr{P}_{t}}\mathbb{E}_{\mathbb{P}_{t}}(\boldsymbol{b}^{\mathrm{T}}\boldsymbol{y}_{t}) (55)
s.t. 𝑨𝒙𝒉\displaystyle\text{ s.t. }\boldsymbol{Ax}\leq\boldsymbol{h}
𝑩t𝒙+𝑪t𝒚t𝒅t+𝑫t𝝈t\displaystyle\qquad\boldsymbol{B}_{t}\boldsymbol{x}+\boldsymbol{C}_{t}\boldsymbol{y}_{t}\leq\boldsymbol{d}_{t}+\boldsymbol{D}_{t}\boldsymbol{\sigma}_{t}

where 𝑨\boldsymbol{A}, 𝑩t\boldsymbol{B}_{t}, 𝑪t\boldsymbol{C}_{t}, and 𝑫t\boldsymbol{D}_{t} denote constant matrices; 𝒉\boldsymbol{h} and 𝒅t\boldsymbol{d}_{t} are constant vectors. Vectors 𝒙\boldsymbol{x} and 𝒚t\boldsymbol{y}_{t} represent decision variables and are listed as follows:

𝒙={uj,t,vj,t,pj,tbuy}\displaystyle\boldsymbol{x}=\left\{u_{j,t},v_{j,t},p_{j,t}^{\rm buy}\right\} (56)
𝒚t={xl,t,xj,t,fp,tod,pj,tsell,pj,tDG,pj,tES,D,pj,tES,C,pj,tL}\displaystyle\boldsymbol{y}_{t}=\left\{x_{l,t},x_{j,t},f_{p,t}^{od},p_{j,t}^{\rm sell},p_{j,t}^{\rm DG},p_{j,t}^{\rm ES,D},p_{j,t}^{\rm ES,C},p_{j,t}^{\rm L}\right\}

The variable 𝒙\boldsymbol{x} is a vector of the first-stage decision consisting of the variables determined in the day-ahead and the indicator variables. The variable 𝒚t\boldsymbol{y}_{t} is a vector of the second-stage decision, which includes link flow, path flow, and decision variables of power network.

From the objective of (55), the second item differentiates with classical robust approaches, and the physical meaning of this item is to minimize the worst situation depending on the distribution t\mathbb{P}_{t}. Combined with (52), the second item in the objective of (55) can be converted as follows [37]:

min𝒚tsupt𝒫t𝔼t(𝒃T𝒚t)=\displaystyle\min_{\boldsymbol{y}_{t}}\sup_{\mathbb{P}_{t}\in\mathscr{P}_{t}}\mathbb{E}_{\mathbb{P}_{t}}(\boldsymbol{b}^{\mathrm{T}}\boldsymbol{y}_{t})= supt𝒫t𝔼t(min𝒃T𝒚t)=\displaystyle\sup_{\mathbb{P}_{t}\in\mathscr{P}_{t}}\mathbb{E}_{\mathbb{P}_{t}}(\min\boldsymbol{b}^{\mathrm{T}}\boldsymbol{y}_{t})= (57)
max𝒞M0,t(min𝒃T𝒚t)𝑑P(𝝈t)\displaystyle\max\int\limits_{\mathscr{C}_{M_{0},t}}(\min\boldsymbol{b}^{\mathrm{T}}\boldsymbol{y}_{t})dP\left(\boldsymbol{\sigma}_{t}\right)

Similarly, constraints in the ambiguity set (52) about the probability of 𝝈~t\tilde{\boldsymbol{\sigma}}_{t} can be transformed as

𝒞M0,t𝝈t𝑑P(𝝈t)=𝟎,𝒞m,t𝑑P(𝝈t)=Pm,t\int_{\mathscr{C}_{M_{0},t}}\boldsymbol{\sigma}_{t}dP\left(\boldsymbol{\sigma}_{t}\right)=\mathbf{0},\ \int_{\mathscr{C}_{m,t}}dP\left(\boldsymbol{\sigma}_{t}\right)=P_{m,t} (58)

As a result, we can reformulate (55) as follows [42]:

min𝒙𝒄T𝒙+min𝒚tt𝒯\displaystyle\min_{\boldsymbol{x}}\boldsymbol{c}^{\mathrm{T}}\boldsymbol{x}+\min_{\boldsymbol{y}_{t}}\sum_{t\in\mathcal{T}} supt𝒫t𝔼t(𝒃T𝒚t)\displaystyle\sup_{\mathbb{P}_{t}\in\mathscr{P}_{t}}\mathbb{E}_{\mathbb{P}_{t}}(\boldsymbol{b}^{\mathrm{T}}\boldsymbol{y}_{t})
s.t. 𝒞M0,t𝝈t𝑑P(𝝈t)=𝟎:𝜼t\displaystyle\text{ s.t. }\int\limits_{\mathscr{C}_{M_{0},t}}\boldsymbol{\sigma}_{t}dP\left(\boldsymbol{\sigma}_{t}\right)=\mathbf{0}:\ \boldsymbol{\eta}_{t}
𝒞m,t𝑑P(𝝈t)=Pm,t:γmt\displaystyle\qquad\int\limits_{\mathscr{C}_{m,t}}dP\left(\boldsymbol{\sigma}_{t}\right)=P_{m,t}:\ \gamma_{mt}
𝑩t𝒙+𝑪t𝒚t𝒅t+𝑫t𝝈t:𝝂t\boldsymbol{B}_{t}\boldsymbol{x}+\boldsymbol{C}_{t}\boldsymbol{y}_{t}\leq\boldsymbol{d}_{t}+\boldsymbol{D}_{t}\boldsymbol{\sigma}_{t}:\ \boldsymbol{\nu}_{t}\!\!\!\!\!\!\!\!\!\!
𝑨𝒙𝒉\boldsymbol{Ax}\leq\boldsymbol{h}~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~

where 𝜼t\boldsymbol{\eta}_{t}, 𝝂t\boldsymbol{\nu}_{t} and γmt\gamma_{mt} denote the dual vectors for the corresponding constraints.

V Solution Methodology

Generally, the two-stage optimization problem (55) can be derived as a master problem (MP) and a subproblem (SP). It should be noted that the term including the uncertainties in (55) has a minimax form, supt𝒫t𝔼t(min𝒃T𝒚t)\sup\limits_{\mathbb{P}_{t}\in\mathscr{P}_{t}}\mathbb{E}_{\mathbb{P}_{t}}(\min\boldsymbol{b}^{\mathrm{T}}\boldsymbol{y}_{t}), which is classified as a bi-level programming problem. For solving the above problem, duality theory can be used to transform the bi-level problem into a single-level model with corresponding dual variables.

Proposition 3: Concerning the ambiguity set 𝒫t\mathscr{P}_{t}, the bilevel programming problem supt𝒫t𝔼t(min𝒃T𝒚t)\sup\limits_{\mathbb{P}_{t}\in\mathscr{P}_{t}}\mathbb{E}_{\mathbb{P}_{t}}(\min\boldsymbol{b}^{\mathrm{T}}\boldsymbol{y}_{t}) can be reformulated as the following optimization problem:

minm=0M0γmtPm,t\displaystyle\min\sum_{m=0}^{M_{0}}\gamma_{mt}P_{m,t} (59)
s.t.s=m1M0γst\displaystyle\text{s.t.}\sum_{s=m-1}^{M_{0}}\gamma_{st}\geq
max𝝈mt,𝝂mt[(𝝈mt)T𝜼t+(𝝂mt)T(𝑩t𝒙𝒅t𝑫t𝝈mt)]\displaystyle\qquad\max\limits_{\boldsymbol{\sigma}_{mt},\boldsymbol{\nu}_{mt}}\left[-(\boldsymbol{\sigma}_{mt})^{\mathrm{T}}\boldsymbol{\eta}_{t}+\left(\boldsymbol{\nu}_{mt}\right)^{\mathrm{T}}\left(\boldsymbol{B}_{t}\boldsymbol{x}-\boldsymbol{d}_{t}-\boldsymbol{D}_{t}\boldsymbol{\sigma}_{mt}\right)\right]
𝒃=(𝑪t)T𝝂t\displaystyle\qquad\boldsymbol{b}=\left(\boldsymbol{C}_{t}\right)^{\mathrm{T}}\boldsymbol{\nu}_{t}

Proof: Concerning the confidence set 𝒞m,t\mathscr{C}_{m,t}\subset𝒞m+1,t\mathscr{C}_{m+1,t} for all mm\in\mathcal{M}, the objective (57) can be discretized into [26]

max𝒞M0,t(min𝒃T𝒚t)𝑑P(𝝈t)\displaystyle\max\int\limits_{\mathscr{C}_{M_{0},t}}(\min\boldsymbol{b}^{\mathrm{T}}\boldsymbol{y}_{t})dP\left(\boldsymbol{\sigma}_{t}\right) (60)
=max[𝒞1,t(min𝒃T𝒚t)dP(𝝈t)+𝒞2,t\𝒞1,t(min𝒃T𝒚t)dP(𝝈t)\displaystyle=\max\left[\int\limits_{\mathscr{C}_{1,t}}(\min\boldsymbol{b}^{\mathrm{T}}\boldsymbol{y}_{t})dP\left(\boldsymbol{\sigma}_{t}\right)+\int\limits_{\mathscr{C}_{2,t}\backslash\mathscr{C}_{1,t}}(\min\boldsymbol{b}^{\mathrm{T}}\boldsymbol{y}_{t})dP\left(\boldsymbol{\sigma}_{t}\right)\right.
++𝒞M0,t\𝒞M01,t(min𝒃T𝒚t)dP(𝝈t)]\displaystyle\phantom{=\;\;}\left.+\cdots+\int\limits_{\mathscr{C}_{M_{0,t}}\backslash\mathscr{C}_{M_{0}-1,t}}(\min\boldsymbol{b}^{\mathrm{T}}\boldsymbol{y}_{t})dP\left(\boldsymbol{\sigma}_{t}\right)\right]
=max[𝒞1,t(min𝒃T𝒚t)dP(𝝈t)\displaystyle=\max\left[\int\limits_{\mathscr{C}_{1,t}}(\min\boldsymbol{b}^{\mathrm{T}}\boldsymbol{y}_{t})dP\left(\boldsymbol{\sigma}_{t}\right)\right.
+s=1M1𝒞s+1,t\𝒞s,t(min𝒃T𝒚t)dP(𝝈t)]\displaystyle\phantom{=\;\;}\left.+\sum_{s=1}^{M-1}\int\limits_{\mathscr{C}_{s+1,t}\backslash\mathscr{C}_{s,t}}(\min\boldsymbol{b}^{\mathrm{T}}\boldsymbol{y}_{t})dP\left(\boldsymbol{\sigma}_{t}\right)\right]

Similarly, the constraints (58) can be transformed as:

{𝒞1,t𝝈t𝑑P(𝝈t)+s=1M01𝒞s+1,t\𝒞s,t𝝈t𝑑P(𝝈t)=𝟎:𝜼t𝒞1,t𝑑P(𝝈t)+s=1m1𝒞s+1,t,𝒞s,t𝑑P(𝝈t)=Pm,t:γmt\displaystyle\left\{\begin{array}[]{l}\int\limits_{\mathscr{C}_{1,t}}\boldsymbol{\sigma}_{t}dP\left(\boldsymbol{\sigma}_{t}\right)+\sum\limits_{s=1}^{M_{0}-1}\int\limits_{\mathscr{C}_{s+1,t}\backslash\mathscr{C}_{s,t}}\boldsymbol{\sigma}_{t}dP\left(\boldsymbol{\sigma}_{t}\right)=\mathbf{0}:\boldsymbol{\eta}_{t}\\ \int\limits_{\mathscr{C}_{1,t}}dP\left(\boldsymbol{\sigma}_{t}\right)+\sum\limits_{s=1}^{m-1}\int\limits_{\mathscr{C}_{s+1,t},\mathscr{C}_{s,t}}dP\left(\boldsymbol{\sigma}_{t}\right)=P_{m,t}:\gamma_{mt}\end{array}\right. (61)
(𝜼tNb×Nb,m)\displaystyle\left(\boldsymbol{\eta}_{t}\in\mathbb{R}^{N_{b}}\times\mathbb{R}^{N_{b}},\forall m\in\mathcal{M}\right)

where 𝜼t\boldsymbol{\eta}_{t} and γmt\gamma_{mt} represent the dual vector and variable corresponding constraints, and 𝜼t𝟎\boldsymbol{\eta}_{t}\geq\boldsymbol{0}, γmt0\gamma_{mt}\geq 0.

Based on the dual theory, the maximization problem (60) and (61) can be written as an equivalent minimization problem as follows:

minm=0M0γmtPm,t\displaystyle\min\sum_{m=0}^{M_{0}}\gamma_{mt}P_{m,t} (62)
s.t. (𝝈t)T𝜼t+s=1M0γstmin𝒃T𝒚t,𝝈t𝒞1,t\displaystyle\text{ s.t. }\left(\boldsymbol{\sigma}_{t}\right)^{\mathrm{T}}\boldsymbol{\eta}_{t}+\sum_{s=1}^{M_{0}}\gamma_{st}\geq\min\boldsymbol{b}^{\mathrm{T}}\boldsymbol{y}_{t},\forall\boldsymbol{\sigma}_{t}\in\mathscr{C}_{1,t}
(𝝈t)T𝜼t+s=m1M0γstmin𝒃T𝒚t,𝝈t𝒞s+1,t\𝒞s,t,\displaystyle\left(\boldsymbol{\sigma}_{t}\right)^{\mathrm{T}}\boldsymbol{\eta}_{t}+\sum_{s=m-1}^{M_{0}}\gamma_{st}\geq\min\boldsymbol{b}^{\mathrm{T}}\boldsymbol{y}_{t},\forall\boldsymbol{\sigma}_{t}\in\mathscr{C}_{s+1,t}\backslash\mathscr{C}_{s,t},
m\[1]\displaystyle\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \forall m\in\mathcal{M}\backslash[1]

Then the constraints in (62) can be transformed as

s=m1M0γstmax𝝈t𝒞m,t[min𝒃T𝒚t(𝝈t)T𝜼t],m\[1]\sum_{s=m-1}^{M_{0}}\gamma_{st}\geq\max_{\boldsymbol{\sigma}_{t}\in\mathscr{C}_{m,t}}[\min\boldsymbol{b}^{\mathrm{T}}\boldsymbol{y}_{t}-\left(\boldsymbol{\sigma}_{t}\right)^{\mathrm{T}}\boldsymbol{\eta}_{t}],\forall m\in\mathcal{M}\backslash[1] (63)

Combining with (55) and (63), the second stage of the bi-level programming problem can be reformulated as:

max𝝈t𝒞m,t[min𝒃T𝒚t(𝝈t)T𝜼t]\displaystyle\max_{\boldsymbol{\sigma}_{t}\in\mathscr{C}_{m,t}}[\min\boldsymbol{b}^{\mathrm{T}}\boldsymbol{y}_{t}-\left(\boldsymbol{\sigma}_{t}\right)^{\mathrm{T}}\boldsymbol{\eta}_{t}] (64)
s.t. 𝑩t𝒙+𝑪t𝒚t𝒅t+𝑫t𝝈t:𝝂t\displaystyle\text{ s.t. }\boldsymbol{B}_{t}\boldsymbol{x}+\boldsymbol{C}_{t}\boldsymbol{y}_{t}\leq\boldsymbol{d}_{t}+\boldsymbol{D}_{t}\boldsymbol{\sigma}_{t}:\ \boldsymbol{\nu}_{t}

where m\[1]\forall m\in\mathcal{M}\backslash[1], and 𝝂t\boldsymbol{\nu}_{t} is the dual vector of the second stage constraints, 𝝂t𝟎\boldsymbol{\nu}_{t}\geq\boldsymbol{0}.

According to the duality theory, (64) can be converted as a single-level optimization model as follows:

max𝝈t,𝝂t[(𝝈t)T𝜼t+𝝂tT(𝑩t𝒙𝒅t𝑫t𝝈t)]\displaystyle\max_{\boldsymbol{\sigma}_{t},\boldsymbol{\nu}_{t}}\left[-\left(\boldsymbol{\sigma}_{t}\right)^{\mathrm{T}}\boldsymbol{\eta}_{t}+\boldsymbol{\nu}_{t}^{\mathrm{T}}\left(\boldsymbol{B}_{t}\boldsymbol{x}-\boldsymbol{d}_{t}-\boldsymbol{D}_{t}\boldsymbol{\sigma}_{t}\right)\right] (65)
s.t. 𝒃=(𝑪t)T𝝂t\displaystyle\text{ s.t. }\boldsymbol{b}=\left(\boldsymbol{C}_{t}\right)^{\mathrm{T}}\boldsymbol{\nu}_{t}

Assuming two uncertain vectors 𝝈1,t\boldsymbol{\sigma}_{1,t} and 𝝈2,t\boldsymbol{\sigma}_{2,t} being defined as 𝟎𝝈1,t\boldsymbol{0}\leq\boldsymbol{\sigma}_{1,t},𝝈2,t𝟏\boldsymbol{\sigma}_{2,t}\leq\boldsymbol{1}, and 𝝈1,t,𝝈2,tNb×Nb\boldsymbol{\sigma}_{1,t},\boldsymbol{\sigma}_{2,t}\in\mathbb{R}^{N_{b}}\times\mathbb{R}^{N_{b}}, the constraints of confidence set 𝒞m,t\mathscr{C}_{m,t} can be converted as [43]

{𝝈t=𝝈1,t𝝈2,t𝟏(𝝈1,t+𝝈2,t)Γm,t\left\{\begin{array}[]{l}\boldsymbol{\sigma}_{t}=\boldsymbol{\sigma}_{1,t}-\boldsymbol{\sigma}_{2,t}\\ \mathbf{1}^{\prime}\cdot\left(\boldsymbol{\sigma}_{1,t}+\boldsymbol{\sigma}_{2,t}\right)\leq\Gamma_{m,t}\end{array}\right. (66)

Then the bilinear term in the constraint of Proposition 3 can be written as

max𝝈t𝒞m,t(𝜼tT𝒗tT𝑫t)𝝈t\displaystyle\max_{\boldsymbol{\sigma}_{t}\in\mathscr{C}_{m,t}}\left(-\boldsymbol{\eta}_{t}^{\mathrm{T}}-\boldsymbol{v}_{t}^{\mathrm{T}}\boldsymbol{D}_{t}\right)\boldsymbol{\sigma}_{t} (67)
s.t. 𝝈t=𝝈1,t𝝈2,t\displaystyle\text{ s.t. }\boldsymbol{\sigma}_{t}=\boldsymbol{\sigma}_{1,t}-\boldsymbol{\sigma}_{2,t}
𝟎𝝈1,t𝟏:𝝉1,t\displaystyle\qquad\mathbf{0}\leq\boldsymbol{\sigma}_{1,t}\leq\mathbf{1}:\boldsymbol{\tau}_{1,t}
𝟎𝝈2,t𝟏:𝝉2,t\displaystyle\qquad\mathbf{0}\leq\boldsymbol{\sigma}_{2,t}\leq\mathbf{1}:\boldsymbol{\tau}_{2,t}
𝟏(𝝈1,t+𝝈2,t)Γm,t:ρt\displaystyle\qquad\mathbf{1}^{\prime}\cdot\left(\boldsymbol{\sigma}_{1,t}+\boldsymbol{\sigma}_{2,t}\right)\leq\Gamma_{m,t}:\rho_{t}

where 𝝉1,t,𝝉2,t\boldsymbol{\tau}_{1,t},\boldsymbol{\tau}_{2,t} and ρt\rho_{t} represent the dual vectors and variable corresponding to the constraints, respectively.

Based on the dual theory, the maximization problem (67) can be written as an equivalent minimization problem as follows:

min𝟏(𝝉1,t+𝝉2,t)ρtΓm,t\displaystyle\min-\mathbf{1}^{\prime}\cdot\left(\boldsymbol{\tau}_{1,t}+\boldsymbol{\tau}_{2,t}\right)-\rho_{t}\Gamma_{m,t} (68)
s.t. 𝝂tT𝑫t+𝜼tT+𝝉1,t+𝟏ρt𝟎\displaystyle\text{ s.t. }\boldsymbol{\nu}_{t}^{\mathrm{T}}\boldsymbol{D}_{t}+\boldsymbol{\eta}_{t}^{\mathrm{T}}+\boldsymbol{\tau}_{1,t}+\mathbf{1}\cdot\rho_{t}\geq\mathbf{0}
𝜼tT𝝂tT𝑫t+𝝉2,t+𝟏ρt𝟎\displaystyle\qquad-\boldsymbol{\eta}_{t}^{\mathrm{T}}-\boldsymbol{\nu}_{t}^{\mathrm{T}}\boldsymbol{D}_{t}+\boldsymbol{\tau}_{2,t}+\mathbf{1}\cdot\rho_{t}\geq\mathbf{0}
𝝉1,t,𝝉2,t𝟎,ρt0\displaystyle\qquad\boldsymbol{\tau}_{1,t},\boldsymbol{\tau}_{2,t}\geq\mathbf{0},\rho_{t}\geq 0

Combined with (68), the single-level optimization model in the constraint of Proposition 3 can be converted as follows:

max𝝂tT(𝑩t𝒙𝒅t)+𝟏T(𝝉1,t+𝝉2,t)+ρtΓm,t\displaystyle\max\boldsymbol{\nu}_{t}^{\mathrm{T}}\left(\boldsymbol{B}_{t}\boldsymbol{x}-\boldsymbol{d}_{t}\right)+\mathbf{1}^{\mathrm{T}}\cdot\left(\boldsymbol{\tau}_{1,t}+\boldsymbol{\tau}_{2,t}\right)+\rho_{t}\Gamma_{m,t} (69)
s.t. 𝒃=(𝑪t)T𝝂t\displaystyle\text{ s.t. }\boldsymbol{b}=\left(\boldsymbol{C}_{t}\right)^{\mathrm{T}}\boldsymbol{\nu}_{t}
𝝂tT𝑫t+𝜼tT+𝝉1,t+𝟏ρt𝟎\displaystyle\qquad\boldsymbol{\nu}_{t}^{\mathrm{T}}\boldsymbol{D}_{t}+\boldsymbol{\eta}_{t}^{\mathrm{T}}+\boldsymbol{\tau}_{1,t}+\mathbf{1}\cdot\rho_{t}\geq\mathbf{0}
𝜼tT𝝂tT𝑫t+𝝉2,t+𝟏ρt𝟎\displaystyle\qquad-\boldsymbol{\eta}_{t}^{\mathrm{T}}-\boldsymbol{\nu}_{t}^{\mathrm{T}}\boldsymbol{D}_{t}+\boldsymbol{\tau}_{2,t}+\mathbf{1}\cdot\rho_{t}\geq\mathbf{0}
𝝉1,t,𝝉2,t𝟎,ρt0,𝝂t𝟎\displaystyle\qquad\boldsymbol{\tau}_{1,t},\boldsymbol{\tau}_{2,t}\geq\mathbf{0},\rho_{t}\geq 0,\boldsymbol{\nu}_{t}\geq\mathbf{0}

The complementary slackness conditions of the optimal problem (68) can be written as

{(𝝂tT𝑫t+𝜼tT+𝝉1,t+𝟏ρt)𝝈1,t=𝟎(𝜼tT𝝂tT𝑫t+𝝉2,t+𝟏ρt)𝝈2,t=𝟎(𝝈1,t𝟏)T𝝉1,t=𝟎(𝝈2,t𝟏)T𝝉2,t=𝟎\left\{\begin{array}[]{l}\left(\boldsymbol{\nu}_{t}^{\mathrm{T}}\boldsymbol{D}_{t}+\boldsymbol{\eta}_{t}^{\mathrm{T}}+\boldsymbol{\tau}_{1,t}+\mathbf{1}\cdot\rho_{t}\right)\boldsymbol{\sigma}_{1,t}=\mathbf{0}\\ \left(-\boldsymbol{\eta}_{t}^{\mathrm{T}}-\boldsymbol{\nu}_{t}^{\mathrm{T}}\boldsymbol{D}_{t}+\boldsymbol{\tau}_{2,t}+\mathbf{1}\cdot\rho_{t}\right)\boldsymbol{\sigma}_{2,t}=\mathbf{0}\\ \left(\boldsymbol{\sigma}_{1,t}-\mathbf{1}\right)^{\mathrm{T}}\boldsymbol{\tau}_{1,t}=\mathbf{0}\\ \left(\boldsymbol{\sigma}_{2,t}-\mathbf{1}\right)^{\mathrm{T}}\boldsymbol{\tau}_{2,t}=\mathbf{0}\end{array}\right. (70)

The nonlinear constraints in (70) can be converted by the big-M method as follows:

{𝟎𝝉1,tM𝜺1,t𝟎𝝉2,tM𝜺2,t𝟎𝝂tT𝑫t+𝜼tT+𝝉1,t+𝟏ρtM𝜾1,t𝟎𝜼tT𝝂tT𝑫t+𝝉2,t+𝟏ρtM𝜾2,t𝜺1,t𝝈1,t𝟏𝜾1,t𝜺2,t𝝈2,t𝟏𝜾2,t\left\{\begin{array}[]{l}\mathbf{0}\leq\boldsymbol{\tau}_{1,t}\leq M\boldsymbol{\varepsilon}_{1,t}\\ \mathbf{0}\leq\boldsymbol{\tau}_{2,t}\leq M\boldsymbol{\varepsilon}_{2,t}\\ \mathbf{0}\leq\boldsymbol{\nu}_{t}^{\mathrm{T}}\boldsymbol{D}_{t}+\boldsymbol{\eta}_{t}^{\mathrm{T}}+\boldsymbol{\tau}_{1,t}+\mathbf{1}\cdot\rho_{t}\leq M\boldsymbol{\iota}_{1,t}\\ \mathbf{0}\leq-\boldsymbol{\eta}_{t}^{\mathrm{T}}-\boldsymbol{\nu}_{t}^{\mathrm{T}}\boldsymbol{D}_{t}+\boldsymbol{\tau}_{2,t}+\mathbf{1}\cdot\rho_{t}\leq M\boldsymbol{\iota}_{2,t}\\ \boldsymbol{\varepsilon}_{1,t}\leq\boldsymbol{\sigma}_{1,t}\leq\mathbf{1}-\boldsymbol{\iota}_{1,t}\\ \boldsymbol{\varepsilon}_{2,t}\leq\boldsymbol{\sigma}_{2,t}\leq\mathbf{1}-\boldsymbol{\iota}_{2,t}\end{array}\right. (71)

Then the subproblem of the optimal problem (55) can be expressed as the following MILP problem:

SP:max𝝂tT(𝑩t𝒙𝒅t)+𝟏T(𝝉1,t+𝝉2,t)+ρtΓm,t\displaystyle\operatorname{SP}:\max\boldsymbol{\nu}_{t}^{\mathrm{T}}\left(\boldsymbol{B}_{t}\boldsymbol{x}-\boldsymbol{d}_{t}\right)+\mathbf{1}^{\mathrm{T}}\cdot\left(\boldsymbol{\tau}_{1,t}+\boldsymbol{\tau}_{2,t}\right)+\rho_{t}\Gamma_{m,t} (72)
s.t. 𝒃=(𝑪t)T𝝂t\displaystyle\text{ s.t. }\boldsymbol{b}=\left(\boldsymbol{C}_{t}\right)^{\mathrm{T}}\boldsymbol{\nu}_{t}
𝟎𝝉1,tM𝜺1,t\displaystyle\qquad\mathbf{0}\leq\boldsymbol{\tau}_{1,t}\leq M\boldsymbol{\varepsilon}_{1,t}
𝟎𝝉2,tM𝜺2,t\displaystyle\qquad\mathbf{0}\leq\boldsymbol{\tau}_{2,t}\leq M\boldsymbol{\varepsilon}_{2,t}
𝟎𝝂tT𝑫t+𝜼tT+𝝉1,t+𝟏ρtM𝜾1,t\displaystyle\qquad\mathbf{0}\leq\boldsymbol{\nu}_{t}^{\mathrm{T}}\boldsymbol{D}_{t}+\boldsymbol{\eta}_{t}^{\mathrm{T}}+\boldsymbol{\tau}_{1,t}+\mathbf{1}\cdot\rho_{t}\leq M\boldsymbol{\iota}_{1,t}
𝟎𝜼tT𝝂tT𝑫t+𝝉2,t+𝟏ρtM𝜾2,t\displaystyle\qquad\mathbf{0}\leq-\boldsymbol{\eta}_{t}^{\mathrm{T}}-\boldsymbol{\nu}_{t}^{\mathrm{T}}\boldsymbol{D}_{t}+\boldsymbol{\tau}_{2,t}+\mathbf{1}\cdot\rho_{t}\leq M\boldsymbol{\iota}_{2,t}
𝜺1,t𝝈1,t𝟏𝜾1,t\displaystyle\qquad\boldsymbol{\varepsilon}_{1,t}\leq\boldsymbol{\sigma}_{1,t}\leq\mathbf{1}-\boldsymbol{\iota}_{1,t}
𝜺2,t𝝈2,t𝟏𝜾2,t\displaystyle\qquad\boldsymbol{\varepsilon}_{2,t}\leq\boldsymbol{\sigma}_{2,t}\leq\mathbf{1}-\boldsymbol{\iota}_{2,t}
𝟏(𝝈1,t+𝝈2,t)Γm,t\displaystyle\qquad\mathbf{1}^{\prime}\cdot\left(\boldsymbol{\sigma}_{1,t}+\boldsymbol{\sigma}_{2,t}\right)\leq\Gamma_{m,t}

where 𝜺1,t\boldsymbol{\varepsilon}_{1,t}, 𝜺2,t\boldsymbol{\varepsilon}_{2,t}, 𝜾1,t\boldsymbol{\iota}_{1,t} and 𝜾2,t\boldsymbol{\iota}_{2,t} denote binary variables.

Next, we implement the Benders decomposition algorithm to solve the DRO model (55) [42]. According to Ref. [44], the Benders cut is expressed as

s=m1M0γst(𝝈mt)T𝜼t+(𝝂mt)T(𝑩t𝒙𝒅t𝑫t𝝈mt)\sum_{s=m-1}^{M_{0}}\gamma_{st}\geq-\left(\boldsymbol{\sigma}_{mt}^{*}\right)^{\mathrm{T}}\boldsymbol{\eta}_{t}+\left(\boldsymbol{\nu}_{mt}^{*}\right)^{\mathrm{T}}\left(\boldsymbol{B}_{t}\boldsymbol{x}-\boldsymbol{d}_{t}-\boldsymbol{D}_{t}\boldsymbol{\sigma}_{mt}^{*}\right) (73)

where 𝝈t\boldsymbol{\sigma}_{t}^{*} and 𝝂t\boldsymbol{\nu}_{t}^{*} are the optimal solution of the SP\rm SP. Then the MP\rm MP can be written as

MP:min(𝒄T𝒙+t𝒯m=0M0γmtPm,t)\mathrm{MP}:\min\left(\boldsymbol{c}^{\mathrm{T}}\boldsymbol{x}+\sum_{t\in\mathcal{T}}\sum_{m=0}^{M_{0}}\gamma_{mt}P_{m,t}\right)~~~~~~~~~~~~~~~~~~~~~~~~
s.t. 𝑨𝒙𝒉\displaystyle\text{ s.t. }\boldsymbol{A}\boldsymbol{x}\leq\boldsymbol{h} (74)
s=m1M0γst(𝝈mtk)T𝜼t+(𝝂mtk)T(𝑩t𝒙𝒅t𝑫t𝝈mtk)\displaystyle\sum_{s=m-1}^{M_{0}}\gamma_{st}\geq-\left(\boldsymbol{\sigma}_{mt}^{*k}\right)^{\mathrm{T}}\boldsymbol{\eta}_{t}+\left(\boldsymbol{\nu}_{mt}^{*k}\right)^{\mathrm{T}}\left(\boldsymbol{B}_{t}\boldsymbol{x}-\boldsymbol{d}_{t}-\boldsymbol{D}_{t}\boldsymbol{\sigma}_{mt}^{*k}\right)

where 𝝈mtk\boldsymbol{\sigma}_{mt}^{*k} and 𝝂mtk\boldsymbol{\nu}_{mt}^{*k} are the optimal solution of the SP\rm SP at iteration kk.

On this basis, the solving procedure of the DRO model (55) with a master-subproblem framework relying on the Benders decomposition algorithm is presented in Algorithm 1, and the parameter setting of the ambiguity set can affect the precision of optimal solution.

Algorithm 1 Benders decomposition algorithm on DRO model.
Result: Obtain the optimal day-ahead power scheduling strategy
 1. Initialization. Let the lower bound LB=LB=-\infty, upper bound UB=+UB=+\infty and iteration number k=0k=0. Choose a convergence tolerance level ξ>0\xi>0. Fix a feasible solution (𝒙k,γmtk,𝜼tk)(\boldsymbol{x}^{*k},\gamma_{mt}^{*k},\boldsymbol{\eta}_{t}^{*k}) of MP\rm MP.
 2. Solve the SP\rm SP related to (𝒙k,γmtk,𝜼tk)(\boldsymbol{x}^{*k},\gamma_{mt}^{*k},\boldsymbol{\eta}_{t}^{*k}), to get the objective value s=m1M0γstk\sum\limits_{s=m-1}^{M_{0}}\gamma_{st}^{*k} and optimal solution (𝝈mtk,𝝂mtk)(\boldsymbol{\sigma}_{mt}^{*k},\boldsymbol{\nu}_{mt}^{*k}).
 3. Obtain the objective value OB=𝒄T𝒙k+t𝒯m=0M0s=m1M0γstkPm,tOB=\boldsymbol{c}^{\mathrm{T}}\boldsymbol{x}^{*k}+\sum\limits_{t\in\mathcal{T}}\sum\limits_{m=0}^{M_{0}}\sum\limits_{s=m-1}^{M_{0}}\gamma_{st}^{*k}P_{m,t} of MP\rm MP with respect to 𝒙k\boldsymbol{x}^{*k} and s=m1M0γstk\sum\limits_{s=m-1}^{M_{0}}\gamma_{st}^{*k}, and let UBUB==min{OB,UB}\text{min}\{OB,UB\}.
 4. Update the Benders cut related to (𝝈mtk,𝝂mtk)(\boldsymbol{\sigma}_{mt}^{*k},\boldsymbol{\nu}_{mt}^{*k}) and solve MP\rm MP. Then update the optimal solution and objective value to (𝒙(k+1),γmt(k+1),𝜼t(k+1))(\boldsymbol{x}^{*(k+1)},\gamma_{mt}^{*(k+1)},\boldsymbol{\eta}_{t}^{*(k+1)}) and z(k+1)z^{*(k+1)}, respectively. Let LB=z(k+1)LB=z^{*(k+1)}.
 5. If (UBLB)/UBξ(UB-LB)/UB\leq\xi, terminate the procedure and return 𝒙k\boldsymbol{x}^{*k} as the optimal solution. Otherwise, let k=k+1k=k+1 and go to Step 2.

VI Case Study

In this section, numerical analysis is performed for the 24-h scheduling in an 8-MG power system coupled with a benchmark urban TN, as depicted in Fig. 4 and Fig. 5. The distribution network is modified from an IEEE 33-bus system with PVs. 8-MG is located in the power system, and is connected with one bus, respectively. The detailed data can be referred to [45]. The arrows of each link represent the direction of vehicles permitted to drive. This urban TN is widely utilized in the research related to power and transportation coupled networks (e.g., [8], [20], [46]). Relevant parameters of traffic links are listed in Table II. It is noted that the proposed approach is extensible, which can be applied to the urban TN with bidirectional-link and multiple O-D pairs. Owing to the spatial limitation of this paper, only three O-D pairs in this transportation system are considered, and the parameters are listed in Table III, in which the traffic demand is the average value from the historical data [20] and the basic value (p.u.) is 100 vehicles per hour. The simulations are implemented on a laptop with an Intel Core i9-10885H CPU 2.40 GHz using MATLAB with YALMIP and CPLEX 12.9.0 solver.

Refer to caption
Figure 4: Topology of TN with FCS.
Refer to caption
Figure 5: Topology of distribution network.
TABLE II: Parameters for the 20-links transportation system
link Ca(p.u.)C_{a}(\rm p.u.) tl0(min)t_{l}^{0}(min) link Ca(p.u.)C_{a}(\rm p.u.) tl0(min)t_{l}^{0}(min)
1 18 6 11 13.8 12
2 8.5 6 12 17.5 6
3 9.8 5 13 8.9 5
4 20 10 14 9.76 5
5 13.5 12 15 7.9 5
6 19 10 16 17 6.5
7 14 11 17 8.2 6.5
8 20 9 18 9.15 5.9
9 13.2 11 19 8.97 5.8
10 20 10 20 18.2 6.1
TABLE III: Parameters for O-D pairs
O-D pair From node To node Average traffic demand (p.u.)
1-6 N1 N6 15
3-11 N3 N11 12
4-12 N4 N12 15

In the power network, MG11-MG88 serves FCSs of C11-C88, respectively. We assume that the charging power of each EV is a constant 0.015MWh0.015\rm MWh and the monetary value of travel time ω\omega is 10$/h. The relevant parameters of MGs are listed in Table IV, where two types of ES and DG devices are set according to the internal and external circulation flows of the transportation network. The ladder electricity price shown in Fig. 6 is used as the trading price between the main grid and MGs.

TABLE IV: Operation parameters of MGs
Unit Parameter Value
Exchange power with main grid PmaxGP_{\max}^{\rm G}(MW)(\rm MW) 30
DG_I\rm DG\_\rm I PmaxDG(MW)P_{\max}^{\rm DG}(\rm MW) 10
PminDG(MW)P_{\min}^{\rm DG}(\rm MW) 0.5
a/b/c($/MW)a/b/c(\$/\rm MW) 0.1/106/00.1/106/0
DG_II\rm DG\_\rm II PmaxDG(MW)P_{\max}^{\rm DG}(\rm MW) 20
PminDG(MW)P_{\min}^{\rm DG}(\rm MW) 2
a/b/c($/MW)a/b/c(\$/\rm MW) 0.1/106/00.1/106/0
ES_I\rm ES\_\rm I λES($/MW)\lambda^{\rm{ES}}(\$/\rm{MW}) 60
PmaxES(MW)P_{\max}^{\rm{ES}}(\rm{MW}) 10
SOCmaxEL(MWh)SOC_{\max}\cdot E_{\rm{L}}(\rm{MWh}) 3030
SOCminEL(MWh)SOC_{\min}\cdot E_{\rm{L}}(\rm{MWh}) 55
ES_II\rm ES\_\rm II λES($/MW)\lambda^{\rm{ES}}(\$/\rm{MW}) 60
PmaxES(MW)P_{\max}^{\rm{ES}}(\rm{MW}) 15
SOCmaxEL(MWh)SOC_{\max}\cdot E_{\rm{L}}(\rm{MWh}) 4040
SOCminEL(MWh)SOC_{\min}\cdot E_{\rm{L}}(\rm{MWh}) 77
Demand response λDR($/MW)\lambda_{\rm{DR}}(\$/\rm{MW}) 50
Refer to caption
Figure 6: The trading price between the main grid and MGs.

In this paper, there are three kinds of cases being adopted, including the DRO model proposed by this paper, the RO model solved by C&CG method [47], and the deterministic model (DM) without uncertain variables, to research the influence on the conservativeness and economic performance of scheduling profile. In the DRO model case, the number of the confidence sets is set as 6, i.e., M0=5M_{0}=5. It should be noted that the model complexity and the computing time will increase with the rising of M0M_{0}. While if a small value is assigned to M0M_{0}, the statistical features of uncertain parameters cannot be represented adequately. Hence, the value of M0M_{0}, the corresponding uncertainty budgets Γm,t\Gamma_{m,t} and probabilities Pm,tP_{m,t} are set by the distribution information of historical data. Meanwhile, the maximum allowable fluctuation deviation of the traffic demand and PV generation is set according to the previous historical prediction deviation, and the box sets are set as {0.9qt,avodq~tod1.1qt,avod,od}\{0.9{q}_{t,av}^{od}\leq\widetilde{q}_{t}^{od}\leq 1.1{q}_{t,av}^{od},\forall od\in\mathcal{R}\} and {0.85pj,t,avPVp~j,tPV1.15pj,t,avPV,j}\{0.85{p}_{j,t,av}^{\rm PV}\leq\widetilde{p}_{j,t}^{PV}\leq 1.15{p}_{j,t,av}^{\rm PV},\forall j\in\mathcal{B}\}, respectively. Then the charging demand of FCSs can be obtained by (45)-(48). In the RO case, the uncertainty set intervals are the same as the DRO model. In the DM case, the uncertainties of this coupled network are not considered. The average value in all time slots of PV generation is adopted. The UE traffic assignment is determined as a certain variable by solving (12) with the average traffic demand computed from historical data. Meanwhile, the DM optimization model is solved using a mixed integer linear programming method. The UE link flow pattern obtained by (12) is shown in Fig.7, in which the flow of 10th10^{th}, 13th13^{th}, 15th15^{th}, 18th18^{th}, and 20th20^{th} links is zero.

Refer to caption
Figure 7: The link flow with the average traffic demand in the total time slot.

Taking MG11 and MG44 as examples, the optimization results are shown in Fig. 8-12. Note that the two types of ES and EG devices are employed by MG11 and MG44, respectively. The Fig. 8 and Fig. 10 reveal the energy scheduling optimization results of MG11 and MG44 by the DRO method, including the power output of DG, the exchange power of MG with main grid, the PV output, charging and discharging power of ES. It should be noted that when the MG purchases electricity from the main grid, the value of power is positive, otherwise, it is negative. Besides, the power value is negative when ES is charging, otherwise, it is positive. The actual dispatching of demand response loads and expected loads in all time slot are shown in Fig. 9 and Fig. 11.

Refer to caption
Figure 8: Power scheduling profile by DRO for MG11.
Refer to caption
Figure 9: Actual and expected electricity plan of demand response load by DRO for MG11.
Refer to caption
Figure 10: Power scheduling profile by DRO for MG44.
Refer to caption
Figure 11: Actual and expected electricity plan of demand response load by DRO for MG44.

In Fig. 8 and Fig. 10, it can be seen that the PV output power is 0 in 1h~6h and 19h~24h, and the load in the MG is completely supplied by DG, ES and the power purchase from main grid. In this time period, the day-ahead transaction price is lower than the unit power generation cost of DG, and thus DG operates at the minimum output power point. During the rest of the time period, DG operates at the maximum power point to increase the power sold to the main grid, since the day-ahead transaction price is higher than the unit power generation cost of DG. Under the ladder electricity price mechanism, ES is charged during non-peak electricity consumption time period, and discharged at night or in the early morning. Then ES reserves the electricity during the valley electricity price period and sells it during the peak electricity price period. Therefore, the peak shaving and valley filling can be realized, which enhances the flexibility of the PN scheduling and makes the experimental results closer to the actual situation. In Fig. 9 and Fig. 11, the demand response load, which is similar to the traditional load, is mainly concentrated in the peak period of electricity price. Under the premise of satisfying the total power demand and the minimum power consumption in each time period, MG supplies excess power to demand response users during non-peak electricity consumption time period, which is stored to supply the power shortage during peak electricity consumption time period. This mechanism reduces the power purchase by MG from the main grid during the peak electricity price period and the cost of electricity consumption.

The situation of the power purchase and the power sold from/to the main grid of MG11 and MG44 under the three kinds of cases in one scheduling period are shown in Fig. 12 and Fig.13. In the DRO and RO methods, the uncertainty of traffic demand influences the UE state, so further causes the uncertainty of charging demand to MG and generates the reservation of charging power for EVs. Moreover, the worst case of uncertainties is considered in the RO method. As a result, the day-ahead power purchase from the main grid of MG applying DRO method is less than that of RO, and higher than DM on the general trend. In addition, the power outputs of PV generation by DRO method are more than the ones by RO, and thus the power sold to the main grid of MG using DRO method is greater than RO and lower than DM overall. In the DRO model proposed by this paper, the probability distribution for uncertain parameters can be captured based on the historical data. In contrast, the RO model adopts the boundary information of uncertainty box sets to obtain the optimal solution in the worst case increasing the conservativeness of strategies. Meanwhile, the DRO model can avoid the dilemma of selling more surplus electricity back to the main grid.

Refer to caption
Figure 12: The exchange power of MG11 with main grid by DRO, RO and DM.
Refer to caption
Figure 13: The exchange power of MG44 with main grid by DRO, RO and DM.

Table V provides the total cost , total power purchase and power sold from/to the main grid of the 8-MG system in three cases. As is shown in this table, the operating costs of the DRO and RO model are more than that of DM. Nonetheless, it is not mean that the scheme of the DM optimization method is better than the robust method. The reason is that this scheme corresponds to the generation and consumption plan submitted by the MG in the day-ahead market, and the imbalance between the planned generation and consumption caused by the uncertainties in the real-time market. The power purchase price in the real-time market is generally higher than the day-ahead market, which results in the increase of transaction cost for the MG. The robust approach considers the worse situation of the higher-level charging load and the lower-level PV generation, which leads to the more conservative strategies and the higher operating cost for the day-ahead schedule. Meanwhile, the uncertain factors can change the flow patterns and the route choices of vehicles, and the results of the game between EVs and MG operators are changed, which is neglected in DM. In addition, as is shown in Fig.12 and Fig. 13, the increase of operating costs is mainly generated by the increase of the power purchase and the decrease of power sold from//to the main grid. According to this point, the scheduling profile obtained by the robust optimization method has stronger robustness to resist the risk of real-time market loads and price fluctuation. Similarly, it can also be observed that the DRO method can obtain an excellent scheduling strategy with more economic performance compared with RO, which indicates that the DRO model proposed by this paper can reduce the conservativeness of strategies and total cost.

TABLE V: Simulation results in different cases
Model RO DRO DM
PbuyP^{buy} (×103MW)(\times 10^{3}\rm MW) 0.7199 0.6813 0.6488
PsellP^{sell} (×103MW)(\times 10^{3}\rm MW) 1.0267 1.0657 1.2893
Total cost (×104$)(\times 10^{4}\$) 11.1954 9.0474 5.9251

Furthermore, about the performance of the above three cases, the total computing times of the DRO, RO, and DM are 187s, 96s, and 24s, respectively. As expected, the DM has a minimal calculation time. It should be noted that the DRO requires a longer computing time than RO, due to there being more auxiliary variables in the computing process of DRO. However, it can attain more practical scheduling and can adapt to multiple uncertainties.

Finally, we compare the optimization results of our method with the traditional operation case. In the traditional operation case, the individual selfishness and the game relationship between the MGs and the EV drivers are not considered. The optimization goal is modified to optimize the total benefit of the system in (75).

minFTN(fp,tod)+Rj(𝒑j,t)\displaystyle\min\ F_{\rm TN}(f_{p,t}^{od})+R_{j}\left(\boldsymbol{p}_{j,t}\right) (75)
s.t.(2)(7),(9)(11),(14),(15),(17)(27),(29)(37)\displaystyle\text{s.t.}(2)-(7),(9)-(1),(4),(5),(7)-(7),(9)-(7)

In the traditional operation case, the solution method and parameter setting are the same as the DRO model of this paper. According to the simulation calculation, the total cost and the power sold from the main grid are 9.4760×104$9.4760\times 10^{4}\$ and 0.7023×103MW0.7023\times 10^{3}\rm MW. It can be seen that the system operation cost of applying the traditional operation method is slightly lower than the method proposed by this paper, nevertheless, it does not take the intra-day game relationship and the individual selfishness of the coupled system in the optimization problem into account. It can lead to insufficient real-time power dispatching and the unmet EV charging requirements. The framework in this paper fully considers the individual selfishness without sacrificing the total cost, and the individual objectives can be achieved by optimizing the potential function.

VII Conclusion

The purpose of this study is to investigate day-ahead power scheduling for the distribution network while considering the coupling of PN and TN, in which the model construction of intra-day source-load uncertainties and the coupling expression of the two networks are the cruxes. The coupling characteristics of conflicting revenues and interaction accurately are characterized by the game theory, and then we prove that the game relation fulfills the characteristics of potential game. Furthermore, the DRO model considering the probability distribution of uncertain parameters is designed to derive the equilibrium of the potential game in the worst situation over ambiguity sets. It can reduce the conservativeness of the optimization model and ensure the safety and economy of power strategies effectively. Therefore, we propose the two-stage DRO model in a unified centralized optimization framework to carry out the day-ahead power scheduling and ensure the individual optimality. Simulation results confirm that the capability of DRO method based on potential game-theory in this paper is superior to the robust optimization and deterministic optimization approach. Meanwhile, the individual optimal strategies can be obtained by the approach of this paper without sacrificing the total cost. The limitation of this study is that there are some assumptions in the TN, which restrict the practical application of this method. As a result, the future extension of this investigation is to take the endogenous uncertainty for the travelers’ charging demand into account to enhance the application value in practice.

References

  • [1] J. A. P. Lopes, F. J. Soares, and P. M. R. Almeida, “Integration of electric vehicles in the electric power system,” Proc. IEEE, vol. 99, no. 1, pp. 168–183, Jan. 2011.
  • [2] S. Xie, Z. Hu, and J. Wang, “Two-stage robust optimization for expansion planning of active distribution systems coupled with urban transportation networks,” Appl. Energy, vol. 261, pp. 114412, 2020.
  • [3] F. He, D. Wu, Y. Yin, and Y. Guan, “Optimal deployment of public charging stations for plug-in hybrid electric vehicles,” Transp. Res. Part B Methodol., vol. 47, pp. 87-101, Jan. 2013.
  • [4] X. Dong, Y. Mu, X. Xu, H. Jia, et al, “A charging pricing strategy of electric vehicle fast charging stations for the voltage control of electricity distribution networks,” Appl. Energy, vol. 225, pp. 857–868, 2018.
  • [5] K. Mahmud, and G. E. Town, “A review of computer tools for modeling electric vehicle energy requirements and their impact on power distribution networks,” Appl. Energy, vol.172, no. 3, pp. 37–59, 2016.
  • [6] M. Alizadeh, H. Wai, M. Chowdhury, A. Goldsmith, et al, “Optimal Pricing to Manage Electric Vehicles in Coupled Power and Transportation Networks,” IEEE Trans. Control Netw. Syst., vol. 4, no. 4, pp. 863 – 875, Dec. 2017.
  • [7] W. Wei, L. Wu, J. Wang, and S. Mei, “Network equilibrium of coupled transportation and power distribution systems,” IEEE Trans. Smart Grid, vol. 9, no. 6, pp. 6764–6779, Nov. 2018.
  • [8] W. Gan, M. Shahidehpour, M. Yan, J. Guo, et al, “Coordinated Planning of Transportation and Electric Power Networks with the Proliferation of Electric Vehicles,” IEEE Trans. Smart Grid, vol. 11, no. 5, pp. 4005–4015, Sep. 2020.
  • [9] J.R. Correa, and N.E. Stier-Moses, “Wardrop equilibria,” In Cochran J, editor. Wiley Encyclopedia of operations research and management science, Feb. 2011.
  • [10] D. Acemoglu, and A. Ozdaglar, “Competition in parallel-serial networks,” IEEE J. Sel. Areas Commun., vol. 25, no. 6, pp. 1180–1192, Aug. 2007.
  • [11] W. Lee, L. Xiang, R. Schober, and V. W. S. Wong, “Electric vehicle charging stations with renewable power generators: a game theoretical analysis,” IEEE Trans. Smart Grid, vol. 6, no. 2, pp. 608–617, Sep. 2015.
  • [12] C. Wu, H. Mohsenian-Rad, and J. Huang, “Vehicle-to-aggregator interaction game,” IEEE Trans. Smart Grid, vol. 3, no. 1, pp. 434–442, Mar. 2012.
  • [13] W. Tushar, W. Saad, H. V. Poor, and D. B. Smith, “Economics of electric vehicle charging: a game theoretic approach,” IEEE Trans. Smart Grid, vol. 3, no. 4, pp. 1767–1778, Dec. 2012.
  • [14] S. Bahrami, and V. W.S. Wong, “A Potential Game Framework for Charging PHEVs in Smart Grid,” in 2015 IEEE Pacific Rim Conference on Communications, Computers and Signal Processing Conf. (PacRim), pp. 28-33.
  • [15] Z. Zhou, S. J. Moura, H. Zhang, X. Zhang, Q. Guo, and H. Sun, “Power-traffic network equilibrium incorporating behavioral theory: A potential game perspective,” Appl. Energy, vol. 289, no. 1, 116703, May 2021.
  • [16] M. S. Javadi, M. Gough, S. A. Mansouri, A. Ahmarinejad, E. Nematbakhsh, S. F. Santos, and J. P. Catalao, “A two-stage joint operation and planning model for sizing and siting of electrical energy storage devices considering demand response programs,” International Journal of Electrical Power & Energy Systems , vol. 138, 107912, Jun. 2022.
  • [17] A. Rezaee Jordehi, V. Sohrabi Tabar, S. A. Mansouri, F. Sheidaei, A. Ahmarinejad, S. Pirouzi, “Two-stage stochastic programming for scheduling microgrids with high wind penetration including fast demand response providers and fast-start generators,” Sustainable Energy, Grids and Networks, vol. 31, 100694, Sep. 2022.
  • [18] S. A. Mansouri, A. Ahmarinejad, F. Sheidaei, M. S. Javadi, A. Rezaee Jordehi, A. Esmaeel Nezhad, J. P. S. Catalao, “A multi-stage joint planning and operation model for energy hubs considering integrated demand response programs,” International Journal of Electrical Power & Energy Systems, vol. 140, 108103, Sep. 2022.
  • [19] S. A. Mansouri, A. Ahmarinejad, E. Nematbakhsh, M. S. Javadi, A. Esmaeel Nezhade, J. P. S. Catalao, “A sustainable framework for multi-microgrids energy management in automated distribution network by considering smart homes and high penetration of renewable energy resources”, Energy, vol. 245, 123228, Apr. 2022.
  • [20] W. Wei, S. Mei, L. Wu, J. Wang, and Y. Fang, “Robust operation of distribution networks coupled with urban transportation infrastructures,” IEEE Trans. Power Syst., vol. 32, no. 3, pp. 2118-2130, May 2017.
  • [21] C. He, L. Wu, T. Liu, W. Wei, and C. Wang, “Co-optimization scheduling of interdependent power and gas systems with electricity and gas uncertainties,” Energy, vol. 159, pp. 1003-1015, Sep. 2018.
  • [22] S. Dehghan, and N. Amjady, “Robust transmission and energy storage expansion planning in wind farm-integrated power systems considering transmission switching,” IEEE Trans. Sustain. Energy, vol. 7, no. 2, pp. 765–774, May 2016.
  • [23] R. Zhou, X. Min, X. Tong, R. Chen, X. Li, and Z. Liu, “Distributional robust optimization under moment uncertainty of environment and economic dispatch for power system,” Proceedings of the CSEE, vol. 35, no. 13, pp. 3248-3256, Aug. 2015.
  • [24] A. Zare, C. Chung, J. Zhan, and S. O. Faried, “A distributionally robust chance-constrained MILP model for multistage distribution system planning with uncertain renewables and loads,” IEEE Trans. Power Syst., vol. 33, no. 5, pp. 5248–5262, Sep. 2018.
  • [25] D. Pozo, A. Street, and V. Alexandre, “An ambiguity-averse model for planning the transmission grid under uncertainty on renewable distributed generation,” in 2018 Proc. Power Syst. Comput. Conf. (PSCC), Dublin, Ireland, pp. 1–7.
  • [26] Y. Zhang, W. Liu, Z. Huang, F. Zheng, J. Le, and S. Zhu, “Distributionally robust coordination optimization scheduling for electricity-gas-transportation coupled system considering multiple uncertainties,” Renewable Energy, vol. 163, pp. 2037-2052, Jan. 2021.
  • [27] Y. Cao, D. Li, Y. Zhang, Q. Tang, A. Khodaei, H. Zhang, and Z. Han, “Optimal energy management for multi-microgrid under a transactive energy framework with distributionally robust optimization,” IEEE Trans. Smart Grid, early access, Sep. 2021.
  • [28] C. Wang, R. Gao, W. Wei, “Risk-based distributionally robust optimal gas-power flow with wasserstein distance,” IEEE Trans. on Power Syst., vol. 34, no. 3, pp. 2190-2204, Dec. 2018.
  • [29] C. Zhao, and Y. Guan, “Data-driven stochastic unit commitment for integrating wind generation,” IEEE Trans. Power Syst., vol. 31, no. 4, pp. 2587-2596, Jul. 2015.
  • [30] C. Ning, F. You, “Data-driven adaptive robust unit commitment under wind power uncertainty: A Bayesian nonparametric approach,” IEEE Trans. on Power Syst., vol. 34, no. 3, pp. 2409-2418, Jan. 2019.
  • [31] W. Wei, L. Wu, J. Wang, and S. Mei, “Expansion planning of urban electrified transportation networks: A mixed-integer convex programming approach,” IEEE Trans. Transport. Electrific., vol. 3, no. 1, pp. 210–224, Mar. 2017.
  • [32] “Bureau of Public Roads,” Traffic Assignment Manual, U.S. Dept. Commerce, Washington, DC, USA, 1964.
  • [33] M. J. Beckmann, C. B. McGuire, and C. B. Winsten, “Studies in the Economics of Transportation,” in New Haven, CT, USA: Yale Univ. Press, Dec. 1956.
  • [34] Y. Sheffi, “Urban transportation networks: Equilibrium Analysis with Mathematical Programming Methods,” Englewood Cliffs, NJ: Prentice-Hall, pp. 339, 1986.
  • [35] W. Wei, J. Wang, and L. Wu, “Quantifying the impact of road capacity loss on urban electrified transportation networks: an optimization based approach,” Int. J. Transp. Sci. Technol., vol. 5, no. 4, pp. 268–288, Dec. 2016.
  • [36] J. Fortuny-Amat, and B. Mccarl, “A representation and economic interpretation of a two-level programming problem,” J. Oper. Res. Soc., vol. 32, no. 9, pp. 783-792, Sep. 1981.
  • [37] H. Xin, X. Jiang, Y. Jian, Y. Li, et al, “Distributed robust energy management of a multimicrogrid system in the real-time energy market,” IEEE Trans. Sustain. Energy, vol. 10, no. 1, pp. 396–406, 2019.
  • [38] D. Monderer, and L. Shapley, “Potential games,” Games Econ Behav., vol. 14, no. 1, pp. 124–143, May 1996.
  • [39] Q. La, Y. Chew, and B. Soong, “Potential game theory applications in radio resource allocation,” Springer, 2016.
  • [40] P. M. Esfahani and D. Kuhn, “Data-driven distributionally robust optimization using the wasserstein metric: Performance guarantees and tractable reformulations,” Math. Program., vol. 171, pp. 115–166, 2018.
  • [41] W. Wiesemann, D. Kuhn, and M. Sim, “Distributionally Robust Convex Optimization,” Oper. Res. the J. of the Oper. Res., 2014.
  • [42] M. Zugno, and A. J. Conejo, “A Robust Optimization Approach to Energy and Reserve Dispatch in Electricity Markets,” Euro. J. of Oper. Res., vol. 247, no. 2, pp. 659–671, Dec. 2015.
  • [43] R. Li, M. Wang, M. Yang, X. Han, Q. Wu and W. Wang, “A distributionally robust model for reserve optimization considering contingency probability uncertainty,” Int. J. Electr. Power Energy Syst., vol. 134, 107174, Jan. 2022.
  • [44] A.M.Geoffrion, “Generalized Benders Decomposition,” J. of Opti. Theory and Appl., vol. 10, no. 4, Oct.1972.
  • [45] H. Gao, J. Liu, L. Wang’. “Robust coordinated optimization of active and reactive power in active distribution systems,”IEEE Trans. on Smart Grid, vol. 9, no. 5, pp. 4436-4447, Jan. 2017.
  • [46] Y. Liu, Y. Wang, Y. Li, H. Gooi, and H. Xin, “Multi-Agent Based Optimal Scheduling and Trading for Multi-Microgrids Integrated with Urban Transportation Networks,” IEEE Trans. on Power Syst., vol. 36, no. 3, pp. 2197 – 2210, Dec. 2020.
  • [47] B. Zeng, and L. Zhao, “Solving two-stage robust optimization problems using a column-and-constraint generation method,” Oper. Res. Lett., vol. 41, no. 5, pp. 457-461, Sep.2013.