Operational risk quantification of power grids using graph neural network surrogates of the DC OPF
Abstract
A DC OPF surrogate modeling framework is developed for Monte Carlo (MC) sampling-based risk quantification in power grid operation. MC simulation necessitates solving a large number of DC OPF problems corresponding to the samples of stochastic grid variables (power demand and renewable generation), which is computationally prohibitive. Computationally inexpensive surrogates of OPF provide an attractive alternative for expedited MC simulation. Graph neural network (GNN) surrogates of DC OPF, which are especially suitable to graph-structured data, are employed in this work. Previously developed DC OPF surrogate models have focused on accurate operational decision-making and not on risk quantification. Here, risk quantification-specific aspects of DC OPF surrogate evaluation is the main focus. To this end, the proposed GNN surrogates are evaluated using realistic joint probability distributions, quantification of their risk estimation accuracy, and investigation of their generalizability. Four synthetic grids (Case118, Case300, Case1354pegase, and Case2848rte) are used for surrogate model performance evaluation. It is shown that the GNN surrogates are sufficiently accurate for predicting the (bus-level, branch-level and system-level) grid state and enable fast as well as accurate operational risk quantification for power grids. The article thus develops tools for fast reliability and risk quantification in real-world power grids using GNN-based surrogates.
Index Terms:
Risk assessment, system reliability, DC OPF surrogates, graph neural networkI Introduction
The growing participation of renewable energy resources (RES), bulk storage, natural gas turbines and other flexible generation sources (rooftop solar, vehicle to grid, etc.) leads to increasing volatility and uncertainty in the power grid [1, 2, 3]. Quantitative risk assessment and management tools are needed to safely operate the grid under these conditions, specifically: a) advanced decision-making algorithms that explicitly consider the uncertainty in grid variables, and b) explicit reliability and risk quantification methods.
The first category includes stochastic optimization (i.e., stochastic programming, robust, chance-constrained, etc.) [4, 5] or dynamic reserve determination algorithms [6, 7, 8]. These methods model the grid uncertainty using various uncertainty representation methods (scenarios, intervals, etc.) and allow operators to make unit commitment and dispatch decisions under uncertainty. Due to the high computational cost of solving these complex stochastic optimization problems, the grid uncertainty can only be approximately captured (using, e.g., only up to 50 scenarios) for real-world power grids. Grid security constraints are satisfied for the chosen (approximate) uncertainty representation, and the risk underlying the optimal solution (the decision) is never explicitly communicated. The risk quantification is thus implicit and approximate [9]. Nevertheless, these methods are vital for optimal, safe, and reliable operation of the grid.
The second category of tools involves explicit, Monte Carlo (MC) sampling-based risk quantification given a commitment decision and a probabilistic forecast of grid variables [9, 5, 10]. These methods provide more accurate (high-resolution) risk estimates than those used in the current decision-making algorithms, because they use numerous (thousands or more) MC samples. During day-to-day operations, the risk estimates obtained using these methods can improve grid operators’ situational awareness for better decision-making. With heightened situational awareness, operators can tune the hyper-parameters (penalties, constraints, number of samples) of intra-day decision making algorithms (e.g., look-ahead commitment, or LAC) that ensure grid reliability. The real-time risk estimate provides a quantitative basis for emergency decisions such as purchasing additional power, asking consumers to shed loads etc. [11]; such decisions are currently not based on rigorous risk quantification. The risk quantification can also enable risk versus cost analysis for competing methods from the first category to help arrive at the most suitable decision-making algorithm.
The main challenge in the adoption of sampling-based, explicit risk quantification for power grid operation is the computational cost of obtaining decisions (grid state) corresponding to numerous future scenarios of input variables [12]. This requires solving thousands of security-constrained (operational or cost) optimization problems such as the DC OPF problem [13, 14, 15, 16, 17]. Using the state-of-the-art solution algorithms (e.g., alternating directional method of multiplier (ADMM) [18, 19, 20]), it is possible to solve DC OPF within 0.1 for small to medium size power grids [21]. However, it still takes up to a second to solve DC OPF for large grids ( 1000 buses) [22]. As a result, it may take more than thirty minutes to solve thousands of DC OPF problems to obtain samples of grid state corresponding to the samples of inputs. This is much longer than the typical decision-making window of five minutes for SCED (security-constrained economic dispatch). Therefore, it is necessary to develop fast surrogate models or optimization proxies that significantly ease the computational burden of sampling-based risk assessment.
The need for fast decision-making has led to significant research in developing machine learning-based methods that accelerate the OPF solution process [23, 24, 25, 26, 27, 28, 19]. For instance, Deka and Misra [24] leverage neural networks to learn active constraints that can maintain the optimality and achieve speedup by reducing the problem size. Biagoni et al. [19] developed RNN model that can predict the warm-start point to expedite the iteration process of the ADMM algorithm. These methods inevitably involve an iterative solution process and may increase the computational cost. To overcome this issue, end-to-end prediction (surrogate) models are also developed. These models are trained to learn the mapping between input (e.g., load) and output (generator dispatch) [29, 30, 31]. We refer the reader to [32, 25, 33] for further details of various end-to-end OPF surrogate models.
These previous efforts on OPF surrogate development focus on expediting the OPF solution for making operational decisions (setting power dispatch). When such models are to be used for risk assessment, the risk assessment surrogate needs to be evaluated for various additional criteria not typically used in evaluating a decision-making surrogate. These include: (a) using realistic marginal distribution types for the input variables; (b) considering the joint probability distribution of, and hence the correlation between, input variables; (c) quantifying the accuracy of reliability/risk estimates and (d) investigating the generalizability of the model for test scenarios differing from the training data. Previously reported studies on OPF decision-making surrogates have not considered these criteria for model evaluation. In this article, we discuss a methodology for rigorously evaluating different surrogates developed specifically for enabling fast reliability/risk assessment.
The proposed methodology could be used to evaluate any end-to-end risk surrogate model, including the ones previously reported in the literature. Here, we investigate graph neural network (GNN) surrogate models. GNN models are chosen because they can effectively deal with graph-structured data encountered in a power grid operation [34, 35, 36, 37, 38, 39, 40]. Multiple GNN surrogates are trained using a supervised learning approach to predict quantities of interest (QoIs) at different levels: bus-level (active power output), branch-level (transmission line flow) and system-level (operating reserve, load shedding, and total cost). A joint probabilistic forecast is used to generate MC samples of power demand and renewable (wind) generation, which are provided as input to an OPF solver. Using the OPF input-output data for these samples, GNN surrogates are constructed for fast prediction of the bus-, branch-, and system-level QoIs. Predictions of QoIs are used for grid operational risk quantification using risk metrics relevant for day-to-day power grid operation. The reliability and risk estimate obtained using the OPF solver is considered as the ground truth, and the performance of the GNN surrogate is evaluated by comparing GNN-based reliability and risk estimates against OPF-based reliability and risk estimates. To evaluate the generalization capability, a distance metric is used to quantify the distance between the training and testing data distributions, and the effect of this distance on the accuracy of GNN-based risk assessment is quantified. The results provide insights into the GNN surrogate’s performance under different potential grid variable distributions (forecasts) and enhance the robustness of the decision-making process. The main contributions of this work are as follows:
-
1.
Investigation of the predictive capability of the DC OPF surrogate, i.e., GNN, for stochastic grid variable states drawn from realistic probabilistic forecasts. This is the first effort to evaluate surrogate models using realistic (forecast) probability distributions.
-
2.
Evaluation of the DC OPF surrogate’s accuracy in estimating power grid operational reliability and risk.
-
3.
Extension of a previously developed zonal and system-level reliability and risk assessment framework to quantify reliability and risk at the transmission line level.
-
4.
Evaluation of the generalization capability of the DC OPF surrogates in the context of power grid operational risk quantification. Various forecasts of joint probability distributions of renewable generation and load variables are considered to mimic a wide range of operational scenarios. A statistical distance metric is used to measure how well the training data covers the forecast scenario, in order to evaluate the quality of the GNN model prediction for scenarios that significantly differ from the training data. The error in the GNN-based reliability and risk quantification is examined as a function of this distance.
The remainder of this article is organized as follows. In Section II, the formulation of DC OPF is described and the methodology for GNN-based risk quantification is discussed. Numerical experiments are described in Section III and the results are summarized in Section IV. Concluding remarks are provided in Section V.
II Methodology
In this section, we provide details of the DC OPF formulation and its GNN surrogates used in this work, the training data generation process, and method to evaluate the GNN surrogate model’s performance in estimating the grid reliability and risk.
II-A DC optimal power flow (OPF)
The optimal power flow (OPF) problem is central to power system operation and plays a crucial role in economic dispatch, unit commitment, stability and reliability assessment, demand response, etc. The objective of the OPF problem is to minimize overall power generation cost while satisfying the grid security constraints. OPF in the full form (AC OPF) is nonlinear and nonconvex, which limits its usage in practice [41, 42]. Typically, a simplified, linearized version of the OPF (DC OPF) is used. DC OPF ignores reactive power and voltage magnitude in the formulation. This guarantees convexity in the feasible domain and ensures that the solution exists [24, 15]. DC OPF is routinely employed in electricity market clearing and power transmission management [43, 15, 24]. For this reason, DC OPF is the focus of this work. We use OPF to refer to DC OPF in the following text unless otherwise specified. The OPF formulation used in this work is given by:
s.t. | ||||
(1) | ||||
where is marginal price of power generation, and denote power generation and demand, and denote the minimum and maximum allowed power generation at bus , respectively, and represent admittance matrix and voltage angle vector, , are line reactance and line flow limit from bus to bus , and is voltage angle for bus .
II-B GNN surrogate development
The GNN surrogates used in this article are constructed using three GCN layers followed by a single, fully connected readout layer. GCN layers are used to update the node representation while the readout layer is only used to output the QoI with the correct dimension. Since we consider QoIs at multiple levels (bus-, branch- and system-level), it is necessary to develop separate GNN surrogates. Three separate GNN surrogates are developed and each of them predicts the QoIs at one of the three levels. The structure of these GNN surrogates are depicted in Fig. 1.

The loss function is constructed using the mean squared error (MSE) loss and a penalty term. The MSE loss is given by:
(2) |
where and represent the training data value and predicted value, respectively. A penalty term is added to ensure that the predictions satisfy OPF inequality constraints. The penalty term is expressed as:
(3) |
where is an auxiliary variable defined as:
(4) |
with and denoting the minimum and maximum allowed value of the QoI. Using weighted averaging, the final loss function can be represented as:
(5) |
Here and are weights and . Equal weights () could be used to give equal importance to both constraints.
II-C Training data generation
The stochastic variabilities in the power demand and renewable generation sources (supply) are the main sources of uncertainty in a power grid [44, 7, 8, 4]. The probability distribution that quantifies such variability is not known at the time of training data generation, since it is to be obtained through forecasting for a future time of interest. The training data for the GNN surrogate is therefore generated by considering uniform distributions for the demand and renewable generation , where and denote the set of load buses and the set of renewable generators, respectively. The lower and upper bounds of the uniform distributions can be decided based on generation capacities and historical load data. The uniform distributions used to generate the training data are uncorrelated. Given a portfolio of stochastic grid variables (i.e., sample values of and ), (dispatch-able) power generation and transmission line power flow are uniquely determined by solving the DC OPF problem discussed in Section II-A ( and represent the set of (dispatch-able) generators and transmission lines, respectively). The grid state data (generator dispatch and transmission line flows) thus obtained constitute the training data set.
Some of the QoIs needed for reliability and risk assessment, such as operating reserve and load shedding, are not directly available from the OPF solution. Operating reserve refers to the unused generation capacity that can be brought online in the specified time window, whereas load shedding refers to the amount of demand not met by the supply. For a given OPF solution, the operating reserve () is taken as the residual (dispatch-able) power generation capacity after satisfying the net load. This can be computed as:
(6) |
where represents the maximum allowed generation capacity of dispatch-able generators. In case of load shedding, the supply is not able to meet the demand and the OPF solution cannot be obtained. Here, we use the slack bus to provide the required additional power. Power supply from the slack bus () is taken as load shedding ():
(7) |
The total cost, which is a system-level QoI, can also be calculated using the generator dispatch and the bid information.

II-D Testing data generation and GNN surrogate evaluation
Two testing datasets (I and II) are used to evaluate the GNN surrogate performance. Dataset I is generated using the same uncorrelated uniform distributions as the training data, but is separate and independent of the training data. This type of test dataset has been used in the majority of the GNN surrogate literature in the power systems domain [7, 4]. Note, however, that the stochastic grid variables in a real power grid do not follow a uniform distribution. Previous studies suggest that the load follows a truncated normal distribution and different renewable generation sources are described by various other probability distributions [5]. Wind power, for instance, is roughly proportional to the cube of wind speed, and wind speed is typically assumed to follow a Weibull distribution [5]. In addition, grid variables are not independent but exhibit correlation [45, 46, 47]. The GNN surrogates are expected to handle such complicated yet more realistic statistical description (probability distributions and dependence structure) of the stochastic grid variables. Therefore, a second test dataset (Dataset II) is generated to test GNN surrogate performance with respect to this key requirement.
To this end, the power grid is partitioned into multiple zones. Each zone contains a certain proportion of load buses, dispatch-able and RES generators. The joint PDF (forecast) of zonal (aggregated) stochastic grid variables, i.e., load and RES power generation, is represented using a Gaussian copula [48] : , for the sake of illustration, as:
(8) |
where is the number of distinct zones, is the random vector containing independent uniform variables, and represent the standard normal CDF and multivariate normal CDF with zero mean and correlation matrix , respectively. In this work, we simply assume a matrix and use the resulting copula for drawing samples of correlated grid variables (). Note that in the real-world application of GNN surrogates, samples drawn from the joint forecast PDF of stochastic variables will be used and will therefore correctly incorporate the correlations. Here, the Gaussian copula (with an assumed matrix) is used only for demonstrating and evaluating GNN-based reliability and risk quantification. Given a sample of , a sample of correlated stochastic grid variables can be generated as:
(9) |
where represents the marginal CDF of stochastic variable in zone .
Without loss of generality, wind turbines are assumed to be the renewable generators in our illustration. Samples of zonal (aggregated) load () and wind generation () are drawn using the copula. The relative contribution of each load bus/wind turbine is assumed to be a constant in each zone. Consider zone containing load buses and wind turbines; then the bus-level load and wind power for zone are calculated as:
(10) |
where and represent the relative contribution of each load bus and wind generator bus to the respective zonal (aggregated) values, and and denote the aggregated values for zone . Thus it is only necessary to draw samples of and for each zone, not at each bus. In this manner, a sample of stochastic grid variables with zonal correlation is obtained and used to solve the OPF problem. The procedure used to generate Dataset II is given below:
-
i.
Draw a sample and for each zone using Eq. 9
-
ii.
Calculate load and wind power for load buses and wind generator buses in each zone using Eq. 10.
-
iii.
Run a numerical solver Pandapower to get OPF solution corresponding to the given sample of stochastic variables.
-
iv.
Repeat Step i to iii multiple times to generate sufficient amount of test data samples for Dataset II.
Note that Dataset II enables the evaluation of GNN surrogate performance for reliability and risk estimation for realistic conditions (correlated grid variables with non-uniform probability distributions). The results from GNN surrogates-based reliability and risk assessment are compared with the ground truth (i.e., OPF-based reliability and risk assessment). The procedure for Dataset II generation and GNN-based reliability and risk assessment evaluation is illustrated in Fig. 2.
II-E Reliability and risk assessment
The systematic risk assessment framework for grid-level QoIs proposed by Stover et al. [9] is employed for evaluating and communicating the hours-ahead operational risk. Specifically, risk is defined by a risk triplet: a credible adverse event, the likelihood of the event occurring and the consequence of its occurrence. Level 2 and 3 metrics defined in [9] are considered in this work. Level 2 metrics quantify the probability of failure corresponding to a chosen failure mode (reserve inadequacy, loss of load, etc.). Level 3 metrics consider the consequence cost of the adverse event in addition to the probability of failure. We consider reserve inadequacy as the failure mode for zonal and system level reliability and risk assessment. At the transmission line level, we consider flow beyond a certain threshold as the failure event. This event is a surrogate for transmission line overloading.
II-E1 Reserve adequacy
To ascertain operating reserve adequacy, we use the minimum reserve requirement (MRR) as the reliability threshold. MRR is typically set as the generation capacity of the single largest generator in the power grid [9]. Therefore, the probability of reserve inadequacy is given by:
(11) |
and is computed as
(12) |
where denotes the number of samples, if holds for -th sample and otherwise. Note that MRRs at system and zone levels are considered in this work.
Next, risk is computed as the product of probability of failure and the consequence (monetary cost). A constant consequence cost, , is assumed for insufficient reserve; thus the corresponding risk can be calculated as:
(13) |
II-E2 Branch overloading
The risk assessment framework developed by Stover et al. [9] is extended to include reliability and risk quantification for transmission line flow. This extension can help grid operators identify the risk of congestion in the power grid. The electric current in one or more branches exceeding a certain threshold is considered as a failure event for this purpose. The probability of overloading branch is defined as:
(14) |
and computed as
(15) |
where with denoting the number of branches. if holds for branch in the -th sample, otherwise 0. Moreover, the conditional probability of overloading a transmission line , given that transmission line is overloaded, is also considered. This probability is defined and computed as:
(16) |
where and . represents the set of samples with .
The overall risk of branch overloading, given that branch is overloaded, can computed by combining Eq. 14 and 16 as:
(17) |
where and are the consequence costs of overloading of lines and respectively. This formula includes the probability of overloading branch and the conditional probability of overloading any other branch, given that branch is overloaded; thus it provides a meaningful estimate of overall branch overloading risk. This overall risk can be used by operators to identify the most critical branches in the grid, given a probabilistic forecast of stochastic grid variables.
In summary, this section outlines the procedure of using GNN surrogates to estimate power grid operational reliability and risk. This includes GNN surrogate development, training/test data generation and reliability/ risk quantification. A system-level reliability and risk assessment framework is extended to branch level to account for line flow constraints. The performance of GNN surrogates is evaluated based on both simple (uniform) and complex (realistic) probabilistic forecasts of grid variables.
III Numerical Experiments
The proposed GNN surrogate-based risk quantification method is evaluated on four benchmark power grids: Case118 [49], Case300 [50], Case1354pegase [51] and Case2848rte [52]. Without loss of generality, we consider wind power as renewable source and approximately 20% generators are assumed to be wind turbines. Due to the low cost and environmental friendliness, renewable energy sources, whenever available, would preferably be fully deployed to meet demand in real-world application. To enforce this, we always use the wind generator output first towards meeting the demand, and other sources are deployed to satisfy the remaining demand.
For training data and testing Dataset I, bus-level load and wind power generation are randomly drawn from independent and identical uniform distributions. A sample of the stochastic variables is given as input to the numerical solver Pandapower to obtain the OPF solution (bus-level, branch-level and system-level QoIs). A total of 1000 samples (70% for training and 30% for Dataset I) are generated using the OPF solver. Although these samples share the same grid topology, they are distinct graphs because feature vector of each node is different. Supervised learning is used during the training process and the hyper-parameters are carefully tuned to promote model performance.
For the synthesis of Dataset II, the grids are partitioned into zones, and each zone contains a specified number of load buses and generators (both renewable and other dispatchable generators). Following the procedure outlined in Section II-D, the joint PDF approximation is established using marginal PDFs via Eq. 8 and 9. We consider demand following a truncated normal (TN) distribution, while wind power generation is converted from wind speed which is assumed to follow a Weibull (WB) distribution. The conversion follows the power rating curve in [5]. Details of zonal partitioning of the four grids, zonal correlations, the parameters used to define the marginal distributions, the training loss versus the number of training samples, as well as other important details of the numerical experiments can be found in the Supplementary Information (SI) provided with this article. Note that the training loss behavior suggests that it is not necessary to use more samples in the training process.
IV Results
The prediction accuracy of GNN surrogates for bus-level, branch-level and system-level QoIs is quantified using relative error with respect to a reference solution, which is obtained using numerical OPF solver Pandapower [53]. The accuracy of GNN-based reliability and risk quantification w.r.t. reserve adequacy at the system and zonal level is examined, while twenty critical branches are selected to demonstrate the results of branch overloading analysis.
IV-A GNN surrogate prediction accuracy
Magnitude () | [0.1, 1) | [1, 10) | [10, 100) | [100, 300) | ||||
(MW) | ||||||||
Grids () | PG | PF | PG | PF | PG | PF | PG | PF |
Case118 | 25.5 | 40.3 | 2.3 | 3 | 1.5 | 1.7 | 0.5 | 0.6 |
Case300 | 24.2 | 38.7 | 2.1 | 2.5 | 1.8 | 1.5 | 0.3 | 0.4 |
Case1354 | 23.7 | 42.5 | 2.6 | 3 | 2 | 2.1 | 1 | 0.8 |
Case2848 | 26.1 | 36.2 | 2.2 | 2.7 | 2.5 | 2.3 | 1.3 | 1.5 |
We first evaluate GNN prediction for bus-level (power generation, or PG) and branch-level (power flow, or PF) QoIs. The results are shown in Table I. In Table I, the maximum relative percentage error is reported for four magnitudes of reference (true, or OPF-based) values, i.e., [0.1, 1), [1, 10), [10, 100), and [100, 300] MW. For power generation and flow magnitude ranging between 0.1 and 1, the maximum prediction error are and for active power and transmission line flow prediction, respectively. This is because a small magnitude of reference value results in high relative error; however, note that these correspond to small absolute error. Such errors are not likely to cause constraint violation and significant discrepancy in the objective (cost) function. With regard to the other three categories, maximum prediction error drops below . Taking the active power prediction in Case2848 as example, the maximum error declines to when the magnitude is in the range [10, 100), and further drops to in the range [100, 300). Similar results are observed for other grids. These maximum percentage errors demonstrate that GNN can provide accurate prediction for bus-level and branch-level predictions.
System-level prediction results are summarized in Table II. The mean relative error in reserve prediction is for all grids. Even lower error () is observed for load shedding prediction. The average error in cost prediction ranges from to for different grids. For comparison, results from recent DC OPF surrogate modeling investigations are also shown in Table II. Slightly better accuracy has been reported in [54] and [50]. However, these previous studies have not reported results for the larger grids, Case1354 and Case2848. Our work is mainly concerned with risk quantification and not with decision-making, hence the small errors in cost prediction for small grids are acceptable. GNN surrogates demonstrate consistently good performance for grids of different sizes. This is particularly important for practical usage. Overall the results suggest that GNN surrogates are capable of predicting system-level QoIs with excellent accuracy.
The efficiency (speed up) achieved using GNN surrogates compared to the original OPF model is also shown in Table II. For small grids (i.e., Case118 and Case300), GNN surrogates are 250 and 300 times faster than numerical DC OPF solver. Similar speedup has been reported by previous studies [54, 50]. For the two large grids, the GNN surrogate model achieves more than 500 times speed up compared to the original PDF. These results encourage investigation regarding the accuracy of reliability/risk estimates obtained using GNN surrogates. If this speed up can support sufficiently accurate reliability/risk estimation, then the proposed risk quantification methodology could be used by grid operators for real-time decision-making.
Grid | Mean relative % error | Mean relative % error, Cost | Speed up | |||||||
Reserve | Shedding | [50] | [25] | [54] | This work | [50] | [25] | [54] | This work | |
Case118 | 0.20 | 0.15 | 0.2 | 0.56 | 0.1 | 0.2 | 281 | 208 | 350 | 250 |
Case300 | 0.20 | 0.17 | 0.1 | 0.66 | 0.2 | 0.3 | 318 | 126 | 200 | 300 |
Case1354 | 0.21 | 0.17 | 0.3 | 500 | ||||||
Case2848 | 0.22 | 0.18 | 0.4 | 800 |
IV-B GNN surrogates for reliability and risk assessment
IV-B1 Reserve adequacy
Reliability assessment results for system-level reserve adequacy are shown in Table III. The GNN surrogate-based prediction of probability of inadequate reserve demonstrates excellent accuracy. Taking Case118 as example, zone III is the most vulnerable to reserve shortage with , followed by zones II and I at 0.16 and 0.12. The corresponding GNN-based predictions are 0.20, 0.11 and 0.16, respectively. Risk assessment results are reported in Table IV. GNN-based risk estimates also show remarkable agreement with the OPF-based risk estimate. The reliability/risk estimation accuracy and speed up (Table II) indicate that GNN surrogates can enable MC sampling-based real-time operational risk estimation by quickly providing thousands of (approximate) OPF solutions, given the updated forecast of the stochastic grid variables.
Grid | I | II | III | IV | V | VI | VII | VIII | IX | X | XI | XII | XIII | XIV | XV | XVI | |
Case118 | 0.12 | 0.16 | 0.21 | ||||||||||||||
0.11 | 0.16 | 0.20 | |||||||||||||||
Case300 | 0.15 | 0.74 | 0.26 | 0.53 | |||||||||||||
0.14 | 0.75 | 0.27 | 0.53 | ||||||||||||||
Case1354 | 0.85 | 0.18 | 0.05 | 0.36 | 0.28 | 0.53 | 0.31 | 0.30 | |||||||||
0.84 | 0.17 | 0.06 | 0.36 | 0.28 | 0.54 | 0.31 | 0.30 | ||||||||||
Case2848 | 0.59 | 0.90 | 0.89 | 0.82 | 0.04 | 0.69 | 0.38 | 0.52 | 0.66 | 0.19 | 0.27 | 0.72 | 0.78 | 0.85 | 0.78 | 0.04 | |
0.58 | 0.90 | 0.89 | 0.81 | 0.04 | 0.70 | 0.38 | 0.51 | 0.66 | 0.19 | 0.28 | 0.73 | 0.78 | 0.85 | 0.78 | 0.04 |
Grid | I | II | III | IV | V | VI | VII | VIII | IX | X | XI | XII | XIII | XIV | XV | XVI | |
Case118 | 1.10 | 1.58 | 2.44 | ||||||||||||||
1.04 | 1.49 | 2.40 | |||||||||||||||
Case300 | 2.15 | 2.87 | 4.16 | 2.06 | |||||||||||||
2.09 | 2.78 | 4.12 | 2.09 | ||||||||||||||
Case1354 | 23.5 | 10.9 | 16.9 | 22.9 | 15.7 | 29.0 | 13.2 | 17.8 | |||||||||
23.9 | 10.9 | 15.6 | 21.8 | 16.3 | 28.0 | 12.8 | 16.0 | ||||||||||
Case2848 | 47.6 | 56.9 | 56.7 | 54.5 | 31.1 | 50.8 | 41.4 | 45.6 | 49.7 | 35.8 | 38.2 | 51.6 | 53.5 | 55.5 | 53.3 | 31.1 | |
46.1 | 57.9 | 55.7 | 53.5 | 32.5 | 52.6 | 41.6 | 44.3 | 50.8 | 35.8 | 38.7 | 53.0 | 53.3 | 55.5 | 54.0 | 31.7 |
IV-B2 Branch overloading
Results of branch overloading analysis are shown in Fig. 3. Overall, GNN surrogates provide accurate prediction for the probability of overloading in individual transmission line. Herein we provide detailed analysis for the results of Case118. It is noticed that there are two branches with , indicating they will always approach the maximum allowed capacity and thus operators should give special attention to these lines to ensure the safety of grid operation. There are four branches with , and they have been identified as such by the GNN surrogate-based assessment. The probability of multiple branch overloading is shown in Fig. 4. Given a heavily loaded critical branch, the conditional probability of overloading all the remaining branches is calculated. Overall, the GNN predictions are in good agreement with the ground truth. It is noticed that overloading does not happen for most branches as the value of is almost zero. Moreover, there are a few branches (e.g., ) that experience heavier loading more often than other branches. It is therefore necessary to monitor the flow in these lines during daily operation of the grid.



The overall risk of branch overloading, as defined in Eq. 16, is also shown in Fig. 5. It is observed that ranges between and . Interestingly, it is found that branches with the highest (marginal) overloading probability do not carry the highest overall (system-wide) overloading risk. This indicates that there will be significant risk of overall (system-wide) branch overloading if the branches that rarely experience overloading experience high power flow. The comparison of OPF-based and GNN-based risk estimates indicates that GNN surrogates are sufficiently accurate for predicting this complex (conditional) risk, and can be used for real-time risk assessment.
IV-B3 GNN surrogate accuracy for different grid forecasts
The GNN surrogate trained using the training data with simple underlying probability distributions (uniform, uncorrelated variables) needs to be exercised for grid variable forecasts that follow more realistic probability distributions (non-uniform, correlated variables) for reliability and risk assessment. It is thus important to quantify the error in reliability or risk estimation as a function of a measure of distance between the training and the forecast distributions. To this end, multiple joint forecast distributions are generated by modifying the marginal distribution parameters. Samples of correlated grid variables are obtained from each of these distributions and are used to quantify reliability and risk. The similarity between the training data and a candidate (forecast) supply/demand regime (as represented by the corresponding joint probability distributions) is quantified using a statistical distance metric (). If the samples of a random vector are given by and those of a random vector are given by , then is defined as:
(18) |
where and are the number of samples of and , respectively, denotes the expected value, and is a vector norm. A higher value of indicates a higher degree of dissimilarity between and . In the context of GNN surrogate evaluation, represents training data whereas denotes testing data. The (ensemble-based) distance () between GNN training data and the forecast distribution is computed using Eq. 18. The mean absolute percentage error (MAPE) in the estimation of branch overloading probability for different values of is shown in Fig. 6. The MAPE % when , however, the value of MAPE gradually increases after that till it reaches about % for . The methodology used in this exercise could be implemented by grid operators to quantify the error bounds on their GNN-based real-time risk estimates.

V Conclusion
We investigated the utility of DC OPF surrogate modeling for power grid operational risk assessment. We developed a methodology for evaluating such risk surrogates and demonstrated the methodology for GNN surrogates. To this end, we solved a large number of optimal power flow (OPF) problems to obtain the surrogate model training data, and learned multiple GNN surrogates that predict bus-level, branch-level and system-level QoIs. We considered inadequate zonal and system reserve, as well as branch overloading as the relevant failure modes for reliability and risk assessment. The surrogate model performance for predicting the QoI values as well as for quantifying the reliability and risk was assessed using two types of test datasets. The first dataset used simple statistical description (uniform, uncorrelated) for stochastic grid variables (renewable generation and load). This is the prevailing practice in literature on OPF surrogate models. The second dataset used realistic, complex statistical description of stochastic grid variables. Specifically, we used realistic marginal probability distributions and employed a Gaussian copula to capture the dependence structure of stochastic grid variables. This has not been previously reported in the power grid literature, and GNN surrogate performance evaluation under this complicated joint (forecast) distribution is needed before real-world operational risk assessment of power grids. We demonstrated the proposed GNN surrogate-based reliability and risk estimation methodology on the Case118, Case300, Case1354pegase and Case2848rte power grids. By comparing OPF-based and GNN-based reliability and risk estimation, we showed that GNN surrogates can provide accurate estimate of the grid’s operational risk at a desired time instant and are also capable of dealing with correlations between variables in distinct zones. We also developed a methodology that could be used to estimate the error in surrogate-based reliability and risk estimation as a function of the statistical distance between training data and grid forecast data. The excellent accuracy and lower computational cost of the GNN surrogates as compared to numerical OPF solvers indicate that GNNs can be good surrogates for computationally expensive numerical OPF solvers, and could be deployed for real-time risk assessment.
The proposed methodology has only considered static reliability and risk assessment, for a given unit commitment. It can be extended to capture temporal evolution of grid variables. To that end, it is necessary to examine generator on/off status, ramping rate, wear and tear cost, etc., and consider the change in committed generator units. In addition, advanced learning methods such as reinforcement learning maybe helpful to improve GNN prediction accuracy, especially for branch-level and system-level QoIs, and thus enhance real-time reliability and risk quantification.
VI Acknowledgement
This work is partly funded by ARPA-E PERFORM award DE-AR0001280. The support is gratefully acknowledged.
References
- [1] M. Milligan, P. Donohoo, D. Lew, E. E. Kirby, B. Holttinen, H. Lannoye, E. Flynn, D. O’malley, M. Miller, and N. Eriksen, “Operating reserves and wind power integration: An international comparison,” National Renewable Energy Lab.(NREL), 2010.
- [2] Ž. B. Rejc and M. Čepin, “Estimating the additional operating reserve in power systems with installed renewable energy sources,” Int. J. Electr. Power Energy Syst., vol. 62, pp. 654–664, Nov. 2014.
- [3] P. J. Heptonstall and R. J. K. Gross, “A systematic review of the costs and impacts of integrating variable renewables into power grids,” Nat. Energy, vol. 6, no. 1, pp. 72–83, Nov. 2020.
- [4] Q. P. Zheng, J. Wang, and A. L. Liu, “Stochastic optimization for unit commitment—a review,” IEEE Trans. Power Syst., vol. 30, no. 4, pp. 1913–1924, Jul. 2015.
- [5] Y.-Y. Hong and G. F. D. G. Apolinario, “Uncertainty in unit commitment in power systems: A review of models, methods, and applications,” Energies, vol. 14, no. 20, p. 6658, Oct. 2021.
- [6] H. Holttinen, M. Milligan, E. Ela, N. Menemenlis, J. Dobschinski, B. Rawn, R. J. Bessa, D. Flynn, E. Gomez Lazaro, and N. Detlefsen, “Methodologies to determine operating reserves due to increased wind power,” in 2013 IEEE Power & Energy Society General Meeting. IEEE, 2013.
- [7] B. Mohandes, M. S. E. Moursi, N. Hatziargyriou, and S. E. Khatib, “A review of power system flexibility with high penetration of renewables,” IEEE Trans. Power Syst., vol. 34, no. 4, pp. 3140–3155, Jul. 2019.
- [8] E. Ela, B. Kirby, E. Lannoye, M. Milligan, D. Flynn, B. Zavadil, and O. Malley, Evolution of operating reserve determination in wind power integration studies. InIEEE PES general meeting. IEEE, 2010.
- [9] O. Stover, P. Karve, and S. Mahadevan, “Reliability and risk metrics to assess operational adequacy and flexibility of power grids,” Reliab. Eng. Syst. Saf., vol. 231, no. 109018, p. 109018, Mar. 2023.
- [10] M. Čepin, Assessment of power system reliability: methods and applications. Springer Science & Business Media, 2011.
- [11] P. Dehghanian and M. Kezunovic, “Probabilistic decision making for the bulk power system optimal topology control,” IEEE Transactions on Smart Grid, vol. 7, no. 4, pp. 2071–2081, 2016.
- [12] G. Shafiullah, A. M. Oo, A. S. Ali, and P. Wolfs, “Potential challenges of integrating large-scale wind energy into the power grid–a review,” Renewable and sustainable energy reviews, vol. 20, pp. 306–321, 2013.
- [13] J. Momoh, R. Koessler, M. Bond, B. Stott, D. Sun, A. Papalexopoulos, and P. Ristanovic, “Challenges to optimal power flow,” IEEE Transactions on Power systems, vol. 12, no. 1, pp. 444–455, 1997.
- [14] H. W. Dommel and W. F. Tinney, “Optimal power flow solutions,” IEEE Transactions on power apparatus and systems, no. 10, pp. 1866–1876, 1968.
- [15] A. G. Bakirtzis and P. N. Biskas, “A decentralized solution to the dc-opf of interconnected power systems,” IEEE Transactions on Power Systems, vol. 18, no. 3, pp. 1007–1013, 2003.
- [16] P. N. Biskas, A. G. Bakirtzis, N. I. Macheras, and N. K. Pasialis, “A decentralized implementation of dc optimal power flow on a network of computers,” IEEE Transactions on Power Systems, vol. 20, no. 1, pp. 25–33, 2005.
- [17] Y. Xu, J. Hu, W. Gu, W. Su, and W. Liu, “Real-time distributed control of battery energy storage systems for security constrained dc-opf,” IEEE Transactions on Smart Grid, vol. 9, no. 3, pp. 1580–1589, 2016.
- [18] Y. Wang, L. Wu, and S. Wang, “A fully-decentralized consensus-based admm approach for dc-opf with demand response,” IEEE Transactions on Smart Grid, vol. 8, no. 6, pp. 2637–2647, 2016.
- [19] D. Biagioni, P. Graf, X. Zhang, A. S. Zamzam, K. Baker, and J. King, “Learning-accelerated admm for distributed dc optimal power flow,” IEEE Control Systems Letters, vol. 6, pp. 1–6, 2020.
- [20] L. Yang, J. Luo, Y. Xu, Z. Zhang, and Z. Dong, “A distributed dual consensus admm based on partition for dc-dopf with carbon emission trading,” IEEE Transactions on Industrial Informatics, vol. 16, no. 3, pp. 1858–1872, 2019.
- [21] T. Erseghe, “Distributed optimal power flow using admm,” IEEE transactions on power systems, vol. 29, no. 5, pp. 2370–2380, 2014.
- [22] M. P. Abraham and A. A. Kulkarni, “Admm-based algorithm for solving dc-opf in a large electricity network considering transmission losses,” IET Generation, Transmission & Distribution, vol. 12, no. 21, pp. 5811–5823, 2018.
- [23] S. Pineda, J. M. Morales, and A. Jiménez-Cordero, “Data-driven screening of network constraints for unit commitment,” IEEE Transactions on Power Systems, vol. 35, no. 5, pp. 3695–3705, 2020.
- [24] D. Deka and S. Misra, “Learning for dc-opf: Classifying active sets using neural nets,” in 2019 IEEE Milan PowerTech. IEEE, 2019, pp. 1–6.
- [25] T. Zhao, X. Pan, M. Chen, A. Venzke, and S. H. Low, “Deepopf+: A deep neural network approach for dc optimal power flow for ensuring feasibility,” in 2020 IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm). IEEE, 2020, pp. 1–6.
- [26] A. Velloso and P. Van Hentenryck, “Combining deep learning and optimization for preventive security-constrained dc optimal power flow,” IEEE Transactions on Power Systems, vol. 36, no. 4, pp. 3618–3628, 2021.
- [27] Y. Chen, L. Zhang, and B. Zhang, “Learning to solve dcopf: A duality approach,” Electric Power Systems Research, vol. 213, p. 108595, 2022.
- [28] L. Zhang, Y. Chen, and B. Zhang, “A convex neural network solver for dcopf with generalization guarantees,” IEEE Transactions on Control of Network Systems, vol. 9, no. 2, pp. 719–730, 2021.
- [29] R. Dobbe, O. Sondermeijer, D. Fridovich-Keil, D. Arnold, D. Callaway, and C. Tomlin, “Toward distributed energy services: Decentralizing optimal power flow with machine learning,” IEEE Transactions on Smart Grid, vol. 11, no. 2, pp. 1296–1306, 2019.
- [30] X. Lei, Z. Yang, J. Yu, J. Zhao, Q. Gao, and H. Yu, “Data-driven optimal power flow: A physics-informed machine learning approach,” IEEE Transactions on Power Systems, vol. 36, no. 1, pp. 346–354, 2020.
- [31] M. K. Singh, V. Kekatos, and G. B. Giannakis, “Learning to solve the ac-opf using sensitivity-informed deep neural networks,” IEEE Transactions on Power Systems, vol. 37, no. 4, pp. 2833–2846, 2021.
- [32] X. Pan, M. Chen, T. Zhao, and S. H. Low, “Deepopf: A feasibility-optimized deep neural network approach for ac optimal power flow problems,” IEEE Systems Journal, vol. 17, no. 1, pp. 673–683, 2022.
- [33] F. Hasan, A. Kargarian, and A. Mohammadi, “A survey on applications of machine learning for optimal power flow,” in 2020 IEEE Texas Power and Energy Conference (TPEC). IEEE, 2020, pp. 1–6.
- [34] Y. Zhang, P. M. Karve, and S. Mahadevan, “Power grid operational risk assessment using graph neural network surrogates,” arXiv preprint arXiv:2311.12309, 2023.
- [35] V. Bolz, J. RueB, and A. Zell, “Power flow approximation based on graph convolutional networks,” in 2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA). IEEE, Dec. 2019.
- [36] Q. Yang, A. Sadeghi, G. Wang, G. B. Giannakis, and J. Sun, “Power system state estimation using gauss-newton unrolled neural networks with trainable priors,” in 2020 IEEE International Conference on Communications, Control, and Computing Technologies for Smart Grids (SmartGridComm). IEEE, Nov. 2020.
- [37] D. Wang, K. Zheng, Q. Chen, G. Luo, and X. Zhang, “Probabilistic power flow solution with graph convolutional network,” in 2020 IEEE PES Innovative Smart Grid Technologies Europe (ISGT-Europe). IEEE, Oct. 2020.
- [38] P. Xu, Y. Pei, X. Zheng, and J. Zhang, “A simulation-constraint graph reinforcement learning method for line flow control,” in 2020 IEEE 4th Conference on Energy Internet and Energy System Integration (EI2). IEEE, Oct. 2020.
- [39] W. Miao, H. Wu, P. Chen, and J. Jing, “Intelligent auxiliary operation and maintenance system of power communication network based on knowledge graph,” J. Phys. Conf. Ser., vol. 1684, no. 1, p. 012105, Nov. 2020.
- [40] J. Q. James, D. J. Hill, L. Vo, and Y. Hou, “Synchrophasor recovery and prediction: A graph-based deep learning approach,” IEEE Internet of Things Journal, vol. 6, no. 5, pp. 7348–7359, 2019.
- [41] C. Duan, W. Fang, L. Jiang, L. Yao, and J. Liu, “Distributionally robust chance-constrained approximate AC-OPF with wasserstein metric,” IEEE Trans. Power Syst., vol. 33, no. 5, pp. 4924–4936, Sep. 2018.
- [42] X. Pan, M. Chen, T. Zhao, and S. H. Low, “Deepopf: A feasibility-optimized deep neural network approach for ac optimal power flow problems,” 2020.
- [43] O. D. Montoya, W. Gil-González, and A. Garces, “Sequential quadratic programming models for solving the OPF problem in DC grids,” Electric Power Syst. Res., vol. 169, pp. 18–23, Apr. 2019.
- [44] H. Holttinen, M. Milligan, E. Ela, N. Menemenlis, J. Dobschinski, B. Rawn, R. J. Bessa, D. Flynn, E. Gomez Lazaro, and N. Detlefsen, “Methodologies to determine operating reserves due to increased wind power,” in 2013 IEEE Power & Energy Society General Meeting. IEEE, 2013.
- [45] G. Li, C.-C. Liu, C. Mattson, and J. Lawarree, “Day-ahead electricity price forecasting in a grid environment,” IEEE Trans. Power Syst., vol. 22, no. 1, pp. 266–274, Feb. 2007.
- [46] M. Aien, M. Fotuhi-Firuzabad, and M. Rashidinejad, “Probabilistic optimal power flow in correlated hybrid wind–photovoltaic power systems,” IEEE Trans. Smart Grid, vol. 5, no. 1, pp. 130–138, Jan. 2014.
- [47] M. B. Amor, E. Billette de Villemeur, M. Pellat, and P.-O. Pineau, “Influence of wind power on hourly electricity prices and GHG (greenhouse gas) emissions: Evidence that congestion matters from ontario zonal data,” Energy (Oxf.), vol. 66, pp. 458–469, Mar. 2014.
- [48] J. Tastu, P. Pinson, and H. Madsen, “Space-time scenarios of wind power generation produced using a gaussian copula with parametrized precision matrix,” Technical University of Denmark, Tech. Rep., 2013.
- [49] R. D. Zimmerman, C. E. Murillo-Sánchez, and D. Gan, “Matpower: A matlab power system simulation package,” Manual, Power Systems Engineering Research Center, Ithaca NY, vol. 1, pp. 10–7, 1997.
- [50] X. Pan, T. Zhao, M. Chen, and S. Zhang, “Deepopf: A deep neural network approach for security-constrained dc optimal power flow,” IEEE Transactions on Power Systems, vol. 36, no. 3, pp. 1725–1735, 2020.
- [51] S. Fliscounakis, P. Panciatici, F. Capitanescu, and L. Wehenkel, “Contingency ranking with respect to overloads in very large power systems taking into account uncertainty, preventive, and corrective actions,” IEEE Transactions on Power Systems, vol. 28, no. 4, pp. 4909–4917, 2013.
- [52] C. Josz, S. Fliscounakis, J. Maeght, and P. Panciatici, “Ac power flow data in matpower and qcqp format: itesla, rte snapshots, and pegase,” arXiv preprint arXiv:1603.01533, 2016.
- [53] L. Thurner, A. Scheidler, F. Schäfer, J.-H. Menke, J. Dollichon, F. Meier, S. Meinecke, and M. Braun, “pandapower—an open-source python tool for convenient modeling, analysis, and optimization of electric power systems,” IEEE Transactions on Power Systems, vol. 33, no. 6, pp. 6510–6521, 2018.
- [54] H. Wu and Z. Xu, “Fast dc optimal power flow based on deep convolutional neural network,” in 2022 IEEE 5th International Electrical and Energy Conference (CIEEC). IEEE, 2022, pp. 2508–2512.