This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Deep Learning Hydrodynamic Forecasting for Flooded Region Assessment in Near-Real-Time
(DL Hydro-FRAN)

Francisco Haces-Garcia National Center for Airborne Laser Mapping Department of Civil and Environmental Engineering, University of Houston Corresponding Author: fhacesgarcia@uh.edu Natalya Maslennikova National Center for Airborne Laser Mapping Department of Civil and Environmental Engineering, University of Houston Craig L. Glennie National Center for Airborne Laser Mapping Department of Civil and Environmental Engineering, University of Houston Hanadi S. Rifai Department of Civil and Environmental Engineering, University of Houston Vedhus Hoskere Department of Civil and Environmental Engineering, University of Houston Nima Ekhtari National Center for Airborne Laser Mapping Department of Civil and Environmental Engineering, University of Houston
(April 2023)

1 Abstract

Hydrodynamic flood modeling improves hydrologic and hydraulic prediction of storm events. However, the computationally intensive numerical solutions required for high-resolution hydrodynamics have historically prevented their implementation in near-real-time flood forecasting. This study examines whether several Deep Neural Network (DNN) architectures are suitable for optimizing hydrodynamic flood models. Several pluvial flooding events were simulated in a low-relief high-resolution urban environment using a 2D HEC-RAS hydrodynamic model. These simulations were assembled into a training set for the DNNs, which were then used to forecast flooding depths and velocities. The DNNs’ forecasts were compared to the hydrodynamic flood models, and showed good agreement, with a median RMSE of around 2 mm for cell flooding depths in the study area. The DNNs also improved forecast computation time significantly, with the DNNs providing forecasts between 34.2 and 72.4 times faster than conventional hydrodynamic models. The study area showed little change between HEC-RAS’ Full Momentum Equations and Diffusion Equations, however, important numerical stability considerations were discovered that impact equation selection and DNN architecture configuration. Overall, the results from this study show that DNNs can greatly optimize hydrodynamic flood modeling, and enable near-real-time hydrodynamic flood forecasting.

2 Introduction

Flooding is an intensifying hazard that poses significant risks to life and property. Over 73% of US-based flooding property damage occurs in cities [1], where rapid urbanization complicates flood assessment by regularly deprecating regulatory floodmaps and exacerbating storm effects [2, 3, 4]. Climate change has also caused the intensification of storm events [5], which has given rise to further concerns of aggravating urban flooding. Flood modeling is a useful tool to assess the potential impacts of flood events for which real-world data is unavailable or inadequate. Thus, the development of reliable, timely, and accurate flood models is essential to mitigating the hazardous impacts of storm events.

Flood modeling techniques are varied, with different methods implemented depending on data availability and analysis scale. Geospatial terrain-based analyses are common within the literature for the analysis of large areas. Such methods include Height Above Nearest Drainage techniques [6, 7] and simple bathtub models [8, 9] to assess the flooding extents of a given storm event. However, these methods have been found to be inaccurate in low-relief environments, in which a small increase in elevation can cause a large increase in inundated area [10, 11, 12]. Hydrodynamic flood modeling, the study of fluids in motion, generally improves hydrologic and hydraulic prediction in low-relief high-resolution settings with complex hydrodynamics [11, 13].

Hydrodynamic modeling relies on numerically approximating Systems of Partial Differential Equations (SPDEs) to spatially evaluate flooding. The spatial discretization of hydrodynamic models, however, is a computationally laborious task [14] that scales exponentially with the resolution of models [15], and linearly with flow velocity (as demonstrated by the Courant–Friedrichs–Lewy condition for timestepping in fluid dynamics[16]). Since modern advances in data collection (particularly in remote sensing) have enabled fine scale hydraulic modeling, efficiently solving these spatial discretizations is crucial to enable efficient flood forecasting and modeling in rapidly changing urban environments. This study examines whether Deep Neural Networks can be implemented to bridge this gap.

Recent advances in computational capabilities and algorithm design have burgeoned Deep Neural Networks (DNNs) for hydrologic and hydraulic prediction. Literature has shown the usefulness of various DNN architectures to solve specific problems, including within hydrology and hydrodynamics [17]. While Feedforward DNNs are useful for a wide variety of general deterministic tasks, Bayesian DNNs are homologously useful for stochastic regression. Notably for flood modeling as a Physics-based process, Physics-based DNNs (PhyDNNs) encode process-specific information into a DNN, and have been shown to be useful tools for various physical problems [18]. Moreover, Long-Short Term Memory models have been useful in timeseries prediction, particularly in hydrology (e.g., [19, 20]). Ample literature exists on the use of DNNs for the prediction of riverine streamflow in reaches [21, 22]. However, the application of DNNs to high-resolution 2D hydrodynamics remains largely unstudied, despite continued advances in the design, applicability, and accessibility of DNNs. To address this research gap, this study examines the effectiveness of various types of DNNs for high-resolution 2D hydrodynamics.

3 Methods

3.1 Hydrodynamic modeling

One of the most significant challenges to develop DNN-based 2D hydrodynamics are the vast data requirements for training a DNN [23, 24, 25]. Data collection during flood events largely relies on either in-situ gauges or satellite remote sensing (such as syntethic aperture radar imagery). Unfortunately, in-situ gauges are generally only applied in riverine systems, limiting their usefulness to overland flow modeling. Moreover, satellite remote sensing datasets suffer from temporal and spatial resolution issues, and cloud cover during storm events often prevents the application of many optical sensors. To mitigate these data sparsity issues, a series of 2D hydrodynamic models were used over the study area to simulate rainfall events and provide training observations for the DNNs.

Hydrodynamic flood models commonly solve the Shallow Water Equations (SWEs), also known as the Saint Venant Equations [26], to simulate a storm event. The SWEs (which are an SPDE) are derived from the Navier-Stokes equations with several simplifying assumptions for the case of hydrodynamic flow modeling. Such assumptions commonly include fluid incompressibility and constant momentum in the z-axis. A form of the SWEs is shown in equations 1, 2, and 3, where hh is water elevation, uu and vv are velocity in the xx and yy directions, mm is mass, and gg is the gravitational acceleration. SS and FF are the momentum sources and sinks in the xx and yy directions, which can include friction components, Coriolis force, and turbulence effects, among others. Equation (1) represents a mass balance, and equations (2) and (3) demonstrate the momentum balance in the x and y directions.

ht+(hu)x+(hv)x=mt\frac{\partial h}{\partial t}+\frac{\partial(hu)}{\partial x}+\ \frac{\partial(hv)}{\partial x}=\frac{\partial m}{\partial t} (1)
(hu)t+x(hu2+g2h2)+(huv)y=S\frac{\partial(hu)}{\partial t}+\frac{\partial}{\partial x}\left(hu^{2}+\frac{g}{2}h^{2}\right)+\ \frac{\partial(huv)}{\partial y}=S (2)
(hv)t+(huv)x+y(hv2+g2h2)=F\frac{\partial(hv)}{\partial t}+\frac{\partial(huv)}{\partial x}+\ \frac{\partial}{\partial y}\left(hv^{2}+\frac{g}{2}h^{2}\right)=F (3)

The Hydrologic Engineering Center’s River Analysis System (HEC-RAS) is one of the state-of-the-art hydrodynamic modeling systems available to the public. It is one of the most widely applied hydrodynamic software packages, and is often implemented to design regulatory floodplains (e.g., [27, 28]). HEC-RAS can solve both 1D and 2D hydrodynamics, with two different formulations available: the Diffusion Wave Equations and the Full Momentum Equations. The Diffusion Wave Equations (DEs) assume that changes in momentum are mainly due to gravity and friction, whereas the Full Momentum Equations (FMEs) add wave propagations, turbulence, wind, and Coriolis forces. Such terms are expressed within SS and FF in Equations 2 and 3, and their specific derivations are available within the HEC-RAS Hydraulic Reference Manual [29]

Numerical instability is a common problem in 2D hydrodynamic models [30], particularly those with large amounts of wetting and drying fronts such as those with low topographic relief [31, 32, 33]. One of the main contributors to instability is the highly nonlinear behavior of hyperbolic SPDEs (such as the SWEs) at the borders of the spatial discretization’s processing cells, which can induce rarefactions, contact discontinuities, and shocks [34]. Nonlinear behavior can cause numerical solvers to lose accuracy, requiring both higher-precision numerical solving schemes and rigorous stability assessments of solutions.

3.2 Study Area

The study area for this research consisted of a highly-urbanized low-relief area in Houston, TX, US, as shown in Figure 1. Two datasets derived from an Airborne Lidar Survey (ALS) collected by the National Center for Airborne Laser Mapping (Houston, TX, US) were used. The first dataset is a Digital Terrain Model with a 1-m resolution, which was gridded from the ALS, then cropped into an approximately square 1km by 1km tile. The second dataset, a high-resolution gridded land cover map, was developed in [35]. From this land cover map, bottom roughness maps (Manning’s n) were obtained using standardized coefficients from [36].

The topography and Manning’s roughness maps were used to develop a 2D hydrodynamic model in HEC-RAS 5.0.7. To standardize the training dataset generation for this study, HEC-RAS was automated using the HECRASController as described in [37]. Experiments were performed with rainfall intensities of one, two, and three inches per hour for the entire model domain. For additional context, 1.90 and 3.64 in/hrin/hr for 11 hour represent storm events with average recurrence intervals of 1 and 10 years for the selected study area respectively [38]. Model outputs were exported every 5 seconds, and training datasets were created using both the Diffusion Wave and Full Momentum HEC-RAS formulations. HEC-RAS results are saved as HDF files, which have a nested file structure of sequentially indexed cells having various attributes, including the raw input values, the intermediate solver calculations, and the modeled value of face velocities and water depth at each timsestep.

Refer to caption
Figure 1: Model domain for this study, encompassing the University of Houston Campus in Houston, TX, US. Navigation and satellite views are shown, as well as the elevation and Manning’s roughness datasets used to develop the hydrodynamic models.

3.3 Dataset Construction

Beyond the current state of a given cell (which is comprised of the water depth, and face velocities), each cell’s attributes influence its hydrodynamic behavior. Thus, the input datasets to generate predictions from the DNNs were comprised of the current cell state (referred to as the state vector), and the attributes of the cell and its surroundings (i.e., the topographic and roughness values for each cells and its neighbors). These inputs were supplemented with the ”change” array, which indicates the rainfall intensity for each timestep, along with how far into the future the DNN should predict for each lookahead timestep. The expected results were the cell state (water depth and face velocities) for a given number of lookahead timesteps. The input and output datasets are detailed in Table 1.

Table 1: Description of Input and Output Datasets for Training and Testing of DNNs.
Description Size in Input Size in Output
State Vector Water Depth 1 1*
North Face Velocity 1 1*
South Face Velocity 1 1*
East Face Velocity 1 1*
West Face Velocity 1 1*
Change Array Rainfall Intensity 1 0
Timestep for Future Prediction 1* 0
Attributes Topography of Cell and Neighbors 9 raw + 1 normalized 0
Manning’s Roughness of Cell and Neighbors 9 raw + 1 normalized 0
Slopes (X, Y, X-Y) of Cell and Neighbors 939*3 raw + 2 normalized 0

n = 9810/1000994 cells, with 50% used for training and 50% for validation, and each cell having approximately 2100 timesteps

Input vectors have a shape of (5+2l+49)×1(5+2*l+49)\times 1 and output vectors have a shape of 5l×15*l\times 1.

* per lookahead timestep

3.4 DNN Architectures

Various DNN architectures were compared in both their fitting and predictive capabilities for 2D hydrodynamics. These DNNs were developed in PyTorch, and the tested architectures were selected to provide representative performance results for different types of DNNs. All the DNN code used for this research, as well as the training, testing, and validation datasets, are available upon request.

A feedforward DNN was constructed as shown in Figure 2. Feedforward DNNs can be used for a wide variety of Deep Learning tasks, and are the foundational networks upon which most other architectures are based. To examine whether stochastic prediction could improve fitting and prediction of flood models, bayesian DNNs were also tested. Bayesian DNNs introduce stochastic components to various elements of traditional feedforward DNNs. Equation 4 (which represents the Bayesian DNNs implemented in this research) illustrates such a stochastic component for a DNN layer, with yy being the layer outputs, xx the inputs, μW\mu_{W} and μB\mu_{B} the mean value for the weight and bias, σW\sigma_{W} and σB\sigma_{B} are the variances for these values, and ηr1\eta_{r_{1}} and ηr2\eta_{r_{2}} random numbers chosen every time the layer is activated. The principal operational difference between a feedfoward DNN and a bayesian DNN is that the former is deterministic and the latter is stochastic. Thus, a feedforward DNN will always produce the same output for a given input, but a bayesian DNN’s output may vary. For this study, Bayesian DNNs where implemented through the TorchBNN package [39], and their construction is shown in Figure 2.

A PhyNet-style [40] PhyDNN was also implemented. PhyDNN first incorporates a common layer, which is split into four processing trains (corresponding to mass fluxes in each cardinal direction). These processing trains are then concatenated to calculate the state for each lookahead timestep (vnv_{n}, vsv_{s}, vev_{e}, vwv_{w}, and hh). PhyDNN is a more complex DNN architecture that encodes some physical knowledge of the underlying hydrodynamics into the learning process, and is also tested as a representative Phyiscs-informed DNN. Finally, a Long-Short Term Memory (LSTM) DNN was developed. LSTMs are recurrent DNNs which selectively block pathways during training to simultaneously foster long and short term learning, LSTMs have been shown to be very useful in timeseries prediction, and thus were also tested in this research. The architectures implemented in this study are described in Figure 2, which also shows their respective hyperparameters available for tuning.

y=μW+eσWηr1x+μB+eσBηr2y=\mu_{W}+e^{\sigma_{W}}*\eta_{r_{1}}*x+\mu_{B}+e^{\sigma_{B}}*\eta_{r_{2}} (4)
Refer to caption
Figure 2: DNN Architectures Implemented for in this study

4 Results

4.1 Hydrodynamic Modeling

Figure 3 displays one of the HEC-RAS 2D hydrodynamic models developed to train the DNNs for both the Full Momentum Equations (FMEs) and the Diffusion Equations (DEs). Although the run results are markedly similar throughout most of the study area, higher slope areas have a tendency to differ between FMEs and DEs, as can be appreciated in the northeastern sector of the study area. Because the FMEs are contain more terms to describe hydrodynamics than the DEs, HEC-RAS users are encouraged to use the FMEs where there are differences between the two, where the DEs can be used when both equation sets are similar to optimize computation time [36].

It should be noted that the FMEs are much more computationally intensive, and can lead to more numerically unstable solutions. For training dataset generation, the DEs were able to converge on solutions for all cells at 5 second computation intervals, but the FMEs required a slightly shorter computation timestep for all cells because of 2 numerically unstable cells (out of 1,000,994). Results are reported at 5 second intervals for all cases.

Refer to caption
Figure 3: Comparison of HEC-RAS formulations. Note the different plotting scales for each column.

4.2 Architecture Training

Refer to caption
Figure 4: Performance of different architectures during training. The HEC-RAS equation set has an important impact on the fitting of the DNNs.

Hyperparameter tuning was performed using grid search to ensure each layer in the DNNs had the optimal sizes, and the selected hyperparameters are shown in Table 2. Training was performed using the Adam optimizer [41] on the full training set of stratified samples (n=9810/1000994). After initial training, it was discovered that high-flow cells were preventing the appropriate fitting of dry and near-dry states, worsening DNN prediction. To mitigate this challenge, a refinement step to improve the performance of the DNNs was performed. The DNNs were fine-tuned with a low learning rate and the training dataset was trimmed to low-flow cells (those with less than 50 mm of flood height and 50 mm/s of flow velocity). Each of the DNNs was provided 6 hours of Graphics Processing Unit (GPU) time on a 16 GB NVIDIA V100 for initial training on the full training dataset, and an additional 4 hours for fine-tuning. As is standard practice, the batch sizes were tuned in all cases to maximize the usage of GPU memory for each training epoch.

Table 2: DNN Training Configurations
Batch Size Learning Rates Hyperparameters
BNN 100000 1e-3, 5e-5 d=10, H=1000
DNN 100000 1e-3, 5e-5 d=10, H=1000
LSTM 150000 1e-3, 5e-5 d=5, H=500
PhyDNN 50000 1e-3, 5e-5 d=5, H=1000, dsd_{s} = 5

Analysis of performances for each architecture at the hyperparameter tuning stage revealed that the network sizes at which prediction performance stopped improving (and networks began to overfit) was similar for both the DEs and FMEs. This can be attributed to the similarities between DEs and FMEs as shown in Figure 3, however, in other study areas, the optimal network hyperparameters could differ between equation sets. Prediction capabilities varied for all the DNN architectures, with the FMEs showing lower prediction performance for all the velocity test sets. Moreover, as shown in Figure 4, fitting for water depth was similar in the FMEs and DEs for each DNN architecture. The steep improvement in fit metrics around epoch 40 can be attributed to the fine-tuning step, which enabled the DNNs to better fit the vast majority of cells while simultaneously removing high-error cells from the training dataset.

Starting flood models from completely dry states (as is required for pluvial flood modeling in non-riverine urban environments) can exacerbate numerical instability in flood models. Such effects are present in the HEC-RAS solutions underlying the training dataset. Specifically, both the FMEs and DEs have mass creation issues early in the modelled flow event. To assess whether this numerical instability posed challenges to DNN forecasting, all the DNNs were also trained with a training dataset excluding the first 10 minutes of each simulated event, at which point numerical stability (or steady-state) for mass had been reached.

4.3 Forecasting Capabilities

The forecasting capabilities for each DNN were tested on a 1 in/hr 1-hour rainfall event for the entire study area. The different architecture’s predictions were compared to the HEC-RAS predictions using both Root Mean Squared Error (RMSE) and Normalized Nash-Suttcliffe Efficiency (NNSE) for the entire 1-hour timeseries. Prediction performances for the various experiment runs are shown in Figure 5. The fine-tuning step improved fit in all cases, with improvements in the RMSE of up to an order of magnitude, and in the NNSE up to 50%. The decrease in NNSE for the steady-state networks in most cases suggests that the removal of the initial wetting front from the training set made it more challenging for the various architectures to replicate the shape of the hydrograph at each cell.

The geospatial patterns for the fit metrics of each DNN are shown in Figure 6, which shows the Coefficient of Variance (CVCV) for each cell, calculated as CV=RMSE/μtrueCV={RMSE}/{\mu_{true}}. The CV is shown instead of the RMSE because it provides a relative error performance metric, and is preferred over the NNSE because it is better suited for nearly-dry areas which have low variance in their timeseries. Common themes persist throughout the geospatial patterns, with all DNNs performing well in dry locations, and localized areas of high error.

It should be noted that some of the most significant clusters of high error occur near buildings (see Figure 1). This is likely due to limitations for building handling in the underlying flood model. HEC-RAS offers two options to handle built-up structures in 2D hydrodynamic models: hydraulic breaklines, or terrain conditioning. Hydraulic breaklines restrict flow across the edges of cells, whereas terrain conditioning either raises or lowers built up structures to the elevation of their surroundings. Neither of these accurately represents hydrodynamic phenomena surrounding buildings, and thus, fitting issues are expected around buildings. Some recent literature has explored novel techniques to shift pluvial flows in buildings to gutters [42], however, such an implementation is not currently available in widely distributed hydrodynamic models.

Refer to caption
Refer to caption
Figure 5: Cumulative Density Performance Plots of NNSE (top) and RMSE (bottom) for different architectures during prediction in entire study area.
Refer to caption
Figure 6: Geospatial patterns for Coefficient of Variance for each of the DNN architectures

5 Discussion

The results presented demonstrate the capabilities of DNNs to perform hydrodynamic forecasting of pluvial inundation events. The BNN architecture lags behind in forecasting performance with respect to the DNN, PhyDNN, and LSTM. Results shown in Figure 4 suggest that there are some differences in forecasting capabilities across these last three, however, no significant differences in performance are found by the forecasts in Figure 5. Interestingly, although results shown in Figure 5 suggest only marginal differences between the full training set and the steady-state training set, the geospatial patterns in Figure 6 reveal that the localized errors displayed in the FMEs are reduced by the steady-state dataset for the LSTM and PhyDNN. These are corroborated by the coefficients of variance shown in Table 3, which show that there is an improvement in fitting for the LSTM and PhyDNN by using just numerically stable portions of the hydrodynamic flood models.

Table 3: Extreme statistics for water depth in DNN forecasts within study area
NNSE
(Higher is better)
RMSE Coefficient of Variance
(Lower is better)
Percentiles 90 99 90 99
Training Set Full SS Full SS Full SS Full SS
DEs BNN 0.54 0.50 0.66 0.51 48.17 47.96 70.85 70.76
DEs DNN 0.61 0.61 0.76 0.76 45.88 44.50 69.38 68.92
DEs LSTM 0.60 0.60 0.74 0.75 45.07 46.71 69.66 71.67
DEs PhyDNN 0.62 0.60 0.79 0.77 46.67 48.50 72.48 75.88
FMEs BNN 0.50 0.54 0.50 0.65 47.96 48.17 70.77 70.87
FMEs DNN 0.64 0.62 0.80 0.79 41.10 44.02 71.61 68.64
FMEs LSTM 0.66 0.63 0.83 0.80 43.36 44.80 85.28 73.10
FMEs PhyDNN 0.66 0.66 0.85 0.84 46.36 45.59 84.38 74.69

Colorbar included for clarity, and is stretched to the minimum and maximum of each percentile.

5.1 Computational Efficiency

Hydrodynamic forecasting through the DNNs presented in this manuscript is much faster than conventional hydrodynamic forecasting, as shown in Table 4, chiefly because of the implementation of GPUs. The tested DNN architectures are between 34 and 72 times faster than the HEC-RAS DEs, which themselves are about twice as fast as the HEC-RAS FMEs. These values do not include training time, which does not need to be repeated for every forecasted flood event or location. As discussed above, the computational efficiency of the various DNN architectures greatly depends on the lookahead timesteps. Under ideal conditions, DNN forecast processing time would be approximately inversely proportional to the amount of lookahead timesteps in a forward pass. Since 12 lookahead timesteps on a GPU predict at least 34 times faster than HEC-RAS, it can be surmized that even the edge case of 1 lookahead timestep would provide more efficient forecasts than HEC-RAS models on GPUs, however, as will be discussed, there are important stability considerations to the selection of lookahead timesteps.

Table 4: Processing time of studied hydrodynamic modeling approaches
Method Processing Hardware
Mean Processing
Time (min)
Relative
Speedup
HEC-RAS FMEs Intel Core i7-8700K and Ryzen 7 3700x 409.9 0.51
HEC-RAS DEs 210.7 1.00
BNN 219.0 0.96
DNN 341.9 0.62
LSTM 30.5 6.91
PhyDNN 419.1 0.50
BNN NVIDIA P1000 and RTX 3070 4.0 52.23
DNN 4.4 47.40
LSTM 8.2 25.76
PhyDNN 13.2 15.92

The efficiency benchmarks shown in Table 4 calculated using representative consumer hardware, however, significant speedups would be available if server-grade hardware (such as that used for training) were used for forecasting. For HEC-RAS models, increases in the amount of available processing cores would further exploit multithreading capabilities, improving solution times. Conversely, the DNNs would benefit from GPUs with additional memory for both training and prediction. At any rate, the computational efficiencies of the DNNs on GPUs shown in this study vastly improve what is otherwise available for hydrodynamic forecasting, and enable near real time flood forecasting.

5.2 Lookahead Timesteps for DNNs

As shown in Table 1, all the DNN architectures can be trained to predict a flexible number of lookahead timesteps, which can then be used to forecast any given length of time. The results presented in this manuscript were obtained using 12 DNN lookahead timesteps, meaning each forward pass of the DNN architectures predicts 60 seconds into the future (12timesteps5stimestep12timesteps*5\frac{s}{timestep}). In cases where one forward pass is not enough to meet the desired length of the forecast, the last prediction of the forward pass is successively used to generate subsequent predictions through each DNN architecture.

Selecting the appropriate number of lookahead timesteps poses an interesting problem: it is computationally more efficient to increase the amount of lookahead timesteps such that fewer forward passes are required for a given forecast. However, as shown in Figure 7, doing so has important implications on prediction accuracy. The uncertainty associated with each forward pass compounds throughout forecasts, thus selecting an appropriate lookahead must balance accuracy and computational efficiency considerations.

Refer to caption
Figure 7: Best validation performance by training epoch for various lookahead timesteps in PhyDNN for a low-flow dataset.

Since the performance between the 25s and 60s lookahead were comparable in this study area, whereas the 120s lookahead presented much more erratic performance, 60s was selected for the DNNs. However, the low flow velocities in this study area allow longer timesteps in hydrodynamic models, and thus more stable prediction behavior for the DNNs. In areas with more complex hydrodynamic phenomena, shorter lookaheads may be required to reach acceptable prediction accuracies. However, decreasing the amount of prediction timesteps may not always improve hydrodynamic prediction.

5.3 Stability Considerations

Hydrodynamic flood models are prone to numerical instability, which is exacerbated by the configurations necessary for high-resolution pluvial modeling. These configurations include the lack of a ramp-up stabilization period for the model domain (which contains no water, and therefore cannot be ramped up), the large amount of cell boundaries which require nonlinear solutions to SPDEs, and the flat terrain which provokes a large number of wetting and drying fronts. One method to examine the stability of flood models is to examine the mass conservation of the model domain. Such an analysis is presented in Figure 8 for the hydrodynamic model used to train the DNNs in this study.

Refer to caption
Figure 8: Mass change over time for 1 in/hr HEC-RAS simulations

As is evident from 8, the FMEs display an erratic pattern of mass addition to the model domain throughout the forecast, suggesting they are numerically unstable. The inability to ramp up the model shows its effects at the start of the timeseries, with both the DEs and FMEs adding around 830% the amount of mass to the study domain as was input at that time. Upon stabilization, the DEs add approximately 150% of the mass inputs to the domain. These instabilities are not apparent from the spatial patterns displayed in Figure 3. Such patterns of numerical instability provide important considerations for the selection of lookahead timesteps. Predicting fewer timesteps may allow the DNNs to better replicate hydrodynamic models, however, these models may be producing highly numerically unstable predictions which do not accurately depict flooding patterns. As shown in Figure 8, predicting multiple timesteps may provide a smoother fitting target, which could allow the DNNs to better resemble real-world flooding scenarios.

6 Conclusions

This study examined whether DNNs can be used to provide rapid pluvial inundation forecasts in low-relief urban areas. To train the DNNs, 2D hydrodynamic models in HEC-RAS were developed for a 11 km2km^{2} hyper-urbanized area in Houston, TX based on a research-grade lidar survey collected by the National Center for Airborne Laser Mapping (Houston, TX, US). Four different DNN architectures (DNN, BNN, LSTM, and PhyDNN) were compared for their fitting and forecast performance to the HEC-RAS Difussion Wave Equations and the Full Momentum Equations.

The location-agnostic DNNs were capable of forecasting water depths with median RMSEs ranging between 1-3 mm across a 1-hour rainfall event for the study area, while simultaneously predicting between 34 and 72 times faster than HEC-RAS on consumer-grade hardware at a 1-m resolution. The BNN lagged behind the DNN, LSTM, and PhyDNN in training and forecast performance (for example, 99th percentile values for NNSE of other architectures were between 0.7 and 0.85, where they were between 0.5 and 0.66 for BNNs). Although some marginal differences were found in the training performance of the DNN, LSTM, and PhyDNN, these did not translate into large differences in forecasts.

Numerical instability in hydrodynamic models was found to play an important role in the training of hydrodynamic DNNs, as suggested by a mass conservation issue within HEC-RAS. Starting the model from a dry state (as is required for pluvial inundation hydrodynamics) was found to exacerbate this issue, and restricting the training dataset for the DNNs to a mass steady-state region improved localized errors in some of the DNN forecasts (reducing the Coefficient of Variance of the RMSE by around 10% for LSTM and PhyDNN). Beyond considerations for the forecast accuracy for DNNs, these numerical instabilities carry important considerations for the architectural setup choices of DNNs for hydrodynamics.

This research has shown that DNNs are an efficient and viable method to forecast 2D hydrodynamic flooding, a first within the literature to the knowledge of the authors. Underlying limitations of hydrodynamic models are transferred to the DNNs attempting to reproduce them, revealing the need for more accurate hydrodynamic models and real-world validation data for such models. Notwithstanding these limitations, DNNs were shown vastly optimize hydrodynamic prediction, enabling future avenues of research into probabilistic hydrodynamics and near real time flood forecasting, in continued efforts to mitigate the deleterious impacts of storm events.

7 Acknowledgements

Partial funding for the first and third authors, along with the high resolution ALS DEM were provided by a facility grant from the National Science Foundation (#1830734).

References

  • [1] Engineering National Academies of Sciences and Medicine “Framing the challenge of urban flooding in the United States” National Academies Press, 2019
  • [2] H..L. Huong and A. Pathirana “Urbanization and climate change impacts on future urban flooding in Can Tho city, Vietnam” In Hydrology and Earth System Sciences 17.1, 2013, pp. 379–394 DOI: 10.5194/hess-17-379-2013
  • [3] P.. Zope, T.. Eldho and V. Jothiprakash “Impacts of land use-land cover change and urbanization on flooding: A case study of Oshiwara River Basin in Mumbai, India” In Catena 145 Elsevier B.V., 2016, pp. 142–154 DOI: 10.1016/j.catena.2016.06.009
  • [4] Wei Zhang, Gabriele Villarini, Gabriel A. Vecchi and James A. Smith “Urbanization exacerbated the rainfall and flooding caused by hurricane Harvey in Houston” In Nature 563.7731 Springer US, 2018, pp. 384–388 DOI: 10.1038/s41586-018-0676-z
  • [5] Jose A Marengo et al. “Trends in extreme rainfall and hydrogeometeorological disasters in the Metropolitan Area of São Paulo: a review” In Annals of the New York Academy of Sciences 1472.1 Wiley Online Library, 2020, pp. 5–20
  • [6] A.D. Nobre et al. “Height Above the Nearest Drainage – a hydrologically relevant new terrain model” In Journal of Hydrology 404.1, 2011, pp. 13–29 DOI: https://doi.org/10.1016/j.jhydrol.2011.03.051
  • [7] Irene Garousi-Nejad, David G. Tarboton, Mahyar Aboutalebi and Alfonso F. Torres-Rua “Terrain Analysis Enhancements to the Height Above Nearest Drainage Flood Inundation Mapping Method” In Water Resources Research 55.10, 2019, pp. 7983–8009 DOI: https://doi.org/10.1029/2019WR024837
  • [8] Lauren Lyn Williams and Melanie Lück-Vogel “Comparative assessment of the GIS based bathtub model and an enhanced bathtub model for coastal inundation” In Journal of Coastal Conservation 24.2 Springer, 2020, pp. 23
  • [9] Joko Sampurno, Randy Ardianto and Emmanuel Hanert “Integrated machine learning and GIS-based bathtub models to assess the future flood risk in the Kapuas River Delta, Indonesia” In Journal of Hydroinformatics 25.1 IWA Publishing, 2023, pp. 113–125
  • [10] J.. Johnson, D. Munasinghe, D. Eyelade and S. Cohen “An integrated evaluation of the National Water Model (NWM)–Height Above Nearest Drainage (HAND) flood mapping methodology” In Natural Hazards and Earth System Sciences 19.11, 2019, pp. 2405–2420 DOI: 10.5194/nhess-19-2405-2019
  • [11] J Bootsma “Evaluating methods to assess the coastal flood hazard on a global scale: a comparative analysis between the Bathtub approach and the LISFLOOD-AC model”, 2022
  • [12] Michalis I Vousdoukas et al. “Developments in large-scale coastal flood hazard mapping” In Natural Hazards and Earth System Sciences 16.8 Copernicus GmbH, 2016, pp. 1841–1853
  • [13] Tim Neumann and Kai Ahrendt “Comparing the” bathtub method” with Mike 21 HD flow model for modelling storm surge inundation” In Ecologic Institute, Berlin, Germany, 2013
  • [14] Jin Teng et al. “Enhancing the capability of a simple, computationally efficient, conceptual flood inundation model in hydrologically complex terrain” In Water Resources Management 33 Springer, 2019, pp. 831–845
  • [15] Emrah Yalcin “Assessing the impact of topography and land cover data resolutions on two-dimensional HEC-RAS hydrodynamic model simulations for urban flood hazard analysis” In Natural Hazards 101.3 Springer, 2020, pp. 995–1017
  • [16] Richard Courant, Kurt Friedrichs and Hans Lewy “On the partial difference equations of mathematical physics” In IBM journal of Research and Development 11.2 IBM, 1967, pp. 215–234
  • [17] Sujan Ghimire et al. “Streamflow prediction using an integrated methodology based on convolutional neural network and long short-term memory networks” In Scientific Reports 11.1 Nature Publishing Group UK London, 2021, pp. 17497
  • [18] Valeriy Gavrishchaka, Olga Senyukova and Mark Koepke “Synergy of physics-based reasoning and machine learning in biomedical applications: towards unlimited deep learning with limited data” In Advances in Physics: X 4.1 Taylor & Francis, 2019, pp. 1582361
  • [19] Minglei Fu et al. “Deep learning data-intelligence model based on adjusted forecasting window scale: application in daily streamflow simulation” In IEEE Access 8 IEEE, 2020, pp. 32632–32651
  • [20] Kyeungwoo Cho and Yeonjoo Kim “Improving streamflow prediction in the WRF-Hydro model with LSTM networks” In Journal of Hydrology 605 Elsevier, 2022, pp. 127297
  • [21] Maryam Rahimzad et al. “Performance comparison of an LSTM-based deep learning model versus conventional machine learning algorithms for streamflow forecasting” In Water Resources Management 35.12 Springer, 2021, pp. 4167–4187
  • [22] Xuan-Hien Le, Hung Viet Ho and Giha Lee “River streamflow prediction using a deep neural network: a case study on the Red River, Vietnam” In Korean Journal of Agricultural Science 46.4 Institute of Agricultural Science, CNU, 2019, pp. 843–856
  • [23] Muhammed Sit et al. “A comprehensive review of deep learning applications in hydrology and water resources” In Water Science and Technology 82.12, 2020, pp. 2635–2670 DOI: 10.2166/wst.2020.369
  • [24] Imme Ebert-Uphoff et al. “A VISION FOR THE DEVELOPMENT OF BENCHMARKS TO BRIDGE GEOSCIENCE AND DATA SCIENCE” In 17th International Workshop on Climate informatics URL: https://par.nsf.gov/biblio/10143795
  • [25] Fazlul Karim et al. “A Review of Hydrodynamic and Machine Learning Approaches for Flood Inundation Modeling” In Water 15.3, 2023 DOI: 10.3390/w15030566
  • [26] Adhémar Jean-Claude Barré de Saint-Venant “Théorie du mouvement non-permanent des eaux, avec application aux crues des rivières et à l’introduction des marées dans leur lit” In Comptes rendus de l’Académie des Sciences de Paris 73.147-154, 1871, pp. 237–240
  • [27] Indiana Department of Natural Resources “The General Guidelines for the Hydrologic-Hydraulic Assessment of Floodplains in Indiana”, 2014, pp. 1–41
  • [28] FEMA “Find the Right HEC-RAS Model”
  • [29] Gary W. Brunner and CEIWR-HEC “HEC-RAS River Analysis System Hydraulic Reference Manual Version 5.0” In Hydrologic Engineering Center, 2016, pp. 547
  • [30] Hossein Ghalkhani et al. “Application of surrogate artificial intelligent models for real-time flood routing” In Water and Environment Journal 27.4 Wiley Online Library, 2013, pp. 535–548
  • [31] Alina Chertock et al. “Well-balancing via flux globalization: Applications to shallow water equations with wet/dry fronts” In Journal of Scientific Computing 90 Springer, 2022, pp. 1–21
  • [32] P Garcıa-Navarro, P Brufau, J Burguete and J Murillo “The shallow water equations: An example of hyperbolic system” In Monografıas de la Real Academia de Ciencias de Zaragoza 31.01, 2008
  • [33] Alexander Kurganov, Yongle Liu and Vladimir Zeitlin “Numerical dissipation switch for two-dimensional central-upwind schemes” In ESAIM: Mathematical Modelling and Numerical Analysis 55.3 EDP Sciences, 2021, pp. 713–734
  • [34] Alina Chertock et al. “Local characteristic decomposition based central-upwind scheme” In Journal of Computational Physics 473 Elsevier, 2023, pp. 111718
  • [35] Nima Ekhtari, Craig Glennie and Juan Carlos Fernandez-Diaz “Classification of airborne multispectral lidar point clouds for land cover mapping” In IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 11.6 IEEE, 2018, pp. 2068–2078
  • [36] Gary W. Brunner and CEIWR-HEC “HEC-RAS HEC-RAS 2D User ’ s Manual”, 2021
  • [37] Christopher Goodell “Breaking the Hec-Ras Code: A User’s Guide to Automating Hec-Ras” h2ls., 2014
  • [38] “POINT PRECIPITATION FREQUENCY (PF) ESTIMATES WITH 90% CONFIDENCE INTERVALS AND SUPPLEMENTARY INFORMATION NOAA Atlas 14, Volume 11, Version 2, LAT 29.7207 LONG -95.3430” National OceanicAtmospheric Administration URL: https://hdsc.nws.noaa.gov/hdsc/pfds/pfds_map_cont.html?bkmrk=tx
  • [39] “TorchBNN V1.2 - Bayesian-Neural-Network-Pytorch” Python Software Foundation URL: https://pypi.org/project/torchbnn/
  • [40] Nikhil Muralidhar et al. “Phynet: Physics guided neural networks for particle drag force prediction in assembly” In Proceedings of the 2020 SIAM International Conference on Data Mining, 2020, pp. 559–567 SIAM
  • [41] Diederik P. Kingma and Jimmy Ba “Adam: A Method for Stochastic Optimization”, 2017 arXiv:1412.6980 [cs.LG]
  • [42] Alina Chertock, Shumo Cui, Alexander Kurganov and Tong Wu “Well-balanced positivity preserving central-upwind scheme for the shallow water system with friction terms” In International Journal for numerical methods in fluids 78.6 Wiley Online Library, 2015, pp. 355–383