Verification of a real-time ensemble-based method for updating earth model based on GAN

Kristian Fossum krfo@norceresearch.no Sergey Alyaev Jan Tveranger Ahmed H. Elsheikh NORCE Norwegian Research Centre, Bergen, Norway School of Energy, Geoscience, Infrastructure and Society, Heriot-Watt University, Edinburgh, United Kingdom

Abstract

The complexity of geomodelling workflows is a limiting factor for quantifying and updating uncertainty in real-time during drilling. We propose Generative Adversarial Networks (GANs) for parametrization and generation of geomodels, combined with Ensemble Randomized Maximum Likelihood (EnRML) for rapid updating of subsurface uncertainty. This real-time ensemble method combined with a highly non-linear model arising from neural-network modeling sequences might produce inaccurate and/or biased posterior solutions. This paper illustrates the predictive ability of EnRML on several examples where we assimilate local extra-deep electromagnetic logs. Statistical verification with MCMC confirms that the proposed workflow can produce reliable results required for geosteering wells.

keywords:

Geosteering , Machine Learning , Deep Neural Network , Generative Adversarial Network , Ensemble randomized maximum likelihood

^†^†journal: Journal of Computational Science

1 Introduction

The process of drilling wells for hydrocarbon production represents a major cost in petroleum reservoir development. However, drilling of new wells is necessary to increase the total oil recovery. To maximize the value for each drilled well it is necessary to optimize the placement of the well within the reservoir structure. An optimally placed well will mobilize more of the petroleum resources, and reduce the need for injected water – reducing the environmental impact of oil production.

To place a well in its optimal position, operators apply geosteering. Here, the well trajectory is adjusted while drilling in response to real-time measurement of the geology surrounding the drill bit. The value of geosteering has been well documented in the literature [1, 2, 3].

The main objective with geosteering is to utilize the information in the measurements to make optimal decisions. Hence, geosteering can be seen as a sequential decision process under uncertainty and should be treated in a probabilistic framework [4]. Recently, a workflow based on the Ensemble Kalman Filter (EnKF) [5] has been employed to condition the geological model on measurements acquired while drilling [6, 7]. In the EnKF, the uncertainty is represented by an ensemble of equiprobable realizations. This workflow has then been combined with a global optimization method and applied as a Decision Support System (DSS)[8].

The DSS framework provides high quality decisions on synthetic cases, and outperforms most of geosteering experts in a controlled experiment [9]. However, practical challenges should be addressed for it to be applicable to real operations [8]. This includes modeling of modern commercial tool to process real measurements as well as real-time earth model that can handle realistic geological complexity. The forward deep neural network (FDNN) trained on synthetic data for extra-deep electromagnetic measurements [10] enabled the real-time ensemble-based update of layered models in 1.5D [11], and 3D [12]. Moreover, [13] showed that the model errors present in the FDNN approximation can be alleviated during the ensemble based inversion for the layered case.

Fossum et. al [14] proposed a new modeling sequence which combines the FDNN with a generative adversarial network (GAN) to produce complex geological realizations in real-time to aid geosteering, see Figure 1. The premise of the GAN is that it allows to represent the earth model by a Gaussian distribution, where all produced realizations also maintain geological realism. This allows using EnKF-like methods to update not only continuous properties but also complex geological structures, which is required for geosteering. However, the workflow implementation in [14] converged only with a little starting uncertainty. [15] improved the results and demonstrated visually-convincing structural ahead-of-bit prediction on a selected example. It is known, however, that the approximate ensemble based methods, such as EnKF and its derivatives, can be sensitive to non-linearities present in complex modeling sequences and thus predictions may be biased.

In this paper we present a robust and improved implementation of the framework presented in [14] that is able to account for an appropriate starting uncertainty. Further, we aim to test the convergence properties of the iterative ensemble randomised maximum likelihood (EnRML) method when updating the GAN-based geomodels with FDNN approximation of measurements – denoted the GAN-FDNN modeling sequence. The EnRML probabilistic output is compared to a gold standard Markov Chain Monte-Carlo (MCMC) solution for the same problem using various metrics. The numerical examples demonstrate that the EnRML – applied to the GAN-FDNN modeling sequence – generates posterior samples with excellent predictive capabilities that are good approximations to the true posterior solution.

Refer to caption — Figure 1: The proposed DSS workflow. Green boxes highlight the new modeling sequence introduced in [12]. The gray boxes indicate the decision optimization, which is not explored in detail here.

To construct a reference earth model we generate realizations of a fluvial geological environment using a commercial software. These realizations are then sub-sampled to form a training dataset for the offline training of a Generative Adversarial Network (GAN). The GAN is then used, online, to generate plausible geological realizations from a low-dimensional Gaussian input vector. The complete earth modeling loop is described in Section 2. For modeling the extra-deep EM measurements we use a forward deep neural network (FDNN) trained on a dataset generated using a commercial simulator (Section 3). In Section 4 we discuss the exact and the approximate data assimilation (DA) methods. The two numerical experiments, designed to test the applicability of our proposed method, are derived, and the numerical results are presented in Section 5. Finally, we summarize and conclude the paper in Section 6.

2 Earth modeling using GAN

GANs are a class of unsupervised machine learning methods which can learn to generate new formatted data with the same statistics as the training set. Motivated by successful applications of GANs for modeling channelized structures for reservoir-simulation workflows [16, 17, 18, 19, 20], we use a GAN for efficient earth modeling.

The GAN consists of two deep neural networks (DNNs): a generator and a discriminator. The generator takes a random Gaussian low dimensional vector as input and generates a realization of formatted data: geological realization. The discriminator takes the formatted data and gives a probability of it being ’real’, i.e., belonging to the training set. During training the DNNs contest each other in a min-max game. They are trained simultaneously. On each training step the generator creates (fake) geological realizations from the random vectors. Fake geological realizations are combined with random samples of the real earth model and are fed to the discriminator. The loss function for the generator is proportional to number of ’fakes’ correctly identified by the discriminator. The loss function for the discriminator is proportional to the total misjudged data samples. In our study we use an adapted Wasserstein GAN [21] with hierarchical deep convolutional networks [22] for the generator and the discriminator, see [21] for implementation details.

For geosteering we want to reproduce likely geological realizations of facies and porosity distributions on a 2D vertical geological section along the well to identify the oil-bearing sands ahead of bit. For training of the GAN we use a large (compared to the area of prediction) reference earth model, which should provide a realistic test case for the present study in terms of scale and actual geological features and properties. The reference earth model is constructed using a commercial software that models a synthetic structural framework, a facies model setup derived from outcrop analogue data, and synthetic petrophysical properties of individual facies derived from published literature. The resulting model measures 4000m x 1000m x 200m (xyz) with cell dimensions set to 10x10x0.5 m, yielding a regular grid of size 400x100x400, see Figure 2.

The constructed facies model represents a low net/gross fluvial depositional system. It was chosen since it provides complex 3D architectures comprising a limited number of facies, which form contrasting geometries, see Figure 2. Input numbers for statistical generation of facies and geometries are derived from a well-documented outcrop of the Cretaceous lower Williams Fork Formation (Mesa Verde Group) at Coal Canyon, Colorado, USA [23, 24, 25].

Key parameters of the facies model setup are listed in Table 1. The model is not intended as a rendering of the outcrop itself and is consequently simplified compared to descriptions of the original outcrop [26, 27, 28, 24]. The model contains three facies: Background/shale, Channels and Crevasse splays. The probability distribution of channel width in the model is adapted to include “narrow channel bodies”, and stacking of channels accounts for multi-story channels which comprise more than 80% of the observed channel bodies. The flow direction of the channel system is set towards $45\pm 10$ degrees. No trends were used to condition the spatial distribution of channels. The details of this synthetic model are also described in [15].

Table 1: Parameter settings for facies models.

Volumetric fraction	Value	Tolerance	Comments
Channel system volume fraction	0.3	0.05
Channel positioning	1		No trends
Crevasse volume fraction	0.1	0.03	Of channel system vol. frac.

Channel geometry	Value	SD	Min.	Max.
Thickness	4.2	1.5
Width	155	50	20	500
Correlation W/T	36
Amplitude	400	50
Sinuosity	1.3
Azimuth	45	10

Form/repulsion	Setting
Cross-section geometry	Parabolic, basic variability
Channel form	Rigid
Repulsion	None

The geological realization is parameterized by a vector of 60 independent parameters. For each 60-dimensional vector, the generator outputs a 64x64 grid with three values in each grid block. For a grid block (with dimensions 10.0m along-well and 0.5m thickness) the three values, ’channels’, represent the probability of the grid-block belonging to the respective facies class: Background/Channel/Crevasse. Our generator is also predicting porosity/resistivity distribution within the geo-bodies, but in this study only the facies classes are used.

For training, the original 3D earth model is sampled as 64x64 2D images with three channels. The facies index from the training set is converted into one-hot three-dimensional vector. That is, the vector represents the probability of facies: the value of the true index is set to one and other channels to zero. During evaluation, the resistivity of the facies with the highest probability is applied.

3 Forward DNN model of extra-deep EM logs

To maintain real-time performance of a data assimilation workflow the forward model should be fast and support batch, preferably parallel execution. Proprietary forward models provided by measurement instrument vendors provide the most accurate results, but they are often not sufficiently fast, and not always optimized for batch execution. In [10], the authors developed a DNN approximation of such a forward model [29], which we abbreviate FDNN.

The model approximates the output of the ultra-deep electromagnetic well-bore logging instrument. The instrument is configured to transmit four shallow and nine pairs of deep directional measurements, and has sensitivity to boundaries up to 30 meters to the side from the well bore. We emphasize that the tool provides information around, but not ahead of the drilling position. An illustration of the deep measurements depth of detection is provided in Figure 11.

The input to the FDNN model is a layered geological media with up to three boundaries above and below the measurement instrument as well as the resistivity values of all seven layers. In this study we assume that the layer resistivity is isotropic and that the well is aligned with the horizontal axis.

We produce one synthetic set of measurements for every horizontal position (one per column of cells) of the gridded model which we ’drill’ through. We choose the most probable facies for each computational cell within the considered column and use the corresponding resistivity value (same as in [15]):

1.

Background, $R=4.0$ Ohm m;
2.

Channel, $R=171.0$ Ohm m;
3.

Crevasse, $R=55.0$ Ohm m.

We find the boundaries between layers composed of pixels with equal resistivities and use the boundaries and the layers’ resistivities as the input to the forward model.

4 Data assimilation during geosteering

The DSS for geosteering [8] uses the data assimilation loop (see Figure 1, left) to condition the earth model to measurements made while drilling. The fundamental idea is that if a poorly known earth model can be made consistent with measurements in the statistical sense, it will contain non-biased forecasts and, hence, provide a better basis for decisions (see Figure 1, right).

In this paper, the emphasis is placed on the data assimilation part of the DSS. Specifically, we investigate real-time data assimilation with the EnRML method utilizing a modeling sequence based on the two neural networks described above: a GAN-generator for complex earth modeling and a FDNN models for the synthetic extra-deep EM logs. The EnRML method is an ensemble-based iterative ensemble smoother which has received a lot of attention for history matching subsurface multi-phase flow problem, see [30] and references therein. The method uses an ensemble approximation to the sensitivity matrix, and provides a fast and approximate solution to the Bayesian problem. The method can only be shown to converge for Gaussian posterior distributions. However, the method is known to also sample accurately from moderately non-Gaussian posterior distributions. To assess the statistical convergence of the EnRML in this context we compare the method to samples generated by a Markov Chain Monte Carlo (MCMC) algorithm. Since the MCMC, when properly converged, generates independent and identically distributed samples from the Bayesian distribution, several metrics can be used to assess the statistical error of the EnRML method. The EnRML and the MCMC method is described in detail in the rest of the section.

4.1 EnRML

The EnRML [31] has recently become one of the most successful methods for automatic history matching of petroleum reservoirs. The EnRML is derived by minimization of an objective function using an ensemble approximation of the sensitivity matrix. Since a wide range of methods can be applied for the minimization, the EnRML can be formulated in many different ways. In this study we utilize the approximate form of the Levenberg-Marquardt method, introduced in [32].

Based on the Bayes’ theory, the objective function to be minimized is

	$\displaystyle S\left(\vec{m}\right)=$	$\displaystyle\frac{1}{2}\left(g\left(\vec{m}\right)-\vec{d}_{obs}^{}\right)^{T}C_{\vec{d}}^{-1}\left(g\left(\vec{m}\right)-\vec{d}_{obs}^{}\right)$
		$\displaystyle+\frac{1}{2}\left(\vec{m}-\vec{m}_{prior}\right)^{T}C_{\vec{m}}^{-1}\left(\vec{m}-\vec{m}_{prior}\right).$		(1)

Here, $\vec{d}_{obs}^{*}$ is the noisy observed data, $g\left(\vec{m}\right)$ is the modelling sequence depending on the parameter $\vec{m}$ , and $\vec{m}_{prior}$ is a sample from the prior distribution of the parameters (this is the rough plus smooth sampling approach given in [33, chap.10]). Iteration number $i$ of the Levenberg-Marquardt method is given as

	$\displaystyle\delta\vec{m}_{i}=$	$\displaystyle-\left[\left(1+\lambda_{i}\right)C^{-1}_{\vec{m}}+G^{T}_{i}C^{-1}_{\vec{d}}G_{i}\right]^{-1}$		(2)
		$\displaystyle\times\left[C^{-1}_{\vec{m}}\left(\vec{m}_{i}-\vec{m}_{prior}\right)+G^{T}_{i}C_{\vec{d}}^{-1}\left(g\left(\vec{m}\right)-\left(\vec{d}_{obs}+\vec{\epsilon}\right)\right)\right]$		(3)

where $\lambda_{i}$ is the Levenberg-Marquardt multiplier, $G$ is the sensitivity of data to the parameters, and $\vec{\epsilon}\sim\mathcal{N}\left(0,C_{\vec{d}}\right)$ is a realization of the measurement observation noise.

In the ensemble framework, we approximate $C_{\vec{m}}$ and $G$ using the ensemble. To this end we define

\tilde{G}=C^{1/2}_{sc}\Delta\vec{d}\left(\Delta\vec{m}\right)^{-1}

(4)

\tilde{C}_{\vec{m}}=\Delta\vec{m}\Delta\vec{m}^{T}

(5)

where

\Delta\vec{m}=\left[\vec{m}_{1},\dots,\vec{m}_{j},\dots,\vec{m}_{N}\right]\left(I_{N}-\frac{1}{N}11^{T}\right)/\sqrt{N-1},

(6)

\Delta\vec{d}=C_{sc}^{-1/2}\left[g\left(\vec{m}_{1}\right),\dots,g\left(\vec{m}_{j}\right),\dots,g\left(\vec{m}_{N}\right)\right]\left(I_{N}-\frac{1}{N}11^{T}\right)/\sqrt{N-1},

(7)

$N$ denotes the ensemble size, and $C_{sc}$ is a diagonal matrix for scaling the data, typically containing the measurement variance on the diagonal. We get the approximate version of the Levenberg-Marquardt update equation by inserting ensemble approximations of $G$ and $C_{\vec{m}}$ , neglecting the updates from the model mismatch term, substituting the prior precision matrix $C^{-1}_{m}$ with $\tilde{C}^{-1}_{\vec{m}_{i}}$ , and rewriting the equation using the Sherman-Woodbury-Morrison matrix inversion formula [34] gives the following update equation

\delta\vec{m}_{i}=-\tilde{C}_{\vec{m}_{i}}\tilde{G}_{i}^{T}\left[\left(1+\lambda_{i}\right)C_{\vec{d}}+\tilde{G_{i}}\tilde{C}_{\vec{m}_{i}}\tilde{G}^{T}_{i}\right]^{-1}\left(g\left(\vec{m}\right)-\left(\vec{d}_{obs}+\vec{\epsilon}\right)\right).

(8)

The update equation is simplified, and made computationally more stable, by inserting the truncated singular value decomposition of $\Delta d$

\Delta\vec{d}=U_{p}S_{p}V_{p}^{T},

(9)

where the subscript $p$ indicates the number of retained singular values, when they are ordered after descending value. In this work, we define $p$ such that the cumulative sum of the $p$ first singular values equals 99% of the cumulative sum of all the singular values. Further, we substitute $C_{D}$ with the ensemble approximation $\tilde{C}_{D}$

\tilde{C}_{\vec{d}}=\Delta\vec{\epsilon}\Delta\vec{\epsilon}^{T},

(10)

where

\Delta\vec{\epsilon}=\left[\vec{\epsilon}_{1},\dots,\vec{\epsilon}_{j},\dots,\vec{\epsilon}_{N}\right]\left(I_{N}-\frac{1}{N}11^{T}\right)/\sqrt{N-1}.

(11)

Inserted into (8) gives

$\displaystyle\delta\vec{m}_{i}=-$	$\displaystyle\Delta\vec{m}_{i}V_{p}\left[\left(1+\lambda_{i}\right)S_{p}^{-1}U_{p}^{T}C^{-1/2}_{scl}\Delta\vec{\epsilon}\Delta\vec{\epsilon}^{T}C_{scl}^{-1/2}U_{p}S_{P}^{-T}+I\right]$
	$\displaystyle\left(U_{p}S^{-1}_{p}\right)^{T}C^{-1/2}_{sc}\left(g\left(\vec{m}\right)-\left(\vec{d}_{obs}+\vec{\epsilon}\right)\right)$	(12)
$\displaystyle=-$	$\displaystyle\Delta\vec{m}_{i}V_{p}Z\left[\left(1+\lambda_{i}\right)\zeta+I\right]^{-1}\left(U_{p}S^{-1}_{p}Z\right)^{T}C^{-1/2}_{sc}\left(g\left(\vec{m}\right)-\left(\vec{d}_{obs}+\vec{\epsilon}\right)\right),$

where $Z$ and $\zeta$ are the eigenvectors and eigenvalues of

S_{p}^{-1}U_{p}^{T}C^{-1/2}_{scl}\Delta\vec{\epsilon}\Delta\vec{\epsilon}^{T}C_{scl}^{-1/2}U_{p}S_{P}^{-T}.

After each application of (8) we assess convergence and we continue iterations until the scheme is is converged. Here, we consider the method to be converged when the relative difference in the data misfit (the first term in (1)) is below a given threshold or when the maximum number of iteration is reached. In the numerical examples, the threshold is $2\times 10^{-2}$ and the maximum number of iterations are $10$ .

4.2 MCMC

A reliable method for sampling from a complex posterior distribution is the MCMC technique. MCMC relates to the general framework of methods introduced in [35] and [36] for Monte Carlo (MC) integration. Firstly, one designs a Markov chain that produce samples from the desired posterior distribution. Secondly, one utilize these samples for MC integration. In this section, the adaptive Metropolis-Hastings method – the method utilized in the numerical study – is introduced. For more information on MCMC we refer the reader to [37], and references therein.

Suppose we want samples from the un-normalized posterior distribution $F$ , which is the general case with the Bayesian method where the normalizing factor often is very difficult to calculate. Assume that the current element of the chain is $\vec{m}$ , and one proposes a move to $\vec{m}^{*}$ . The proposal is sampled from the proposal distribution $q\left(\vec{m}^{*}|\vec{m}\right)$ . The move is performed with probability

b\left(\vec{m},\vec{m}^{*}\right)=min\left(1,r\left(\vec{m},\vec{m}^{*}\right)\right)

(13)

where the Hastings ratio is defined as

r\left(\vec{m},\vec{m}^{*}\right)=\frac{F\left(\vec{m}^{*}\right)q\left(\vec{m}|\vec{m}^{*}\right)}{F\left(\vec{m}\right)q\left(\vec{m}^{*}|\vec{m}\right)}.

(14)

This is the basis for the Metropolis-Hastings method, and it can be shown that the method generates samples from the posterior distribution $F$ .

The Metropolis-Hastings algorithm requires a choice of proposal distribution, and some distributions work better than others. The optimal would be to draw proposals directly from the posterior $F$ . However, this is not possible since we cannot sample from this distribution. Since the MCMC converges for any proposal distribution that fulfills some general conditions, one idea is to gradually adapt the proposal distribution using previous samples from the chain. This adaptive approach ensures a gradually better proposal distribution as the chain evolves. To this end we select the following mixture distribution as our proposal distribution

\vec{m}^{*}\sim\left(1-\beta\right)\mathcal{N}\left(\vec{m},\left(\frac{2.38^{2}}{N_{m}}\right)\tilde{C}_{\vec{m}}\right)+\beta\mathcal{N}\left(\vec{m},Q_{\vec{m}}\right).

(15)

Here $\tilde{C}_{\vec{m}}$ is the empirical covariance matrix calculated utilizing all the preceding iterations of the Markov Chain, $Q_{\vec{m}}$ is some fixed non-singular matrix and $0<\beta<1$ . Note that $\beta=1$ until $\tilde{C}_{\vec{m}}$ is well defined. Efficient on-line updating of $\tilde{C}_{\vec{m}}$ is achieved by the recursion given in [38]. This sampling method was applied in [39, 40]. It is well known that the MCMC requires a certain burn-in period since the initial samples are not from the posterior distribution. Hence, it is necessary to monitor the convergence of the method. In this work, convergence is monitored by assessing the maximum root statistic of the multivariate potential scale reduction factor [41].

5 Numerical Experiments

The numerical experiments investigate how the GAN-FDNN modeling sequence can be applied in the data assimilation part of DSS when data assimilation using EnRML is applied for real-time uncertainty reduction. We design two synthetic experiments that focus on the reduction of uncertainty ahead of measurements and the ability of the algorithm to predict the sand channels in the unexplored part of the geomodel.

To quantify the quality of the EnRML approximation, when applied to the GAN-FDNN modeling sequence, we compare the posterior ensemble of EnRML with true samples from the posterior – acquired by the MCMC algorithm. The comparison is done by evaluating several metrics, including visual comparison of standard deviation, and mean, in every point of the domain, point-wise Kolmogorov-Smirnov two-sample test, and visual inspection of kernel density estimates of the marginal distribution of GAN-input vector $m_{i}$ .

We perform two numerical tests. In both tests we utilize the generative neural network, introduced in Section 2, to represent the earth model with uncertainty. Hence, our goal is to condition the poorly known 60-dimensional input vector, $\vec{m}$ , to measurements. The prior realizations of the earth model are generated by applying the generative network to parameters sampled from a multivariate Gaussian distribution, $\vec{m}\sim\mathcal{N}\left(\hat{\vec{m}_{0}},C_{\vec{m}}\right)$ . The distribution is slightly shifted to simulate conditioning on pre-drill information. The $\hat{\vec{m}_{0}}$ represents the shifted mean and is defined by the equation equation:

\hat{m_{0}}_{i}=\left\{\begin{array}[]{ll}0,&i=[20..44],\\ 0.25{m_{0}}_{i},&\textrm{otherwise},\end{array}\right.

(16)

where $\vec{m}_{0}$ is the synthetic truth from [15], see Figure 6. We use uncorrelated covariance matrix with marginal variance of $C_{m_{i}}=1$ for all parameters, similar to the GAN training. Figure 3 shows six generated earth model realizations from the prior model. From the figure, we observe that this setup provides significant variation in the earth model, which is reflected in the relatively flat mean and the standard deviation derived from the full ensemble of 500 realizations, see Figure 5. At the same time, all the realizations are consistent with the chosen channelized geological setting. We emphasize, that we use the same prior for both numerical experiments.

We conduct two numerical experiments, that differ with respect to the synthetic truth. In the first experiment, the truth model from [15] is applied. In the second experiment, the synthetic truth depicted in Figure 11 is applied. Hence, for experiment 2, the prior is biased towards a wrong model (indicating erroneous pre-drill information) making the data assimilation problem harder.

The numerical study is performed in the same manner for both experiments. Firstly, we sample the true posterior with the MCMC. Here, 8 Markov chains, starting from different initial points, were run for $10^{6}$ steps. At that point, based on assessing the multivariate potential scale reduction factor, the MCMC was found to be converged. Samples from the posterior were then extracted by removing the burn-in phase, and by thinning. For each of the 8 chains, the first half of the chain was removed, and every 100th iteration from the second half of the chain was retained, leaving $4\times 10^{4}$ samples from the posterior distribution. Secondly, we estimate the posterior distribution using the EnRML method introduced in Section 4.1. Due to the fast simulation time, we utilized an ensemble size of $N=500$ , and in addition, we applied the correlation-based localization technique introduced in [42]. Finally, we assess the result from the EnRML by comparison with the samples from the MCMC.

5.1 Example 1 – Verification of convergence on an example from literature

The first numerical example tests the sampling capabilities for the EnRML on an example from the literature. The synthetic true log is generated from the true model 6. Hence for this case, the prior mean is slightly shifted towards the true model.

Figure 7 shows the mean and standard deviation of the posterior resistivity model. Compared to the prior mean and standard deviation (shown in figure 5) it is clear that the uncertainty around the well is significantly reduced. Moreover, the same reduction is observed for both the EnRML and the MCMC. Apart from slightly sharper boundaries for the MCMC, the EnRML approximation to the posterior mean and standard deviation is almost indistinguishable from the true posterior mean and standard deviation.

A similar conclusion can be drawn from the plots showing the estimated point-wise probability of the sand facies, plotted in Figure 8. Around the well, the EnRML and the MCMC are almost indistinguishable. Ahead of the bit, there are small differences. However, the predictive capability, as described in [15], is also present in the MCMC solution. Hence, this is a true feature of the posterior solution with the GAN-FDNN modeling sequence.

To evaluate the statistical distance between the samples from the EnRML and the samples from MCMC we perform a Kolmogorov-Smirnov two-sample test. This is a non-parametric test of equality for one-dimensional probability distributions. The earth model is a 2D image, and not one-dimensional. Hence, we perform the test on the marginal distribution for each cell. The P-values from the test of the H0 hypothesis of equal distributions are shown in Figure 9. The significance level of 0.05 is given by the black contour line. Hence, for all p-values higher than 0.05 one cannot differentiate between the marginal distributions and the H0 hypothesis hold.

As a final evaluation of the results, we plot a selection of marginal and bi-variate elements of the input vector $\vec{m}$ . To highlight the effect of the pre-drill information, we selected two elements that was shifted ( $m_{1}$ and $m_{52}$ ) and two that were not shifted ( $m_{23}$ and $m_{37}$ ). In Figure 10 the kernel density estimate of the selected elements is plotted along the diagonal, the scatter plot of the pairwise elements is given in the top corner, while the contours of the 2D Kernel density estimate of the pairwise elements are given in the lower corner. The true model is given as a black line in the 1D plots and a black star for the 2D plots.

The numerical experiment shows that the EnRML can successfully approximate the true posterior solution for the GAN-FDNN modeling sequence. The numerical results show a convincing similarity between the exact samples from the posterior, acquired by the MCMC, and the approximate samples, acquired by EnRML. The EnRML provides good approximations of both the posterior earth model and the posterior input vector. Moreover, from inspection of selected elements from $\vec{m}$ it is clear that the posterior distribution can be well approximated by a Gaussian.

5.2 Example 2 – Prediction of a sand-channel sequence

The second numerical example tests the ability of the workflow to predict the targets ahead of measurements in the case where the well is already landed into a sand channel, and when the pre-drill information, embedded in the prior model, is biased toward a wrong solution. The synthetic truth for this example with the depth of detection is shown in Figure 11.

Figure 12 illustrates the posterior mean and standard deviation of the resistivity model. It is clear that conditioning to measurements resolves the sand channel ahead of the well position. Moreover, the EnRML does a reasonably good job in approximating the true posterior, despite the prior being slightly misspecified.

Similarly, the estimated point-wise probability of the sand facies, plotted in Figure 13, shows that the EnRML provides excellent predictive capabilities as the sand facies is correctly forecasted to the right of the geomodel, more than 500 meters ahead of the bit. There are slightly larger differences between the EnRML and MCMC in this example. However, the approximate posterior is still very close to the MCMC posterior, especially around the well.

The measure of statistical distance between the samples from the EnRML and the samples from MCMC indicates similar performance. The P-values from the test of the H0 hypothesis of equal distributions are shown in Figure 14. The significance level of 0.05 is given by the black contour line. Hence, for all p-values higher than 0.05 one cannot differentiate between the marginal distributions and the H0 hypothesis hold. Compared to example 1, there are more areas where the H0 hypothesis fails. This demonstrates that, for the more challenging experiment, there is a larger statistical distance between the approximate posterior from the EnRML and the true posterior.

As a final evaluation of the results, we plot a selection of marginal and bi-variate elements of the input vector $\vec{m}$ . Similar to experiment 1, we highlight the effect of the biased pre-drill information by selecting two elements that were shifted ( $m_{14}$ and $m_{53}$ ) and two that were not shifted ( $m_{20}$ and $m_{342}$ ). In Figure 15 the kernel density estimate of the selected elements is plotted along the diagonal, the scatter plot of the pairwise elements is given in the top corner, while the contours of the 2D Kernel density estimate of the pairwise elements are given in the lower corner. The true model is given as a black line in the 1D plots and a black star for the 2D plots. The effect of the biased prior can be observed in the MCMC results, where the marginal posterior is bi-modal.

The numerical experiment shows that the EnRML can successfully approximate the true posterior solution for the GAN-FDNN modeling sequence, even when the prior model is slightly biased. The numerical results show convincing similarity between the exact samples from the posterior, acquired by the MCMC, and the approximate samples, acquired by EnRML. There is however a larger discrepancy than was observed in example 1. From inspection of selected elements from $\vec{m}$ we can observe that the biased prior results in a more non-Gaussian posterior distribution. Despite this, we claim that the EnRML provides good approximations of both the posterior earth model and the posterior input vector.

6 Conclusions

In this paper, we have demonstrated that two essential parts, the earth model and the simulated extra-deep EM logs, of an ensemble-based DSS system can be substituted with neural networks. For the earth model, we utilize the GAN trained with images from a realistic geological setting, for the simulated logs we use a forward deep neural network (FDNN) trained using a large set of simulations from a commercial tool. The setup redistributes the computational cost from online to offline calculations, enabling complex earth models and deep-sensing EM logs to be part of real-time ensemble updates.

The numerical results illustrate that the GAN-FDNN modeling sequence provides excellent probabilistic predictions ahead of drilling capturing both continuous and discrete features when conditioning to only measurements with sideways sensitivity. Moreover, the numerical results show that the computationally efficient EnRML algorithm can sample the true Bayesian posterior confirmed by the MCMC algorithm. This conclusion is valid even when the prior model is slightly biased towards a wrong solution.

The proposed approach has many beneficial factors. Firstly, a GAN provides large flexibility for defining the geological setting. Here, we consider three different facies, but one can easily imagine including features like faults and pinch-outs as well as smoothly-varying properties. Secondly, we only need to condition a few parameters with Gaussian distribution to the measurements, which is very beneficial for the ensemble-based DA approach. Thirdly, since we are utilizing a neural network model to generate the simulated log, the computational cost of simulating a single ensemble member is milliseconds. Hence, the proposed approach can utilize a large ensemble for the DA part.

The numerical experiments illustrated that the posterior has a predictive capability for both MCMC and the faster EnRML method. The future work is to integrate the DA developed in this paper with the decision framework developed in [8], allowing DSS under a much more complex geological setting. Furthermore, the method can be extended to account for model errors present in machine learning approximations in real-time [13].

Acknowledgments

This work is part of the Center for Research-based Innovation DigiWells: Digital Well Center for Value Creation, Competitiveness and Minimum Environmental Footprint (NFR SFI project no. 309589, https://DigiWells.no). The center is a cooperation of NORCE Norwegian Research Centre, the University of Stavanger, the Norwegian University of Science and Technology (NTNU), and the University of Bergen. It is funded by Aker BP, ConocoPhillips, Equinor, Lundin Energy, TotalEnergies, Vår Energi, Wintershall Dea, Kongsberg Digital, Odfjell Drilling, Sekal, and the Research Council of Norway.

Part of the work was performed within the project ’Geosteering for IOR’ (NFR-Petromaks2 project no. 268122) which is funded by the Research Council of Norway, Aker BP, Equinor, Vår Energi and Baker Hughes Norway.

We would like to thank Emerson Roxar for providing an academic licence for RMS 11.1. used for the geo-modelling in this study.

References

[1] A. Al-Fawwaz, O. Al-Yosef, D. Al-Qudaihy, Y. Al-Shobaili, H. Al-Faraj, C. Maeso, I. Roberts, Increased Net to Gross Ratio as the Result of an Advanced Well Placement Process Utilizing Real-Time Density Images, in: IADC/SPE Asia Pacific Drilling Technology Conference and Exhibition, Society of Petroleum Engineers, 2004, pp. 151–160. doi:10.2118/87979-MS.
URL http://www.onepetro.org/doi/10.2118/87979-MS
[2] A. I. Guevara, J. Sandoval, M. Guerrero, C. A. Manrique, Milestone in Production Using Proactive Azimuthal Deep-Resistivity Sensor Combined With Advanced Geosteering Techniques: Tarapoa Block, Ecuador, in: SPE Latin America and Caribbean Petroleum Engineering Conference, Vol. 2, Society of Petroleum Engineers, 2012, pp. 1508–1520. doi:10.2118/153580-MS.
URL http://www.onepetro.org/doi/10.2118/153580-MS
[3] S. Janwadkar, M. Thomas, S. Privott, R. Tehan, L. Carlson, W. Spear, A. Setiadarma, Reservoir-Navigation System and Drilling Technology Maximize Productivity and Drilling Performance in the Granite Wash, US Midcontinent, SPE Drilling & Completion 27 (01) (2012) 22–31. doi:10.2118/140073-PA.
URL https://onepetro.org/DC/article/27/01/22/198159/Reservoir-Navigation-System-and-Drilling
[4] K. Kullawan, R. Bratvold, J. E. Bickel, A decision analytic approach to geosteering operations, SPE Drilling & Completion 29 (01) (2014) 36–46.
[5] G. Evensen, Sequential data assimilation with a nonlinear quasi-geostrophic model using Monte Carlo methods to forecast error statistics, J. Geophys. Res. 99 (C5) (1994) 10143. doi:10.1029/94JC00572.
URL http://doi.wiley.com/10.1029/94JC00572
[6] Y. Chen, R. J. Lorentzen, E. H. Vefring, Optimization of Well Trajectory Under Uncertainty for Proactive Geosteering, SPE Journal 20 (02) (2015) 368–383. doi:10.2118/172497-PA.
URL https://onepetro.org/SJ/article/20/02/368/206467/Optimization-of-Well-Trajectory-Under-Uncertainty
[7] X. Luo, P. Eliasson, S. Alyaev, A. Romdhane, E. Suter, E. Querendez, E. Vefring, An ensemble-based framework for proactive geosteering, SPWLA 56th Annual Logging Symposium 2015.
[8] S. Alyaev, E. Suter, R. B. Bratvold, A. Hong, X. Luo, K. Fossum, A decision support system for multi-target geosteering, Journal of Petroleum Science and Engineering 183 (August) (2019) 106381. arXiv:1903.03933, doi:10.1016/j.petrol.2019.106381.
URL https://doi.org/10.1016/j.petrol.2019.106381https://linkinghub.elsevier.com/retrieve/pii/S0920410519308022
[9] S. Alyaev, S. Ivanova, A. Holsaeter, R. B. Bratvold, M. Bendiksen, An interactive sequential-decision benchmark from geosteering, Applied Computing and Geosciences 12 (2021) 100072.
[10] S. Alyaev, M. Shahriari, D. Pardo, Á. J. Omella, D. S. Larsen, N. Jahani, E. Suter, Modeling extra-deep electromagnetic logs using a deep neural network, Geophysics 86 (3) (2021) E269–E281.
[11] N. Jahani, J. A. Garrido, S. Alyaev, K. Fossum, E. Suter, C. Torres-Verdin, Ensemble-based well log interpretation and uncertainty quantification for geosteering, arXiv preprint arXiv:2103.05384.
[12] K. Fossum, S. Alyaev, E. Suter, G. Tossi, M. Mele, Reducing 3d uncertainty by an ensemble-based geosteering workflow: an example from the goliat field, in: 3rd EAGE/SPE Geosteering Workshop, no. 1, European Association of Geoscientists & Engineers, 2021, pp. 1–5.
[13] M. H. Rammay, S. Alyaev, A. H. Elsheikh, Probabilistic model-error assessment of deep learning proxies: an application to real-time inversion of borehole electromagnetic measurements, Geophysical Journal International.
[14] K. Fossum, S. Alyaev, J. Tveranger, A. Elsheikh, Deep learning for prediction of complex geology ahead of drilling, in: International Conference on Computational Science, Springer, Cham, 2021, pp. 466–479.
[15] S. Alyaev, J. Tveranger, K. Fossum, A. H. Elsheikh, Probabilistic forecasting for geosteering in fluvial successions using a generative adversarial network, First Break 39 (7) (2021) 45–50.
[16] S. Chan, A. H. Elsheikh, Parametric generation of conditional geological realizations using generative neural networks, Computational Geosciences 23 (5) (2019) 925–952. arXiv:1807.05207, doi:10.1007/s10596-019-09850-7.
URL http://link.springer.com/10.1007/s10596-019-09850-7
[17] S. Chan, A. H. Elsheikh, Parametrization of Stochastic Inputs Using Generative Adversarial Networks With Application in Geology, Frontiers in Water 2 (March) (2020) 1–21. arXiv:1904.03677, doi:10.3389/frwa.2020.00005.
URL https://www.frontiersin.org/article/10.3389/frwa.2020.00005/full
[18] E. Laloy, N. Linde, D. Jacques, Approaching geoscientific inverse problems with vector-to-image domain transfer networks, Advances in Water Resources 152 (2021) 103917.
[19] S. M. Razak, B. Jafarpour, Conditioning generative adversarial networks on nonlinear data for subsurface flow model calibration and uncertainty quantification, Computational Geosciences 26 (1) (2022) 29–52.
[20] T. Zhang, X. Ji, A. Zhang, Reconstruction of fluvial reservoirs using multiple-stage concurrent generative adversarial networks, Computational Geosciences 25 (6) (2021) 1983–2004.
[21] M. Arjovsky, S. Chintala, L. Bottou, Wasserstein GAN, arXivarXiv:1701.07875.
URL http://arxiv.org/abs/1701.07875
[22] A. Radford, L. Metz, S. Chintala, Unsupervised representation learning with deep convolutional generative adversarial networks, arXiv preprint arXiv:1511.06434.
[23] M. J. Pranter, A. C. Hewlett, R. D. Cole, H. Wang, J. Gilman, Fluvial architecture and connectivity of the Williams Fork Formation: use of outcrop analogues for stratigraphic characterization and reservoir modelling, Geological Society, London, Special Publications 387 (1) (2014) 57–83. doi:10.1144/sp387.1.
[24] M. J. Pranter, N. K. Sommer, Static connectivity of fluvial sandstones in a lower coastal-plain setting: An example from the Upper Cretaceous lower Williams Fork Formation, Piceance Basin, Colorado, AAPG Bulletin 95 (6) (2011) 899–923. doi:10.1306/12091010008.
[25] S. M. Trampush, E. A. Hajek, K. M. Straub, E. P. Chamberlin, Identifying autogenic sedimentation in fluvial-deltaic stratigraphy: Evaluating the effect of outcrop-quality data on the compensation statistic, Journal of Geophysical Research: Earth Surface 122 (1) (2017) 91–113. doi:10.1002/2016JF004067.
[26] R. D. Cole, S. Cumella, Sand-body architecture in the lower Williams Fork Formation (Upper Cretaceous), Coal Canyon, Colorado, with comparison to the Piceance Basin subsurface, Mountain Geologist 42 (2005) 85–107.
[27] H. Panjaitan, Sand-body dimensions in outcrop and subsurface, Lower Williams Fork Formation, Piceance Basin, Colorado, Master’s thesis, Colorado School of Mines (2006).
[28] M. J. Pranter, R. D. Cole, H. Panjaitan, N. K. Sommer, Sandstone-body dimensions in a lower coastal-plain depositional setting: Lower Williams Fork formation, Coal Canyon, Piceance Basin, Colorado, American Association of Petroleum Geologists Bulletin 93 (10) (2009) 1379–1401. doi:10.1306/06240908173.
[29] M. Sviridov, A. Mosin, Y. Antonov, M. Nikitenko, S. Martakov, M. B. Rabinovich, Baker Hughes, BP, New Software for Processing of LWD Extradeep Resistivity and Azimuthal Resistivity Data, SPE Reservoir Evaluation & Engineering 17 (May). doi:10.2118/160257-PA.
[30] D. S. Oliver, K. Fossum, T. Bhakta, I. Sandø, G. Nævdal, R. J. Lorentzen, 4D seismic history matching, Journal of Petroleum Science and Engineering 207 (2021) 109119. doi:10.1016/j.petrol.2021.109119.
URL https://doi.org/10.1016/j.asoc.2021.107184https://linkinghub.elsevier.com/retrieve/pii/S0920410521007750
[31] Y. Gu, D. S. Oliver, An Iterative Ensemble Kalman Filter for Multiphase Fluid Flow Data Assimilation, SPE Journal 12 (4) (2007) 438–46. doi:10.2118/108438-PA.
URL http://www.spe.org/ejournals/jsp/journalapp.jsp?pageType=Preview{&}jid=ESJ{&}mid=SPE-108438-PA{&}pdfChronicleId=090147628014cce3
[32] Y. Chen, D. S. Oliver, Levenberg–Marquardt forms of the iterative ensemble smoother for efficient history matching and uncertainty quantification, Comput. Geosci.doi:10.1007/s10596-013-9351-5.
URL http://link.springer.com/10.1007/s10596-013-9351-5
[33] D. S. Oliver, A. C. Reynolds, N. Liu, Inverse Theory for Petroleum Reservoir Characterization and History Matching, Cambridge University Press, 2008.
URL http://books.google.com/books?hl=en&lr=&id=oPuy3OfbtfIC&oi=fnd&pg=PR11&dq=Inverse+Theory+for+Petroleum+Reservoir+Charaterization+and+History+Matching&ots=9fIWq86eCn&sig=nrnDTbgY36_iqIEemjzFNw9Xu54
[34] G. H. Golub, C. F. Van Loan, Matrix Computations, Johns Hopkins series in the mathematical sciences, The Johns Hopkins University Press, Baltimore, 1983.
[35] N. Metropolis, A. W. Rosenbluth, M. N. Rosenbluth, A. H. Teller, E. Teller, Equation of State Calculations by Fast Computing Machines, The Journal of Chemical Physics 21 (6) (1953) 1087. doi:10.1063/1.1699114.
URL http://link.aip.org/link/JCPSA6/v21/i6/p1087/s1{&}Agg=doi
[36] W. K. Hastings, Monte Carlo Sampling Methods Using Markov Chains and Their Applications, Biometrika 57 (1) (1970) 97. doi:10.2307/2334940.
URL http://www.jstor.org/stable/2334940?origin=crossref
[37] S. P. Brooks, A. Gelman, G. L. Jones, X.-L. Meng (Eds.), Handbook of Markov Chain Monte Carlo, Chapman and Hall / CRC, 2011.
[38] S. Dasgupta, D. Hsu, On-Line Estimation with the Multivariate Gaussian Distribution, in: Learning Theory, Vol. 4539 LNAI, Springer Berlin Heidelberg, Berlin, Heidelberg, 2007, pp. 278–292.
URL https://doi.org/10.1007/978-3-540-72927-3_21
[39] K. Fossum, T. Mannseth, Parameter sampling capabilities of sequential and simultaneous data assimilation: II. Statistical analysis of numerical results, Inverse Problems 30 (11) (2014) 114003. doi:10.1088/0266-5611/30/11/114003.
URL http://stacks.iop.org/0266-5611/30/i=11/a=114003?key=crossref.2e2654e480ea9cb9eab6d8c245d1dc46
[40] K. Fossum, T. Mannseth, Assessment of ordered sequential data assimilation, Computational Geosciences 19 (4). doi:10.1007/s10596-015-9492-9.
URL http://link.springer.com/10.1007/s10596-015-9492-9
[41] S. P. Brooks, A. Gelman, General Methods for Monitoring Convergence of Iterative Simulations, J. Comput. Graph. Stat. 7 (4) (1998) 434. doi:10.2307/1390675.
URL http://www.tandfonline.com/doi/abs/10.1080/10618600.1998.10474787http://www.jstor.org/stable/1390675?origin=crossref
[42] X. Luo, T. Bhakta, G. Nævdal, Correlation-Based Adaptive Localization With Applications to Ensemble-Based 4D-Seismic History Matching, SPE Journal 23 (2) (2018) 396–427. doi:10.2118/185936-PA.