This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\floatsetup

[figure]style=plain,subcapbesideposition=top

Jet mixing enhancement with Bayesian optimization, deep learning, and persistent data topology

Yiqing Li\aff1    Bernd R. Noack\aff1,2\corresp bernd.noack@hit.edu.cn    Tianyu Wang\aff1    Guy Y. Cornejo Maceda\aff1\corresp yoslan@hit.edu.cn   
Ethan Pickering\aff3
   Tamir Shaqarin\aff4    and Artur Tyliszczak\aff5 \aff1 Chair of Artificial Intelligence and Aerodynamics, School of Mechanical Engineering and Automation, Harbin Institute of Technology, 518055 Shenzhen, P. R. China \aff2 Guangdong Provincial Key Laboratory of Intelligent Morphing Mechanisms and Adaptive Robotics, Harbin Institute of Technology, 518055 Shenzhen, P. R. China \aff3 Independent Scholar \aff4 Department of Mechanical Engineering, Tafila Technical University, 66110 Tafila, Jordan \aff5 Faculty of Mechanical Engineering and Computer Science, Czestochowa University of Technology, 42-201 Czestochowa, Poland
Abstract

We optimize the jet mixing using large eddy simulations (LES) at a Reynolds number of 30003000. Key methodological enablers consist of Bayesian optimization, a surrogate model enhanced by deep learning, and persistent data topology for physical interpretation. The mixing performance is characterized by an equivalent jet radius (ReqR_{\rm eq}) derived from the streamwise velocity in a plane located 88 diameters downstream. The optimization is performed in a 22-dimensional actuation space that comprises most known excitations. The plant benefits from a 22-dimensional actuation space that comprises most known excitations. This search space parameterizes distributed actuation imposed on the bulk flow and at the periphery of the nozzle in the streamwise and radial directions. The momentum flux measures the energy input of the actuation. The optimization quadruples the jet radius ReqR_{\rm eq} with a 77-armed blooming jet after around 570570 evaluations. The control input requires 2%2\% momentum flux of the main flow, which is one order of magnitude lower than an ad hoc dual-mode excitation. Intriguingly, a pronounced suboptimum in the search space is associated with a double-helix jet, a new flow pattern. This jet pattern results in a mixing improvement comparable to the blooming jet. A state-of-the-art Bayesian optimization converges towards this double helix solution. The learning is accelerated and converges to another better optimum by including surrogate model trained along the optimization. Persistent data topology extracts the global and many local minima in the actuation space. These minima can be identified with flow patterns beneficial to the mixing.

1 Introduction

Jet flows are ubiquitous in nature and technology and belong to a handful of configurations described in any fluid mechanics textbook. Jet mixing plays a pivotal role in many engineering applications, e.g. fuel injection in engines, combustor cooling, chemical mixing, printing, and noise generation (Jordan & Colonius, 2013), just to name a few. Hence, jet mixing optimization plays an important part in academic research and engineering applications.

Laminar jets are affected by the Kelvin-Helmholtz instability of the initial shear layer (Ball et al., 2012). The jet shear layer rolls up into pronounced vortex rings. Excitation at the nozzle exit provides authority over the vortex formation, e.g. allows to speed up the vortex formation, to promote or mitigate vortex pairing, and to influence the far-field coherent structures. Vortex pairing in streamwise direction promotes larger mixing regions observed as the orderly ‘vortical puffs’ with axisymmetric excitation (Crow & Champagne, 1971). More importantly, a significant increase in the spreading angle can be obtained by vortex splitting evolving along several branches (Lee & Reynolds, 1985).

The actuation may promote axisymmetric, helical, dual-mode, flapping, and bifurcating dynamics. In particular, acoustic excitation of bulk affects the jet spreading via controlled vortex pairing (Crow & Champagne, 1971; Hussain & Zaman, 1980). Jet mixing is more effectively augmented with helical forcing (Mankbadi & Liu, 1981; Corke & Kusek, 1993). Bifurcating, trifurcating, and blooming jets appear with a spreading angle up to 8080^{\circ} when axisymmetric and helical modes are combined (dual-mode) with different frequency ratios (Lee & Reynolds, 1985). The flapping mode is composed of counter-rotating helical modes, and the combination of axisymmetric and flapping modes is referred to as bifurcating mode. Both the flapping and the bifurcating mode can produce bifurcating jets with an impressive jet spreading (Parekh, 1989; Danaila & Boersma, 2000; da Silva & Métais, 2002).

The world of multiple-mode actuation for mixing optimization holds considerable promise and is still to be explored. The radial excitation with three flapping modes, including 99 parameters, is optimized by Evolution Strategies (Koumoutsakos et al., 2001). Only one dominant flapping mode remains after 400400 Direct Numerical Simulations (DNS) at Re=500Re=500. In the sequel, the bifurcating mode using axial forcing is optimized for ReRe up to 15001500 using the amplitudes and two Strouhal numbers as control parameters (Hilgers & Boersma, 2001). In an adjoint-based optimization study at Re=2000Re=2000, radial forcing is found to be more effective than axial actuation in dual-mode forcing (Shaabani-Ardali et al., 2020). In an experiment at Re=8000Re=8000, jet mixing is manipulated with periodic operation of six radial minijets. In 200 evaluations, Bayesian optimization minimizes a streamwise centerline velocity tuning 12 parameters, the frequency, amplitudes, and phase differences (Blanchard et al., 2021). The optimal mixing is facilitated by combining flapping and helical forcing, like machine learning control for the same configuration (Zhou et al., 2020). Moreover, the control performance also benefits from the deployment of more actuators and richer actuation space. For example, an intelligent nozzle with eighteen electromagnetic flap actuators (Suzuki et al., 1999), and 8-channel localized arc filament plasma actuators (Utkin et al., 2006) have been developed for jet control.

In flow control, machine learning techniques have recently gained attention due to their successful applications (Duriez et al., 2017; Brunton et al., 2020). Examples are genetic programming and variants (Cornejo Maceda et al., 2021), reinforcement learning (Rabault et al., 2019; Nair & Goza, 2023; Sonoda et al., 2023; Vignon et al., 2023a, b; Xu & Zhang, 2023; Guastoni et al., 2023), and Bayesian optimization (Blanchard et al., 2021). These methods encode the input-output relations in various forms without requiring prior knowledge. Function regression solvers like genetic programming and deep reinforcement learning can provide a large model capacity for exploration. However, deriving the optimal solution in finite time can not be guaranteed. Alternatively, a predefined control law can be tuned to near-optimal by parameter optimizers like Bayesian optimization (BO), genetic algorithm (GA) and particle swarm optimization (PSO), to name a few. Pino et al. (2023) compares genetic programming, deep reinforcement learning, and BO in increasingly complex control problems. The authors highlight BO’s potential to balance both sample efficiency and the performance of the final solution. With the recent advances in the design of the acquisition function (Blanchard & Sapsis, 2021) and the surrogate models (Pickering et al., 2022), BO is moving forward in conquering high-dimensional search spaces. This work leverages these advancements to optimize and understand high-dimensional jet forcing modes.

The present study builds on a jet mixing plant employing Large Eddy Simulations (LES) and a rich streamwise and radial actuation space at the nozzle exit. This plant can reproduce virtually all previously considered actuated jet dynamics as elements of a high-dimensional search space. High-dimensional optimization constitutes a challenge that is tackled by a Bayesian optimizer enhanced by deep learning.

The paper is organized as follows. The configuration, actuation and metrics are defined in § 2. The optimizer and numerical solver are presented in § 3. We discuss the learning process and the optimized solutions in § 4. Finally, § 5 concludes the findings with outlook.

2 Setup and problem definition

2.1 Configuration and actuation

Refer to caption
Figure 1: Problem setup including the jet configuration with the designed actuation (left) and the deep learning enhanced Bayesian optimization (right). 𝐛\mathbf{b} is the actuation parameter. 𝐮^b\hat{\mathbf{u}}_{b} is the actuation command at the 9 red points indicated in the dashed box.

The configuration is a jet flow exiting a circular nozzle of diameter DD in figure 1. The flow is described in a Cartesian coordinate system (x,y,z)(x,y,z) where xx represents the streamwise direction and the origin coincides with the center of the nozzle. The computational domain starts from the exit and covers a rectangular region with size 12D×16D×12D12D\times 16D\times 12D. The actuation 𝒖b(r,θ,t)\bm{u}_{b}(r,\theta,t) is imposed with the mean streamwise velocity um(r)u_{m}(r) as the inlet velocity profile u(r,t)u(r,t),

𝒖(r,t)=um(r)𝒆x+𝒖b(r,θ,t),\bm{u}(r,t)=u_{m}(r)\bm{e}_{x}+\bm{u}_{b}(r,\theta,t),

where rr measures the radial distance from the centerline, and θ\theta is the azimuthal angle. The mean streamwise component has a hyperbolic-tangent profile,

um(r)=Uj+uc2Ujuc2tanh(14Rδ2(rRRr)),u_{m}(r)=\frac{U_{\rm j}+u_{c}}{2}-\frac{U_{\rm j}-u_{c}}{2}\tanh\left(\frac{1}{4}\frac{R}{\delta_{2}}\left(\frac{r}{R}-\frac{R}{r}\right)\right),

where UjU_{\rm j} is the jet centerline velocity, uc=0.03Uju_{c}=0.03U_{\rm j} is the co-flow velocity to mimic a natural suction process, and δ2=R/20\delta_{2}=R/20 is the momentum boundary layer thickness of the initial shear layer. At the side boundaries, we impose the vertical velocity equals ucu_{c}, and the remaining velocity components equal zero. The pressure at the side boundaries is computed from the Neumann condition 𝐧p=0{\mathbf{n}}\cdot\nabla p=0 with 𝐧{\mathbf{n}} as the vector normal to the boundary. At the outlet plane, the velocity is computed from a convective boundary condition 𝒖/t+V~C𝒖/n=0\partial\bm{u}/\partial t+\tilde{V}_{C}\partial\bm{u}/\partial n=0, where V~C\tilde{V}_{C} is the instantaneous convection velocity VCV_{C} limited to positive values: V~C=max(VC,0)\tilde{V}_{C}=\max(V_{C},0). VCV_{C} is the velocity averaged over the outlet plane. The pressure at the outflow equals zero. Such defined outflow boundary condition ensures stable simulations and has negligible impact on the turbulent flow structures leaving the computational domain (Tyliszczak & Geurts, 2014; Tyliszczak, 2018).

As introduced in § 1, the jet control techniques for mixing enhancement are usually designed according to the instability mode, described by their azimuthal wavenumber at order 0 (axisymmetric mode) or 11 (helical mode). The perturbation is either axial or radial. We combine both axial and radial perturbation and define the actuation 𝒖b\bm{u}_{b} with a general expression of θ\theta and tt, without assumption on the forcing mode. Therefore, the term 𝒖b\bm{u}_{b} includes an axisymmetric streamwise bulk forcing uα(r,t)u^{\alpha}(r,t), and a peripheral forcing with the streamwise component uβ(r,θ,t)u^{\beta}(r,\theta,t), and the radial uγ(r,θ,t)u^{\gamma}(r,\theta,t):

𝒖b=(uα+uβ)𝒆x+uγ𝒆r.\bm{u}_{b}=\left(u^{\alpha}+u^{\beta}\right)\bm{e}_{x}+u^{\gamma}\bm{e}_{r}. (1)

The forcing components are the product of a perturbation fm(θ,t)f^{m}(\theta,t) and a radial profile gm(r)g^{m}(r): um(r,θ,t)=fm(θ,t)gm(r)u^{m}(r,\theta,t)=f^{m}(\theta,t)g^{m}(r), m=α,β,γm=\alpha,\beta,\gamma with gα(r)=1g^{\alpha}(r)=1 for rRr\leq R and 0 for r>Rr>R, and gβ(r)=gγ(r)=exp(1000(Rr)2.5)g^{\beta}(r)=g^{\gamma}(r)=\exp(-1000(R-r)^{2.5}). The profiles of the three forcing components are depicted in figure 1. The perturbation terms fm(θ,t)f^{m}(\theta,t) are defined as the sum and product of space- and time-harmonic functions:

fα(t)\displaystyle f^{\alpha}(t) =\displaystyle= i=LLαiΘi(ωαt)\displaystyle\sum_{i=-L}^{L}\alpha_{i}\>\Theta_{i}(\omega_{\alpha}t) (2)
fβ(θ,t)\displaystyle f^{\beta}(\theta,t) =\displaystyle= i,j=L,,LβijΘi(θ)Θj(ωβt)\displaystyle\sum_{i,j=-L,\ldots,L}\beta_{ij}\>\Theta_{i}(\theta)\>\Theta_{j}(\omega_{\beta}t) (3)
fγ(θ,t)\displaystyle f^{\gamma}(\theta,t) =\displaystyle= i,j=L,,LγijΘi(θ)Θj(ωγt),\displaystyle\sum_{i,j=-L,\ldots,L}\gamma_{ij}\>\Theta_{i}(\theta)\>\Theta_{j}(\omega_{\gamma}t), (4)

where αi\alpha_{i}, βij\beta_{ij}, γij\gamma_{ij} and ωα\omega_{\alpha}, ωβ\omega_{\beta}, ωγ\omega_{\gamma} are the actuation amplitudes and angular frequencies, respectively. Θi(ϕ)\Theta_{i}(\phi) is the harmonic function basis defined as: Θi(ϕ)=sin(iϕ)\Theta_{i}(\phi)=\sin(i\phi) for i>0i>0, Θi(ϕ)=1\Theta_{i}(\phi)=1 for i=0i=0, and Θi(ϕ)=cos(iϕ)\Theta_{i}(\phi)=\cos(i\phi) for i<0i<0. The forcing ansatz can approximate any periodic function of θ\theta and tt as the expansion order increases. In this study, focus is placed on the first order expansion (L=1L=1) of (2-4). Thus, the control law is parameterized by a 2222-dimensional vector 𝐛\mathbf{b},

𝐛=[Stα,Stβ,Stγ,α1,{βij}i,j=1,0,1,{γij}i,j=1,0,1],\mathbf{b}=\left[St_{\alpha},St_{\beta},St_{\gamma},\alpha_{1},\left\{\beta_{ij}\right\}_{i,j=-1,0,-1},\left\{\gamma_{ij}\right\}_{i,j=-1,0,-1}\right]^{\intercal}\in\mathcal{B}, (5)

where the Strouhal numbers Stm=ωmD/2πUj,m=α,β,γSt_{m}=\omega_{m}D/2\pi U_{\rm j},m=\alpha,\beta,\gamma. Note that α0\alpha_{0} is set to 0 as a constant bulk flow can be incorporated into the steady profile. In addition, α1=0\alpha_{-1}=0 can be assumed by a translation in time. The range of StmSt_{m} is set as [0.1,1][0.1,1] to include the Strouhal number of the preferred mode at Stp=0.30.64St_{p}=0.3-0.64 (Crow & Champagne, 1971; Gutmark & Ho, 1983; Sadeghi & Pollard, 2012), and the range of axisymmetric mode Stα[0.15,0.8]St_{\alpha}\in[0.15,0.8] where bifurcating and blooming jets are observed (Lee & Reynolds, 1985; Parekh, 1989; Tyliszczak, 2018). The actuation amplitudes are limited to 0.1α1,βij,γij0.1-0.1\leq\alpha_{1},\beta_{ij},\gamma_{ij}\leq 0.1, lower than 0.150.15 used by Danaila & Boersma (2000); Gohil et al. (2015); Tyliszczak (2018), and 0.50.5 by Koumoutsakos et al. (2001).

This high-dimensional search space allows the actuation to emulate various forcing modes, such as axisymmetric, helical, flapping, bifurcating, dual-mode, and harmonic waves discussed in §  1. In short, the forcing can be axisymmetric, statistically axisymmetric, and non-axisymmetric.

2.2 Mixing and actuation performance

The mixing process in turbulent jets is typically characterized by quantities such as the decay of centerline velocity and its fluctuations, or the entrainment (Nathan et al., 2006). However, these quantities only measure the local statistics and cannot reflect the mixing process of non-axisymmetric jets, such as bifurcating jets and asymmetric jets. For example, the velocity in these jets may locally drop to zero due to a jet-splitting phenomenon, rather than as a result of enhanced mixing. The entrainment is rather a measure of the amount of surrounding fluid entrained into the jet vicinity, without guaranteeing that it mixes with the jet. In asymmetric jets, the amount of fluid flowing towards the jet may characterize significant radial non-uniformity not revealed by the entrainment. Considering the non-axisymmetric forcing in search spaces, we define a new metric, the equivalent mixing radius ReqR_{\rm eq}, to estimate the spatial uniformity of the streamwise velocity. ReqR_{\rm eq} is defined as the normalized streamwise velocity variance computed at a given yzy-z cross-section:

Req=2σ,σ2=ϱ(y,z)[(yyc)2+(zzc)2]𝑑y𝑑zϱ(y,z)𝑑y𝑑z,R_{\rm eq}=\sqrt{2}\sigma,~{}\sigma^{2}=\frac{\int\!\!\!\int\!\varrho(y,z)\>\left[(y-y_{c})^{2}+(z-z_{c})^{2}\right]dydz}{\int\!\!\!\int\varrho(y,z)dydz}, (6)

with ϱ(y,z)=(u(x=X0,y,z)tuc)/(Ujuc)\varrho(y,z)=(\langle{u(x=X_{0},y,z)}\rangle_{t}-u_{c})/(U_{\rm j}-u_{c}), X0=8DX_{0}=8D, and (yc,zc)(y_{c},z_{c}) the jet center, as an analog to the center of mass: yc=yϱ(y,z)𝑑y𝑑z/ϱ(y,z)𝑑y𝑑zy_{c}=\int\!\!\!\int y\varrho(y,z)dydz/\int\!\!\!\int\varrho(y,z)dydz and zc=zϱ(y,z)𝑑y𝑑z/ϱ(y,z)𝑑y𝑑zz_{c}=\int\!\!\!\int z\varrho(y,z)dydz/\int\!\!\!\int\varrho(y,z)dydz. The 2\sqrt{2} coefficient is chosen so that the equivalent mixing radius of a top flat jet flow of radius RR is RR.

The amplitude and mass flow rate have been adopted to evaluate the control input. Inspired by Parekh (1989), we define the momentum flux PP of the actuation to estimate the energy input from a practical perspective. The momentum flux is time-averaged and normalized by the jet axial momentum flux at the inlet:

P=ub2𝑑AtπR2Uj2.P=\frac{\langle{\int\!\!\!\int u^{2}_{b}dA}\rangle_{t}}{\pi\>R^{2}\>U_{\rm j}^{2}}. (7)

3 Methodology

3.1 Deep learning-enhanced Bayesian optimization

The optimization problem to maximize the mixing as a response to the actuation input parameterized by 𝐛\mathbf{b} is formulated as

𝐛=argmin𝐛J(𝐛),\mathbf{b}^{*}=\underset{\mathbf{b}\in\mathcal{B}}{\arg\min}\;J(\mathbf{b}), (8)

where =[0.1,1]3×[0.1,0.1]1922\mathcal{B}=[0.1,1]^{3}\times[-0.1,0.1]^{19}\subset\mathbb{R}^{22}. Cost function JJ is defined as the inverse of the equivalent mixing radius and normalized by the unforced case, J=Req,0/ReqJ=R_{\rm eq,0}/R_{\rm eq}. Better mixing with large ReqR_{\rm eq} is related to the decrease of JJ. The assumed optimization goal leads to the spatial uniformity of u(x=X0,y,z)t\langle{u(x=X_{0},y,z)}\rangle_{t}.

To optimize this 2222-dimensional search space, we employ techniques inspired by Bayesian optimization (BO) (Williams & Rasmussen, 2006). BO has shown to be advantageous in optimizing expensive black-box functions by systematically reducing uncertainty in the black-box mapping and incorporating prior assumptions of the cost function (Shahriari et al., 2015). Through a sequential approach, BO identifies the next actuation to evaluate, or “data point” to acquire, for the purpose of finding the optimum. This is generally achieved via a surrogate model trained on all the queried data and an acquisition function (Williams & Rasmussen, 2006). A sketch of the method used in this study is shown in figure 1. The algorithm is initialized with the evaluation of a set 𝒟0\mathcal{D}_{0} of N0N_{0} actuation vectors in \mathcal{B} generated by Latin hypercube sampling. N0N_{0} is equal to ND+1N_{D}+1 with NDN_{D} the dimension of the search space \mathcal{B}. We recall that for this study, ND=22N_{D}=22. 𝒟0\mathcal{D}_{0} includes all the evaluated parameter vectors and their cost {𝐛i,Ji}i=1N0\{\mathbf{b}_{i},J_{i}\}_{i=1}^{N_{0}}. A surrogate model J~\tilde{J} is trained on the available data to approximate the latent objective function JJ. After the initialization, the algorithm explores the search space \mathcal{B} one new query at a time. At each iteration, BO determines the optimal actuation to implement next by minimizing an acquisition function a(𝐛)a(\mathbf{b}). The acquisition function leverages the surrogate model J~\tilde{J} and available data 𝒟n1\mathcal{D}_{n-1} to guide the data selection in the search space. After each query, the data set is enriched by the new data point {𝐛n,Jn}\{\mathbf{b}_{n},J_{n}\} into 𝒟n\mathcal{D}_{n} to further refine the surrogate model. When the query budget is met, the algorithm ends with the best design vector 𝐛\mathbf{b}^{*} recorded during the optimization.

The two key elements in BO are the choice of the surrogate model and the sequential strategy (Blanchard & Sapsis, 2021). We focus on the former for a better estimation of the high-dimensional flow control system. Gaussian processes (GP) serve as a successful surrogate model in moderate dimensions and can provide closed-form solutions with the posterior distribution. However, the computation of the posterior costs 𝒪(n3)\mathcal{O}(n^{3}) where nn is the number of observations due to the inverse of the covariance matrix. The number of evaluations required to effectively cover the domain grows exponentially with the dimensionality. This makes GP difficult to scale to large training sets for high-dimensional problems. Recently, the deep operator network (DeepONet) has shown small generalization error for systems where functions are acted upon by an operator (Lu et al., 2021). Different from GP that parameterizes the input, DeepONet can map input functions, which are then transformed by an operator, to an output function or scalar with improved accuracy. This means that DeepONet does not fall victim to the scaling difficulties of GP when training. Therefore, DeepONet is capable of learning from infinite-dimensional functions. Empirically, DeepONet’s utility as an operator surrogate model for Bayesian-inspired experimental design has been shown to significantly outperform GP in several infinite-dimensional systems that exhibit extreme events in Pickering et al. (2022), ranging from stochastic pandemic spikes, to catastrophic structural failure, to rogue wave identification. Through a study of the Bayesian optimizer based on GP (BO) and DeepONet (BO-DeepONet) for the defined 22-dimensional problem (see §4.2), we propose a new algorithm, deep learning-enhanced Bayesian optimization (BO-DL). By incorporating both GP and DeepONet as the surrogate model, BO-DL presents a better explorative capability and faster convergence. We alternate between DeepONet and GP every 1010 iterations to query the next sample. The 1010 is chosen empirically to balance the characteristics of the two models and combine their advantages. If the interval is too long, such as 100100, the models will be more independent than interacting with each other. On the other hand, if the interval is too short, such as 11, the exploitation may be interrupted by uncertainty due to the exchange of models. In GP implementations, the parameter space of a stochastic process is used for both regression and searching. Instead, DeepONet performs regression in the functional space, leveraging the typically disregarded basis functions associated with the parameterization. Here, the function 𝐮^b\hat{\mathbf{u}}_{b} is designed as the actuation command at 99 points located at the jet exit, including the centerline and 88 equidistant points on the periphery.

The acquisition function employed is the likelihood-weighted lower confidence bound proposed by Blanchard & Sapsis (2021), with superiority in finding rare extreme behaviors,

a(𝐛)=μ(𝐛)κσ(𝐛)w(𝐛),w(𝐛)=p𝐛(𝐛)pμ(μ(𝐛)).a(\mathbf{b})=\mu(\mathbf{b})-\kappa\sigma(\mathbf{b})w(\mathbf{b}),~{}w(\mathbf{b})=\frac{p_{\mathbf{b}}(\mathbf{b})}{p_{\mu}(\mu(\mathbf{b}))}. (9)

Here, κ\kappa balances exploration (large κ\kappa) and exploitation (small κ\kappa), and is chosen as 11. The mean model μ(𝐛)\mu(\mathbf{b}) and standard variance model σ(𝐛)\sigma(\mathbf{b}) are estimated by the mean and variance over a 22-ensemble of trained DeepONet. For the case of GP, these can be calculated in closed form using standard expressions from GP regression (Williams & Rasmussen, 2006). The likelihood ratio w(𝐛)w(\mathbf{b}) measures relevance by weighting the uncertainty of the point (the input density p𝐛p_{\mathbf{b}}) against its expected impact on the cost function (the output density pμp_{\mu}).

3.2 Governing equations and numerical solver

We consider an incompressible flow described by the Navier-Stokes equations in the framework of LES:

u¯jxj=0,\frac{\partial\overline{{u}}_{j}}{\partial x_{j}}=0, (10)
u¯it+u¯iu¯jxj=1ρp¯xi+τijxj+τijfxj,\frac{\partial\overline{u}_{i}}{\partial t}+\frac{\partial{\overline{u}_{i}\overline{u}_{j}}}{\partial x_{j}}=-\frac{1}{{\rho}}\frac{\partial\bar{p}}{\partial x_{i}}+\frac{\partial\tau_{ij}}{\partial x_{j}}+\frac{\partial\tau_{ij}^{f}}{\partial x_{j}}, (11)

where ui{u}_{i} represents the velocity components, pp denotes pressure and ρ\rho is density. The overbar denotes spatially filtered variables, f¯(𝐱,t)=Ωf(𝐱,t)𝒢(𝐱𝐱,Δ)𝑑𝐱\bar{f}(\mathbf{x},t)=\int_{\Omega}f(\mathbf{x}^{\prime},t)\mathcal{G}(\mathbf{x}-\mathbf{x}^{\prime},\Delta)d\mathbf{x}^{\prime} and 𝒢\mathcal{G} is the filter function that fulfils the condition Ω𝒢(𝐱,Δ)𝑑𝐱=1\int_{\Omega}\mathcal{G}(\mathbf{x},\Delta)d\mathbf{x}=1. A local filter width equals the cube root of the computational cell volume, Δ=(ΔxΔyΔz)1/3\Delta=(\Delta x\Delta y\Delta z)^{1/3}. The stress tensor includes the large-scale term τij\tau_{ij} and the sub-grid term τijf\tau_{ij}^{f} defined as

τij=2νSij,τijf=(u¯iu¯juiuj¯),\tau_{ij}=2\nu S_{ij},\quad\tau_{ij}^{f}=\left({\overline{u}_{i}\overline{u}_{j}}-\overline{{u}_{i}{u}_{j}}\right), (12)

where ν\nu is the kinematic viscosity and Sij=12(u¯ixj+u¯jxi)S_{ij}=\frac{1}{2}\left(\frac{\partial\bar{u}_{i}}{\partial x_{j}}+\frac{\partial\bar{u}_{j}}{\partial x_{i}}\right) is the rate of strain tensor of the resolved velocity field. In this work, the sub-filter tensor is modeled by an eddy-viscosity model with τijf=2νtSij+τkkfδij/3\tau_{ij}^{f}=2\nu_{t}S_{ij}+\tau^{f}_{kk}\delta_{ij}/3. The diagonal terms τkkf\tau^{f}_{kk} are added to the pressure, resulting in the so-called modified pressure P¯=p¯ρτkkfδij/3\bar{P}=\bar{p}-\rho\tau^{f}_{kk}\delta_{ij}/3. The Vreman subgrid-scale model is used for the low computational cost and very good accuracy in simulating jet flows (Wawrzak et al., 2015; Boguslawski et al., 2019).

The simulations are conducted with an in-house high-order LES solver SAILOR. The solution algorithm is based on the projection method for the pressure-velocity coupling for half-staggered meshes where the pressure nodes are shifted half a cell size from the velocity nodes (Tyliszczak, 2014, 2015a). A predictor-corrector method (Adams-Bashforth/Adams-Moulton) is applied for the time integration. Derivative approximations and interpolation on staggered nodes are defined using 6th and 10th order compact difference formulas. The SAILOR solver was used in the jet studies with similar dynamic scales as the present work, such as jets undergoing laminar/turbulent transition (Boguslawski et al., 2019) and excited jets (Tyliszczak & Geurts, 2014; Tyliszczak, 2018). The applied high-order discretization schemes led to grid-independent results already at relatively coarse meshes.

4 Results

4.1 LES validation with the bifurcating jet

The Reynolds number \Rey=UjD/ν\Rey=U_{\rm j}D/\nu is decided as 30003000 for this study. This allows for the use of a relatively coarse computational mesh to obtain reliable and fast simulations as the database. Two meshes are employed in this study. A coarse mesh with 80×160×8080\times 160\times 80 nodes is used for the learning process and a refined mesh with 192×336×192192\times 336\times 192 nodes for the validation and flow analysis of selected cases. The mesh points are compacted in the axial direction towards the inlet using an exponential function and radially towards the jet axis by a tangent hyperbolic function. In the region 1.2D<y,z<1.2D-1.2D<y,z<1.2D, the mesh spacing is almost uniform and equal to Δy=Δz=0.05D\Delta y=\Delta z=0.05D (46 nodes) on the coarse mesh and Δy=Δz=0.02D\Delta y=\Delta z=0.02D (115 nodes) on the dense one. In the axial direction, the sizes of the cells in the direct inlet vicinity are Δx=0.067D\Delta x=0.067D and Δx=0.032D\Delta x=0.032D for the coarse and dense mesh, respectively. The time step varies according to the Courant-Friedrichs-Lewy condition, with the CFL number equal to 0.5. The jet impulsively injects into quiescent flow and becomes fully developed after 100D/Uj100D/U_{\rm j} time units. The time-averaging procedure then starts and lasts for 500D/Uj500D/U_{\rm j} time units for the statistics to converge. A single simulation on the coarse mesh takes 2020 CPU-hours. The whole optimization process with 10001000 converged simulations lasts around 2121 days, using 40 CPUs of an AMD EPYC 7742 (2.25GHz) processor. On the dense mesh, the single simulation run takes approximately 576576 CPU-hours. The parallel computation is carried out with the Open MPI interface.

As presented in 3.2, the numerical code employed has been well validated against experimental and numerical data for a series of studies of jet dynamics and control. Here, the code with the assumed perturbation design (2-4) is verified to obtain a well-documented flow pattern of an excited jet, the bifurcating jet at \Rey=4300\Rey=4300 (Lee & Reynolds, 1985). The well-known jet is shown in figure 2, produced by the combined axisymmetric and flapping excitation at the frequencies 0.4<Stα<0.60.4<St_{\alpha}<0.6 and Stβ=Stγ=Stα/2St_{\beta}=St_{\gamma}=St_{\alpha}/2. Based on the current control definition, the bifurcating jet is reproduced by

fα(t)=α1Θ1(ωαt)\displaystyle f^{\alpha}(t)=\alpha_{1}\>\Theta_{1}(\omega_{\alpha}t) (13)

that produces the axisymmetric excitation and

fβ(θ,t)=β1,1Θ1(θ)Θ1(ωβt),fγ(θ,t)=γ1,1Θ1(θ)Θ1(ωγt),\displaystyle f^{\beta}(\theta,t)=\beta_{-1,1}\>\Theta_{-1}(\theta)\>\Theta_{1}(\omega_{\beta}t),\,\quad f^{\gamma}(\theta,t)=\gamma_{-1,1}\>\Theta_{-1}(\theta)\>\Theta_{1}(\omega_{\gamma}t), (14)

as the flapping mode, simulating the orbital motion of the nozzle tip in the experiment. We take α1=0.17\alpha_{1}=0.17, Stα=0.5St_{\alpha}=0.5 and assume Stβ=Stγ=Stα/2St_{\beta}=St_{\gamma}=St_{\alpha}/2, β1,12+γ1,12=α12\beta_{-1,1}^{2}+\gamma_{-1,1}^{2}=\alpha_{1}^{2} with β1,1=α1cos(20)\beta_{-1,1}=\alpha_{1}\cos(20^{\circ}). This type of excitation is also used in the previous LES simulations of the bifurcating jet (da Silva & Métais, 2002; Tyliszczak & Geurts, 2014).

Refer to caption
Figure 2: Validation of the LES solver on the bifurcating jet at \Rey=4300\Rey=4300 (Lee & Reynolds, 1985). Instantaneous isosurfaces of Q-parameter (Q=0.5Q=0.5) in the bifurcating (a1) and bisecting plane (a2), colored by the radial velocity uru_{r}. Radial profiles of the time-averaged axial velocity in the bifurcating planes (b1-b3). The square symbols represent the experimental results of Lee & Reynolds (1985), and the lines represent the simulation results obtained in this study.

Figure 2(b1-b3) shows the time-averaged axial velocity profiles along the radius in the bifurcating plane at the distance x/D=5.0, 6.5, 8.0x/D=5.0,\,6.5,\,8.0 from the inlet. These results were obtained for \Rey=4300\Rey=4300, as in Lee & Reynolds (1985), and for \Rey=3000\Rey=3000 assumed in the present study. The effect of the Reynolds number on the velocity profiles is small. We attribute such behavior to a dominating role of the perturbation.The location and level of two peaks, which are associated with the split jet arms, are well predicted by the numerical solutions. The impact of the mesh density on the solution is also negligible, owing to the employed high-order numerical method. This also holds for the optimized jet, see figure 6 of § 4.4.

4.2 Bayesian Optimization with different surrogate models

Refer to caption
Figure 3: (a) Prediction of JJ by GP (cross) and DeepONet (dot). Learning curves of JminJ_{\rm min} using BO (b1), BO-DeepONet (b2) and BO-DL (c). (d) Average learning curves of Bayesian optimizers with/without deep leaning.

We first study the capability of the surrogate model, GP, and DeepONet, to predict the cost function JJ as a response to the excitation input 𝐛\mathbf{b}. Then, the Bayesian optimizers based on each of the two surrogate models (BO and BO-DeepONet) are tested on our plant. Finally, the performance of the proposed method deep learning-enhanced Bayesian optimization (BO-DL) in 3.1 is illustrated.

A kk-fold cross-validation training (k=5k=5) of GP and DeepONet model is performed over 10001000 data points with 80/2080/20 train/test split. The data are extracted randomly out of the database from realizations of Bayesian optimizers. Figure 3(a) shows the prediction J~(𝐛)\tilde{J}(\mathbf{b}) versus the truth J(𝐛)J(\mathbf{b}) obtained. The distribution of points along the diagonal shows that DeepONet achieves a lower prediction error than GP. This is further explained by the correlation coefficient R=0.89R=0.89 for DeepONet, and 0.710.71 for GP. The average error of the kk tests is measured by the mean squared error (MSE). The MSE for GP model is 0.010.01, 1%1\% of the range of JJ value.The prediction of DeepONet model is superior, with a MSE equal to 0.0050.005. The learning process of Bayesian optimizer with GP (BO) and DeepONet (BO-DeepONet) is shown in figure 3(b1-2). In figure 3(b1), the learning curve of BO displays a plateau after the initial samples (triangles). After 160160 samples, new optima are found and followed by continuous exploitation of the samples near the learning curve. The final solution is reached with J=0.274J=0.274 after 745745 evaluations. When DeepONet is employed (figure 3b2), a better solution J=0.256J=0.256 is found quickly within 300300 samples. This may be attributed to DeepONet’s capibility to generalize better for previously unseen data than GP as the cross-validation indicates (Lu et al., 2021). After m=300m=300, the newly tested parameters cover the entire range of JJ, but no further improvement is observed in the learning curve. This suggests that the optimizer focuses on exploration of the search space rather than exploitation like BO. Based on the above observations, a joint surrogate model is proposed for this study to combine the advantages of GP in local exploitation and DeepONet in exploring new minima. The Bayesian optimizer based on this new model is described in § 3.1 and referred to as BO-DL. The learning process of BO-DL is given in figure 3(b3) with the samples queried by GP and DeepONet. As indicated by the data points on the learning curve, the queries made by DeepONet (red dots) discovers a new minimum with significant reduction of JJ, and GP (blue dots) continues to descend. The best solution is obtained at J=0.237J=0.237 within 600600 evaluations.

The average performance of three Bayesian optimizers above is further studied. Each optimizer is employed for three realizations with a fixed budget of 10001000 evaluations. Figure 3(c) reports the average value of the current optimum JminJ_{min} from each optimizer with the standard deviation (shaded region) of three runs. The learning curve starts from Jmin=0.45J_{min}=0.45, the lowest cost value after initialization of 2323 samples, including the unforced case and the other 2222 controlled cases from Latin hypercube sampling in the search spaces. The unforced case (J=1J=1) is omitted for better visibility of the data. The maximum cost of the controlled flow is around 0.70.7. With around 750750, 580580 and 570570 queries, the average lowest costs JminJ_{min} achieved by BO, BO-DeepONet, and BO-DL are J=0.274J=0.274 (diamond), J=0.268J=0.268 (square), and J=0.237J=0.237 (star), respectively. On average, BO-DeepONet shows the fastest learning speed (dotted line) but with the largest variation. This is owing to DeepONet’s capability of predicting the potential minima with a small generalization error. Although the descent of BO is the slowest, the optimal results of the three realizations are consistent. This indicates that GP provides better interpolation around the minima than DeepONet due to its deterministic nature. By combining GP and DeepONet, BO-DL not only demonstrates a comparable learning speed as BO-DeepONet but also inherits the small variance of the final solution from BO. Finally, among the three optimizers, BO-DL derives the best solution. In addition, the warm-up phase during queries 0 to 100100 appears to be significantly shortened, denoted by the rectangle in figure 3(c).

The computational cost of the BO loop is also noteworthy. With BO, the computation of the posterior costs 𝒪(N3)\mathcal{O}(N^{3}), where NN is the number of observations (Williams & Rasmussen, 2006). This makes the algorithm quite slow, even after only a few hundred observations. The experience of this study shows that the computation of BO increases from 1010 CPU-seconds to 600600 CPU-seconds after 10001000 iterations on an AMD EPYC 7742 (2.25GHz) processor. The BO-DeepONet procedure scales much more favorably. Initially, the first iteration takes 120120 CPU-seconds, increasing only to 180180 CPU-seconds after 10001000 iterations. The combination of the two models in BO-DL compromises the cost to an average level.

For the high-dimensional physical problem, we show that Bayesian optimization can benefit not only from more accurate surrogate models but also from combining the advantages of parametric and non-parametric predictors. The proposed BO-DL holds a fast convergence and efficient exploration with a GP-DeepONet surrogate model. Compared with GP, the proposed surrogate model can provide more accurate predictions by leveraging the hidden functional input with DeepONet and scales better as both data size and dimensionality increase. In addition, a comparison between the Bayesian methods and bio-inspired approaches is given in Appendix A. It is shown that the optimizer with a surrogate model employed, particularly a deep-learning model, shows more advantage in the current problem.

4.3 Exploration and characterization of the search space

Refer to caption
Figure 4: Learning process of BO and BO-DL on the proximity map. The unforced case (filled square), the local minima (unfilled symbols), and the final solutions explored by BO (blue-filled diamond) and BO-DL (red-filled star) are highlighted, with related jet patterns.

In this section, we explore the learning processes of the BO and BO-DL in the 2222-dimensional space with persistent data topology (Wang et al., 2023a, b). This data analysis identifies the cost function minima and their depth, i.e. their persistence to noise, and was inspired by Edelsbrunner & Harer (2008). The analysis includes the identification of local minima in the high-dimensional actuation space, a dimension reduction to a two-dimensional proximity map and corresponding data visualization, and as shown in figure 4. The 2222-dimensional data obtained by both BO and BO-DL are projected on a 2D proximity map by classical multidimensional scaling. The feature coordinates γij\gamma_{ij} are chosen to optimally preserve the dissimilarity between control parameters defined by the Euclidean distance Dij=|𝐛i𝐛j|D_{ij}=|\mathbf{b}_{i}-\mathbf{b}_{j}|. The map features two large basins of attraction with low values of JJ, as well as small basins distributed around the border. A point 𝐛0\mathbf{b}_{0} is supposed as a local minimum 𝐦\mathbf{m}, if there exists a neighborhood \mathcal{B} of 𝐛0\mathbf{b}_{0} that satisfies J(𝐛0)min𝐛J(𝐛)J(\mathbf{b}_{0})\leq\min_{\mathbf{b}\in\mathcal{B}}J(\mathbf{b}). \mathcal{B} is an open set which should include KK nearest neighbours of 𝐛0\mathbf{b}_{0} measured by Euclidean distance, KND+1K\geq N_{D}+1. Note that the local minima are assumed based on the obtained discrete data and may change with additional data. A total of 5757 local minima are extracted from the data, with 3636 found by BO-DL and 2121 by BO. In the proximity map, the unforced case is represented by a black square where both algorithms begin. The other symbols denote the derived minima 𝐦\mathbf{m} found by BO (blue) and BO-DL (red). The final BO and BO-DL solutions highlighted by the filled diamond (J=0.27J=0.27) and the filled star (J=0.24J=0.24) are located in the large basins of attractions. Most of the minima queried by BO are located in the center of the map, whereas BO-DL also explores outward regions. Forced by the control commands corresponding to these minima, different jet patterns are observed, corroborated with the control modes. The axial puff (circles) are close to the unforced case. The bifurcating type (cross) distributes widely in the cost range. The lower the JJ value is, the closer to the helix (filled diamond) basin. The jets bifurcating to one side are away from the center, surrounded by the other unidentified patterns (triangles). Helix (diamond) and blooming (star) jets feature the most substantial performance, but the latter is only detected by BO-DL. Among the 2020 minima explored by BO, the identified patterns include 66 helix, 55 flapping, and 44 axial puff. Among the 3737 minima explored by BO-DL, the identified patterns include 66 flapping, 55 asymmetric flapping, 22 helix, 11 blooming, and 11 axial puff. In addition, most (2222) of the 2727 unidentified patterns are detected by BO-DL.

BO-DL explores not only more minima than BO but also more diverse flow patterns beneficial to the mixing. This is probably owing to DeepONet’s capability to extrapolate the mapping from the high-dimensional actuation to the mixing response more accurately. Two solutions with large basins of attractions in the search space are revealed — the optimal solution with a 77-armed blooming jet generated, and the suboptimal with a double-helix shape.

4.4 Discussion of the optimized solutions

Refer to caption
Figure 5: Comparison between three mixing jet solutions: (a) the ad hoc solution with the best mixing in Tyliszczak (2018), (b) the best solution learned with BO, and (c) the best solution learned with BO-DL. Top — forcing modes with the associated mixing and actuation metric. Middle — (contour online) bottom view of instantaneous isosurfaces of Q=0.5Q=0.5 colored by the radial velocity uru_{r}. Bottom — (contour online) contour plots of the streamwise velocity on cross-sectional planes at x=2Dx=2D, 4D4D, 6D6D, 8D8D.

Here, we include three solutions for the discussion: an ad hoc forcing with the best mixing in Tyliszczak (2018), BO optimized solution, and the optimal solution of BO-DL. The forcing command, the instantaneous snapshots, and the mean flow fields are presented in figure 5. The forcing commands are expressed by the operators in an order of constant, spatial-periodic, temporal-periodic, and traveling waves in figure 5(a1), 5(b1) and 5(c1). The axial excitation combining axisymmetric and helical modes has been widely employed to study the bifurcating and blooming jets since Lee & Reynolds (1985). A parametric study of the blooming jets with this type of excitation was performed in Tyliszczak (2018), under the same Reynolds number as this study. Among various multi-armed jets, the one with 11 arms led to the best mixing performance. The excitation was imposed on the axial velocity and combined the axisymmetric mode with Strouhal number Sta=0.45St_{a}=0.45 and the helical mode with Sth=0.164St_{h}=0.164 at the same amplitude, 15%15\% the bulk jet velocity (figure 5a1). The BO solution contains mainly the axisymmetric mode at an amplitude of 8%8\% with Stα=0.497St_{\alpha}=0.497 for the bulk, a helical mode at an amplitude of 7%7\%, and a flapping mode at an amplitude of 4%4\% with Stγ=0.232St_{\gamma}=0.232 for radial components in the periphery. After removing the expansions with negligible amplitudes, less than 1%1\% of the bulk jet, the control law approximately reads

fα(t)\displaystyle f^{\alpha}(t) =0.08sin(2π×0.497Ujt/D)\displaystyle=0.08\sin(2\pi\times 0.497U_{\rm j}\>t/D) (15)
fβ(θ,t)\displaystyle f^{\beta}(\theta,t) 0\displaystyle\approx 0
fγ(θ,t)\displaystyle f^{\gamma}(\theta,t) 0.04cos(θ)sin(2π×0.232Ujt/D)\displaystyle\approx 0.04\cos(\theta)\sin(2\pi\times 0.232U_{\rm j}\>t/D)
0.07cos(θ2π×0.232Ujt/D).\displaystyle\quad-0.07\cos(\theta-2\pi\times 0.232U_{\rm j}\>t/D).

Because the removed terms hold an amplitude lower than the turbulence intensity at the jet outlet, the approximation hardly changes the flow patterns, with the relative cost difference being less than 1%1\%. The BO-DL solution contains mainly the axisymmetric mode at an amplitude of 6%6\% with Stα=0.519St_{\alpha}=0.519 for the bulk, a helical mode at an amplitude of 8%8\% with Stγ=0.223St_{\gamma}=0.223 for radial components in the periphery. The simplified control law reads

fα(t)\displaystyle f^{\alpha}(t) =0.06sin(0.519t)\displaystyle=0.06\sin(0.519t) (16)
fβ(θ,t)\displaystyle f^{\beta}(\theta,t) 0\displaystyle\approx 0
fγ(θ,t)\displaystyle f^{\gamma}(\theta,t) 0.08cos(θ+2π×0.223Ujt/D).\displaystyle\approx 0.08\cos(\theta+2\pi\times 0.223U_{\rm j}\>t/D).

Two significant factors to be noted are the axisymmetric forcing Strouhal number StαSt_{\alpha}, and the frequency ratio between the axial and helical modes, α=Stα/Stγ\alpha=St_{\alpha}/St_{\gamma}. For both BO and BO-DL solutions, the axisymmetric forcing Strouhal number falls into the range 0.4Stα0.60.4\lesssim St_{\alpha}\lesssim 0.6 to observe bifurcating and blooming jets, and coincides with around Stα=0.5St_{\alpha}=0.5 where the peak spreading occurs (Lee & Reynolds, 1985; Shaabani-Ardali et al., 2020; Gohil et al., 2015). The BO-DL actuation takes a frequency ratio of 2.342.34, which very well agrees with a theoretically derived value α=7/3\alpha=7/3 Tyliszczak (2015b); Gohil et al. (2015). Interestingly, the ratio of the BO solution which produces a helix jet (α=2.14\alpha=2.14) also falls into this range. Moreover, different from the ad hoc excitation using only the axial forcing, the radial component in the periphery plays an important role in solutions optimized by both BO and BO-DL. Shaabani-Ardali et al. (2020) also concludes radial forcing is the dominant component of helical modes to maximize the spreading angle of a bifurcating jet. We extend the importance of radial forcing to the jet spreading globally. From an estimate of the momentum flux, the solutions in this study take only 2.8%2.8\% (BO-DL) and 4.8%4.8\% (BO) of the main jet, one order lower than the ad hoc excitation (25%25\%). One reason is the low amplitudes, and another is the forcing applied into the local boundary region (see § 2.1) rather than the whole jet, which leads to a more efficient control. This represents the physical reality of small actuators installed on the wall of the inlet nozzle, like flap arrays in Suzuki et al. (1999), only affecting the boundary layers.

The flow structures are presented by the bottom view of the instantaneous Q-parameter isosurfaces (figure 5a2, b2, and c2). The arms of the ad hoc blooming jet are not explicitly observed due to the interaction between the closely aligned vortex rings. A double-helix jet is formulated by the BO solution. The jet bifurcates into two branches, which rotate with a specific frequency (see figure 8) and then experiences continuous bifurcation along the azimuth until the vortex rings break. This type of jet has not been reported in the literature so far. We reserve it for future investigation. The BO-DL optimized jet produces a 77-armed blooming jet, with the vortex rings eventually propagating along 77 different trajectories. The contour slices of the time-averaged streamwise velocity also confirm the spreading observed from the vortex rings. The 11 branches generated by the ad hoc forcing can be traced in x=8Dx=8D. The BO-optimized jet shows more continuous distribution along the circumference due to the azimuthal bifurcation of two helix-shaped arms. The blooming jet features the earliest and furthest spreading. This leads to the largest effective mixing radius, 4.22Req,04.22R_{\rm eq,0} at x=8Dx=8D, followed by BO optimized jet with Req=3.65Req,0R_{\rm eq}=3.65R_{\rm eq,0}, and the ad hoc forced jet with Req=2.77Req,0R_{\rm eq}=2.77R_{\rm eq,0}.

Refer to caption
Figure 6: Mean profiles (a) and fluctuations (b) of the axial velocity for the unforced flow, the helical (BO solution), and the blooming jet (BO-DL solution). The solid and dashed lines denote the results calculated separately by the coarse and dense meshes.
Refer to caption
Figure 7: Axial velocity spectrum for the unforced (a1-a3), the helical (b1-b3), and the blooming jet (c1-c3) at x=0Dx=0D, 2D2D, 4D4D, and 6D6D.
Refer to caption
Figure 8: Instantaneous isosurfaces of Q-parameter (Q=0.5Q=0.5) for the helical jet, colored by the radial velocity, in a half period of the arm rotation. Tr=62.5D/UjT_{r}=62.5D/U_{\rm j}.

Figure 6 shows the axial profiles of the time-averaged centerline velocity and its fluctuation for the unforced and forced jets. The results obtained on the coarse and dense meshes agree well, except for slight discrepancies in the region 5D<x<8D5D<x<8D. Compared with the unforced case in figure 6(a), the length of the potential core shortens significantly from 7.5D7.5D to 2D2D for both helical and blooming jets. Beyond the potential core, the velocity drops steeply and even reaches small negative values in the blooming jet. As a similar behavior observed by Tyliszczak (2018), this is the effect of jet splitting resulting in a local pressure drop. As a result, the reversal flow amplifies the local turbulence intensity to the peak at x5Dx\approx 5D, as indicated in figure 6(b). The initial fluctuation level in both jets corresponds to the imposed forcing amplitudes of the bulk forcing term. A small decrease at x<1Dx<1D is caused by the lack of energy in a range of low-wave numbers (Kempf et al., 2005). For the controlled jets, the waves of the fluctuation profiles around the peak are attributed to the interactions between the Kelvin-Helmholtz instability and the forcing disturbance. Further downstream, the fluctuations drop nearly to zero along with a low mean velocity. The fluctuation profile for the unforced jet shows a very low turbulence level until x7Dx\approx 7D, and then slowly increases to the maximum around x=10Dx=10D.

Figure 7 shows the amplitude spectra of the centerline velocity at four localizations along the axis, x=0Dx=0D, 2D2D, 4D4D, and 6D6D. These results are presented versus the Strouhal number StD=ωD/2πUjSt_{D}=\omega D/2\pi U_{\rm j}. The spectrum of the unforced jet is nearly flat at the inlet as the imposed turbulent signal does not contain any characteristic frequency. The high-frequency components (StD>1St_{D}>1) are dampened downstream, and a broadband peak emerges around StD=0.52St_{D}=0.52. This falls within the range of the preferred mode frequency Stp=0.30.64St_{p}=0.3-0.64 (Crow & Champagne, 1971; Gutmark & Ho, 1983; Sadeghi & Pollard, 2012). Note that the optimal StαSt_{\alpha} predicted by BO-DL for the blooming jet perfectly matches the current preferred mode Stp=0.52St_{p}=0.52. This finding is consistent with previous studies (Tyliszczak & Geurts, 2014; Gohil et al., 2015; Tyliszczak, 2018) which concluded that the jet splitting phenomenon is most pronounced when StαSt_{\alpha} is equal to StpSt_{p}. The initial spectra of the helix and blooming jets characterize a distinct peak at StαSt_{\alpha}. The peaks related to the helical forcing StγSt_{\gamma} can be observed from x=2Dx=2D. The high-frequency harmonics also appear due to the interactions between generated toroidal vortices. In the case of the helical jet, the peak at StD0.032St_{D}\approx 0.032 is also noteworthy. We find that this frequency coincides with the azimuthal motion of the helical arms, with a period TrT_{r} equal to 62.5D/Uj62.5D/U_{\rm j}. Figure 8 shows the snapshots of the helical jet, depicting the positions of its arms during the period of 31.25D/Uj31.25D/U_{\rm j}, which corresponds to the detected StD0.032St_{D}\approx 0.032. The relationship between the frequency of the rotation and the one associated with forcing terms is left for future study.

5 Conclusions and outlook

We perform a global optimization of the jet control modes, parameterized in a 2222-dimensional search space. The forcing includes axial and radial components that are defined to approximate a general periodic function of time and azimuthal angle. The design space allows the actuation to emulate various forcing modes that have been studied. This high-dimensional problem for jet mixing improvement is tackled by Bayesian Optimization (BO). We advance BO by incorporating a deep-learning enhanced surrogate model. This surrogate model combines the non-parametric method GP for fast local descent and the parametric method Deep Operator Network (DeepONet) for efficient exploration of the search space. The proposed optimizer BO-DL is more efficient in searching for minima and more scalable to large datasets. To further understand the optimized high-dimensional solutions, we propose a topological analysis of the optimization data. The achieved control landscape features two persistent (pronounced) minima, a global minimum corresponding to a 7-armed blooming jet generated, and a suboptimal parameter with a double-helix shape that performs comparably. Intriguingly, many of the less persistent minima also correspond to known actuated jet mixing mechanisms.

Compared with the unforced jet, both the helical and the blooming jet shorten the length of the potential core substantially from 7.5D7.5D to 2D2D. The valley of the mean centerline velocity is located around 6D6D in the downwash, corresponding to the peak of the fluctuation profiles. The reversal flow in the blooming jet amplifies the local turbulence intensity, and leads to even negative velocity in the centerline. Both of the optimized control laws show the radial component dominates the non-axisymmetric forcing mode. The optimized forcing for helical jet is a triple-mode that combines the axisymmetric bulk component, a helical, and a flapping mode in the periphery. The 77-armed blooming jet is produced by a dual-mode forcing with only axisymmetric and helical modes. The better performance of the latter is attributed to the exact match between the axisymmetric forcing Strouhal number and the preferred mode frequency found by BO-DL. The forced flows are characterized by a distinct peak at the Strouhal number of the axisymmetric mode, and the effect of the helical forcing appears later. Intriguingly, a peak at the low Strouhal number in the helical jet coincides with the azimuthal motion of the helical arms.

This study emphasizes the importance of effective exploration for machine learning-based optimization in flow control, particularly in high-dimensional design spaces. The proposed BO-DL enhances the explorative feature of BO by improving the model accuracy and increasing the solvable model capacity. Therefore, BO-DL can serve as an alternative to classical BO when there is a need for greater complexity. In addition to parallelizing GP and DeepONet in the Bayesian framework, we can also incrementally increase the model complexity. For example, we can use the controller obtained by GP to accelerate the learning process of the DeepONet. Furthermore, DeepONet can also be employed as a function approximator like DRL, which deserves future study under a different framework — Bayesian experimental design. As an add-on, the proposed persistent data topology analysis can help to characterize the control landscape from the discrete data produced by different optimizers. Persistent data minima indicate literature known and unknown mixing mechanisms. Finally, we expect the proposed BO-DL and topological data analysis for effective learning and characterization of the search spaces could contribute to more flow control problems.

Acknowledgements

This work is supported by the National Natural Science Foundation of China under grants 12172109 and 12302293, by the Guangdong Basic and Applied Basic Research Foundation under grant 2022A1515011492, and by the Shenzhen Science and Technology Program under grant JCYJ20220531095605012. This work is also supported by the Polish National Science Center under grant 2018/31/B/ST8/00762. We gratefully acknowledge inspiring and formative contributions of Antoine Blanchard and Themis Sapsis to the employed Bayesian optimization algorithm and its interpretation. We express our sincere thanks to Zhutao Jiang for testing our optimization algorithmss in a companion jet mixing experiment and deepening our physical insights and to Thomas Weise for enriching our roadmap on optimization algorithms and to Herbert Edelsbrunner and Hans-Christian Hege for a discussion of topological data analyses.

Declaration of interests

The authors report no conflict of interest.

Appendix A Comparison of Bayesian optimization with two bio-inspired optimizers

Refer to caption
Figure 9: Learning curves of JminJ_{\rm min} using BO, BO-DL, PSO and GA.
Refer to caption
Figure 10: Landscape with visited data (a) and derived local minima (b) of BO (blue), BO-DL (red), PSO (magenta) and GA (green) from unknown in the whole space.

The Bayesian optimizers, BO and BO-DL, employed in this study, have been compared with two popular biologically inspired methods (Wahde, 2008): Particle Swarm Optimization (PSO) and Genetic Algorithm (GA). Here, a variant of PSO, Particle Swarm Optimization through Targeted, Position‑Mutated, Elitism (PSO-TPME) is employed (Shaqarin & Noack, 2023), and GA is realized following Wright (1991). Figure 9 displays the learning curve of one realization of the four methods. The learning curves give an indication of the learning speed of each method. PSO converges to a solution with the cost J=0.283J=0.283 slightly higher than BO (J=0.273J=0.273), and GA ends with a even higher cost J=0.3J=0.3. Figure 10 presents all the evaluated points (a), and the local minima (b) derived from the combined database. The tested solutions during the search are denoted by the colored dots in figure 10 (a). The derived minima are denoted by the filled circles with corresponding colors in figure 10 (b). The converged solutions are depicted by stars. PSO and GA fall into the local minima in the upper left and the upper right corner, respectively. Interestingly, the search process of these methods show different features. PSO moves all the particles (magenta dots) towards the best region detected. Finally, all particles accumulate in the upper left region and get stuck. The minima (magenta circles) are found along the direction of gradient descent. GA searches the minima in one neighborhood but is extremely inefficient in exploring further regions. Most of the exploration away from the global minimum in the right upper region in figure 9 (a) ends with no local minima in figure 9 (b). Owing to the prediction by GP, BO converges to the region with a lower cost quickly. Moreover, with the deep-learning enhanced surrogate model, BO-DL not only obtains the best minimum (red star) but also reveals more potential minima in a wider neighbourhood (red circles). In high-dimensional search spaces, exploration based on accurate estimators is more efficient than random exploration. For the current problem, the optimizer based on a surrogate model, particularly a deep-learning model, shows more advantages than bio-inspired optimizers.

References

  • Ball et al. (2012) Ball, C. G., Fellouah, H. & Pollard, A. 2012 The flow field in turbulent round free jets. Prog. Aerosp. Sci. 50, 1–26.
  • Blanchard & Sapsis (2021) Blanchard, A. & Sapsis, T. 2021 Bayesian optimization with output-weighted optimal sampling. J. Comp. Phys. 425, 109901.
  • Blanchard et al. (2021) Blanchard, A. B., Cornejo Maceda, G. Y., Fan, D., Li, Y., Zhou, Y., Noack, B. R. & Sapsis, T. P. 2021 Bayesian optimization for active flow control. Acta Mech. Sin. pp. 1–3.
  • Boguslawski et al. (2019) Boguslawski, A., Wawrzak, K. & Tyliszczak, A. 2019 A new insight into understanding the crow and champagne preferred mode: a numerical study. J. Fluid Mech. 869, 385–416.
  • Brunton et al. (2020) Brunton, S. L., Noack, B. R. & Koumoutsakos, P. 2020 Machine learning for fluid mechanics. Ann. Rev. Fluid Mech. 52, 477–508.
  • Corke & Kusek (1993) Corke, T. C. & Kusek, S. M. 1993 Resonance in axisymmetric jets with controlled helical-mode input. J. Fluid Mech. 249, 307–336.
  • Cornejo Maceda et al. (2021) Cornejo Maceda, G. Y., Li, Y., Lusseyran, F., Morzyński, M. & Noack, B. R. 2021 Stabilization of the fluidic pinball with gradient-based machine learning control. J. Fluid Mech. 917, A42:1–43.
  • Crow & Champagne (1971) Crow, S. C. & Champagne, F. H. 1971 Orderly structure in jet turbulence. J. Fluid Mech. 48 (3), 547–591.
  • Danaila & Boersma (2000) Danaila, I. & Boersma, B. J. 2000 Direct numerical simulation of bifurcating jets. Phys. Fluids 12 (5), 1255–1257.
  • Duriez et al. (2017) Duriez, T., Brunton, S. L. & Noack, B. R. 2017 Machine Learning Control — Taming Nonlinear Dynamics and Turbulence, Fluid Mechanics and Its Applications, vol. 116. Springer-Verlag.
  • Edelsbrunner & Harer (2008) Edelsbrunner, H. & Harer, J. 2008 Persistent homology — a survey. In Surveys on Discrete and Computational Geometry: Twenty Years Later (ed. J. E. Goodman, J. Pach & R. Pollack), , vol. 458, pp. 257–282. AMS Bookstore.
  • Gohil et al. (2015) Gohil, T. B., Saha, A. K. & Muralidhar, K. 2015 Simulation of the blooming phenomenon in forced circular jets. J. Fluid Mech. 783, 567–604.
  • Guastoni et al. (2023) Guastoni, L., Rabault, J., Schlatter, P. & Azizpour, H. Vinuesa, R. 2023 Deep reinforcement learning for turbulent drag reduction in channel flows. Eur. Phys. J. E 46, 27.
  • Gutmark & Ho (1983) Gutmark, E. & Ho, C.M. 1983 Preferred modes and the spreading rates of jets. Phys. Fluids 26, 2932–2938.
  • Hilgers & Boersma (2001) Hilgers, A. & Boersma, B. J. 2001 Optimization of turbulent jet mixing. Fluid Dyn. Res. 29 (6), 345.
  • Hussain & Zaman (1980) Hussain, A. K. M. F. & Zaman, K. B. M. Q. 1980 Vortex pairing in a circular jet under controlled excitation. Part 2. Coherent structure dynamics. J. Fluid Mech. 101 (3), 493–544.
  • Jordan & Colonius (2013) Jordan, P. & Colonius, T. 2013 Wave packets and turbulent jet noise. Annu. Rev. Fluid Mech. 45 (1), 173–195.
  • Kempf et al. (2005) Kempf, A., Klein, M. & Janicka, J. 2005 Efficient generation of initial- and inflow-conditions for transient turbulent flows in arbitrary geometries. Flow Turbul. Combust. 74, 67–84.
  • Koumoutsakos et al. (2001) Koumoutsakos, P., Freund, J. & Parekh, D. 2001 Evolution strategies for automatic optimization of jet mixing. AIAA J. 39 (5), 967–969.
  • Lee & Reynolds (1985) Lee, M. & Reynolds, W. C. 1985 Bifurcating and blooming jets. Tech. Rep.. Thermosciences Division, Department of Mechanical Engineering, Stanford University.
  • Lu et al. (2021) Lu, L., Jin, P., Pang, G., Zhang, Z. & Karniadakis, G. E. 2021 Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators. Nat. Mach. Intell. 3 (3), 218–229.
  • Mankbadi & Liu (1981) Mankbadi, R. & Liu, J. T. C. 1981 A study of the interactions between large-scale coherent structures and fine-grained turbulence in a round jet. Philos. Trans. Royal Soc. A 298 (1443), 541–602.
  • Nair & Goza (2023) Nair, N. J. & Goza, A. 2023 Bio-inspired variable-stiffness flaps for hybrid flow control, tuned via reinforcement learning. J. Fluid Mech. 956, R4.
  • Nathan et al. (2006) Nathan, G. J., Mi, J., Alwahabi, Z. T., Newbold, G. J. R. & Nobes, D. S. 2006 Impacts of a jet’s exit flow pattern on mixing and combustion performance. Prog. Energy Combust. Sci. 32 (5), 496–538.
  • Parekh (1989) Parekh, D. E. 1989 Bifurcating jets at high Reynolds numbers. Stanford University.
  • Pickering et al. (2022) Pickering, E., Guth, S., Karniadakis, G. E. & Sapsis, T. P. 2022 Discovering and forecasting extreme events via active learning in neural operators. Nat. Comput. Sci. 2 (12), 823–833.
  • Pino et al. (2023) Pino, F., Schena, L., Rabault, J. & Mendez, M. A. 2023 Comparative analysis of machine learning methods for active flow control. J. Fluid Mech. 958, A39.
  • Rabault et al. (2019) Rabault, J., Kuchta, M., Jensen, A., Réglade, U. & Cerardi, N. 2019 Artificial neural networks trained through deep reinforcement learning discover control strategies for active flow control. J. Fluid Mech. 865, 281–302.
  • Sadeghi & Pollard (2012) Sadeghi, H. & Pollard, A. 2012 Effects of passive control rings positioned in the shear layer and potential core of a turbulent round jet. Phys. Fluids 24, 115103.
  • Shaabani-Ardali et al. (2020) Shaabani-Ardali, L., Sipp, D. & Lesshafft, L. 2020 Optimal triggering of jet bifurcation: an example of optimal forcing applied to a time-periodic base flow. J. Fluid Mech. 885, A34.
  • Shahriari et al. (2015) Shahriari, B., Swersky, K., Wang, Z., Adams, R. P. & De Freitas, N. 2015 Taking the human out of the loop: A review of Bayesian optimization. Proc. IEEE 104 (1), 148–175.
  • Shaqarin & Noack (2023) Shaqarin, T. & Noack, B. R. 2023 A fast-converging particle swarm optimization through targeted, position-mutated, elitism (PSO-TPME). Int. J. Comput. Intell. Syst. 16, 6.
  • da Silva & Métais (2002) da Silva, C. B. & Métais, O. 2002 Vortex control of bifurcating jets: A numerical study. Phys. Fluids 14 (11), 3798–3819.
  • Sonoda et al. (2023) Sonoda, T., Liu, Z., Itoh, T. & Hasegawa, Y. 2023 Reinforcement learning of control strategies for reducing skin friction drag in a fully developed turbulent channel flow. J. Fluid Mech. 960, A30.
  • Suzuki et al. (1999) Suzuki, H., Kasagi, N. & Suzuki, Y. 1999 Active control of an axisymmetric jet with an intelligent nozzle. In First Symposium on Turbulence and Shear Flow Phenomena. Begel House Inc.
  • Tyliszczak (2014) Tyliszczak, A. 2014 A high-order compact difference algorithm for half-staggered grids for laminar and turbulent incompressible flows. J. Comp. Phys. 276, 438–467.
  • Tyliszczak (2015a) Tyliszczak, A. 2015a LES-CMC study of an excited hydrogen flame. Combust. Flame 162 (10), 3864–3883.
  • Tyliszczak (2015b) Tyliszczak, A. 2015b Multi-armed jets: A subset of the blooming jets. Phys. Fluids 27 (4), 041703.
  • Tyliszczak (2018) Tyliszczak, A. 2018 Parametric study of multi-armed jets. Int. J. Heat Fluid Flow 73, 82–100.
  • Tyliszczak & Geurts (2014) Tyliszczak, A. & Geurts, B. J. 2014 Parametric analysis of excited round jets-numerical study. Flow Turbul. Combust. 93, 221–247.
  • Utkin et al. (2006) Utkin, Y. G., Keshav, S., Kim, J. H., Kastner, J., Adamovich, I. V. & Samimy, M. 2006 Development and use of localized arc filament plasma actuators for high-speed flow control. J. Phys. D: Appl. Phys. 40 (3), 685.
  • Vignon et al. (2023a) Vignon, C., Rabault, J., Vasanth, J., Alcántara-Ávila, F., Mortensen, M. & Vinuesa, R. 2023a Effective control of two-dimensional Rayleigh–Bénard convection: Invariant multi-agent reinforcement learning is all you need. Phys. Fluids 35 (6), 065146.
  • Vignon et al. (2023b) Vignon, C., Rabault, J. & Vinuesa, R. 2023b Recent advances in applying deep reinforcement learning for flow control: Perspectives and future directions. Phys. Fluids 35 (3), 031301.
  • Wahde (2008) Wahde, M. 2008 Biologically inspired optimization methods: an introduction. WIT press.
  • Wang et al. (2023a) Wang, T., Cornejo Maceda, G. Y. & Noack, B. R. 2023a xPDT: A Toolkit for Persistent Data Topology. Universitätsbibliothek der Technischen Universität Braunschweig, Germany.
  • Wang et al. (2023b) Wang, T., Yang, Y., Chen, X., Li, P., Iollo, A., Cornejo Maceda, G. Y. & Noack, B. R. 2023b Topologically assisted optimization for rotor design. Phys. Fluids 35, 055105.
  • Wawrzak et al. (2015) Wawrzak, K., Boguslawski, A. & Tyliszczak, A. 2015 LES predictions of self-sustained oscillations in homogeneous density round free jet. Flow Turbul. Combust. 95, 437–459.
  • Williams & Rasmussen (2006) Williams, C. K. & Rasmussen, C. E. 2006 Gaussian processes for machine learning. MIT press.
  • Wright (1991) Wright, A. H. 1991 Genetic algorithms for real parameter optimization. In Foundations of genetic algorithms, , vol. 1, pp. 205–218. Elsevier.
  • Xu & Zhang (2023) Xu, D. & Zhang, M. 2023 Reinforcement-learning-based control of convectively unstable flows. J. Fluid Mech. 954, A37.
  • Zhou et al. (2020) Zhou, Y., Fan, D., Zhang, B., Li, R. & Noack, B. R. 2020 Artificial intelligence control of a turbulent jet. J. Fluid Mech. 897, A27.