This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

\newdateformat

monthyeardate\monthname[\THEMONTH] \THEDAY, \THEYEAR

Navigation in Shallow Water
Using Passive Acoustic Ranging

Junsu Jang and Florian Meyer

Email: {jujang, flmeyer}@ucsd.edu
Scripps Institution of Oceanography and Department of Electrical and Computer Engineering University of California San Diego, La Jolla, CA
Abstract

Passive acoustics can provide a variety of capabilities with applications in oceanographic research and maritime situational awareness. In this paper, we develop a method for the navigation of autonomous underwater vehicles (AUVs) in shallow water. Our approach relies on passively recorded signals from acoustic sources of opportunities (SOOs). By making use of the waveguide invariant, a measurement of the range to the SOO is extracted from the spectrogram of a single hydrophone. Range extraction requires knowledge of the range rate, i.e., the radial velocity between the SOO and AUV, computed from the pressure fields at different time intervals. A particle-based navigation filter fuses the range measurements with the AUV’s internal velocity and heading measurements. As a result, the position error, which would otherwise increase over time, can be bounded. The ability to compute the range rate and range measurements from the pressure field measured using a single hydrophone is demonstrated on real data from the SWellEx-96 experiment. The capability of the developed navigation filter is shown based on synthetic data generated by the normal mode program KRAKEN.

Index Terms:
Navigation, Bayes Filtering, Passive Acoustics, Marine Robotics, Particle Filtering.

I Introduction

Autonomous underwater vehicles (AUVs) are widely used in scientific, commercial, and military applications due to their ability to perform complex tasks effectively. However, navigation of small and inexpensive AUVs remains a challenge. This is because underwater, the radio signals of the Global Positioning System (GPS) are unavailable. Navigation based on sensors that do not require wireless links, such as inertial sensors, relies on integrating acceleration or velocity measurements. This strategy, referred to as dead-reckoning, is subject to position errors that grow over time. As a result, the AUV resurfaces regularly to perform GPS localization [1]. There exist a variety of active acoustic underwater ranging techniques [2][3], but they have limited range and require deployments of multiple transponders.

In this paper, we investigate navigation by means of a passive acoustic ranging technique. Our approach only relies on a simple acoustic sensor hosted by the AUV. Since acoustic signals emitted by a source of opportunity (SOO) are used, no additional transponders are deployed. The proposed method is applicable in shallow water with frequent ship traffic, such as coastal waterways. The SOOs are typically commercial vessels whose propeller cavitation causes the ship noise to be used as acoustic signal [4]. In addition, this type of vessel is equipped with an automatic identification system (AIS) for broadcasting its current and predicted future position information[5].

Refer to caption
Figure 1: Bayesian network representing the proposed Bayes filter for AUV navigation. The state of the AUV at discrete time nn, 𝒙n\bm{x}_{n}, consists of the AUV’s position, heading, and velocity in a global coordinate system. A measurement, 𝒛n\bm{z}_{n}, consists of measurements of the range with respect to an SOO, the local velocity, and the heading. The range measurement is computed from the acoustic signal of an SOO in shallow water using the WI. The WI-based ranging relies on a range rate measurement computed in a preprocessing stage that follows the approach presented in [6].

Our method extracts range measurements from the acoustic signal recorded by a single hydrophone. In a shallow-water waveguide, the signal emitted from a broadband moving source leads to an interference pattern in the spectrogram of a single hydrophone. In particular, minima and maxima of signal intensity are slightly shifted with respect to frequency, ff, and range, rr [7]. Loci of constant intensities form a diagonal striation pattern characterized by a scalar parameter called the waveguide invariant (WI), β\beta[8]. The WI encapsulates the dispersive propagation characteristics of the waveguide[9].

Parameter estimation problems in underwater acoustics that rely on striation patterns described by the WI include ranging and source localization [10, 11, 12, 13], geoacoustic inversion[14], and internal wave characterization[15]. In [16], the WI is treated as a probability distribution, and the impacts of environmental parameters (depth of the source and receiver, sound speed, etc…) on the WI distribution are investigated.

The WI-based ranging with respect to SOOs that exhibit tonal signals has been explored in[17, 18] A single hydrophone is considered, and the application of the WI-based ranging to AUV navigation is discussed. Here, a statistical model for intensities along hypothetical striations is established, and the maximum likelihood (ML) estimation of the WI parameter and range is performed. A nonlinear least square-based approach for receiver localization is also proposed.

Range estimation based on the WI requires knowledge of the range rate, which can be time-varying. They are, however, assumed known in the aforementioned ranging approaches. A technique for estimating the range rate from the acoustic measurements of a single hydrophone has been presented in [6]. Assuming a narrowband source, this approach relies on normal mode theory[7] to extract range rate from the spectra of consecutive snapshots of data. A combination of the range rate estimation with the WI-based ranging is also discussed. The range rate estimation has been extended to scenarios with broadband sources in [19, 20].

The fundamental question addressed in this paper is the feasibility of underwater self-localization in shallow water based on the range measurements extracted from the data of a simple acoustic receiver. In the considered scenario, tonal signals are emitted by a moving SOO. We develop a self-localization method that relies on the WI-based ranging to bound the position error of an AUV in a shallow-water environment with frequent ship traffic. Our Bayesian filter fuses the range measurements with the velocity and heading measurements provided by the internal sensors of the AUV. It offers an opportunity for an AUV, which would otherwise solely perform dead-reckoning, to recalibrate its position information without resurfacing. The WI-based ranging relies on an acoustically range-independent underwater environment, passively recorded tonal signals emitted by the SOO, and the knowledge of the potentially time-varying range rate.

Within our approach, a range rate measurement is computed in a preparatory step [6] by using the same tonal signal as used for the WI-based ranging. Only a single hydrophone is required to obtain a range measurement if a single SOO is present. This makes the proposed method appealing for inexpensive AUV platforms with limited acoustic sensing capabilities. We illustrate the accuracy of the range rate and range measurements using real acoustic data recorded during the SWellEx-96 experiment[21]. We also demonstrate the capabilities of the proposed Bayesian navigation filter using acoustic data simulated in a realistic shallow water environment generated by the normal mode program KRAKEN. In particular, we show how the position error can be bounded and thus reduced compared to a reference method that relies on dead-reckoning.

The key contributions of this paper are summarized as follows:

  • We review the approaches for computing measurements of the range rate, the range, and the WI from the spectrogram recorded by a single hydrophone that is mobile with respect to an SOO.

  • We develop a Bayesian filter for navigation in shallow water that fuses information provided by the sensors for dead reckoning with the range measurements computed from the spectrogram.

  • We demonstrate passive acoustic ranging using real data from the SWellEx-96 experiment and present the capability of the developed navigation filter using synthetic data generated by the normal mode program KRAKEN.

Notation: Random variables are displayed in serif, italic fonts, and vectors and matrices are denoted by bold lowercase and uppercase letters, respectively. For example, a random variable and a random vector are denoted by xx and 𝒙\bm{x}, respectively. Furthermore, 𝒙\|\bm{x}\| and 𝒙T{\bm{x}}^{\mathrm{T}} denote the Euclidean norm and the transpose of vector 𝒙\bm{x}, respectively; \propto indicates equality up to a constant factor; 𝓅(x)\mathpzc{p}(\bm{x}) denotes the probability density function (PDF) of random vector 𝒙\bm{x} (this is a short notation for 𝓅x(x)\mathpzc{p}_{\bm{x}}(\bm{x})); 𝓅(x|y)\mathpzc{p}(\bm{x}|\bm{y}) denotes the conditional PDF of random vector 𝒙\bm{x} conditioned on random vector 𝒚\bm{y} (this is a short notation for 𝓅x|y(x|y)\mathpzc{p}_{\bm{x}|\bm{y}}(\bm{x}|\bm{y})). 𝒙0:k\bm{x}_{0:k} is short for [𝒙0T,,𝒙kT]T[\bm{x}_{0}^{\mathrm{T}},\dots,\bm{x}_{k}^{\mathrm{T}}]^{\mathrm{T}}. 𝑰n\bm{I}_{n} denotes the n×nn\hskip-0.85358pt\times\hskip-0.85358ptn identity matrix. |𝒮||\mathcal{S}| denotes the cardinality of set 𝒮\mathcal{S}. The operator denotes the complex conjugate, and 𝒿=1\mathpzc{j}=\sqrt{-1} is the imaginary unit.

II WI-Based Ranging

In this section, we will present our approach to extracting the range measurements from acoustic data by exploiting the striation pattern described by the WI. First, we discuss the preprocessing stage that will provide the range rate measurements subsequently used for the WI-based ranging. We also establish an initialization stage for the computation of the WI. In what follows, it is assumed that consecutive snapshots of data samples have been extracted from the raw acoustic signal and that the discrete Fourier transform (DFT) of length NDFTN_{\mathrm{DFT}} and overlap α\alpha has been performed for each snapshot. The time between snapshots is tΔ=(1α)NDFTTΔt_{\Delta}=(1-\alpha)N_{\mathrm{DFT}}\hskip 0.85358ptT_{\Delta}, where TΔT_{\Delta} is the sampling interval of the raw acoustic signal. Furthermore, n=0,1,2,n=0,1,2,\dots denotes the discrete-time indexes of snapshots.

Refer to caption

(a)

Refer to caption

(b)

Refer to caption

(c)

Refer to caption

(d)

Figure 2: An example of a spectrogram in the (r,f)(r,f) domain that represents a shallow water waveguide with β=1.07\beta=1.07 is presented in (a). Some expected striations are shown in red. Intensity functions at two different frequencies f0f_{0} and f1f_{1} with f0<f1f_{0}<f_{1} are depicted as a function of range in (b) and (c), where I[r;f1]I[r;f_{1}] is shifted up by 1.5×\times10-6 from its original values for visualization purposes. These intensities correspond to two different rows in the spectrogram in (a). As described by (4), the intensity function I[ri;f1]I[r_{i};f_{1}] is a stretched version of I[ri;f0]I[r_{i};f_{0}]. Based on the true β\beta and rnr_{n}, a comparison of the two intensities after stretching I[ri;f0]I[r_{i};f_{0}] by means of (6) is shown in (c). Here, J[ri;f0,β,rn]J[r_{i};f_{0},\beta,r_{n}] denotes the stretched version of I[ri;f0]I[r_{i};f_{0}]. As a result, the intensities are now aligned in range. The normalized objective function obtained by the sum of cross-correlation coefficients for pairs of demeaned intensity functions is provided in (d).

II-A Computation of Range Rate Measurements

At each time step nn, we compute a measurement of the range rate, r˙n\dot{r}_{n}, following the “difference” method proposed in [6]. Here, the complex DFT samples at source frequency ff and from consecutive snapshots of data result in the signal p[n,f]p[n,f]. This signal represents the acoustic pressure field as a function of range, rr, and frequency, ff. To compute a range rate measurement, we consider two segments of snapshots before and after time nn, i.e., pn,1[k,f]=p[nk,f]p_{n,1}[k,f]=p[n\hskip-0.85358pt-\hskip-0.85358ptk,f] and pn,2[k,f]=p[n+k,f]p_{n,2}[k,f]=p[n\hskip-0.85358pt+\hskip-0.85358ptk,f] with k{0,,L}k\hskip-0.85358pt\in\hskip-0.85358pt\{0,\dots,L\}. Both segments include the sample at time nn and are L+1L+1 snapshots long, i.e., a total of 2L+12L+1 snapshots are considered.

A measurement of r˙n\dot{r}_{n} can now be extracted from the real part of the point-wise product of pn,1[k,f]p_{n,1}[k,f] with pn,2[k,f]p^{*}_{n,2}[k,f], i.e., Ic[k;n,f]={pn,1[k,f]pn,2[k,f]}I_{\mathrm{c}}[k;n,f]=\mathcal{R}\big{\{}p_{n,1}[k,f]\hskip 0.85358ptp^{*}_{n,2}[k,f]\big{\}}. Following [6], the result of this product can be approximated as Ic[k;n,f]I~c[k;n,f]I_{\mathrm{c}}[k;n,f]\hskip-0.85358pt\approx\hskip-0.85358pt\tilde{I}_{\mathrm{c}}[k;n,f], where I~c[k;n,f]\tilde{I}_{\mathrm{c}}[k;n,f] is given by

I~c[k;n,f]cos[wk].\tilde{I}_{\mathrm{c}}[k;n,f]\hskip 0.85358pt\propto\hskip 0.85358pt\cos{[wk]}.\vskip-1.42262pt (1)

Here, we introduced w4πfr˙ntΔ/vp¯w\hskip-0.85358pt\triangleq\hskip-0.85358pt4\pi f\dot{r}_{n}t_{\Delta}/\overline{v_{\mathrm{p}}} where vp¯\overline{v_{\mathrm{p}}} is the average horizontal phase speed [7]. Note that vp¯\overline{v_{\mathrm{p}}} can be further approximated by the sound speed in the water, vv, which is assumed to be known [6]. Eq. (1) motivates the computation of a range rate measurement, yr˙ny_{\dot{r}_{n}}, from the location of largest magnitude in the frequency domain of Ic[k;n,f]I_{c}[k;n,f]. In particular, let Fc[w;n,f]F_{c}[w;n,f] be the DFT of Ic[k;n,f]I_{c}[k;n,f]. A measurement of the range rate can then be obtained as

yr˙n=argmaxr˙n|Fc[4πfr˙ntΔ/v;n,f]|2.y_{\dot{r}_{n}}=\operatorname*{arg\,max}_{\dot{r}_{n}}|F_{\mathrm{c}}[4\pi f\dot{r}_{n}\hskip 0.85358ptt_{\Delta}/v;n,f]|^{2}.\vskip-4.2679pt (2)

Note that the measurement yr˙ny_{\dot{r}_{n}} represents the average range rate over the 2L+12L+1 snapshots.

Unfortunately, the approach discussed above is not directly applicable to real-time navigation. This is because at time nn, the required signal pn,1[k,f]p_{n,1}[k,f] consists of samples that lie in the future. The most recent range rate measurement that can be computed at time nn is actually the outdated yr˙nLy_{\dot{r}_{n-L}}. Thus, for real-time navigation, we use yr˙nLy_{\dot{r}_{n-L}} to predict a real-time measurement at time nn, denoted as zr˙nz_{\dot{r}_{n}}. In particular, an average range acceleration measurement, yr¨nLy_{\ddot{r}_{n-L}}, is extracted from the range rate measurements yr˙n2L,,yr˙nLy_{\dot{r}_{n-2L}},\dots,y_{\dot{r}_{n-L}}. The real-time range rate measurement zr˙nz_{\dot{r}_{n}}, is then obtained using a linear prediction, i.e.,

zr˙n=yr˙nL+yr¨nLtΔL.z_{\dot{r}_{n}}=y_{\dot{r}_{n-L}}+y_{\ddot{r}_{n-L}}t_{\Delta}\hskip 0.85358ptL. (3)

Also note that the time interval tΔLt_{\Delta}L has to be several minutes in length [6]. Only in this way a range rate measurement can be computed according to (2). Thus, It is assumed that the source signal is coherent within this time interval. Furthermore, note that a phase offset is introduced when the time interval between the snapshots, tΔt_{\Delta}, is not an integer multiple of 1/f1/f[22]. This phase offset results in a shift of the peak location of Fc[w;n,f]F_{\mathrm{c}}[w;n,f]. Hence, the phase compensation approach proposed in [22] is applied to obtain accurate range rate measurements.

II-B Computation of Range Measurements

In a shallow-water waveguide, the signal emitted from a broadband moving source leads to an interference pattern in the spectrogram of a single hydrophone. In particular, let us consider an observation interval [tstart,tstop][t_{\mathrm{start}},t_{\mathrm{stop}}] and let tstartttstopt_{\mathrm{start}}\leqslant t\leqslant t_{\mathrm{stop}} be an arbitrary time in the observation interval. We denote the ranges at times tstartt_{\mathrm{start}}, tt, and tstopt_{\mathrm{stop}} by rstartr_{\mathrm{start}}, rr, and rstopr_{\mathrm{stop}}, respectively. Assuming that the source’s range rate r˙\dot{r} is fixed and known during the observation interval, we can compute a spectrogram in the (r,f)(r,f) domain. Note that, since the range is typically unknown, the rr-axis of this spectrogram is a function of a reference range, e.g., with rstopr_{\mathrm{stop}} being the reference range, the range at time tt is equal to r=rstop(tstopt)r˙r=r_{\mathrm{stop}}-(t_{\mathrm{stop}}-t)\dot{r}.

This (r,f)(r,f) spectrogram shows an inference pattern with striations of constant intensity. Each striation can be mathematically described as[7]

f0f1=(r0r1)β\frac{f_{0}}{f_{1}}=\Big{(}\frac{r_{0}}{r_{1}}\Big{)}^{\beta}\vskip 0.0pt (4)

where r0,r1[rstart,rstop]r_{0},r_{1}\in[r_{\mathrm{start}},r_{\mathrm{stop}}] and f0,f1f_{0},f_{1} are arbitrary source frequencies. According to (4), the same intensity as in (r0,f0)(r_{0},f_{0}) can be found in any other point of the spectrogram, (r1,f1)(r_{1},f_{1}), where r1=r0(f1/f0)1/βr_{1}=r_{0}(f_{1}/f_{0})^{1/\beta}. The parameter β\beta is known as the WI. Since in typical shallow water waveguides, we have a β\beta close to 11, striations often appear as diagonal lines. An example with β=1.07\beta=1.07 and a known rstopr_{\mathrm{stop}} is shown in Fig. 2(a). The convenience of exploiting the striation pattern described by the WI for ranging is that in an approximately range-independent waveguide, all parameters that characterize the environment are summarized by the single scalar WI, β\beta. As discussed in [17], striations are also visible when tonal signals are recorded, e.g., when a large container ship emits mechanical noise[23].

At time step nn, for the WI-based ranging, we aim to compute a measurement of the range between the source and the receiver, rnr_{n}, by exploiting the WI[7][11][24]. Each column in the spectrogram corresponds to discrete time step i{0,,n}i\hskip-0.85358pt\in\hskip-0.85358pt\{0,\dots,n\}. By making use of the range rate measurements zr˙lz_{\dot{r}_{l}}, l{0,,n1}l\in\{0,\dots,n-1\}, computed as discussed in the previous Sec. II-A, the following ranges can be assigned to each column of the spectrogram

ri=rnl=in1zr˙ltΔ,i{0,,n}.r_{i}=r_{n}\hskip 0.85358pt\hskip 0.85358pt-\hskip-0.85358pt\hskip-0.85358pt\hskip-0.85358pt\hskip-0.85358pt\sum_{l=i}^{n-1}z_{\dot{r}_{l}}t_{\Delta},\hskip 14.22636pti\in\{0,\dots,n\}. (5)

Here, we have used rnr_{n} as the reference range.

Next, we develop an objective function for the computation of rnr_{n} by pairwise comparing nonlinearly transformed intensity functions for different frequencies. In what follows, it is assumed that the WI, β\beta, is known. A method for the computation of a WI measurement zβz_{\beta} will be discussed in Sec. II-C. Let f0f_{0} and f1f_{1} be the two source frequencies. For these two frequencies, the intensities as a function of the ranges, rir_{i}, i{0,,n}i\in\{0,\dots,n\}, are denoted as I[ri;f0]I[r_{i};f_{0}] and I[ri;f1]I[r_{i};f_{1}], respectively. Each intensity function is represented by a single row in the spectrogram.

For the WI-based ranging, we compute cross-correlation coefficients for the most recent NN ranges, i.e., ri,i{nN+1,,n}r_{i},i\in\{n\hskip-0.85358pt-\hskip-0.85358ptN\hskip-0.85358pt+\hskip-0.85358pt1,\dots,n\}. Without loss of generality, let us assume that f0<f1f_{0}<f_{1}. Based on (4) and assuming a positive β\beta, the acoustic intensity at f1f_{1} is a stretched function of that of f0f_{0}. An example is shown in Fig. 2(b). First, we perform a similar stretching for f0f_{0} by using (4) to convert the ranges of the last MM intensity samples as follows

ri=ri(f1f0)1/β,i{nM+1,,n}.r^{\prime}_{i}=r_{i}\Big{(}\frac{f_{1}}{f_{0}}\Big{)}^{1/\beta}\hskip-0.85358pt\hskip-0.85358pt\hskip-0.85358pt\hskip-0.85358pt,\hskip 14.22636pti\in\{n-M+1,\dots,n\}.\vskip 1.42262pt (6)

This results in the stretched intensity function I[ri;f0,β]I^{\prime}[r^{\prime}_{i};f_{0},\beta]. Note that MNM\hskip-0.85358pt\geqslant\hskip-0.85358ptN has to be chosen such that the resulting stretched intensity function is defined on the entire interval [rnN+1,rn][r_{n-N+1},r_{n}]. Finally, we perform a linear interpolation to again obtain intensity for ranges rir_{i}, i{nN+1,,n}i\in\{n-N+1,\dots,n\}. This interpolation implies a cutoff of ranges outside the interval [rnN+1,rn][r_{n-N+1},r_{n}]. As can easily be verified, the resulting intensity function depends on the value of the reference range rnr_{n} and is thus denoted by J[ri;f0,β,rn]J[r_{i};f_{0},\beta,r_{n}]. An example of how an intensity function J[ri;f0,β,rn]J[r_{i};f_{0},\beta,r_{n}] matches I[ri;f1]I[r_{i};f_{1}] after stretching and interpolation by the correct β\beta and rnr_{n}, is shown in Fig. 2(c).

Given the correct β\beta and rnr_{n}, the intensity pattern, e.g., the locations of minima and maxima of J[ri;f0,β,rn]J[r_{i};f_{0},\beta,r_{n}], typically matches the intensity pattern of I[ri;f1]I[r_{i};f_{1}] well. However, the absolute intensity values of the two intensity functions can be quite different, and there is a need for normalization. Therefore, the cross-correlation coefficient is used to measure similarity to develop an objective function. Let us denote the demeaned versions of I[ri;f1]I[r_{i};f_{1}] and J[ri;f0,β,rn]J[r_{i};f_{0},\beta,r_{n}] as I^[ri;f1]\hat{I}[r_{i};f_{1}] and J^[ri;f0,β,rn]\hat{J}[r_{i};f_{0},\beta,r_{n}], respectively. As an objective function for the computation of a range measurement zr˙nz_{\dot{r}_{n}}, we now introduce the sum of the cross-correlation coefficients for all available pairs of demeaned intensity functions and the most recent NN samples, i.e.,

g(rn;β)=(f0,f1)1Zf0,f1i=nN+1nJ^[ri;f0,β,rn]I^[ri;f1],\displaystyle g(r_{n};\beta)=\sum_{(f_{0},f_{1})\in\mathcal{F}}\hskip 1.70717pt\frac{1}{Z}_{f_{0},f_{1}}\hskip 1.42262pt\sum_{i=n-N+1}^{n}\hskip-8.53581pt\hat{J}[r_{i};f_{0},\beta,r_{n}]\hskip 0.85358pt\hat{I}[r_{i};f_{1}],

where \mathcal{F} is the set of all pairs of source frequencies with f0<f1f_{0}<f_{1} and Zf0,f1Z_{f_{0},f_{1}} is a normalization factor given by

Zf0,f1=[i=nN+1nJ^2[ri;f0,β,rn]i=nN+1nI^2[ri;f1]]12.Z_{f_{0},f_{1}}=\Biggl{[}\hskip 2.84526pt\sum_{i=n-N+1}^{n}\hat{J}^{2}[r_{i};f_{0},\beta,r_{n}]\sum_{i=n-N+1}^{n}\hskip-0.85358pt\hskip-0.85358pt\hat{I}^{2}[r_{i};f_{1}]\Biggr{]}^{\frac{1}{2}}\hskip-0.85358pt\hskip-0.85358pt\hskip-0.85358pt. (8)

Finally, a range measurement is obtained by finding the location of the maximum of the objective function (LABEL:eq:ri), i.e.,

zrn=argmaxrng(rn;zβ).z_{r_{n}}=\operatorname*{arg\,max}_{r_{n}}g(r_{n};z_{\beta}).\vskip-1.42262pt (9)

Here, zβz_{\beta} is the WI measurement to be discussed in the next Sec. II-C. An example of the objective function maximizing is shown in Fig. 2(d). To reduce the computational complexity, unlike the likelihood function used in [18], the proposed objective function, (LABEL:eq:ri), does not consider noise statistics. However, as demonstrated in Sec. IV, it still leads to a competitive-ranging performance.

II-C Computation of the WI Measurement

A WI measurement, zβz_{\beta}, can be computed in an initialization stage by following a similar approach as for the computation of zrnz_{r_{n}}. For initialization, it is assumed that the range with respect to the source, rnr_{n}, is known. The initialization stage is performed while the AUV remains at the surface and can record GPS measurements. In particular, for a fixed rnr_{n}, we obtain the WI measurement by finding the location of the maximum of β\beta in the objective function (LABEL:eq:ri), i.e.,

zβ=argmaxβg(rn;β).z_{\beta}=\operatorname*{arg\,max}_{\beta}g(r_{n};\beta).\vskip-2.84526pt (10)

It is assumed that β\beta remains unchanged during the initialization stage and the next dive of the AUV. Depending on the sound speed profile, β\beta can change as a function of the receiver depth [16]. A method that can refine β\beta online is subject to future research.

III Statistical Model and Navigation Filter

In what follows, we introduce the statistical model for an AUV navigation and develop the proposed navigation filter. For notational simplicity, it is assumed that the navigation filter uses the same time scale as the computation of range rate and range measurements discussed in Sec. II, i.e., each discrete time step nn has a duration of tΔt_{\Delta}.

Refer to caption
Figure 3: Relevant quantities for the considered AUV navigation scenario at time nn. The state of the AUV consists of position, 𝒑n(W)=[xn(W)yn(W)]T\bm{p}_{n}^{\mathrm{(W)}}=\big{[}x_{n}^{\mathrm{(W)}}\hskip 0.85358pt\hskip 0.85358pt\hskip 0.85358pty_{n}^{\mathrm{(W)}}\big{]}^{\mathrm{T}}\hskip-0.85358pt\hskip-0.85358pt\hskip-0.85358pt, velocity, 𝒗n(W)=[x˙n(W)y˙n(W)]T\bm{v}_{n}^{\mathrm{(W)}}=\big{[}\dot{x}_{n}^{\mathrm{(W)}}\hskip 0.85358pt\hskip 0.85358pt\hskip 0.85358pt\dot{y}_{n}^{\mathrm{(W)}}\big{]}^{\mathrm{T}}\hskip-0.85358pt\hskip-0.85358pt, and heading, θn(W)\theta^{\mathrm{(W)}}_{n}. The measurements required for AUV navigation are heading, zφnz_{\varphi_{n}}, (not shown), velocity in the body reference frame, z𝒗n(B)=[x˙n(B)y˙n(B)]Tz_{\bm{v}_{n}^{\mathrm{(B)}}}=\big{[}\dot{x}_{n}^{\mathrm{(B)}}\hskip 0.85358pt\hskip 0.85358pt\hskip 0.85358pt\dot{y}_{n}^{\mathrm{(B)}}\big{]}^{\mathrm{T}}\hskip-0.85358pt\hskip-0.85358pt\hskip-0.85358pt, and range between the SOO and the AUV, zrnz_{r_{n}}. The position of the SOO, 𝒑s,n\bm{p}_{\mathrm{s},n}, is assumed known based on its initial location and predicted course transmitted over the AIS.

The state of the AUV at discrete time nn is denoted as 𝒙n=[𝒑n(W)𝒗n(W)ϑn(W)]T\bm{x}_{n}=\big{[}\bm{p}_{n}^{\mathrm{(W)}}\hskip 0.85358pt\hskip 0.85358pt\hskip 0.85358pt\bm{v}_{n}^{\mathrm{(W)}}\hskip 0.85358pt\hskip 0.85358pt\hskip 0.85358pt\stheta^{\mathrm{(W)}}_{n}\big{]}^{\mathrm{T}}\hskip-0.85358pt\hskip-0.85358pt\hskip-0.85358pt, where 𝒑n(W)=[xn(W)yn(W)]T2\bm{p}_{n}^{\mathrm{(W)}}=\big{[}x_{n}^{\mathrm{(W)}}\hskip 0.85358pt\hskip 0.85358pt\hskip 0.85358pty_{n}^{\mathrm{(W)}}\big{]}^{\mathrm{T}}\hskip-0.85358pt\in\hskip-0.85358pt\mathbb{R}^{2} is position, 𝒗n(W)=[x˙n(W)y˙n(W)]T2\bm{v}_{n}^{\mathrm{(W)}}=\big{[}\dot{x}_{n}^{\mathrm{(W)}}\hskip 0.85358pt\hskip 0.85358pt\hskip 0.85358pt\dot{y}_{n}^{\mathrm{(W)}}\big{]}^{\mathrm{T}}\hskip-0.85358pt\in\hskip-0.85358pt\mathbb{R}^{2} is the velocity, and θn(W)[0,360)\theta^{\mathrm{(W)}}_{n}\hskip-0.85358pt\in\hskip-0.85358pt[0^{\circ},360^{\circ}) is the heading (Fig. 3). All three quantities are defined with respect to a global (“world”) 2-D Cartesian coordinate system. In particular, the x- and y-axes are positive along the east and north directions, and the heading of 0°  points east and increases counterclockwise. The superscripts (W) and (B) indicate the “world” and “body” reference frames, respectively. The heading is relevant for the measurement model where the local velocity measurement has to be rotated from the body reference frame to the world reference frame.

The state transition between time steps follows a nearly constant velocity motion model, i.e.,

𝒙n=[10tΔ00010tΔ0001000001000001]𝒙n1+𝒖n\bm{x}_{n}=\begin{bmatrix}1&0&t_{\Delta}&0&0\\ 0&1&0&t_{\Delta}&0\\ 0&0&1&0&0\\ 0&0&0&1&0\\ 0&0&0&0&1\end{bmatrix}\bm{x}_{n-1}+\bm{u}_{n}\vskip 1.42262pt (11)

where tΔt_{\Delta} is the length of each discrete time step and 𝒖n5\bm{u}_{n}\hskip-0.85358pt\in\hskip-0.85358pt\mathbb{R}^{5} is a zero-mean multivariate Gaussian random vector with covariance matrix given by

𝚺u=𝑼[σ𝒖12000σ𝒖12000σ𝒖22]𝑼T,𝑼=[tΔ22000tΔ220tΔ000tΔ000tΔ].\bm{\varSigma}_{u}=\bm{U}\hskip-0.85358pt\hskip-0.85358pt\hskip-0.85358pt\hskip-0.85358pt\begin{bmatrix}\sigma_{\bm{u}_{1}}^{2}&0&0\\ 0&\sigma_{\bm{u}_{1}}^{2}&0\\ 0&0&\sigma_{\bm{u}_{2}}^{2}\end{bmatrix}\hskip-0.85358pt\hskip-0.85358pt\bm{U}^{\hskip-0.85358pt\mathrm{T}}\hskip-0.85358pt\hskip-0.85358pt\hskip-0.85358pt,\quad\bm{U}\hskip-0.85358pt=\hskip-0.85358pt\begin{bmatrix}\frac{t_{\Delta}^{2}}{2}&0&0\\ 0&\frac{t_{\Delta}^{2}}{2}&0\\ t_{\Delta}&0&0\\ 0&t_{\Delta}&0\\ 0&0&t_{\Delta}\end{bmatrix}\hskip-0.85358pt\hskip-0.85358pt\hskip-0.85358pt.\vskip 0.0pt

Here, σ𝒖12\sigma_{\bm{u}_{1}}^{2} and σ𝒖22\sigma_{\bm{u}_{2}}^{2} are the variance of a random linear acceleration and turn rate, respectively. The driving noise 𝒖n\bm{u}_{n} is assumed statistically independent across time nn. The state-transition model in (11) induces the state-transition PDF 𝓅(x𝓃|x𝓃1)\mathpzc{p}(\bm{x}_{n}|\bm{x}_{n-1}) that will be used in the prediction step of the proposed navigation filter. At the time n=0n\hskip-0.85358pt=\hskip-0.85358pt0, it is assumed that the prior PDF of the AUV state, i.e., 𝓅(x0)\mathpzc{p}(\bm{x}_{0}) is available.

The measurements at time nn are denoted as 𝒛n=[zrnz𝒗n(B)zφn]T\bm{z}_{n}\hskip-0.85358pt=\hskip-0.85358pt\big{[}z_{r_{n}}\hskip 0.85358pt\hskip 0.85358pt\hskip 0.85358ptz_{\bm{v}_{n}^{\mathrm{(B)}}}\hskip 0.85358pt\hskip 0.85358ptz_{\varphi_{n}}\big{]}^{\mathrm{T}}, where zrnz_{r_{n}} is the range measurement that is computed as discussed in Sec. II, z𝒗n(B)=[x˙n(B)y˙n(B)]Tz_{\bm{v}_{n}^{\mathrm{(B)}}}=\big{[}\dot{x}_{n}^{\mathrm{(B)}}\hskip 0.85358pt\hskip 0.85358pt\hskip 0.85358pt\dot{y}_{n}^{\mathrm{(B)}}\big{]}^{\mathrm{T}} is the velocity in the local “body” reference frame of the vehicle, and zφnz_{\varphi_{n}} is a compass reading, i.e., a noisy measurement of the AUV heading. As a statistical model for the range measurements, we use

zrn\displaystyle z_{r_{n}} =𝒑n(W)𝒑s,n+γn.\displaystyle=\|\bm{p}_{n}^{\mathrm{(W)}}-\bm{p}_{\mathrm{s},n}\|+\gamma_{n}. (12)

Here, 𝒑s,n\bm{p}_{\mathrm{s},n} is the position of the SOO in the global reference frame that is assumed known, and γn\gamma_{n} is a zero-mean Gaussian measurement noise with variance σγ2\sigma^{2}_{\gamma}. The measurement noise γn\gamma_{n} is assumed statistically independent across nn. The initial position, 𝒑s,0\bm{p}_{\mathrm{s},0}, and velocity, 𝒑v,0\bm{p}_{v,0}, of the SOO are known, i.e., received over the automatic identification system (AIS) as the AUV starts to dive. Furthermore, it is assumed that the SOO stays on course, and thus, the position of the SOO at time nn can be predicted reliably, i.e., 𝒑s,n=𝒑s,0+ntΔ𝒑v,0\bm{p}_{\mathrm{s},n}=\bm{p}_{\mathrm{s},0}+n\hskip 0.85358pt\hskip 0.85358ptt_{\Delta}\hskip 0.85358pt\hskip 0.85358pt\bm{p}_{v,0}. Future work will include an improved model for passive acoustic ranging that consists of a statistical characterization of the processing performed in Sec. II and takes the uncertainty of knowing the position of the SOO into account.

The velocity measurements provided by the Doppler velocity log (DVL) of the AUV are modeled as

z𝒗n(B)\displaystyle z_{\bm{v}_{n}^{\mathrm{(B)}}} =RWB(ϑn(W))𝒗n(W)+𝝃n,\displaystyle={}_{B\hskip-0.85358pt}R_{W}(\stheta^{\mathrm{(W)}}_{n})\hskip 0.85358pt\hskip 0.85358pt\bm{v}_{n}^{\mathrm{(W)}}+\bm{\xi}_{n}, (13)

where RWB(ϑn(W)){}_{B\hskip-0.85358pt}R_{W}\big{(}\stheta^{\mathrm{(W)}}_{n}\big{)} is the rotation matrix from the world to the body reference frame, i.e.,

RWB(ϑn(W))=[cos(θn(W))sin(θn(W))sin(θn(W))cos(θn(W))]{}_{B\hskip-0.85358pt}R_{W}\big{(}\stheta^{\mathrm{(W)}}_{n}\big{)}=\begin{bmatrix}\hskip 9.6739pt\cos{\big{(}\theta^{\mathrm{(W)}}_{n}\big{)}}&\sin{\big{(}\theta^{\mathrm{(W)}}_{n}\big{)}}\\ -\sin{\big{(}\theta^{\mathrm{(W)}}_{n}\big{)}}&\cos{\big{(}\theta^{\mathrm{(W)}}_{n}\big{)}}\end{bmatrix}\hskip-0.85358pt\hskip-0.85358pt\vskip 0.0pt

and 𝝃n\bm{\xi}_{n} is a zero-mean multivariate Gaussian measurement noise with covariance matrix 𝑰2σ𝝃2\bm{I}_{2}\sigma_{\bm{\xi}}^{2}. The measurement noise is assumed to be statistically independent across nn.

Finally, the compass readings are modeled as

zφn\displaystyle z_{\varphi_{n}} =θn+ηn\displaystyle=\theta_{n}+\eta_{n} (14)

where ηn\eta_{n} is again a zero-mean Gaussian measurement noise with variance ση2\sigma^{2}_{\eta}. The measurement noise ηn\eta_{n} is also statistically independent across nn.

The measurement models in (12), (13), and (14) induce the following likelihood function at time nn, i.e.,

𝓅(z𝓃|x𝓃)=𝓅(𝓏𝓇𝓃|p𝓃(W))𝓅(𝓏v𝓃(B)|v𝓃(W),θ𝓃(W))𝓅(𝓏φ𝓃|θ𝓃(W)).\displaystyle\mathpzc{p}\big{(}\bm{z}_{n}\big{|}\bm{x}_{n}\big{)}=\mathpzc{p}\big{(}z_{r_{n}}\big{|}\bm{p}_{n}^{\mathrm{(W)}}\big{)}\hskip 0.85358pt\mathpzc{p}\big{(}z_{\bm{v}_{n}^{\mathrm{(B)}}}\big{|}\bm{v}_{n}^{\mathrm{(W)}}\hskip-0.85358pt\hskip-0.85358pt,\theta_{n}^{\mathrm{(W)}}\big{)}\hskip 0.85358pt\mathpzc{p}\big{(}z_{\varphi_{n}}\big{|}\theta_{n}^{\mathrm{(W)}}\big{)}.

This likelihood function will be used in the update step of the proposed navigation filter.

III-A Estimation Problem and Navigation Filter

At time nn, we aim to estimate the state 𝒙n\bm{x}_{n} of the AUV from all measurements 𝒛1:n\bm{z}_{1:n}. Given the conditional PDF of the state, p(𝒙n|𝒛1:n)p(\bm{x}_{n}|\bm{z}_{1:n}), the minimum mean-squared error (MMSE) estimate [25] of the state can be obtained as

𝒙^nMMSE=𝒙n𝓅(x𝓃|z1:𝓃)dx𝓃.\hat{\bm{x}}^{\mathrm{MMSE}}_{n}=\int\hskip-0.85358pt\bm{x}_{n}\hskip 0.85358pt\mathpzc{p}(\bm{x}_{n}|\bm{z}_{1:n})\hskip 0.85358pt\mathrm{d}\bm{x}_{n}. (16)

A Bayes filter[26], which consists of prediction and update steps, is applied to compute the conditional PDF p(𝒙n|𝒛1:n)p(\bm{x}_{n}|\bm{z}_{1:n}). The prediction step uses the Chapman-Kolomorogov equation, which involves the state-transition PDF. The update step is based on Baye’s rule and the likelihood function in (LABEL:eq:likelihood). Due to the nonlinearities in our measurement model, we follow the particle filtering approach [26] to compute a particle-based approximation of (16).

IV Results

The computed range rate and range results are presented based on the simulated and real data. Then, the navigation capability is demonstrated using the simulated data. Finally, the relevant hyperparameters to the simulation are summarized in Table. I.

Hyperparameters Value
Prior Position STD (m) 500500
Driving Noise STD, σ𝒖1\sigma_{\bm{u}_{1}} (m s-1) 0.020.02
Turning Rate Noise STD, σ𝒖2\sigma_{\bm{u}_{2}} (° ) 0.500.50
Measurement Noise (Heading) STD, ση\sigma_{\eta} (° ) 0.500.50
Measurement Noise (Range) STD, σγ\sigma_{\gamma} (m) 150150
Measurement Noise (Velocity) STD, σ𝝃\sigma_{\bm{\xi}} (m s-1) 0.020.02
Average Acoustic SNR for navigation(dB) 1212
Number of Particles 500,000500,\hskip-0.85358pt000
TABLE I: Hyperparameters used to simulate the proposed AUV navigation method.

For the investigation using the real data, the acoustic recordings on the first element of the Horizontal Line Array South (HLAS) are processed from the S5 event of the SWellEx-96 experiment (Fig. 6(a)). The acoustic data from 23:52 to 00:29 in Greenwich Mean Time (GMT) are used. The sampling frequency is 3276.8 Hz, and the spectrogram was generated using a window length of 3276 samples (no overlap, Hanning window).

IV-A Simulation Environment and Data

As for the simulation, a scenario where a moving AUV, equipped with a single hydrophone, a DVL, and a compass, records a sound from a moving SOO in a shallow-water environment is considered. Initially, the SOO is at (2000,0)(2000,0) moving north at 10 knots (5.14{\sim}5.14 m s-1). The AUV starts at the origin, moving northeast at 3knots (1.54{\sim}1.54 m s-1), i.e., 𝒙0=[0,0,1.1,1.1,45]T\bm{x}_{0}=[0,0,1.1,1.1,45]^{\mathrm{T}}, for 20 minutes. Its driving noise, σ𝒖1\sigma_{\bm{u}_{1}}, is 0.02 m s-1, which accounts for random motion due to the control and the current in the ocean, and the turning rate noise, σ𝒖2\sigma_{\bm{u}_{2}}, is 0.5° .

The simulation environment follows the geoacoustic model from the S5 event of the SWellEx-96 experiment with the vertical line array (VLA)[21] (Fig. 4). The source depth was chosen as zs=9z_{s}\hskip-0.85358pt=\hskip-0.85358pt9. This is the same depth as the towed source at the shallow depth in the SWellEx-96 experiment above the thermocline. Rouseff and Spindel [16] discuss the impact of different factors, including the source depth, on the “distribution” of the WI. The receiver depth was arbitrarily chosen close to the 34th element of the VLA at a depth of zr=150z_{r}\hskip-0.85358pt=\hskip-0.85358pt150 m. The acoustic signals are simulated with an acoustic modal simulator, KRAKEN, which generates the pressure field in the range and frequency domain.

Refer to caption
Figure 4: Acoustic environment used for performance evaluation of the proposed AUV navigation method. In the considered range-independent shallow-water scenario, the sound speed profile and seafloor properties are adopted from the SWellEx-96 experiment event S5.

IV-B Range Rate and Range Computation

The range rate computation uses 4 min long segments, while the range computation uses a 10 min long signal. In the range rate computation, simulation and real data use the signals at 109109 Hz. For the range computation, the acoustic signals of four tones (109,127,145,163109,127,145,163 Hz) from a source at a shallow depth are used in both simulation and real data. When the first range rate measurement becomes available, another 1010 min long acoustic recording is necessary to compute the range. Recall that using the 44 min long spectrogram, the first range rate can be computed after 22 min. Hence, the first range measurement becomes available after 1212 min.

IV-B1 Simulation

Monte Carlo experiments with different signal-to-noise ratios (SNRs) of the acoustic recording under the scenario described in Sec. IV-A are performed to characterize the performance of the range rate computation and the range computation. The SNRs considered are 33, 55, 1010, 1515 and 2020 dB, and 300300 Monte Carlo experiments were conducted for each SNR. Here, circularly symmetric Gaussian noise is added to the source signal.

The range rates computed from the simulated data are compared to the true range rate at every 30 s interval, and their root-mean-square errors (RMSEs) sampled for each SNR are calculated (Fig. 5(a) and (b)). The RMSE converges to a biased value of 0.17{\sim}0.17 m s-1 as the SNR increases. This bias is introduced mainly by two factors. The first factor is the limited time interval sampling, which results in a range rate resolution of 0.110.11 ms1\mathrm{m\,s^{-1}}. The second factor is a combination of the wavenumber variation [6] and the mismatch between the sound speed of the water, vv, and the horizontal phase speed, vp¯\overline{v_{p}}.

Based on these computed range rates, the ranges between the source and receiver are calculated as described in Sec. II-B (Fig. 5(c)). The computed WI for the simulated environment is zβ,sim=1.11z_{\beta,\mathrm{sim}}\hskip-0.85358pt=\hskip-0.85358pt1.11. The computed ranges are also compared to the true range at every 30 s interval to calculate the range RMSE over the Monte Carlo experiments for each SNR (Fig. 5(d)).

Refer to caption

(a)

Refer to caption

(b)

Refer to caption

(c)

Refer to caption

(d)

Figure 5: An example of range rate computed in a simulated scenario is shown in (a). This range rate computation requires data with a duration of 44 min. The range rate RMSE at higher SNRs is biased due to the limited resolution and the approximations made by the range rate computation method. The RMSE of the computed range rate as a function of the SNR is presented in (b). An example of a range computed in a simulated scenario is shown in (c). Since the range computation is feasible 1010 min after the range rate computation becomes available, the curve for the computed range starts after 1212 min. The RMSE of the computed range, which relies on the computed range rate as a function of the SNR, is depicted in (d).

Refer to caption

(a)

Refer to caption

(b)

Refer to caption

(c)

Figure 6: A spectrogram with tonal signals from the moving source is shown in (a). This spectrogram relies on the acoustic data recorded by the first element of HLAS during the SWellEx-96 experiment event S5. The red lines indicate the striations related to β=1.15\beta=1.15. A single tone at 109109 Hz is used for computing the range rate measurement, and four tones at frequencies 109,127,145109,127,145, and 163163 Hz are used to compute the range measurements. The range rate and range measurements are compared with their respective ground truth in (b) and (c). The average SNR of the acoustic signal is 88 dB in the considered dataset.

IV-B2 Real Data

The geoacoustic properties are equivalent to the simulation environment, except for the seafloor and receiver depths. The receiver is located on the seafloor, which is 198 m deep. The noise variance for the SNR calculation is obtained as the power at which the tones were not present.

The computed range rates are in good accordance with the true ranges rate but appear to be biased by 0.100.10 m s-1 (Fig. 6(b)). On top of the two factors for the bias discussed in Sec. IV-B1, there is also a model mismatch. The bathymetry along the event S5 slowly changes; therefore, it is a mildly range-dependent environment. The true range rate was computed by taking a moving average of the range difference between the receiver and the ship’s location, which was recorded with a GPS every minute. In addition, the computed range rate considers the phase offset discussed in [22]. In particular, given the sampling frequency of 3276.83276.8 Hz, the phase offset due to taking a segment of length 32763276 samples is 0.360.36 m s-1, which is removed from the computed range rates.

Using zβ,real=1.15z_{\beta,\mathrm{real}}\hskip-0.85358pt=\hskip-0.85358pt1.15, which was computed using the same data, the range RMSE compared to the actual range is 217217 m (Fig. 6(b)). The average SNR of the data used is calculated to be 88 dB. Note also that the range computed using the average of the moving averaged true range rate (2.392.39 m s-1) yields an RMSE of 104104 m. In comparison, the range RMSE based on the simulation is approximately 109109 m at average SNR of 1010 dB in Fig. 5. Given the bias in the computed range rate, it is promising that the range computed using the SWellEx-96 event S5 data is close to the expected error.

Even though no real acoustic recordings from an AUV are available in this work, the following tracking simulation results show that our proposed method has a strong potential for bounding the error for self-localization in underwater navigation scenarios with just a single hydrophone in addition to the sensors for a dead-reckoning navigation.

IV-C Navigation

300300 Monte Carlo experiments of the given the 20 min long scenario were performed to characterize the position error and covariance. The average SNR of the acoustic signal is 1212 dB. Assuming that the AUV has traveled based on dead-reckoning for some time, the initial position estimate is sampled from a normal distribution with a standard deviation of 500500 m. The corresponding WI parameter is computed as zβ,sim=1.11z_{\beta,\mathrm{sim}}\hskip-0.85358pt=\hskip-0.85358pt1.11. To avoid particle degeneracy[27], 500,000500,000 particles were used, and a Gaussian kernel with a standard deviation (STD) of 1010 m was added to the position states of the particles at every resampling stage for roughening. Using a MacBook Pro with an Apple M1 Pro chip and 3232 GB memory, the particle-based estimation at each time step takes a median value of 0.11 s.

Each measurement noise STD reflects the error specifications of real sensors used in the AUV navigation. The STD of the heading and the speed over the ground are modeled after OceanServer OS5000[28] and Teledyne RD Instruments Explorer Doppler Velocity Log[29], respectively.

As discussed above, the first range measurement can be computed after 1212 min of acoustic data are available. As a result, sharp drops in the position error and the sample covariance of the particles after 1212 min are observed in Fig. 7. Furthermore, the navigation simulation results show that the position error is reduced once incorporated and keeps it bounded.

Refer to caption

(a)

Refer to caption

(b)

Refer to caption

(c)

Figure 7: An example of an AUV navigation scenario is shown in (a). This scenario is related to a single simulation run. Evolution in time is indicated by the shade of dots that show position estimates. The error ellipses represent the sample standard deviation of the position estimates along its major and minor axes. It can be seen that the ellipses become thin once the navigation filter incorporates the range measurements. The RMSE of the position estimates is compared to the true positions in (b). The average sample covariance of the particles representing the posterior distribution of the AUV position is presented in (c). Both (b) and (c) are based on 300300 Monte Carlo experiments. The strong reduction in RMSE and the covariance at 1212 min indicates the beginning of the WI-based ranging.

V Conclusion

We developed a Bayesian method for the navigation of autonomous underwater vehicles (AUVs) in shallow water. Our approach relies on passively recorded signals from acoustic sources of opportunities (SOOs). A measurement of the range with respect to the SOO is extracted from the spectrogram of a single hydrophone by exploiting striation patterns described by the waveguide invariant. A particle-based navigation filter processes the resulting range measurements and the AUV’s internal velocity and heading measurements. As a result, the position error, which would otherwise increase over time, can be bounded. The ability to compute the range rate and range measurements from the pressure field measured using a single hydrophone is demonstrated on real data from the SWellEx-96 experiment. The achieved ranging accuracy shows potential for real-world applications. In particular, by adding just a single hydrophone to a navigation system that only relies on dead-reckoning, the system can limit its position uncertainty if a suitable SOO is available. Future research directions include an extension to tracking of SOOs [30, 31, 32] and model adaptation using deep learning [33].

Acknowledgement

This work was supported by the Defense Advanced Research Projects Agency (DARPA) under Grant D22AP00151. We thank Dr. H. Akins, Dr. A. H. Young, Prof. S. T. Rakotonarivo, Prof. A. H. Harms and Prof. W. A. Kuperman for interesting discussions and comments.

References

  • [1] J. González-García, A. Gómez-Espinosa, E. Cuan-Urquizo, L. G. García-Valdovinos, T. Salgado-Jiménez, and J. A. E. Cabello, “Autonomous underwater vehicles: Localization, navigation, and communication for collaborative missions,” Appl. Sci., vol. 10, no. 4, 2020.
  • [2] R. P. Stokey and T. C. Austin, “Sequential long-baseline navigation for REMUS, an autonomous underwater vehicle,” in Proc. Information Systems for Navy Divers and Autonomous Underwater Vehicles Operating in Very Shallow Water and Surf Zone Regions, vol. 3711. SPIE, 1999, pp. 212–219.
  • [3] Y. Chen, D. Zheng, P. A. Miller, and J. A. Farrell, “Underwater inertial navigation with long baseline transceivers: A near-real-time approach,” IEEE Trans. Control Syst. Technol., vol. 24, no. 1, pp. 240–251, 2016.
  • [4] M. McKenna, D. Ross, S. Wiggins, and J. Hildebrand, “Underwater radiated noise from modern commercial ships,” J. Acoust. Soc. Am., vol. 131, pp. 92–103, 01 2012.
  • [5] H. M. Perez, R. Chang, R. Billings, and T. L. Kosub, “Automatic identification systems (AIS) data use in marine vessel emission estimation,” in Proc. 18th Annu. Int. Emission Inventory Conf., 2009.
  • [6] S. T. Rakotonarivo and W. A. Kuperman, “Model-independent range localization of a moving source in shallow water,” J. Acoust. Soc. Am., vol. 132, no. 4, pp. 2218–2223, 2012.
  • [7] F. B. Jensen, W. A. Kuperman, M. B. Porter, and H. Schmidt, Computational Ocean Acoustics, 2nd ed. Springer Publishing Company, Incorporated, 2011.
  • [8] S. Chuprov, “Interference structure of a sound field in a layered ocean,” Ocean Acoustics, Current State, pp. 71–99, 1982.
  • [9] G. L. D’Spain and W. A. Kuperman, “Application of waveguide invariants to analysis of spectrograms from shallow water environments that vary in range and azimuth,” J. Acoust. Soc. Am., vol. 106, no. 5, pp. 2454–2468, 1999.
  • [10] A. M. Thode, “Source ranging with minimal environmental information using a virtual receiver and waveguide invariant theory,” J. Acoust. Soc. Am., vol. 108, no. 4, pp. 1582–1594, 2000.
  • [11] K. L. Cockrell and H. Schmidt, “Robust passive range estimation using the waveguide invariant,” J. Acoust. Soc. Am., vol. 127, no. 5, pp. 2780–2789, 2010.
  • [12] A. Turgut, M. Orr, and D. Rouseff, “Broadband source localization using horizontal-beam acoustic intensity striations,” J. Acoust. Soc. Am., vol. 127, no. 1, pp. 73–83, 2010.
  • [13] C. Cho, H. C. Song, and W. S. Hodgkiss, “Robust source-range estimation using the array/waveguide invariant and a vertical array,” J. Acoust. Soc. Am., vol. 139, no. 1, pp. 63–69, 2016.
  • [14] C. Gervaise, B. G. Kinda, J. Bonnel, Y. Stéphan, and S. Vallez, “Passive geoacoustic inversion with a single hydrophone using broadband ship noise,” J. Acoust. Soc. Am., vol. 131, no. 3, pp. 1999–2010, 2012.
  • [15] B. Katsnelson, A. Lunkov, and I. Ostrovsky, “Interference pattern of the sound field in the presence of an internal kelvin wave in a stratified lake,” J. Acoust. Soc. Am., vol. 139, no. 2, pp. 881–889, 2016.
  • [16] D. Rouseff and R. Spindel, “Modeling the waveguide invariant as a distribution,” J. Acoust. Soc. Am., vol. 621, 06 2002.
  • [17] A. H. Young, J. Soli, and G. Hickman, “Self-localization technique for unmanned underwater vehicles using sources of opportunity and a single hydrophone,” in Proc. IEEE OCEANS, 2017, pp. 1–6.
  • [18] A. H. Young, H. A. Harms, G. W. Hickman, J. S. Rogers, and J. L. Krolik, “Waveguide-invariant-based ranging and receiver localization using tonal sources of opportunity,” IEEE J. Ocean. Eng., vol. 45, no. 2, pp. 631–644, 2020.
  • [19] X. Song, F. Liu, and Y. Shao, “Radial source velocity estimation using multiple line spectrum signals based on compressive sensing,” in Proc. IEEE WCSP, 2018.
  • [20] A. Zhao, P. Song, J. Hui, J. Li, and K. Tang, “Passive estimation of target velocity based on cross-spectrum histogram,” J. Acoust. Soc. Am., vol. 151, no. 5, pp. 2967–2974, 2022.
  • [21] J. Murray and D. Ensberg, “Swellex-96: S5 event,” http://swellex96.ucsd.edu/s5.htm, 1996, online; Accessed on 28-Feb-2023.
  • [22] C. Wang, J. Wang, and P. Du, “Estimation of moving target speed using weak line spectrum of single-hydrophon,” in Proc. IEEE Int. Conf. on Signal Process., Commun. and Comput., 2017, pp. 1–5.
  • [23] R. J. Urick, Principles of Underwater Sound, 3rd ed. Peninsula Publishing, 2013.
  • [24] A. Harms, J. L. Odom, and J. L. Krolik, “Ocean acoustic waveguide invariant parameter estimation using tonal noise sources,” in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2015, pp. 4001–4004.
  • [25] S. M. Kay, Fundamentals of Statistical Signal Processing: Estimation Theory. Upper Saddle River, NJ: Prentice-Hall, 1993.
  • [26] M. S. Arulampalam, S. Maskell, N. Gordon, and T. Clapp, “A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking,” IEEE Trans. Signal Process., vol. 50, no. 2, pp. 174–188, Feb. 2002.
  • [27] T. Li, M. Bolic, and P. M. Djuric, “Resampling methods for particle filtering: Classification, implementation, and strategies,” IEEE Signal Process. Mag., vol. 32, no. 3, pp. 70–86, 2015.
  • [28] OceanServer Technology, Inc., “Digital Compass Users Guide, OS5000 Series, Rev 4.0,” 2010.
  • [29] T. R. Instruments, “A Teledyne RD Instruments Navigation Datasheet,” Jul. 2013.
  • [30] F. Meyer, T. Kropfreiter, J. L. Williams, R. Lau, F. Hlawatsch, P. Braca, and M. Z. Win, “Message passing algorithms for scalable multitarget tracking,” Proc. IEEE, vol. 106, no. 2, pp. 221–259, 2018.
  • [31] J. Jang, F. Meyer, E. R. Snyder, S. M. Wiggins, S. Baumann-Pickering, and J. A. Hildebrand, “Bayesian detection and tracking of odontocetes in 3-D from their echolocation clicks,” J. Acoust. Soc. Am., vol. 153, no. 5, p. 2690, May 2023.
  • [32] F. Meyer and J. L. Williams, “Scalable detection and tracking of geometric extended objects,” IEEE Trans. Signal Process., vol. 69, pp. 6283–6298, 2021.
  • [33] M. Liang and F. Meyer, “Neural enhanced belief propagation for multiobject tracking,” IEEE Trans. Signal Process., 2023, to appear.