An Efficient High-Dimensional Sparse Fourier Transform¹¹1This manuscript has been submitted to IEEE Transactions on Aerospace and Electronic Systems for reviewing on Sept. 1st, 2016.

Shaogang Wang, , Vishal M. Patel, , and Athina Petropulu Shaogang Wang, Vishal M. Patel and Athina Petropulu are with the department of Electrical and Computer Engineering at Rutgers University, Piscataway, NJ USA. Email: {shaogang.wang, vishal.m.patel, athinap}@rutgers.edu.

Abstract

We propose RSFT, which is an extension of the one dimensional Sparse Fourier Transform algorithm to higher dimensions in a way that it can be applied to real, noisy data. The RSFT allows for off-grid frequencies. Furthermore, by incorporating Neyman-Pearson detection, the frequency detection stages in RSFT do not require knowledge of the exact sparsity of the signal and are more robust to noise. We analyze the asymptotic performance of RSFT, and study the computational complexity versus the worst case signal SNR tradeoff. We show that by choosing the proper parameters, the optimal tradeoff can be achieved. We discuss the application of RSFT on short range ubiquitous radar signal processing, and demonstrate its feasibility via simulations.

Index Terms:

Array signal processing, sparse Fourier transform, detection and estimation, radar signal processing.

I Introduction

High dimensional FFT (N-D FFT) is used in many applications in which a multidimensional Discrete Fourier Transform (DFT) is needed, such as radar and imaging. However, its complexity increases with an increase in dimension, leading to costly hardware and low speed of system reaction. The recently developed Sparse Fourier Transform (SFT) [1, 2, 3, 4, 5, 6], designed for signals that have a small number of frequencies enjoys low complexity, and thus is ideally suited in big data scenarios [7, 8, 9]. One such application is radar target detection. The radar returns are typically sparse in the target space, i.e., the number of targets is far less than the number of resolutions cells. This observation motivates the application of SFT in radar signal processing.

The current literature on multi-dimensional extensions of the SFT include [10, 5, 3, 6]. Those works mainly address sample complexity, i.e., using the least number of time domain samples to reconstruct the signal frequencies. In order to detect the significant frequencies in an approximately sparse settings, i.e., signal corrupted by additive noise, the aforementioned methods assume knowing the exact sparsity, and compare the frequency amplitude with a predefined threshold. However, in many real applications, the exact signal sparsity may be either unknown or subject to change. For example, in the radar case, the number of targets to be detected is typically unknown and usually varies from time to time. Also, setting up an ideal threshold for detection in noisy cases is not trivial, since it relates to the tradeoff between probability of detection and false alarm rate. However, those issues have not been studied in the SFT literature.

One of the constrains in the aforementioned SFT algorithms is the assumption that the signal discrete frequencies are all on-grid. In reality, however, the signal frequencies, when discretized and depending on the grid size can fall between grid points. The consequence of off-grid frequencies is leakage to other frequency bins, which essentially reduces the sparsity of the signal. To refine the estimation of off-grid frequencies starting from the initial SFT-based estimates,[9] proposed a gradient descent method. However, the method of [9] has to enumerate all possible directions of the gradient for each frequency and compute the approximation error for each guessed direction, which increases the computational complexity. Moreover, like the other aforementioned SFT methods, the thresholding for frequency detection is not clear in [9]. The off-grid frequency problem in the context of SFT was also studied in [11], where it was assumed that in the frequency domain the signal and the noise are well separated by predefined gaps. However, this is a restrictive assumption that limits the applicability of the work.

In this paper we setup a SFT-based framework for sparse signal detection in a high dimensional frequency domain and propose a new algorithm, namely, the Realistic Sparse Fourier Transform (RSFT) which addresses the shortcomings discussed above. This paper makes the following contributions.

1.

The RSFT algorithm does not require knowledge of the number of frequencies to be estimated. Also, it does not need the frequencies to be on-grid and does not require signal and noise to be separated in the frequency domain. Our method reduces the leakage effect from off-grid frequencies by applying a window on the input time domain data (see Section III-A for details). The design of this window trades off frequency resolution, leakage suppression and computational efficiency. We shall point out that, unlike the work of [9] that recovers the exact off-grid frequency locations, our work aims to recover grided locations of off-grid frequencies with less amount of computation.
2.

We extend the RSFT into an arbitrary fixed high dimension, so that it can replace the N-D FFT in sparse settings and thus enable computational savings.
3.

We put RSFT into a Neyman-Pearson (NP) detection framework. Based on the signal model and other design specifications, we give the (asymptoticly) optimal thresholds for the two detection stages of the RSFT (see Section IV for details). Since the output of the first stage of detection serves as the input of the second stage, the two detection stages are interconnected. The detection thresholds are jointly found by formulating and solving an optimization problem, with its objective function minimizing the worst case Signal to Noise Ratio (SNR) (hence the system is more sensitive to weak signal), and its constrains connecting probability of detection and false alarm rate for both two stages.
4.

We provide a quantitive measure of tradeoff between computational complexity and worst case signal SNR for systems that use RSFT, which serves as a concrete design reference for system engineers.
5.

We investigate the use of RSFT in multi-dimensional radar signal processing.

A closely related technique to SFT is compressed sensing (CS). Compressed sensing-based methods recover signals by exploiting their sparse features [12]. The application of CS methods in MIMO radar is discussed intensively in [13, 14]. At a high level, CS differs from SFT in that it assumes the signal can be sparsely represented by an overdetermined dictionary, and formulates the problem as an optimization problem with sparsity constraints ( $l_{0}$ or $l_{1}$ norm). This problem is usually solved by convex optimization, which runs in polynomial time in $N$ for an $N$ -dimensional signal. On the other hand, the SFT method finds an (approximately) sparse Fourier representation of a signal in a mean square error sense ( $l_{2}$ norm). The sample and the computational complexity of SFT is sub-linear to $N$ , for a wide range of $N$ -dimensional signals [6].

A preliminary version of this work appeared in [15]. A detailed analysis of the RSFT algorithm as well as extensive experimental results are extensions to [15].

I-A Notation

We use lower-case (upper-case) bold letters to denote vectors (matrices). $(\cdot)^{T}$ and $(\cdot)^{H}$ respectively denote the transpose and conjugate transpose of a matrix or a vector. $\|\cdot\|$ is Euclidean norm for a vector. $[\mathbf{a}]_{i}$ is the $i_{th}$ element of vector $\mathbf{a}$ . All operations on indices in this paper are taken modulo $N$ denoted by $[\cdot]_{N}$ . We use $\lfloor\cdot\rfloor$ to denote rounding to floor. $[S]$ refers to the set of indices $\{0,...,S-1\}$ , and $[S]\setminus a$ is for eliminating element $a$ from set $[S]$ . We use $\{0,1\}^{B}$ to denote the set of $B$ -dimensional binary vectors. We use $\mathop{\mathrm{diag}}\nolimits(\cdot)$ to denote forming a diagonal matrix from a vector and use $\mathbb{E}\{\cdot\}$ to denote expectation. The DFT of signal ${\mathbf{s}}$ is denoted as $\hat{{\mathbf{s}}}$ . We also assume that the signal length in each dimension is an integer power of 2.

This paper is organized as follows. A brief background on the SFT algorithm is given in Section II. Details of the proposed RSFT algorithm are given in Section III. Section IV presents the derivation of the optimal threshold design for the RSFT algorithm. Then in Section V we provide some numerical results to verify the theoretical findings. An application of the RSFT algorithm in radar signal processing is presented in Section VI and finally, concluding remarks are made in Section VII.

II Preliminaries

II-A Basic Techniques

As opposed to the FFT which computes the coefficients of all $N$ discrete frequency components of an $N$ -sample long signal, the SFT[2] computes only the $K$ frequency components of a $K$ -sparse signal. Before outlining the SFT algorithm we provide some key definitions and properties, which is extracted and reformulated based on [2] .

Definition 1.

(Permutation): Define the transform $P_{\sigma,\tau}$ such that, given $\mathbf{x}\in\mathbb{C}^{N},\sigma,\tau\in[N]$ , and $\sigma$ invertible

\sigma\sigma^{-1}=[1]_{N}.

(1)

Then, the following transformation is called permutation

(P_{\sigma,\tau}x)_{i}=x_{\sigma i+\tau}.

(2)

The permutation has the following property.

Property 1.

A modular reordering of the data in time domain results in a modular dilation and phase rotation in the frequency domain, i.e.,

\widehat{(P_{\sigma,\tau}x)}_{\sigma i}=\hat{x}_{i}e^{-j\tau\frac{2\pi}{N}}.

(3)

Definition 2.

(Aliasing): Let $\mathbf{x}\in\mathbb{C}^{N},\mathbf{y}\in\mathbb{C}^{B}$ , with $N,B$ powers of 2, and $B<N$ . For $L={N}/{B}$ , a time-domain aliased version of x is defined as

y_{i}=\sum_{j=0}^{L-1}x_{i+Bj},\quad i\in[B].

(4)

Property 2.

Aliasing in time domain results in downsampling in frequency domain, i.e.,

\hat{y}_{i}=\hat{x}_{iL}.

(5)

Definition 3.

(Mapping): Let $i,\sigma\in[N]$ , where $\sigma$ satisfies (1). We define the mapping $\mathcal{M}(i,\sigma)$ such that

[B]\ni\mathcal{M}(i,\sigma)\equiv\lfloor\frac{B}{N}[i\sigma]_{N}\rfloor.

(6)

Definition 4.

(Reverse-mapping): Let $\sigma^{-1}\in[N]$ , $\sigma^{-1}$ satisfies (1), and $j\in[B]$ . Define $\mathcal{R}(j,\sigma^{-1})$ a reverse-mapping such that

\begin{split}&\mathcal{R}(j,\sigma^{-1})\equiv\{[\sigma^{-1}u]_{N}\mid u\in\mathbb{S}\},\end{split}

(7)

where

\begin{split}\mathbb{S}\equiv\{v\in[N]\mid j\frac{N}{B}\leq v<(j+1)\frac{N}{B}]\}.\end{split}

(8)

II-B SFT Algorithm

At a high level, the SFT algorithm in [2] runs two loops, namely the Location loop and the Estimation loop. The former finds the indices of the $K$ most significant frequencies from the input signal, while the latter estimates the corresponding Fourier coefficients. Here, we emphasize on Location more than Estimation, since the former is more relevant to the radar application that we consider. The Location step provides frequency locations, which in the radar case is directly related to target parameters.

In the Location loop, a permutation procedure reorders the input data in the time domain, causing the frequencies to also reorder. The permutation causes closely spaced frequencies to appear in well separated locations with high probability. Then, a flat-window[2] is applied on the permuted signal for the purpose of extending a single frequency into a (nearly) boxcar, for a reason that will become apparent in the following. The windowed data are aliased, as in Definition 2. The frequency domain equivalent of this aliasing is undersampling by $N/B$ (see Property 2). The flat-window used at the previous step ensures that no peaks are lost due to the effective undersampling in the frequency domain. After this stage, a FFT of length $B$ is employed.

The permutation and the aliasing procedure effectively map the signal frequencies from $N$ -dimensional space into a reduced $B$ -dimensional space, where the first-stage-detection procedure finds the significant frequencies’ peaks, and then their indices are reverse mapped into the original $N$ -dimensional frequency space. However, the reverse mapping yields not only the true location of the significant frequencies, but also ${N}/{B}$ ambiguous locations for each frequency. To remove the ambiguity, multiple iterations of Location with randomized permutation are performed. Finally, the second-stage-detection procedure locates the $K$ most significant frequencies from the accumulated data for each iteration. More details about the SFT algorithm can be found in [2].

III The RSFT Algorithm

In this section we address some problems that have not been considered in the original SFT algorithm, namely the leakage from off-grid frequencies and optimal detection threshold design for both detection stages. Also, we extend the RSFT into arbitrary fixed high dimension.

III-A Leakage Suppression for Off-grid Frequencies

In real world applications, the frequencies are continuous and can take any value in $[0,2\pi)$ . When fitting a grid on these frequencies, leakage occurs from off-grid frequencies, which can diminish the sparsity of the signal. As the leakage due to strong frequency components can mask the contributions of weak frequency components, it becomes difficult to determine the frequency domain peaks after permutation. (see Fig. 1 (c)). To address this problem, we propose to multiply the received time domain signal with a window before permutation. We call this procedure pre-permutation windowing. The idea is to confine the leakage within a finite number of frequency bins, as illustrated in Fig. 1.

The choice of the pre-permutation window is determined by the required resolution, computational complexity, and degree of leakage suppression. Specifically, the side-lobe level should be lower than the noise level after windowing (see Fig. 4). However, the larger the attenuation of the side-lobes, the wider the main lobe would be, thus lowering the frequency resolution. Meanwhile, a broader main lobe results in increased computational load, which will be discussed in Section IV-D.

III-B Signal Detection Without Knowing the Exact Sparsity and Optimal Threshold Design

In the SFT, detection of the significant frequencies is needed in two stages. With knowing the number of the significant frequencies and assuming they are all on-grid, the detection of the signal can be accomplished by finding the $K$ highest spectral amplitude values.

In reality however, we usually do not have the knowledge of the exact sparsity, i.e., $K$ . Moreover, even if we knew $K$ , due to the leakage caused by the off-grid signals, the $K$ highest spectral peaks might not be the correct representation of the signal frequencies. Finally, the additive noise could generate false alarms, which would add more difficulties in signal detection.

In order to solve the detection problem, we propose to use NP detection in the two stages of detection, which does not require knowing $K$ . However, the optimal thresholds still rely on the number of significant frequencies as well as their SNR. In light of this, we provide an optimal design based on a bound for the sparsity and signal SNR level. The optimal threshold design is presented in Section IV.

III-C High Dimensional Extensions

In the following, we elaborate on the high dimensional extension of the RSFT for its main stages.

III-C1 Windowing

In the pre-permutation windowing and the flat-windowing stages, the window for each dimension is designed separately. After that, the high dimension widow is generated by combining each 1-D window. For instance, in the 2-D case, assuming that $\mathbf{w}_{x}$ and $\mathbf{w}_{y}$ are the two windows in the $x$ and $y$ dimension, respectively, a 2D window can be computed as

\mathbf{W}_{xy}=\mathbf{w}_{x}\mathbf{w}_{y}^{H}.

(9)

Fig. 2 shows a compound 2-D window which is a combination of a 64-point and a 1024-point Dolph-Chebyshev window, both of which has $60dB$ attenuation for the side-lobes in frequency domain. A Dolph-Chebyshev window allows us to trade off frequency resolution and side-lobe attenuation easly. Other windows sharing the same flexibility includes Gaussian window, Kaiser-Bessel window, Blackman-Harris window, etc., see [16] for detail. We apply those windows on the data by point-wise multiplications.

III-C2 Permutation

The permutation parameters are generated for each dimension in a random way according to (2). Then, we carry the permutation on each dimension sequentially. An example for the 2-D case is illustrated in Fig. 3.

III-C3 Aliasing

The aliasing stage compresses the high dimensional data into much smaller size. In 2-D, as shown in Fig. 3, a periodic extension of the $N_{x}\times N_{y}$ data matrix is created with period $B_{x}$ in the $x$ dimension and $B_{y}$ in the $y$ dimension, with $B_{x}<<N_{x}$ and $B_{y}<<N_{y}$ , and the basic period, i.e., $B_{x}\times B_{y}$ is extracted.

III-C4 First-stage-detection and reverse-mapping

We carry first stage detection after taking the square of magnitude of N-D FFT on the aliased data. Each data point is then compared with a pre-determined threshold, and for those passing the thresholds, their indices are reversed map to the original space. The combination of the reverse mapped indices from each dimension provides the tentative locations of the original frequency components.

III-C5 Accumulation and second-stage-detection

The accumulation stage collects the tentative frequency locations found in the reverse mapping for each iteration, and the number of occurrences for each location is calculated after running over $T$ iterations. The second stage detection finds indices in each dimension for those whose number of occurrence pass the threshold of this stage.

Based on the above discussion, we summarize the RSFT in Algorithm 1. Comparing to the original SFT, we add pre-permutation windowing process on the input data and incorporate NP detection in both stages of detection.

Depending on the application, the iterations can be applied on the same input data with different permutations. Alternatively, each iteration could process different segments (see the signal model in (10)) of input data as indicated in Algorithm 1, and it can effectively reduce the variance of the estimation via RSFT, provided that the signal is stationary.

The Estimation procedure in the RSFT is similar to the original Estimation procedure, except that the recovered Fourier coefficients should be divided by the spectrum of the pre-permutation window, so that the distortion due to pre-permutation windowing is compensated. Moreover, using Location is sufficient for our radar application, for it can provide the location of significant frequencies, which is directly related to the parameters to be estimated. We also set $\tau=0$ in each permutation, since the random phase rotation does not affect the performance of a detector after taking magnitude of the signal in the intermediate stage.

Algorithm 1 RSFT algorithm

Input: complex signal $\mathbf{r}$ in any fixed dimension
Output: $\mathbf{o}$ , sparse frequency locations of input signal

1:procedure RSFT(

\mathbf{r}

)

2: Generate a set of

\sigma

randomly for each dimension

\bar{\mathbf{a}}\leftarrow 0

4: for

i\leftarrow 0

T

5: Pre-Permutation Windowing:

\mathbf{y}\leftarrow\mathbf{W}\mathbf{r}

6: Permutation:

\mathbf{p}

\leftarrow

P_{\sigma,0}\mathbf{y}

7: Flat-windowing:

\mathbf{z}

\leftarrow

\overline{\mathbf{W}}\mathbf{p}

8: Aliasing:

\mathbf{f}\leftarrow\mathop{\mathrm{Aliasing}}(\mathbf{z}

)

9: N-D FFT:

\hat{\mathbf{f}}

\leftarrow

\mathop{\mathrm{NDFFT}}(\mathbf{f})

10: First-stage-detection:

\mathbf{c}\leftarrow\mathop{\mathrm{NPdet1}}(|\hat{\mathbf{f}}|^{2}

)

11: Reverse-mapping:

\mathbf{a}_{i}

\leftarrow

\mathop{\mathrm{Reverse}}(\mathbf{c})

12: Accumulation:

\bar{\mathbf{a}}\leftarrow\bar{\mathbf{a}}+\mathbf{a}_{i}

13: end for

14: Second-stage-detection:

\mathbf{o}\leftarrow\mathop{\mathrm{NPdet2}}(\bar{\mathbf{a}})

15: return

\mathbf{o}

16:end procedure

IV Optimal Thresholds Design in the RSFT

The main challenge in implementing the RSFT is to decide the thresholds in the two stages of NP detection. The applying of NP detection in the RSFT is not a straitforward extension on the SFT, in that the two stages are inter-connected, thus need to be jointly studied. In the following, we will show that the two asymptotically optimal thresholds are jointly founded with an optimization process. The analysis is carried in 1-D, while the generalization to high dimension is straitforward.

IV-A Signal Model and Problem Formulation

We model the signal in continuous time domain as a superposition of $K$ sinusoids as well as additive white noise. We then sample the signal uniformly both in I and Q channels with a sampling frequency above the Nyqvist rate. Assume the total sampling time is divided into $T$ consecutive equal length segments, each of which contains $N$ samples, and $K<<N$ (i.e., the signal is sparse in frequency domain). Then for each time segment, i.e., $s\in[T]$ , we have

\mathbf{r}_{s}=\sum_{i\in[K]}b_{i,s}\mathbf{v}(\omega_{i})+\mathbf{n}_{s},

(10)

where $\mathbf{v}(\omega_{i})$ denotes for the $i_{th}$ complex sinusoid, with $\omega_{i}\in[0,2\pi)$ as its frequency, i.e.,

\mathbf{v}(\omega_{i})=[1\quad e^{j\omega_{i}}\;\cdots\;e^{j(N-1)\omega_{i}}]^{T}.

(11)

We further assume that $\omega_{i}$ is unknown deterministic quantity and is constant during the whole process, while the complex amplitude of the sinusoid, i.e., $b_{i,s}$ takes random value for each segment. More specifically, we model $b_{i,s}$ as a circularly symmetric Gaussian process with the distribution $b_{i,s}\sim\mathcal{CN}(0,\sigma_{bi}^{2})$ . Likewise, the noise $\mathbf{n}_{s}$ is distributed as $\mathbf{n}_{s}\sim\mathcal{CN}(\mathbf{0},\sigma_{n}^{2}\mathbf{I})$ , where $\mathbf{0}$ is $N$ -dimensional zero vector, and $\mathbf{I}\in\mathbb{R}^{N\times N}$ is unit matrix. We also assume each sinusoid and the noise are uncorrelated. In addition, the neighboring sinusoids are resolvable in the frequency domain, i.e., the frequency spacing of neighboring sinusoids is greater than $\eta_{m}\frac{2\pi}{N}$ , where $\eta_{m}$ is the 6.0-dB bandwidth[16] of a window that applies on $\mathbf{r}_{s}$ . Note the signal model in (10) is also commonly used in array signal processing literature for Uniform Linear Array (ULA) settings (See e.g., [17]), in which the time samples in each segment is replaced by spatial samples from array elements, and a sample segment is referred as a time snapshot.

We want to detect and estimate each $\omega_{i}$ from the input signal. From a non-parametric and data-independent perspective, this is a classic spectral analysis and detection problem that can be solved by any FFT-based spectrum estimation method, for example, the Bartlett method followed by a NP detection procedure (see Appendix A). In that case, the FFT computes the signal spectrum on $N$ frequency bins, and the detection is carried on each bin to determine whether there exists a significant frequency. In what follows, we define the detection and estimation problem related to the design of the RSFT.

Let $SNR_{i}=\sigma_{bi}^{2}/\sigma_{n}^{2}$ be the SNR of the $i_{th}$ sinusoid. Let us define the worst case SNR, i.e., $SNR_{min}$ , as

SNR_{min}\triangleq\mathop{\mathrm{min}}_{i\in[K]}(SNR_{i}).

(12)

Let $P_{d}$ denote for the probability of detection for the sinusoid with $SNR_{min}$ , and $P_{fa}$ the corresponding probability of false alarm on each frequency bin.

Problem 1.

For the signal defined in (10), find the optimal thresholds of the first and the second stage of detection in an asymptotic sense, such that they minimize $SNR_{min}$ for given $P_{d},P_{fa},N,B,T,K$ . Also, characterize the tradeoff between computational complexity and $SNR_{min}$ as a function of various parameters.

In the following, we investigate the two stages of detection separately, then summarize the solution into an optimization problem.

IV-B First Stage Detection

The first stage detection is performed on each data segment. After pre-permutation windowing, permutation and flat-windowing, the input signal can be expressed as

\mathbf{z}=\overline{\mathbf{W}}\mathbf{P}_{\sigma_{s}}\mathbf{W}\mathbf{r},

(13)

where $\sigma_{s}$ is the permutation parameter for the $s_{th}$ segment, which has an uniform random distribution; $\mathbf{P}_{\sigma_{s}}$ is the permutation matrix, which functions as (2) with $\tau=0$ ; $\mathbf{W}=\mathop{\mathrm{diag}}\nolimits(\mathbf{w})$ , $\overline{\mathbf{W}}=\mathop{\mathrm{diag}}\nolimits(\bar{\mathbf{w}})$ , where $\mathbf{w}$ and $\bar{\mathbf{w}}$ are pre-permutation window and flat-window, respectively.

Regarding the design of the flat-window, we take its frequency domain main-lobe to have width $2\pi/B$ , and choose its length in time domain as $N$ . As indicated in [2], it is possible to use less data in flat-windowing by choosing the length of $\bar{\mathbf{w}}$ less than $N$ , i.e., dropping some samples in each segment after the permutation. However, a reduced length window in the time domain would result in longer transition regions in the frequency domain and as a result the detection performance of the system would degrade, since a larger transition region would allow more noise to enter the estimation process.

The time domain aliasing can be described as

\mathbf{f}=\sum_{i\in[L]}\left(\overline{\mathbf{W}}_{i}\mathbf{P}_{\sigma_{s}}\mathbf{W}\mathbf{r}\right)=\mathbf{V}_{\sigma_{s}}\mathbf{r},

(14)

where $L=N/B$ ; $\overline{\mathbf{W}}_{i}$ is the $i_{th}$ sub-matrix of $\overline{\mathbf{W}}$ , which is comprised of the $iB_{th}$ to the $((i+1)B-1)_{th}$ rows of $\overline{\mathbf{W}}$ . And $\mathbf{V}_{\sigma_{s}}=\sum_{i\in[L]}\overline{\mathbf{W}}_{i}\mathbf{P}_{\sigma_{s}}\mathbf{W}$ .

The FFT operation on the aliased data $\mathbf{f}$ can be expressed as

\hat{\mathbf{f}}=\mathbf{D}\mathbf{V}_{\sigma_{s}}\mathbf{r},

(15)

where $\mathbf{D}\in\mathbb{C}^{B\times B}$ is the DFT matrix. For the $k_{th}$ entry of $\hat{\mathbf{f}}$ , we have

[\hat{\mathbf{f}}]_{k}=\mathbf{u}^{H}_{k}\mathbf{V}_{\sigma_{s}}\mathbf{r},\;k\in[B],

(16)

where $\mathbf{u}_{k}$ is the $k_{th}$ column of $\mathbf{D}$ , i.e., $\mathbf{u}_{k}=[1\quad e^{jk\Delta\omega_{B}}\;\cdots\;e^{jk(B-1)\Delta\omega_{B}}]^{T}$ , and $\Delta\omega_{B}=2\pi/B$ .

Substituting (10) into (16), and taking the $m_{th}$ sinusoid, which we assume is the weakest sinusoid with its SNR equals to $SNR_{min}$ , out of the summation

\begin{split}[\hat{\mathbf{f}}]_{k}&=b_{m}\mathbf{u}^{H}_{k}\mathbf{V}_{\sigma_{s}}\mathbf{v}(\omega_{m})\\ &+\sum_{j\in[K]\setminus m}\left(b_{j}\mathbf{u}^{H}_{k}\mathbf{V}_{\sigma_{s}}\mathbf{v}(\omega_{j})\right)\\ &+\mathbf{u}^{H}_{k}\mathbf{V}_{\sigma_{s}}\mathbf{n}.\end{split}

(17)

Since $[\hat{\mathbf{f}}]_{k}$ is a linear combination of $b_{i},[\mathbf{n}]_{j},i\in[K],j\in[N]$ , it holds that

[\hat{\mathbf{f}}]_{k}\sim\mathcal{CN}(0,\sigma_{fk}^{2}),

(18)

where

\begin{split}\sigma_{fk}^{2}&=\sigma_{bm}^{2}\alpha(k,\sigma_{s},\omega_{m})\\ &+\sum_{j\in[K]\setminus m}\left(\sigma_{bj}^{2}\alpha(k,\sigma_{s},\omega_{j})\right)+\sigma_{n}^{2}\beta(\sigma_{s}),\end{split}

(19)

and

\begin{split}&\alpha(k,\sigma_{s},\omega)=|\mathbf{u}^{H}_{k}\mathbf{V}_{\sigma_{s}}\mathbf{v}(\omega)|^{2}\\ &\beta(\sigma_{s})=\|\overline{\mathbf{W}}\mathbf{P}_{\sigma_{s}}\mathbf{w}\|^{2}.\end{split}

(20)

It is easy to see that $\sigma_{fk}^{2}$ is summation of weighted variance from each signal and noise component.

We now investigate the $p_{th}$ bin, where $\omega_{m}$ is mapped to, and we have the following claim.

Claim 1.

For a complex sinusoid signal, i.e., $\mathbf{v}(\omega)$ , after pre-permutation windowing, permutation with $\sigma_{s}$ , flat windowing, aliasing and FFT, the highest amplitude of signal spectrum appears in $[B]$ at location

p(\omega,\sigma_{s})=\lfloor{\frac{B}{N}[\sigma_{s}\lfloor\frac{\omega}{\Delta\omega_{N}}\rfloor]_{N}}\rfloor.

(21)

where $\Delta\omega_{N}=2\pi/N$ .

Proof.

If we were applying DFT to $\mathbf{v}(\omega)$ , the highest amplitude of the spectrum would appear on the grid point closest to $\omega$ , i.e. $\lfloor\frac{\omega}{\Delta\omega_{N}}\rfloor$ . The pre-permutation windowing will not change the position of the highest peak, provided the window is symmetric. Then after permutation, the peak location dilates by $\sigma_{s}$ modularly, and becomes $[\sigma_{s}\lfloor\frac{\omega}{\Delta\omega_{N}}\rfloor]_{N}$ . Finally, after flat-windowing and aliasing, the signal is ideally downsampled in the frequency domain, and the data length changes from $N$ to $B$ . Then the $B$ -point DFT exhibits the highest peak on grid point $p$ as desired. A visualization of this process is shown in Fig. 4. ∎

On assuming that only $\omega_{m}$ maps to bin $p$ , and that the side-lobes (leakage) are far below the noise level (owning to the two stages of windowing, which attenuate the leakage down to a desired level), the effect of leakage from other sinusoids can be ignored. Then we can approximate the variance of $[\hat{\mathbf{f}}]_{p}$ as

\begin{split}\sigma_{fp}^{2}&\approx\sigma_{bm}^{2}\alpha(p,\sigma_{s},\omega_{m})+\sigma_{n}^{2}\beta(\sigma_{s}).\end{split}

(22)

In case that multiple frequencies are mapped to the same bin (collision), (22) gives a underestimate of the variance. The probability of a collision occurring reduces as $K<<B$ .

The bin $u\in[B]$ , for which no significant frequency is mapped to, contains only noise, and the corresponding variance for $[\hat{\mathbf{f}}]_{u}$ is

\begin{split}\sigma_{fu}^{2}&\approx\sigma_{n}^{2}\beta(\sigma_{s}).\end{split}

(23)

Hence, the hypothesis test for the first-stage-detection on $[\hat{\mathbf{f}}]_{j},j\in[B]$ is formulated as

•

$\mathcal{H}0$ : no significant frequency is mapped to it.
•

$\mathcal{H}1$ : at least one significant frequency is mapped to it, with worst case SNR equals to $SNR_{min}$ .

The log likelihood ratio test (LLRT) is

\log{P_{f_{j}|H1}(x)\over P_{f_{j}|H0}(x)}\mathop{\gtrless}_{\mathcal{H}0}^{\mathcal{H}1}\gamma^{\prime}.

(24)

where $P_{f_{j}|H1}(x)$ and $P_{f_{j}|H0}(x)$ are the probability density function (PDF) of $[\hat{\mathbf{f}}]_{j}$ under $\mathcal{H}1$ and $\mathcal{H}0$ respectively, and $\gamma^{\prime}$ is a threshold.

Substituting the PDF of $[\hat{\mathbf{f}}]_{j}$ under both hypothesis into (24), and after some manipulations we get

|[\hat{\mathbf{f}}]_{j}|^{2}\mathop{\gtrless}_{\mathcal{H}0}^{\mathcal{H}1}{{\gamma^{\prime}}-{{\log{\sigma^{2}_{f_{u}}\over\sigma^{2}_{f_{p}}}}}\over{{1\over\sigma^{2}_{f_{u}}}-{1\over\sigma^{2}_{f_{p}}}}}.

(25)

Hence, $|[\hat{\mathbf{f}}]_{j}|^{2}$ is a sufficient statistics for first stage detection. Since $[\hat{\mathbf{f}}]_{j}$ has circularly symmetric Gaussian distribution, $|[\hat{\mathbf{f}}]_{j}|^{2}$ is exponentially distributed with cumulative distribution function (CDF)

F_{|[\hat{\mathbf{f}}]_{j}|^{2}}(x,\zeta^{2})=\begin{cases}1-e^{-{x\over\zeta^{2}}},\;x\geq 0\\ 0,\;x<0\;,\end{cases}

(26)

where $\zeta^{2}$ equals to $\sigma^{2}_{f_{u}}$ under ${\mathcal{H}0}$ and $\sigma^{2}_{f_{p}}$ under ${\mathcal{H}1}$ .

Based on (26), in the first stage of detection, the false alarm rate on each of $B$ bins and the probability of detection of the weakest sinusoid can be derived to be equal to

\begin{split}&\tilde{P}_{fa}(\sigma_{s})=e^{-{\gamma\over\sigma^{2}_{n}\beta(\sigma_{s})}},\\ &\tilde{P}_{d}(\omega_{m},\sigma_{s})=\tilde{P}_{fa}^{\beta(\sigma_{s})\over\alpha(p,\omega_{m},\sigma_{s})SNR_{min}+\beta(\sigma_{s})},\end{split}

(27)

where $\gamma$ is the detection threshold. Both $\tilde{P}_{fa}$ and $\tilde{P}_{d}$ depend on the permutation $\sigma_{s}$ . Taking expectation with respect to $\sigma_{s}$ , we have

\begin{split}&\bar{P}_{fa}=e^{-{\gamma\over\sigma^{2}_{n}\bar{\beta}}},\\ &\bar{P}_{d}(\omega_{m})=\bar{P}_{fa}^{\bar{\beta}\over\bar{\alpha}(p,\omega_{m})SNR_{min}+\bar{\beta}},\end{split}

(28)

where $\bar{P}_{d}(\omega_{m})=\mathbb{E}\{\tilde{P}_{d}(\sigma_{s},\omega_{m})\}$ , $\bar{P}_{fa}=\mathbb{E}\{\tilde{P}_{fa}(\sigma_{s})\}$ , $\bar{\alpha}(p,\omega_{m})=\mathbb{E}\{\tilde{\alpha}(p,\omega_{m},\sigma_{s})\}$ , and $\bar{\beta}=\mathbb{E}\{\tilde{\beta}(\sigma_{s})\}$ .

IV-C Second Stage Detection

Let $\mathbf{c}_{\sigma_{s}}\in\{0,1\}^{B}$ denote the output of the first-stage-detection for the $s_{th}$ segment, with permutation factor $\sigma_{s}$ . Each entry in $\mathbf{c}_{\sigma_{s}}$ is a Bernoulli random variable, i.e., for $j\in[B]$ ,

{[\mathbf{c}_{\sigma_{s}}]_{j}}\sim\begin{cases}\mathrm{Bernoulli}\left(\tilde{P}_{fa}(\sigma_{s})\right),under\;\mathcal{H}0,\\ \mathrm{Bernoulli}\left(\tilde{P}_{d}(\omega_{m},\sigma_{s})\right),under\;\mathcal{H}1.\end{cases}

(29)

Note that under $\mathcal{H}1$ , we assume that $[\mathbf{c}_{\sigma_{s}}]_{j}$ corresponds to the weakest sinusoid. For the other $K-1$ co-existing sinusoids, since their SNR may be grater than $SNR_{min}$ , their probability of detection may also be grater than $\tilde{P}_{d}(\omega_{m},\sigma_{s})$ (see Claim 3).

The reverse-mapping stage hashes the $B$ -dimensional $\mathbf{c}_{\sigma_{s}}$ back to the $N$ -dimensional $\mathbf{a}_{\sigma_{s}}$ . According to Definition 4, it holds that

[\mathbf{a}_{\sigma_{s}}]_{i}=[\mathbf{c}_{\sigma_{s}}]_{j},\;i\in[N],j\in[B],i\in\mathcal{R}(j,\sigma_{s}^{-1}).

(30)

After accumulation of $T$ iterations, each entry in the accumulated output is summation of $T$ Bernoulli variables with different success rate. Define $\bar{\mathbf{a}}$ as the accumulated output, then for its $i_{th},i\in[N]$ entry, we have

[\bar{\mathbf{a}}]_{i}=\sum_{s\in[T]}[\mathbf{a}_{\sigma_{s}}]_{i}=\sum_{i\in\mathcal{R}(j,\sigma_{s}^{-1}),s\in[T]}[\mathbf{c}_{\sigma_{s}}]_{j}.

(31)

Note that in (31), each term inside the sum corresponds to a different segment, i.e., $[\mathbf{c}_{\sigma_{s}}]_{j}$ is from the $s_{th}$ segment. Since $\sigma_{s}$ is drawn randomly for each segment, $j$ may take different values, and relates to $i$ via a reverse-mapping. Fig. 5 gives a graphical illustration of the mapping and reverse-mapping.

Now, the hypothesis test for the second-stage-detection on $[\bar{\mathbf{a}}]_{i},i\in[N]$ is formulated as

•

$\overline{\mathcal{H}}0$ : no significant frequency exists.
•

$\overline{\mathcal{H}}1$ : there exists a significant frequency, whose SNR is at least $SNR_{min}$ .

In the following, we investigate the statistics of $[\bar{\mathbf{a}}]_{i}$ under both hypothesis in an asymptotic senses. Before that however, we will take a closer look at the mapping and the reverse mapping by providing the following properties.

Property 3.

(Reversibility): Let $j\in[B],i,\sigma,\sigma^{-1}\in[N]$ . $\sigma$ and $\sigma^{-1}$ satisfy Eq. (1). If $j=\mathcal{M}(i,\sigma)$ , then it holds that

i\in\mathcal{R}(j,\sigma^{-1}).

(32)

Property 4.

(Distinctiveness): Let $i,j\in[B],i\neq j$ . If $\sigma^{-1}\in[N]$ and satisfies Eq. (1), then it holds that

\mathcal{R}(i,\sigma^{-1})\cap\mathcal{R}(j,\sigma^{-1})=\emptyset.

(33)

The proofs of these properties are provided in Appendix B. The two properties simply reveal the following facts: a mapped location can be recovered by reverse mapping (with ambiguities). Also, when applying reverse mapping to two distinct locations with the same permutation parameter, the resulting locations are also distinct.

Under $\overline{\mathcal{H}}1$ , assuming that $[\bar{\mathbf{a}}]_{i}$ corresponds to the $m_{th}$ sinusoid, i.e., the weakest sinusoid, then each term inside the sum of (31) has distribution $[\mathbf{c}_{\sigma_{s}}]_{j}\sim\mathrm{Bernoulli}\left(\tilde{P}_{d}(\omega_{m},\sigma_{s})\right),s\in[T]$ . Then we present the following claim.

Claim 2.

Under $\overline{\mathcal{H}}1$ , and as $T\to\infty$ ,

[\bar{\mathbf{a}}]_{i}\sim N(\mu_{a1}(\omega_{m}),\sigma^{2}_{a1}(\omega_{m})),

(34)

where $\mu_{a1}(\omega_{m})=T\bar{P}_{d}(\omega_{m})$ , $\sigma^{2}_{a1}(\omega_{m})\leq T\bar{P}_{d}(\omega_{m})(1-\bar{P}_{d}(\omega_{m}))$ .

Proof.

Since $0<\tilde{P}_{d}(\omega_{m},\sigma_{s})(1-\tilde{P}_{d}(\omega_{m},\sigma_{s}))<1$ , for $\delta>0$ , the Lyapunov Condition[18] holds, i.e.,

\begin{split}&\lim_{T\to\infty}\frac{1}{\sigma_{a1}(\omega_{m})^{2+\delta}}\sum_{s\in[T]}\mathbb{E}\{|[\mathbf{c}_{\sigma_{s}}]_{j}(\omega_{m})-\tilde{P}_{d}(\omega_{m},\sigma_{s})|^{2+\delta}\}\\ &\leq\lim_{T\to\infty}\frac{1}{\sigma_{a1}(\omega_{m})^{2+\delta}}\sum_{s\in[T]}\mathbb{E}\{|[\mathbf{c}_{\sigma_{s}}]_{j}(\omega_{m})-\tilde{P}_{d}(\omega_{m},\sigma_{s})|^{2}\}\\ &=\lim_{T\to\infty}\frac{1}{\sigma_{a1}(\omega_{m})^{\delta}}=0.\end{split}

(35)

Therefore, $[\bar{\mathbf{a}}]_{i}$ conforms to the Normal distribution as indicated in (34). It also holds that

\begin{split}\sigma^{2}_{a1}(\omega_{m})&=\sum_{s\in[T]}\tilde{P}_{d}(\omega_{m},\sigma_{s})(1-\tilde{P}_{d}(\omega_{m},\sigma_{s}))\\ &=T\bar{P}_{d}(\omega_{m})(1-\bar{P}_{d}(\omega_{m}))\\ &-\sum_{s\in[T]}(\tilde{P}_{d}(\omega_{m},\sigma_{s})-\bar{P}_{d}(\omega_{m}))^{2},\end{split}

(36)

from which we get that $\sigma^{2}_{a1}(\omega_{m})\leq T\bar{P}_{d}(\omega_{m})(1-\bar{P}_{d}(\omega_{m}))$ , with the equality holding when $\tilde{P}_{d}(\omega_{m},\sigma_{s})=\bar{P}_{d}(\omega_{m})$ .

∎

The distribution of $[\bar{\mathbf{a}}]_{i}$ under $\overline{\mathcal{H}}0$ is more complicated, and we have following claim.

Claim 3.

Under $\overline{\mathcal{H}}0$ , and as $T\to\infty$ ,

[\bar{\mathbf{a}}]_{i}\sim N(\mu_{a0}(\omega_{m}),\sigma^{2}_{a0}(\omega_{m})),

(37)

where

\begin{split}f\mu_{a0}(\omega_{m})&=F\eta_{p}\bar{P}_{d}(\omega_{m})+(T-F)\bar{P}_{fa},\\ \sigma^{2}_{a0}(\omega_{m})&\leq F\eta_{p}\bar{P}_{d}(\omega_{m})(1-\eta_{p}\bar{P}_{d}(\omega_{m}))\\ &+(T-F)\bar{P}_{fa}(1-\bar{P}_{fa}),\end{split}

(38)

and $F=\frac{TK\eta_{m}}{B}$ , where $\eta_{m}$ is the 6.0-dB bandwidth of the pre-permutation window $\mathbf{w}$ . $\eta_{p}\in[1,\frac{1}{\bar{P_{d}}(\omega_{m})}]$ is a calibration factor of the probability of detection for the other $K-1$ co-existing sinusoids.

Proof.

Under $\overline{\mathcal{H}}0$ , each term in (31) may be distributed differently. To illustrate this, we consider a location $i\in[N]$ in the frequency domain of the input signal, which does not contain a significant frequency, as shown in Fig. 5. Let $j=\mathcal{M}(i,\sigma_{s})$ be the mapping. There would be two cases for $j$ : 1) $j$ does not contain a significant frequency; or 2) $j$ contains at least one significant frequency, with its SNR at least $SNR_{min}$ . In the former case, ${[\mathbf{c}_{\sigma_{s}}]_{j}}\sim\mathrm{Bernoulli}\left(\tilde{P}_{fa}(\sigma_{s})\right)$ , i.e., ${[\mathbf{c}_{\sigma_{s}}]_{j}}$ is under $\mathcal{H}0$ . For the latter case, ${[\mathbf{c}_{\sigma_{s}}]_{j}}\sim\mathrm{Bernoulli}\left(\tilde{P}_{d}(\omega_{m},\sigma_{s})\right)$ , i.e., ${[\mathbf{c}_{\sigma_{s}}]_{j}}$ is under $\mathcal{H}1$ . Due to the permutation being uniformly random, on the average, the number of $[\mathbf{c}]_{j}$ under $\mathcal{H}1$ is $F=\frac{TK\eta_{m}}{B}$ , and the number of $[\mathbf{c}]_{j}$ under $\mathcal{H}0$ is $T-F$ . The parameter $\eta_{m}$ reflects the fact that sparsity is affected by the pre-permutation windowing. Since we assume that $\mathbf{v}(\omega_{m})$ has the minimum SNR, i.e., $SNR_{min}$ , other sinusoids with higher SNR will have larger $\bar{P}_{d}$ . Hence we multiply $\bar{P}_{d}(\omega_{m})$ with $\eta_{p}$ to calibrate the successful rate of $[\mathbf{c}_{\sigma_{s}}]_{j}$ under $\mathcal{H}1$ . If all the sinusoids’s SNR were equal to $SNR_{min}$ , then $\eta_{p}=1$ ; on the other hand, if the co-existing sinusoids’ SNR were sufficient high so that their $\bar{P}_{d}$ approaches to $1$ , then $\eta_{p}=\frac{1}{\bar{P_{d}}(\omega_{m})}$ . Finally, the results follows immediately by applying Lyapunov CLT. ∎

Remark 1.

From Claim 2 and 3, we notice that for the second stage detection, the LLRT is obtained based on two Normal distributions. The test statistic under $\overline{\mathcal{H}}1$ is “stable”, for it only depends on $\bar{P}_{d}(\omega_{m})$ . However, under $\overline{\mathcal{H}}0$ , the distribution depends on the number of co-existing sinusoids, as well as on each sinusoid’s SNR. The larger $K$ and higher SNR will “push” the distribution under $\overline{\mathcal{H}}0$ closer to the distribution under $\overline{\mathcal{H}}1$ , hence degrades the detection performance. In order to compensate for this, a larger $SNR_{min}$ is required.

A natural extension of Remark 1 is Remark 2, which gives the condition under which the RSFT will reach its limit.

Remark 2.

Assuming that $P_{d}\geq P_{fa}$ , the RSFT will fail if $K\eta_{m}\geq B$ no matter how large the $SNR_{min}$ is.

Proof.

Assuming $\eta_{p}=1$ and substituting $K\eta_{m}=B$ into $F$ yields $F=T$ , which means that the distributions under both hypothes are the same, hence the two hypothesis cannot be discriminated. If $\eta_{p}>1$ , the assumption of $P_{d}\geq P_{fa}$ will be violated as $K\eta_{m}$ approaching $B$ . ∎

Based on the above discussion, the optimal threshold design in Problem 1 can be solved by the following optimization problem, i.e.,

\begin{split}&Minimize_{\{\mu,\bar{P}_{fa},\bar{P}_{d}\}}\quad SNR_{min}\\ &Subject\;to\\ &\quad\quad\bar{P}_{d}(\omega_{m})=\bar{P}_{fa}^{\bar{\beta}\over\bar{\alpha}(p,\omega_{m})SNR_{min}+\bar{\beta}}\\ &\quad\quad P_{fa}=\int_{\mu}^{\infty}g_{a_{0}}(u)du\\ &\quad\quad P_{d}=\int_{\mu}^{\infty}g_{a_{1}}(u)du\\ &\quad\quad 0\leq\bar{P}_{fa}\leq 1,\;0\leq\bar{P}_{d}\leq 1\\ &\quad\quad\mu\in[T],\end{split}

(39)

where $g_{a_{0}}(u),g_{a_{1}}(u)$ are the asymptotic PDF²²2We take the upper bounds of the variances in both distributions. It is shown in Section V-E that the actual variances is close to their upper bounds. of $[\bar{\mathbf{a}}]_{i}$ (which corresponds the weakest sinusoid) under $\overline{\mathcal{H}}0$ and $\overline{\mathcal{H}}1$ , respectively. Since both of them are Normal distributions, with fixed threshold, i.e., $\mu$ , we can solve for $\bar{P}_{d}(\omega_{m}),\bar{P}_{fa}$ , and then compute the $SNR_{min}$ . By enumerating $\mu\in[T]$ , the minimum worst case SNR, i.e., $SNR_{min}^{\ast}$ can be found, and the corresponding $\mu^{\ast}$ is the optimal threshold for the second stage of detection. The optimal threshold for the first stage of detection, i.e., $\gamma^{\ast}$ , can thus be calculated via (28).

Remark 3.

In Claim 3, we set a parameter $\eta_{p}$ to calibrate the distribution of $[\bar{\mathbf{a}}]_{i}$ under $\overline{\mathcal{H}}0$ . By setting $\eta_{p}$ as $1$ or $\frac{1}{\bar{P_{d}}(\omega_{m})}$ , we can get respectively the lower and upper bound of $SNR_{min}^{\ast}$ for the variation of SNR of other co-existing sinusoids. If $K$ is the maximum budget of signal sparsity, the optimal thresholds found by solving (39) provides the optimal thresholds for the worst case. If the actual signal sparsity were less than $K$ , $P_{fa}$ would be lower than the expected value, while $P_{d}$ would be unchanged according to Remark 1.

By averaging over the permutation, asymptotically, $SNR_{min}^{\ast}$ does not depend on the permutation. However, it still depends on $\omega_{m}$ , and we have the following claim to manifest their relationship.

Claim 4.

The dependence of $SNR_{min}^{\ast}$ on $\omega_{m}$ is due to the off-grid loss[16] from off-grid frequencies. $SNR_{min}^{\ast}$ attains its minimum when $\omega_{m}$ is on-grid, i.e. $\omega_{m}=k\Delta\omega_{N},k\in[N]$ . When $\omega_{m}$ is in the middle of two grid points, i.e., $\omega_{m}=(k+\frac{1}{2})\Delta\omega_{N}$ , $SNR_{min}^{\ast}$ attains its maximum.

Proof.

Assume $\mathbf{r}=\mathbf{v}(\omega_{m})$ . Since the pre-permutation window $\mathbf{w}$ is symmetric, if we applied DFT to the pre-permuted data, the amplitude of the spectrum would attain its maximum and minimum respectively when $\omega_{m}$ is on-grid or in the middle between two grid points. The subsequent permutation operation would not change the amplitude of the spectrum. Also, since the flat-window is used, the downsampling in the frequency domain, which is a result of aliasing, will not affect the amplitude either. The on-grid frequency generates highest amplitude, while the frequency in the middle of between grid points has the lowest amplitude. As a result, the two detection stages require the lowest SNR for on-grid frequencies, and the highest SNR for frequencies lying in the middle of between grid points. ∎

IV-D Tradeoff between Worst Case SNR and Complexity

IV-D1 Comparison to Bartlett Method

We compare the complexity of the RSFT with the FFT-based Bartlett method (see Appendix A) by counting the number of operations in both algorithms as shown in Table (I) and Table (II). The RSFT has complexity equal to

\mathcal{O}\left(T(N+B+B\log B+\frac{K\eta_{m}N}{B\eta_{p}})+N\right),

(40)

while the Bartlett method has complexity equal to $\mathcal{O}\left(TN(1+\log N)+N\right)$ . Fig. 6 compares the RSFT’s complexity to that of Bartlett’s for various $B$ and $K$ . One can see that the RSFT enabled savings are remarkable when $B$ is chosen properly. Specifically, from Fig. 6 one can see, the lowest complexity for $K$ equals to $5,50,100$ is achieved when $B$ equals to $32,64,128$ , respectively. Note that the core operation in RSFT is still FFT-based, but on a reduced dimension space. By leveraging the existing high performance FFT libraries such as FFTW [19], the implementation of the RSFT algorithm could be further improved.

Remark 4.

The complexity of RSFT is linearly depend on $N,T,K,1/\eta_{p}$ and $\eta_{m}$ , hence it is beneficial to choose a pre-permutation window with a small $\eta_{m}$ , provided the attenuation of the side-lobes is sufficient. We can also choose the optimal $B$ from (40) to minimize the computation. However, there are two additional constrains for $B$ , one is $B$ should be a power of 2, the other is $K\eta_{m}\geq B$ , as stated in Remark 2.

TABLE I: Computational Complexity of Bartlett Method

Procedure	Number of Operations
Windowing	$TN$
FFT	$T\frac{N}{2}\log N$
Square	$TN$
Detection	$N$
Complexity	$\mathcal{O}\left(TN(1+\log N)+N\right)$

TABLE II: Computational Complexity of RSFT

Procedure	Number of Operations
Pre-Permutation Win	$TN$
Permutation	$TN$
Flat-Win	$TN$
Aliasing	$TB(N/B-1)$
FFT	$T\frac{B}{2}\log B$
Square	$TB$
First-Stage-Detection	$TB$
Reverse-Mapping	$\frac{TK\eta_{m}N}{B\eta_{p}}$
Second-stage-Detection	$N$
Complexity	$\mathcal{O}\left(T(N+B+B\log B+\frac{K\eta_{m}N}{B\eta_{p}})+N\right)$

IV-D2 Worst Case SNR and Complexity Trade Off

The reduced complexity of RSFT is achieved at a the cost of an increased $SNR_{min}$ , which decreases the ability of detecting weak signals. The tradeoff between $SNR_{min}$ and complexity for various choices of parameters is shown in Fig. 7. The performance of the Bartlett method is also shown as a reference. From Fig. 7 we can see that $B$ plays a central role in trading off $SNR_{min}$ and complexity. A proper choice of $B$ can enhance the computational efficiency significantly with a reasonable increase of $SNR_{min}$ . Also, since the sparsity $K$ affects both $SNR_{min}$ and complexity, a less sparse signal will worsen both. The complexity of RSFT is larger than that of the Bartlett method by setting $B=N$ , due to the additional processing in the algorithm. Also, it cannot achieve the same $SNR_{min}$ as the Bartlett method does.

V Numerical Results

In this section, we verify our theoretical findings via simulations. We use the following common parameters for various settings, unless we state specifically. We take the following values: $N=1024,T=50,B=64,\eta_{m}=1.8,P_{d}=0.9,P_{fa}=10^{-6},\omega_{m}=64.5\Delta\omega_{N}\approx 0.4$ . We use a Dolph-Chebyshev window with $40dB$ attenuation as pre-permutation windowing. The flat-window is also based on this window, and we set its passband width as $1/B$ .

V-A Lower and Upper Bounds of $SNR_{min}^{\ast}$ for Fixed Sparsity

According to Remark 3, we can calculate the lower bound and the upper bound of $SNR_{min}^{\ast}$ and their corresponding thresholds for fixed sparsity. Fig. 8 and 9 shows the thresholding of RSFT with both bounds. We mark the amplitude of $\omega_{m}$ with a magenta dot in each figure.

V-B Unknown Signal Sparsity

Since we do not assume that we know the exact sparsity of the signal, we will use a guess for $K$ . Fig. 10 shows the optimal design was toward $K=10$ , however, when the true sparsity is $K=3$ , the system yields the same $P_{d}$ but better $P_{fa}$ , since the noise level is much lower than expected.

V-C Dependency on Frequency

Fig. 11 shows the dependency of $SNR_{min}^{\ast}$ on frequency, which verifies Claim 4.

V-D The Receiver Operating Characteristic (ROC) Curve

In this section, we use ROC curve to characterize the performance of RSFT with variance parameters. Fig. 12 shows the impact of the detection by adopting different values of $B$ . A smaller $B$ lowers the detection performance, and in order to compensate it, a higher $SNR_{min}$ is required. The ROC curve for the Bartlett method is calculated by (50) and is also shown in Fig. 12.

Fig. 13 illustrates the relationship between detection performance and sparsity of signal. It is shown that with other parameters fixed, the sparser the signal is, the better the performance of detection is.

In Fig. 14, we can see the impact of the SNR from the co-existing sinusoids. The higher the SNR of the co-existing sinusoids is, the worse the detection performance is.

V-E The Variance and Its Upper Bound for $[\bar{\mathbf{a}}]_{i}$

In solving (39), we take the upper bound of the variance for $[\bar{\mathbf{a}}]_{i}$ under both hypotheses. In this section, we show by simulation that the actual variance of $[\bar{\mathbf{a}}]_{i}$ is close to its upper bound. In what follows, we study $\sigma_{a1}^{2}(\omega_{m})$ ; the case for $\sigma_{a0}^{2}(\omega_{m})$ can be similarly studied.

As shown in (36), the discrepancy of $\sigma_{a1}^{2}(\omega_{m})$ from its upper bound is due to the $\tilde{P}_{d}$ ’s dependence on $\sigma_{s}$ , which is caused by $\beta$ and $\alpha$ ’s dependence on $\sigma_{s}$ (see (27)). For $N$ a power of 2, a valid $\sigma_{s}$ can be any odd integer in $[N]$ [2]. Fig. 15 shows $\beta(\sigma_{s})$ and $\alpha(p,\omega_{m},\sigma_{s})$ as functions of $\sigma_{s}$ . The symmetry of the plot is due to the symmetry of pre-permutation window and the flat-window, as well as the modulo property of the permutation. Another observation is that most of $\beta(\sigma_{s})$ and $\alpha(p,\omega_{m},\sigma_{s})$ have similar values. As a result, $\tilde{P}_{d}(\omega_{m},\sigma_{s})$ has similar value for different permutations, and this is the reason for $\sigma_{a1}^{2}(\omega_{m})$ being close to its upper bound. The Monte Carlo simulation in Fig. 16 shows that the approximation error, i.e., $\frac{T\bar{P}_{d}(\omega_{m})(1-\bar{P}_{d}({\omega_{m}}))-\sigma_{a1}^{2}(\omega_{m})}{\sigma_{a1}^{2}(\omega_{m})}$ decreases as $T$ grows, and even for a small $T$ , such as $T=10$ , the error is as small as about $1.6\%$ .

VI RSFT for Ubiquitous Radar Signal Processing

The RSFT algorithm can greatly reduce the complexity of certain high dimensional problems. This can be signifiant in many applications, since lower complexity means faster reaction time and more economical hardware. However, in order to apply RSFT, the signal to be processed should meet the following requirements:

•

It should be sparse in some domain.
•

It should be sampled uniformly whether in temporal or spacial domain.
•

The SNR should be moderately high.

While many applications satisfy these requirements, in what follows, we discuss an example in Short Range Ubiquitous Radar[20] (SRUR) signal processing.

VI-A Short Range Ubiquitous Radar

An ubiquitous radar or SIMO radar can see targets everywhere at anytime without steering its beams as a traditional phased array radar does. In SRUR, a broad transmitting beam patten is achieved by an omnidirectional transmitter and multiple narrow beams are formed simultaneously after receiving of the reflected signal. The beam pattens of an ubiquitous radar is shown in Fig. 17 with an Uniform Linear Array (ULA) configuration.

An SRUR with range coverage of several kilometers could be important both in military and civilian vehicular applications. For instance, in an active protection system [21], sensors on the protected vehicle have to detect and locate the warheads from a closely fired rocket-propelled grenade (RPG) within milliseconds. Among other sensors, SRUR’s simultaneous wide angle coverage, high precision of measurement and all-weather operation make it the ideal sensor for such situation.

In order to achieve high range resolution and cover near range, SRUR utilizes a LFMCW waveform, as shown in Fig. 18. Mathematically, the transmitted waveform can be expressed as

s(t,v)=A\cos(2\pi(f_{c}(t-vT_{p})+\pi\rho(t-vT_{p})^{2}),

(41)

where $T_{p}$ is the repetition interval (RI), $v\in[M]$ denotes the $v_{th}$ RI, $A$ is amplitude of the signal, $f_{c}$ is the carrier frequency and $\rho$ is the chirp rate. Furthermore, without loss of generality, we assume that the initial phase of the signal is zero.

Upon reception, a de-chirp process is implemented by mixing the received signal with the transmitted signal, followed by a lowpass filter. The received signal is a delayed version of the transmitted one, hence by mixing the two signals, the range information of the targets is linearly encoded in the difference of the frequencies. Hence for the $i_{th},i\in[N]$ receiving channel, the de-chirped signal is expressed as

\begin{split}r_{i}&=\sum_{k\in[K]}a^{[k]}(s)\cos\left(2\pi((f_{r}^{[k]}+f_{d}^{[k]})(t-vT_{p})+i\pi\sin\theta^{[k]}\right)\\ &+n(t),\end{split}

(42)

which is a superposition of $K$ sinusoids and additive noise $n(t)$ . For the $k_{th}$ sinusoid, $a^{[k]}(s),s\in[T]$ represents its amplitude, which can be modeled as a Gaussian random process. More specifically, the amplitude is assumed static within a burst, and independent between each of $T$ bursts. This assumption is consistent with the Swerling-I target model[22], which represents a slow fluctuation of the target RCS. $f_{r}^{[k]},f_{d}^{[k]}$ are the frequency components respect to target’s range and velocity respectively, i.e.,

f_{r}^{[k]}=\frac{2\rho r_{t}^{[k]}}{c},\quad f_{d}^{[k]}=\frac{2v_{t}^{[k]}}{\lambda},

(43)

where $r_{t}^{[k]},v_{t}^{[k]},c$ are the $k_{th}$ target’s range, velocity and speed of wave propagation respectively.

The DOA of the $k_{th}$ target, i.e., $\theta^{[k]}$ is defined as the angle between the line of sight (from the array center to the target) and the array normal. Assuming that the element wise spacing is $\lambda/2$ , under the narrowband signal assumption, $\theta^{[k]}$ will cause an increase of phase at the neighboring array element equal to $\pi\sin\theta^{[k]}$ . We omit the constant phase term in each sinusoids of (42), since they are irrelevant to the performance of the algorithm.

After AD conversion of each receiving channel, we can use the processing scheme shown in Fig. 19 to detect the targets as well as estimate their range, velocity and DOA. More specifically, grid-based versions of $f_{r}^{[k]},f_{d}^{[k]},\pi\sin\theta^{[k]}$ can be calculated by applying a 3-D FFT on the windowed data cube, then, after accumulation of $T$ iterations, the above described NP detection procedure can be performed.

VI-B RSFT-based SRUR Signal Processing

Although the number of samples of SRUR is reduced significantly with the analog de-chirp processing, the realtime processing with 3-D FFT is still challenging. The RSFT algorithm is suitable for reducing the computational complexity of SRUR, since, 1) the number of targets is usually much smaller than the number of spatial resolutions cells, which implies that the signal is sparse after proper translation; 2) with a ULA and digitization of each received element, the signal is uniformly sampled both in spatial and temporal domain; and 3) the short range coverage implies that moderate high SNR is easy to achieve.

The RSFT-based SRUR processing architecture is shown in Fig. 20. Compared to the conventional processing, the 3-D FFT is replaced with a 3-D RSFT, in which the aliasing procedure converts the data cube dimensions from $R\times N\times M$ to $B\times C\times D$ . The 3-D FFT operated on the smaller data cube could save the computation time significantly.

VI-C Simulations

In this section, we verify the feasibility of RSFT-based SRUR processing and compare to the SFT-based processing via simulations. The main parameters of the system are listed in Table III. The design of the system can guarantee non-ambiguous measurements of the target’s range and velocity, assuming the maximum range and velocity are less than $1.5km$ and $300m/s$ , respectively.

TABLE III: SRUR Parameters

Parameter	Symbol	Value
Number of range bins	$R$	$2048$
Number of receiving elements	$N$	$64$
Number of RI	$M$	$32$
Wave length	$\lambda$	$0.03m$
Wave propagation speed	$c$	$3\times 10^{8}m/s$
Bandwidth	$B_{w}$	$150MHz$
Repetition interval	$T_{p}$	$5\times 10^{-5}s$
Maxima range	$R_{max}$	$1.5\times 10^{3}m$
Chirp rate	$\rho$	$3\times 10^{12}Hz/s$
Sampling frequency (IQ)	$f_{s}$	$41MHz$

We generate signals from $4$ targets according to (42). The range, velocity and DOA of targets can be arbitrarily chosen within the unambiguous space, which implies that the corresponding frequency components do not necessarily lie on the grid. The targets’ parameters used in the simulation are listed in Table IV. For Targets $3$ and $4$ , we set the same parameters except that their DOA is $4^{\circ}$ apart, which is close to the theoretical angular resolution after windowing for the Bartlett beamforming. To compare the RSFT and the SFT for different scenarios, we adopt two sets of SNR for targets. Specifically, for the first set, we use the same SNR, i.e., $-10dB$ for different targets. And for the second set, we assign different SNR for different targets, which is more close to a realistic scenario.

TABLE IV: Target Parameters

Target	Range ( $m$ )	Velocity ( $m/s$ )	DOA ( $\circ$ )	SNR (dB)
1	$1000$	$100$	$30$	$-10/0$
2	$500$	$50$	$0$	$-10/-10$
3	$350$	$240$	$-16$	$-10/-20$
4	$350$	$240$	$-20$	$-10/-20$

The SFT from [2] is $1$ -dimensional. In order to reconstruct targets in the 3-D space, we extend the SFT to high dimension with the techniques described in Section III-C. Another obstacle of applying SFT is that it needs to know the number of peaks to count in the detection stages. In our radar example, even the knowledge of exact number of targets is presented, it is still not clear how to determine the number of counting peaks due to existence of the large number of peaks from leakage. In the experiment, we gradually increase the number of counting peaks until all the targets are recovered. For the case of the same SNR setting, all the targets are recovered after around $20$ peaks are counted. While for the second SNR setting, we need to count nearly $200$ peaks to recover the weakest targets (Targets $3$ and $4$ ). Fig. 21 and 22 show the targets reconstruction results for the two settings, respectively. The former shows both SFT and RSFT methods can perfectly recover all the targets, whose SNR has the same value. From Targets $3$ and $4$ we can see that the SFT-based method achieves a better resolution than its RSFT counterpart, since the former does not require a pre-permutation window. For the second scenario, the SFT-based method shows the side-lobes of the stronger targets, while the RSFT-based method only recovers the (extended) main-lobes of all the targets.

The simulation shows that the RSFT-based approach is better than its SFT counterpart for a realistic scenario, within which the signal has a reasonable dynamic range. We also want to emphasis that in a real radar system, determine the number of counting peaks for the SFT-based method lacks a theoretical foundation, while the thresholding approach in the RSFT is consistent with the conventional FFT-based processing, both of which are based on the NP criterion.

VII Conclusion

In this paper, we have addressed practical problems of applying SFT in real-life applications, based on that, we have proposed a modified algorithm (RSFT). The optimal design of parameters in RSFT has been analyzed, and the relationship between system sensitivity and computational complexity has been investigated. Some interesting properties of the RSFT have also been revealed by our analysis, such as the performance of detection not only relies on the frequency under examination, but also depends on other co-existing significant frequencies, which is very different from the traditional FFT-based processing. The analysis has revealed that RSFT could provide engineers an extra freedom of design in trading off system’s ability of detecting weak signals and complexity. Finally, the context of the application of RSFT has been discussed, and a specific example for short range ubiquitous radar signal processing has been presented.

Appendix A Bartlett Method Analysis

Detecting each sinusoid and estimating the corresponding frequencies in (10) can be achieved by Bartlett spectrum analysis followed by an NP detection. A window is applied on each data segment to reduce the leakage of off-grid frequencies. In order to enhance the computational efficiency, the FFT is adopted (see Algorithm 2). In the $N$ -dimensional case, each step in Algorithm 2 is done in $N$ -dimension.

The analysis of the Bartlett method can be found in the literature [23], however, such analysis usually focuses on bias, variance and frequency resolution, while the detection performance has not been throughly studied in connection with an NP detector. In [24], the performance of detecting a single sinusoid is discussed and the theoretical analysis is provided for on-grid frequency. However, most typically, the signal contains multiple significant frequencies, which are off-grid. In what follows, we analyze the asymptotic performance of Algorithm 2 based on the signal model of (10), which is a multiple-frequencies case and does not restrict the frequency being on-grid. Moreover, as compared to the signal model in [24], which assumes that the sinusoid has a deterministic amplitude, we model the complex amplitude of each sinusoid as a circularly symmetric Gaussian random variable. This modeling reflects the stochastic nature of each sinusoid, and is consistent with the Swerling-I target model in radar signal cases, since the square of a circularly symmetric Gaussian random variable has an exponential distribution.

Algorithm 2 Bartlett Method algorithm

Input: complex signal $\mathbf{r}$ in any fixed dimension
Output: $\mathbf{o}$ , frequency domain representation of input signal

1:procedure Bartlett(

\mathbf{r}

)

\mathbf{x}\leftarrow 0

3: for

i=0\to T

4: Windowing:

\mathbf{u}\leftarrow\mathbf{W}\mathbf{r}

5: N-D FFT:

\hat{\mathbf{u}}

\leftarrow

\mathop{\mathrm{NDFFT}}(\mathbf{u})

6: Accumulation:

\mathbf{x}\leftarrow\mathbf{x}+|\hat{\mathbf{u}}|^{2}

7: end for

8: Detection:

\mathbf{o}\leftarrow\mathop{\mathrm{NPdet}}(\mathbf{x})

9: return

\mathbf{o}

10:end procedure

The analysis of Algorithm 2 follows a similar fashion with the analysis of the RSFT, and we thus use the same notation as in Section IV. Our goal is to derive the relationship between $P_{d}$ and $P_{fa}$ , which is also related to the worst case signal SNR, i.e., $SNR_{min}$ .

After windowing and FFT, the signal becomes

\hat{\mathbf{u}}_{s}=\overline{\mathbf{D}}\mathbf{W}\mathbf{r}_{s},\;s\in[T],

(44)

where $\overline{\mathbf{D}}\in\mathbb{C}^{N\times N}$ is the DFT matrix.

Substituting (10) into (44), for the $k_{th},k\in[N]$ entry of $\hat{\mathbf{u}}_{s}$ , we get

\begin{split}[\hat{\mathbf{u}}_{s}]_{k}&=b_{m,s}\mathbf{d}^{H}_{k}\mathbf{W}\mathbf{v}(\omega_{m})\\ &+\sum_{j\in[K]\setminus m}(b_{j,s}\mathbf{d}^{H}_{k}\mathbf{W}\mathbf{v}(\omega_{j}))\\ &+\mathbf{d}^{H}_{k}\mathbf{W}\mathbf{n}_{s},\end{split}

(45)

where $\mathbf{d}_{k}$ is the $k_{th}$ column of $\overline{\mathbf{D}}$ , i.e., $\mathbf{d}_{k}=[1\quad e^{jk\Delta\omega_{N}}\;\cdots\;e^{jk(N-1)\Delta\omega_{N}}]^{T}$ , and $\Delta\omega_{N}=2\pi/N$ .

Since $[\hat{\mathbf{u}}_{s}]_{k}$ is a linear combination of $b_{i,s},[\mathbf{n}_{s}]_{j},i\in[K],j\in[N]$ , it is a circularly symmetry Gaussian scalar with distribution

[\hat{\mathbf{u}}_{s}]_{k}\sim\mathcal{CN}(0,\sigma_{uk}^{2}).

(46)

The hypothesis test on each frequency bin is formulated as

•

$\mathcal{H}^{\prime}0$ : no significant frequency exists.
•

$\mathcal{H}^{\prime}1$ : there exists a significant frequency, with its SNR at least equals to $SNR_{min}$ .

We assume the side-lobes of the significant frequencies are far below the noise level due to windowing, then under $\mathcal{H}^{\prime}1$ and $\mathcal{H}^{\prime}0$ , respectively, we have the following approximation for $\sigma_{uk}^{2}$

\begin{split}&\sigma_{uk|H1}^{2}\approx\sigma_{bm}^{2}\alpha^{\prime},\\ &\sigma_{fk|H0}^{2}\approx\sigma_{n}^{2}\beta^{\prime}.\\ \end{split}

(47)

where $\alpha^{\prime}=|\mathbf{d}^{H}_{k}\mathbf{W}\mathbf{v}(\omega_{m})|^{2}$ and $\beta^{\prime}=\|\mathbf{w}\|^{2}$ .

The LLRT yields the sufficient statistics

l_{k}={1\over T}\sum_{s\in[T]}|[\hat{\mathbf{u}}_{s}]_{k}|^{2}\mathop{\gtrless}_{\mathcal{H}^{\prime}0}^{\mathcal{H}^{\prime}1}\gamma.

(48)

We study its asymptotic performance. Assume that $T$ is moderately large, after applying central limit theory, the test statistic distributes as Normal distributions in both hypothesis, i.e.,

\begin{split}&l_{k|H0}\sim\mathcal{N}(\sigma_{uk|H0}^{2},{\sigma_{uk|H0}^{4}\over T}),\\ &l_{k|H1}\sim\mathcal{N}(\sigma_{uk|H1}^{2},{\sigma_{uk|H1}^{4}\over T}).\end{split}

(49)

Finally, we can relate $P_{d}$ and $P_{fa}$ as

P_{d}=1-\Phi\left({\beta^{\prime}\Phi^{-1}(1-P_{fa})+\sqrt{T}(\beta^{\prime}-SNR_{min}\alpha^{\prime})\over SNR_{min}\alpha^{\prime}}\right),

(50)

where $\Phi(\cdot)$ is the CDF of standard normal distribution. An exemplar ROC curve calculated with (50) is demonstrated in Fig. 12.

Appendix B Proof of Properties of Mapping and Reverse Mapping

B-A Proof of Property 3

Proof.

According to Definition 3, the mapping can be split into two stages: 1) apply modular multiplication to $i$ , i.e., $k=[\sigma i]_{N}\in[N]$ ; and, 2) convert $k$ into $j\in[B]$ with $j=\lfloor kB/N\rfloor$ .

Similarly, according to Definition 4, the reverse-mapping also can be split into two stages: 1) dilate $j\in[B]$ into $\mathbb{S}\equiv\{v\in[N]\mid j\frac{N}{B}\leq v<(j+1)\frac{N}{B}]\}\subset[N]$ ; and, 2) apply inverse modular multiplication on $\mathbb{S}$ , i.e., $\mathcal{R}(j,\sigma^{-1})\equiv\{[u\sigma^{-1}]_{N}\mid u\in\mathbb{S}\}$ .

The first stage of reverse-mapping is the inverse operation of the second stage of mapping, and as a result, $k\in\mathbb{S}$ . Hence $i=[k\sigma^{-1}]_{N}\in\mathcal{R}(j,\sigma^{-1})$ as desired. ∎

B-B Proof of Property 4

Proof.

We use the two stages of the reverse-mapping in the proof of Property 3. The first stage of the reverse-mapping for $i$ and $j$ yields $\mathbb{S}_{1}\equiv\{v\in[N]\mid i\frac{N}{B}\leq v<(i+1)\frac{N}{B}]\}$ and $\mathbb{S}_{2}\equiv\{v\in[N]\mid j\frac{N}{B}\leq v<(j+1)\frac{N}{B}]\}$ , respectively. It is not difficult to verify that $\mathbb{S}_{1}\cap\mathbb{S}_{2}=\emptyset$ , provided that $i\neq j$ .

In what follows, we will prove that the second stage of the reverse-mapping also gives distinct results. Assume that there exists $m\in\mathbb{S}_{1},n\in\mathbb{S}_{2}$ , such that $[m\sigma^{-1}]_{N}=[n\sigma^{-1}]_{N}$ . Modularly multiply both sides with $\sigma$ yields that $m=n$ , which is contradictory with $\mathbb{S}_{1}\cap\mathbb{S}_{2}=\emptyset$ . Hence both stages of the reverse-mapping guarantee the results are distinct for $i\neq j$ . ∎

Acknowledgment

The authors would like to thank Dr. Predrag Spasojevic and Dr. Anand Sarwate from Rutgers university for initial support of this work. The work of SW was jointly supported by China Scholarship Council and Shanghai Institute of Spaceflight Electronics Technology. The work of VMP was partially supported by an ARO grant W911NF-16-1-0126.

References

[1] H. Hassanieh, P. Indyk, D. Katabi, and E. Price, “Nearly optimal sparse fourier transform,” in Proceedings of the forty-fourth annual ACM symposium on Theory of computing, pp. 563–578, ACM, 2012.
[2] H. Hassanieh, P. Indyk, D. Katabi, and E. Price, “Simple and practical algorithm for sparse fourier transform,” in Proceedings of the Twenty-third Annual ACM-SIAM Symposium on Discrete Algorithms, SODA ’12, pp. 1183–1194, SIAM, 2012.
[3] F. Ong, S. Pawar, and K. Ramchandran, “Fast and efficient sparse 2d discrete fourier transform using sparse-graph codes,” arXiv preprint arXiv:1509.05849, 2015.
[4] A. C. Gilbert, P. Indyk, M. Iwen, and L. Schmidt, “Recent developments in the sparse fourier transform: A compressed fourier transform for big data,” Signal Processing Magazine, IEEE, vol. 31, no. 5, pp. 91–100, 2014.
[5] B. Ghazi, H. Hassanieh, P. Indyk, D. Katabi, E. Price, and L. Shi, “Sample-optimal average-case sparse fourier transform in two dimensions,” in Communication, Control, and Computing (Allerton), 2013 51st Annual Allerton Conference on, pp. 1258–1265, IEEE, 2013.
[6] P. Indyk and M. Kapralov, “Sample-optimal fourier sampling in any constant dimension,” in Foundations of Computer Science (FOCS), 2014 IEEE 55th Annual Symposium on, pp. 514–523, IEEE, 2014.
[7] H. Hassanieh, F. Adib, D. Katabi, and P. Indyk, “Faster gps via the sparse fourier transform,” in Proceedings of the 18th annual international conference on Mobile computing and networking, pp. 353–364, ACM, 2012.
[8] H. Hassanieh, L. Shi, O. Abari, E. Hamed, and D. Katabi, “Ghz-wide sensing and decoding using the sparse fourier transform,” in INFOCOM, 2014 Proceedings IEEE, pp. 2256–2264, IEEE, 2014.
[9] L. Shi, H. Hassanieh, A. Davis, D. Katabi, and F. Durand, “Light field reconstruction using sparsity in the continuous fourier domain,” ACM Transactions on Graphics (TOG), vol. 34, no. 1, p. 12, 2014.
[10] A. Rauh and G. R. Arce, “Sparse 2d fast fourier transform,” Proceedings of the 10th International Conference on Sampling Theory and Applications, 2013.
[11] P. Boufounos, V. Cevher, A. C. Gilbert, Y. Li, and M. J. Strauss, “What’s the frequency, kenneth?: Sublinear fourier sampling off the grid,” in Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques, pp. 61–72, Springer, 2012.
[12] E. J. Candè and M. B. Wakin, “An introduction to compressive sampling,” Signal Processing Magazine, IEEE, vol. 25, no. 2, pp. 21–30, 2008.
[13] C.-Y. Chen and P. Vaidyanathan, “Compressed sensing in mimo radar,” in Signals, Systems and Computers, 2008 42nd Asilomar Conference on, pp. 41–44, IEEE, 2008.
[14] Y. Yu, A. P. Petropulu, and H. V. Poor, “Mimo radar using compressive sampling,” Selected Topics in Signal Processing, IEEE Journal of, vol. 4, no. 1, pp. 146–163, 2010.
[15] S. Wang, V. M. Patel, and A. Petropulu, “RSFT: a realistic high dimensional sparse fourier transform and its application in radar signal processing,” in Milcom 2016 Track 1 - Waveforms and Signal Processing., (Baltimore, USA), Avaliable at: http://www.rci.rutgers.edu/ vmp93/Conference_pub/
MILCOM2016_SFT.pdf, Nov. 2016.
[16] F. J. Harris, “On the use of windows for harmonic analysis with the discrete fourier transform,” Proceedings of the IEEE, vol. 66, no. 1, pp. 51–83, 1978.
[17] H. L. Van Trees, Optimum array processing: part IV of detection, estimation, and modulation, ch. 5, pp. 349–350. Wiley, New York, 2002.
[18] R. B. Ash and C. Doleans-Dade, Probability and measure theory, p. 309. Academic Press, 2000.
[19] M. Frigo and S. G. Johnson, “Fftw: An adaptive software architecture for the fft,” in Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on, vol. 3, pp. 1381–1384, IEEE, 1998.
[20] M. Skolnik, “Systems aspects of digital beam forming ubiquitous radar,” tech. rep., DTIC Document, 2002.
[21] D. A. Schade, T. C. Winant, J. Alforque, J. Faul, K. B. Groves, V. Horvatich, M. A. Middione, C. Tarantino, and J. R. Turner, “Fast acting active protection system,” Apr. 10 2007. US Patent 7,202,809.
[22] M. I. Skolnik, “Radar handbook,” 1970.
[23] P. Stoica and R. L. Moses, Spectral analysis of signals, pp. 49–50. Pearson/Prentice Hall Upper Saddle River, NJ, 2005.
[24] H. So, Y. Chan, Q. Ma, and P. Ching, “Comparison of various periodograms for sinusoid detection and frequency estimation,” Aerospace and Electronic Systems, IEEE Transactions on, vol. 35, no. 3, pp. 945–952, 1999.

An Efficient High-Dimensional Sparse Fourier Transform111This manuscript has been submitted to IEEE Transactions on Aerospace and Electronic Systems for reviewing on Sept. 1st, 2016.

Abstract

Index Terms:

I Introduction

I-A Notation

II Preliminaries

II-A Basic Techniques

Definition 1.

Property 1.

Definition 2.

Property 2.

Definition 3.

Definition 4.

II-B SFT Algorithm

III The RSFT Algorithm

III-A Leakage Suppression for Off-grid Frequencies

III-B Signal Detection Without Knowing the Exact Sparsity and Optimal Threshold Design

III-C High Dimensional Extensions

III-C1 Windowing

III-C2 Permutation

III-C3 Aliasing

III-C4 First-stage-detection and reverse-mapping

III-C5 Accumulation and second-stage-detection

IV Optimal Thresholds Design in the RSFT

IV-A Signal Model and Problem Formulation

Problem 1.

IV-B First Stage Detection

Claim 1.

Proof.

IV-C Second Stage Detection

Property 3.

Property 4.

Claim 2.

Proof.

Claim 3.

Proof.

Remark 1.

Remark 2.

Proof.

Remark 3.

Claim 4.

Proof.

IV-D Tradeoff between Worst Case SNR and Complexity

IV-D1 Comparison to Bartlett Method

Remark 4.

IV-D2 Worst Case SNR and Complexity Trade Off

V Numerical Results

V-A Lower and Upper Bounds of S​N​Rm​i​n∗SNR_{min}^{\ast} for Fixed Sparsity

V-B Unknown Signal Sparsity

V-C Dependency on Frequency

V-D The Receiver Operating Characteristic (ROC) Curve

V-E The Variance and Its Upper Bound for [𝐚¯]i[\bar{\mathbf{a}}]_{i}

VI RSFT for Ubiquitous Radar Signal Processing

VI-A Short Range Ubiquitous Radar

VI-B RSFT-based SRUR Signal Processing

VI-C Simulations

VII Conclusion

Appendix A Bartlett Method Analysis

Appendix B Proof of Properties of Mapping and Reverse Mapping

B-A Proof of Property 3

Proof.

B-B Proof of Property 4

Proof.

Acknowledgment

References

An Efficient High-Dimensional Sparse Fourier Transform¹¹1This manuscript has been submitted to IEEE Transactions on Aerospace and Electronic Systems for reviewing on Sept. 1st, 2016.

V-A Lower and Upper Bounds of $SNR_{min}^{\ast}$ for Fixed Sparsity

V-E The Variance and Its Upper Bound for $[\bar{\mathbf{a}}]_{i}$