Wavelet Canonical Coherence for Nonstationary Signals

Haibo Wu
Statistics Program
King Abdullah University of Science and Technology, Saudi Arabia
haibo.wu@kaust.edu.sa
&Marina I. Knight
Department of Mathematics
University of York, UK
marina.knight@york.ac.uk
&Keiland W. Cooper
Department of Neurobiology and Behavior
Center for the Neurobiology of Learning and Memory
University of California, Irvine
kwcooper@uci.edu
&Norbert J. Fortin
Department of Neurobiology and Behavior
Center for the Neurobiology of Learning and Memory
University of California, Irvine
norbert.fortin@uci.edu
&Hernando Ombao
Statistics Program
King Abdullah University of Science and Technology, Saudi Arabia
hernando.ombao@kaust.edu.sa

Abstract

Understanding the evolving dependence between two clusters of multivariate signals is fundamental in neuroscience and other domains where sub-networks in a system interact dynamically over time. Despite the growing interest in multivariate time series analysis, existing methods for between-clusters dependence typically rely on the assumption of stationarity and lack the temporal resolution to capture transient, frequency-specific interactions. To overcome this limitation, we propose scale-specific wavelet canonical coherence (WaveCanCoh), a novel framework that extends canonical coherence analysis to the nonstationary setting by leveraging the multivariate locally stationary wavelet model. The proposed WaveCanCoh enables the estimation of time-varying canonical coherence between clusters, providing interpretable insight into scale-specific time-varying interactions between clusters. Through extensive simulation studies, we demonstrate that WaveCanCoh accurately recovers true coherence structures under both locally stationary and general nonstationary conditions. Application to local field potential (LFP) activity data recorded from the hippocampus reveals distinct dynamic coherence patterns between correct and incorrect memory-guided decisions, illustrating the capacity of the method to detect behaviorally relevant neural coordination. These results highlight WaveCanCoh as a flexible and principled tool for modeling complex cross-group dependencies in nonstationary multivariate systems. The code for WaveCanCoh is available at: https://github.com/mhaibo/WaveCanCoh.git.

1 Introduction

Assessing the dependence structure between node clusters in a network is one of the most critical aspects of network time series analysis. Many models and frameworks have been developed to capture between-clusters association (e.g., correlation, coherence, and causality). Most existing methods characterize the dependence between two clusters through the dependence between (many) node pairs. However, in many scenarios, the primary interest lies in understanding the dependence structure between two groups of multivariate time series rather than individual processes. Figure 1 illustrates this perspective using brain activity signals. In this experiment, local field potential (LFP) activity was recorded from multiple electrodes implanted in different subregions of the hippocampus of rodents (rats) as they performed a complex sequence memory task. Instead of focusing on coherence between individual channels (electrodes), the main goal is to quantify time-varying functional interactions between groups of electrodes in order to understand how information processing differs in these two subregions. Similar challenges arise in other domains. For instance, in finance, understanding the dependence between entire market sectors (e.g., technology and energy) can be more informative than analyzing associations between individual stocks. These scenarios require a framework capable of capturing dynamic coherence between sets of nonstationary multivariate signals. In this paper, we propose a novel framework called scale-specific wavelet canonical coherence (WaveCanCoh) to characterize time-localized and scale-specific coherence between two clusters of multivariate time series. By leveraging the time-frequency localization properties of wavelets, WaveCanCoh is well-suited for analyzing nonstationary multivariate signals in neuroscience, finance, and other fields where transient, cross-group interactions are of scientific interest.

Refer to caption — Figure 1: Schematic overview of the experimental and analytical motivation. Local field potential (LFP) activity was recorded from multiple electrodes in the hippocampus as rats performed an odor sequence memory task. Rather than focusing on individual electrode pairs, our goal is to characterize the dynamic dependence between two hippocampal subregions, each represented by a group of electrodes, using wavelet-based multivariate coherence.

Canonical variate analysis (CVA) (Hotelling (1936)) provides a method for measuring the correlation between two vector variables, its application to time series data having started in the 1950s and 1960s, primarily in the econometrics and signal processing fields. Box and Tiao (1977) and Geweke (1982) extend CVA to time series for forecasting and causality detection. Brillinger (2001) provides a spectral domain formulation of canonical correlation, useful for frequency-domain time series analysis. This approach enables the analysis of canonical coherence between two sets of time series across different frequency bands. Many modern studies have been developed within this framework (e.g., Talento et al. (2024) and Viduarre et al. (2019)), and related methods have been widely applied across various fields, including neuroscience, finance, speech processing, and machine learning.

However, previous methods rely on the assumption that time series are (weakly) stationary, meaning their statistical properties (e.g., expectation and covariance, spectral signature) remain constant over time. In practice, this assumption often does not hold for time series that arise in practice. Moreover, two sets of time series commonly exhibit time-varying global coherence, which can sometimes be crucial for analysis. Thus, a method capable of handling nonstationary time series is necessary. Wavelet analysis is a widely used tool for studying nonstationary time series, as its localization property allows for the examination of localized correlations between two time series across both time and frequency domains. Wavelets are particularly effective for capturing transient properties of nonstationary signals due to their compact support, which can be compressed or stretched to adapt to the dynamic characteristics of the signal. Wavelet coherence has been well-defined and extensively studied in previous research, with applications spanning various fields (Embleton et al., 2022; Grinsted et al., 2004). To the best of our knowledge, no existing work has extended classical canonical coherence to the wavelet domain to measure the time-evolving canonical coherence between two groups of multivariate time series.

The key novelty of this paper lies in the development of a comprehensive and rigorous framework based on wavelets for measuring canonical coherence between two sets of nonstationary multivariate time series. Specifically, our main contributions include: (1) we define scale-specific wavelet canonical coherence (WaveCanCoh) and introduce its use as a tool to quantify the coherence between two sets of multivariate time series, (2) we provide a complete and theoretically justified algorithm for its estimation, and (3) we apply our method to LFP activity data from multiple electrodes to quantify dynamic interaction patterns among different subregions in the hippocampus. Multivariate locally stationary wavelet processes (MvLSW) underpin our construction, and the reader is directed to Nason et al. (2000) and Park et al. (2014) for details on their construction. Our framework not only captures the time-varying coherence between two sets of signals but also determines the contribution of each individual channel to the global coherence. Compared to previous models, the proposed approach provides a detailed, localized characterization of interactions within the multivariate time series. Our findings on the LFP activity data offer new insights into the functional relationship between hippocampal subregions, demonstrating the potential of our method to advance the study of functional brain connectivity.

The format of the paper is as follows. Section 2 overviews the current methodology for assessing time series canonical coherence. Section 3 provides a brief overview of MvLSW processes, supporting a detailed introduction to our proposed WaveCanCoh framework. Its estimation, and that of related parameters, is tackled in Section 4. Section 5 validates the proposed framework and demonstrates its performance through simulation. In Section 6 we apply the WaveCanCoh method on LFP activity data collected from rats to investigate the dynamic interactions of different subregions in the hippocampus during memory tasks. Section 7 concludes the paper.

2 Related works

First, we provide a brief introduction to classical canonical correlation analysis for time series, with the primary goal to characterize dependence between two clusters of time series, where each cluster features several nodes. In particular, we consider two multivariate time series with dimensions $p$ and $q$ respectively, denoted by $\mathbf{X}_{t}=\left(X_{t}^{(1)},\ldots,X_{t}^{(p)}\right)^{\top}$ and $\mathbf{Y}_{t}=\left(Y_{t}^{(1)},\ldots,Y_{t}^{(q)}\right)^{\top}$ , for $t=\{1,\ldots,T\}$ . Typically, $\{\mathbf{X}_{t}\}$ and $\{\mathbf{Y}_{t}\}$ are assumed to be zero-mean weakly stationary time series. Letting $\mathbf{Z}_{t}$ denote the concatenated ( $p+q$ ) dimension time series, $\mathbf{Z}_{t}=\left(X_{t}^{(1)},\ldots,X_{t}^{(p)},Y_{t}^{(1)},\ldots,Y_{t}^{(q)}\right)^{\top}$ , its covariance matrix at lag $\tau$ is

\displaystyle\boldsymbol{\Sigma}_{\mathbf{ZZ}}(\tau)=\left(\begin{array}[]{cc}\boldsymbol{\Sigma}_{\mathbf{XX}}(\tau)&\boldsymbol{\Sigma}_{\mathbf{XY}}(\tau)\\ \boldsymbol{\Sigma}_{\mathbf{YX}}(\tau)&\boldsymbol{\Sigma}_{\mathbf{YY}}(\tau)\end{array}\right),

where $\boldsymbol{\Sigma}_{\mathbf{XX}}(\cdot)$ , $\boldsymbol{\Sigma}_{\mathbf{YY}}(\cdot)$ are the autocovariance matrices of $\{\mathbf{X}_{t}\}$ and $\{\mathbf{Y}_{t}\}$ respectively, and $\boldsymbol{\Sigma}_{\mathbf{XY}}(\cdot)$ , $\boldsymbol{\Sigma}_{\mathbf{YX}}(\cdot)$ are their cross-covariances. The canonical correlation between $\{\mathbf{X}_{t}\}$ and $\{\mathbf{Y}_{t}\}$ at lag $\tau$ , $\boldsymbol{\rho}(\tau)$ , is defined as

\displaystyle\boldsymbol{\rho}(\tau)=\max_{\mathbf{a},\mathbf{b}}\text{ }\frac{\mathbf{a}^{\top}\boldsymbol{\Sigma}_{\mathbf{XY}}(\tau)\mathbf{b}}{\sqrt{\mathbf{a}^{\top}\boldsymbol{\Sigma}_{\mathbf{XX}}(\tau)\mathbf{a}}{\sqrt{\mathbf{b}^{\top}\boldsymbol{\Sigma}_{\mathbf{YY}}(\tau)\mathbf{b}}}},

(1)

where $\mathbf{a}\in\mathbb{R}^{p}$ and $\mathbf{b}\in\mathbb{R}^{q}$ are called canonical correlation vectors, subject to standardized constraints $\mathbf{a}^{\top}\boldsymbol{\Sigma}_{\mathbf{XX}}\mathbf{a}=1$ , $\mathbf{b}^{\top}\boldsymbol{\Sigma}_{\mathbf{YY}}\mathbf{b}=1$ (Brillinger, 2001). Thus, the canonical correlation can be rewritten as

\displaystyle\boldsymbol{\rho}{(\tau)}=\max_{\mathbf{a},\mathbf{b}}\text{ }\left(\mathbf{a}^{\top}\boldsymbol{\Sigma}_{\mathbf{XY}}(\tau)\mathbf{b}\right).

(2)

Remark 1: In the preceding definitions for the cross-covariance matrices, we have $\boldsymbol{\Sigma}_{\mathbf{XY}}(\tau)=\boldsymbol{\Sigma}_{\mathbf{YX}}(-\tau)=\mathbb{E}\left[\mathbf{X}_{t}\mathbf{Y}_{t-\tau}^{\top}\right]$ , which measure the lagged covariance between $\{\mathbf{X}_{t}\}$ and $\{\mathbf{Y}_{t}\}$ , i.e., past values of $\mathbf{Y}$ may be associated to present values of $\mathbf{X}$ for $\tau>0$ , and vice-versa when $\tau<0$ . The case of $\tau=0$ illustrates contemporaneous relationships.

The solution for $\mathbf{a}$ and $\mathbf{b}$ in equation (1) can be obtained from the eigenvectors of the following matrices, respectively (in what follows, $\tau$ is dropped for brevity),

	$\displaystyle\boldsymbol{\Sigma}_{\mathbf{XX}}^{-1}\boldsymbol{\Sigma}_{\mathbf{XY}}\boldsymbol{\Sigma}_{\mathbf{YY}}^{-1}\boldsymbol{\Sigma}_{\mathbf{YX}}\mbox{, and}$		(3)
	$\displaystyle\boldsymbol{\Sigma}_{\mathbf{YY}}^{-1}\boldsymbol{\Sigma}_{\mathbf{YX}}\boldsymbol{\Sigma}_{\mathbf{XX}}^{-1}\boldsymbol{\Sigma}_{\mathbf{XY}}.$		(4)

Here, $\mathbf{a}$ and $\mathbf{b}$ are the eigenvectors corresponding to the largest eigenvalue, $\lambda$ , of matrices (3) and (4), respectively, and for largest canonical correlation coefficient we have $\boldsymbol{\rho}=\sqrt{\lambda}$ (Mardia et al., 1979). The framework above yields the canonical correlation between $\{\mathbf{X}_{t}\}$ and $\{\mathbf{Y}_{t}\}$ . However, in many practical cases, such as EEG analysis, canonical correlation in the spectral domain is more meaningful than in the time domain, as components at different frequencies (or scales) reveal crucial information about neural dynamics and functional connectivity (Ombao and Pinto, 2024) , Brillinger (2001) extends the time-domain canonical correlation into the spectral domain). Namely, suppose the spectral matrix of $\mathbf{Z}_{t}=\left(\mathbf{X}_{t}^{\top},\mathbf{Y}_{t}^{\top}\right)^{\top}$ is

\displaystyle\mathbf{f_{ZZ}}(\boldsymbol{\omega})=\left[\begin{array}[]{ll}\mathbf{f_{XX}}(\boldsymbol{\omega})&\mathbf{f_{XY}}(\boldsymbol{\omega})\\ \mathbf{f_{YX}}(\boldsymbol{\omega})&\mathbf{f_{YY}}(\boldsymbol{\omega})\end{array}\right],

where $\mathbf{f_{XX}}(\boldsymbol{\omega})$ is the $p\times p$ autospectral matrix of $\{\mathbf{X}_{t}\}$ , $\mathbf{f_{YY}}(\boldsymbol{\omega})$ is the $q\times q$ autospectral matrix of $\{\mathbf{Y}_{t}\}$ , and $\mathbf{f_{XY}}(\boldsymbol{\omega})$ is the $p\times q$ cross-spectral matrix between $\{\mathbf{X}_{t}\}$ and $\{\mathbf{Y}_{t}\}$ . Given vectors $\mathbf{a}\in\mathbb{C}^{p}$ and $\mathbf{b}\in\mathbb{C}^{q}$ , such that $\mathbf{a}^{\top}\mathbf{f_{XX}}(\boldsymbol{\omega})\mathbf{a}=\mathbf{b}^{\top}\mathbf{f_{YY}}(\boldsymbol{\omega})\mathbf{b}=1$ , the canonical coherence at frequency $\boldsymbol{\omega}$ is

\displaystyle\boldsymbol{\rho(\omega)}=\max_{\mathbf{a},\mathbf{b}}\left|\frac{\mathbf{a}^{\top}\mathbf{f_{XY}}(\boldsymbol{\omega})\mathbf{b}}{\sqrt{\mathbf{a}^{\top}\mathbf{f_{XX}}(\boldsymbol{\omega})\mathbf{a}}{\sqrt{\mathbf{b}^{\top}\mathbf{f_{YY}}(\boldsymbol{\omega})\mathbf{b}}}}\right|^{2}.

(5)

By solving the maximization problem in equation (5), the canonical coherence vectors $\mathbf{a}$ and $\mathbf{b}$ are determined, leading to the quantification of the canonical coherence at frequency $\boldsymbol{\omega}$ .

Note the classic canonical coherence in (5) completely ignores temporal dynamics, a consequence of the stationarity assumption where dependence between clusters is imposed to remain constant over time, whilst most real-world data, such as EEG, exhibit nonstationarity (Huang et al., 2004; Knight et al., 2024; West et al., 1999). Hence the lack of time-localization information in the above approach may result in misleading results and a novel method capable of capturing time-varying canonical coherence is needed.

3 Wavelet canonical coherence (WaveCanCoh)

Our WaveCanCoh framework is built upon the multivariate locally stationary wavelet (MvLSW) process (Nason et al. (2000), Park et al. (2014)), which is a model based on wavelet analysis for time series. A brief overview of wavelets and highlight of their differences from Fourier-based methods are provided in Appendix A.

A new representation for discretely sampled nonstationary time series based on discrete non-decimated wavelets is the locally stationary wavelet process introduced by Nason et al. (2000), later extended to a multivariate framework in Park et al. (2014). A $P$ -variate stochastic process with time evolving second-order structure, $\mathbf{X}_{t}=\left(X_{t}^{(1)},X_{t}^{(2)},\ldots,X_{t}^{(P)}\right)^{\top}$ , where $t=1,\ldots,T$ , can be represented with the MvLSW formulation

\displaystyle\mathbf{X}_{t}=\sum_{j=1}^{\infty}\sum_{k\in\mathbb{Z}}\mathbf{V}_{j}(k/T)\psi_{j,k}(t)\mathbf{z}_{j,k},

where $\mathbf{V}_{j}(k/T)$ is a $P\times P$ transfer function matrix assumed to have a lower-triangular form; $\left\{\psi_{j,k}\right\}_{j,k}$ is a set of discrete non-decimated wavelets; $\left\{\mathbf{z}_{j,k}\right\}_{j,k}$ is a set of $P\times 1$ uncorrelated random vectors with (column) mean vector $\mathbf{0}$ and $P\times P$ identity covariance matrix. Since the wavelet basis $\psi_{j,k}(t)$ is localized in both time and frequency, the transfer matrix $\mathbf{V}_{j}(k/T)$ provides a measure of the time-varying contribution to the variance among channels at a specific scale $j$ and rescaled time $u=k/T$ , thus enabling the statistical properties of the process $\{\mathbf{X}_{t}\}$ to change smoothly over time.

The time-varying statistical properties of $\{\mathbf{X}_{t}\}$ can be captured through the localized, scale-specific local wavelet spectral matrix (LWS, Park et al. (2014)), $\mathbf{S}_{j}(u)$ , defined at scale $j$ and rescaled time $u\in(0,1)$ , as

\displaystyle\mathbf{S}_{j}(u)=\mathbf{V}_{j}(u)\mathbf{V}_{j}^{\top}(u).

(6)

Note $\mathbf{S}_{j}(u)$ is a $P\times P$ symmetric, positive semi-definite matrix and its $(p,q)$ entry, $S_{j}^{(p,q)}(u)$ , denotes the cross-spectrum between channels $p$ and $q$ . We now extend the LWS matrix construction from a single set of multivariate time series to a cross-group LSW matrix, between $\mathbf{X}_{t}=\left(X_{t}^{(1)},\ldots,X_{t}^{(P)}\right)^{\top}$ and $\mathbf{Y}_{t}=\left(Y_{t}^{(1)},\ldots,Y_{t}^{(Q)}\right)^{\top}$ . Denoting $\mathbf{Z}_{t}=\left(\mathbf{X}_{t}^{\top},\mathbf{Y}_{t}^{\top}\right)^{\top}$ , the LWS matrix of $\{\mathbf{Z}_{t}\}$ at scale $j$ and rescaled time $u$ , $\mathbf{S}_{j;\mathbf{ZZ}}(u)$ , is

\displaystyle\mathbf{S}_{j;\mathbf{ZZ}}(u)=\mathbf{V}_{j;\mathbf{Z}}(u)\mathbf{V}_{j;\mathbf{Z}}^{\top}(u)=\left[\begin{array}[]{ll}\mathbf{S}_{j;\mathbf{XX}}(u)&\mathbf{S}_{j;\mathbf{XY}}(u)\\ \mathbf{S}_{j;\mathbf{YX}}(u)&\mathbf{S}_{j;\mathbf{YY}}(u)\end{array}\right].

(9)

In equation (9), $\mathbf{V}_{j;\mathbf{Z}}(u)$ denotes the $(P+Q)\times(P+Q)$ transfer function matrix of the MvLSW process $\{\mathbf{Z}_{t}\}$ , and $\mathbf{S}_{j;\mathbf{ZZ}}(u)$ is its corresponding LWS matrix. The main diagonal blocks $\mathbf{S}_{j;\mathbf{XX}}(u)$ $(P\times P)$ and $\mathbf{S}_{j;\mathbf{YY}}(u)$ $(Q\times Q)$ denote the auto-LWS matrices of the $\{\mathbf{X}_{t}\}$ and $\{\mathbf{Y}_{t}\}$ processes, respectively, while $\mathbf{S}_{j;\mathbf{XY}}(u)$ and $\mathbf{S}_{j;\mathbf{YX}}(u)$ denote their cross-LWS matrices.

Remark 2: In equation (9), $\mathbf{S}_{j;\mathbf{XY}}(u)$ is a $P\times Q$ matrix at each time point, and the $(p,q)$ element gives the cross-spectrum between channel $p$ of $\{\mathbf{X}_{t}\}$ and channel $q$ of $\{\mathbf{Y}_{t}\}$ . Moreover, it is easy to show that $\mathbf{S}_{j;\mathbf{XY}}(u)=\mathbf{S}_{j;\mathbf{YX}}^{\top}(u)$ .

The LSW matrix quantifies the localized contributions to the process variance for individual and cross-channels, which motivates us to next define the localized canonical coherence between two sets of locally stationary time series at a specific scale (corresponding to a determined frequency band).

Definition 1 (Localized Scale-specific Wavelet Canonical Coherence)
Let $\mathbf{X}_{t}=\left(X_{t}^{(1)},\ldots,X_{t}^{(P)}\right)^{\top}$ and $\mathbf{Y}_{t}=\left(Y_{t}^{(1)},\ldots,Y_{t}^{(Q)}\right)^{\top}$ , where $t=1,\ldots,T$ , be (jointly) multivariate locally stationary time series. We define the localized scale-specific wavelet canonical coherence (WaveCanCoh) between $\{\mathbf{X}_{t}\}$ and $\{\mathbf{Y}_{t}\}$ , at scale $j$ and rescaled time $u$ , as

\displaystyle\boldsymbol{\rho}_{j;\mathbf{XY}}(u)=\max_{\mathbf{a}_{j}(u),\mathbf{b}_{j}(u)}\left\{\mathbf{a}_{j}^{\top}(u)\mathbf{S}_{j;\mathbf{XY}}(u)\mathbf{b}_{j}(u)\right\}^{2},

(10)

where $\mathbf{a}^{\top}_{j}(u)=\left(\mathbf{a}_{j}^{(p)}(u)\right)_{p=1}^{P}$ is a $1\times P$ vector and $\mathbf{b}^{\top}_{j}(u)=\left(\mathbf{b}_{j}^{(q)}(u)\right)_{q=1}^{Q}$ is a $1\times Q$ vector, representing the localized canonical coherence vectors of $\{\mathbf{X}_{t}\}$ and $\{\mathbf{Y}_{t}\}$ , respectively. The constraints here are $\mathbf{a}_{j}^{\top}(u)\mathbf{S}_{j;\mathbf{XX}}(u)\mathbf{a}_{j}(u)=1$ and $\mathbf{b}_{j}^{\top}(u)\mathbf{S}_{j;\mathbf{YY}}(u)\mathbf{b}_{j}(u)=1$ .

Remark 3: The WaveCanCoh time-dependent trace $\boldsymbol{\rho}_{j;\mathbf{XY}}(\cdotp)$ measures the ‘global’ coherence between $\{\mathbf{X}_{t}\}$ and $\{\mathbf{Y}_{t}\}$ at scale $j$ , and takes values between 0 and 1. A value close to 1 indicates strong linear dependence, while a value close to 0 shows little to no linear dependence. Furthermore, $\mathbf{a}_{j}^{(p)}(\cdotp)$ and $\mathbf{b}_{j}^{(q)}(\cdotp)$ represent the localized contributions from the $(p,q)$ channels to $\boldsymbol{\rho}_{j;\mathbf{XY}}(\cdotp)$ .

The canonical coherence vectors $\mathbf{a}_{j}(\cdotp)$ , $\mathbf{b}_{j}(\cdotp)$ can be obtained by maximizing (10) and the solution can be found by solving the eigenvalue and eigenvector problem associated with the following matrices

	$\displaystyle\mathbf{M}_{j;\mathbf{a}}(u)$	$\displaystyle=\mathbf{S}_{j,\mathbf{XX}}^{-1}(u)\mathbf{S}_{j,\mathbf{XY}}(u)\mathbf{S}_{j,\mathbf{YY}}^{-1}(u)\mathbf{S}_{j,\mathbf{YX}}(u),$		(11)
	$\displaystyle\mathbf{M}_{j;\mathbf{b}}(u)$	$\displaystyle=\mathbf{S}_{j,\mathbf{YY}}^{-1}(u)\mathbf{S}_{j,\mathbf{YX}}(u)\mathbf{S}_{j,\mathbf{XX}}^{-1}(u)\mathbf{S}_{j,\mathbf{XY}}(u).$		(12)

Denote by $\Lambda_{j;\mathbf{a}}^{(k)}(u)$ the $k$ -th largest eigenvalue of matrix $\mathbf{M}_{j;\mathbf{a}}(u)$ in equation (11), and by $\Lambda_{j;\mathbf{b}}^{(l)}(u)$ the $l$ -th largest eigenvalue of matrix $\mathbf{M}_{j;\mathbf{b}}(u)$ in equation (12), for $k,l=1,\ldots,\min(P,Q)$ . An important observation is that $\mathbf{M}_{j;\mathbf{a}}(u)$ and $\mathbf{M}_{j;\mathbf{b}}(u)$ share the same eigenvalues (see Appendix B for details), hence denoting by $\Lambda_{j}^{(1)}(u)=\Lambda_{j;\mathbf{a}}^{(1)}(u)=\Lambda_{j;\mathbf{b}}^{(1)}(u)$ their largest eigenvalue, the canonical coherence between $\{\mathbf{X}_{t}\}$ and $\{\mathbf{Y}_{t}\}$ at rescaled time $u$ , as defined in equation (10), becomes

\displaystyle\boldsymbol{\rho}_{j;\mathbf{XY}}(u)=\Lambda_{j}^{(1)}(u).

(13)

The eigenvectors of $\mathbf{M}_{j;\mathbf{a}}(u)$ and $\mathbf{M}_{j;\mathbf{b}}(u)$ corresponding to $\Lambda_{j}^{(1)}(u)$ provide the solutions to the canonical directions of $\{\mathbf{X}_{t}\}$ and $\{\mathbf{Y}_{t}\}$ , respectively. Proof: see Appendix B.

An important extension of our framework is the incorporation of leading-lag relationships into the scale-specific wavelet canonical coherence (WaveCanCoh). To account for potential causal effects, we define a lagged joint process, $\mathbf{Z}_{t}(h)=\left(\mathbf{X}^{\top}_{t},\mathbf{Y}^{\top}_{t+h}\right)^{\top}$ , where $h$ is the value of lag, and $t=1,\ldots,T-h$ for $h>0$ . We define the LWS matrix of $\{\mathbf{Z}_{t}(h)\}$ at scale $j$ , as

\displaystyle\mathbf{S}_{j;\mathbf{ZZ}}(u,h)=\mathbf{V}_{j;\mathbf{Z}}(u,h)\mathbf{V}_{j;\mathbf{Z}}^{\top}(u,h)=\left[\begin{array}[]{ll}\mathbf{S}_{j;\mathbf{XX}}(u)&\mathbf{S}_{j;\mathbf{XY}}(u,h)\\ \mathbf{S}_{j;\mathbf{YX}}(u+(h/T),-h)&\mathbf{S}_{j;\mathbf{YY}}(u+(h/T))\end{array}\right]

(16)

where $\mathbf{S}_{j;\mathbf{XY}}(u,h)$ denotes the cross-LWS matrix between $\mathbf{X}_{[uT]}$ and $\mathbf{Y}_{[uT]+h}$ , capturing the interaction between current values of $\mathbf{X}$ and future values of $\mathbf{Y}$ . Based on this construction, we can define and estimate the lagged version of WaveCanCoh, enabling us to infer potential causal relationships between two groups of time series.

Definition 2 (Causal Localized Scale-specific Wavelet Canonical Coherence)
The causal localized scale-specific canonical coherence (Causal-WaveCanCoh) between $\{\mathbf{X}_{t}\}$ and $\{\mathbf{Y}_{t}\}$ with lag $h$ (or, $\mathbf{X}_{t}\rightarrow\mathbf{Y}_{t+h}$ ), at scale $j$ and rescaled time $u$ , is defined as

\displaystyle\boldsymbol{\rho}_{j;\mathbf{XY}}(u,h)=\max_{\mathbf{a}_{j}(u),\mathbf{b}_{j}(u)}\left\{\mathbf{a}_{j}^{\top}(u)\mathbf{S}_{j;\mathbf{XY}}(u,h)\mathbf{b}_{j}(u+(h/T))\right\}^{2},

(17)

where the notations and constraints are the same as in the standard WaveCanCoh framework. This extension allows for a scale- and time-specific evaluation of causal overall association between two sets of time series, enhancing interpretability in dynamic, multivariate, and nonstationary settings.

The framework above allows us to capture the time-varying overall association between two sets of multivariate time series, as well as the time-varying contributions from each individual channel within these sets. However, a natural consideration is how to project this time-varying coherence at each scale $j$ into the frequency domain in a manner consistent with the Fourier-based method described in Section 2, as a key concern in many analyses is to interpret the results in the frequency domain. As mentioned earlier, each scale in the wavelet analysis corresponds approximately, but not exactly, to a specific frequency band. This correspondence is governed by the unique filtering mechanism of wavelets, an explanation for this corresponding relationship is provided in Appendix A.

4 Estimation procedure

In Section 3, we developed a rigorous framework that allowed us to introduce the localized, scale-specific wavelet canonical coherence. In this section, we propose a well-behaved estimation procedure for quantifying the canonical coherence and corresponding canonical vectors. We start by estimating the local wavelet spectrum (LWS) matrices in equations (11) and (12) in the spirit of Park et al. (2014), given by

\displaystyle\widehat{\mathbf{S}}_{j,k}=\sum_{l=1}^{J}A_{jl}^{-1}\tilde{\mathbf{I}}_{l,k},\mbox{ where }\tilde{\mathbf{I}}_{l,k}=\frac{1}{2M+1}\sum_{m=-M}^{M}\mathbf{I}_{l,k+m}\mbox{ is the smoothed periodogram },

(18)

$k$ represents the shift of the wavelet function and is equivalent to time $k=[uT]$ in our context, and $M$ is the half-width of the rectangular smoothing kernel, controlling the amount of temporal smoothing. The matrix $\mathbf{I}_{l,k}$ is the raw periodogram at scale $l$ and time $k$ , obtained as

\displaystyle\mathbf{I}_{l,k}=\mathbf{d}_{l,k}\mathbf{d}_{l,k}^{\top},\text{ where }\mathbf{d}_{l,k}=\sum_{t=0}^{T}\mathbf{X}_{t}\psi_{l,k}(t)\mbox{ is the empirical wavelet coefficient vector}.

This multistep procedure yields consistent estimators of the LWS matrices under the asymptotic conditions $T,M\rightarrow\infty$ and $M/T\rightarrow 0$ (Park et al., 2014). We propose the following estimator for the scale-specific wavelet canonical coherence (WaveCanCoh)

\displaystyle\widehat{\boldsymbol{\rho}}_{j;\mathbf{XY}}(u)=\widehat{\Lambda}_{j}^{(1)}(u),

(19)

where $\widehat{\Lambda}_{j}^{(1)}(u)$ is the largest eigenvalue of $\widehat{\mathbf{M}}_{j;\mathbf{a}}(u)$ and $\widehat{\mathbf{M}}_{j;\mathbf{b}}(u)$ , defined as

	$\displaystyle\widehat{\mathbf{M}}_{j;\mathbf{a}}(u)$	$\displaystyle=\widehat{\mathbf{S}}_{j,\mathbf{XX}}^{-1}(u)\widehat{\mathbf{S}}_{j,\mathbf{XY}}(u)\widehat{\mathbf{S}}_{j,\mathbf{YY}}^{-1}(u)\widehat{\mathbf{S}}_{j,\mathbf{YX}}(u),$		(20)
	$\displaystyle\widehat{\mathbf{M}}_{j;\mathbf{b}}(u)$	$\displaystyle=\widehat{\mathbf{S}}_{j,\mathbf{YY}}^{-1}(u)\widehat{\mathbf{S}}_{j,\mathbf{YX}}(u)\widehat{\mathbf{S}}_{j,\mathbf{XX}}^{-1}(u)\widehat{\mathbf{S}}_{j,\mathbf{XY}}(u).$		(21)

The estimated localized, scale-specific canonical direction vectors $\widehat{\mathbf{a}}_{j}(u)$ and $\widehat{\mathbf{b}}_{j}(u)$ are the eigenvectors of $\widehat{\mathbf{M}}_{j;\mathbf{a}}(u)$ and $\widehat{\mathbf{M}}_{j;\mathbf{b}}(u)$ respectively, associated with $\widehat{\Lambda}_{j}^{(1)}(u)$ . These quantities provide estimates of the time-varying global coherence and the channel-specific contributions at scale $j$ . The proposed estimators are consistent with the true quantities they aim to approximate, provided certain asymptotic conditions are met. These include increasing sample size and appropriate smoothing bandwidth, ensuring reliable estimation in the limit. Proof: See Appendix B.

Algorithm 1 summarizes the estimation procedure for WaveCanCoh and its results can be further used to investigate the temporal channel contributions to the global association, at a particular scale.

Algorithm 1 Proposed WaveCanCoh estimation algorithm for nonstationary time series

Suppose the observed data are two sets of multivariate locally stationary time series, denoted as

\mathbf{X}_{t}=\left(X_{t}^{(1)},\ldots,X_{t}^{(P)}\right)^{\top}

and

\mathbf{Y}_{t}=\left(Y_{t}^{(1)},\ldots,Y_{t}^{(Q)}\right)^{\top}

, observed for

t=\{1,\ldots,T\}

1. Fuse: fuse the data to a new

(P+Q)

-variate time series denoted as

\mathbf{Z}_{t}

, with

\mathbf{Z}_{t}=\left(\mathbf{X}_{t}^{\top},\mathbf{Y}_{t}^{\top}\right)^{\top}

(This fused representation allows for the joint analysis of the two multivariate processes within a unified framework. The Causal-WaveCanCoh can also be estimated by appropriately incorporating the leading-lag into the existing WaveCanCoh estimation procedure.)

2. Spectral estimation: estimate the LWS matrix of

\{\mathbf{Z}_{t}\}

using equation (18). Denote the estimator as

\widehat{\mathbf{S}}_{j;\mathbf{ZZ}}(u)

for any rescaled time

u\in(0,1)

. The estimated auto- and cross-LWS between

\{\mathbf{X}_{t}\}

and

\{\mathbf{Y}_{t}\}

, denoted as

\widehat{\mathbf{S}}_{j;\mathbf{XX}}(u)

\widehat{\mathbf{S}}_{j;\mathbf{YY}}(u)

\widehat{\mathbf{S}}_{j;\mathbf{XY}}(u)

\widehat{\mathbf{S}}_{j;\mathbf{YX}}(u)

, can be obtained by partitioning

\widehat{\mathbf{S}}_{j;\mathbf{ZZ}}(u)

into four submatrices as illustrated in equation (9).

3. Eigendecomposition: compute the matrices

\widehat{\mathbf{M}}_{j;\mathbf{a}}(u)

and

\widehat{\mathbf{M}}_{j;\mathbf{b}}(u)

in equations (20)- (21), then perform their eigendecompositions and obtain their (common) largest eigenvalue

\widehat{\Lambda}_{j}^{(1)}(u)

. This will serve as the estimated WaveCanCoh in (19),

\widehat{\boldsymbol{\rho}}_{j;\mathbf{XY}}(u)=\widehat{\Lambda}_{j}^{(1)}(u)

, while its corresponding eigenvectors of

\widehat{\mathbf{M}}_{j;\mathbf{a}}(u)

and

\widehat{\mathbf{M}}_{j;\mathbf{b}}(u)

give the canonical direction vectors,

\widehat{\mathbf{a}}_{j}(u)

and

\widehat{\mathbf{b}}_{j}(u)

5 Simulation study

In this section, we implement the proposed framework using simulated data under two distinct scenarios, one adhering to the MvLSW assumptions underpinning our method, while the other introduces nonstationarity without strictly satisfying the MvLSW assumptions. These setups allow us to validate both the theoretical soundness and empirical performance of the proposed approach, as well as to assess its robustness and practical applicability in real-world scenarios where model assumptions may be violated. To further evaluate performance, we also compare the results with those obtained from the classical Fourier-based canonical coherence approach.

MvLSW-based simulation. We generate the multivariate time series $\{\mathbf{Z}_{t}\}$ from a MvLSW process with $P=6,\,Q=4$ , observed across $T=1024$ time points. The process is constructed using non-decimated Haar wavelets, with non-zero spectral structure specified at scale $j=2$ , as detailed in Appendix C.1. We impose a weaker dependence structure between $\{\mathbf{X}_{t}\}$ and $\{\mathbf{Y}_{t}\}$ in the interval $0<u<0.5$ , and a stronger dependence in the interval $0.5<u<1$ , allowing us to examine the framework’s sensitivity to changes in cross-group coherence. Using the process realization (Figure 9) we estimate WaveCanCoh using Algorithm 1. To assess the estimation accuracy and account for variability, we replicate the simulation and estimation process 1000 times. At each time point, we compute the average of the estimated WaveCanCoh across the replicates and construct a 95% Wald confidence interval using the empirical variance. Figure 2 (left) demonstrates that the proposed WaveCanCoh method accurately tracks the true coherence structure and effectively reflects its time-varying nature, while the estimated canonical direction vectors in Figure 3 map the temporal and individual channel heterogeneity in their roles within the multivariate dependence structure.

Mixture of AR(2)-based processes. To evaluate the robustness and generality of the proposed framework, we investigate WaveCanCoh using synthetic data generated from a mixture of AR(2) processes. Unlike the MvLSW-based simulations, this setting introduces nonstationary dynamics without any wavelet-based structure, providing a more flexible and realistic test scenario. To the best of our knowledge, WaveCanCoh is the first framework designed to estimate time-varying canonical coherence between two multivariate time series groups. However, to benchmark its performance, we compare it against a method based on the time-varying Cramér representation (Dahlhaus, 1997), with canonical coherence estimated via STFT-based localized spectra (Allen and Rabiner, 1977). We simulate 500 replicates of ${\mathbf{X}_{t}}\in\mathbb{R}^{4}$ and ${\mathbf{Y}_{t}}\in\mathbb{R}^{3}$ , with $T=1024$ , each formed by mixing five latent AR(2) sources tuned to neural frequency bands. In the first half, shared gamma ( $30-50Hz$ ) bands induce cross-group coherence, while the second half contains no shared structure (see details in Appendix C.2). Figure 2 (right) illustrates that while both methods detect the existence of coherence in the first half, only WaveCanCoh captures its sharp drop and true behaviour in the second half, thus demonstrating its advantage in identifying transient, localized changes that global Fourier-based methods fail to detect.

6 Local field potential (LFP) data analysis

To demonstrate the practical utility of our proposed WaveCanCoh framework, we analyze LFP activity recorded from the hippocampus of rats engaged in a sequence memory task (Allen et al., 2016; Shahbaba et al., 2022). The data were recorded using a 22-electrode microdrive implanted in the CA1 subregion to capture high-resolution LFP signals across all channels at a sampling rate of $1000Hz$ . In this task, rats were tested on their memory of a sequence of five odors (odors ABCDE). Each odor was presented for $\sim$ 1.2 second ( $s$ ) and a variable delay of $\sim$ 5 $s$ separated each odor (see Figure 1). For each trial (i.e., each odor presentation), the rat had to judge whether the odor was presented "in sequence" (e.g., ABC…) or "out of sequence" (e.g., ABD…) and indicate their decision by holding their nosepoke response until a tone signal (at 1.2 $s$ ) or withdrawing before the signal, respectively. Correct-response trials (i.e., correct "in sequence" or "out of sequence" decisions) were rewarded. LFP activity data were recorded over a 4 $s$ period ( $T=4000$ time points) per trial, with $t=0$ marking the moment the rat initiated a nosepoke to receive the odor stimulus. This paradigm provides a well-controlled setting to investigate dynamic, time-varying functional interactions in the hippocampus during memory-guided decisions. We employ WaveCanCoh framework to quantify frequency-specific functional coherence between two groups of hippocampal electrodes (T1, T2, T4, T5 and T13–T17), and to examine how coherence patterns differ between correct- and incorrect-response trials ("in sequence" trials only). Specifically, we analyze LFP data from the rat Mitt, which included 40 correct-response trials and 32 incorrect-response trials. Figure 4 presents the estimated wavelet canonical coherence at scale $j=5$ , corresponding to the $15.625-31.25Hz$ frequency band. The results, averaged across trials for each condition, reveal dynamic changes in inter-regional coherence, with a pronounced peak around the time of odor stimulus delivery ( $t=0$ ). Notably, distinct patterns emerge between correct and incorrect trials, suggesting that coherent activity in this frequency band may play a role in supporting successful memory retrieval and decision making. More results for several other scales can be found in Figure 10 in Appendix D.1.

To further interpret the coherence patterns, Figure 5 provides a spatial summary of the canonical coherence between the two electrode groups at several selected time points. The double-headed arrows represent the magnitude of estimated coherence between the two regions, while the numbers in the circles reflect the individual channel contributions to the global coherence, derived from the elements of the canonical vectors $\mathbf{a}_{5}(\cdotp)$ and $\mathbf{b}_{5}(\cdotp)$ .

The results highlight that both the strength and structure of inter-regional coherence vary dynamically over time and differ across trial outcomes, reflecting the nonstationary nature of neural interactions during task performance. Specifically, incorrect-response trials exhibit lower coherence at time points immediately following the odor stimulus delivery and are often driven by a few dominant channels, while correct trials show higher coherence values and relatively balanced contributions across channels. These findings illustrate the necessity of a framework like WaveCanCoh, which simultaneously identifies time-varying and scale-specific dependence structures between multivariate time series. Traditional stationary or pairwise approaches would fail to capture such nuanced dynamics, highlighting the need for a method like WaveCanCoh to extract such complex relationships in brain activity. The directed interactions between brain regions are also explored in Appendix E with proposed Causal-WaveCanCoh framework.

To further validate the existence of significant differences in the activity between correct- and incorrect-response trials, we propose a time-specific detection procedure based on the permutation test to determine temporally localized differences in the wavelet canonical coherence at a given scale between conditions, while maintaining the nonparametric nature of the statistical inference. The detailed inference procedure steps are shown in Algorithm 2 (Appendix D.2) and the results of the time-specific permutation test for WaveCanCoh across five wavelet scales and four selected time points are summarized in Table 1 (Appendix D.3). Statistically significant differences in canonical coherence between correct and incorrect trials emerge at time points following odor sampling ( $t=0$ ), predominantly at intermediate wavelet scales corresponding approximately to the $8-62Hz$ frequency range. On the other hand, no significant differences between the two groups are observed prior to stimulus onset, indicating that the coherence patterns distinguishing inter-regional communication among trial types are tightly linked to task engagement. These findings demonstrate the effectiveness of the proposed WaveCanCoh framework and associated permutation test in capturing localized, frequency-specific differences in neural coordination between behavioral conditions, thus offering a powerful tool for analyzing complex brain interactions.

7 Conclusions

In this paper we introduced a novel methodological framework, scale-specific wavelet canonical coherence (WaveCanCoh), designed to quantify the dynamic multiscale coherence between two sets of nonstationary multivariate time series. Our primary contributions include the rigorous definition of WaveCanCoh within the multivariate locally stationary wavelet (MvLSW) framework and the development of a comprehensive estimation and theoretically-backed inference procedure based on wavelet analysis. We validated our proposed methodology through simulation studies, demonstrating its accuracy in tracking true coherence structures. The application to local field potential (LFP) activity recorded from subregions of the hippocampus effectively showcased the capability of WaveCanCoh to identify nuanced spatio-temporal coherence patterns associated with cognitive performance, while our permutation-based inference procedure provided a robust, nonparametric approach for detecting significant coherence differences between conditions. Compared to existing stationary and Fourier-based canonical coherence methods, WaveCanCoh is shown to offer significant advantages, particularly through its ability to adaptively capture transient and time-localized interactions, hence it is highly suitable for analyzing signals in neuroscience and other fields with data exhibiting dynamic cross-group interactions.

References

Allen and Rabiner (1977) J.B. Allen and L.R. Rabiner. A unified approach to short-time fourier analysis and synthesis. Proceedings of the IEEE, 65(11):1558–1564, 1977.
Allen et al. (2016) Timothy A. Allen, Daniel M. Salz, Sam McKenzie, and Norbert J. Fortin. Nonspatial sequence coding in ca1 neurons. The Journal of Neuroscience, 36(5):1547–1563, 2016.
Box and Tiao (1977) G. E. P. Box and G. C. Tiao. A canonical analysis of multiple time series. Biometrika, 64(2):355–365, August 1977.
Brillinger (2001) D. R. Brillinger. Time Series: Data Analysis and Theory. SIAM, 2001.
Dahlhaus (1997) Rainer Dahlhaus. Fitting time series models to nonstationary processes. The Annals of Statistics, 25(1):1–37, February 1997.
Daubechies (1992) Ingrid Daubechies. Ten lectures on wavelets. SIAM, 1992.
Embleton et al. (2022) Jonathan Embleton, Marina I. Knight, and Hernando Ombao. Multiscale spectral modelling for nonstationary time series within an ordered multiple-trial experiment. Annals of Applied Statistics, 16(4):2774–2803, 2022.
Geweke (1982) John Geweke. Measurement of linear dependence and feedback between multiple time series. Journal of the American Statistical Association, 77(378):304–313, June 1982.
Grinsted et al. (2004) A. Grinsted, J. C. Moore, and S. Jevrejeva. Application of the cross wavelet transform and wavelet coherence to geophysical time series. Nonlinear Processes in Geophysics, 11(5-6):561–566, 2004.
Hotelling (1936) Harold Hotelling. Relations between two sets of variates. Biometrika, 28(3/4):321–377, 1936.
Huang et al. (2004) Hsiao-Yun Huang, Hernando Ombao, and David S. Stoffer. Discrimination and classification of nonstationary time series using the slex model. Journal of the American Statistical Association, 99(467):763–774, 2004.
Knight et al. (2024) Marina I. Knight, Matthew A. Nunes, and Jessica K. Hargreaves. Adaptive wavelet domain principal component analysis for nonstationary time series. Journal of Computational and Graphical Statistics, 33(3):941–954, 2024.
Mardia et al. (1979) K. V. Mardia, J. T. Kent, and J. M. Bibby. Multivariate Analysis. Academic Press, 1979.
Nason (2008) G. P. Nason. Wavelet Methods in Statistics with R. Springer New York, NY, 2008.
Nason et al. (2000) Guy P. Nason, Rainer Sachs, and Gerald Kroisandt. Wavelet processes and adaptive estimation of the evolutionary wavelet spectrum. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 62(2):271–292, 2000.
Ombao and Pinto (2024) Hernando Ombao and Marco Pinto. Spectral dependence. Econometrics and Statistics, 32:122–159, 2024.
Park et al. (2014) Timothy Park, Idris Eckley, and Hernando Ombao. Estimating time-evolving partial coherence between signals via multivariate locally stationary wavelet processes. IEEE Transactions on Signal Processing, 62:5240–5250, 2014.
Shahbaba et al. (2022) Babak Shahbaba, Lu Li, Francesco Agostinelli, Meenakshi Saraf, Keiland W. Cooper, Daniel Haghverdian, George A. Elias, Pierre Baldi, and Norbert J. Fortin. Hippocampal ensembles represent sequential relationships among an extended sequence of nonspatial events. Nature Communications, 13(1):787, February 2022.
Talento et al. (2024) Mara Sherlin D. Talento, Sarbojit Roy, and Hernando C. Ombao. Kencoh: A ranked-based canonical coherence. arXiv preprint arXiv:2412.10521, December 2024.
Viduarre et al. (2019) C. Viduarre, G. Nolte, I.E.J. de Vries, M. Gómez, T.W. Boonstra, K.-R. Müller, A. Villringer, and V.V. Nikulin. Canonical maximization of coherence: A novel tool for investigation of neuronal interactions between two datasets. NeuroImage, 201:116009, November 2019.
West et al. (1999) Mike West, Raquel Prado, and Andrew D. Krystal. Evaluation and comparison of eeg traces: Latent structure in non-stationary time series. Journal of the American Statistical Association, 94(446):1083–1095, 1999.

Appendix

Appendix A Basic introduction to wavelets

Wavelet analysis provides a powerful framework for studying signals with both time- and scale-varying structure, making it particularly well-suited for nonstationary data. Unlike traditional Fourier-based methods, which decompose signals into global sinusoidal bases and therefore assume stationarity, wavelets enable localized, adaptive decompositions by projecting signals onto functions that are compact in both time and scale. This localization is achieved through two core operations: scaling, which adjusts the width of the wavelet to analyze different resolution levels, and shifting, which moves the wavelet across time to detect when features occur. Specifically, the wavelet functions at scale $j$ and shift $k$ , denoted by $\psi_{j,k}$ , are derived from a mother wavelet $\psi$ and defined as

\displaystyle\psi_{j,k}(t)=2^{-j/2}\psi\left(\frac{t-2^{j}k}{2^{j}}\right),\quad j=1,\ldots,J,

where $J$ represents the number of scales. Smaller scale $j$ corresponds to finer (high-frequency) resolution, and larger $j$ captures coarser (low-frequency) trends. A similar construction applies to the father wavelet, denoted by $\phi_{j,k}$ , which serves as a scaling function (see Daubechies (1992) and Nason (2008) for more details on wavelets). Figure 6 illustrates the effect of scaling and shifting operations on the wavelet function. This ability to isolate both short-lived and long-term features distinguishes wavelet methods from Fourier analysis, allowing for nuanced investigation of nonstationary signals such as neural activity, where structure evolves dynamically across time and scale.

The correspondence between wavelet scale and signal frequency arises from the principle of multi-resolution analysis, which provides the foundational framework for wavelet construction and signal representation. Briefly, the discrete wavelet transform (DWT) decomposes a signal into frequency subbands via a dyadic filter bank architecture. At each level, the signal is passed through a pair of conjugate quadrature filters: a low-pass filter $h[n]$ and a high-pass filter $g[n]$ , followed by downsampling by a factor of two. The low-pass branch yields approximation coefficients that retain coarse-scale information, while the high-pass branch produces detail coefficients that capture localized high-frequency variations. Specifically, given a discrete signal $x[n]$ , the approximation and detail coefficients at level $j$ are obtained by

\displaystyle a_{j}[n]=\sum_{k}h[k]a_{j-1}[2n-k],\quad d_{j}[n]=\sum_{k}g[k]a_{j-1}[2n-k],

where $a_{j-1}$ is the approximation from the previous level (with $a_{0}=x$ ). This process is iterated on the low-pass output, producing a multiscale representation in which each level isolates a specific frequency band. For a signal sampled at rate $f_{s}$ , the detail coefficients at level $j$ correspond approximately to the frequency interval $\left[\frac{f_{s}}{2^{j+1}},\frac{f_{s}}{2^{j}}\right]$ . The orthogonality between subbands ensures perfect reconstruction and energy preservation, and the hierarchical filter bank provides a localized time-frequency analysis with increasingly coarse temporal resolution at lower frequencies. The explanation above offers an intuitive understanding of how components at each wavelet scale can be approximated to true frequency bands (see Figure 7 for an intuitive illustration). This approximation enables our framework to capture the overall association between two sets of locally stationary time series in both the temporal and frequency domains (more details can be found in Daubechies (1992)).

Appendix B Theoretical proofs

Proof of solution to equation (10). Suppose that $\mathbf{X}_{t}$ and $\mathbf{Y}_{t}$ are two multivariate time series and for each time point $t$ , the goal is to find the vectors $\mathbf{a}_{j}(u)$ and $\mathbf{b}_{j}(u)$ that maximize the coherence,

\displaystyle\boldsymbol{\rho}_{j;\mathbf{XY}}(u)=\left\{\mathbf{a}_{j}^{\top}(u)\mathbf{S}_{j;\mathbf{XY}}(u)\mathbf{b}_{j}(u)\right\}^{2}

subject to the normalization constraints described in the main paper

\displaystyle\mathbf{a}_{j}^{\top}(u)\mathbf{S}_{j;\mathbf{XX}}(u)\mathbf{a}_{j}(u)=1,\quad\mathbf{b}_{j}^{\top}(u)\mathbf{S}_{j;\mathbf{YY}}(u)\mathbf{b}_{j}(u)=1.

Assuming rescaled time $u$ and scale $j$ are fixed, we suppress them for clarity. To solve the above optimization problem, we set up the Lagrangian as

\displaystyle\mathcal{L}\left(\mathbf{a},\mathbf{b},\lambda_{1},\lambda_{2}\right)=\frac{1}{2}\left(\mathbf{a}^{\top}\mathbf{S}_{XY}\mathbf{b}\right)^{2}-\frac{\lambda_{1}}{2}\left(\mathbf{a}^{\top}\mathbf{S}_{XX}\mathbf{a}-1\right)-\frac{\lambda_{2}}{2}\left(\mathbf{b}^{\top}\mathbf{S}_{YY}\mathbf{b}-1\right)

Differentiating the Lagrangian with respect to $\mathbf{a}$ and $\mathbf{b}$ , respectively and requiring the partial derivatives to equal zero, we obtain

	$\displaystyle\frac{\partial\mathcal{L}}{\partial\mathbf{a}}$	$\displaystyle=\left(\mathbf{S}_{XY}\mathbf{b}\right)\cdot\left(\mathbf{S}_{XY}\mathbf{b}\right)^{\top}\mathbf{a}-\lambda_{1}\mathbf{S}_{XX}\mathbf{a}=\mathbf{0}$
		$\displaystyle\Rightarrow\left(\mathbf{S}_{XY}\mathbf{b}\right)\cdot\left(\mathbf{S}_{XY}\mathbf{b}\right)^{\top}\mathbf{a}=\lambda_{1}\mathbf{S}_{XX}\mathbf{a},\qquad(\mathrm{i})$
	$\displaystyle\frac{\partial\mathcal{L}}{\partial\mathbf{b}}$	$\displaystyle=\left(\mathbf{S}_{YX}\mathbf{a}\right)\cdot\left(\mathbf{S}_{YX}\mathbf{a}\right)^{\top}\mathbf{b}-\lambda_{2}\mathbf{S}_{YY}\mathbf{b}=\mathbf{0}$
		$\displaystyle\Rightarrow\left(\mathbf{S}_{YX}\mathbf{a}\right)\cdot\left(\mathbf{S}_{YX}\mathbf{a}\right)^{\top}\mathbf{b}=\lambda_{2}\mathbf{S}_{YY}\mathbf{b}.\qquad\mathrm{(ii)}$

With $\mathrm{(i)}$ , $\mathrm{(ii)}$ and since $\left(\mathbf{a}^{\top}\mathbf{S}_{XY}\mathbf{b}\right)$ is a real-valued quantity, we further obtain

\displaystyle\begin{array}[]{l}(\mathbf{a}^{\top}\mathbf{S}_{XY}\mathbf{b})\cdot(\mathbf{a}^{\top}\mathbf{S}_{XY}\mathbf{b})=\lambda_{1}\mathbf{a}^{\top}\mathbf{S}_{XX}\mathbf{a},\mbox{ and}\\ (\mathbf{b}^{\top}\mathbf{S}_{YX}\mathbf{a})\cdot(\mathbf{a}^{\top}\mathbf{S}_{XY}\mathbf{b})=\lambda_{2}\mathbf{b}^{\top}\mathbf{S}_{YY}\mathbf{b}.\end{array}

Recalling the constraints $\mathbf{a}^{\top}\mathbf{S}_{XX}\mathbf{a}=1$ and $\mathbf{b}^{\top}\mathbf{S}_{YY}\mathbf{b}=1$ , it immediately follows that

\displaystyle\lambda_{1}=\lambda_{2}=\left(\mathbf{a}^{\top}\mathbf{S}_{XY}\mathbf{b}\right)^{2}:\,=\lambda.

By substituting the above back into $\mathrm{(i)}$ , $\mathrm{(ii)}$ and assuming $\lambda$ to be non-zero, we obtain

\displaystyle\begin{array}[]{l}\mathbf{a}=\frac{1}{\sqrt{\lambda}}\mathbf{S}_{XX}^{-1}\mathbf{S}_{XY}\mathbf{b}\mbox{ and}\\ \mathbf{b}=\frac{1}{\sqrt{\lambda}}\mathbf{S}_{YY}^{-1}\mathbf{S}_{YX}\mathbf{a},\end{array}

which plugged into $\mathrm{(ii)}$ , $\mathrm{(i)}$ , respectively, yield

\displaystyle\begin{array}[]{l}\mathbf{S}_{XY}\mathbf{S}_{YY}^{-1}\mathbf{S}_{YX}\ \mathbf{a}=\lambda\mathbf{S}_{XX}\mathbf{a}\\ \mathbf{S}_{YX}\mathbf{S}_{XX}^{-1}\mathbf{S}_{XY}\mathbf{b}=\lambda\mathbf{S}_{YY}\mathbf{b}\mbox{ or, equivalently,}\\ \end{array}

\displaystyle\begin{array}[]{l}\mathbf{S}_{XX}^{-1}\mathbf{S}_{XY}\mathbf{S}_{YY}^{-1}\mathbf{S}_{YX}\mathbf{a}=\lambda\mathbf{a},\\ \mathbf{S}_{YY}^{-1}\mathbf{S}_{YX}\mathbf{S}_{XX}^{-1}\mathbf{S}_{XY}\mathbf{b}=\lambda\mathbf{b}.\\ \end{array}

Hence $\lambda=\left(\mathbf{a}^{\top}\mathbf{S}_{XY}\mathbf{b}\right)^{2}$ is an eigenvalue for both matrices

	$\displaystyle\mathbf{S}_{XX}^{-1}\mathbf{S}_{XY}\mathbf{S}_{YY}^{-1}\mathbf{S}_{YX},\mbox{ and}$
	$\displaystyle\mathbf{S}_{YY}^{-1}\mathbf{S}_{YX}\mathbf{S}_{XX}^{-1}\mathbf{S}_{XY}$

whose corresponding eigenvectors are $\mathbf{a}$ and $\mathbf{b}$ , respectively. Thus, the defined canonical coherence $\boldsymbol{\rho}=\max\left(\mathbf{a}^{\top}\mathbf{S}_{XY}\mathbf{b}\right)^{2}$ is the largest eigenvalue of above matrices. The case when $\lambda=0$ illustrates the canonical coherence between $\mathbf{X}$ and $\mathbf{Y}$ is 0, which is not a meaningful scenario for this problem. We recall the above equations hold for every time $u$ and scale $j$ .

Proof of consistency of WaveCanCoh estimator. We aim to establish the consistency of the matrix estimators

\displaystyle\widehat{\mathbf{M}}_{j;\mathbf{a}}=\widehat{\mathbf{S}}_{j,\mathbf{XX}}^{-1}\widehat{\mathbf{S}}_{j,\mathbf{XY}}\widehat{\mathbf{S}}_{j,\mathbf{YY}}^{-1}\widehat{\mathbf{S}}_{j,\mathbf{YX}},\quad\widehat{\mathbf{M}}_{j;\mathbf{b}}=\widehat{\mathbf{S}}_{j,\mathbf{Y}\mathbf{Y}}^{-1}\widehat{\mathbf{S}}_{j,\mathbf{YX}}\widehat{\mathbf{S}}_{j,\mathbf{XX}}^{-1}\widehat{\mathbf{S}}_{j,\mathbf{XY}}

According to Nason et al. (2000) and Park et al. (2014), the smoothed periodogram-based estimators of the local wavelet spectral (LWS) matrices are consistent. Specifically, as the number of time points $T\to\infty$ and the smoothing parameter $M\to\infty$ with $M/T\to 0$ , we have

\displaystyle\widehat{\mathbf{S}}_{j,\mathbf{XX}}\xrightarrow{P}\mathbf{S}_{j,\mathbf{XX}},\quad\widehat{\mathbf{S}}_{j,\mathbf{XY}}\xrightarrow{P}\mathbf{S}_{j,\mathbf{XY}},\quad\widehat{\mathbf{S}}_{j,\mathbf{YX}}\xrightarrow{P}\mathbf{S}_{j,\mathbf{YX}},\quad\widehat{\mathbf{S}}_{j,\mathbf{YY}}\xrightarrow{P}\mathbf{S}_{j,\mathbf{YY}}.

Assuming the spectral matrices and their estimators are non-singular, it follows by the continuous mapping theorem that

\displaystyle\widehat{\mathbf{S}}_{j,\mathbf{XX}}^{-1}\xrightarrow{P}\mathbf{S}_{j,\mathbf{XX}}^{-1},\quad\widehat{\mathbf{S}}_{j,\mathbf{Y}\mathbf{Y}}^{-1}\xrightarrow{P}\mathbf{S}_{j,\mathbf{YY}}^{-1}.

Since matrix multiplication is continuous with respect to convergence in probability, we obtain the consistency of the matrix estimators by Slutsky’s theorem, namely

\displaystyle\widehat{\mathbf{M}}_{j;\mathbf{a}}\xrightarrow{P}\mathbf{M}_{j;\mathbf{a}},\quad\widehat{\mathbf{M}}_{j;\mathbf{b}}\xrightarrow{P}\mathbf{M}_{j;\mathbf{b}}.

Consequently, the estimated wavelet canonical coherence and associated canonical vectors, derived from the largest eigenvalue and corresponding eigenvectors of $\widehat{\mathbf{M}}_{j;a}$ and $\widehat{\mathbf{M}}_{j;b}$ , also converge in probability to their population counterparts, following arguments akin to those in Knight et al. (2024).

Appendix C Details of the simulation setup

This appendix provides the full specification of the simulation experiments described in Section 5 of the main text.

C.1 MvLSW-based simulation

We simulate a $P+Q=6+4$ dimensional multivariate time series $\mathbf{Z}_{t}=[\mathbf{X}_{t}^{\top},\mathbf{Y}_{t}^{\top}]^{\top}\in\mathbb{R}^{10}$ from a multivariate locally stationary wavelet (MvLSW) process over $T=1024$ time points. The wavelet spectrum $\mathbf{S}_{j;\mathbf{ZZ}}(u)$ is non-zero only at scale $j=2$ and is structured as a $10\times 10$ block matrix

\mathbf{S}_{j=2;\mathbf{ZZ}}(u)=\begin{bmatrix}\mathbf{S}_{j;\mathbf{XX}}&\mathbf{S}_{j;\mathbf{XY}}(u)\\ \mathbf{S}_{j;\mathbf{YX}}(u)&\mathbf{S}_{j;\mathbf{YY}}\end{bmatrix},\quad u=\frac{t}{T}\in(0,1).

Auto-spectral block: $\mathbf{S}_{j;\mathbf{XX}}\in\mathbb{R}^{6\times 6}$

\mathbf{S}_{j;\mathbf{XX}}=\begin{bmatrix}8&1&1&0&0&0\\ 1&8&0&0&0&1\\ 1&0&8&0&0&0\\ 0&0&0&8&1&0\\ 0&0&0&1&8&0\\ 0&1&0&0&0&8\end{bmatrix}

Auto-spectral block: $\mathbf{S}_{j;\mathbf{YY}}\in\mathbb{R}^{4\times 4}$

\mathbf{S}_{j;\mathbf{YY}}=\begin{bmatrix}6&0&1&0\\ 0&6&1&1\\ 1&1&6&0\\ 0&1&0&6\end{bmatrix}

Cross-spectral block: $\mathbf{S}_{j;\mathbf{XY}}(u)\in\mathbb{R}^{6\times 4}$

This block is time-varying. For $u<0.5$ , each specified cross-group pair has coherence value 1; for $u\geq 0.5$ , the same entries increase to 2.

Let

c(u)=\begin{cases}1&\text{if }u<0.5\\ 2&\text{if }u\geq 0.5\end{cases}

then

\mathbf{S}_{j;\mathbf{XY}}(u)=\begin{bmatrix}c(u)&0&0&c(u)\\ 0&c(u)&0&0\\ 0&0&0&c(u)\\ c(u)&0&0&0\\ 0&0&0&0\\ 0&0&0&0\\ \end{bmatrix},\quad\mathbf{S}_{j;\mathbf{YX}}(u)=\mathbf{S}_{j;\mathbf{XY}}^{\top}(u)

The entries of the cross-spectral matrix determine how individual channels contribute to the global coherence structure between $\{\mathbf{X}_{t}\}$ and $\{\mathbf{Y}_{t}\}$ . Figure 8 visualizes the spectral structure $\mathbf{S}_{j=2;\mathbf{ZZ}}(u)$ , and Figure 9 shows example realizations of the simulated processes $\{\mathbf{X}_{t}\}$ and $\{\mathbf{Y}_{t}\}$ .

C.2 Mixture of AR(2)-based simulation

Fourier-based LSP method for comparison

As a benchmark, we implement a method based on the time-varying Cramér representation for locally stationary processes (LSP), introduced in Dahlhaus (1997). A locally stationary process $\{\mathbf{X}_{t,T}\}$ can be expressed as

\mathbf{X}_{t,T}=\int_{-0.5}^{0.5}\mathbf{A}(u,\omega)e^{2\pi i\omega t}\,dZ(\omega),\quad u=t/T,

where $\mathbf{A}(u,\omega)$ is a smoothly varying transfer function and $Z(\omega)$ is a complex orthogonal increment process with $\mathbb{E}[|dZ(\omega)|^{2}]=d\omega$ . The local spectral density is then defined as $\mathbf{f}(u,\omega)=|\mathbf{A}(u,\omega)|^{2}$ . We estimate $\mathbf{f}(u,\omega)$ via the Short-Time Fourier Transform (STFT) with a Gaussian smoothing kernel and compute canonical coherence over $\omega\in[25,50]Hz$ .

Simulation setting

We simulate 500 independent replicates of two multivariate processes, $\{\mathbf{X}_{t}\}\in\mathbb{R}^{4}$ and $\{\mathbf{Y}_{t}\}\in\mathbb{R}^{3}$ , for $t=1,\ldots,T$ , with $T=1024$ and sampling rate $f_{s}=100Hz$ . Each process is generated from mixture of $K=5$ latent AR(2) sources, $\mathbf{Z}^{(\mathbf{X})}_{t},\mathbf{Z}^{(\mathbf{Y})}_{t}\in\mathbb{R}^{K}$ , peaking at different frequency bands (delta, theta, alpha, beta, gamma), respectively. For the first half of the time series ( $t\leq T/2$ ), the observed processes are given by:

\mathbf{X}_{t}=B^{(1)}\mathbf{Z}^{(\mathbf{X})}_{t},\qquad\mathbf{Y}_{t}=C^{(1)}\mathbf{Z}^{(\mathbf{Y})}_{t},

and for the second half ( $t>T/2$ ):

\mathbf{X}_{t}=B^{(2)}\mathbf{Z}^{(\mathbf{X})}_{t},\qquad\mathbf{Y}_{t}=C^{(2)}\mathbf{Z}^{(\mathbf{Y})}_{t},

where the mixing matrices are:

B^{(1)}=\begin{bmatrix}0&0&0&0&0.95\\ 0&0&0&0&0.90\\ b_{3,1}^{(1)}&b_{3,2}^{(1)}&0&0&0\\ b_{4,1}^{(1)}&b_{4,2}^{(1)}&b_{4,3}^{(1)}&0&0\end{bmatrix},\quad B^{(2)}=\begin{bmatrix}0&b_{1,2}^{(2)}&b_{1,3}^{(2)}&0&0\\ 0&0&0.80&0&0\\ 0.90&0&0&0&0\\ 0&b_{4,2}^{(2)}&b_{4,3}^{(2)}&0&0\end{bmatrix}

C^{(1)}=\begin{bmatrix}0&0&0&0&0.95\\ 0&0&0&0&0.90\\ 0&c_{3,2}^{(1)}&c_{3,3}^{(1)}&0&0\end{bmatrix},\quad C^{(2)}=\begin{bmatrix}0&0&0&0.90&0\\ 0&0&c_{2,3}^{(2)}&0&0\\ c_{3,1}^{(2)}&0&0&0&0\end{bmatrix}

For each row of the mixing matrices, a selected subset of frequency bands is assigned non-zero weights that are randomly drawn to sum to a predefined total (e.g., 0.95, 0.90, or 1.0), introducing controlled variability across replicates while preserving the intended contribution structure. The latent sources $\mathbf{Z}^{(\mathbf{X})}_{t}$ and $\mathbf{Z}^{(\mathbf{Y})}_{t}$ are partially shared in the gamma band (component 5) during the first regime, with mixing weights $\alpha=0.7$ and $\beta=0.6$ , respectively:

Z^{(\mathbf{X})}_{t,5}=\alpha Z^{(\text{shared})}_{t,5}+(1-\alpha)Z^{(\mathbf{X},\text{private})}_{t,5},\qquad Z^{(\mathbf{Y})}_{t,5}=\beta Z^{(\text{shared})}_{t,5}+(1-\beta)Z^{(\mathbf{Y},\text{private})}_{t,5},\quad t\leq T/2

where $Z^{(\text{shared})}_{t,5}$ is a common latent process. Specifically, each channel, say $p$ , of $\{\mathbf{Z}_{t}^{\text{(shared)}}\}$ , $\{\mathbf{Z}_{t}^{(\mathbf{X},\text{private})}\}$ and $\{\mathbf{Z}_{t}^{(\mathbf{Y},\text{private})}\}$ is generated from the following AR(2) process independently,

\displaystyle Z_{t,p}=\phi_{1}Z_{t-1,p}+\phi_{2}Z_{t-2,p}+w_{t},

where $\{w_{t}\}$ is white noise and the coefficients are $\phi_{1}=2cos(2\pi\eta)/e^{s}$ , $\phi_{2}=-1/e^{2s}$ . For each component $p=1,\ldots,5$ , we use the frequency vector, $\boldsymbol{\eta}=\{\eta^{(1)},\ldots,\eta^{(5)}\}=\{0.02,0.06,0.10,0.175,0.375\}$ , and the sharpness parameter $\mathbf{s}=\{s^{(1)},\ldots,s^{(5)}\}=\{0.03,0.03,0.03,0.05,0.05\}$ , where smaller $s$ yields narrower frequency bands.

This design induces time-varying coherence between the first two channels of $\mathbf{X}$ and $\mathbf{Y}$ during the first regime, primarily through the shared gamma component, while maintaining independence during the second regime. The abrupt transition in mixing structure at $t=T/2$ provides a controlled setting for evaluating the sensitivity of coherence estimation methods to sudden changes in cross-dependence.

Appendix D Additional analysis for the LFP data in Section 6

D.1 Additional results

We display the estimated wavelet canonical coherence between two electrode clusters across additional scales. Figure 10 shows that cross-group associations vary significantly over time, with the stimulus clearly eliciting a scale-dependent response, indicative of the heterogeneous brain activity across different frequency bands. These results emphasize the importance of capturing scale-specific, time-varying coherence.

D.2 Permutation test: Algorithm

Algorithm 2 below gives the detailed procedure used in Section 6 for detecting significant differences in canonical coherence between correct- and incorrect-response trials, at selected time points.

Algorithm 2 Time-specific window permutation test for wavelet coherence analysis

Assume the time-localized, scale-specific WaveCanCoh

\rho_{\text{correct/ incorrect }}^{(r)}(j,u)

is available for each trial

r

, along with the trial label ‘correct/ incorrect’. We want to test for significant differences between correct- and incorrect-response trial groups at specific time points

t^{*}

and given scale

j

1.Window definition: For each time point

t^{*}

, define a window of size

w

centered at

t^{*}

t_{\text{start }}=t^{*}-w/2,\quad t_{\text{end }}=t^{*}+w/2

2.Test statistic: For each time point

t^{*}

and given scale

j

, compute:

T_{\mathrm{obs}}\left(j,t^{*}\right)=\sum_{t=t_{\text{start }}}^{t_{\text{end }}}\left(\operatorname{median}_{r}\left(\rho_{\text{correct }}^{(r)}(j,t/T)\right)-\operatorname{median}_{r}\left(\rho_{\text{incorrect }}^{(r)}(j,t/T)\right)\right)^{2}

3.Permutation: For each

t^{*}

and scale

j

•

Combine all $\rho^{(r)}(j,t/T)$ values from both correct- and incorrect-response trials across the time window.

•

Perform $n_{\text{perm}}$ random permutations. For each permutation $i$ :

–

Randomly assign trials into two new groups of the same sizes as the original groups.

–

Compute the permuted statistic:

T_{\text{perm}}^{(i)}\left(j,t^{*}\right)=\sum_{t=t_{\text{start }}}^{t_{\text{end }}}\left(\operatorname{median}_{r}\left(\rho_{\text{perm},1}^{(r)}(j,t/T)\right)-\operatorname{median}_{r}\left(\rho_{\text{perm},2}^{(r)}(j,t/T)\right)\right)^{2}

where $\rho_{\text{perm,}1}^{(r)}$ and $\rho_{\text{perm,}2}^{(r)}$ are the permuted trial groups.

4.Calculate

p

-value:

p\left(j,t^{*}\right)=\frac{1}{n_{\mathrm{perm}}}\sum_{i=1}^{n_{\mathrm{perm}}}\mathbb{I}\left(T_{\mathrm{perm}}^{(i)}\left(j,t^{*}\right)\geq T_{\mathrm{obs}}\left(j,t^{*}\right)\right)

D.3 Permutation test: LFP results

Table 1 reports the permutation test results on the LFP data, with the value in each cell representing the difference in the median wavelet coherence between correct and incorrect trials at the corresponding scale $j$ and time $t^{*}=-1.0s,-0.5s,0.5s,1.0s$ . The values in parentheses denote the permutation $p$ -values obtained using the windowed test procedure (window size $=0.2s$ ), with the number of permutation $n_{perm}=1000$ , and the detailed test results are shown in Figure 11, revealing significant differences between correct- and incorrect-response trials at scales $j=4$ to $7$ following the odor stimulation.

While our permutation-based inference procedure provides a robust, nonparametric approach for detecting significant coherence differences between conditions, nevertheless this framework does exhibit certain limitations, e.g., it inherently focuses on predefined wavelet scales rather than on arbitrary frequency bands, potentially restricting analyses that require precise frequency localization.

Table 1: Differences in median wavelet canonical coherence between correct- and incorrect-response trials across time points and scales. Each cell reports the median difference at scale

j

and time

t^{*}

, with

p

-values obtained from the time-specific permutation test shown in parentheses.

	-1.0	-0.5	0.5	1.0
3 ( $62.5-125Hz$ )	0.222 (0.786)	0.068 (0.704)	-0.178 (0.355)	0.106 (0.886)
4 ( $31.25-62.5Hz$ )	-0.090 (0.180)	-0.021 (0.506)	0.023 (0.001**)	0.027 (0.691)
5 ( $15.63-31.25Hz$ )	0.006 (0.977)	0.229 (0.999)	0.334 (0.002**)	0.016 (0.079)
6 ( $7.81-15.63Hz$ )	-0.025 (0.239)	-0.049(0.111)	0.039 (0.001**)	0.002 (0.025**)
7 ( $<7.81Hz$ )	-0.014 (0.999)	0.058 (0.640)	0.012 (0.059)	-0.030 (0.489)

Appendix E Causal-WaveCanCoh analysis

To further explore directed interactions between brain regions, we implement the Causal-WaveCanCoh framework (equation (17)) on LFP activity data, which extends WaveCanCoh by introducing a lead-lag structure to evaluate time-lagged canonical coherence. Specifically, we define $\mathbf{X}_{t}=\left({X^{(\text{T1})},X^{(\text{T2})},X^{(\text{T4})},X^{(\text{T5})}}\right)$ and $\mathbf{Y}_{t}=\left({Y^{(\text{T13})},\ldots,Y^{(\text{T17})}}\right)$ as the multivariate signals corresponding to the two investigated distinct hippocampal regions, and we conduct the analysis in both directions, namely $\mathbf{X}_{t}\to\mathbf{Y}_{t+h}$ and $\mathbf{Y}_{t}\to\mathbf{X}_{t+h}$ , for lags $h=0,\,10,\,20,\,30,\,40,\,50$ , corresponding to time shifts from 0 (contemporaneous dependence, used as reference point) to 0.05 seconds.

Figure 12 shows the average scale-specific estimated causal canonical coherence for both directions across the odor presentation time ( $0s-1.2s$ ) and trial type (correct- vs incorrect-response trials). The results reveal lag-dependent and scale-driven patterns of directed coherence that identify a stronger association between the two hippocampal regions at scale $j=5$ (corresponding to the frequency band $15.625-31.25Hz$ ). This notably occurs in both directions, with the activity in T13-T17 leading that of T1-T5 in correct-response trials after $h>10$ . The strength and behavior of coherence vary across scales, as well as across correct- and incorrect-response trial groups, which can be captured by our Causal-WaveCanCoh framework.

Wavelet Canonical Coherence for Nonstationary Signals

Abstract

1 Introduction

2 Related works

3 Wavelet canonical coherence (WaveCanCoh)

4 Estimation procedure

5 Simulation study

6 Local field potential (LFP) data analysis

7 Conclusions

References

Appendix

Appendix A Basic introduction to wavelets

Appendix B Theoretical proofs

Appendix C Details of the simulation setup

C.1 MvLSW-based simulation

Auto-spectral block: 𝐒j;𝐗𝐗∈ℝ6×6\mathbf{S}_{j;\mathbf{XX}}\in\mathbb{R}^{6\times 6}

Auto-spectral block: 𝐒j;𝐘𝐘∈ℝ4×4\mathbf{S}_{j;\mathbf{YY}}\in\mathbb{R}^{4\times 4}

Cross-spectral block: 𝐒j;𝐗𝐘​(u)∈ℝ6×4\mathbf{S}_{j;\mathbf{XY}}(u)\in\mathbb{R}^{6\times 4}

C.2 Mixture of AR(2)-based simulation

Fourier-based LSP method for comparison

Simulation setting

Appendix D Additional analysis for the LFP data in Section 6

D.1 Additional results

D.2 Permutation test: Algorithm

D.3 Permutation test: LFP results

Appendix E Causal-WaveCanCoh analysis

Auto-spectral block: $\mathbf{S}_{j;\mathbf{XX}}\in\mathbb{R}^{6\times 6}$

Auto-spectral block: $\mathbf{S}_{j;\mathbf{YY}}\in\mathbb{R}^{4\times 4}$

Cross-spectral block: $\mathbf{S}_{j;\mathbf{XY}}(u)\in\mathbb{R}^{6\times 4}$