This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Wavelet Canonical Coherence for Nonstationary Signals

Haibo Wu
Statistics Program
King Abdullah University of Science and Technology, Saudi Arabia
haibo.wu@kaust.edu.sa
&Marina I. Knight
Department of Mathematics
University of York, UK
marina.knight@york.ac.uk
&Keiland W. Cooper
Department of Neurobiology and Behavior
Center for the Neurobiology of Learning and Memory
University of California, Irvine
kwcooper@uci.edu
&Norbert J. Fortin
Department of Neurobiology and Behavior
Center for the Neurobiology of Learning and Memory
University of California, Irvine
norbert.fortin@uci.edu
&Hernando Ombao
Statistics Program
King Abdullah University of Science and Technology, Saudi Arabia
hernando.ombao@kaust.edu.sa
Abstract

Understanding the evolving dependence between two clusters of multivariate signals is fundamental in neuroscience and other domains where sub-networks in a system interact dynamically over time. Despite the growing interest in multivariate time series analysis, existing methods for between-clusters dependence typically rely on the assumption of stationarity and lack the temporal resolution to capture transient, frequency-specific interactions. To overcome this limitation, we propose scale-specific wavelet canonical coherence (WaveCanCoh), a novel framework that extends canonical coherence analysis to the nonstationary setting by leveraging the multivariate locally stationary wavelet model. The proposed WaveCanCoh enables the estimation of time-varying canonical coherence between clusters, providing interpretable insight into scale-specific time-varying interactions between clusters. Through extensive simulation studies, we demonstrate that WaveCanCoh accurately recovers true coherence structures under both locally stationary and general nonstationary conditions. Application to local field potential (LFP) activity data recorded from the hippocampus reveals distinct dynamic coherence patterns between correct and incorrect memory-guided decisions, illustrating the capacity of the method to detect behaviorally relevant neural coordination. These results highlight WaveCanCoh as a flexible and principled tool for modeling complex cross-group dependencies in nonstationary multivariate systems. The code for WaveCanCoh is available at: https://github.com/mhaibo/WaveCanCoh.git.

1 Introduction

Assessing the dependence structure between node clusters in a network is one of the most critical aspects of network time series analysis. Many models and frameworks have been developed to capture between-clusters association (e.g., correlation, coherence, and causality). Most existing methods characterize the dependence between two clusters through the dependence between (many) node pairs. However, in many scenarios, the primary interest lies in understanding the dependence structure between two groups of multivariate time series rather than individual processes. Figure 1 illustrates this perspective using brain activity signals. In this experiment, local field potential (LFP) activity was recorded from multiple electrodes implanted in different subregions of the hippocampus of rodents (rats) as they performed a complex sequence memory task. Instead of focusing on coherence between individual channels (electrodes), the main goal is to quantify time-varying functional interactions between groups of electrodes in order to understand how information processing differs in these two subregions. Similar challenges arise in other domains. For instance, in finance, understanding the dependence between entire market sectors (e.g., technology and energy) can be more informative than analyzing associations between individual stocks. These scenarios require a framework capable of capturing dynamic coherence between sets of nonstationary multivariate signals. In this paper, we propose a novel framework called scale-specific wavelet canonical coherence (WaveCanCoh) to characterize time-localized and scale-specific coherence between two clusters of multivariate time series. By leveraging the time-frequency localization properties of wavelets, WaveCanCoh is well-suited for analyzing nonstationary multivariate signals in neuroscience, finance, and other fields where transient, cross-group interactions are of scientific interest.

Refer to caption
Figure 1: Schematic overview of the experimental and analytical motivation. Local field potential (LFP) activity was recorded from multiple electrodes in the hippocampus as rats performed an odor sequence memory task. Rather than focusing on individual electrode pairs, our goal is to characterize the dynamic dependence between two hippocampal subregions, each represented by a group of electrodes, using wavelet-based multivariate coherence.

Canonical variate analysis (CVA) (Hotelling (1936)) provides a method for measuring the correlation between two vector variables, its application to time series data having started in the 1950s and 1960s, primarily in the econometrics and signal processing fields. Box and Tiao (1977) and Geweke (1982) extend CVA to time series for forecasting and causality detection. Brillinger (2001) provides a spectral domain formulation of canonical correlation, useful for frequency-domain time series analysis. This approach enables the analysis of canonical coherence between two sets of time series across different frequency bands. Many modern studies have been developed within this framework (e.g., Talento et al. (2024) and Viduarre et al. (2019)), and related methods have been widely applied across various fields, including neuroscience, finance, speech processing, and machine learning.

However, previous methods rely on the assumption that time series are (weakly) stationary, meaning their statistical properties (e.g., expectation and covariance, spectral signature) remain constant over time. In practice, this assumption often does not hold for time series that arise in practice. Moreover, two sets of time series commonly exhibit time-varying global coherence, which can sometimes be crucial for analysis. Thus, a method capable of handling nonstationary time series is necessary. Wavelet analysis is a widely used tool for studying nonstationary time series, as its localization property allows for the examination of localized correlations between two time series across both time and frequency domains. Wavelets are particularly effective for capturing transient properties of nonstationary signals due to their compact support, which can be compressed or stretched to adapt to the dynamic characteristics of the signal. Wavelet coherence has been well-defined and extensively studied in previous research, with applications spanning various fields (Embleton et al., 2022; Grinsted et al., 2004). To the best of our knowledge, no existing work has extended classical canonical coherence to the wavelet domain to measure the time-evolving canonical coherence between two groups of multivariate time series.

The key novelty of this paper lies in the development of a comprehensive and rigorous framework based on wavelets for measuring canonical coherence between two sets of nonstationary multivariate time series. Specifically, our main contributions include: (1) we define scale-specific wavelet canonical coherence (WaveCanCoh) and introduce its use as a tool to quantify the coherence between two sets of multivariate time series, (2) we provide a complete and theoretically justified algorithm for its estimation, and (3) we apply our method to LFP activity data from multiple electrodes to quantify dynamic interaction patterns among different subregions in the hippocampus. Multivariate locally stationary wavelet processes (MvLSW) underpin our construction, and the reader is directed to Nason et al. (2000) and Park et al. (2014) for details on their construction. Our framework not only captures the time-varying coherence between two sets of signals but also determines the contribution of each individual channel to the global coherence. Compared to previous models, the proposed approach provides a detailed, localized characterization of interactions within the multivariate time series. Our findings on the LFP activity data offer new insights into the functional relationship between hippocampal subregions, demonstrating the potential of our method to advance the study of functional brain connectivity.

The format of the paper is as follows. Section 2 overviews the current methodology for assessing time series canonical coherence. Section 3 provides a brief overview of MvLSW processes, supporting a detailed introduction to our proposed WaveCanCoh framework. Its estimation, and that of related parameters, is tackled in Section 4. Section 5 validates the proposed framework and demonstrates its performance through simulation. In Section 6 we apply the WaveCanCoh method on LFP activity data collected from rats to investigate the dynamic interactions of different subregions in the hippocampus during memory tasks. Section 7 concludes the paper.

2 Related works

First, we provide a brief introduction to classical canonical correlation analysis for time series, with the primary goal to characterize dependence between two clusters of time series, where each cluster features several nodes. In particular, we consider two multivariate time series with dimensions pp and qq respectively, denoted by 𝐗t=(Xt(1),,Xt(p))\mathbf{X}_{t}=\left(X_{t}^{(1)},\ldots,X_{t}^{(p)}\right)^{\top} and 𝐘t=(Yt(1),,Yt(q))\mathbf{Y}_{t}=\left(Y_{t}^{(1)},\ldots,Y_{t}^{(q)}\right)^{\top}, for t={1,,T}t=\{1,\ldots,T\}. Typically, {𝐗t}\{\mathbf{X}_{t}\} and {𝐘t}\{\mathbf{Y}_{t}\} are assumed to be zero-mean weakly stationary time series. Letting 𝐙t\mathbf{Z}_{t} denote the concatenated (p+qp+q) dimension time series, 𝐙t=(Xt(1),,Xt(p),Yt(1),,Yt(q))\mathbf{Z}_{t}=\left(X_{t}^{(1)},\ldots,X_{t}^{(p)},Y_{t}^{(1)},\ldots,Y_{t}^{(q)}\right)^{\top}, its covariance matrix at lag τ\tau is

𝚺𝐙𝐙(τ)=(𝚺𝐗𝐗(τ)𝚺𝐗𝐘(τ)𝚺𝐘𝐗(τ)𝚺𝐘𝐘(τ)),\displaystyle\boldsymbol{\Sigma}_{\mathbf{ZZ}}(\tau)=\left(\begin{array}[]{cc}\boldsymbol{\Sigma}_{\mathbf{XX}}(\tau)&\boldsymbol{\Sigma}_{\mathbf{XY}}(\tau)\\ \boldsymbol{\Sigma}_{\mathbf{YX}}(\tau)&\boldsymbol{\Sigma}_{\mathbf{YY}}(\tau)\end{array}\right),

where 𝚺𝐗𝐗()\boldsymbol{\Sigma}_{\mathbf{XX}}(\cdot), 𝚺𝐘𝐘()\boldsymbol{\Sigma}_{\mathbf{YY}}(\cdot) are the autocovariance matrices of {𝐗t}\{\mathbf{X}_{t}\} and {𝐘t}\{\mathbf{Y}_{t}\} respectively, and 𝚺𝐗𝐘()\boldsymbol{\Sigma}_{\mathbf{XY}}(\cdot), 𝚺𝐘𝐗()\boldsymbol{\Sigma}_{\mathbf{YX}}(\cdot) are their cross-covariances. The canonical correlation between {𝐗t}\{\mathbf{X}_{t}\} and {𝐘t}\{\mathbf{Y}_{t}\} at lag τ\tau, 𝝆(τ)\boldsymbol{\rho}(\tau), is defined as

𝝆(τ)=max𝐚,𝐛 𝐚𝚺𝐗𝐘(τ)𝐛𝐚𝚺𝐗𝐗(τ)𝐚𝐛𝚺𝐘𝐘(τ)𝐛,\displaystyle\boldsymbol{\rho}(\tau)=\max_{\mathbf{a},\mathbf{b}}\text{ }\frac{\mathbf{a}^{\top}\boldsymbol{\Sigma}_{\mathbf{XY}}(\tau)\mathbf{b}}{\sqrt{\mathbf{a}^{\top}\boldsymbol{\Sigma}_{\mathbf{XX}}(\tau)\mathbf{a}}{\sqrt{\mathbf{b}^{\top}\boldsymbol{\Sigma}_{\mathbf{YY}}(\tau)\mathbf{b}}}}, (1)

where 𝐚p\mathbf{a}\in\mathbb{R}^{p} and 𝐛q\mathbf{b}\in\mathbb{R}^{q} are called canonical correlation vectors, subject to standardized constraints 𝐚𝚺𝐗𝐗𝐚=1\mathbf{a}^{\top}\boldsymbol{\Sigma}_{\mathbf{XX}}\mathbf{a}=1, 𝐛𝚺𝐘𝐘𝐛=1\mathbf{b}^{\top}\boldsymbol{\Sigma}_{\mathbf{YY}}\mathbf{b}=1 (Brillinger, 2001). Thus, the canonical correlation can be rewritten as

𝝆(τ)=max𝐚,𝐛 (𝐚𝚺𝐗𝐘(τ)𝐛).\displaystyle\boldsymbol{\rho}{(\tau)}=\max_{\mathbf{a},\mathbf{b}}\text{ }\left(\mathbf{a}^{\top}\boldsymbol{\Sigma}_{\mathbf{XY}}(\tau)\mathbf{b}\right). (2)

Remark 1: In the preceding definitions for the cross-covariance matrices, we have 𝚺𝐗𝐘(τ)=𝚺𝐘𝐗(τ)=𝔼[𝐗t𝐘tτ]\boldsymbol{\Sigma}_{\mathbf{XY}}(\tau)=\boldsymbol{\Sigma}_{\mathbf{YX}}(-\tau)=\mathbb{E}\left[\mathbf{X}_{t}\mathbf{Y}_{t-\tau}^{\top}\right], which measure the lagged covariance between {𝐗t}\{\mathbf{X}_{t}\} and {𝐘t}\{\mathbf{Y}_{t}\}, i.e., past values of 𝐘\mathbf{Y} may be associated to present values of 𝐗\mathbf{X} for τ>0\tau>0, and vice-versa when τ<0\tau<0. The case of τ=0\tau=0 illustrates contemporaneous relationships.

The solution for 𝐚\mathbf{a} and 𝐛\mathbf{b} in equation (1) can be obtained from the eigenvectors of the following matrices, respectively (in what follows, τ\tau is dropped for brevity),

𝚺𝐗𝐗1𝚺𝐗𝐘𝚺𝐘𝐘1𝚺𝐘𝐗, and\displaystyle\boldsymbol{\Sigma}_{\mathbf{XX}}^{-1}\boldsymbol{\Sigma}_{\mathbf{XY}}\boldsymbol{\Sigma}_{\mathbf{YY}}^{-1}\boldsymbol{\Sigma}_{\mathbf{YX}}\mbox{, and} (3)
𝚺𝐘𝐘1𝚺𝐘𝐗𝚺𝐗𝐗1𝚺𝐗𝐘.\displaystyle\boldsymbol{\Sigma}_{\mathbf{YY}}^{-1}\boldsymbol{\Sigma}_{\mathbf{YX}}\boldsymbol{\Sigma}_{\mathbf{XX}}^{-1}\boldsymbol{\Sigma}_{\mathbf{XY}}. (4)

Here, 𝐚\mathbf{a} and 𝐛\mathbf{b} are the eigenvectors corresponding to the largest eigenvalue, λ\lambda, of matrices (3) and (4), respectively, and for largest canonical correlation coefficient we have 𝝆=λ\boldsymbol{\rho}=\sqrt{\lambda} (Mardia et al., 1979). The framework above yields the canonical correlation between {𝐗t}\{\mathbf{X}_{t}\} and {𝐘t}\{\mathbf{Y}_{t}\}. However, in many practical cases, such as EEG analysis, canonical correlation in the spectral domain is more meaningful than in the time domain, as components at different frequencies (or scales) reveal crucial information about neural dynamics and functional connectivity (Ombao and Pinto, 2024) , Brillinger (2001) extends the time-domain canonical correlation into the spectral domain). Namely, suppose the spectral matrix of 𝐙t=(𝐗t,𝐘t)\mathbf{Z}_{t}=\left(\mathbf{X}_{t}^{\top},\mathbf{Y}_{t}^{\top}\right)^{\top} is

𝐟𝐙𝐙(𝝎)=[𝐟𝐗𝐗(𝝎)𝐟𝐗𝐘(𝝎)𝐟𝐘𝐗(𝝎)𝐟𝐘𝐘(𝝎)],\displaystyle\mathbf{f_{ZZ}}(\boldsymbol{\omega})=\left[\begin{array}[]{ll}\mathbf{f_{XX}}(\boldsymbol{\omega})&\mathbf{f_{XY}}(\boldsymbol{\omega})\\ \mathbf{f_{YX}}(\boldsymbol{\omega})&\mathbf{f_{YY}}(\boldsymbol{\omega})\end{array}\right],

where 𝐟𝐗𝐗(𝝎)\mathbf{f_{XX}}(\boldsymbol{\omega}) is the p×pp\times p autospectral matrix of {𝐗t}\{\mathbf{X}_{t}\}, 𝐟𝐘𝐘(𝝎)\mathbf{f_{YY}}(\boldsymbol{\omega}) is the q×qq\times q autospectral matrix of {𝐘t}\{\mathbf{Y}_{t}\}, and 𝐟𝐗𝐘(𝝎)\mathbf{f_{XY}}(\boldsymbol{\omega}) is the p×qp\times q cross-spectral matrix between {𝐗t}\{\mathbf{X}_{t}\} and {𝐘t}\{\mathbf{Y}_{t}\}. Given vectors 𝐚p\mathbf{a}\in\mathbb{C}^{p} and 𝐛q\mathbf{b}\in\mathbb{C}^{q}, such that 𝐚𝐟𝐗𝐗(𝝎)𝐚=𝐛𝐟𝐘𝐘(𝝎)𝐛=1\mathbf{a}^{\top}\mathbf{f_{XX}}(\boldsymbol{\omega})\mathbf{a}=\mathbf{b}^{\top}\mathbf{f_{YY}}(\boldsymbol{\omega})\mathbf{b}=1, the canonical coherence at frequency 𝝎\boldsymbol{\omega} is

𝝆(𝝎)=max𝐚,𝐛|𝐚𝐟𝐗𝐘(𝝎)𝐛𝐚𝐟𝐗𝐗(𝝎)𝐚𝐛𝐟𝐘𝐘(𝝎)𝐛|2.\displaystyle\boldsymbol{\rho(\omega)}=\max_{\mathbf{a},\mathbf{b}}\left|\frac{\mathbf{a}^{\top}\mathbf{f_{XY}}(\boldsymbol{\omega})\mathbf{b}}{\sqrt{\mathbf{a}^{\top}\mathbf{f_{XX}}(\boldsymbol{\omega})\mathbf{a}}{\sqrt{\mathbf{b}^{\top}\mathbf{f_{YY}}(\boldsymbol{\omega})\mathbf{b}}}}\right|^{2}. (5)

By solving the maximization problem in equation (5), the canonical coherence vectors 𝐚\mathbf{a} and 𝐛\mathbf{b} are determined, leading to the quantification of the canonical coherence at frequency 𝝎\boldsymbol{\omega}.

Note the classic canonical coherence in (5) completely ignores temporal dynamics, a consequence of the stationarity assumption where dependence between clusters is imposed to remain constant over time, whilst most real-world data, such as EEG, exhibit nonstationarity (Huang et al., 2004; Knight et al., 2024; West et al., 1999). Hence the lack of time-localization information in the above approach may result in misleading results and a novel method capable of capturing time-varying canonical coherence is needed.

3 Wavelet canonical coherence (WaveCanCoh)

Our WaveCanCoh framework is built upon the multivariate locally stationary wavelet (MvLSW) process (Nason et al. (2000), Park et al. (2014)), which is a model based on wavelet analysis for time series. A brief overview of wavelets and highlight of their differences from Fourier-based methods are provided in Appendix A.

A new representation for discretely sampled nonstationary time series based on discrete non-decimated wavelets is the locally stationary wavelet process introduced by Nason et al. (2000), later extended to a multivariate framework in Park et al. (2014). A PP-variate stochastic process with time evolving second-order structure, 𝐗t=(Xt(1),Xt(2),,Xt(P))\mathbf{X}_{t}=\left(X_{t}^{(1)},X_{t}^{(2)},\ldots,X_{t}^{(P)}\right)^{\top}, where t=1,,Tt=1,\ldots,T, can be represented with the MvLSW formulation

𝐗t=j=1k𝐕j(k/T)ψj,k(t)𝐳j,k,\displaystyle\mathbf{X}_{t}=\sum_{j=1}^{\infty}\sum_{k\in\mathbb{Z}}\mathbf{V}_{j}(k/T)\psi_{j,k}(t)\mathbf{z}_{j,k},

where 𝐕j(k/T)\mathbf{V}_{j}(k/T) is a P×PP\times P transfer function matrix assumed to have a lower-triangular form; {ψj,k}j,k\left\{\psi_{j,k}\right\}_{j,k} is a set of discrete non-decimated wavelets; {𝐳j,k}j,k\left\{\mathbf{z}_{j,k}\right\}_{j,k} is a set of P×1P\times 1 uncorrelated random vectors with (column) mean vector 𝟎\mathbf{0} and P×PP\times P identity covariance matrix. Since the wavelet basis ψj,k(t)\psi_{j,k}(t) is localized in both time and frequency, the transfer matrix 𝐕j(k/T)\mathbf{V}_{j}(k/T) provides a measure of the time-varying contribution to the variance among channels at a specific scale jj and rescaled time u=k/Tu=k/T, thus enabling the statistical properties of the process {𝐗t}\{\mathbf{X}_{t}\} to change smoothly over time.

The time-varying statistical properties of {𝐗t}\{\mathbf{X}_{t}\} can be captured through the localized, scale-specific local wavelet spectral matrix (LWS, Park et al. (2014)), 𝐒j(u)\mathbf{S}_{j}(u), defined at scale jj and rescaled time u(0,1)u\in(0,1), as

𝐒j(u)=𝐕j(u)𝐕j(u).\displaystyle\mathbf{S}_{j}(u)=\mathbf{V}_{j}(u)\mathbf{V}_{j}^{\top}(u). (6)

Note 𝐒j(u)\mathbf{S}_{j}(u) is a P×PP\times P symmetric, positive semi-definite matrix and its (p,q)(p,q) entry, Sj(p,q)(u)S_{j}^{(p,q)}(u), denotes the cross-spectrum between channels pp and qq. We now extend the LWS matrix construction from a single set of multivariate time series to a cross-group LSW matrix, between 𝐗t=(Xt(1),,Xt(P))\mathbf{X}_{t}=\left(X_{t}^{(1)},\ldots,X_{t}^{(P)}\right)^{\top} and 𝐘t=(Yt(1),,Yt(Q))\mathbf{Y}_{t}=\left(Y_{t}^{(1)},\ldots,Y_{t}^{(Q)}\right)^{\top}. Denoting 𝐙t=(𝐗t,𝐘t)\mathbf{Z}_{t}=\left(\mathbf{X}_{t}^{\top},\mathbf{Y}_{t}^{\top}\right)^{\top}, the LWS matrix of {𝐙t}\{\mathbf{Z}_{t}\} at scale jj and rescaled time uu, 𝐒j;𝐙𝐙(u)\mathbf{S}_{j;\mathbf{ZZ}}(u), is

𝐒j;𝐙𝐙(u)=𝐕j;𝐙(u)𝐕j;𝐙(u)=[𝐒j;𝐗𝐗(u)𝐒j;𝐗𝐘(u)𝐒j;𝐘𝐗(u)𝐒j;𝐘𝐘(u)].\displaystyle\mathbf{S}_{j;\mathbf{ZZ}}(u)=\mathbf{V}_{j;\mathbf{Z}}(u)\mathbf{V}_{j;\mathbf{Z}}^{\top}(u)=\left[\begin{array}[]{ll}\mathbf{S}_{j;\mathbf{XX}}(u)&\mathbf{S}_{j;\mathbf{XY}}(u)\\ \mathbf{S}_{j;\mathbf{YX}}(u)&\mathbf{S}_{j;\mathbf{YY}}(u)\end{array}\right]. (9)

In equation (9), 𝐕j;𝐙(u)\mathbf{V}_{j;\mathbf{Z}}(u) denotes the (P+Q)×(P+Q)(P+Q)\times(P+Q) transfer function matrix of the MvLSW process {𝐙t}\{\mathbf{Z}_{t}\}, and 𝐒j;𝐙𝐙(u)\mathbf{S}_{j;\mathbf{ZZ}}(u) is its corresponding LWS matrix. The main diagonal blocks 𝐒j;𝐗𝐗(u)\mathbf{S}_{j;\mathbf{XX}}(u) (P×P)(P\times P) and 𝐒j;𝐘𝐘(u)\mathbf{S}_{j;\mathbf{YY}}(u) (Q×Q)(Q\times Q) denote the auto-LWS matrices of the {𝐗t}\{\mathbf{X}_{t}\} and {𝐘t}\{\mathbf{Y}_{t}\} processes, respectively, while 𝐒j;𝐗𝐘(u)\mathbf{S}_{j;\mathbf{XY}}(u) and 𝐒j;𝐘𝐗(u)\mathbf{S}_{j;\mathbf{YX}}(u) denote their cross-LWS matrices.

Remark 2: In equation (9), 𝐒j;𝐗𝐘(u)\mathbf{S}_{j;\mathbf{XY}}(u) is a P×QP\times Q matrix at each time point, and the (p,q)(p,q) element gives the cross-spectrum between channel pp of {𝐗t}\{\mathbf{X}_{t}\} and channel qq of {𝐘t}\{\mathbf{Y}_{t}\}. Moreover, it is easy to show that 𝐒j;𝐗𝐘(u)=𝐒j;𝐘𝐗(u)\mathbf{S}_{j;\mathbf{XY}}(u)=\mathbf{S}_{j;\mathbf{YX}}^{\top}(u).

The LSW matrix quantifies the localized contributions to the process variance for individual and cross-channels, which motivates us to next define the localized canonical coherence between two sets of locally stationary time series at a specific scale (corresponding to a determined frequency band).

Definition 1 (Localized Scale-specific Wavelet Canonical Coherence)
Let 𝐗t=(Xt(1),,Xt(P))\mathbf{X}_{t}=\left(X_{t}^{(1)},\ldots,X_{t}^{(P)}\right)^{\top} and 𝐘t=(Yt(1),,Yt(Q))\mathbf{Y}_{t}=\left(Y_{t}^{(1)},\ldots,Y_{t}^{(Q)}\right)^{\top}, where t=1,,Tt=1,\ldots,T, be (jointly) multivariate locally stationary time series. We define the localized scale-specific wavelet canonical coherence (WaveCanCoh) between {𝐗t}\{\mathbf{X}_{t}\} and {𝐘t}\{\mathbf{Y}_{t}\}, at scale jj and rescaled time uu, as

𝝆j;𝐗𝐘(u)=max𝐚j(u),𝐛j(u){𝐚j(u)𝐒j;𝐗𝐘(u)𝐛j(u)}2,\displaystyle\boldsymbol{\rho}_{j;\mathbf{XY}}(u)=\max_{\mathbf{a}_{j}(u),\mathbf{b}_{j}(u)}\left\{\mathbf{a}_{j}^{\top}(u)\mathbf{S}_{j;\mathbf{XY}}(u)\mathbf{b}_{j}(u)\right\}^{2}, (10)

where 𝐚j(u)=(𝐚j(p)(u))p=1P\mathbf{a}^{\top}_{j}(u)=\left(\mathbf{a}_{j}^{(p)}(u)\right)_{p=1}^{P} is a 1×P1\times P vector and 𝐛j(u)=(𝐛j(q)(u))q=1Q\mathbf{b}^{\top}_{j}(u)=\left(\mathbf{b}_{j}^{(q)}(u)\right)_{q=1}^{Q} is a 1×Q1\times Q vector, representing the localized canonical coherence vectors of {𝐗t}\{\mathbf{X}_{t}\} and {𝐘t}\{\mathbf{Y}_{t}\}, respectively. The constraints here are 𝐚j(u)𝐒j;𝐗𝐗(u)𝐚j(u)=1\mathbf{a}_{j}^{\top}(u)\mathbf{S}_{j;\mathbf{XX}}(u)\mathbf{a}_{j}(u)=1 and 𝐛j(u)𝐒j;𝐘𝐘(u)𝐛j(u)=1\mathbf{b}_{j}^{\top}(u)\mathbf{S}_{j;\mathbf{YY}}(u)\mathbf{b}_{j}(u)=1.

Remark 3: The WaveCanCoh time-dependent trace 𝛒j;𝐗𝐘()\boldsymbol{\rho}_{j;\mathbf{XY}}(\cdotp) measures the ‘global’ coherence between {𝐗t}\{\mathbf{X}_{t}\} and {𝐘t}\{\mathbf{Y}_{t}\} at scale jj, and takes values between 0 and 1. A value close to 1 indicates strong linear dependence, while a value close to 0 shows little to no linear dependence. Furthermore, 𝐚j(p)()\mathbf{a}_{j}^{(p)}(\cdotp) and 𝐛j(q)()\mathbf{b}_{j}^{(q)}(\cdotp) represent the localized contributions from the (p,q)(p,q) channels to 𝛒j;𝐗𝐘()\boldsymbol{\rho}_{j;\mathbf{XY}}(\cdotp).

The canonical coherence vectors 𝐚j()\mathbf{a}_{j}(\cdotp), 𝐛j()\mathbf{b}_{j}(\cdotp) can be obtained by maximizing (10) and the solution can be found by solving the eigenvalue and eigenvector problem associated with the following matrices

𝐌j;𝐚(u)\displaystyle\mathbf{M}_{j;\mathbf{a}}(u) =𝐒j,𝐗𝐗1(u)𝐒j,𝐗𝐘(u)𝐒j,𝐘𝐘1(u)𝐒j,𝐘𝐗(u),\displaystyle=\mathbf{S}_{j,\mathbf{XX}}^{-1}(u)\mathbf{S}_{j,\mathbf{XY}}(u)\mathbf{S}_{j,\mathbf{YY}}^{-1}(u)\mathbf{S}_{j,\mathbf{YX}}(u), (11)
𝐌j;𝐛(u)\displaystyle\mathbf{M}_{j;\mathbf{b}}(u) =𝐒j,𝐘𝐘1(u)𝐒j,𝐘𝐗(u)𝐒j,𝐗𝐗1(u)𝐒j,𝐗𝐘(u).\displaystyle=\mathbf{S}_{j,\mathbf{YY}}^{-1}(u)\mathbf{S}_{j,\mathbf{YX}}(u)\mathbf{S}_{j,\mathbf{XX}}^{-1}(u)\mathbf{S}_{j,\mathbf{XY}}(u). (12)

Denote by Λj;𝐚(k)(u)\Lambda_{j;\mathbf{a}}^{(k)}(u) the kk-th largest eigenvalue of matrix 𝐌j;𝐚(u)\mathbf{M}_{j;\mathbf{a}}(u) in equation (11), and by Λj;𝐛(l)(u)\Lambda_{j;\mathbf{b}}^{(l)}(u) the ll-th largest eigenvalue of matrix 𝐌j;𝐛(u)\mathbf{M}_{j;\mathbf{b}}(u) in equation (12), for k,l=1,,min(P,Q)k,l=1,\ldots,\min(P,Q). An important observation is that 𝐌j;𝐚(u)\mathbf{M}_{j;\mathbf{a}}(u) and 𝐌j;𝐛(u)\mathbf{M}_{j;\mathbf{b}}(u) share the same eigenvalues (see Appendix B for details), hence denoting by Λj(1)(u)=Λj;𝐚(1)(u)=Λj;𝐛(1)(u)\Lambda_{j}^{(1)}(u)=\Lambda_{j;\mathbf{a}}^{(1)}(u)=\Lambda_{j;\mathbf{b}}^{(1)}(u) their largest eigenvalue, the canonical coherence between {𝐗t}\{\mathbf{X}_{t}\} and {𝐘t}\{\mathbf{Y}_{t}\} at rescaled time uu, as defined in equation (10), becomes

𝝆j;𝐗𝐘(u)=Λj(1)(u).\displaystyle\boldsymbol{\rho}_{j;\mathbf{XY}}(u)=\Lambda_{j}^{(1)}(u). (13)

The eigenvectors of 𝐌j;𝐚(u)\mathbf{M}_{j;\mathbf{a}}(u) and 𝐌j;𝐛(u)\mathbf{M}_{j;\mathbf{b}}(u) corresponding to Λj(1)(u)\Lambda_{j}^{(1)}(u) provide the solutions to the canonical directions of {𝐗t}\{\mathbf{X}_{t}\} and {𝐘t}\{\mathbf{Y}_{t}\}, respectively. Proof: see Appendix B.

An important extension of our framework is the incorporation of leading-lag relationships into the scale-specific wavelet canonical coherence (WaveCanCoh). To account for potential causal effects, we define a lagged joint process, 𝐙t(h)=(𝐗t,𝐘t+h)\mathbf{Z}_{t}(h)=\left(\mathbf{X}^{\top}_{t},\mathbf{Y}^{\top}_{t+h}\right)^{\top}, where hh is the value of lag, and t=1,,Tht=1,\ldots,T-h for h>0h>0. We define the LWS matrix of {𝐙t(h)}\{\mathbf{Z}_{t}(h)\} at scale jj, as

𝐒j;𝐙𝐙(u,h)=𝐕j;𝐙(u,h)𝐕j;𝐙(u,h)=[𝐒j;𝐗𝐗(u)𝐒j;𝐗𝐘(u,h)𝐒j;𝐘𝐗(u+(h/T),h)𝐒j;𝐘𝐘(u+(h/T))]\displaystyle\mathbf{S}_{j;\mathbf{ZZ}}(u,h)=\mathbf{V}_{j;\mathbf{Z}}(u,h)\mathbf{V}_{j;\mathbf{Z}}^{\top}(u,h)=\left[\begin{array}[]{ll}\mathbf{S}_{j;\mathbf{XX}}(u)&\mathbf{S}_{j;\mathbf{XY}}(u,h)\\ \mathbf{S}_{j;\mathbf{YX}}(u+(h/T),-h)&\mathbf{S}_{j;\mathbf{YY}}(u+(h/T))\end{array}\right] (16)

where 𝐒j;𝐗𝐘(u,h)\mathbf{S}_{j;\mathbf{XY}}(u,h) denotes the cross-LWS matrix between 𝐗[uT]\mathbf{X}_{[uT]} and 𝐘[uT]+h\mathbf{Y}_{[uT]+h}, capturing the interaction between current values of 𝐗\mathbf{X} and future values of 𝐘\mathbf{Y}. Based on this construction, we can define and estimate the lagged version of WaveCanCoh, enabling us to infer potential causal relationships between two groups of time series.

Definition 2 (Causal Localized Scale-specific Wavelet Canonical Coherence)
The causal localized scale-specific canonical coherence (Causal-WaveCanCoh) between {𝐗t}\{\mathbf{X}_{t}\} and {𝐘t}\{\mathbf{Y}_{t}\} with lag hh (or, 𝐗t𝐘t+h\mathbf{X}_{t}\rightarrow\mathbf{Y}_{t+h}), at scale jj and rescaled time uu, is defined as

𝝆j;𝐗𝐘(u,h)=max𝐚j(u),𝐛j(u){𝐚j(u)𝐒j;𝐗𝐘(u,h)𝐛j(u+(h/T))}2,\displaystyle\boldsymbol{\rho}_{j;\mathbf{XY}}(u,h)=\max_{\mathbf{a}_{j}(u),\mathbf{b}_{j}(u)}\left\{\mathbf{a}_{j}^{\top}(u)\mathbf{S}_{j;\mathbf{XY}}(u,h)\mathbf{b}_{j}(u+(h/T))\right\}^{2}, (17)

where the notations and constraints are the same as in the standard WaveCanCoh framework. This extension allows for a scale- and time-specific evaluation of causal overall association between two sets of time series, enhancing interpretability in dynamic, multivariate, and nonstationary settings.

The framework above allows us to capture the time-varying overall association between two sets of multivariate time series, as well as the time-varying contributions from each individual channel within these sets. However, a natural consideration is how to project this time-varying coherence at each scale jj into the frequency domain in a manner consistent with the Fourier-based method described in Section 2, as a key concern in many analyses is to interpret the results in the frequency domain. As mentioned earlier, each scale in the wavelet analysis corresponds approximately, but not exactly, to a specific frequency band. This correspondence is governed by the unique filtering mechanism of wavelets, an explanation for this corresponding relationship is provided in Appendix A.

4 Estimation procedure

In Section 3, we developed a rigorous framework that allowed us to introduce the localized, scale-specific wavelet canonical coherence. In this section, we propose a well-behaved estimation procedure for quantifying the canonical coherence and corresponding canonical vectors. We start by estimating the local wavelet spectrum (LWS) matrices in equations (11) and (12) in the spirit of Park et al. (2014), given by

𝐒^j,k=l=1JAjl1𝐈~l,k, where 𝐈~l,k=12M+1m=MM𝐈l,k+m is the smoothed periodogram ,\displaystyle\widehat{\mathbf{S}}_{j,k}=\sum_{l=1}^{J}A_{jl}^{-1}\tilde{\mathbf{I}}_{l,k},\mbox{ where }\tilde{\mathbf{I}}_{l,k}=\frac{1}{2M+1}\sum_{m=-M}^{M}\mathbf{I}_{l,k+m}\mbox{ is the smoothed periodogram }, (18)

kk represents the shift of the wavelet function and is equivalent to time k=[uT]k=[uT] in our context, and MM is the half-width of the rectangular smoothing kernel, controlling the amount of temporal smoothing. The matrix 𝐈l,k\mathbf{I}_{l,k} is the raw periodogram at scale ll and time kk, obtained as

𝐈l,k=𝐝l,k𝐝l,k, where 𝐝l,k=t=0T𝐗tψl,k(t) is the empirical wavelet coefficient vector.\displaystyle\mathbf{I}_{l,k}=\mathbf{d}_{l,k}\mathbf{d}_{l,k}^{\top},\text{ where }\mathbf{d}_{l,k}=\sum_{t=0}^{T}\mathbf{X}_{t}\psi_{l,k}(t)\mbox{ is the empirical wavelet coefficient vector}.

This multistep procedure yields consistent estimators of the LWS matrices under the asymptotic conditions T,MT,M\rightarrow\infty and M/T0M/T\rightarrow 0 (Park et al., 2014). We propose the following estimator for the scale-specific wavelet canonical coherence (WaveCanCoh)

𝝆^j;𝐗𝐘(u)=Λ^j(1)(u),\displaystyle\widehat{\boldsymbol{\rho}}_{j;\mathbf{XY}}(u)=\widehat{\Lambda}_{j}^{(1)}(u), (19)

where Λ^j(1)(u)\widehat{\Lambda}_{j}^{(1)}(u) is the largest eigenvalue of 𝐌^j;𝐚(u)\widehat{\mathbf{M}}_{j;\mathbf{a}}(u) and 𝐌^j;𝐛(u)\widehat{\mathbf{M}}_{j;\mathbf{b}}(u), defined as

𝐌^j;𝐚(u)\displaystyle\widehat{\mathbf{M}}_{j;\mathbf{a}}(u) =𝐒^j,𝐗𝐗1(u)𝐒^j,𝐗𝐘(u)𝐒^j,𝐘𝐘1(u)𝐒^j,𝐘𝐗(u),\displaystyle=\widehat{\mathbf{S}}_{j,\mathbf{XX}}^{-1}(u)\widehat{\mathbf{S}}_{j,\mathbf{XY}}(u)\widehat{\mathbf{S}}_{j,\mathbf{YY}}^{-1}(u)\widehat{\mathbf{S}}_{j,\mathbf{YX}}(u), (20)
𝐌^j;𝐛(u)\displaystyle\widehat{\mathbf{M}}_{j;\mathbf{b}}(u) =𝐒^j,𝐘𝐘1(u)𝐒^j,𝐘𝐗(u)𝐒^j,𝐗𝐗1(u)𝐒^j,𝐗𝐘(u).\displaystyle=\widehat{\mathbf{S}}_{j,\mathbf{YY}}^{-1}(u)\widehat{\mathbf{S}}_{j,\mathbf{YX}}(u)\widehat{\mathbf{S}}_{j,\mathbf{XX}}^{-1}(u)\widehat{\mathbf{S}}_{j,\mathbf{XY}}(u). (21)

The estimated localized, scale-specific canonical direction vectors 𝐚^j(u)\widehat{\mathbf{a}}_{j}(u) and 𝐛^j(u)\widehat{\mathbf{b}}_{j}(u) are the eigenvectors of 𝐌^j;𝐚(u)\widehat{\mathbf{M}}_{j;\mathbf{a}}(u) and 𝐌^j;𝐛(u)\widehat{\mathbf{M}}_{j;\mathbf{b}}(u) respectively, associated with Λ^j(1)(u)\widehat{\Lambda}_{j}^{(1)}(u). These quantities provide estimates of the time-varying global coherence and the channel-specific contributions at scale jj. The proposed estimators are consistent with the true quantities they aim to approximate, provided certain asymptotic conditions are met. These include increasing sample size and appropriate smoothing bandwidth, ensuring reliable estimation in the limit. Proof: See Appendix B.

Algorithm 1 summarizes the estimation procedure for WaveCanCoh and its results can be further used to investigate the temporal channel contributions to the global association, at a particular scale.

Algorithm 1 Proposed WaveCanCoh estimation algorithm for nonstationary time series
Suppose the observed data are two sets of multivariate locally stationary time series, denoted as 𝐗t=(Xt(1),,Xt(P))\mathbf{X}_{t}=\left(X_{t}^{(1)},\ldots,X_{t}^{(P)}\right)^{\top} and 𝐘t=(Yt(1),,Yt(Q))\mathbf{Y}_{t}=\left(Y_{t}^{(1)},\ldots,Y_{t}^{(Q)}\right)^{\top}, observed for t={1,,T}t=\{1,\ldots,T\}.
1. Fuse: fuse the data to a new (P+Q)(P+Q)-variate time series denoted as 𝐙t\mathbf{Z}_{t}, with 𝐙t=(𝐗t,𝐘t)\mathbf{Z}_{t}=\left(\mathbf{X}_{t}^{\top},\mathbf{Y}_{t}^{\top}\right)^{\top}.
(This fused representation allows for the joint analysis of the two multivariate processes within a unified framework. The Causal-WaveCanCoh can also be estimated by appropriately incorporating the leading-lag into the existing WaveCanCoh estimation procedure.)
2. Spectral estimation: estimate the LWS matrix of {𝐙t}\{\mathbf{Z}_{t}\} using equation (18). Denote the estimator as 𝐒^j;𝐙𝐙(u)\widehat{\mathbf{S}}_{j;\mathbf{ZZ}}(u) for any rescaled time u(0,1)u\in(0,1). The estimated auto- and cross-LWS between {𝐗t}\{\mathbf{X}_{t}\} and {𝐘t}\{\mathbf{Y}_{t}\}, denoted as 𝐒^j;𝐗𝐗(u)\widehat{\mathbf{S}}_{j;\mathbf{XX}}(u), 𝐒^j;𝐘𝐘(u)\widehat{\mathbf{S}}_{j;\mathbf{YY}}(u), 𝐒^j;𝐗𝐘(u)\widehat{\mathbf{S}}_{j;\mathbf{XY}}(u), 𝐒^j;𝐘𝐗(u)\widehat{\mathbf{S}}_{j;\mathbf{YX}}(u), can be obtained by partitioning 𝐒^j;𝐙𝐙(u)\widehat{\mathbf{S}}_{j;\mathbf{ZZ}}(u) into four submatrices as illustrated in equation (9).
3. Eigendecomposition: compute the matrices 𝐌^j;𝐚(u)\widehat{\mathbf{M}}_{j;\mathbf{a}}(u) and 𝐌^j;𝐛(u)\widehat{\mathbf{M}}_{j;\mathbf{b}}(u) in equations (20)- (21), then perform their eigendecompositions and obtain their (common) largest eigenvalue Λ^j(1)(u)\widehat{\Lambda}_{j}^{(1)}(u). This will serve as the estimated WaveCanCoh in (19), 𝝆^j;𝐗𝐘(u)=Λ^j(1)(u)\widehat{\boldsymbol{\rho}}_{j;\mathbf{XY}}(u)=\widehat{\Lambda}_{j}^{(1)}(u), while its corresponding eigenvectors of 𝐌^j;𝐚(u)\widehat{\mathbf{M}}_{j;\mathbf{a}}(u) and 𝐌^j;𝐛(u)\widehat{\mathbf{M}}_{j;\mathbf{b}}(u) give the canonical direction vectors, 𝐚^j(u)\widehat{\mathbf{a}}_{j}(u) and 𝐛^j(u)\widehat{\mathbf{b}}_{j}(u).

5 Simulation study

In this section, we implement the proposed framework using simulated data under two distinct scenarios, one adhering to the MvLSW assumptions underpinning our method, while the other introduces nonstationarity without strictly satisfying the MvLSW assumptions. These setups allow us to validate both the theoretical soundness and empirical performance of the proposed approach, as well as to assess its robustness and practical applicability in real-world scenarios where model assumptions may be violated. To further evaluate performance, we also compare the results with those obtained from the classical Fourier-based canonical coherence approach.

MvLSW-based simulation. We generate the multivariate time series {𝐙t}\{\mathbf{Z}_{t}\} from a MvLSW process with P=6,Q=4P=6,\,Q=4, observed across T=1024T=1024 time points. The process is constructed using non-decimated Haar wavelets, with non-zero spectral structure specified at scale j=2j=2, as detailed in Appendix C.1. We impose a weaker dependence structure between {𝐗t}\{\mathbf{X}_{t}\} and {𝐘t}\{\mathbf{Y}_{t}\} in the interval 0<u<0.50<u<0.5, and a stronger dependence in the interval 0.5<u<10.5<u<1, allowing us to examine the framework’s sensitivity to changes in cross-group coherence. Using the process realization (Figure 9) we estimate WaveCanCoh using Algorithm 1. To assess the estimation accuracy and account for variability, we replicate the simulation and estimation process 1000 times. At each time point, we compute the average of the estimated WaveCanCoh across the replicates and construct a 95% Wald confidence interval using the empirical variance. Figure 2 (left) demonstrates that the proposed WaveCanCoh method accurately tracks the true coherence structure and effectively reflects its time-varying nature, while the estimated canonical direction vectors in Figure 3 map the temporal and individual channel heterogeneity in their roles within the multivariate dependence structure.

Refer to caption
Refer to caption
Figure 2: Left: Estimated wavelet canonical coherence at scale j=2j=2 over 1000 MvLSW replications. The solid line shows the average estimated coherence, the dashed line is the true coherence computed from the specified spectrum. Right: Estimated canonical coherence over 500 replicates of the AR(2) mixture using WaveCanCoh and LSP at scale j=1j=1 and ω[25,50]Hz\omega\in[25,50]Hz, respectively. Shaded areas indicate the corresponding 95% Wald confidence interval.
Refer to caption
Refer to caption
Figure 3: Estimated time-varying canonical direction vectors 𝐚^2()\widehat{\mathbf{a}}_{2}(\cdotp) (left) and 𝐛^2()\widehat{\mathbf{b}}_{2}(\cdotp) (right).

Mixture of AR(2)-based processes. To evaluate the robustness and generality of the proposed framework, we investigate WaveCanCoh using synthetic data generated from a mixture of AR(2) processes. Unlike the MvLSW-based simulations, this setting introduces nonstationary dynamics without any wavelet-based structure, providing a more flexible and realistic test scenario. To the best of our knowledge, WaveCanCoh is the first framework designed to estimate time-varying canonical coherence between two multivariate time series groups. However, to benchmark its performance, we compare it against a method based on the time-varying Cramér representation (Dahlhaus, 1997), with canonical coherence estimated via STFT-based localized spectra (Allen and Rabiner, 1977). We simulate 500 replicates of 𝐗t4{\mathbf{X}_{t}}\in\mathbb{R}^{4} and 𝐘t3{\mathbf{Y}_{t}}\in\mathbb{R}^{3}, with T=1024T=1024, each formed by mixing five latent AR(2) sources tuned to neural frequency bands. In the first half, shared gamma (3050Hz30-50Hz) bands induce cross-group coherence, while the second half contains no shared structure (see details in Appendix C.2). Figure 2 (right) illustrates that while both methods detect the existence of coherence in the first half, only WaveCanCoh captures its sharp drop and true behaviour in the second half, thus demonstrating its advantage in identifying transient, localized changes that global Fourier-based methods fail to detect.

6 Local field potential (LFP) data analysis

To demonstrate the practical utility of our proposed WaveCanCoh framework, we analyze LFP activity recorded from the hippocampus of rats engaged in a sequence memory task (Allen et al., 2016; Shahbaba et al., 2022). The data were recorded using a 22-electrode microdrive implanted in the CA1 subregion to capture high-resolution LFP signals across all channels at a sampling rate of 1000Hz1000Hz. In this task, rats were tested on their memory of a sequence of five odors (odors ABCDE). Each odor was presented for \sim1.2 second (ss) and a variable delay of \sim5ss separated each odor (see Figure 1). For each trial (i.e., each odor presentation), the rat had to judge whether the odor was presented "in sequence" (e.g., ABC…) or "out of sequence" (e.g., ABD…) and indicate their decision by holding their nosepoke response until a tone signal (at 1.2ss) or withdrawing before the signal, respectively. Correct-response trials (i.e., correct "in sequence" or "out of sequence" decisions) were rewarded. LFP activity data were recorded over a 4ss period (T=4000T=4000 time points) per trial, with t=0t=0 marking the moment the rat initiated a nosepoke to receive the odor stimulus. This paradigm provides a well-controlled setting to investigate dynamic, time-varying functional interactions in the hippocampus during memory-guided decisions. We employ WaveCanCoh framework to quantify frequency-specific functional coherence between two groups of hippocampal electrodes (T1, T2, T4, T5 and T13–T17), and to examine how coherence patterns differ between correct- and incorrect-response trials ("in sequence" trials only). Specifically, we analyze LFP data from the rat Mitt, which included 40 correct-response trials and 32 incorrect-response trials. Figure 4 presents the estimated wavelet canonical coherence at scale j=5j=5, corresponding to the 15.62531.25Hz15.625-31.25Hz frequency band. The results, averaged across trials for each condition, reveal dynamic changes in inter-regional coherence, with a pronounced peak around the time of odor stimulus delivery (t=0t=0). Notably, distinct patterns emerge between correct and incorrect trials, suggesting that coherent activity in this frequency band may play a role in supporting successful memory retrieval and decision making. More results for several other scales can be found in Figure 10 in Appendix D.1.

Refer to caption
Figure 4: Estimated wavelet canonical coherence between two hippocampal subregions (T1, T2, T4, T5 vs. T13–T17) in one subject (Mitt) at scale j=5(15.62531.25Hz)j=5\,(15.625\text{\textendash}31.25Hz). The estimates are averaged across 40 correct- and 32 incorrect-response trials, using a rectangular smoothing window of 0.2s0.2s.

To further interpret the coherence patterns, Figure 5 provides a spatial summary of the canonical coherence between the two electrode groups at several selected time points. The double-headed arrows represent the magnitude of estimated coherence between the two regions, while the numbers in the circles reflect the individual channel contributions to the global coherence, derived from the elements of the canonical vectors 𝐚5()\mathbf{a}_{5}(\cdotp) and 𝐛5()\mathbf{b}_{5}(\cdotp).

Refer to caption
Figure 5: Spatio-temporal illustration of WaveCanCoh between hippocampal regions (T1, T2, T4, T5 and T13–T17) in rat Mitt at scale j=5(15.62531.25Hz)j=5\,(15.625\text{\textendash}31.25Hz). Arrows indicate the magnitude of inter-region coherence at selected time points. Numbers inside circles represent the channel-wise relative contributions to the canonical coherence for each region, for correct- and incorrect-responses (negative values denote the corresponding channels have contributions to negative correlations among the two regions).

The results highlight that both the strength and structure of inter-regional coherence vary dynamically over time and differ across trial outcomes, reflecting the nonstationary nature of neural interactions during task performance. Specifically, incorrect-response trials exhibit lower coherence at time points immediately following the odor stimulus delivery and are often driven by a few dominant channels, while correct trials show higher coherence values and relatively balanced contributions across channels. These findings illustrate the necessity of a framework like WaveCanCoh, which simultaneously identifies time-varying and scale-specific dependence structures between multivariate time series. Traditional stationary or pairwise approaches would fail to capture such nuanced dynamics, highlighting the need for a method like WaveCanCoh to extract such complex relationships in brain activity. The directed interactions between brain regions are also explored in Appendix E with proposed Causal-WaveCanCoh framework.

To further validate the existence of significant differences in the activity between correct- and incorrect-response trials, we propose a time-specific detection procedure based on the permutation test to determine temporally localized differences in the wavelet canonical coherence at a given scale between conditions, while maintaining the nonparametric nature of the statistical inference. The detailed inference procedure steps are shown in Algorithm 2 (Appendix D.2) and the results of the time-specific permutation test for WaveCanCoh across five wavelet scales and four selected time points are summarized in Table 1 (Appendix D.3). Statistically significant differences in canonical coherence between correct and incorrect trials emerge at time points following odor sampling (t=0t=0), predominantly at intermediate wavelet scales corresponding approximately to the 862Hz8-62Hz frequency range. On the other hand, no significant differences between the two groups are observed prior to stimulus onset, indicating that the coherence patterns distinguishing inter-regional communication among trial types are tightly linked to task engagement. These findings demonstrate the effectiveness of the proposed WaveCanCoh framework and associated permutation test in capturing localized, frequency-specific differences in neural coordination between behavioral conditions, thus offering a powerful tool for analyzing complex brain interactions.

7 Conclusions

In this paper we introduced a novel methodological framework, scale-specific wavelet canonical coherence (WaveCanCoh), designed to quantify the dynamic multiscale coherence between two sets of nonstationary multivariate time series. Our primary contributions include the rigorous definition of WaveCanCoh within the multivariate locally stationary wavelet (MvLSW) framework and the development of a comprehensive estimation and theoretically-backed inference procedure based on wavelet analysis. We validated our proposed methodology through simulation studies, demonstrating its accuracy in tracking true coherence structures. The application to local field potential (LFP) activity recorded from subregions of the hippocampus effectively showcased the capability of WaveCanCoh to identify nuanced spatio-temporal coherence patterns associated with cognitive performance, while our permutation-based inference procedure provided a robust, nonparametric approach for detecting significant coherence differences between conditions. Compared to existing stationary and Fourier-based canonical coherence methods, WaveCanCoh is shown to offer significant advantages, particularly through its ability to adaptively capture transient and time-localized interactions, hence it is highly suitable for analyzing signals in neuroscience and other fields with data exhibiting dynamic cross-group interactions.

References

  • Allen and Rabiner (1977) J.B. Allen and L.R. Rabiner. A unified approach to short-time fourier analysis and synthesis. Proceedings of the IEEE, 65(11):1558–1564, 1977.
  • Allen et al. (2016) Timothy A. Allen, Daniel M. Salz, Sam McKenzie, and Norbert J. Fortin. Nonspatial sequence coding in ca1 neurons. The Journal of Neuroscience, 36(5):1547–1563, 2016.
  • Box and Tiao (1977) G. E. P. Box and G. C. Tiao. A canonical analysis of multiple time series. Biometrika, 64(2):355–365, August 1977.
  • Brillinger (2001) D. R. Brillinger. Time Series: Data Analysis and Theory. SIAM, 2001.
  • Dahlhaus (1997) Rainer Dahlhaus. Fitting time series models to nonstationary processes. The Annals of Statistics, 25(1):1–37, February 1997.
  • Daubechies (1992) Ingrid Daubechies. Ten lectures on wavelets. SIAM, 1992.
  • Embleton et al. (2022) Jonathan Embleton, Marina I. Knight, and Hernando Ombao. Multiscale spectral modelling for nonstationary time series within an ordered multiple-trial experiment. Annals of Applied Statistics, 16(4):2774–2803, 2022.
  • Geweke (1982) John Geweke. Measurement of linear dependence and feedback between multiple time series. Journal of the American Statistical Association, 77(378):304–313, June 1982.
  • Grinsted et al. (2004) A. Grinsted, J. C. Moore, and S. Jevrejeva. Application of the cross wavelet transform and wavelet coherence to geophysical time series. Nonlinear Processes in Geophysics, 11(5-6):561–566, 2004.
  • Hotelling (1936) Harold Hotelling. Relations between two sets of variates. Biometrika, 28(3/4):321–377, 1936.
  • Huang et al. (2004) Hsiao-Yun Huang, Hernando Ombao, and David S. Stoffer. Discrimination and classification of nonstationary time series using the slex model. Journal of the American Statistical Association, 99(467):763–774, 2004.
  • Knight et al. (2024) Marina I. Knight, Matthew A. Nunes, and Jessica K. Hargreaves. Adaptive wavelet domain principal component analysis for nonstationary time series. Journal of Computational and Graphical Statistics, 33(3):941–954, 2024.
  • Mardia et al. (1979) K. V. Mardia, J. T. Kent, and J. M. Bibby. Multivariate Analysis. Academic Press, 1979.
  • Nason (2008) G. P. Nason. Wavelet Methods in Statistics with R. Springer New York, NY, 2008.
  • Nason et al. (2000) Guy P. Nason, Rainer Sachs, and Gerald Kroisandt. Wavelet processes and adaptive estimation of the evolutionary wavelet spectrum. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 62(2):271–292, 2000.
  • Ombao and Pinto (2024) Hernando Ombao and Marco Pinto. Spectral dependence. Econometrics and Statistics, 32:122–159, 2024.
  • Park et al. (2014) Timothy Park, Idris Eckley, and Hernando Ombao. Estimating time-evolving partial coherence between signals via multivariate locally stationary wavelet processes. IEEE Transactions on Signal Processing, 62:5240–5250, 2014.
  • Shahbaba et al. (2022) Babak Shahbaba, Lu Li, Francesco Agostinelli, Meenakshi Saraf, Keiland W. Cooper, Daniel Haghverdian, George A. Elias, Pierre Baldi, and Norbert J. Fortin. Hippocampal ensembles represent sequential relationships among an extended sequence of nonspatial events. Nature Communications, 13(1):787, February 2022.
  • Talento et al. (2024) Mara Sherlin D. Talento, Sarbojit Roy, and Hernando C. Ombao. Kencoh: A ranked-based canonical coherence. arXiv preprint arXiv:2412.10521, December 2024.
  • Viduarre et al. (2019) C. Viduarre, G. Nolte, I.E.J. de Vries, M. Gómez, T.W. Boonstra, K.-R. Müller, A. Villringer, and V.V. Nikulin. Canonical maximization of coherence: A novel tool for investigation of neuronal interactions between two datasets. NeuroImage, 201:116009, November 2019.
  • West et al. (1999) Mike West, Raquel Prado, and Andrew D. Krystal. Evaluation and comparison of eeg traces: Latent structure in non-stationary time series. Journal of the American Statistical Association, 94(446):1083–1095, 1999.

Appendix

Appendix A Basic introduction to wavelets

Wavelet analysis provides a powerful framework for studying signals with both time- and scale-varying structure, making it particularly well-suited for nonstationary data. Unlike traditional Fourier-based methods, which decompose signals into global sinusoidal bases and therefore assume stationarity, wavelets enable localized, adaptive decompositions by projecting signals onto functions that are compact in both time and scale. This localization is achieved through two core operations: scaling, which adjusts the width of the wavelet to analyze different resolution levels, and shifting, which moves the wavelet across time to detect when features occur. Specifically, the wavelet functions at scale jj and shift kk, denoted by ψj,k\psi_{j,k}, are derived from a mother wavelet ψ\psi and defined as

ψj,k(t)=2j/2ψ(t2jk2j),j=1,,J,\displaystyle\psi_{j,k}(t)=2^{-j/2}\psi\left(\frac{t-2^{j}k}{2^{j}}\right),\quad j=1,\ldots,J,

where JJ represents the number of scales. Smaller scale jj corresponds to finer (high-frequency) resolution, and larger jj captures coarser (low-frequency) trends. A similar construction applies to the father wavelet, denoted by ϕj,k\phi_{j,k}, which serves as a scaling function (see Daubechies (1992) and Nason (2008) for more details on wavelets). Figure 6 illustrates the effect of scaling and shifting operations on the wavelet function. This ability to isolate both short-lived and long-term features distinguishes wavelet methods from Fourier analysis, allowing for nuanced investigation of nonstationary signals such as neural activity, where structure evolves dynamically across time and scale.

Refer to caption
Figure 6: Illustration of scaling and shifting of wavelet functions. Smaller scales (top) capture high-frequency, localized features, while larger scales (bottom) capture broader, low-frequency structures.

The correspondence between wavelet scale and signal frequency arises from the principle of multi-resolution analysis, which provides the foundational framework for wavelet construction and signal representation. Briefly, the discrete wavelet transform (DWT) decomposes a signal into frequency subbands via a dyadic filter bank architecture. At each level, the signal is passed through a pair of conjugate quadrature filters: a low-pass filter h[n]h[n] and a high-pass filter g[n]g[n], followed by downsampling by a factor of two. The low-pass branch yields approximation coefficients that retain coarse-scale information, while the high-pass branch produces detail coefficients that capture localized high-frequency variations. Specifically, given a discrete signal x[n]x[n], the approximation and detail coefficients at level jj are obtained by

aj[n]=kh[k]aj1[2nk],dj[n]=kg[k]aj1[2nk],\displaystyle a_{j}[n]=\sum_{k}h[k]a_{j-1}[2n-k],\quad d_{j}[n]=\sum_{k}g[k]a_{j-1}[2n-k],

where aj1a_{j-1} is the approximation from the previous level (with a0=xa_{0}=x). This process is iterated on the low-pass output, producing a multiscale representation in which each level isolates a specific frequency band. For a signal sampled at rate fsf_{s}, the detail coefficients at level jj correspond approximately to the frequency interval [fs2j+1,fs2j]\left[\frac{f_{s}}{2^{j+1}},\frac{f_{s}}{2^{j}}\right]. The orthogonality between subbands ensures perfect reconstruction and energy preservation, and the hierarchical filter bank provides a localized time-frequency analysis with increasingly coarse temporal resolution at lower frequencies. The explanation above offers an intuitive understanding of how components at each wavelet scale can be approximated to true frequency bands (see Figure 7 for an intuitive illustration). This approximation enables our framework to capture the overall association between two sets of locally stationary time series in both the temporal and frequency domains (more details can be found in Daubechies (1992)).

Refer to caption
Figure 7: Illustration of mapping relationship between wavelet scales and their approximated frequency bands.

Appendix B Theoretical proofs

Proof of solution to equation (10). Suppose that 𝐗t\mathbf{X}_{t} and 𝐘t\mathbf{Y}_{t} are two multivariate time series and for each time point tt, the goal is to find the vectors 𝐚j(u)\mathbf{a}_{j}(u) and 𝐛j(u)\mathbf{b}_{j}(u) that maximize the coherence,

𝝆j;𝐗𝐘(u)={𝐚j(u)𝐒j;𝐗𝐘(u)𝐛j(u)}2\displaystyle\boldsymbol{\rho}_{j;\mathbf{XY}}(u)=\left\{\mathbf{a}_{j}^{\top}(u)\mathbf{S}_{j;\mathbf{XY}}(u)\mathbf{b}_{j}(u)\right\}^{2}

subject to the normalization constraints described in the main paper

𝐚j(u)𝐒j;𝐗𝐗(u)𝐚j(u)=1,𝐛j(u)𝐒j;𝐘𝐘(u)𝐛j(u)=1.\displaystyle\mathbf{a}_{j}^{\top}(u)\mathbf{S}_{j;\mathbf{XX}}(u)\mathbf{a}_{j}(u)=1,\quad\mathbf{b}_{j}^{\top}(u)\mathbf{S}_{j;\mathbf{YY}}(u)\mathbf{b}_{j}(u)=1.

Assuming rescaled time uu and scale jj are fixed, we suppress them for clarity. To solve the above optimization problem, we set up the Lagrangian as

(𝐚,𝐛,λ1,λ2)=12(𝐚𝐒XY𝐛)2λ12(𝐚𝐒XX𝐚1)λ22(𝐛𝐒YY𝐛1)\displaystyle\mathcal{L}\left(\mathbf{a},\mathbf{b},\lambda_{1},\lambda_{2}\right)=\frac{1}{2}\left(\mathbf{a}^{\top}\mathbf{S}_{XY}\mathbf{b}\right)^{2}-\frac{\lambda_{1}}{2}\left(\mathbf{a}^{\top}\mathbf{S}_{XX}\mathbf{a}-1\right)-\frac{\lambda_{2}}{2}\left(\mathbf{b}^{\top}\mathbf{S}_{YY}\mathbf{b}-1\right)

Differentiating the Lagrangian with respect to 𝐚\mathbf{a} and 𝐛\mathbf{b}, respectively and requiring the partial derivatives to equal zero, we obtain

𝐚\displaystyle\frac{\partial\mathcal{L}}{\partial\mathbf{a}} =(𝐒XY𝐛)(𝐒XY𝐛)𝐚λ1𝐒XX𝐚=𝟎\displaystyle=\left(\mathbf{S}_{XY}\mathbf{b}\right)\cdot\left(\mathbf{S}_{XY}\mathbf{b}\right)^{\top}\mathbf{a}-\lambda_{1}\mathbf{S}_{XX}\mathbf{a}=\mathbf{0}
(𝐒XY𝐛)(𝐒XY𝐛)𝐚=λ1𝐒XX𝐚,(i)\displaystyle\Rightarrow\left(\mathbf{S}_{XY}\mathbf{b}\right)\cdot\left(\mathbf{S}_{XY}\mathbf{b}\right)^{\top}\mathbf{a}=\lambda_{1}\mathbf{S}_{XX}\mathbf{a},\qquad(\mathrm{i})
𝐛\displaystyle\frac{\partial\mathcal{L}}{\partial\mathbf{b}} =(𝐒YX𝐚)(𝐒YX𝐚)𝐛λ2𝐒YY𝐛=𝟎\displaystyle=\left(\mathbf{S}_{YX}\mathbf{a}\right)\cdot\left(\mathbf{S}_{YX}\mathbf{a}\right)^{\top}\mathbf{b}-\lambda_{2}\mathbf{S}_{YY}\mathbf{b}=\mathbf{0}
(𝐒YX𝐚)(𝐒YX𝐚)𝐛=λ2𝐒YY𝐛.(ii)\displaystyle\Rightarrow\left(\mathbf{S}_{YX}\mathbf{a}\right)\cdot\left(\mathbf{S}_{YX}\mathbf{a}\right)^{\top}\mathbf{b}=\lambda_{2}\mathbf{S}_{YY}\mathbf{b}.\qquad\mathrm{(ii)}

With (i)\mathrm{(i)}, (ii)\mathrm{(ii)} and since (𝐚𝐒XY𝐛)\left(\mathbf{a}^{\top}\mathbf{S}_{XY}\mathbf{b}\right) is a real-valued quantity, we further obtain

(𝐚𝐒XY𝐛)(𝐚𝐒XY𝐛)=λ1𝐚𝐒XX𝐚, and(𝐛𝐒YX𝐚)(𝐚𝐒XY𝐛)=λ2𝐛𝐒YY𝐛.\displaystyle\begin{array}[]{l}(\mathbf{a}^{\top}\mathbf{S}_{XY}\mathbf{b})\cdot(\mathbf{a}^{\top}\mathbf{S}_{XY}\mathbf{b})=\lambda_{1}\mathbf{a}^{\top}\mathbf{S}_{XX}\mathbf{a},\mbox{ and}\\ (\mathbf{b}^{\top}\mathbf{S}_{YX}\mathbf{a})\cdot(\mathbf{a}^{\top}\mathbf{S}_{XY}\mathbf{b})=\lambda_{2}\mathbf{b}^{\top}\mathbf{S}_{YY}\mathbf{b}.\end{array}

Recalling the constraints 𝐚𝐒XX𝐚=1\mathbf{a}^{\top}\mathbf{S}_{XX}\mathbf{a}=1 and 𝐛𝐒YY𝐛=1\mathbf{b}^{\top}\mathbf{S}_{YY}\mathbf{b}=1, it immediately follows that

λ1=λ2=(𝐚𝐒XY𝐛)2:=λ.\displaystyle\lambda_{1}=\lambda_{2}=\left(\mathbf{a}^{\top}\mathbf{S}_{XY}\mathbf{b}\right)^{2}:\,=\lambda.

By substituting the above back into (i)\mathrm{(i)}, (ii)\mathrm{(ii)} and assuming λ\lambda to be non-zero, we obtain

𝐚=1λ𝐒XX1𝐒XY𝐛 and𝐛=1λ𝐒YY1𝐒YX𝐚,\displaystyle\begin{array}[]{l}\mathbf{a}=\frac{1}{\sqrt{\lambda}}\mathbf{S}_{XX}^{-1}\mathbf{S}_{XY}\mathbf{b}\mbox{ and}\\ \mathbf{b}=\frac{1}{\sqrt{\lambda}}\mathbf{S}_{YY}^{-1}\mathbf{S}_{YX}\mathbf{a},\end{array}

which plugged into (ii)\mathrm{(ii)}, (i)\mathrm{(i)}, respectively, yield

𝐒XY𝐒YY1𝐒YX𝐚=λ𝐒XX𝐚𝐒YX𝐒XX1𝐒XY𝐛=λ𝐒YY𝐛 or, equivalently,\displaystyle\begin{array}[]{l}\mathbf{S}_{XY}\mathbf{S}_{YY}^{-1}\mathbf{S}_{YX}\ \mathbf{a}=\lambda\mathbf{S}_{XX}\mathbf{a}\\ \mathbf{S}_{YX}\mathbf{S}_{XX}^{-1}\mathbf{S}_{XY}\mathbf{b}=\lambda\mathbf{S}_{YY}\mathbf{b}\mbox{ or, equivalently,}\\ \end{array} 𝐒XX1𝐒XY𝐒YY1𝐒YX𝐚=λ𝐚,𝐒YY1𝐒YX𝐒XX1𝐒XY𝐛=λ𝐛.\displaystyle\begin{array}[]{l}\mathbf{S}_{XX}^{-1}\mathbf{S}_{XY}\mathbf{S}_{YY}^{-1}\mathbf{S}_{YX}\mathbf{a}=\lambda\mathbf{a},\\ \mathbf{S}_{YY}^{-1}\mathbf{S}_{YX}\mathbf{S}_{XX}^{-1}\mathbf{S}_{XY}\mathbf{b}=\lambda\mathbf{b}.\\ \end{array}

Hence λ=(𝐚𝐒XY𝐛)2\lambda=\left(\mathbf{a}^{\top}\mathbf{S}_{XY}\mathbf{b}\right)^{2} is an eigenvalue for both matrices

𝐒XX1𝐒XY𝐒YY1𝐒YX, and\displaystyle\mathbf{S}_{XX}^{-1}\mathbf{S}_{XY}\mathbf{S}_{YY}^{-1}\mathbf{S}_{YX},\mbox{ and}
𝐒YY1𝐒YX𝐒XX1𝐒XY\displaystyle\mathbf{S}_{YY}^{-1}\mathbf{S}_{YX}\mathbf{S}_{XX}^{-1}\mathbf{S}_{XY}

whose corresponding eigenvectors are 𝐚\mathbf{a} and 𝐛\mathbf{b}, respectively. Thus, the defined canonical coherence 𝝆=max(𝐚𝐒XY𝐛)2\boldsymbol{\rho}=\max\left(\mathbf{a}^{\top}\mathbf{S}_{XY}\mathbf{b}\right)^{2} is the largest eigenvalue of above matrices. The case when λ=0\lambda=0 illustrates the canonical coherence between 𝐗\mathbf{X} and 𝐘\mathbf{Y} is 0, which is not a meaningful scenario for this problem. We recall the above equations hold for every time uu and scale jj.

Proof of consistency of WaveCanCoh estimator. We aim to establish the consistency of the matrix estimators

𝐌^j;𝐚=𝐒^j,𝐗𝐗1𝐒^j,𝐗𝐘𝐒^j,𝐘𝐘1𝐒^j,𝐘𝐗,𝐌^j;𝐛=𝐒^j,𝐘𝐘1𝐒^j,𝐘𝐗𝐒^j,𝐗𝐗1𝐒^j,𝐗𝐘\displaystyle\widehat{\mathbf{M}}_{j;\mathbf{a}}=\widehat{\mathbf{S}}_{j,\mathbf{XX}}^{-1}\widehat{\mathbf{S}}_{j,\mathbf{XY}}\widehat{\mathbf{S}}_{j,\mathbf{YY}}^{-1}\widehat{\mathbf{S}}_{j,\mathbf{YX}},\quad\widehat{\mathbf{M}}_{j;\mathbf{b}}=\widehat{\mathbf{S}}_{j,\mathbf{Y}\mathbf{Y}}^{-1}\widehat{\mathbf{S}}_{j,\mathbf{YX}}\widehat{\mathbf{S}}_{j,\mathbf{XX}}^{-1}\widehat{\mathbf{S}}_{j,\mathbf{XY}}

According to Nason et al. (2000) and Park et al. (2014), the smoothed periodogram-based estimators of the local wavelet spectral (LWS) matrices are consistent. Specifically, as the number of time points TT\to\infty and the smoothing parameter MM\to\infty with M/T0M/T\to 0, we have

𝐒^j,𝐗𝐗𝑃𝐒j,𝐗𝐗,𝐒^j,𝐗𝐘𝑃𝐒j,𝐗𝐘,𝐒^j,𝐘𝐗𝑃𝐒j,𝐘𝐗,𝐒^j,𝐘𝐘𝑃𝐒j,𝐘𝐘.\displaystyle\widehat{\mathbf{S}}_{j,\mathbf{XX}}\xrightarrow{P}\mathbf{S}_{j,\mathbf{XX}},\quad\widehat{\mathbf{S}}_{j,\mathbf{XY}}\xrightarrow{P}\mathbf{S}_{j,\mathbf{XY}},\quad\widehat{\mathbf{S}}_{j,\mathbf{YX}}\xrightarrow{P}\mathbf{S}_{j,\mathbf{YX}},\quad\widehat{\mathbf{S}}_{j,\mathbf{YY}}\xrightarrow{P}\mathbf{S}_{j,\mathbf{YY}}.

Assuming the spectral matrices and their estimators are non-singular, it follows by the continuous mapping theorem that

𝐒^j,𝐗𝐗1𝑃𝐒j,𝐗𝐗1,𝐒^j,𝐘𝐘1𝑃𝐒j,𝐘𝐘1.\displaystyle\widehat{\mathbf{S}}_{j,\mathbf{XX}}^{-1}\xrightarrow{P}\mathbf{S}_{j,\mathbf{XX}}^{-1},\quad\widehat{\mathbf{S}}_{j,\mathbf{Y}\mathbf{Y}}^{-1}\xrightarrow{P}\mathbf{S}_{j,\mathbf{YY}}^{-1}.

Since matrix multiplication is continuous with respect to convergence in probability, we obtain the consistency of the matrix estimators by Slutsky’s theorem, namely

𝐌^j;𝐚𝑃𝐌j;𝐚,𝐌^j;𝐛𝑃𝐌j;𝐛.\displaystyle\widehat{\mathbf{M}}_{j;\mathbf{a}}\xrightarrow{P}\mathbf{M}_{j;\mathbf{a}},\quad\widehat{\mathbf{M}}_{j;\mathbf{b}}\xrightarrow{P}\mathbf{M}_{j;\mathbf{b}}.

Consequently, the estimated wavelet canonical coherence and associated canonical vectors, derived from the largest eigenvalue and corresponding eigenvectors of 𝐌^j;a\widehat{\mathbf{M}}_{j;a} and 𝐌^j;b\widehat{\mathbf{M}}_{j;b}, also converge in probability to their population counterparts, following arguments akin to those in Knight et al. (2024).

Appendix C Details of the simulation setup

This appendix provides the full specification of the simulation experiments described in Section 5 of the main text.

C.1 MvLSW-based simulation

We simulate a P+Q=6+4P+Q=6+4 dimensional multivariate time series 𝐙t=[𝐗t,𝐘t]10\mathbf{Z}_{t}=[\mathbf{X}_{t}^{\top},\mathbf{Y}_{t}^{\top}]^{\top}\in\mathbb{R}^{10} from a multivariate locally stationary wavelet (MvLSW) process over T=1024T=1024 time points. The wavelet spectrum 𝐒j;𝐙𝐙(u)\mathbf{S}_{j;\mathbf{ZZ}}(u) is non-zero only at scale j=2j=2 and is structured as a 10×1010\times 10 block matrix

𝐒j=2;𝐙𝐙(u)=[𝐒j;𝐗𝐗𝐒j;𝐗𝐘(u)𝐒j;𝐘𝐗(u)𝐒j;𝐘𝐘],u=tT(0,1).\mathbf{S}_{j=2;\mathbf{ZZ}}(u)=\begin{bmatrix}\mathbf{S}_{j;\mathbf{XX}}&\mathbf{S}_{j;\mathbf{XY}}(u)\\ \mathbf{S}_{j;\mathbf{YX}}(u)&\mathbf{S}_{j;\mathbf{YY}}\end{bmatrix},\quad u=\frac{t}{T}\in(0,1).

Auto-spectral block: 𝐒j;𝐗𝐗6×6\mathbf{S}_{j;\mathbf{XX}}\in\mathbb{R}^{6\times 6}

𝐒j;𝐗𝐗=[811000180001108000000810000180010008]\mathbf{S}_{j;\mathbf{XX}}=\begin{bmatrix}8&1&1&0&0&0\\ 1&8&0&0&0&1\\ 1&0&8&0&0&0\\ 0&0&0&8&1&0\\ 0&0&0&1&8&0\\ 0&1&0&0&0&8\end{bmatrix}

Auto-spectral block: 𝐒j;𝐘𝐘4×4\mathbf{S}_{j;\mathbf{YY}}\in\mathbb{R}^{4\times 4}

𝐒j;𝐘𝐘=[6010061111600106]\mathbf{S}_{j;\mathbf{YY}}=\begin{bmatrix}6&0&1&0\\ 0&6&1&1\\ 1&1&6&0\\ 0&1&0&6\end{bmatrix}

Cross-spectral block: 𝐒j;𝐗𝐘(u)6×4\mathbf{S}_{j;\mathbf{XY}}(u)\in\mathbb{R}^{6\times 4}

This block is time-varying. For u<0.5u<0.5, each specified cross-group pair has coherence value 1; for u0.5u\geq 0.5, the same entries increase to 2.

Let

c(u)={1if u<0.52if u0.5c(u)=\begin{cases}1&\text{if }u<0.5\\ 2&\text{if }u\geq 0.5\end{cases}

then

𝐒j;𝐗𝐘(u)=[c(u)00c(u)0c(u)00000c(u)c(u)00000000000],𝐒j;𝐘𝐗(u)=𝐒j;𝐗𝐘(u)\mathbf{S}_{j;\mathbf{XY}}(u)=\begin{bmatrix}c(u)&0&0&c(u)\\ 0&c(u)&0&0\\ 0&0&0&c(u)\\ c(u)&0&0&0\\ 0&0&0&0\\ 0&0&0&0\\ \end{bmatrix},\quad\mathbf{S}_{j;\mathbf{YX}}(u)=\mathbf{S}_{j;\mathbf{XY}}^{\top}(u)

The entries of the cross-spectral matrix determine how individual channels contribute to the global coherence structure between {𝐗t}\{\mathbf{X}_{t}\} and {𝐘t}\{\mathbf{Y}_{t}\}. Figure 8 visualizes the spectral structure 𝐒j=2;𝐙𝐙(u)\mathbf{S}_{j=2;\mathbf{ZZ}}(u), and Figure 9 shows example realizations of the simulated processes {𝐗t}\{\mathbf{X}_{t}\} and {𝐘t}\{\mathbf{Y}_{t}\}.

Refer to caption
Figure 8: Visualization of the specified block structure of 𝐒2;𝐙𝐙(u)\mathbf{S}_{2;\mathbf{ZZ}}(u).
Refer to caption
Refer to caption
Figure 9: Example realization of {𝐗t}\{\mathbf{X}_{t}\} and {𝐘t}\{\mathbf{Y}_{t}\} generated from stated MvLSW process {𝐙t}\{\mathbf{Z}_{t}\}.

C.2 Mixture of AR(2)-based simulation

Fourier-based LSP method for comparison

As a benchmark, we implement a method based on the time-varying Cramér representation for locally stationary processes (LSP), introduced in Dahlhaus (1997). A locally stationary process {𝐗t,T}\{\mathbf{X}_{t,T}\} can be expressed as

𝐗t,T=0.50.5𝐀(u,ω)e2πiωt𝑑Z(ω),u=t/T,\mathbf{X}_{t,T}=\int_{-0.5}^{0.5}\mathbf{A}(u,\omega)e^{2\pi i\omega t}\,dZ(\omega),\quad u=t/T,

where 𝐀(u,ω)\mathbf{A}(u,\omega) is a smoothly varying transfer function and Z(ω)Z(\omega) is a complex orthogonal increment process with 𝔼[|dZ(ω)|2]=dω\mathbb{E}[|dZ(\omega)|^{2}]=d\omega. The local spectral density is then defined as 𝐟(u,ω)=|𝐀(u,ω)|2\mathbf{f}(u,\omega)=|\mathbf{A}(u,\omega)|^{2}. We estimate 𝐟(u,ω)\mathbf{f}(u,\omega) via the Short-Time Fourier Transform (STFT) with a Gaussian smoothing kernel and compute canonical coherence over ω[25,50]Hz\omega\in[25,50]Hz.

Simulation setting

We simulate 500 independent replicates of two multivariate processes, {𝐗t}4\{\mathbf{X}_{t}\}\in\mathbb{R}^{4} and {𝐘t}3\{\mathbf{Y}_{t}\}\in\mathbb{R}^{3}, for t=1,,Tt=1,\ldots,T, with T=1024T=1024 and sampling rate fs=100Hzf_{s}=100Hz. Each process is generated from mixture of K=5K=5 latent AR(2) sources, 𝐙t(𝐗),𝐙t(𝐘)K\mathbf{Z}^{(\mathbf{X})}_{t},\mathbf{Z}^{(\mathbf{Y})}_{t}\in\mathbb{R}^{K}, peaking at different frequency bands (delta, theta, alpha, beta, gamma), respectively. For the first half of the time series (tT/2t\leq T/2), the observed processes are given by:

𝐗t=B(1)𝐙t(𝐗),𝐘t=C(1)𝐙t(𝐘),\mathbf{X}_{t}=B^{(1)}\mathbf{Z}^{(\mathbf{X})}_{t},\qquad\mathbf{Y}_{t}=C^{(1)}\mathbf{Z}^{(\mathbf{Y})}_{t},

and for the second half (t>T/2t>T/2):

𝐗t=B(2)𝐙t(𝐗),𝐘t=C(2)𝐙t(𝐘),\mathbf{X}_{t}=B^{(2)}\mathbf{Z}^{(\mathbf{X})}_{t},\qquad\mathbf{Y}_{t}=C^{(2)}\mathbf{Z}^{(\mathbf{Y})}_{t},

where the mixing matrices are:

B(1)=[00000.9500000.90b3,1(1)b3,2(1)000b4,1(1)b4,2(1)b4,3(1)00],B(2)=[0b1,2(2)b1,3(2)00000.80000.9000000b4,2(2)b4,3(2)00]B^{(1)}=\begin{bmatrix}0&0&0&0&0.95\\ 0&0&0&0&0.90\\ b_{3,1}^{(1)}&b_{3,2}^{(1)}&0&0&0\\ b_{4,1}^{(1)}&b_{4,2}^{(1)}&b_{4,3}^{(1)}&0&0\end{bmatrix},\quad B^{(2)}=\begin{bmatrix}0&b_{1,2}^{(2)}&b_{1,3}^{(2)}&0&0\\ 0&0&0.80&0&0\\ 0.90&0&0&0&0\\ 0&b_{4,2}^{(2)}&b_{4,3}^{(2)}&0&0\end{bmatrix}
C(1)=[00000.9500000.900c3,2(1)c3,3(1)00],C(2)=[0000.90000c2,3(2)00c3,1(2)0000]C^{(1)}=\begin{bmatrix}0&0&0&0&0.95\\ 0&0&0&0&0.90\\ 0&c_{3,2}^{(1)}&c_{3,3}^{(1)}&0&0\end{bmatrix},\quad C^{(2)}=\begin{bmatrix}0&0&0&0.90&0\\ 0&0&c_{2,3}^{(2)}&0&0\\ c_{3,1}^{(2)}&0&0&0&0\end{bmatrix}

For each row of the mixing matrices, a selected subset of frequency bands is assigned non-zero weights that are randomly drawn to sum to a predefined total (e.g., 0.95, 0.90, or 1.0), introducing controlled variability across replicates while preserving the intended contribution structure. The latent sources 𝐙t(𝐗)\mathbf{Z}^{(\mathbf{X})}_{t} and 𝐙t(𝐘)\mathbf{Z}^{(\mathbf{Y})}_{t} are partially shared in the gamma band (component 5) during the first regime, with mixing weights α=0.7\alpha=0.7 and β=0.6\beta=0.6, respectively:

Zt,5(𝐗)=αZt,5(shared)+(1α)Zt,5(𝐗,private),Zt,5(𝐘)=βZt,5(shared)+(1β)Zt,5(𝐘,private),tT/2Z^{(\mathbf{X})}_{t,5}=\alpha Z^{(\text{shared})}_{t,5}+(1-\alpha)Z^{(\mathbf{X},\text{private})}_{t,5},\qquad Z^{(\mathbf{Y})}_{t,5}=\beta Z^{(\text{shared})}_{t,5}+(1-\beta)Z^{(\mathbf{Y},\text{private})}_{t,5},\quad t\leq T/2

where Zt,5(shared)Z^{(\text{shared})}_{t,5} is a common latent process. Specifically, each channel, say pp, of {𝐙t(shared)}\{\mathbf{Z}_{t}^{\text{(shared)}}\}, {𝐙t(𝐗,private)}\{\mathbf{Z}_{t}^{(\mathbf{X},\text{private})}\} and {𝐙t(𝐘,private)}\{\mathbf{Z}_{t}^{(\mathbf{Y},\text{private})}\} is generated from the following AR(2) process independently,

Zt,p=ϕ1Zt1,p+ϕ2Zt2,p+wt,\displaystyle Z_{t,p}=\phi_{1}Z_{t-1,p}+\phi_{2}Z_{t-2,p}+w_{t},

where {wt}\{w_{t}\} is white noise and the coefficients are ϕ1=2cos(2πη)/es\phi_{1}=2cos(2\pi\eta)/e^{s},   ϕ2=1/e2s\phi_{2}=-1/e^{2s}. For each component p=1,,5p=1,\ldots,5, we use the frequency vector, 𝜼={η(1),,η(5)}={0.02,0.06,0.10,0.175,0.375}\boldsymbol{\eta}=\{\eta^{(1)},\ldots,\eta^{(5)}\}=\{0.02,0.06,0.10,0.175,0.375\}, and the sharpness parameter 𝐬={s(1),,s(5)}={0.03,0.03,0.03,0.05,0.05}\mathbf{s}=\{s^{(1)},\ldots,s^{(5)}\}=\{0.03,0.03,0.03,0.05,0.05\}, where smaller ss yields narrower frequency bands.

This design induces time-varying coherence between the first two channels of 𝐗\mathbf{X} and 𝐘\mathbf{Y} during the first regime, primarily through the shared gamma component, while maintaining independence during the second regime. The abrupt transition in mixing structure at t=T/2t=T/2 provides a controlled setting for evaluating the sensitivity of coherence estimation methods to sudden changes in cross-dependence.

Appendix D Additional analysis for the LFP data in Section 6

D.1 Additional results

We display the estimated wavelet canonical coherence between two electrode clusters across additional scales. Figure 10 shows that cross-group associations vary significantly over time, with the stimulus clearly eliciting a scale-dependent response, indicative of the heterogeneous brain activity across different frequency bands. These results emphasize the importance of capturing scale-specific, time-varying coherence.

Refer to caption
Refer to caption
Refer to caption
Figure 10: Estimated wavelet canonical coherence between two hippocampal regions (electrodes T1, T2, T4, T5 vs. T13–T17) in rat Mitt at scale j=4(31.2562.5Hz)j=4\,(31.25\text{\textendash}62.5Hz), j=6(7.8115.63Hz)j=6\,(7.81\text{\textendash}15.63Hz) and j=7(3.907.81Hz)j=7\,(3.90\text{\textendash}7.81Hz). The estimates are averaged across 40 correct- and 32 incorrect-response trials, using a rectangular smoothing window of 0.2 seconds.

D.2 Permutation test: Algorithm

Algorithm 2 below gives the detailed procedure used in Section 6 for detecting significant differences in canonical coherence between correct- and incorrect-response trials, at selected time points.

Algorithm 2 Time-specific window permutation test for wavelet coherence analysis
Assume the time-localized, scale-specific WaveCanCoh ρcorrect/ incorrect (r)(j,u)\rho_{\text{correct/ incorrect }}^{(r)}(j,u) is available for each trial rr, along with the trial label ‘correct/ incorrect’. We want to test for significant differences between correct- and incorrect-response trial groups at specific time points tt^{*} and given scale jj.
1.Window definition: For each time point tt^{*}, define a window of size ww centered at tt^{*}:
tstart =tw/2,tend =t+w/2t_{\text{start }}=t^{*}-w/2,\quad t_{\text{end }}=t^{*}+w/2
2.Test statistic: For each time point tt^{*} and given scale jj, compute:
Tobs(j,t)=t=tstart tend (medianr(ρcorrect (r)(j,t/T))medianr(ρincorrect (r)(j,t/T)))2T_{\mathrm{obs}}\left(j,t^{*}\right)=\sum_{t=t_{\text{start }}}^{t_{\text{end }}}\left(\operatorname{median}_{r}\left(\rho_{\text{correct }}^{(r)}(j,t/T)\right)-\operatorname{median}_{r}\left(\rho_{\text{incorrect }}^{(r)}(j,t/T)\right)\right)^{2}
3.Permutation: For each tt^{*} and scale jj:
  • Combine all ρ(r)(j,t/T)\rho^{(r)}(j,t/T) values from both correct- and incorrect-response trials across the time window.

  • Perform npermn_{\text{perm}} random permutations. For each permutation ii:

    • Randomly assign trials into two new groups of the same sizes as the original groups.

    • Compute the permuted statistic:

      Tperm(i)(j,t)=t=tstart tend (medianr(ρperm,1(r)(j,t/T))medianr(ρperm,2(r)(j,t/T)))2T_{\text{perm}}^{(i)}\left(j,t^{*}\right)=\sum_{t=t_{\text{start }}}^{t_{\text{end }}}\left(\operatorname{median}_{r}\left(\rho_{\text{perm},1}^{(r)}(j,t/T)\right)-\operatorname{median}_{r}\left(\rho_{\text{perm},2}^{(r)}(j,t/T)\right)\right)^{2}

      where ρperm,1(r)\rho_{\text{perm,}1}^{(r)} and ρperm,2(r)\rho_{\text{perm,}2}^{(r)} are the permuted trial groups.

4.Calculate pp-value:
p(j,t)=1npermi=1nperm𝕀(Tperm(i)(j,t)Tobs(j,t))p\left(j,t^{*}\right)=\frac{1}{n_{\mathrm{perm}}}\sum_{i=1}^{n_{\mathrm{perm}}}\mathbb{I}\left(T_{\mathrm{perm}}^{(i)}\left(j,t^{*}\right)\geq T_{\mathrm{obs}}\left(j,t^{*}\right)\right)

D.3 Permutation test: LFP results

Table 1 reports the permutation test results on the LFP data, with the value in each cell representing the difference in the median wavelet coherence between correct and incorrect trials at the corresponding scale jj and time t=1.0s,0.5s,0.5s,1.0st^{*}=-1.0s,-0.5s,0.5s,1.0s. The values in parentheses denote the permutation pp-values obtained using the windowed test procedure (window size =0.2s=0.2s), with the number of permutation nperm=1000n_{perm}=1000, and the detailed test results are shown in Figure 11, revealing significant differences between correct- and incorrect-response trials at scales j=4j=4 to 77 following the odor stimulation.

While our permutation-based inference procedure provides a robust, nonparametric approach for detecting significant coherence differences between conditions, nevertheless this framework does exhibit certain limitations, e.g., it inherently focuses on predefined wavelet scales rather than on arbitrary frequency bands, potentially restricting analyses that require precise frequency localization.

Table 1: Differences in median wavelet canonical coherence between correct- and incorrect-response trials across time points and scales. Each cell reports the median difference at scale jj and time tt^{*}, with pp-values obtained from the time-specific permutation test shown in parentheses.
jj t(s)t^{*}(s) -1.0 -0.5 0.5 1.0
3    (62.5125Hz62.5-125Hz) 0.222 (0.786) 0.068 (0.704) -0.178 (0.355) 0.106 (0.886)
4    (31.2562.5Hz31.25-62.5Hz) -0.090 (0.180) -0.021 (0.506) 0.023 (0.001**) 0.027 (0.691)
5   (15.6331.25Hz15.63-31.25Hz) 0.006 (0.977) 0.229 (0.999) 0.334 (0.002**) 0.016 (0.079)
6    (7.8115.63Hz7.81-15.63Hz) -0.025 (0.239) -0.049(0.111) 0.039 (0.001**) 0.002 (0.025**)
7     (<7.81Hz<7.81Hz) -0.014 (0.999) 0.058 (0.640) 0.012 (0.059) -0.030 (0.489)
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Refer to caption
Figure 11: Permutation test distributions (TpermT_{perm}) obtained using Algorithm 2 for different time points across scales j=3j=3 to j=7j=7. Red vertical lines indicate observed test statistics (TobsT_{obs}); pp-values are shown in each panel, corresponding to the results reported in Table 1.

Appendix E Causal-WaveCanCoh analysis

To further explore directed interactions between brain regions, we implement the Causal-WaveCanCoh framework (equation (17)) on LFP activity data, which extends WaveCanCoh by introducing a lead-lag structure to evaluate time-lagged canonical coherence. Specifically, we define 𝐗t=(X(T1),X(T2),X(T4),X(T5))\mathbf{X}_{t}=\left({X^{(\text{T1})},X^{(\text{T2})},X^{(\text{T4})},X^{(\text{T5})}}\right) and 𝐘t=(Y(T13),,Y(T17))\mathbf{Y}_{t}=\left({Y^{(\text{T13})},\ldots,Y^{(\text{T17})}}\right) as the multivariate signals corresponding to the two investigated distinct hippocampal regions, and we conduct the analysis in both directions, namely 𝐗t𝐘t+h\mathbf{X}_{t}\to\mathbf{Y}_{t+h} and 𝐘t𝐗t+h\mathbf{Y}_{t}\to\mathbf{X}_{t+h}, for lags h=0, 10, 20, 30, 40, 50h=0,\,10,\,20,\,30,\,40,\,50, corresponding to time shifts from 0 (contemporaneous dependence, used as reference point) to 0.05 seconds.

Figure 12 shows the average scale-specific estimated causal canonical coherence for both directions across the odor presentation time (0s1.2s0s-1.2s) and trial type (correct- vs incorrect-response trials). The results reveal lag-dependent and scale-driven patterns of directed coherence that identify a stronger association between the two hippocampal regions at scale j=5j=5 (corresponding to the frequency band 15.62531.25Hz15.625-31.25Hz). This notably occurs in both directions, with the activity in T13-T17 leading that of T1-T5 in correct-response trials after h>10h>10. The strength and behavior of coherence vary across scales, as well as across correct- and incorrect-response trial groups, which can be captured by our Causal-WaveCanCoh framework.

Refer to caption
Refer to caption
Figure 12: Average causal wavelet canonical coherence across time lags h=0h=0 to 5050 at scales j=4j=4 to 77, comparing correct- and incorrect-response trials. Top: 𝐗t𝐘t+h\mathbf{X}_{t}\rightarrow\mathbf{Y}_{t+h}. Bottom: 𝐘t𝐗t+h\mathbf{Y}_{t}\rightarrow\mathbf{X}_{t+h}.