This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

11institutetext: Yashwanth Ramamurthi 22institutetext: International Institute of Information Technology, Bangalore
22email: yashwanth@iiitb.ac.in
33institutetext: Amit Chattopadhyay 44institutetext: International Institute of Information Technology, Bangalore
44email: a.chattopadhyay@iiitb.ac.in

A Topological Distance Measure between Multi-Fields for Classification and Analysis of Shapes and Data

Yashwanth Ramamurthi    Amit Chattopadhyay
Abstract

Distance measures play an important role in shape classification and data analysis problems. Topological distances based on Reeb graphs and persistence diagrams have been employed to obtain effective algorithms in shape matching and scalar data analysis. In the current paper, we propose an improved distance measure between two multi-fields by computing a multi-dimensional Reeb graph (MDRG) each of which captures the topology of a multi-field through a hierarchy of Reeb graphs in different dimensions. A hierarchy of persistence diagrams is then constructed by computing a persistence diagram corresponding to each Reeb graph of the MDRG. Based on this representation, we propose a novel distance measure between two MDRGs by extending the bottleneck distance between two Reeb graphs. We show that the proposed measure satisfies the pseudo-metric and stability properties. We examine the effectiveness of the proposed multi-field topology-based measure on two different applications: (1) shape classification and (2) detection of topological features in a time-varying multi-field data. In the shape classification problem, the performance of the proposed measure is compared with the well-known topology-based measures in shape matching. In the second application, we consider a time-varying volumetric multi-field data from the field of computational chemistry where the goal is to detect the site of stable bond formation between Pt and CO molecules. We demonstrate the ability of the proposed distance in classifying each of the sites as occurring before and after the bond stabilization.

Keywords:
Multi-Field, Topology, Multi-Dimensional Reeb graph, Distance Measure, Shape Classification, Feature-Detection.

1 Introduction

The computations of topological similarity between shapes or data reveal critical phenomena to experts in various domains. A lot of research has been done in developing similarity or distance measures using scalar topology. The development of these measures were based on descriptors in scalar topology such as Reeb graph, Morse-Smale Complex, extremum graph and persistence diagram. Scalar topology based methods have found applications in several areas such as shape matching 2001-Hilaga-MRG ; 2007-Tam-DeformableModelRetrieval , protein structure classification 2004-Zhang-Dual-Contour-Tree , symmetry detection 2014-Thomas-Multiscale and periodicity detection in time-varying data 2018-Sridharamurthy-Edit-Distance-Merge-Trees .

Persistent homology theory 2002-Edelsbrunner-Persistence ; 2007-Zomorodian-ComputingPersistentHomology ; 2009-Chazal-ProximityOfPersistenceModules has provided a simple yet powerful technique to encode the topological information of a scalar field into a bar code or persistence diagram. Distance measures based on persistence diagrams have been shown to be effective in the classification of signals 2017-Marchese-Signal-Classification and determining the crystal structure of materials 2020-Maroulas-Distance-Topological-Classification . The bottleneck distance between persistence diagrams has proven to be effective in the topological analysis of shapes and data 2014-Li-Persistence-Based-Structural-Recognition ; 2016-MultiscaleMapper . Recently, Bauer et al.2014-Bauer-DistanceBetweenReebGraphs developed a functional distortion distance between Reeb graphs and showed that the bottleneck distance between the persistence diagrams of Reeb graphs is a pseudo-metric.

Multi-field topology is getting wider attention because of its ability to capture and classify richer topological features than scalar topology, as has been shown in various data analysis applications in computational physics and computational chemistry 2012-Duke-Nuclear-Scission ; 2019-Agarwal-histogram ; 2021-Ramamurthi-MRS . However, the application of multi-field topology in the classification and analysis of shapes and data, requires further development in the representation of such topological features and finding effective distance measures between such representations. In the current work, we propose a novel distance measure between two shapes or multi-fields based on their multi-dimensional Reeb graphs (MDRG).

The MDRG is a topological descriptor which captures the topology of a multi-field using a series of Reeb graphs in different dimensions. To compare two MDRGs, we compute the persistence diagrams corresponding to the component Reeb graphs and find a distance between two such collections of Reeb graphs using the bottleneck distance. We show that the proposed distance measure is stable and satisfies the pseudo-metric property. We validate the effectiveness of the proposed distance measure in the classification of shapes and detection of topological features in time-varying volumetric data. In the current paper, our contributions are as follows.

  • We propose a novel distance measure between two MDRGs based on the bottleneck distance between the component Reeb graphs of the MDRGs.

  • We show the proposed distance measure satisfies the stability and pseudo-metric properties.

  • We show the effectiveness of using pairs of eigenfunctions from the Laplace-Beltrami operator, in the classification of shapes, compared to other well-known shape descriptors, viz. HKS, WKS and SIHKS, for different topology-based techniques, using the SHREC 20102010 dataset.

  • We demonstrate the performance of multi-field topology over scalar topology by showing the effectiveness of the proposed measure in detecting stable bond formation in the Pt-CO molecular orbitals data and classifying sites based on their occurrence before and after bond stabilization.

Outline. In Section 2, we discuss topological tools and their applications in shape matching and data analysis. Section 3 discusses the necessary background required to understand the proposed distance measure. In Section 4, we describe the proposed distance measure between MDRGs and prove its pseudometric and stability properties. In Section 5, we show the experimental results of the proposed measure in (i) shape classification and (ii) topological feature detection in a time-varying dataset from computational chemistry. Finally, in Section 6 we draw a conclusion and provide future directions.

2 Related Work

Several shape classification and data analysis techniques have been developed based on the scalar topology tools such as contour Tree 2004-Zhang-Dual-Contour-Tree , Reeb Graph 2001-Hilaga-MRG , and Merge tree 2018-Sridharamurthy-Edit-Distance-Merge-Trees . Li et al.2014-Li-Persistence-Based-Structural-Recognition computed the persistence diagrams of shapes corresponding to the spectral descriptors HKS, WKS, SIHKS and compared the technique with the bag-of-features model. Kleiman et al.2017-Kleiman-StructureBasedShapeCorrespondence present a method for computing region-level correspondence between a pair of shapes by obtaining a shape graph corresponding to each shape and then matching the nodes of shape graphs. Poulenard et al.2018-Poulenard-TopologicalFunctionOptimization present a characterization of functional maps based on persistence diagrams along with an optimization scheme to improve the computational efficiency.

An increase of interest in the Laplace-Beltrami (LB) operator resulted in the evolution of various signatures to capture the features in a shape 1997-Rosenberg-Laplacian ; 2009-Reuter-LaplaceBeltrami . Reuter et al.2006-Reuter-Shape-DNA proposed the shape DNA, where a shape is represented using the eigenvalues of the Laplace-Beltrami operator and was able to identify the shapes with similar poses. However, non-isometric shapes can have the same set of eigenvalues. This drawback is overcome by the Global Point Signature (GPS) 2007-Rustamov-GPS , where the eigenfunctions are used along with the eigenvalues. However, this introduced the sign ambiguity problem to the eigenfunctions (see Section 5.1.1 for more details), which was solved using the Heat Kernel Signature (HKS) 2009-Sun-HKS . However, the HKS consists of information from low frequencies. The suppression of high frequencies makes it difficult to detect microscopic features. This limitation was overcome in the Wave Kernel Signature (WKS)2011-Aubry-WKS by using band-pass filters instead of low-pass filters. Bronstein et al.2010-Bronstein-SIHKS propose a scale invariant version of the Heat Kernel Signature (SIHKS) by using logarithmic sampling and Fourier transform. Recently, Zihao et al.2020-Wang-Shape-Retrieval proposed a shape descriptor based on the probability distributions of the eigenfunctions of the LB operator and showed its effectiveness over SIHKS. In this paper, we measure distance between shapes by directly comparing the eigenfunctions of the LB operator and show its performance with respect to HKS, WKS and SIHKS.

Recently, tools for capturing multi-field topology have been studied using the Jacobi set 2004-Edelsbrunner-JacobiSets , Reeb Space 2008-Edelsbrunner-ReebSpaces , Mapper 2007-Singh-Mapper , Joint Contour Net 2014-Carr-JCN , Multi-Dimensional Reeb Graph 2014-Chattopadhyay-ExtractingJacobiStructures ; 2016-Chattopadhyay-MultivariateTopologySimplification , etc. Techniques for comparing multi-fields have been developed such as the bottleneck distance between persistence diagrams of multiscale mappers 2016-MultiscaleMapper , distance between fiber-component distributions 2019-Agarwal-histogram and similarity between multi-resolution Reeb spaces (MRSs) 2021-Ramamurthi-MRS . In the current paper, we capture the topological features in a multi-field by computing the persistence diagrams of the Reeb graphs in an MDRG and develop a distance measure between two such MDRGs based on the bottleneck distance between the component Reeb graphs of the MDRGs. We evaluate the effectiveness of the proposed distance measure in (i) the shape classification problem using the SHREC 20102010 dataset 2010-SHREC and (ii) analysis and classification of topological features in a volumetric time-varying data from the field of computational chemistry.

3 Background

In this section, we discuss the necessary background required to understand the proposed distance measure and feature descriptors used in the comparison of shapes.

3.1 Multi-Field Topology

Let 𝕄\mathbb{M} be a triangulated mesh of a compact dd-manifold \mathcal{M}. A multi-field on 𝕄\mathbb{M} with rr component scalar fields is defined as a continuous map 𝐟=(f1,f2,,fr):𝕄r\mathbf{f}=(f_{1},f_{2},...,f_{r}):\mathbb{M}\rightarrow\mathbb{R}^{r}. For a value 𝐜r\mathbf{c}\in\mathbb{R}^{r}, its inverse 𝐟1(𝐜)\mathbf{f}^{-1}(\mathbf{c}) is called a fiber. Each connected component of a fiber is a fiber-component 2004-Saeki-Topology-of-Singular-Fibers ; 2014-Saeki-Visualizing-Multivariate-Data . In particular, for a scalar field f:𝕄f:\mathbb{M}\rightarrow\mathbb{R}, the inverse of ff corresponding to a point in the range is called a level set and each connected component of a level set is called a contour. The Reeb space of 𝐟\mathbf{f}, denoted by 𝒮𝐟\mathcal{RS}_{\mathbf{f}}, is the quotient space by contracting each fiber-component to a point 2008-Edelsbrunner-ReebSpaces . In particular, the Reeb space of a scalar field f:𝕄f:\mathbb{M}\rightarrow\mathbb{R} is known as the Reeb graph 𝒢f\mathcal{RG}_{f}, which is obtained by contracting each contour to a point and f~:𝕄𝒢f\tilde{f}:\mathbb{M}\rightarrow\mathcal{RG}_{f} is the corresponding quotient map. 2017-Saeki-Theory-of-Singular-Fibers . Carr et al.2014-Carr-JCN proposed the Joint Contour Net (JCN) data-structure, which is a quantized approximation of the Reeb space. For construction of the JCN of 𝐟\mathbf{f} , the range of 𝐟\mathbf{f} is subdivided into a finite set of rr-dimensional intervals. For each interval, the inverse of 𝐟\mathbf{f} is a quantized fiber and each connected component of a quantized fiber is a quantized fiber-component or joint contour. The JCN is a graph where a node corresponds to a joint contour and an edge between two nodes corresponds to the adjacency of the corresponding joint contours in the domain.

3.2 Multi Dimensional Reeb Graph

A multi-dimensional Reeb graph (MDRG) of a multi-field 𝐟=(f1,,fr):𝕄r\mathbf{f}=(f_{1},...,f_{r}):\mathbb{M}\rightarrow\mathbb{R}^{r}, proposed by Chattopadhyay et al.2014-Chattopadhyay-ExtractingJacobiStructures ; 2016-Chattopadhyay-MultivariateTopologySimplification , is the decomposition of the Reeb space (or a JCN) into a series of Reeb graphs in different dimensions. To construct the MDRG of 𝐟\mathbf{f}, the Reeb graph 𝒢f1\mathcal{RG}_{f_{1}} of the function f1f_{1} is computed. Each point p𝒢f1p\in\mathcal{RG}_{f_{1}} represents a contour CpC_{p} of f1f_{1}. For every p𝒢f1p\in\mathcal{RG}_{f_{1}} a Reeb graph corresponding to the field f2f_{2} is computed by restricting f2f_{2} on CpC_{p}. This procedure is repeated by computing the Reeb graphs for the function fif_{i} by restricting fif_{i} on the contours corresponding to points in the Reeb graphs of fi1f_{i-1}, where 2ir2\leq i\leq r. In practice, the MDRG is constructed from a quantized approximation of the Reeb Space (JCN). Therefore, similar to the JCN, the MDRG also depends on the subdivision of the range. Each node in a Reeb graph of the MDRG corresponds to a quantized contour and the adjacency between quantized contours is represented by an edge in the Reeb graph. Figure 1 shows the JCN and MDRG corresponding to a bivariate field. In the current paper, we compute persistence diagrams corresponding to Reeb graphs in the MDRG and propose a distance to compare the collection of persistence diagrams of two MDRGs based on the bottleneck distance between Reeb graphs 2014-Bauer-DistanceBetweenReebGraphs .

Refer to caption
Figure 1: (a) A piecewise linear bivariate data over a double torus: The black curves depict the contours of the first field (f1)(f_{1}) and the mesh is colored based on the values of the second field (f2)(f_{2}), (b) JCN at 5×55\times 5 resolution: the coloring of the nodes are is based on the values of the first field, (c) MDRG computed using the algorithm in 2016-Chattopadhyay-MultivariateTopologySimplification : the coloring of the nodes in the first (second) dimension is based on the values of f1f_{1} (f2f_{2})

3.3 Persistence Diagram

In this paper, we give a brief introduction to the notion of persistence diagrams and refer the readers to 2010-Edelsbrunner-book ; 2007-Zomorodian-ComputingPersistentHomology for further details on persistent homology.

Let ff be a continuous real-valued function defined on 𝕄\mathbb{M}. For a value aa\in\mathbb{R}, the sublevel set 𝕄a\mathbb{M}_{\leq a} consists of the points in 𝕄\mathbb{M} with ff-value less than or equal to aa, i.e. 𝕄a=f1(,a]\mathbb{M}_{\leq a}=f^{-1}(-\infty,a]. For aba\leq b, the ll-th homology groups of the sublevel sets 𝕄a\mathbb{M}_{\leq a} and 𝕄b\mathbb{M}_{\leq b} are connected by the inclusion map fla,b:Hl(𝕄a)Hl(𝕄b)f^{a,b}_{l}:H_{l}(\mathbb{M}_{\leq a})\rightarrow H_{l}(\mathbb{M}_{\leq b}). A value aa\in\mathbb{R} is a homological critical value of ff if l\exists l\in\mathbb{Z}^{*} such that flaδ,a+δf^{a-\delta,a+\delta}_{l} is not an isomorphism for all sufficiently small δ>0\delta>0, where \mathbb{Z}^{*} is the set of non-negative integers. We assume that ff is tame, i.e, the number of homological critical values of ff is finite and the homology groups Hl(𝕄a)H_{l}(\mathbb{M}_{\leq a}) are finite-dimensional l\forall l\in\mathbb{Z}^{*}. Here, aa is any homological critical value of ff. Let a0<a1<<aNa_{0}<a_{1}<...<a_{N} be the homological critical values of ff. In this paper, we consider homology with coefficients in 2\mathbb{Z}_{2}, which is the group of integers modulo 22. Therefore, Hl(𝕄ai)H_{l}(\mathbb{M}_{\leq_{a_{i}}}) is a vector space for 0iN0\leq i\leq N. We have the following sequence of vector spaces,

=Hl(𝕄a0)Hl(𝕄a1)Hl(𝕄aN)=Hl(𝕄).\emptyset=H_{l}(\mathbb{M}_{\leq a_{0}})\rightarrow H_{l}(\mathbb{M}_{\leq a_{1}})\rightarrow\cdots\rightarrow H_{l}(\mathbb{M}_{\leq a_{N}})=H_{l}(\mathbb{M}). (1)

where the homomorphisms flai,ai+1f_{l}^{a_{i},a_{i+1}} are induced by the inclusions 𝕄ai𝕄ai+1\mathbb{M}_{\leq a_{i}}\subseteq\mathbb{M}_{\leq a_{i+1}}. A homology class γ\gamma is born at aa if γHl(𝕄a)\gamma\in H_{l}(\mathbb{M}_{a}) but γIm flaδ,a\gamma\notin\text{Im }f_{l}^{a-\delta,a} for 0<δba0<\delta\leq b-a. Further, a class γ\gamma which is born at aa dies at bb if fla,bδ(γ)Im flaδ,bδf_{l}^{a,b-\delta}(\gamma)\notin\text{Im }f_{l}^{a-\delta,b-\delta} for any δ>0\delta>0, but fla,b(γ)Im flaδ,bf_{l}^{a,b}(\gamma)\in\text{Im }f_{l}^{a-\delta,b}. Such birth and death events are recorded by persistent homology. The ll-th ordinary persistence diagram is a multiset of points in ¯2\mathbb{\overline{R}}^{2}, where ¯={,+}\mathbb{\overline{R}}=\mathbb{R}\cup\{-\infty,+\infty\}. Each point (a,b)(a,b) in the llth ordinary persistence diagram corresponds to a ll-homology class which is born at aa and dies at bb. The multiplicity of a point (a,b)(a,b) with aba\leq b is defined in terms of the ranks of the homomorphism fla,bf_{l}^{a,b} and the points along the diagonal have infinite multiplicity.

In general, not all homology classes die during the sequence in equation (1). Such homology classes are is said to be essential. An essential homology class of dimension ll is denoted by a point (ai,)(a_{i},\infty) in the llth ordinary persistence diagram, where aia_{i} is the critical value corresponding to the birth of the homology class. By appending a sequence of relative homology groups to equation (1), we obtain the following sequence:

\displaystyle\emptyset =Hl(𝕄a0)Hl(𝕄a1)Hl(𝕄aN)=Hl(𝕄)\displaystyle=H_{l}(\mathbb{M}_{\leq a_{0}})\rightarrow H_{l}(\mathbb{M}_{\leq a_{1}})\rightarrow\cdots\rightarrow H_{l}(\mathbb{M}_{\leq a_{N}})=H_{l}(\mathbb{M})
=Hl(𝕄,𝕄aN)Hl(𝕄,𝕄aN1)\displaystyle=H_{l}(\mathbb{M},\mathbb{M}_{\geq a_{N}})\rightarrow H_{l}(\mathbb{M},\mathbb{M}_{\geq a_{N-1}})\rightarrow\cdots
Hl(𝕄,𝕄a0)=.\displaystyle\hskip 14.22636pt\cdots\rightarrow H_{l}(\mathbb{M},\mathbb{M}_{\geq a_{0}})=\emptyset. (2)

where 𝕄ai\mathbb{M}_{\geq a_{i}} denotes the super-level set of ff, 𝕄ai=f1[ai,)\mathbb{M}_{\geq a_{i}}=f^{-1}[a_{i},\infty) and Hl(𝕄,𝕄ai)H_{l}(\mathbb{M},\mathbb{M}_{\geq a_{i}}) is a relative homology group 2010-Edelsbrunner-book . Essential homology classes are created in the ordinary part and are destroyed in the relative part of the sequence in equation (2). The birth and death of essential homology classes are encoded by the extended persistence diagram. An essential homology class of dimension ll which is born at Hl(𝕄ai)H_{l}(\mathbb{M}_{\leq a_{i}}) and dies at Hl(𝕄,𝕄aj)H_{l}(\mathbb{M},\mathbb{M}_{\geq a_{j}}) is denoted by the point (ai,aj)(a_{i},a_{j}) in the llth extended persistence diagram.

3.4 Bottleneck Distance between Persistence Diagrams

The bottleneck distance between persistence diagrams XX and YY is defined as follows:

dB(X,Y)=infη:XYmaxxX𝐱η(x)d_{B}(X,Y)=\inf_{\eta:X\rightarrow Y}\max_{x\in X}\|\mathbf{x}-\eta(x)\|_{\infty} (3)

where η\eta ranges over bijections between XX and YY. For each point (a,b)(a,b) in XX, we add its nearest diagonal point (a+b2,a+b2)(\frac{a+b}{2},\frac{a+b}{2}) in YY and vice-versa. The number of points in XX and YY are now equal. To compute the bottleneck distance, a bijection η:XY\eta:X\rightarrow Y is constructed such that maxxXxη(x)\underset{x\in X}{\max}\|x-\eta(x)\|_{\infty} is minimized.

3.5 Reeb Graph and Persistence Diagram

Let ff be a real-valued continuous function defined on 𝕄\mathbb{M}. The Reeb graph of ff, denoted by 𝒢f\mathcal{RG}_{f}, is the quotient space of contours of ff. The function ff induces a real-valued function f¯:𝒢f\bar{f}:\mathcal{RG}_{f}\rightarrow\mathbb{R}, which takes each point in p𝒢fp\in\mathcal{RG}_{f} to the value of ff corresponding to its contour f~1(p)\tilde{f}^{-1}(p) in the domain, f=f¯f~f=\bar{f}\circ\tilde{f}.

The persistence of topological features of 𝒢f\mathcal{RG}_{f} are encoded in the persistence diagrams Dg0(𝒢f)\mathrm{Dg}_{0}(\mathcal{RG}_{f}), Dg0(𝒢f)\mathrm{Dg}_{0}(\mathcal{RG}_{-f}), ExDg0(𝒢f)\mathrm{ExDg}_{0}(\mathcal{RG}_{f}) and ExDg1(𝒢f)\mathrm{ExDg}_{1}(\mathcal{RG}_{f}). The 0-dimensional persistent homological features corresponding to the sub-level set and super-level set filtrations of ff are captured in Dg0(𝒢f)\mathrm{Dg}_{0}(\mathcal{RG}_{f}) and Dg0(𝒢f)\mathrm{Dg}_{0}(\mathcal{RG}_{-f}) respectively. ExDg0(𝒢f)\mathrm{ExDg}_{0}(\mathcal{RG}_{f}) encodes the range of ff and ExDg1(𝒢f)\mathrm{ExDg}_{1}(\mathcal{RG}_{f}) captures the 11-cycles or loops in 𝒢f\mathcal{RG}_{f}.

To compute the persistence diagrams of 𝒢f\mathcal{RG}_{f}, we require 𝒢f\mathcal{RG}_{f} to be the Reeb graph of a Morse function. If ff is a Morse function, i.e. all its critical points are non-degenerate and are at different levels, then the critical nodes of 𝒢f\mathcal{RG}_{f} have distinct values of f¯\bar{f} and belong to one of the following five-types: (i) a minimum (with down-degree =0=0, up-degree =1=1), (ii) a maximum (with up-degree =0=0, down-degree =1=1), (iii) a down-fork (with down-degree =2=2, up-degree =1=1) and (iv) a up-fork (with up-degree =2=2, down-degree =1=1). A regular node has up-degree = 11 and down-degree =1=1. A down-fork (similarly, up-fork) node is called an essential down-fork node when it contributes to a loop (cycle) of the Reeb graph. Otherwise it is called an ordinary down-fork node. To ensure that 𝒢f\mathcal{RG}_{f} is the Reeb graph of a Morse function, we first eliminate degenerate critical nodes in 𝒢f\mathcal{RG}_{f} by breaking them into non-degenerate critical nodes 2019-Tu-PropogateAndPair . After eliminating degenerate critical nodes, we ensure that the critical nodes of f¯\bar{f} are at different levels. If two critical nodes are at the same level, then the value of one of the nodes is increased/decreased by a small value ϵ\epsilon. After removing degenerate critical nodes and ensuring that critical nodes are at different levels, 𝒢f\mathcal{RG}_{f} becomes the Reeb graph of a Morse function.

The points in Dg0(f)\mathrm{Dg}_{0}(f) are computed by pairing ordinary down-forks with minima, ordinary up-forks with maxima and the global minimum with global maximum. Let uu be an ordinary down-fork of 𝒢f\mathcal{RG}_{f}. The two lower branches of uu correspond to two different components C1C_{1} and C2C_{2} in (𝒢f)f¯(u)(\mathcal{RG}_{f})_{\leq\bar{f}(u)}. Let x1x_{1} and x2x_{2} be the global minimum of C1C_{1} and C2C_{2} respectively. Let us assume that f¯(x1)<f¯(x2)\bar{f}(x_{1})<\bar{f}(x_{2}). Then a 0-dimensional homology class is born at x2x_{2} and dies at uu. x2x_{2} is paired with uu and point (f¯(x2),f¯(u))(\bar{f}(x_{2}),\bar{f}(u)) is created in Dg0(𝒢f)\mathrm{Dg}_{0}(\mathcal{RG}_{f}) (see Figure 2(b)). A symmetric procedure is applied on 𝒢f\mathcal{RG}_{-f} to obtain points in Dg0(𝒢f)\mathrm{Dg}_{0}(\mathcal{RG}_{-f}) by pairing up-forks with maxima. Let x,y𝒢fx,y\in\mathcal{RG}_{f} correspond to the global minimum and maximum of f¯\bar{f} respectively. We pair xx with yy, giving rise to the point (f¯(x),f¯(y))(\bar{f}(x),\bar{f}(y)) in ExDg0(𝒢f)\mathrm{ExDg}_{0}(\mathcal{RG}_{f}) (see Figure 2(c)).

We are interested in the persistent features which are born and die within the range of f¯\bar{f}. In Figure 2(b), the point (1,)Dg0(𝒢f)(1,\infty)\in\mathrm{Dg}_{0}(\mathcal{RG}_{f}) corresponds to the unique homology class born at the global minimum of ff and persists throughout the sub-level set filtration. However, the point (1,12)ExDg0(𝒢f)(1,12)\in\mathrm{ExDg}_{0}(\mathcal{RG}_{f}) encodes both the minimum and maximum of f¯\bar{f} (see Figure 2(c)). Therefore, Dg0(𝒢f)\mathrm{Dg}_{0}(\mathcal{RG}_{f}) is combined with ExDg0(𝒢f)\mathrm{ExDg}_{0}(\mathcal{RG}_{f}) to obtain PD0(𝒢f)\mathrm{PD}_{0}(\mathcal{RG}_{f}). This persistence diagram consists of the points in Dg0(𝒢f)\mathrm{Dg}_{0}(\mathcal{RG}_{f}) except the point with infinite persistence (1,)(1,\infty), instead the point (1,12)(1,12) in ExDg0(𝒢f)\mathrm{ExDg}_{0}(\mathcal{RG}_{f}) is included. Thus PD0(𝒢f):=Dg0(𝒢f)ExDg0(𝒢f){(1,)}\mathrm{PD}_{0}(\mathcal{RG}_{f}):=\mathrm{Dg}_{0}(\mathcal{RG}_{f})\cup\mathrm{ExDg}_{0}(\mathcal{RG}_{f})\setminus\{(1,\infty)\} (see Figure 2(d)).

To compute the points in the 11st extended persistence diagram ExDg1(𝒢f)\mathrm{ExDg}_{1}(\mathcal{RG}_{f}), we pair essential down-forks with essential up-forks. Let uu be an essential down-fork of 𝒢f\mathcal{RG}_{f}. Of all the cycles born at uu, let γ\gamma be a cycle having the largest minimum value of f¯\bar{f}. Let vγv\in\gamma be the node corresponding to the minimum on γ\gamma. It can be seen that vv is an essential up-fork 2006-Agarwal-ExtremeElevation . uu is paired with vv giving rise to the point (f¯(u),f¯(v))(\bar{f}(u),\bar{f}(v)) in ExDg1(f)\mathrm{ExDg}_{1}(f) (see Figure 2(e)). In this paper, we compute the persistence diagrams PD0\mathrm{PD}_{0} and ExDg1\mathrm{ExDg}_{1} of the Reeb graphs in an MDRG.

Refer to caption
Figure 2: (a) Reeb graph of a scalar field ff. (b) 0th ordinary persistence diagram Dg0(𝒢f)\mathrm{Dg}_{0}(\mathcal{RG}_{f}). (c) 0th extended persistence diagram ExDg0(𝒢f)\mathrm{ExDg}_{0}(\mathcal{RG}_{f}). (d) PD0(𝒢f):=Dg0(𝒢f)ExDg0(𝒢f){(1,)}\mathrm{PD}_{0}(\mathcal{RG}_{f}):=\mathrm{Dg}_{0}(\mathcal{RG}_{f})\cup\mathrm{ExDg}_{0}(\mathcal{RG}_{f})\setminus\{(1,\infty)\}. (e) 11st extended persistence diagram ExDg1(𝒢f)\mathrm{ExDg}_{1}(\mathcal{RG}_{f}). The points in ordinary (extended) persistence diagrams are denoted by circular points (squares)

Next, we provide a brief description of the Laplace Beltrami operator and show its utility in computing descriptors of a shape.

3.6 Laplace Beltrami Operator

Let \mathcal{M} be a Riemannian 22-manifold in 3\mathbb{R}^{3}. For a twice-differentiable function f:f:\mathcal{M}\rightarrow\mathbb{R}, the Laplace Beltrami (LB) operator Δ\Delta_{\mathcal{M}} is defined as follows:

Δf=div(grad f)\Delta_{\mathcal{M}}f=div(grad\text{ }f) (4)

where grad ff is the gradient of ff and div is the divergence on the manifold 1984-chavel-eigenvalues . Let ψ\psi be a diffeormorphism from an open set UU\subset\mathcal{M} to 2\mathbb{R}^{2} with

gij\displaystyle g_{ij} =iψ,jψ,G=(gij),\displaystyle=\langle\partial_{i}\psi,\partial_{j}\psi\rangle,G=(g_{ij}),
W\displaystyle W =|G|,(gij)=G1,\displaystyle=\sqrt{|G|},(g^{ij})=G^{-1},

where the matrix GG is a Riemannian tensor on \mathcal{M} and |G||G| is the determinant of GG. The LB operator can now be written as follows 1997-Rosenberg-Laplacian :

Δf=1Wi,j=12i(gijWjf).\Delta_{\mathcal{M}}f=\frac{1}{W}\sum_{i,j=1}^{2}\partial_{i}(g^{ij}W\partial_{j}f). (5)

The Riemannian metric gg determines the intrinsic properties of \mathcal{M}, which are independent of the embedding of \mathcal{M}. Further, since the definition of gg is based on inner products which are rotation invariant, gg is also invariant to rotations of \mathcal{M}.

3.6.1 Discretization of the Laplace Beltrami operator

Let 𝕄\mathbb{M} be a triangulation of the surface \mathcal{M}, with 𝒱={v1,v2,,vn}\mathcal{V}=\{v_{1},v_{2},...,v_{n}\} as the set of vertices. Two vertices in 𝕄\mathbb{M} are said to be adjacent if they are connected by an edge. The set of adjacent vertices of viv_{i} is denoted by N(vi)N(v_{i}). The LB operator at a vertex viv_{i} is written as follows:

Δ𝕄=1sivjN(vi)cotαij+cotβij2[f(vj)f(vi)].\Delta_{\mathbb{M}}=\frac{1}{s_{i}}\sum_{v_{j}\in N(v_{i})}\frac{\cot\alpha_{ij}+\cot\beta_{ij}}{2}[f(v_{j})-f(v_{i})]. (6)

where, sis_{i} is the area of the Voronoi cell (shaded region) shown in Figure 3.

Refer to caption
Figure 3: Definition of the angles αij\alpha_{ij} and βij\beta_{ij} and the area of the Voronoi cell (shaded region) corresponding to vertex viv_{i}

A weight function is defined as follows:

mij={cotαij+cotβij2if vi and vj are adjacent0otherwise.m_{ij}=\begin{cases}\frac{\cot\alpha_{ij}+\cot\beta_{ij}}{2}&\text{if $v_{i}$ and $v_{j}$ are adjacent}\\ 0&\text{otherwise.}\end{cases} (7)

When ff is restricted to the vertices of 𝕄\mathbb{M}, its discrete version is f\vec{f} with fi=f(vi)f_{i}=f(v_{i}). The discrete LB operator of ff is written as Δ𝕄fLf\Delta_{\mathbb{M}}f\approx L\vec{f}. The entries of the matrix LL are given by

Lij={kmiksi if i=j,mijsi if vi and vj are adjacent,0 otherwise.L_{ij}=\begin{cases}\sum_{k}\frac{m_{ik}}{s_{i}}\text{ if $i=j$},\\ -\frac{m_{ij}}{s_{i}}\text{ if $v_{i}$ and $v_{j}$ are adjacent},\\ 0\text{ otherwise.}\end{cases}

The eigenvalues and eigenfunctions of the discrete version of the LB operator are computed by solving the eigenvalue problem Lv=λvL\vec{v}=\lambda\vec{v}. We note, the matrix LL is not symmetric since sis_{i} may not be the same for all vertices of 𝕄\mathbb{M}. Therefore, the set of eigenvalues of this problem is not guaranteed to be real 2007-Rustamov-Laplace-Beltrami-Eigenfunctions . However, the matrix LL can be factorized as L=S1ML=S^{-1}M, where SS is a diagonal matrix with Sii=siS_{ii}=s_{i} and MM is defined as follows:

Mij={imij if i=j,mij if vi and vj are adjacent,0 otherwise.M_{ij}=\begin{cases}\sum_{i}m_{ij}\text{ if $i=j$},\\ -m_{ij}\text{ if $v_{i}$ and $v_{j}$ are adjacent},\\ 0\text{ otherwise.}\end{cases} (8)

The eigenvalue problem Lv=λvL\vec{v}=\lambda\vec{v} is rewritten as the generalized eigenvalue problem S1Mv=λvS^{-1}M\vec{v}=\lambda\vec{v}, or

Mv=λSv.M\vec{v}=\lambda S\vec{v}. (9)

Since the matrices SS and MM are symmetric and SS is positive-definite, the eigenvalues and eigenvectors are real. Further, the eigenvalues are non-negative and the eigenvectors are orthogonal in terms of SS-inner product:

u,vS=uTSv.\langle\vec{u},\vec{v}\rangle_{S}=\vec{u}^{T}S\vec{v}.

In this paper, we obtain the feature descriptors of a shape based on the eigenfunctions of LB operator.

4 Proposed Distance between MDRGs

In this section, we propose a distance measure between shapes by defining a distance measure between two MDRGs based on the bottleneck distance between Reeb graphs 2014-Bauer-DistanceBetweenReebGraphs . First we discuss our measure for bivariate fields.

4.1 Distance between Bivariate Fields

Let 𝐟=(f1,f2),𝐠=(g1,g2)\mathbf{f}=(f_{1},f_{2}),\,\mathbf{g}=(g_{1},g_{2}) be two piecewise linear bivariate fields defined on 𝕄\mathbb{M} and let 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕄𝐠\mathbb{MR}_{\mathbf{g}} be the corresponding MDRGs. To compute the proposed distance between 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕄𝐠\mathbb{MR}_{\mathbf{g}}, we compute a sum of bottleneck distances between the component Reeb graphs in each dimension of the MDRGs by considering all possible bijections between the component Reeb graphs.

In the first dimension of the MDRGs, there is only one Reeb graph corresponding to each of the fields f1f_{1} and g1g_{1}, denoted by 𝒢f1\mathcal{RG}_{f_{1}} and 𝒢g1\mathcal{RG}_{g_{1}}, respectively. Let, Range(f1)=[minx𝕄f1(x),maxx𝕄f1(x)]Range(f_{1})=[\underset{x\in\mathbb{M}}{\min}f_{1}(x),\underset{x\in\mathbb{M}}{\max}f_{1}(x)] and Range(g1)=[minx𝕄g1(x),maxx𝕄g1(x)]Range(g_{1})=[\underset{x\in\mathbb{M}}{\min}g_{1}(x),\underset{x\in\mathbb{M}}{\max}g_{1}(x)] be the ranges of the functions f1f_{1} and g1g_{1}. We define R(f1,g1)=Range(f1)Range(g1)R(f_{1},g_{1})=Range(f_{1})\cup Range(g_{1}) as the union of the ranges of f1f_{1} and g1g_{1}. Now for each cR(f1,g1)c\in R(f_{1},g_{1}), in the second dimension, the MDRGs 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕄𝐠\mathbb{MR}_{\mathbf{g}} may have multiple Reeb graphs corresponding to f2f_{2} and g2g_{2} restricted on f11(c)f_{1}^{-1}(c) and g11(c)g_{1}^{-1}(c), respectively. Let 𝕊f1c={𝒢f2p~|f1¯(p)=c}\mathbb{S}_{f_{1}}^{c}=\{\mathcal{RG}_{\widetilde{f_{2}^{p}}}|\bar{f_{1}}(p)=c\} be the set of Reeb graphs from the second dimension of 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕊g1c={𝒢g2p~|g1¯(p)=c}\mathbb{S}_{g_{1}}^{c}=\{\mathcal{RG}_{\widetilde{g_{2}^{p}}}|\bar{g_{1}}(p)=c\} be the set of Reeb graphs from the second dimension of 𝕄𝐠\mathbb{MR}_{\mathbf{g}}. We consider all possible bijections ψc\psi_{c} between 𝕊f1c\mathbb{S}_{f_{1}}^{c} and 𝕊g1c\mathbb{S}_{g_{1}}^{c}. Then we define a distance between 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕄𝐠\mathbb{MR}_{\mathbf{g}} as follows.

d𝕄(𝕄𝐟,𝕄𝐠)=dB(Dg(𝒢f1),Dg(𝒢g1))+1|R(f1,g1)|cR(f1,g1)infψc:𝕊f1c𝕊g1csup𝒢𝕊f1cdB(Dg(𝒢),Dg(ψc(𝒢)))d_{\mathbb{MR}}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}\right)=d_{B}\left(\mathrm{Dg}\left(\mathcal{RG}_{f_{1}}\right),\mathrm{Dg}\left(\mathcal{RG}_{g_{1}}\right)\right)+\\ \frac{1}{|R(f_{1},g_{1})|}\underset{c\in R(f_{1},g_{1})}{\int}\underset{\psi_{c}:\mathbb{S}_{f_{1}}^{c}\rightarrow\mathbb{S}_{g_{1}}^{c}}{\inf}\sup_{\mathcal{RG}\in\mathbb{S}_{f_{1}}^{c}}d_{B}\bigl{(}\mathrm{Dg}\left(\mathcal{RG}\right),\\ \hskip 170.71652pt\mathrm{Dg}\left(\psi_{c}\left(\mathcal{RG}\right)\right)\bigr{)}\\ (10)

where Dg(𝒢)\mathrm{Dg}(\mathcal{RG}) is a persistence diagram of the Reeb graph 𝒢\mathcal{RG}, ψc\psi_{c} ranges over bijections between 𝕊f1c\mathbb{S}_{f_{1}}^{c} and 𝕊g1c\mathbb{S}_{g_{1}}^{c}. If Range(f1)Range(g1)Range(f_{1})\cap Range(g_{1})\neq\emptyset, then |R(f1,g1)||R(f_{1},g_{1})| is the length of the interval R(f1,g1)R(f_{1},g_{1}), otherwise |R(f1,g1)|=|Range(f1)|+|Range(g1)||R(f_{1},g_{1})|=|Range(f_{1})|+|Range(g_{1})|.

In the current paper, corresponding to each of the Reeb graph in an MDRG, we construct two persistence diagrams, namely PD0\mathrm{PD}_{0} and ExDg1\mathrm{ExDg}_{1}, as discussed in Section 3.5. We compute the distance between MDRGs defined in equation (LABEL:eqn:bottleneck-distance-mdrgs) based on PD0\mathrm{PD}_{0} and ExDg1\mathrm{ExDg}_{1}, which are denoted by d𝕄0(𝕄𝐟,𝕄𝐠)d_{\mathbb{MR}}^{0}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}) and d𝕄1(𝕄𝐟,𝕄𝐠)d_{\mathbb{MR}}^{1}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}), respectively. We consider the proposed distance measure between 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕄𝐠\mathbb{MR}_{\mathbf{g}} as the weighted sum of d𝕄0(𝕄𝐟,𝕄𝐠)d_{\mathbb{MR}}^{0}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}\right), d𝕄0(𝕄𝐟,𝕄𝐠)d_{\mathbb{MR}}^{0}\left(\mathbb{MR}_{-\mathbf{f}},\mathbb{MR}_{-\mathbf{g}}\right) and d𝕄1(𝕄𝐟,𝕄𝐠)d_{\mathbb{MR}}^{1}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}\right).

dT(𝕄𝐟,𝕄𝐠)=w0d𝕄0(𝕄𝐟,𝕄𝐠)+w1d𝕄0(𝕄𝐟,𝕄𝐠)+w2d𝕄1(𝕄𝐟,𝕄𝐠)d_{T}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}})=w_{0}\cdot d_{\mathbb{MR}}^{0}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}\right)\\ +w_{1}\cdot d_{\mathbb{MR}}^{0}\left(\mathbb{MR}_{-\mathbf{f}},\mathbb{MR}_{-\mathbf{g}}\right)+w_{2}\cdot d_{\mathbb{MR}}^{1}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}\right) (11)

where, 0w0,w1,w210\leq w_{0},w_{1},w_{2}\leq 1 and w0+w1+w2=1w_{0}+w_{1}+w_{2}=1. Here, d𝕄0(𝕄𝐟,𝕄𝐠)d_{\mathbb{MR}}^{0}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}\right) and d𝕄0(𝕄𝐟,𝕄𝐠)d_{\mathbb{MR}}^{0}\left(\mathbb{MR}_{-\mathbf{f}},\mathbb{MR}_{-\mathbf{g}}\right) respectively compute the distance between persistence diagrams of sub-level set and super-level set filtrations of the Reeb graphs in the MDRGs and d𝕄1(𝕄𝐟,𝕄𝐠)d_{\mathbb{MR}}^{1}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}\right) is the distance between the 11-st extended persistence diagrams of the Reeb graphs in 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕄𝐠\mathbb{MR}_{\mathbf{g}} respectively. Next, we discuss how to compute the proposed distance measure for quantized fields.

4.1.1 Computational Aspects for Quantized Fields

We note, the MDRG of a bivariate field can be computed from the corresponding JCN, which is a quantized approximation of the Reeb space. For constructing the JCN of a bivariate field, the range of each of the component scalar fields is quantized or subdivided into a finite number of slabs. Algorithm 1 outlines the computation of the persistence diagrams by computing an MDRG corresponding to a bivariate field 𝐟=(f1,f2)\mathbf{f}=(f_{1},f_{2}) and quantization levels q1q_{1} and q2q_{2}. ConstructJCN (in line 2) computes the JCN of 𝐟\mathbf{f} corresponding to q1q_{1} and q2q_{2}. ConstructMDRG (in line 33) constructs the MDRG from the JCN. Finally, ComputePDReebGraph computes the persistence diagram corresponding to a component Reeb graph (lines 66-88).

Algorithm 1 ComputePDsMDRG

Input: 𝐟=(f1,f2),q1,q2\mathbf{f}=(f_{1},f_{2}),\;q_{1},\;q_{2}
Output: Persistence diagrams of the component Reeb graphs in the MDRG

1:  % Compute the JCN and MDRG of the bivariate field
2:  JCN𝐟\mathrm{JCN}_{\mathbf{f}} = ConstructJCN(𝐟,q1,q2)(\mathbf{f},q_{1},q_{2})
3:  𝕄𝐟\mathbb{MR}_{\mathbf{f}} = ConstructMDRG(JCN𝐟)(\mathrm{JCN}_{\mathbf{f}})
4:  % Compute the persistence diagrams of the component Reeb graphs in the MDRG
5:  Dg(𝒢f1)\mathrm{Dg}(\mathcal{RG}_{f_{1}}) = ComputePDReebGraph(𝒢f1\mathcal{RG}_{f_{1}})
6:  for p𝒢f1p\in\mathcal{RG}_{f_{1}} do
7:     Dg(𝒢f2p~)\mathrm{Dg}(\mathcal{RG}_{\widetilde{f_{2}^{p}}}) = ComputePDReebGraph(𝒢f2p~\mathcal{RG}_{\widetilde{f_{2}^{p}}})
8:  end for
9:  return  {Dg(𝒢f1),{Dg(𝒢f2p~)|p𝒢f1}}\left\{\mathrm{Dg}(\mathcal{RG}_{f_{1}}),\left\{\mathrm{Dg}(\mathcal{RG}_{\widetilde{f_{2}^{p}}})\middle|p\in\mathcal{RG}_{f_{1}}\right\}\right\}

Algorithm 2 gives the pseudo-code for computing the distance between two multi-fields using the proposed distance measure. Let 𝐟=(f1,g2)\mathbf{f}=(f_{1},g_{2}) and 𝐠=(g1,g2)\mathbf{g}=(g_{1},g_{2}) be two bivariate fields defined on a compact dd-manifold 𝕄\mathbb{M} with d2d\geq 2. Let q1q_{1} and q2q_{2} be the number of slabs (quantization levels) into which R(f1,g1)R(f_{1},g_{1}) and R(f2,g2)R(f_{2},g_{2}) are subdivided respectively, for constructing the JCNs of 𝐟\mathbf{f} and 𝐠\mathbf{g}. Then let c1,c2,,cq1c_{1},c_{2},\ldots,c_{q_{1}} be the quantized range values corresponding to the subdivision of R(f1,g1)R(f_{1},g_{1}). The distance between MDRGs defined in equation (LABEL:eqn:bottleneck-distance-mdrgs) can be written as:

d𝕄(𝕄𝐟,𝕄𝐠)=dB(Dg(𝒢f1),Dg(𝒢g1))+1q11iq1infψci:𝕊f1ci𝕊g1cisup𝒢𝕊f1cidB(Dg(𝒢),Dg(ψci(𝒢)))d_{\mathbb{MR}}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}\right)=d_{B}\left(\mathrm{Dg}\left(\mathcal{RG}_{f_{1}}\right),\mathrm{Dg}\left(\mathcal{RG}_{g_{1}}\right)\right)+\\ \frac{1}{q_{1}}\underset{1\leq i\leq q_{1}}{\sum}\underset{\psi_{c_{i}}:\mathbb{S}_{f_{1}}^{c_{i}}\rightarrow\mathbb{S}_{g_{1}}^{c_{i}}}{\inf}\sup_{\mathcal{RG}\in\mathbb{S}_{f_{1}}^{c_{i}}}d_{B}\left(\mathrm{Dg}\left(\mathcal{RG}\right),\mathrm{Dg}\left(\psi_{c_{i}}\left(\mathcal{RG}\right)\right)\right) (12)

where ψci\psi_{c_{i}} ranges over all possible bijections between 𝕊f1ci\mathbb{S}_{f_{1}}^{c_{i}} and 𝕊g1ci\mathbb{S}_{g_{1}}^{c_{i}}. Since the number of Reeb graphs in 𝕊f1ci\mathbb{S}_{f_{1}}^{c_{i}} and 𝕊g1ci\mathbb{S}_{g_{1}}^{c_{i}} need not be equal, it may not be possible to construct proper bijections between 𝕊f1ci\mathbb{S}_{f_{1}}^{c_{i}} and 𝕊g1ci\mathbb{S}_{g_{1}}^{c_{i}}. Therefore, we introduce dummy Reeb graphs (without vertices and edges) in 𝕊f1ci\mathbb{S}_{f_{1}}^{c_{i}} or 𝕊g1ci\mathbb{S}_{g_{1}}^{c_{i}} to make the cardinality of 𝕊f1c\mathbb{S}_{f_{1}}^{c} and 𝕊g1ci\mathbb{S}_{g_{1}}^{c_{i}} equal. Then the optimal bijection ψci\psi_{c_{i}} is constructed using the Hungarian algorithm by computing a cost matrix 1955-Kuhn-Hungarian-Algorithm (lines 88-1212 of Algorithm 2). We note, the bijection ψci\psi_{c_{i}} can map a Reeb graph in 𝕊f1ci\mathbb{S}_{f_{1}}^{c_{i}} to a dummy Reeb graph. The persistence diagram corresponding to a dummy Reeb graph is empty, i.e, it does not contain any point. The bottleneck distance between persistence diagrams is computed as described in Section 3.4.

Algorithm 2 ComputeDistance

Input: 𝐟,𝐠,q1,q2\mathbf{f},\;\mathbf{g},\;q_{1},\;q_{2}
Output: d𝕄(𝕄𝐟,𝕄𝐠)d_{\mathbb{MR}}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}})

1:  % Compute the Persistence Diagrams
2:  PD𝐟PD_{\mathbf{f}} = ComputePDsMDRG(𝐟,q1,q2\mathbf{f},q_{1},q_{2})
3:  PD𝐠PD_{\mathbf{g}} = ComputePDsMDRG(𝐠,q1,q2\mathbf{g},q_{1},q_{2})
4:  % Compute the distance between MDRGs
5:  Range(f1)Range(f_{1}) = [minx𝕄f1(x),maxx𝕄f1(x)][\underset{x\in\mathbb{M}}{\min}f_{1}(x),\underset{x\in\mathbb{M}}{\max}f_{1}(x)]
6:  Range(g1)Range(g_{1}) = [minx𝕄g1(x),maxx𝕄g1(x)][\underset{x\in\mathbb{M}}{\min}g_{1}(x),\underset{x\in\mathbb{M}}{\max}g_{1}(x)]
7:  R(f1,g1)R(f_{1},g_{1}) = Range(f1)Range(g1)Range(f_{1})\cup Range(g_{1})
8:  for i=1,2,,q1i=1,2,\ldots,q_{1} do
9:     𝕊f1ci\mathbb{S}_{f_{1}}^{c_{i}} = {𝒢f2p~|f1¯(p)=ci}\{\mathcal{RG}_{\widetilde{f_{2}^{p}}}|\bar{f_{1}}(p)=c_{i}\}
10:     𝕊g1ci\mathbb{S}_{g_{1}}^{c_{i}} = {𝒢g2p~|g1¯(p)=ci}\{\mathcal{RG}_{\widetilde{g_{2}^{p}}}|\bar{g_{1}}(p)=c_{i}\}
11:     Compute optimal bijection ψci\psi_{c_{i}} between elements of 𝕊f1ci\mathbb{S}_{f_{1}}^{c_{i}} and 𝕊g1ci\mathbb{S}_{g_{1}}^{c_{i}} (by computing the bottleneck distances between the corresponding persistence diagrams in PD𝐟PD_{\mathbf{f}} and PD𝐠PD_{\mathbf{g}}), using the Hungarian algorithm
12:  end for
13:  d𝕄(𝕄𝐟,𝕄𝐠)\displaystyle d_{\mathbb{MR}}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}\right) = dB(Dg(𝒢f1),Dg(𝒢g1))+d_{B}\left(\mathrm{Dg}\left(\mathcal{RG}_{f_{1}}\right),\mathrm{Dg}\left(\mathcal{RG}_{g_{1}}\right)\right)+1q11iq1sup𝒢𝕊f1cidB(Dg(𝒢),Dg(ψci(𝒢)))\hskip 39.83368pt\displaystyle\frac{1}{q_{1}}\underset{1\leq i\leq q_{1}}{\sum}\sup_{\mathcal{RG}\in\mathbb{S}_{f_{1}}^{c_{i}}}d_{B}\left(\mathrm{Dg}\left(\mathcal{RG}\right),\mathrm{Dg}\left(\psi_{c_{i}}\left(\mathcal{RG}\right)\right)\right)
14:  return  d𝕄(𝕄𝐟,𝕄𝐠)d_{\mathbb{MR}}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}\right)

4.2 Properties of the Proposed Distance Measure

In this sub-section, we discuss the properties of the proposed distance between MDRGs. Let 𝐟=(f1,f2),𝐠=(g1,g2)\mathbf{f}=(f_{1},f_{2}),\mathbf{g}=(g_{1},g_{2}) be two bivariate fields. First, we show that dT(𝕄𝐟,𝕄𝐠)d_{T}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}) is a pseudo-metric.

Theorem 4.1

dTd_{T} satisfies the properties of a pseudo-metric.

Proof

Non-negativity: For two MDRGs 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕄𝐠\mathbb{MR}_{\mathbf{g}}, we have dT(𝕄𝐟,𝕄𝐠)0d_{T}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}})\geq 0. Thus the non-negativity property is satisfied.

Symmetry: For two MDRGs 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕄𝐠\mathbb{MR}_{\mathbf{g}}, we have dT(𝕄𝐟,𝕄𝐠)=dT(𝕄𝐠,𝕄𝐟)d_{T}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}})=d_{T}(\mathbb{MR}_{\mathbf{g}},\mathbb{MR}_{\mathbf{f}}). Thus the symmetry property holds.

Identity: Bauer etal.etal. 2014-Bauer-DistanceBetweenReebGraphs showed that two distinct Reeb graphs can have the same persistence diagram and therefore the bottleneck distance between Reeb graphs is a pseudo-metric. Since we compute dT(𝕄𝐟,𝕄𝐠)d_{T}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}) by calculating the bottleneck distance between the component Reeb graphs of the MDRGs 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕄𝐠\mathbb{MR}_{\mathbf{g}}, it follows that the bottleneck distance between two non-identical MDRGs 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕄𝐠\mathbb{MR}_{\mathbf{g}} can be zero. Therefore, the identity property does not hold.

Triangle inequality: Let us consider three MDRGs 𝕄𝐟,𝕄𝐠\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}} and 𝕄𝐡\mathbb{MR}_{\mathbf{h}}, corresponding to bivariate fields 𝐟=(f1,f2),𝐠=(g1,g2)\mathbf{f}=(f_{1},f_{2}),\mathbf{g}=(g_{1},g_{2}) and 𝐡=(h1,h2)\mathbf{h}=(h_{1},h_{2}), respectively. We first prove the triangle inequality property for d𝕄0d_{\mathbb{MR}}^{0} between MDRGs. We need to show that d𝕄0(𝕄𝐟,𝕄𝐡)d𝕄0(𝕄𝐟,𝕄𝐠)+d𝕄0(𝕄𝐠,𝕄𝐡)d_{\mathbb{MR}}^{0}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{h}})\leq d_{\mathbb{MR}}^{0}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}})+d_{\mathbb{MR}}^{0}(\mathbb{MR}_{\mathbf{g}},\mathbb{MR}_{\mathbf{h}}).

For cR(f1,g1)c\in R(f_{1},g_{1}), let ψc\psi_{c}^{\prime} be the bijection between 𝕊f1c\mathbb{S}_{f_{1}}^{c} and 𝕊g1c\mathbb{S}_{g_{1}}^{c} achieving d𝕄0(𝕄𝐟,𝕄𝐠)d_{\mathbb{MR}}^{0}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}). Similarly, let ψc′′\psi_{c}^{\prime\prime} be the bijection between 𝕊g1c\mathbb{S}_{g_{1}}^{c} and 𝕊h1c\mathbb{S}_{h_{1}}^{c} achieving d𝕄0(𝕄𝐠,𝕄𝐡)d_{\mathbb{MR}}^{0}(\mathbb{MR}_{\mathbf{g}},\mathbb{MR}_{\mathbf{h}}). Let ψc′′′\psi_{c}^{\prime\prime\prime} be the bijection between 𝕊f1c\mathbb{S}_{f_{1}}^{c} and 𝕊h1c\mathbb{S}_{h_{1}}^{c} obtained by the composition of ψc′′\psi_{c}^{\prime\prime} and ψc\psi_{c}^{\prime}, i.e, ψc′′′=ψc′′ψc\psi_{c}^{\prime\prime\prime}=\psi_{c}^{\prime\prime}\circ\psi_{c}^{\prime}.

d𝕄0(𝕄𝐟,𝕄𝐡)\displaystyle d_{\mathbb{MR}}^{0}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{h}}\right)
=dB(PD0(𝒢f1),PD0(𝒢h1))+\displaystyle=d_{B}\left(\mathrm{PD}_{0}\left(\mathcal{RG}_{f_{1}}\right),\mathrm{PD}_{0}\left(\mathcal{RG}_{h_{1}}\right)\right)+
1|R(f1,h1)|cR(f1,h1)infψc:𝕊f1c𝕊h1csup𝒢𝕊f1cdB(PD0(𝒢),\displaystyle\frac{1}{|R(f_{1},h_{1})|}\underset{c\in R(f_{1},h_{1})}{\int}\underset{\psi_{c}:\mathbb{S}_{f_{1}}^{c}\rightarrow\mathbb{S}_{h_{1}}^{c}}{\inf}\sup_{\mathcal{RG}\in\mathbb{S}_{f_{1}}^{c}}d_{B}\bigl{(}\mathrm{PD}_{0}\left(\mathcal{RG}\right),
PD0(ψc(𝒢)))\displaystyle\hskip 170.71652pt\mathrm{PD}_{0}\left(\psi_{c}\left(\mathcal{RG}\right)\right)\bigr{)}
dB(PD0(𝒢f1),PD0(𝒢h1))+\displaystyle\leq d_{B}\left(\mathrm{PD}_{0}\left(\mathcal{RG}_{f_{1}}\right),\mathrm{PD}_{0}\left(\mathcal{RG}_{h_{1}}\right)\right)+
1|R(f1,h1)|cR(f1,h1)sup𝒢𝕊f1cdB(PD0(𝒢),\displaystyle\hskip 11.38092pt\frac{1}{|R(f_{1},h_{1})|}\underset{c\in R(f_{1},h_{1})}{\int}\sup_{\mathcal{RG}\in\mathbb{S}_{f_{1}}^{c}}d_{B}\bigl{(}\mathrm{PD}_{0}\left(\mathcal{RG}\right),
PD0(ψc′′′(𝒢)))\displaystyle\hskip 142.26378pt\mathrm{PD}_{0}\left(\psi_{c}^{\prime\prime\prime}\left(\mathcal{RG}\right)\right)\bigr{)}
dB(PD0(𝒢f1),PD0(𝒢g1))+dB(PD0(𝒢g1),\displaystyle\leq d_{B}\left(\mathrm{PD}_{0}\left(\mathcal{RG}_{f_{1}}\right),\mathrm{PD}_{0}\left(\mathcal{RG}_{g_{1}}\right)\right)+d_{B}\bigl{(}\mathrm{PD}_{0}\left(\mathcal{RG}_{g_{1}}\right),
PD0(𝒢h1))+\displaystyle\hskip 162.18062pt\mathrm{PD}_{0}\left(\mathcal{RG}_{h_{1}}\right)\bigr{)}+
1|R(f1,h1)|cR(f1,h1)sup𝒢𝕊f1c(dB(PD0(𝒢),\displaystyle\hskip 11.38092pt\frac{1}{|R(f_{1},h_{1})|}\underset{c\in R(f_{1},h_{1})}{\int}\sup_{\mathcal{RG}\in\mathbb{S}_{f_{1}}^{c}}\Bigg{(}d_{B}\bigl{(}\mathrm{PD}_{0}\left(\mathcal{RG}\right),
PD0(ψc(𝒢)))+\displaystyle\hskip 142.26378pt\mathrm{PD}_{0}\left(\psi_{c}^{\prime}\left(\mathcal{RG}\right)\right)\bigr{)}+
dB(ψc(PD0(𝒢)),PD0(ψc′′ψc(𝒢))))\displaystyle\hskip 56.9055ptd_{B}\left(\psi_{c}^{\prime}\left(\mathrm{PD}_{0}\left(\mathcal{RG}\right)\right),\mathrm{PD}_{0}\left(\psi_{c}^{\prime\prime}\circ\psi_{c}^{\prime}\left(\mathcal{RG}\right)\right)\right)\Bigg{)}
(since dBd_{B} between Reeb graphs is a pseudo-metric)
dB(PD0(𝒢f1),PD0(𝒢g1))+dB(PD0(𝒢g1),\displaystyle\leq d_{B}\left(\mathrm{PD}_{0}\left(\mathcal{RG}_{f_{1}}\right),\mathrm{PD}_{0}\left(\mathcal{RG}_{g_{1}}\right)\right)+d_{B}\bigl{(}\mathrm{PD}_{0}\left(\mathcal{RG}_{g_{1}}\right),
PD0(𝒢h1))+\displaystyle\hskip 162.18062pt\mathrm{PD}_{0}\left(\mathcal{RG}_{h_{1}}\right)\bigr{)}+
1|R(f1,g1)|cR(f1,g1)sup𝒢𝕊f1cdB(PD0(𝒢),\displaystyle\hskip 11.38092pt\frac{1}{|R(f_{1},g_{1})|}\underset{c\in R(f_{1},g_{1})}{\int}\sup_{\mathcal{RG}\in\mathbb{S}_{f_{1}}^{c}}d_{B}\bigl{(}\mathrm{PD}_{0}\left(\mathcal{RG}\right),
PD0(ψc(𝒢)))+\displaystyle\hskip 142.26378pt\mathrm{PD}_{0}\left(\psi_{c}^{\prime}\left(\mathcal{RG}\right)\right)\bigr{)}+
1|R(g1,h1)|cR(g1,h1)sup𝒢𝕊g1cdB(PD0(𝒢),\displaystyle\hskip 11.38092pt\frac{1}{|R(g_{1},h_{1})|}\underset{c\in R(g_{1},h_{1})}{\int}\sup_{\mathcal{RG}\in\mathbb{S}_{g_{1}}^{c}}d_{B}\bigl{(}\mathrm{PD}_{0}\left(\mathcal{RG}\right),
PD0(ψc′′(𝒢)))\displaystyle\hskip 142.26378pt\mathrm{PD}_{0}\left(\psi_{c}^{\prime\prime}\left(\mathcal{RG}\right)\right)\bigr{)}
=d𝕄0(𝕄𝐟,𝕄𝐠)+d𝕄0(𝕄𝐠,𝕄𝐡).\displaystyle=d_{\mathbb{MR}}^{0}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}})+d_{\mathbb{MR}}^{0}(\mathbb{MR}_{\mathbf{g}},\mathbb{MR}_{\mathbf{h}}).

Using a similar argument, we can prove the triangle inequality for d𝕄1d_{\mathbb{MR}}^{1}. We now show the triangle inequality property for dT(𝕄𝐟,𝕄𝐡)d_{T}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{h}}).

dT(𝕄𝐟,𝕄𝐡)\displaystyle d_{T}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{h}})
=w0d𝕄0(𝕄𝐟,𝕄𝐡)+w1d𝕄0(𝕄𝐟,𝕄𝐡)\displaystyle=w_{0}\cdot d_{\mathbb{MR}}^{0}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{h}}\right)+w_{1}\cdot d_{\mathbb{MR}}^{0}\left(\mathbb{MR}_{-\mathbf{f}},\mathbb{MR}_{-\mathbf{h}}\right)
+w2d𝕄1(𝕄𝐟,𝕄𝐡)\displaystyle\hskip 85.35826pt+w_{2}\cdot d_{\mathbb{MR}}^{1}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{h}}\right)
w0(d𝕄0(𝕄𝐟,𝕄𝐠)+d𝕄0(𝕄𝐠,𝕄𝐡))+\displaystyle\leq w_{0}\left(d_{\mathbb{MR}}^{0}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}\right)+d_{\mathbb{MR}}^{0}\left(\mathbb{MR}_{\mathbf{g}},\mathbb{MR}_{\mathbf{h}}\right)\right)+
w1(d𝕄0(𝕄𝐟,𝕄𝐠)+d𝕄0(𝕄𝐠,𝕄𝐡))+\displaystyle\hskip 11.38092ptw_{1}\left(d_{\mathbb{MR}}^{0}\left(\mathbb{MR}_{-\mathbf{f}},\mathbb{MR}_{-\mathbf{g}}\right)+d_{\mathbb{MR}}^{0}\left(\mathbb{MR}_{-\mathbf{g}},\mathbb{MR}_{-\mathbf{h}}\right)\right)+
w2(d𝕄1(𝕄𝐟,𝕄𝐠)+d𝕄1(𝕄𝐠,𝕄𝐡))\displaystyle\hskip 11.38092ptw_{2}\left(d_{\mathbb{MR}}^{1}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}\right)+d_{\mathbb{MR}}^{1}\left(\mathbb{MR}_{\mathbf{g}},\mathbb{MR}_{\mathbf{h}}\right)\right)
=w0d𝕄0(𝕄𝐟,𝕄𝐠)+w1d𝕄0(𝕄𝐟,𝕄𝐠)\displaystyle=w_{0}\cdot d_{\mathbb{MR}}^{0}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}\right)+w_{1}\cdot d_{\mathbb{MR}}^{0}\left(\mathbb{MR}_{-\mathbf{f}},\mathbb{MR}_{-\mathbf{g}}\right)
+w2d𝕄1(𝕄𝐟,𝕄𝐠)+w0d𝕄0(𝕄𝐠,𝕄𝐡)\displaystyle\hskip 11.38092pt+w_{2}\cdot d_{\mathbb{MR}}^{1}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}\right)+w_{0}\cdot d_{\mathbb{MR}}^{0}\left(\mathbb{MR}_{\mathbf{g}},\mathbb{MR}_{\mathbf{h}}\right)
+w1d𝕄0(𝕄𝐠,𝕄𝐡)+w2d𝕄1(𝕄𝐠,𝕄𝐡)\displaystyle\hskip 11.38092pt+w_{1}\cdot d_{\mathbb{MR}}^{0}\left(\mathbb{MR}_{-\mathbf{g}},\mathbb{MR}_{-\mathbf{h}}\right)+w_{2}\cdot d_{\mathbb{MR}}^{1}\left(\mathbb{MR}_{\mathbf{g}},\mathbb{MR}_{\mathbf{h}}\right)
=dT(𝕄𝐟,𝕄𝐠)+dT(𝕄𝐠,𝕄𝐡).\displaystyle=d_{T}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}})+d_{T}(\mathbb{MR}_{\mathbf{g}},\mathbb{MR}_{\mathbf{h}}).

Next, we prove the stability of the proposed measure dTd_{T}. Let 𝐟=(f1,f2):𝕄2\mathbf{f}=(f_{1},f_{2}):\mathbb{M}\rightarrow\mathbb{R}^{2} and 𝐠=(g1,g2):𝕄2\mathbf{g}=(g_{1},g_{2}):\mathbb{M}\rightarrow\mathbb{R}^{2} be two bivariate fields defined on a compact metric space 𝕄\mathbb{M} such that f1,f2,g1f_{1},f_{2},g_{1} and g2g_{2} are tame functions. Let 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕄𝐠\mathbb{MR}_{\mathbf{g}} be the MDRGs of 𝐟\mathbf{f} and 𝐠\mathbf{g}, respectively. Then we prove the following lemma.

Lemma 1
d𝕄0(𝕄𝐟,𝕄𝐠)f1g1+12max{Amp(f2),Amp(g2)}d_{\mathbb{MR}}^{0}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}})\leq\|f_{1}-g_{1}\|_{\infty}+\frac{1}{2}\max\{Amp(f_{2}),Amp(g_{2})\}

where Amp(f)Amp(f) denotes the amplitude of a real-valued function f:𝕄f:\mathbb{M}\rightarrow\mathbb{R} and is defined as follows:

Amp(f)=maxx𝕄f(x)miny𝕄f(y).Amp(f)=\max_{x\in\mathbb{M}}f(x)-\min_{y\in\mathbb{M}}f(y).
Proof

The term dB(PD0(𝒢f1),PD0(𝒢g1))d_{B}\left(\mathrm{PD}_{0}\left(\mathcal{RG}_{f_{1}}\right),\mathrm{PD}_{0}\left(\mathcal{RG}_{g_{1}}\right)\right) in d𝕄0(𝕄𝐟,𝕄𝐠)d_{\mathbb{MR}}^{0}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}) of equation (1010) is bounded above by f1g1\|f_{1}-g_{1}\|_{\infty} (see 2007-Cohen-Steiner-Bottleneck for more details). We now prove a bound on the term cR(f1,g1)infψ:𝕊f1c𝕊g1csup𝒢𝕊f1cdB(PD0(𝒢),PD0(ψ(𝒢)))\underset{c\in R(f_{1},g_{1})}{\int}\underset{\psi:\mathbb{S}_{f_{1}}^{c}\rightarrow\mathbb{S}_{g_{1}}^{c}}{\inf}\underset{\mathcal{RG}\in\mathbb{S}_{f_{1}}^{c}}{\sup}d_{B}\left(\mathrm{PD}_{0}\left(\mathcal{RG}\right),\mathrm{PD}_{0}\left(\psi\left(\mathcal{RG}\right)\right)\right).

Let 𝒢𝕊f1c\mathcal{RG}\in\mathbb{S}_{f_{1}}^{c}. Now, every point in the persistence diagram PD0(𝒢)\mathrm{PD}_{0}\left(\mathcal{RG}\right) corresponds to a 0-dimensional class whose birth and death lies between minx𝒢f2p~¯(x)\underset{x\in\mathcal{RG}}{\min}\overline{\widetilde{f_{2}^{p}}}(x) and maxx𝒢f2p~¯(x)\underset{x\in\mathcal{RG}}{\max}\overline{\widetilde{f_{2}^{p}}}(x). Therefore, the maximum persistence of a point in PD0(𝒢)\mathrm{PD}_{0}\left(\mathcal{RG}\right) is at most the amplitude of f2p~¯\overline{\widetilde{f_{2}^{p}}} in 𝒢\mathcal{RG}.

maxxPD0(𝒢)pers(x)\displaystyle\max_{x\in\mathrm{PD}_{0}\left(\mathcal{RG}\right)}pers(x) maxp𝒢f2p~f2p~¯(p)minp𝒢f2p~f2p~¯(p).\displaystyle\leq\max_{p\in\mathcal{RG}_{\widetilde{f_{2}^{p}}}}\overline{\widetilde{f_{2}^{p}}}(p)-\min_{p\in\mathcal{RG}_{\widetilde{f_{2}^{p}}}}\overline{\widetilde{f_{2}^{p}}}(p).
maxp𝕄f2(p)minp𝕄f2(p).\displaystyle\leq\max_{p\in\mathbb{M}}f_{2}(p)-\min_{p\in\mathbb{M}}f_{2}(p).
=Amp(f2).\displaystyle=Amp(f_{2}). (13)

Similarly, for 𝒢𝕊g2c\mathcal{RG}^{\prime}\in\mathbb{S}_{g_{2}}^{c}, the following bound is obtained on the maximum persistence of a point in the persistence diagram PD0(𝒢)\mathrm{PD}_{0}\left(\mathcal{RG}^{\prime}\right).

maxxPD0(𝒢)pers(x)\displaystyle\max_{x\in\mathrm{PD}_{0}\left(\mathcal{RG}^{\prime}\right)}pers(x) Amp(g2).\displaystyle\leq Amp(g_{2}). (14)

Let (a,b)PD0(𝒢)(a,b)\in\mathrm{PD}_{0}(\mathcal{RG}) and (c,d)(c,d) be its matching pair in PD0(𝒢)\mathrm{PD}_{0}(\mathcal{RG}^{\prime}) corresponding to dB(PD0(𝒢),PD0(𝒢))d_{B}\left(\mathrm{PD}_{0}\left(\mathcal{RG}\right),\mathrm{PD}_{0}\left(\mathcal{RG}^{\prime}\right)\right). Then (a,b)(c,d)\|(a,b)-(c,d)\|_{\infty} is at most max{a+b2,c+d2}\max\{\frac{a+b}{2},\frac{c+d}{2}\}. Otherwise, we can match (a,b)(a,b) to (a+b2,a+b2)(\frac{a+b}{2},\frac{a+b}{2}), (c,d)(c,d) to (c+d2,c+d2)(\frac{c+d}{2},\frac{c+d}{2}) and obtain the required bound. Therefore, for any matched pair of points (x,x)(x,x^{\prime}), xx\|x-x^{\prime}\|_{\infty} is bounded by max{12pers(x),12pers(x)}\max\{\frac{1}{2}pers(x),\frac{1}{2}pers(x^{\prime})\}. We now obtain a bound on dB(PD0(𝒢),PD0(𝒢))d_{B}\left(\mathrm{PD}_{0}\left(\mathcal{RG}\right),\mathrm{PD}_{0}\left(\mathcal{RG}^{\prime}\right)\right) as follows:

dB(PD0(𝒢),PD0(𝒢))\displaystyle d_{B}\left(\mathrm{PD}_{0}\left(\mathcal{RG}\right),\mathrm{PD}_{0}\left(\mathcal{RG}^{\prime}\right)\right)
max{maxxPD0(𝒢)12pers(x),maxxPD0(𝒢)12pers(x)}\displaystyle\leq\max\left\{\max_{x\in\mathrm{PD}_{0}\left(\mathcal{RG}\right)}\frac{1}{2}pers(x),\max_{x\in\mathrm{PD}_{0}\left(\mathcal{RG}^{\prime}\right)}\frac{1}{2}pers(x)\right\}
12max{Amp(f2),Amp(g2)}\displaystyle\leq\frac{1}{2}\max\{Amp(f_{2}),Amp(g_{2})\}
(from equations (13) and (14)).\displaystyle\hskip 28.45274pt(\text{from equations (\ref{eqn:bound-persistence-f}})\text{ and }(\ref{eqn:bound-persistence-g})).
d𝕄0(𝕄𝐟,𝕄𝐠)\displaystyle d_{\mathbb{MR}}^{0}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}})
=dB(PD0(𝒢f1),PD0(𝒢g1))+\displaystyle=d_{B}\left(\mathrm{PD}_{0}\left(\mathcal{RG}_{f_{1}}\right),\mathrm{PD}_{0}\left(\mathcal{RG}_{g_{1}}\right)\right)+
1|R(f1,g1)|cR(f1,g1)infψ:𝕊f1c𝕊g1csup𝒢𝕊f1cdB(PD0(𝒢),\displaystyle\hskip 5.69046pt\frac{1}{|R(f_{1},g_{1})|}\underset{c\in R(f_{1},g_{1})}{\int}\underset{\psi:\mathbb{S}_{f_{1}}^{c}\rightarrow\mathbb{S}_{g_{1}}^{c}}{\inf}\sup_{\mathcal{RG}\in\mathbb{S}_{f_{1}}^{c}}d_{B}\bigl{(}\mathrm{PD}_{0}\left(\mathcal{RG}\right),
PD0(ψ(𝒢)))\displaystyle\hskip 173.56198pt\mathrm{PD}_{0}\left(\psi\left(\mathcal{RG}\right)\right)\bigr{)}
f1g1+1|R(f1,g1)|cR(f1,g1)12max{Amp(f2),\displaystyle\leq\|f_{1}-g_{1}\|_{\infty}+\frac{1}{|R(f_{1},g_{1})|}\underset{c\in R(f_{1},g_{1})}{\int}\frac{1}{2}\max\{Amp(f_{2}),
Amp(g2)}.\displaystyle\hskip 182.09746ptAmp(g_{2})\}.
=f1g1+12max{Amp(f2),Amp(g2)}.\displaystyle=\|f_{1}-g_{1}\|_{\infty}+\frac{1}{2}\max\{Amp(f_{2}),Amp(g_{2})\}.
Lemma 2
d𝕄1(𝕄𝐟,𝕄𝐠)3f1g1+12max{Amp(f2),Amp(g2)}.d_{\mathbb{MR}}^{1}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}})\leq 3\|f_{1}-g_{1}\|_{\infty}+\frac{1}{2}\max\{Amp(f_{2}),Amp(g_{2})\}.
Proof

The term dB(ExPD0(𝒢f1),ExPD0(𝒢g1))d_{B}\left(ExPD_{0}\left(\mathcal{RG}_{f_{1}}\right),ExPD_{0}\left(\mathcal{RG}_{g_{1}}\right)\right) in d𝕄1(𝕄𝐟,𝕄𝐠)d_{\mathbb{MR}}^{1}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}) is bounded above by 3f1g13\|f_{1}-g_{1}\|_{\infty} (see Theorems 4.14.1 and 4.34.3 in 2014-Bauer-DistanceBetweenReebGraphs for more details). The rest of the proof is similar to that of Lemma 1.

The following theorem gives the stability of dT(𝕄𝐟,𝕄𝐠)d_{T}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}).

Theorem 4.2

Stability:

dT(𝕄𝐟,𝕄𝐠)3f1g1+12max{Amp(f2),Amp(g2)}.d_{T}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}})\leq 3\|f_{1}-g_{1}\|_{\infty}+\frac{1}{2}\max\{Amp(f_{2}),Amp(g_{2})\}.
Proof
dT(𝕄𝐟,𝕄𝐠)\displaystyle d_{T}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}})
=w0d𝕄0(𝕄𝐟,𝕄𝐠)+w1d𝕄0(𝕄𝐟,𝕄𝐠)+\displaystyle=w_{0}\cdot d_{\mathbb{MR}}^{0}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}})+w_{1}\cdot d_{\mathbb{MR}}^{0}(\mathbb{MR}_{-\mathbf{f}},\mathbb{MR}_{-\mathbf{g}})+
w2d𝕄1(𝕄𝐟,𝕄𝐠)\displaystyle\hskip 91.04872ptw_{2}\cdot d_{\mathbb{MR}}^{1}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}})
w0(f1g1+12max{Amp(f2),Amp(g2)})+\displaystyle\leq w_{0}\left(\|f_{1}-g_{1}\|_{\infty}+\frac{1}{2}\max\{Amp(f_{2}),Amp(g_{2})\}\right)+
w1(f1g1+12max{Amp(f2),Amp(g2)})+\displaystyle\hskip 11.38092ptw_{1}\left(\|f_{1}-g_{1}\|_{\infty}+\frac{1}{2}\max\{Amp(f_{2}),Amp(g_{2})\}\right)+
w2(3f1g1+12max{Amp(f2),Amp(g2)})\displaystyle\hskip 11.38092ptw_{2}\left(3\|f_{1}-g_{1}\|_{\infty}+\frac{1}{2}\max\{Amp(f_{2}),Amp(g_{2})\}\right)
(using Lemmas 1, 2)
3f1g1+12max{Amp(f2),Amp(g2)}\displaystyle\leq 3\|f_{1}-g_{1}\|_{\infty}+\frac{1}{2}\max\{Amp(f_{2}),Amp(g_{2})\}
(since w0+w1+w2=1).\displaystyle\hskip 85.35826pt(\text{since }w_{0}+w_{1}+w_{2}=1).

4.3 Complexity Analysis

In this sub-section, we analyze the time and space complexities of computing the distance between two bivariate fields based on the corresponding MDRGs. First, we give the complexity for constructing the MDRG of a bivariate field and computing the persistence diagrams of the component Reeb graphs of the MDRG. Then, we analyze the complexity of computing the proposed distance measure between two MDRGs.

4.3.1 Algorithm 1: Computing the MDRG and Persistence Diagrams of its Component Reeb Graphs

Time Complexity. Let 𝐟=(f1,f2)\mathbf{f}=(f_{1},f_{2}) be a bivariate field defined on a compact dd-manifold. To construct the MDRG of 𝐟\mathbf{f}, denoted by 𝕄𝐟\mathbb{MR}_{\mathbf{f}}, we require a JCN as input. The construction of the JCN takes 𝒪(2|E1|+|E1|α(|E1|))\mathcal{O}(2|E_{1}|+|E_{1}|\alpha(|E_{1}|)) time, where |E1||E_{1}| is the number of edges in the JCN and α\alpha is the inverse Ackermann function 1975-Tarjan-UF (line 22, Algorithm 1). The time complexity of constructing the MDRG from the JCN using the algorithm in 2016-Chattopadhyay-MultivariateTopologySimplification is 𝒪(2|V1|(|V1|+|E1|α(|V1|)+|V1|log(|V1|)))\mathcal{O}(2|V_{1}|(|V_{1}|+|E_{1}|\alpha(|V_{1}|)+|V_{1}|\log(|V_{1}|))), where |V1||V_{1}| is the number of vertices of the JCN (line 33, Algorithm 1). We note, the Reeb graph 𝒢f1\mathcal{RG}_{f_{1}} has at most |V1||V_{1}| vertices and |E1||E_{1}| edges. Therefore, 𝒢f1\mathcal{RG}_{f_{1}} has at most |V1|+|E1||V_{1}|+|E_{1}| simplices (vertices and edges). The computation of the persistence diagram of 𝒢f1\mathcal{RG}_{f_{1}} take 𝒪((|V1|+|E1|)3)\mathcal{O}((|V_{1}|+|E_{1}|)^{3}) time 2010-Edelsbrunner-book (line 55, Algorithm 1). Next, we see the time complexity of computing the persistence diagrams of the Reeb graphs of f2f_{2} in 𝕄𝐟\mathbb{MR}_{\mathbf{f}} (lines 66-88, Algorithm 1).

We note, each node/edge in a Reeb graph 𝒢f2p~\mathcal{RG}_{\widetilde{f_{2}^{p}}} (p𝒢f1p\in\mathcal{RG}_{f_{1}}) corresponds to a unique node/edge in the JCN 2016-Chattopadhyay-MultivariateTopologySimplification . Therefore, the total number of simplices (vertices and edges) in the second-dimensional Reeb graphs of 𝕄𝐟\mathbb{MR}_{\mathbf{f}} is at most the total number of simplices in the JCN. Thus, the construction of the persistence diagrams of the Reeb graphs of f2f_{2}, i.e. 𝒢f2p~(p𝒢f1)\mathcal{RG}_{\widetilde{f_{2}^{p}}}(p\in\mathcal{RG}_{f_{1}}) take 𝒪((|V|+|E|)3)\mathcal{O}((|V|+|E|)^{3}) time. The total time for constructing the MDRG of 𝐟\mathbf{f} and computing the persistence diagrams of its component Reeb graphs (in Algorithm 1) is 𝒪(2|V1|(|V1|+|E1|α(|V1|)+|V1|log(|V1|)))+𝒪((|V1|+|E1|)3)+𝒪((|V1|+|E1|)3)=𝒪(2(|V1|+|E1|)3)\mathcal{O}(2|V_{1}|(|V_{1}|+|E_{1}|\alpha(|V_{1}|)+|V_{1}|\log(|V_{1}|)))+\mathcal{O}((|V_{1}|+|E_{1}|)^{3})+\mathcal{O}((|V_{1}|+|E_{1}|)^{3})=\mathcal{O}(2(|V_{1}|+|E_{1}|)^{3}). Next, we analyze the space complexity of computing 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and persistence diagrams of the Reeb graphs in 𝕄𝐟\mathbb{MR}_{\mathbf{f}}.

Space Complexity. The space complexity of constructing the JCN of 𝐟\mathbf{f} depends on the number of edges it contains (|E1|)(|E_{1}|) and the number of fragments in the domain corresponding to the quantization of 𝐟\mathbf{f}, denoted by N𝐟N_{\mathbf{f}}. Since |E1|=𝒪((4+d)N𝐟)|E_{1}|=\mathcal{O}((4+d)N_{\mathbf{f}}) 2014-Carr-JCN , the space complexity of computing the JCN is 𝒪(|E1|)\mathcal{O}(|E_{1}|). The number of nodes/edges in 𝒢f1\mathcal{RG}_{f_{1}} is bounded by the number of nodes/edges in the JCN. Thus the Reeb graph 𝒢f1\mathcal{RG}_{f_{1}} takes 𝒪(|V1|+|E1|)\mathcal{O}(|V_{1}|+|E_{1}|) space. Similarly, based on the bound obtained on the number of simplices in the Reeb graphs 𝒢f2p~(p𝒢f1)\mathcal{RG}_{\widetilde{f_{2}^{p}}}(p\in\mathcal{RG}_{f_{1}}), the Reeb graphs in the second dimension of 𝕄𝐟\mathbb{MR}_{\mathbf{f}} take 𝒪(|V1|+|E1|)\mathcal{O}(|V_{1}|+|E_{1}|) space. Thus, the space complexity of 𝕄𝐟\mathbb{MR}_{\mathbf{f}} is 𝒪(2(|V1|+|E1|))\mathcal{O}(2(|V_{1}|+|E_{1}|)). Next, we analyze the space complexity for the persistence diagrams of the Reeb graphs in 𝕄𝐟\mathbb{MR}_{\mathbf{f}}. We note, the persistence diagram of a Reeb graph is obtained by pairing its critical nodes. Therefore, the number of points in a persistence diagram is at most half the number of vertices in the corresponding Reeb graph. The total number of nodes in the component Reeb graphs of 𝕄𝐟\mathbb{MR}_{\mathbf{f}} is at most 2|V1|2|V_{1}|. Thus, the total number of points in the persistence diagrams of the Reeb graphs in 𝕄𝐟\mathbb{MR}_{\mathbf{f}} is |V1||V_{1}|. The total space complexity of computing 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and the persistence diagrams of its component Reeb graphs is 𝒪(2(|V1|+|E1|)+|V1|)=𝒪(3|V1|+2|E1|)\mathcal{O}(2(|V_{1}|+|E_{1}|)+|V_{1}|)=\mathcal{O}(3|V_{1}|+2|E_{1}|).

4.3.2 Algorithm 2: Computing the Distance between MDRGs

Time Complexity. Let 𝐟=(f1,f2),𝐠=(g1,g2)\mathbf{f}=(f_{1},f_{2}),\mathbf{g}=(g_{1},g_{2}) be two bivariate fields and 𝕄𝐟,𝕄𝐠\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}} be the corresponding MDRGs. Let (|V1|,|E1|)(|V_{1}|,|E_{1}|) and (|V2|,|E2|)(|V_{2}|,|E_{2}|) be the number of vertices and edges in the JCNs of 𝐟\mathbf{f} and 𝐠\mathbf{g} respectively. Then the total number of points in the persistence diagrams of the component Reeb graphs of 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕄𝐠\mathbb{MR}_{\mathbf{g}} are at most |V1||V_{1}| and |V2||V_{2}| respectively (see Section 4.3.1 for more details). Therefore, the total time for computing dB(Dg(𝒢f1),Dg(𝒢g1))d_{B}\left(\mathrm{Dg}\left(\mathcal{RG}_{f_{1}}\right),\mathrm{Dg}\left(\mathcal{RG}_{g_{1}}\right)\right) and dB(Dg(𝒢f2p~),Dg(𝒢g2p~))d_{B}\left(\mathrm{Dg}\left(\mathcal{RG}_{\widetilde{f_{2}^{p}}}\right),\mathrm{Dg}\left(\mathcal{RG}_{\widetilde{g_{2}^{p^{\prime}}}}\right)\right) p𝒢f1,p𝒢g1\forall p\in\mathcal{RG}_{f_{1}},p^{\prime}\in\mathcal{RG}_{g_{1}} using the Hungarian algorithm is 𝒪((|V1|+|V2|)3)\mathcal{O}((|V_{1}|+|V_{2}|)^{3}) 1955-Kuhn-Hungarian-Algorithm .

Let q1q_{1} be the number of slabs into which R(f1,g1)R(f_{1},g_{1}) is subdivided, for constructing the JCNs of 𝐟\mathbf{f} and 𝐠\mathbf{g}. Then let c1,c2,,cq1c_{1},c_{2},\ldots,c_{q_{1}} be the quantized range values corresponding to the subdivision of R(f1,g1)R(f_{1},g_{1}). For each cic_{i}, we need to compute a map ψci:𝕊f1ci𝕊g1ci\psi_{c_{i}}:\mathbb{S}_{f_{1}}^{c_{i}}\rightarrow\mathbb{S}_{g_{1}}^{c_{i}} which achieves the minimal value of sup𝒢𝕊f1cidB(Dg(𝒢),Dg(ψci(𝒢)))\sup_{\mathcal{RG}\in\mathbb{S}_{f_{1}}^{c_{i}}}d_{B}\left(\mathrm{Dg}\left(\mathcal{RG}\right),\mathrm{Dg}(\psi_{c_{i}}(\mathcal{RG}))\right) (lines 88-1212, Algorithm 2). To do so, we need to compute dB(Dg(𝒢),Dg(𝒢))d_{B}\left(\mathrm{Dg}\left(\mathcal{RG}\right),\mathrm{Dg}\left(\mathcal{RG}^{\prime}\right)\right) for each Dg𝕊f1ci\mathrm{Dg}\in\mathbb{S}_{f_{1}}^{c_{i}} and Dg𝕊g1ci\mathrm{Dg}^{\prime}\in\mathbb{S}_{g_{1}}^{c_{i}}. The total time for computing 1iq1𝒢𝕊f1ci,𝒢𝕊g1cidB(Dg(𝒢),Dg(𝒢))\displaystyle\bigcup_{1\leq i\leq q_{1}}\bigcup_{\mathcal{RG}\in\mathbb{S}_{f_{1}}^{c_{i}},\mathcal{RG}^{\prime}\in\mathbb{S}_{g_{1}}^{c_{i}}}d_{B}\left(\mathrm{Dg}\left(\mathcal{RG}\right),\mathrm{Dg}\left(\mathcal{RG}^{\prime}\right)\right) is bounded by the time complexity for computing the bottleneck distance between all pairs of Reeb graphs in the second dimension of 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕄𝐠\mathbb{MR}_{\mathbf{g}}, which is 𝒪((|V1|+|V2|)3)\mathcal{O}((|V_{1}|+|V_{2}|)^{3}). After this step, the computation of all the optimal bijections ψci(i{1,2,,q1})\psi_{c_{i}}(i\in\{1,2,\ldots,q_{1}\}) using Hungarian algorithm take 𝒪((|V1|+|V2|)3)\mathcal{O}((|V_{1}|+|V_{2}|)^{3}) time 1955-Kuhn-Hungarian-Algorithm . The total time for computing the distance between the collections of persistence diagrams of 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕄𝐠\mathbb{MR}_{\mathbf{g}} (in Algorithm 2) is 𝒪((|V1|+|V2|)3)+𝒪((|V1|+|V2|)3)=𝒪(2(|V1|+|V2|)3)\mathcal{O}((|V_{1}|+|V_{2}|)^{3})+\mathcal{O}((|V_{1}|+|V_{2}|)^{3})=\mathcal{O}(2(|V_{1}|+|V_{2}|)^{3}). Next, we analyze the space complexity of the proposed distance between MDRGs.

Space Complexity. The distance between 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕄𝐠\mathbb{MR}_{\mathbf{g}} is computed based on the bottleneck distance between the persistence diagrams of the component Reeb graphs in the respective MDRGs. To compute the bottleneck distance between the persistence diagrams of two Reeb graphs, we construct an optimal bijection between the points in the persistence diagrams using the Hungarian algorithm. For two persistence diagrams consisting of N1N_{1} and N2N_{2} points respectively, the Hungarian algorithm takes 𝒪((N1+N2)2)\mathcal{O}((N_{1}+N_{2})^{2}) space. Based on the bound obtained on the number of points in the persistence diagrams of 𝒢f1\mathcal{RG}_{f_{1}} and 𝒢g1\mathcal{RG}_{g_{1}}, the space complexity for computing dB(Dg(𝒢f1),Dg(𝒢g1))d_{B}\left(\mathrm{Dg}\left(\mathcal{RG}_{f_{1}}\right),\mathrm{Dg}\left(\mathcal{RG}_{g_{1}}\right)\right) is 𝒪((12|V1|+12|V2|)2)\mathcal{O}((\frac{1}{2}|V_{1}|+\frac{1}{2}|V_{2}|)^{2}). Similarly, the total space complexity for the computation of dB(Dg(𝒢f2p~),Dg(𝒢g2p~))d_{B}\left(\mathrm{Dg}\left(\mathcal{RG}_{\widetilde{f_{2}^{p}}}\right),\mathrm{Dg}\left(\mathcal{RG}_{\widetilde{g_{2}^{p^{\prime}}}}\right)\right) for all combinations of p𝒢f1p\in\mathcal{RG}_{f_{1}} and p𝒢g1p^{\prime}\in\mathcal{RG}_{g_{1}}, is 𝒪((12|V1|+12|V2|)2)\mathcal{O}((\frac{1}{2}|V_{1}|+\frac{1}{2}|V_{2}|)^{2}). Thus the computation of distance between MDRGs takes 𝒪(12(|V1|2+|V2|2)+|V1||V2|)\mathcal{O}(\frac{1}{2}(|V_{1}|^{2}+|V_{2}|^{2})+|V_{1}||V_{2}|) space.

4.4 Generalization for more than two fields

Although, in this paper, we deal with MDRGs of bivariate fields, the proposed distance measure can be extended for MDRGs corresponding to more than 22 fields. If 𝐟=(f1,f2,..,fr):𝕄r\mathbf{f}=(f_{1},f_{2},..,f_{r}):\mathbb{M}\rightarrow\mathbb{R}^{r} and 𝐠=(g1,g2,,gr):𝕄r\mathbf{g}=(g_{1},g_{2},...,g_{r}):\mathbb{M}\rightarrow\mathbb{R}^{r} are multi-fields defined on a triangulated mesh 𝕄\mathbb{M} of dimension drd\geq r, then the distance between the MDRGs 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕄𝐠\mathbb{MR}_{\mathbf{g}} is defined as follows:

d𝕄,rk(𝕄𝐟,𝕄𝐠)=dB(Dgk(𝒢f1),Dgk(𝒢g1))+i=1r11|R(fi,gi)|cR(fi,gi)infψc(i):𝕊fic𝕊gicsup𝒢𝕊ficdB(Dgk(𝒢),Dgk(ψi(𝒢))).d_{\mathbb{MR},r}^{k}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}\right)=d_{B}\left(\mathrm{Dg}_{k}\left(\mathcal{RG}_{f_{1}}\right),\mathrm{Dg}_{k}\left(\mathcal{RG}_{g_{1}}\right)\right)+\\ \sum_{i=1}^{r-1}\frac{1}{|R(f_{i},g_{i})|}\underset{c\in R(f_{i},g_{i})}{\int}\underset{\psi^{(i)}_{c}:\mathbb{S}_{f_{i}}^{c}\rightarrow\mathbb{S}_{g_{i}}^{c}}{\inf}\sup_{\mathcal{RG}\in\mathbb{S}_{f_{i}}^{c}}d_{B}\bigl{(}\mathrm{Dg}_{k}(\mathcal{RG}),\\ \hskip 173.56198pt\mathrm{Dg}_{k}(\psi_{i}(\mathcal{RG}))\bigr{)}. (15)

Here, 𝕊fic\mathbb{S}_{f_{i}}^{c} and 𝕊gic\mathbb{S}_{g_{i}}^{c} denote the collections of Reeb graphs in the (i+1)th(i+1)^{th} dimension of 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕄𝐠\mathbb{MR}_{\mathbf{g}} respectively, ψc(i)\psi^{(i)}_{c} varies over bijections between 𝕊fic\mathbb{S}_{f_{i}}^{c} and 𝕊gic\mathbb{S}_{g_{i}}^{c}, and R(fi,gi)R(f_{i},g_{i}) represents the union of the ranges of fif_{i} and gig_{i}. Similar to the case of bivariate fields, we denote the distance between MDRGs based on the persistence diagrams PD0\mathrm{PD}_{0} and ExDg1\mathrm{ExDg}_{1} by d𝕄,r0(𝕄𝐟,𝕄𝐠)d_{\mathbb{MR},r}^{0}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}) and d𝕄,r1(𝕄𝐟,𝕄𝐠)d_{\mathbb{MR},r}^{1}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}), respectively. We now define a distance between 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕄𝐠\mathbb{MR}_{\mathbf{g}} as follows.

dT,r(𝕄𝐟,𝕄𝐠)=d𝕄,r0(𝕄𝐟,𝕄𝐠)+d𝕄,r0(𝕄𝐟,𝕄𝐠)+d𝕄,r1(𝕄𝐟,𝕄𝐠)d_{T,r}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}})=d_{\mathbb{MR},r}^{0}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}})\\ +d_{\mathbb{MR},r}^{0}(\mathbb{MR}_{-\mathbf{f}},\mathbb{MR}_{-\mathbf{g}})+d_{\mathbb{MR},r}^{1}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}) (16)

where, 0w0,w1,w210\leq w_{0},w_{1},w_{2}\leq 1 and w0+w1+w2=1w_{0}+w_{1}+w_{2}=1. Here, d𝕄,r0(𝕄𝐟,𝕄𝐠)d_{\mathbb{MR},r}^{0}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}\right) and d𝕄,r0(𝕄𝐟,𝕄𝐠)d_{\mathbb{MR},r}^{0}\left(\mathbb{MR}_{-\mathbf{f}},\mathbb{MR}_{-\mathbf{g}}\right) respectively compute the distance between persistence diagrams of sub-level set and super-level set filtrations of the Reeb graphs in the MDRGs and d𝕄,r1(𝕄𝐟,𝕄𝐠)d_{\mathbb{MR},r}^{1}\left(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}\right) is the distance between the 11-st extended persistence diagrams of the Reeb graphs in 𝕄𝐟\mathbb{MR}_{\mathbf{f}} and 𝕄𝐠\mathbb{MR}_{\mathbf{g}} respectively. We note, dT,rd_{T,r} is a pseudo-metric.

Theorem 4.3

dT,rd_{T,r} satisfies the properties of a pseudo-metric.

The proof of this theorem is similar to that of Theorem 4.1. Next, we show the stability of dT,rd_{T,r}.

Theorem 4.4

Stability:

dT,r(𝕄𝐟,𝕄𝐠)\displaystyle d_{T,r}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}) 3f1g1+\displaystyle\leq 3\|f_{1}-g_{1}\|_{\infty}+
12i=2rmax{Amp(fi),Amp(gi)}.\displaystyle\indent\frac{1}{2}\sum_{i=2}^{r}\max\{Amp(f_{i}),Amp(g_{i})\}.

The bound on dT,r(𝕄𝐟,𝕄𝐠)d_{T,r}(\mathbb{MR}_{\mathbf{f}},\mathbb{MR}_{\mathbf{g}}) consists of two terms. The proof of the first and second terms are similar to the proofs of Lemmas 1 and 2 respectively.

Next, we show the experimental results of the proposed method on two datasets.

5 Experimental Results

In this section, we show the application of the proposed distance measure in shape matching and detecting topological features in a time varying multi-field data.

5.1 Classification and Analysis of Shape Data

In this subsection, we show the effectiveness of the proposed distance between MDRGs in matching shapes. First, we describe the feature descriptors corresponding to a shape for computing the MDRGs.

5.1.1 Feature Descriptors of a Shape

We obtain scalar fields based on the eigenfunctions of discrete Laplace-Beltrami operator (see equation (9)) for computing the MDRG. The eigenvalues λi\lambda_{i}s of the discrete LB operator are sorted in the non-decreasing order. The first eigenvalue λ0\lambda_{0} is zero and the corresponding eigenfunctions are constant 2006-Reuter-Shape-DNA . Therefore, we consider the eigenvalues λi\lambda_{i} for i>0i>0. The ability of the eigenfunctions in capturing the geometric and topological properties of a shape and their invariance to isometry of shapes 2006-Levy-Laplace-Beltrami-Eigenfunctions ; 2007-Rustamov-Laplace-Beltrami-Eigenfunctions make them suitable shape descriptors.

To remove the effect of scale, each eigenfunction is normalized by its corresponding eigenvalue 2007-Rustamov-Laplace-Beltrami-Eigenfunctions . For a triangulated shape with vertices v1,v2,,vnv_{1},v_{2},...,v_{n}, the normalized eigenfunction corresponding to eigenvalue λi\lambda_{i} is an nn-dimensional vector ϕi^=(ϕi(v1)λi,ϕi(v2)λi,,ϕi(vn)λi)\hat{\phi_{i}}=\left(\frac{\phi_{i}(v_{1})}{\sqrt{\lambda_{i}}},\frac{\phi_{i}(v_{2})}{\sqrt{\lambda_{i}}},...,\frac{\phi_{i}(v_{n})}{\sqrt{\lambda_{i}}}\right), where ϕi(vj)\phi_{i}(v_{j}) is the value of the ithi^{th} eigenfunction at the vertex vjv_{j}.

Sign Ambiguity Problem: We now encounter the sign ambiguity problem while dealing with the eigenfunctions of the LB operator. If ϕi\phi_{i} is an eigenfunction corresponding to the eigenvalue λi\lambda_{i} , then ϕi-\phi_{i} is also an eigenfunction for λi\lambda_{i}. We have to pick one among them as the eigenfunction for λi\lambda_{i}. If we arbitrarily choose either of ϕi\phi_{i} or ϕi-\phi_{i}, then there is a possibility of two similar shapes having eigenfunctions which are negative of each other (See Figure 4). Therefore, for each eigenvalue λi\lambda_{i}, we need to systematically pick one of ϕi\phi_{i} or ϕi-\phi_{i}. This is known as the sign ambiguity problem 2007-mateus-shape-matching , for which Umeyama etal.etal. 1988-Umeyama-EigenDecomposition proposed a solution by taking the absolute values of eigenfunctions. In this paper, we adopt this solution and choose the functions |ϕ^i|\left|\hat{\phi}_{i}\right| as shape descriptors.

Refer to caption
Refer to caption
Figure 4: Eigenfunction ϕ3\phi_{3} of similar shapes

We show the effectiveness of using absolute values of eigenfunctions as feature descriptors with respect to HKS 2009-Sun-HKS , WKS 2011-Aubry-WKS and SIHKS 2010-Bronstein-SIHKS . The descriptors HKS and SIHKS depend on the number of time parameters 2009-Sun-HKS ; 2010-Bronstein-SIHKS and WKS depends on the number of energy distributions 2011-Aubry-WKS . Following 2014-Li-Persistence-Based-Structural-Recognition , we have fixed the number of time parameters as 1010 for HKS, 1717 for SIHKS, and the number of energy distributions as 1010 for WKS. The number of eigenfunctions is chosen as 200200 for constructing HKS, WKS and SIHKS. The distance between two shapes with respect to a distance measure dd is defined as the sum of distances computed at each time (for HKS, SIHKS) or energy distribution (for WKS). For example, the distance between shapes S1S_{1} and S2S_{2} using HKS is given by:

dHKS=t=110d(ht(S1),ht(S2))d_{\mathrm{HKS}}=\sum_{t=1}^{10}d(h_{t}(S_{1}),h_{t}(S_{2})) (17)

where dd is the distance measure and hth_{t} is the HKS at time tt.

5.1.2 SHREC 20102010 Dataset

We show the experimental results of the proposed measure in the Shape Retrieval Contest (SHREC) 20102010 dataset and compare the results with various topological distance measures in the literature. The SHREC 20102010 dataset 2010-SHREC consists of 200200 watertight shapes classified into 1010 categories. A sample of the shapes are shown in Figure 5. Each of the meshes is simplified into 20002000 faces 2014-Li-Persistence-Based-Structural-Recognition . We compare the performance of the proposed distance between MDRGs with the distance between MRSs 2021-Ramamurthi-MRS and the distance between histograms corresponding to fiber-component distributions 2019-Agarwal-histogram . We evaluate the efficiency of distance measures by the standard evaluation techniques NN, FT, ST, E-measure, and DCG (see 2010-SHREC for details). These techniques take a distance matrix consisting of distances between all pairs of shapes and return a value between 0 and 11, where higher values indicate better efficiency.

Refer to caption
Figure 5: Collection of shapes from SHREC 20102010

We first show the significance of bivariate fields over scalar fields using the proposed distance measure dTd_{T}. Table 1 shows the results of dTd_{T} between MDRGs using a pair of eigenfunctions (bivariate field) and the bottleneck distance between Reeb graphs using individual eigenfunctions (scalar field). The bivariate field using MDRG produces a better result than the Reeb graphs of the individual scalar fields, thereby indicating its significance.

Table 1: Performance Results. Comparison of the the proposed distance between MDRGs for the bivariate field (|ϕ^1|,|ϕ^2|)\left(\left|\hat{\phi}_{1}\right|,\left|\hat{\phi}_{2}\right|\right) with the bottleneck distance between Reeb graphs corresponding to the scalar fields |ϕ^1|\left|\hat{\phi}_{1}\right| and |ϕ^2|\left|\hat{\phi}_{2}\right|, respectively
Descriptor NN 11-Tier 22-Tier e-Measure DCG
|ϕ^1|\left|\hat{\phi}_{1}\right| 0.81000.8100 0.54610.5461 0.69500.6950 0.49630.4963 0.81720.8172
|ϕ^2|\left|\hat{\phi}_{2}\right| 0.78000.7800 0.44370.4437 0.59240.5924 0.41310.4131 0.77640.7764
(|ϕ^1|,|ϕ^2|)\left(\left|\hat{\phi}_{1}\right|,\left|\hat{\phi}_{2}\right|\right) 0.86000.8600 0.57840.5784 0.71870.7187 0.51590.5159 0.84120.8412

The efficiency in computing the distance between shapes will be higher when the shapes are compared based on many eigenfunctions. However, a 33D shape is a 22-dimensional manifold. If more than 22 fields are applied on a shape, then the corresponding Reeb space is not defined. Therefore, we generalize the distance between shapes S1S_{1} and S2S_{2} for more than two eigenfunctions as follows.

d(S1,S2)=i=1E1dT(𝕄|ϕ^i|,|ϕ^i+1|(S1),𝕄|ϕ^i|,|ϕ^i+1|(S2))d(S_{1},S_{2})=\sum_{i=1}^{E-1}d_{T}\left(\mathbb{MR}_{\left|\hat{\phi}_{i}\right|,\left|\hat{\phi}_{i+1}\right|}(S_{1}),\mathbb{MR}_{\left|\hat{\phi}_{i}\right|,\left|\hat{\phi}_{i+1}\right|}(S_{2})\right) (18)

where EE is the number of eigenfunctions and 𝕄|ϕ^i|,|ϕ^i+1|(Sk)\mathbb{MR}_{\left|\hat{\phi}_{i}\right|,\left|\hat{\phi}_{i+1}\right|}(S_{k}) is the MDRG for the shape SkS_{k} with |ϕ^i|\left|\hat{\phi}_{i}\right| and |ϕ^i+1|\left|\hat{\phi}_{i+1}\right| as the component scalar fields. Table 2 shows the performance of the proposed method for varying number of eigenfunctions. The results clearly indicate the increase in efficiency with increase in the number of eigenfunctions.

Table 2: Performance of the proposed distance between MDRGs for varying number of eigenfunctions
NN NN 11-Tier 22-Tier e-Measure DCG
22 0.86000.8600 0.57840.5784 0.71870.7187 0.51590.5159 0.84120.8412
44 0.92000.9200 0.63180.6318 0.75760.7576 0.54350.5435 0.87000.8700
66 0.94500.9450 0.67290.6729 0.80580.8058 0.57350.5735 0.89200.8920
88 0.94500.9450 0.70000.7000 0.84160.8416 0.59630.5963 0.91270.9127
1010 0.96500.9650 0.75180.7518 0.86950.8695 0.62630.6263 0.93670.9367
1212 0.96500.9650 0.75870.7587 0.87340.8734 0.63080.6308 0.94100.9410

Table 3 shows the performance of HKS, WKS, SIHKS and absolute values of eigenfunctions as shape descriptors for various distance measures. The distance measures for HKS, WKS and SIHKS are computed using equation (17). We set the parameters for various distance measures as follows. The distance between MDRGs is computed as specified in equation (18), where E=12E=12 and the weights w0,w1w_{0},w_{1} and w2w_{2} are set equally. The number of slabs for constructing the JCNs (required for computing MDRGs) is 3232 , the parameters ww and qq for the distance between fiber-component distributions are set to 11 and the MRSs are constructed for 66 resolutions.

Table 3: Performance Results for SHREC 20102010 dataset. Comparison of distance measures based on fiber-component distributions 2019-Agarwal-histogram , MRSs 2021-Ramamurthi-MRS , persistence diagrams (PDs) of Reeb graphs 2014-Bauer-DistanceBetweenReebGraphs and PDs of MDRGs using the descriptors HKS, WKS, SIHKS and pairs of eigenfunctions
Descriptors Methods NN 11-Tier 22-Tier e-Measure DCG
HKS Histogram 0.92000.9200 0.50840.5084 0.65370.6537 0.45880.4588 0.82000.8200
MRS 0.92500.9250 0.68210.6821 0.82110.8211 0.59650.5965 0.90010.9001
PD\mathrm{PD} of Reeb graph 0.96000.9600 0.58050.5805 0.72080.7208 0.51120.5112 0.86950.8695
WKS Histogram 0.94000.9400 0.50080.5008 0.64530.6453 0.45410.4541 0.82500.8250
MRS 0.93500.9350 0.51050.5105 0.64680.6468 0.46040.4604 0.82730.8273
PD\mathrm{PD} of Reeb graph 0.89500.8950 0.41240.4124 0.52340.5234 0.37310.3731 0.76650.7665
SIHKS Histogram 0.96000.9600 0.75340.7534 0.90080.9008 0.65220.6522 0.93800.9380
MRS 0.96000.9600 0.75760.7576 0.88080.8808 0.64120.6412 0.93540.9354
PD\mathrm{PD} of Reeb graph 0.97500.9750 0.82580.8258 0.96050.9605 0.69350.6935 0.96180.9618
Pairs of Eigen-functions Histogram 0.89000.8900 0.61530.6153 0.76970.7697 0.54690.5469 0.86920.8692
MRS 0.97000.9700 0.77740.7774 0.88210.8821 0.64310.6431 0.94380.9438
PD\mathrm{PD}s of MDRG 0.96500.9650 0.75870.7587 0.87340.8734 0.63080.6308 0.94100.9410

From the table, it can be seen that MDRG using eigenfunctions produces better results than HKS and WKS (for all the methods). With eigenfunctions as shape descriptors, the proposed method performs better than the histogram of fiber-component distributions and the results are comparable with that of MRS. On the other hand, the bottleneck distance between Reeb graphs outperforms other methods using SIHKS. However, the proposed measure using eigenfunctions performs better than the bottleneck distance between Reeb graphs using SIHKS when we experiment with 77 categories of shapes, which can be observed in the distance matrices in Figure 7.

5.1.3 Results for Seven Categories of Shapes

We experiment with 77 of the 1010 categories of shapes shown in Figure 6 and the results are provided in Table 4. Similar to the results in Table 3, MDRG using eigenfunctions performs better than other techniques using HKS and WKS. Further, the results of the distance between MDRGs are comparable with that of MRSs using eigenfunctions and the bottleneck distance between Reeb graphs using SIHKS. This can be seen from the distance matrices shown in Figure 7. The proposed distance using MDRGs is better in discriminating different classes of shapes as compared to the distances based on Reeb graphs and MRSs.

Refer to caption
Figure 6: Seven categories of shapes from SHREC 20102010
Table 4: Performance Results for seven categories of shapes in SHREC 20102010 dataset. Comparison of distance measures based on fiber-component distributions 2019-Agarwal-histogram , MRSs 2021-Ramamurthi-MRS , PDs of Reeb graphs 2014-Bauer-DistanceBetweenReebGraphs and PDs of MDRGs using HKS, WKS, SIHKS and pairs of eigenfunctions as shape descriptors
Descriptors Methods NN 11-Tier 22-Tier e-Measure DCG
HKS Histogram 0.90000.9000 0.55340.5534 0.74400.7440 0.51740.5174 0.84070.8407
MRS 0.90710.9071 0.76580.7658 0.94140.9414 0.67090.6709 0.93570.9357
PD\mathrm{PD} of Reeb graph 0.94290.9429 0.63350.6335 0.78050.7805 0.56830.5683 0.87870.8787
WKS Histogram 0.92140.9214 0.54700.5470 0.72520.7252 0.50870.5087 0.85170.8517
MRS 0.94290.9429 0.57030.5703 0.74250.7425 0.51990.5199 0.86320.8632
PD\mathrm{PD} of Reeb graph 0.88570.8857 0.45230.4523 0.60040.6004 0.41710.4171 0.79210.7921
SIHKS Histogram 0.96430.9643 0.78680.7868 0.94620.9462 0.68180.6818 0.95260.9526
MRS 0.97140.9714 0.79920.7992 0.94210.9421 0.68010.6801 0.95460.9546
PD\mathrm{PD} of Reeb graph 0.97860.9786 0.81050.8105 0.95600.9560 0.68490.6849 0.95860.9586
Pairs of Eigen-functions Histogram 0.92140.9214 0.68980.6898 0.84920.8492 0.60590.6059 0.90360.9036
MRS 0.97140.9714 0.84770.8477 0.93800.9380 0.68820.6882 0.96230.9623
PD\mathrm{PD}s of MDRG 0.95710.9571 0.85340.8534 0.92780.9278 0.67930.6793 0.96330.9633
Refer to caption
Figure 7: Distance matrices. (a) Bottleneck distance between Reeb graphs using SIHKS. (b), (c) Distances between MDRGs and MRSs, respectively using pairs of eigenfunctions

5.1.4 Classification Results

We measure the effectiveness of the proposed method in the classification of the shapes by performing a 1010-fold cross-validation using a decision tree classifier. The dataset is randomly divided into 1010 folds. The cross-validation is performed for 1010 iterations. In each iteration, one of the folds is taken as the test set and the rest of the folds constitute the training set. For each shape SS, we consider a feature vector, which consists of the distances from SS to every shape in the dataset. The classifier is trained based on the feature vectors of the shapes in the training set. The category of a shape in the test set is determined by the classifier based on its feature vector.

Refer to caption
Figure 8: Classification accuracies plotted for varying numbers of eigenfunctions using the multi-field topology-based distances between (i) fiber-component distributions 2019-Agarwal-histogram , (ii) MRSs 2021-Ramamurthi-MRS , and (iii) MDRGs. The maximum accuracy (.96.96) is achieved by the proposed distance between MDRGs using 1212 eigenfunctions.

We perform the experiments with 77 categories of shapes, described in Section 5.1.3. Figure 8 shows the classification accuracy results using three different distances based on multi-field topology, viz., (i) the distance between histograms of fiber-component distributions 2019-Agarwal-histogram , (ii) the distance between MRSs 2021-Ramamurthi-MRS and (iii) the proposed distance between MDRGs, using pairs of eigenfunctions as the shape descriptor (as in Section 5.1.2). The accuracies are plotted for varying numbers of eigenfunctions. From the plots, we observe that the classification accuracy is higher for more eigenfunctions. Further, the proposed distance gives the best accuracy (96%)(96\%) when we use 1212 eigenfunctions as the shape descriptor.

Refer to caption
Figure 9: Classification accuracies for the multi-field topology-based distances between (i) fiber-component distributions 2019-Agarwal-histogram , (ii) MRSs 2021-Ramamurthi-MRS , and (iii) MDRGs, using the shape descriptors HKS, WKS, SIHKS, and 1212 eigenfunctions (taken in pairs). The MDRG-based method using eigenfunctions achieves the maximum accuracy (0.96)(0.96).

We also evaluate the classification accuracy of the same distances using HKS, WKS, SIHKS, and pairs of eigenfunctions, respectively, as shape descriptors. The results are illustrated in Figure 9. We observe that the eigenfunctions demonstrate a better ability in classifying shapes compared to HKS, WKS, and SIHKS. Further, upon using eigenfunctions as the shape descriptor, the proposed distance between MDRGs shows improved performance over other distances and feature descriptors.

Next, we show the effectiveness of the proposed distance measure in detecting topological features in a time-varying multi-field data from computational chemistry.

5.2 Classification and Analysis of Chemistry Data

Adsorption is a process where the molecules of a gas adhere to a metal surface. This phenomenon has various applications including corrosion, electrochemistry, molecular electronics and hetrogeneous catalysis 2010-kendrick-elucidating ; 2010-somorjai-introduction . In particular, the adsorption of the Carbon Monoxide (CO) molecule on the Platinum (Pt) surface has gained interest due to its industrial applications in the areas of automobile emission, fuel cells, and other catalytic processes 2009-dimakis-attraction ; 2018-patra-surface . Hence, the interaction between the CO molecule and the platinum surface at the atomic level is studied with utmost importance.

In this study, we validate the proposed method by studying the interaction of the CO molecule with a Pt7 cluster. When the CO molecule approaches the Pt7 cluster, the internal CO bond weakens and a bond is formed between one of the Pt atoms and the C atom of the CO molecule 2009-dimakis-attraction . The Pt-CO bond is formed at site 1313, which is validated by the geometry in Figure 10(b). However, the bond length is unstable at this site. The bond stabilizes at site 2121 when the bond length between Pt and C atoms is 1.841.84Å(see the plot in Figure 10(c)). In this experiment, we aim to capture the stable bond formation between the Pt7 cluster and the CO molecule.

The Pt-CO dataset consists of electron density distributions generated by quantum mechanical computations corresponding to the HOMO (Highest Occupied Molecular Orbital), LUMO (Lowest Unoccupied Molecular Orbital) and HOMO-11, defined on a regular 41×41×4141\times 41\times 41 grid. The orbital numbers 69,7069,70 and 7171 correspond to HOMO-11, HOMO and LUMO respectively. At a series of 4040 sites, the electron density distributions are computed for varying distances between the C atom of the CO molecule and the Pt surface 2019-Agarwal-histogram .

5.2.1 Feature Detection Results

We examine the performance of the proposed measure in detecting the formation of a stable Pt-CO bond for various combinations of the orbital densities at HOMO, LUMO and HOMO-11 molecular orbitals. We compare the performance of scalar, bivariate and trivariate fields. In our experiments, we subdivide the range of each of the fields into 88 slabs for constructing the JCNs. Figure 10(a) shows the plots for the trivariate field (HOMO, LUMO, HOMO-11), consisting of all three scalar fields. We observe the most significant peak at site 2121. This corresponds to the stable bond formation between Pt and C atoms, which is indicated by the bond length between Pt and C remaining constant after site 2121 (see Figure 10(b)). Ramamurthi et al.2021-Ramamurthi-MRS identified this site of the stable bond formation using the trivariate field.

Refer to caption
Figure 10: Plots for bond detection in Pt-CO data. (a) Distance between consecutive sites of the data computed using the proposed distance measure for the trivariate field (HOMO, HOMO-11, LUMO). The most significant peak is at site 2121, which corresponds to the stable bond formation. (b) Plot of the Pt-CO bond length at various sites. The bond length between Pt and C atoms stabilizes at site 2121. (c) Geometry of the bond formation between the Pt atom and CO molecule. The bond is formed at site 1313 but the bond length is unstable at this site

We also measure the performance of the proposed method with scalar fields and bivariate fields. Figure 11(a) shows the plots for each of the scalar fields HOMO, HOMO-11 and LUMO. We observe that the site of stable bond formation is not detected using each of the scalar fields. Figure 11(b) shows the plots for the bivariate fields (HOMO, LUMO), (HOMO, HOMO-11) and (LUMO, HOMO-11). We observe that the combinations (HOMO, HOMO-11) and (LUMO, HOMO-11) are unable to detect the correct site in the plots. However, for the combination (HOMO, LUMO), the site corresponding to the stable bond formation is detected, which is also observed in 2019-Agarwal-histogram ; 2021-Ramamurthi-MRS . This illustrates the significance of using multi-fields in detecting the bond formation between Pt and C atoms.

Refer to caption
Figure 11: Distance plots for Pt-CO bond detection data using scalar and bivariate fields. (a) Plots for the scalar fields HOMO-11, HOMO and LUMO. (b). Plots for the bivariate fields (HOMO,LUMO), (HOMO, HOMO-11) and (LUMO, HOMO-11). The stable Pt-CO bond formation is detected using the bivariate field (HOMO, LUMO), which is indicated by the most significant peak at site 2121
Table 5: Computational performance results for SHREC 20102010 and Pt-CO datasets. Here, Shapes/Sites: Shapes or sites compared; Field(s): Field(s) for constructing the MDRGs; Slabs: Number of slabs into which the ranges of the component fields are subdivided for constructing the JCNs; |V1|,|V2|\lvert{V_{1}}\rvert,\lvert{V_{2}}\rvert: Number of nodes in the JCNs; t1t_{1}, t2t_{2} represent the time (in seconds) for computing the distance between MDRGs based on the 0th ordinary persistence diagrams corresponding to sub-level set super-level set filtrations respectively and t3t_{3} is the time (in seconds) for the computing the distance between MDRGs based on the 11st extended persistence diagrams
Dataset Shapes/Sites Field(s) Slabs |V1|,|V2|\lvert{V_{1}}\rvert,\lvert{V_{2}}\rvert t1t_{1} t2t_{2} t3t_{3}
SHREC 20102010 Human, Hand |ϕ^1|\left|\hat{\phi}_{1}\right| 32,3232,32 75,7375,73 0.45560.4556s 0.45820.4582s 0.45230.4523s
SHREC 20102010 Human, Hand |ϕ^2|\left|\hat{\phi}_{2}\right| 32,3232,32 60,6660,66 0.40400.4040s 0.41050.4105s 0.39760.3976s
SHREC 20102010 Human, Hand |ϕ^1|,|ϕ^2|\left|\hat{\phi}_{1}\right|,\left|\hat{\phi}_{2}\right| 32,3232,32 278,526278,526 0.77890.7789s 0.72550.7255s 0.73000.7300s
Pt-CO 21,2221,22 HOMO 8,88,8 34,5434,54 67.165767.1657s 67.057067.0570 67.300467.3004s
Pt-CO 21,2221,22 HOMO, LUMO 8,88,8 724,449724,449 79.631179.6311s 80.233580.2335 80.717280.7172s
Pt-CO 21,2221,22 HOMO, LUMO, HOMO-11 8,88,8 2355,16492355,1649 96.541996.5419s 102.9539102.9539 95.490195.4901s

5.2.2 Classification Results

In this subsection, we evaluate the ability of the proposed distance between MDRGs in classifying a site as occurring before and after the stabilization of the bond between Pt and CO. We divide the dataset into two classes, consisting of the sites before (sites 11-2121) and after the stabilization of the Pt-CO bond (sites 2222-4040), respectively. Similar to Section 5.1.4, we employ a decision tree classifier and analyze the effectiveness of methods by performing 1010-fold cross-validation. We measure the classification accuracies and plot the Receiver Operator Characteristics (ROC) curves. The ROC curve is constructed by plotting the true positive rate of classification (TPR) against the false positive rate of classification (FPR). Here, the negative and positive classes are the sites before and after the bond stabilization, respectively. TPR (FPR) gives the proportion of the correct (wrong) predictions in the positive (negative) class. An ROC curve having a higher area under the curve (AUC) indicates better distinguishing ability between the classes.

Refer to caption
(a) Classification Accuracies
Refer to caption
(b) ROC Curve
Figure 12: Classification accuracies and ROC curve plots for the proposed distance using the field combinations HOMO, LUMO, LUMO, (HOMO, LUMO), (HOMO, HOMO-11), (LUMO, HOMO-11) and (HOMO-11, LUMO, HOMO). The trivariate field (HOMO-11, LUMO, HOMO) has maximum classification accuracy (0.975)(0.975) and area under the curve (0.98)(0.98), which indicates better classification ability upon using all the fields.

Figure 12 shows the results of classification accuracies and plots of the ROC curves for the proposed method using various combinations of HOMO, HOMO-11 and LUMO fields. Here, the ROC curve is computed as the mean of the ROC curves obtained for each of the 1010 folds used as a test set, in the 1010-fold cross-validation. We observe that the distance between MDRGs using the trivariate field (HOMO-11, LUMO, HOMO) obtains a better accuracy and has a higher area under the curve (AUC) compared to other field combinations.

Refer to caption
(a) Classification Accuracies
Refer to caption
(b) ROC Curve
Figure 13: Classification accuracies and ROC curve plots for the distance measures based on fiber-component distributions 2019-Agarwal-histogram , MRSs 2021-Ramamurthi-MRS , and MDRGs. The results are computed for the trivariate field (HOMO-11, LUMO, HOMO). The MDRG-based distance performs better than the MRS-based distance and its classification ability is same as the fiber-component distribution-based distance.

We compare the performance of the proposed method with other multi-field based distances by plotting the classification accuracies and ROC curves for the trivariate field (HOMO-11, LUMO, HOMO). The plots are shown in Figure 13. The proposed method performs better than the distance between MRSs 2021-Ramamurthi-MRS and its performance is on par with the distance between fiber-component distributions 2019-Agarwal-histogram .

5.3 Computational Performance

Table 5 shows the computational performance results of the proposed distance between MDRGs in shape matching (SHREC 20102010) and Pt-CO bond detection datasets using scalar and multi-fields. All timings were performed on a 2.202.20 GHz 1010-Core Intel Xeon(R) with 1616 GB memory, running Ubuntu 16.0416.04. From the table, we observe the time taken for computing the distance between MDRGs based on the 0th ordinary and 11st extended persistence diagrams respectively.

6 Conclusion

In this article, we introduce a novel distance measure between MDRGs by computing the bottleneck distances between their component Reeb graphs. We show that the proposed distance measure is a pseudo-metric and satisfies the stability property. The effectiveness of the proposed distance measure is shown in the problem of shape classification, where the performance of using absolute values of eigenfunctions as shape descriptors is compared with HKS, WKS and SIHKS. Further, we have shown the performance of the proposed distance measure in detecting the formation of a stable bond between Pt and CO molecules. The ability of the distance is demonstrated in the classification of sites based on their occurrence before and after the stabilization of the Pt-CO bond.

The computation of the MDRG of a multi-field requires a JCN, which is computationally expensive to construct. Thus, there is a need for a faster algorithm for constructing the MDRG. We have shown applications of the proposed method in the problem of shape retrieval and in detecting the formation of a stable Pt-CO bond, which is an application in the field of computational chemistry. However, exploring other computational domains will be useful for further analysis on the effectiveness of the proposed distance measure. Further, we note, the multiplicity of the eigenvalues of the Laplace Beltrami operator can be greater than 11, i.e. there can be more than one eigenfunction corresponding to an eigenvalue. In such cases, there is an ambiguity in ordering the eigenfunctions according to eigenvalues. Such issues need to be addressed in the future.

7 Declarations

Funding: The authors would like to thank the Science and Engineering Research Board (SERB), India (SERB/CRG/2018/000702) and International Institute of Information Technology (IIITB), Bangalore for funding this project and for generous travel support.
Conflicts of interest: The authors of this paper have no conflicts of interest.

Data availability statement: The datasets analysed during the current study are available from the corresponding author on reasonable request.

References

  • (1) Agarwal, P., Edelsbrunner, H., Harer, J., Wang, Y.: Extreme elevation on a 2-manifold. Discrete &\& Computational Geometry 36, 553–572 (2006). DOI 10.1007/s00454-006-1265-8
  • (2) Agarwal, T., Chattopadhyay, A., Natarajan, V.: Topological Feature Search in Time-Varying Multifield Data. TopoInVis 2019, Nyköping, Sweden, preprint arXiv:1911.00687 (2019)
  • (3) Aubry, M., Schlickewei, U., Cremers, D.: The wave kernel signature: A quantum mechanical approach to shape analysis. pp. 1626–1633 (2011). DOI 10.1109/ICCVW.2011.6130444
  • (4) Bauer, U., Ge, X., Wang, Y.: Measuring distance between reeb graphs. In: Proceedings of the Thirtieth Annual Symposium on Computational Geometry, SOCG’14, p. 464–473. Association for Computing Machinery, New York, NY, USA (2014)
  • (5) Bronstein, M., Kokkinos, I.: Scale-invariant heat kernel signatures for non-rigid shape recognition. 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition pp. 1704–1711 (2010)
  • (6) Carr, H., Duke, D.: Joint Contour Nets. IEEE Transactions on Visualization and Computer Graphics 20(8), 1100–1113 (2014). DOI 10.1109/TVCG.2013.269
  • (7) Chattopadhyay, A., Carr, H., Duke, D., Geng, Z.: Extracting Jacobi Structures in Reeb Spaces. In: N. Elmqvist, M. Hlawitschka, J. Kennedy (eds.) EuroVis - Short Papers, pp. 1–4. The Eurographics Association (2014). DOI 10.2312/eurovisshort.20141156
  • (8) Chattopadhyay, A., Carr, H., Duke, D., Geng, Z., Saeki, O.: Multivariate topology simplification. Computational Geometry: Theory and Applications 58, 1–24 (2016). DOI 10.1016/j.comgeo.2016.05.006
  • (9) Chavel, I.: Eigenvalues in Riemannian geometry. Academic press (1984)
  • (10) Chazal, F., Cohen-Steiner, D., Glisse, M., Guibas, L.J., Oudot, S.Y.: Proximity of persistence modules and their diagrams. In: Proceedings of the Twenty-Fifth Annual Symposium on Computational Geometry, SCG ’09, p. 237–246. Association for Computing Machinery, New York, NY, USA (2009). DOI 10.1145/1542362.1542407. URL https://doi.org/10.1145/1542362.1542407
  • (11) Cohen-Steiner, D., Edelsbrunner, H., Harer, J.: Stability of persistence diagrams. Discret. Comput. Geom. 37(1), 103–120 (2007)
  • (12) Dey, T.K., Mémoli, F., Wang, Y.: Multiscale mapper: Topological summarization via codomain covers. In: Proceedings of the Twenty-Seventh Annual ACM-SIAM Symposium on Discrete Algorithms, SODA ’16, p. 997–1013. Society for Industrial and Applied Mathematics, USA (2016)
  • (13) Dimakis, N., Cowan, M., Hanson, G., Smotkin, E.S.: Attraction- repulsion mechanism for carbon monoxide adsorption on platinum and platinum- ruthenium alloys. The Journal of Physical Chemistry C 113(43), 18,730–18,739 (2009). DOI 10.1021/jp9036809
  • (14) Duke, D., Carr, H., Knoll, A., Schunck, N., Nam, H.A., Staszczak, A.: Visualizing nuclear scission through a multifield extension of topological analysis. IEEE Transactions on Visualization and Computer Graphics 18(12), 2033–2040 (2012). DOI 10.1109/TVCG.2012.287
  • (15) Edelsbrunner, H., Harer, J.: Jacobi sets of multiple morse functions. Foundations of Computational Mathematics - FoCM (2004)
  • (16) Edelsbrunner, H., Harer, J.: Computational Topology - an Introduction. American Mathematical Society (2010). URL http://www.ams.org/bookstore-getitem/item=MBK-69
  • (17) Edelsbrunner, H., Harer, J., Patel, A.K.: Reeb spaces of piecewise linear mappings. In: Proceedings of the Twenty-Fourth Annual Symposium on Computational Geometry, SCG ’08, p. 242–250. Association for Computing Machinery, New York, NY, USA (2008). DOI 10.1145/1377676.1377720. URL https://doi.org/10.1145/1377676.1377720
  • (18) Edelsbrunner, H., Letscher, D., Zomorodian, A.: Topological persistence and simplification. Discret. Comput. Geom. 28(4), 511–533 (2002). DOI 10.1007/s00454-002-2885-2
  • (19) Hilaga, M., Shinagawa, Y., Kohmura, T., Kunii, T.L.: Topology Matching for Fully Automatic Similarity Estimation of 3D Shapes. In: Proceedings of the 28th annual conference on Computer graphics and interactive techniques, pp. 203–212. ACM (2001)
  • (20) Kendrick, I., Kumari, D., Yakaboski, A., Dimakis, N., Smotkin, E.S.: Elucidating the ionomer-electrified metal interface. Journal of the American Chemical Society 132(49), 17,611–17,616 (2010). DOI 10.1021/ja1081487
  • (21) Kleiman, Y., Ovsjanikov, M.: Robust structure-based shape correspondence. Computer Graphics Forum 38 (2017)
  • (22) Kuhn, H.W.: The hungarian method for the assignment problem. Naval Research Logistics Quarterly 2(1-2), 83–97 (1955). DOI https://doi.org/10.1002/nav.3800020109. URL https://onlinelibrary.wiley.com/doi/abs/10.1002/nav.3800020109
  • (23) Levy, B.: Laplace-beltrami eigenfunctions towards an algorithm that ”understands” geometry. In: IEEE International Conference on Shape Modeling and Applications 2006 (SMI’06), pp. 13–13 (2006). DOI 10.1109/SMI.2006.21
  • (24) Li, C., Ovsjanikov, M., Chazal, F.: Persistence-based structural recognition. In: 2014 IEEE Conference on Computer Vision and Pattern Recognition, pp. 2003–2010 (2014). DOI 10.1109/CVPR.2014.257
  • (25) Lian, Z., Godil, A., Fabry, T., Furuya, T., Hermans, J., Ohbuchi, R., Shu, C., Smeets, D., Suetens, P., Vandermeulen, D., Wuhrer, S.: Shrec’10 track: Non-rigid 3d shape retrieval. pp. 101–108 (2010). DOI 10.2312/3DOR/3DOR10/101-108
  • (26) Marchese, A., Maroulas, V.: Signal classification with a point process distance on the space of persistence diagrams. Advances in Data Analysis and Classification 12 (2017). DOI 10.1007/s11634-017-0294-x
  • (27) Maroulas, V., Micucci, C.P., Spannaus, A.: A stable cardinality distance for topological classification. Adv. Data Anal. Classif. 14(3), 611–628 (2020). DOI 10.1007/s11634-019-00378-3. URL https://doi.org/10.1007/s11634-019-00378-3
  • (28) Mateus, D., Cuzzolin, F., Horaud, R., Boyer, E.: Articulated Shape Matching Using Locally Linear Embedding and Orthogonal Alignment. In: NRTL 2007 - Workshop on Non-rigid Registration and Tracking through Learning, pp. 1–8. IEEE Computer Society Press, Rio de Janeiro, Brazil (2007). DOI 10.1109/ICCV.2007.4409180. URL https://hal.inria.fr/inria-00590237
  • (29) Patra, A.: Surface Properties, Adsorption, and Phase Transitions with a Dispersion-Corrected Density Functional. Temple University (2018)
  • (30) Poulenard, A., Skraba, P., Ovsjanikov, M.: Topological function optimization for continuous shape matching. Computer Graphics Forum 37, 13–25 (2018)
  • (31) Ramamurthi, Y., Agarwal, T., Chattopadhyay, A.: A topological similarity measure between multi-resolution reeb spaces. IEEE Transactions on Visualization and Computer Graphics pp. 1–1 (2021). DOI 10.1109/TVCG.2021.3087273
  • (32) Reuter, M., Wolter, F.E., Peinecke, N.: Laplace-beltrami spectra as ’shape-dna’ of surfaces and solids. Comput. Aided Des. 38(4), 342–366 (2006). DOI 10.1016/j.cad.2005.10.011. URL https://doi.org/10.1016/j.cad.2005.10.011
  • (33) Reuter, M., Wolter, F.E., Shenton, M., Niethammer, M.: Laplace–beltrami eigenvalues and topological features of eigenfunctions for statistical shape analysis. Computer-Aided Design 41(10), 739–755 (2009). DOI https://doi.org/10.1016/j.cad.2009.02.007. URL https://www.sciencedirect.com/science/article/pii/S0010448509000463
  • (34) Rosenberg, S.: The Laplacian on a Riemannian Manifold: An Introduction to Analysis on Manifolds. London Mathematical Society Student Texts. Cambridge University Press (1997). DOI 10.1017/CBO9780511623783
  • (35) Rustamov, R.M.: Laplace-Beltrami Eigenfunctions for Deformation Invariant Shape Representation. In: A. Belyaev, M. Garland (eds.) Geometry Processing. The Eurographics Association (2007). DOI 10.2312/SGP/SGP07/225-233
  • (36) Rustamov, R.M.: Laplace-Beltrami Eigenfunctions for Deformation Invariant Shape Representation. In: A. Belyaev, M. Garland (eds.) Geometry Processing. The Eurographics Association (2007). DOI 10.2312/SGP/SGP07/225-233
  • (37) Saeki, O.: Topology of singular fibers of differentiable maps, Lecture Notes in Mathematics, vol. 1854. Springer Heidelberg, Germany (2004)
  • (38) Saeki, O.: Theory of singular fibers and reeb spaces for visualization. In: H. Carr, C. Garth, T. Weinkauf (eds.) Topological Methods in Data Analysis and Visualization IV, pp. 3–33. Springer International Publishing, Cham (2017)
  • (39) Saeki, O., Takahashi, S., Sakurai, D., Wu, H.Y., Kikuchi, K., Carr, H., Duke, D., Yamamoto, T.: Visualizing Multivariate Data Using Singularity Theory, Mathematics for Industry, vol. 1, chap. The Impact of Applications on Mathematics, pp. 51–65. Springer Japan (2014)
  • (40) Singh, G., Mémoli, F., Carlsson, G.E.: Topological methods for the analysis of high dimensional data sets and 3d object recognition. In: PBG@Eurographics (2007)
  • (41) Somorjai, G.A., Li, Y.: Introduction to surface chemistry and catalysis. John Wiley & Sons (2010)
  • (42) Sridharamurthy, R., Masood, T.B., Kamakshidasan, A., Natarajan, V.: Edit distance between merge trees. IEEE Transactions on Visualization and Computer Graphics 26(3), 1518–1531 (2020). DOI 10.1109/TVCG.2018.2873612
  • (43) Sun, J., Ovsjanikov, M., Guibas, L.: A concise and provably informative multi-scale signature based on heat diffusion. In: Proceedings of the Symposium on Geometry Processing, SGP ’09, p. 1383–1392. Eurographics Association, Goslar, DEU (2009)
  • (44) Tam, G.K.L., Lau, R.W.H.: Deformable model retrieval based on topological and geometric signatures. IEEE Transactions on Visualization and Computer Graphics 13(3), 470–482 (2007)
  • (45) Tarjan, R.E.: Efficiency of a Good But Not Linear Set Union Algorithm. J. ACM 22(2), 215–225 (1975)
  • (46) Thomas, D.M., Natarajan, V.: Multiscale symmetry detection in scalar fields by clustering contours. IEEE Trans. Visualization and Computer Graphics 20(12), 2427–2436 (2014)
  • (47) Tu, J., Hajij, M., Rosen, P.: Propagate and pair: A single-pass approach to critical point pairing in reeb graphs. In: G. Bebis, R. Boyle, B. Parvin, D. Koracin, D. Ushizima, S. Chai, S. Sueda, X. Lin, A. Lu, D. Thalmann, C. Wang, P. Xu (eds.) Advances in Visual Computing, pp. 99–113. Springer International Publishing (2019)
  • (48) Umeyama, S.: An eigendecomposition approach to weighted graph matching problems. IEEE Transactions on Pattern Analysis and Machine Intelligence 10(5), 695–703 (1988). DOI 10.1109/34.6778
  • (49) Wang, Z., Lin, H.: 3d shape retrieval based on laplace operator and joint bayesian model. Visual Informatics 4(3), 69–76 (2020). DOI https://doi.org/10.1016/j.visinf.2020.08.002. URL https://www.sciencedirect.com/science/article/pii/S2468502X20300346
  • (50) Zhang, X., Marcos, Bajaj, C.L., Baker, N.: Fast Matching of Volumetric Functions Using Multi-resolution Dual Contour Trees (2004)
  • (51) Zomorodian, A., Carlsson, G.: Computing persistent homology. Discrete Comput. Geom. 33(2), 249–274 (2005)