Decoding algorithms of monotone codes and azinv codes and their unified view

Hokuto Takahashi Graduate School of Science and Engineering,
Chiba University
1-33 Yayoi-cho, Inage-ku, Chiba-city, Chiba, Japan
263-0022
Email: ayca5495@chiba-u.jp Manabu Hagiwara Graduate School of Science,
Chiba University
1-33 Yayoi-cho, Inage-ku, Chiba-city, Chiba, Japan
263-0022
Email: hagiwara@math.s.chiba-u.ac.jp

Abstract

This paper investigates linear-time decoding algorithms for two classes of error-correcting codes. One of the classes is monotone codes which are known as single deletion codes. The other is azinv codes which are known as single balanced adjacent deletion codes. As results, this paper proposes generalizations of Levenshtein’s decoding algorithm for Levenshtein’s single deletion codes. This paper points out that it is possible to unify our new two decoding algorithms.

I Introduction

Insertion errors and deletion errors are considered to be synchronization errors over communication channels and storage channels such as DNA-data based storages [1, 2], racetrack memories[3, 4], or Bit Patterned Media[5, 6]. The study of deletion error-correcting codes started with Levenshtein’s work[7], where he proved that Varshamov-Tenengolts (VT) codes are capable of correcting single insertions or deletions. In Levenshtein’s proof, he showed beautiful decoding algorithm to correct single deletions. This paper provides two generalizations of Levenshtein’s decoding algorithms. One corrects single deletions and reversals for monotone codes, the other corrects single balanced adjacent deletions and single balanced adjacent reversals for azinv codes. A single Balanced Adjacent Deletion (BAD) error is a double deletion error that deletes two different consecutive binary symbols, i.e., 01 or 10. A single Balanced Adjacent Reversal (BAR) error is a swap error that reverses two different consecutive binary symbols.

For applications, the computational cost of decoding algorithms is preferable to be polynomial. One of remarkable aspects of our algorithms is the computational cost. The costs are only linear to their input, i.e., the length of received word. Another remarkable aspect is the following. While these classes and their correctable error are different, our algorithms can be allowed to be unified.

This paper is organized as follows. In Section II, we first introduce monotone codes and provide a single deletion/reveral error-correcting algorithm of the monotone codes. In Section III, we first introduce azinv codes and provide a single BAD/BAR error-correcting algorithm of the azinv codes. In Section IV, we provide a unified view of the algorithms of the monotone codes and the azinv codes.

II Monotone Code and its Decoding Algorithm

Throughout this paper, $\mathbb{B}$ denotes the binary set $\{0,1\}$ . For a positive integer $n$ , $[n]$ denotes $\left\{1,2,\cdots,n\right\}$ .

In this section, we introduce monotone codes and provide an algorithm to make the monotone codes single deletion/reversal error-correctable (Algorithm 1). Errors treated in this section are deletion errors and reversal errors, which are defined below.

Definition II.1 (Deletion and Reversal).

Let $n$ be a positive integer and $i\in[n]$ . Define a map $\mathrm{del}_{i}:\mathbb{B}^{n}\rightarrow\mathbb{B}^{n-1}$ as

\mathrm{del}_{i}(x_{1}x_{2}\cdots x_{n})\coloneqq x_{1}\cdots x_{i-1}x_{i+1}\cdots x_{n}.

We call the map $\mathrm{del}_{i}$ deletion. Note that $\mathbb{B}^{0}:=\{\varepsilon\}$ , where $\varepsilon$ is the empty word.

Define a map $\mathrm{rev}_{i}:\mathbb{B}^{n}\rightarrow\mathbb{B}^{n}$ as

\mathrm{rev}_{i}(x_{1}x_{2}\cdots x_{n})\coloneqq x_{1}\cdots x_{i-1}\overline{x}_{i}x_{i+1}\cdots x_{n}

with $\overline{0}=1$ and $\overline{1}=0$ . We call the map $\mathrm{rev}_{i}$ reversal.

The following codes, monotone codes, are known as single deletion error-correcting codes [8]. However, no decoding algorithm has been studied.

Definition II.2 (Monotone code[8]).

Let $n$ and $m$ be positive integers, $a$ an integer, and ${\bf k}=(k_{1},k_{2},\cdots,k_{n})$ a positive monotonic increasing integer sequence with $m>k_{n}$ . Define a set $M_{a,m,{\bf k}}(n)$ as

M_{a,m,{\bf k}}(n)\coloneqq\left\{{\bf x}\in\mathbb{B}^{n}\mid\rho_{{\bf k}}({\bf x})\equiv a\pmod{m}\right\},

where

\rho_{{\bf k}}({\bf x})\coloneqq\sum_{i=1}^{|{\bf x}|}k_{i}x_{i}

and $|{\bf x}|$ denotes the length of ${\bf x}$ . $\rho_{{\bf k}}({\bf x})$ is defined only for k with $|{\bf k}|\geq|{\bf x}|$ . We call $M_{a,m,{\bf k}}(n)$ a monotone code.

Remark II.3.

If ${\bf k}=(1,2,\cdots,n)$ , the monotone code is called a Levenshtein code. If ${\bf k}=(1,2,\cdots,n)$ and $m=n+1$ , the monotone code is called a VT code. Levenshtein [7] proved that Levenshtein codes are single deletion error-correcting codes with $m>n$ . He also proved that Levenshtein codes are single deletion/reversal error-correcting codes with $m\geq 2n$ .

The following is one of the main contributions of this paper.

Theorem II.4.

Let $M_{a,m,{\bf k}}(n)$ be a monotone code and ${\bf x}$ be a codeword of $M_{a,m,{\bf k}}(n)$ . Let $\mathrm{Dec}_{M}$ denote Algorithm 1.

1. $\mathrm{Dec}_{M}({\bf x})={\bf x}$ .

2. Assume $k_{n}<m$ . For any single deletion $\mathrm{del}$ , $\mathrm{Dec}_{M}\circ\mathrm{del}({\bf x})={\bf x}$ .

3. If $k_{n}<m$ , any monotone code $M_{a,m,{\bf k}}(n)$ is a single deletion error-correcting code with Algorithm 1 as a decoding algorithm.

4. Assume $2k_{n}\leq m$ . For any single reversal $\mathrm{rev}$ , $\mathrm{Dec}_{M}\circ\mathrm{rev}({\bf x})={\bf x}$ .

5. If $2k_{n}\leq m$ , any monotone code $M_{a,m,{\bf k}}(n)$ is a single deletion/reversal error-correcting code with Algorithm 1 as a decoding algorithm. Here $\circ$ denotes the map composition.

Proof.

1. It is trivial from the steps 3 and 4 of Algorithm 1. 2. A proof is provided in subsection II-B. 3. It is a corollary of 1. and 2. 4. A proof is provided in subsection II-C. 5. It is a corollary of 3. and 4. ∎

II-A Decoding algorithm for single deletion/reversal errors

In this subsection, we provide notation in Algorithm 1 and the details of Algorithm 1.

For an integer $n\geq 1$ , a positive integer $i\in[n]$ , a positive monotonic increasing integer sequence ${\bf k}=(k_{1},k_{2},\cdots,k_{n})$ , and a binary sequence ${\bf y}=y_{1}y_{2}\cdots y_{n-1}\in\mathbb{B}^{n-1}$ , define maps as

	$\displaystyle L^{(0)}_{\bf k}(i,{\bf y})\coloneqq\begin{cases}\displaystyle\sum_{j=1}^{i-1}\overline{y}_{j}(k_{j+1}-k_{j})\quad&(i\neq 1),\\ 0&(i=1).\end{cases}$
	$\displaystyle L^{(1)}_{\bf k}(i,{\bf y})\coloneqq\begin{cases}\displaystyle\sum_{j=1}^{i-1}y_{j}(k_{j+1}-k_{j})\quad&(i\neq 1),\\ 0&(i=1).\end{cases}$
	$\displaystyle R^{(0)}_{\bf k}(i,{\bf y})\coloneqq\begin{cases}\displaystyle\sum_{j=i}^{n-1}\overline{y}_{j}(k_{j+1}-k_{j})\quad&(i\neq n),\\ 0&(i=n).\end{cases}$
	$\displaystyle R^{(1)}_{\bf k}(i,{\bf y})\coloneqq\begin{cases}\displaystyle\sum_{j=i}^{n-1}y_{j}(k_{j+1}-k_{j})\quad&(i\neq n),\\ 0&(i=n).\end{cases}$

We denote $R^{(1)}_{\bf k}(1,{\bf y})$ by $\mathrm{wt}_{\bf k}({\bf y})$ . Note that $\mathrm{wt}_{\bf k}({\bf y})$ is coincided with the Hamming weight of y if ${\bf k}=(1,2,\cdots,n).$ We omit ${\bf k}$ from the notations, if ${\bf k}=(1,2,\cdots,n)$ .

For ${\bf b}\in\mathbb{B}$ , $i\in[n]$ , and $\mathrm{ins}_{i,{\bf b}}({\bf y})$ , $L^{(0)}(i,{\bf y})$ (resp. $L^{(1)}(i,{\bf y})$ ) equals to the number of $0$ (resp. $1$ ) to the left of inserted position $i$ , $R^{(0)}(i,{\bf y})$ (resp. $R^{(1)}(i,{\bf y})$ ) equals to the number of $0$ (resp. $1$ ) to the right of inserted position $i$ .

The map defined below is known as an inverse operation of deletion and is used in Algorithm 1.

Definition II.5.

For a positive integer $i\in[n+1]$ and a non-empty binary sequence ${\bf b}$ , define a map $\mathrm{ins}_{i,{\bf b}}:\mathbb{B}^{n}\rightarrow\mathbb{B}^{n+|{\bf b}|}$ as

\mathrm{ins}_{i,{\bf b}}(x_{1}x_{2}\cdots x_{n})\coloneqq\\ x_{1}\cdots x_{i-1}bx_{i}\cdots x_{n}.

We call the map $\mathrm{ins}_{i,{\bf b}}$ insertion.

Algorithm 1 Decoding algorithm for single deletion/reversal errors

1: Input:

a\in\mathbb{Z}_{\geq 0}

n

and

m\in\mathbb{Z}_{\geq 1}

{\bf y}\in\bigcup_{t\geq 0}\mathbb{B}^{t}

, and

{\bf k}=(k_{1},k_{2},\cdots,k_{n})\in\mathbb{Z}^{n}

. Output:

{\bf z}\in\mathbb{B}^{n}

or a symbol ?.

2: Compute the length of

{\bf y}

, say

|{\bf y}|

3: if

|{\bf y}|=n

then

4: go to 10.

5: else if

|{\bf y}|=n-1

then

6: go to 21.

7: else

{\bf z}\coloneqq?

. Go to 35.

9: end if

10: Compute

r\coloneqq\min\{s\in\mathbb{Z}_{\geq 0}\mid s\equiv a-\rho_{\bf k}({\bf y})\pmod{m}\}

11: if

r=0

then

12:

{\bf z}\coloneqq{\bf y}

. Go to 35.

13: else

14: Compute

p\coloneqq\min\{j\in[n]\mid k_{j}=\min\{r,m-r\}\}

15: if

\mathrm{rev}_{p}({\bf y})\in M_{a,m,{\bf k}}(n)

then

16:

{\bf z}\coloneqq\mathrm{rev}_{p}({\bf y})

. Go to 35.

17: else

18:

{\bf z}\coloneqq?

. Go to 35.

19: end if

20: end if

21: Compute

r\coloneqq\min\{s\in\mathbb{Z}_{\geq 0}\mid s\equiv a-\rho_{\bf k}({\bf y})\pmod{m}\}

22: Compute

w\coloneqq\mathrm{wt}_{\bf k}({\bf y})

23: if

r\leq w

then

24: Compute

p\coloneqq\max\{j\in[n]\mid R^{(1)}_{\bf k}(j,{\bf y})=r\}

25:

b\coloneqq 0

26: else

27: Compute

p\coloneqq\min\{j\in[n]\mid L^{(0)}_{\bf k}(j,{\bf y})=r-w-k_{1}\}.

28:

b\coloneqq 1

29: end if

30: if

\mathrm{ins}_{p,{\bf b}}({\bf y})\in M_{a,m,{\bf k}}(n)

then

31:

{\bf z}\coloneqq\mathrm{ins}_{p,{\bf b}}({\bf y})

32: else

33:

{\bf z}\coloneqq?

34: end if

35: Output

{\bf z}

Note that the steps 21st - 34th of Algorithm 1 are coincided with Levenshtein’s decoding algorithm for single deletion errors if ${\bf k}=(1,2,\cdots,n)$ [7].

Before we move to proofs for Theorem II.4, we provide examples of Algorithm 1 for each error.

Example II.6 (Decoding for a single deletion error).

Let $n=4,a=0,m=9,{\bf k}=(1,3,6,8),$ and ${\bf y}=101$ . Note that the monotone code has four words,

M_{0,9,{\bf k}}(4)=\{0000,1001,0110,1111\}.

•

Since $|{\bf y}|=3$ , we go to step 6 and then to step 21.
•

Algorithm 1 computes $r$ and $w$ . We obtain $r=2$ and $w=4.$
•

Since $r\leq w$ ,

$\displaystyle p$ $\displaystyle=\max\{j\in[4]\mid R^{(1)}(j,{\bf y})=2)\}$

$\displaystyle=3,$

$\displaystyle b$ $\displaystyle=0.$
•

The output $\mathrm{Dec}_{M}({\bf y})=\mathrm{ins}_{p,b}({\bf y})$ is

$\mathrm{ins}_{3,0}(101)=1001.$

Example II.7 (Decoding for a single reversal error).

Let $n=6,a=0,m=20,{\bf k}=(1,2,3,8,9,10),$ and ${\bf y}=111110$ . Note that the monotone code has five words,

M_{0,20,{\bf k}}(6)=\{000000,110110,001110,100011,010101\}.

•

Since $|{\bf y}|=6$ , we go to step 4 and then to step 10.
•

Algorithm 1 computes $r$ , then we obtain $r=17$ .
•

Since $r\neq 0$ ,

$\displaystyle p$ $\displaystyle=\min\{3,17\}$

$\displaystyle=3.$
•

The output $\mathrm{Dec}_{M}({\bf y})=\mathrm{rev}_{p}({\bf y})$ is

$\mathrm{rev}_{3}(111110)=110110.$

II-B Proof for single deletion error-correction

To prove 2 of Theorem II.4, we introduce the following four Lemmas II.9, II.10, II.11, and II.12. From now on, till the end of this subsection, we assume the following.

Hypothesis II.8.

A binary sequence ${\bf x}$ is a codeword of $M_{a,m,{\bf k}}(n)$ . Set ${\bf y}\coloneqq\mathrm{del}_{i}({\bf x})$ for a fixed $i$ . $r$ is the value at the step 21 of Algorithm 1.

Lemma II.9.

k_{i}=k_{1}+L^{(0)}_{\bf k}(i,{\bf y})+L^{(1)}_{\bf k}(i,{\bf y}).

Proof.

It follows from the definitions of $L^{(0)}_{\bf k}(i,{\bf y})$ and $L^{(1)}_{\bf k}(i,{\bf y})$ .

	$\displaystyle(\text{R.H.S.})$	$\displaystyle=k_{1}+\sum_{j=1}^{i-1}\overline{y}_{j}(k_{j+1}-k_{j})+\sum_{j=1}^{i-1}y_{j}(k_{j+1}-k_{j})$
		$\displaystyle=k_{1}+\sum_{j=1}^{i-1}(\overline{y}_{j}+y_{j})(k_{j+1}-k_{j})$
		$\displaystyle=k_{1}+\sum_{j=1}^{i-1}(k_{j+1}-k_{j})$
		$\displaystyle=k_{i}.$

∎

Lemma II.10.

$\mathrm{wt}_{\bf k}({\bf y})=L^{(1)}_{\bf k}(i,{\bf y})+R^{(1)}_{\bf k}(i,{\bf y})$ .

Proof.

It follows from the definitions of $\mathrm{wt}_{\bf k}({\bf y}),L^{(1)}_{\bf k}(i,{\bf y}),$ and $R^{(1)}_{\bf k}(i,{\bf y})$ .

	$\displaystyle(\text{R.H.S.})$	$\displaystyle=\sum_{j=1}^{i-1}y_{j}(k_{j+1}-k_{j})+\sum_{j=i}^{n-1}y_{j}(k_{j+1}-k_{j})$
		$\displaystyle=\sum_{j=1}^{n-1}y_{j}(k_{j+1}-k_{j})$
		$\displaystyle=\mathrm{wt}_{\bf k}({\bf y}).$

∎

Lemma II.11.

The following four inequalities hold.

$\displaystyle 0$	$\displaystyle\leq R^{(1)}_{\bf k}(i,{\bf y})$	(1)
	$\displaystyle\leq\mathrm{wt}_{\bf k}({\bf y})$	(2)
	$\displaystyle<k_{1}+\mathrm{wt}_{\bf k}({\bf y})+L^{(0)}_{\bf k}(i,{\bf y})$	(3)
	$\displaystyle\leq k_{n}.$	(4)

Proof.

The inequality (1) follows from that ${\bf k}$ is a positive monotonic increasing integer sequence. The inequality (2) follows from Lemma II.10 and $L^{(1)}_{\bf k}(i,{\bf y})\geq 0$ . The inequality (3) follows from $k_{1}>0$ and $L^{(0)}_{\bf k}(i,{\bf y})\geq 0$ .

We show the inequality (4). From Lemma II.9 and Lemma II.10, the equation on the first line below follows.

	$\displaystyle k_{1}+\mathrm{wt}_{\bf k}({\bf y})+L^{(0)}_{\bf k}(i,{\bf y})$	$\displaystyle=k_{i}+R^{(1)}_{\bf k}(i,{\bf y})$
		$\displaystyle=k_{i}+\sum_{j=i}^{n-1}y_{j}(k_{j+1}-k_{j})$
		$\displaystyle\leq k_{i}+\sum_{j=i}^{n-1}(k_{j+1}-k_{j})$
		$\displaystyle=k_{n}.$

∎

Lemma II.12.

r=\begin{cases}R^{(1)}_{\bf k}(i,{\bf y})&(x_{i}=0),\\ k_{1}+\mathrm{wt}_{\bf k}({\bf y})+L^{(0)}_{\bf k}(i,{\bf y})&(x_{i}=1).\end{cases}

Proof.

The value of $r$ is obtained by using Lemma II.9, Lemma II.10 and Lemma II.11. It follows from the definitions of $r$ and $a$ .

	$\displaystyle r$	$\displaystyle\equiv a-\rho_{{\bf k}}({\bf y})\pmod{m}$
		$\displaystyle=a-\sum_{i=1}^{n-1}k_{i}y_{i}$
		$\displaystyle\equiv\sum_{i=1}^{n}k_{i}x_{i}-\sum_{i=1}^{n-1}k_{i}y_{i}\pmod{m}.$

By the assumption ${\bf y}=\mathrm{del}_{i}({\bf x})$ , the following holds.

{\bf x}=\mathrm{ins}_{i,x_{i}}({\bf y})=y_{1}\cdots y_{i-1}x_{i}y_{i}\cdots y_{n-1}.

Therefore,

	$\displaystyle\sum_{i=1}^{n}k_{i}x_{i}-\sum_{i=1}^{n-1}k_{i}y_{i}$
	$\displaystyle=k_{1}y_{1}+\cdots+k_{i-1}y_{i-1}+k_{i}x_{i}+k_{i+1}y_{i}\cdots+k_{n}y_{n-1}$
	$\displaystyle-k_{1}y_{1}+\cdots+k_{i-1}y_{i-1}~{}~{}~{}~{}~{}~{}~{}~{}~{}+k_{i}y_{i}\cdots+k_{n-1}y_{n-1}$
	$\displaystyle=k_{i}x_{i}+\sum_{j=i}^{n-1}y_{j}(k_{j+1}-k_{j})$
	$\displaystyle=k_{i}x_{i}+R^{(1)}_{\bf k}(i,{\bf y})$
	$\displaystyle=\begin{cases}R^{(1)}_{\bf k}(i,{\bf y})&(x_{i}=0),\\ k_{i}+R^{(1)}_{\bf k}(i,{\bf y})&(x_{i}=1)\\ \end{cases}$
	$\displaystyle=\begin{cases}R^{(1)}_{\bf k}(i,{\bf y})&(x_{i}=0),\\ k_{1}+\mathrm{wt}_{\bf k}({\bf y})+L^{(0)}_{\bf k}(i,{\bf y})&(x_{i}=1).\end{cases}$

The last follows from Lemma II.9 and Lemma II.10. We show that the equality holds in the cases of $x_{i}=0$ and $x_{i}=1$ .

Case $x_{i}=0$ : we have shown

r\equiv R^{(1)}_{\bf k}(i,{\bf y})\pmod{m}.

On the other hand, the following inequalities hold by Lemma II.11.

0\leq R^{(1)}_{\bf k}(i,{\bf y})<k_{n}<m.

By the definition of $r$ ,

0\leq r<m.

This implies $r=R^{(1)}_{\bf k}(i,{\bf y})$ .

Case $x_{i}=1$ : We have shown

r\equiv k_{1}+\mathrm{wt}_{\bf k}({\bf y})+L^{(0)}_{\bf k}(i,{\bf y})\pmod{m}.

On the other hand, the following inequalities hold by Lemma II.11.

0\leq k_{1}+\mathrm{wt}_{\bf k}({\bf y})+L^{(0)}_{\bf k}(i,{\bf y})\leq k_{n}<m.

By the definition of $r$ ,

0\leq r<m.

This implies $r=k_{1}+\mathrm{wt}_{\bf k}({\bf y})+L^{(0)}_{\bf k}(i,{\bf y})$ .

Therefore, Lemma II.12 holds. ∎

proof of 2 of Theorem II.4.

Let us focus on the step 23 of Algorithm 1. The case of $r\leq w$ and the case of $r>w$ are shown separately.

In the case of $r\leq w$ , we have $\mathrm{Dec}_{M}({\bf y})=\mathrm{ins}_{p,0}({\bf y})$ . We show $\mathrm{ins}_{p,0}({\bf y})={\bf x}$ . First, we show that $r=R^{(1)}_{\bf k}(i,{\bf y})$ holds. This is shown by contradiction. From Lemma II.12, either

r=R^{(1)}_{\bf k}(i,{\bf y})\quad\text{or}

r=k_{1}+\mathrm{wt}_{\bf k}({\bf y})+L^{(0)}_{\bf k}(i,{\bf y})

holds. Assume that $r=k_{1}+\mathrm{wt}_{\bf k}({\bf y})+L^{(0)}_{\bf k}(i,{\bf y})$ holds. Lemma II.11 implies $r>w=\mathrm{wt}_{\bf k}({\bf y})$ , which contradicts to $r\leq w$ . Thus, $r=R^{(1)}_{\bf k}(i,{\bf y})$ holds.

Next, we show that the deleted symbol $x_{i}$ is equal to $0$ . This is also shown by contradiction. Since $x_{i}\in\mathbb{B}$ , either $x_{i}=0$ or $x_{i}=1$ holds. If $x_{i}=1$ holds, Lemma II.12 implies

r=k_{1}+\mathrm{wt}_{\bf k}({\bf y})+L^{(0)}_{\bf k}(i,{\bf y}),

which contradicts to

r=R^{(1)}_{\bf k}(i,{\bf y}).

Therefore, $x_{i}=0$ .

Finally, we show that $\mathrm{ins}_{p,0}({\bf y})={\bf x}$ . We showed the deleted symbol $x_{i}$ is equal to $0$ . Since $x_{i}=0$ and ${\bf y}=\mathrm{del}_{i}({\bf x})$ , ${\bf x}=\mathrm{ins}_{i,0}({\bf y})$ holds. Therefore, it suffices to prove that $\mathrm{ins}_{p,0}({\bf y})=\mathrm{ins}_{i,0}({\bf y})$ . Furthermore, we will show

0=y_{i}=y_{i+1}=\cdots=y_{p-1}.

Since $r=R^{(1)}_{\bf k}(i,{\bf y})$ , then

i\in\{j\in[n]\mid R^{(1)}_{\bf k}(j,{\bf y})=r\}

holds. Since

p\in\{j\in[n]\mid R^{(1)}_{\bf k}(j,{\bf y})=r\},

then

\displaystyle r=R^{(1)}_{\bf k}(i,{\bf y})=R^{(1)}_{\bf k}(p,{\bf y})

holds. Therefore,

\displaystyle\sum_{j=i}^{n-1}y_{j}(k_{j+1}-k_{j})=\sum_{j=p}^{n-1}y_{j}(k_{j+1}-k_{j})

holds. In the case of $r\leq w$ , $i\leq p$ follows from the definition of $p$ . Since $i\leq p$ , we have

	$\displaystyle 0$	$\displaystyle=\sum_{j=i}^{n-1}y_{j}(k_{j+1}-k_{j})-\sum_{j=p}^{n-1}y_{j}(k_{j+1}-k_{j})$
		$\displaystyle=\sum_{j=i}^{p-1}y_{j}(k_{j+1}-k_{j}).$

Since $(k_{j+1}-k_{j})>0$ , we have

0=y_{i}=y_{i+1}=\cdots=y_{p-1}.

By a similar argument, we can prove in the remaining case $r>w$ . ∎

II-C Proof for single reversal error-correction

To prove 4 of Theorem II.4, we introduce the following Lemma II.14. From now on, till the end of this subsection, we assume the following.

Hypothesis II.13.

A binary sequence ${\bf x}$ is a codeword of $M_{a,m,{\bf k}}(n)$ with $m\geq 2k_{n}$ . Set ${\bf y}\coloneqq\mathrm{rev}_{i}({\bf x})$ for a fixed $i$ . $r$ is the value at the step 10 of Algorithm 1.

Lemma II.14.

r=\begin{cases}k_{i}&(y_{i}=0),\\ m-k_{i}&(y_{i}=1)\end{cases}

and $r\neq 0$ .

Proof.

It follows from the definitions of $r$ and $a$ .

	$\displaystyle r$	$\displaystyle\equiv a-\rho_{{\bf k}}({\bf y})\pmod{m}$
		$\displaystyle=a-\sum_{i=1}^{n}k_{i}y_{i}$
		$\displaystyle\equiv\sum_{i=1}^{n}k_{i}x_{i}-\sum_{i=1}^{n}k_{i}y_{i}\pmod{m}.$

By the assumption ${\bf y}=\mathrm{rev}_{i}({\bf x})$ , the following holds.

{\bf x}=\mathrm{rev}_{i}({\bf y})=y_{1}\cdots y_{i-1}\overline{y}_{i}y_{i+1}\cdots y_{n}.

Therefore, we have

	$\displaystyle\sum_{i=1}^{n}k_{i}x_{i}-\sum_{i=1}^{n}k_{i}y_{i}$
	$\displaystyle=k_{1}y_{1}+\cdots+k_{i-1}y_{i-1}+k_{i}\overline{y}_{i}+k_{i+1}y_{i+1}\cdots+k_{n}y_{n}$
	$\displaystyle-k_{1}y_{1}+\cdots+k_{i-1}y_{i-1}+k_{i}y_{i}+k_{i+1}y_{i+1}\cdots+k_{n}y_{n}$
	$\displaystyle=k_{i}(\overline{y}_{i}-y_{i})$
	$\displaystyle=\begin{cases}k_{i}&(y_{i}=0),\\ -k_{i}&(y_{i}=1)\\ \end{cases}$
	$\displaystyle\equiv\begin{cases}k_{i}&(y_{i}=0),\\ m-k_{i}&(y_{i}=1).\\ \end{cases}$

We show that the equality holds in the cases of $y_{i}=0$ and $y_{i}=1$ . Since the sequence k is a positive monotonic increasing integer sequence with $m\geq 2k_{n}$ , we have $0<k_{i}\leq k_{n}<m$ and $0<m-k_{n}\leq m-k_{i}<m$ . Therefore,

r=\begin{cases}k_{i}&(y_{i}=0),\\ m-k_{i}&(y_{i}=1)\\ \end{cases}\\

and $r\neq 0$ . ∎

proof of of 4 of Theorem II.4.

Lemma II.14 implies $r\neq 0$ . Therefore, $\mathrm{Dec}_{M}({\bf y})\neq{\bf y}$ . Let us focus on the step 14 of Algorithm 1. Lemma II.14 implies $r=k_{i}$ or $r=m-k_{i}$ . Whichever $r=k_{i}$ or $r=m-k_{i}$ ,

	$\displaystyle\min\{r,m-r\}$	$\displaystyle=\min\{k_{i},m-k_{i}\}$
		$\displaystyle=k_{i}$

holds, since m $\geq 2k_{i}$ . Furthermore, for distinct indices $j_{1}$ and $j_{2}\in[n]$ , $k_{j_{1}}\neq k_{j_{2}}$ holds, since ${\bf k}$ is a positive monotonic increasing integer sequence. Therefore, we have

	$\displaystyle p$	$\displaystyle=\min\{j\in[n]\mid k_{j}=\min\{r,m-r\}\}$
		$\displaystyle=i$

holds. Thus,

	$\displaystyle\mathrm{rev}_{p}({\bf y})$	$\displaystyle=\mathrm{rev}_{i}({\bf y})$
		$\displaystyle={\bf x}$
		$\displaystyle\in M_{a,m,{\bf k}}(n).$

Therefore, $\mathrm{Dec}_{M}({\bf y})=\mathrm{rev}_{p}({\bf y})={\bf x}$ . ∎

III Azinv Code and its Decoding Algorithm

In this section, we provide an algorithm to make azinv codes single BAD/BAR error-correctable (Algorithm 2). Errors treated in this section are BAD errors and BAR errors, which are defined below.

Definition III.1 (BAD and BAR).

For an integer $n\geq 2$ and $i\in[n-1]$ , define a partial map $\mathrm{BD}_{i}:\mathbb{B}^{n}\rightarrow\mathbb{B}^{n-2}$ as

\mathrm{BD}_{i}(x_{1}x_{2}\cdots x_{n})\coloneqq x_{1}\cdots x_{i-1}x_{i+2}\cdots x_{n}

only for ${\bf x}$ with $x_{i}\neq x_{i+1}$ . We call the partial map $\mathrm{BD}_{i}$ balanced adjacent deletion (BAD).

Define a partial map $\mathrm{BR}_{i}:\mathbb{B}^{n}\rightarrow\mathbb{B}^{n}$ as

\mathrm{BR}_{i}(x_{1}x_{2}\cdots x_{n})\coloneqq x_{1}\cdots x_{i-1}\overline{x}_{i}\overline{x}_{i+1}\cdots x_{n}

only for ${\bf x}$ with $x_{i}\neq x_{i+1}$ . We call the partial map $\mathrm{BR}_{i}$ balanced adjacent reversal (BAR).

The following codes, azinv codes, are known as single BAD error-correcting codes [9]. However, no decoding algorithm has been studied.

Definition III.2 (Azinv code [9]).

For integers $n\geq 2$ and $m\geq 2$ and an integer $a$ , define a set $A_{a,m}(n)$ as

\displaystyle A_{a,m}(n)\coloneqq\{{\bf x}\in\mathbb{B}^{n}\mid\tau({\bf x})\equiv a\pmod{m},{\bf x}\neq{\bf 0},{\bf 1}\}

with $m\geq n$ , where ${\bf 0}$ (resp. ${\bf 1}$ ) is the all zero (resp. one) word, the function $\tau$ is the composition of the function $\mathrm{inv}$ below and the permutation $\sigma^{-1}$ below i.e., $\tau\coloneqq\mathrm{inv}\circ\sigma^{-1}$ .

The function $\mathrm{inv}$ is a map from a binary word to a non-negative integer and is defined as

\mathrm{inv}(x_{1}x_{2}\cdots x_{n})\coloneqq\#\{(i,j)\mid 1\leq i<j\leq n,x_{i}>x_{j}\}.

The value $\mathrm{inv}({\bf x})$ is called the inversion number of ${\bf x}$ .

The permutation $\sigma$ is defined as

$\sigma(x_{1}x_{2}\cdots x_{n})\coloneqq$

\begin{cases}x_{1}x_{n}x_{2}x_{n-1}x_{3}\cdots x_{\frac{n+4}{2}}x_{\frac{n}{2}}x_{\frac{n+2}{2}}&(n:\text{even}),\\ x_{1}x_{n}x_{2}x_{n-1}x_{3}\cdots x_{\frac{n-1}{2}}x_{\frac{n+3}{2}}x_{\frac{n+1}{2}}&(\text{otherwise}).\end{cases}

We call $A_{a,m}(n)$ an azinv code.

The following is one of the main contributions of this paper.

Theorem III.3.

Let $A_{a,m}(n)$ be an azinv code and ${\bf x}$ be a codeword of $A_{a,m}(n)$ . Let $\mathrm{Dec}_{A}$ denote Algorithm 2.

1. $\mathrm{Dec}_{A}({\bf x})={\bf x}$ .

2. Assume $n\leq m$ . For any single BAD $\mathrm{del_{BA}}$ , $\mathrm{Dec}_{A}\circ\mathrm{del_{BA}}({\bf x})={\bf x}$ .

3. If $n\leq m$ , any azinv code $A_{a,m}(n)$ is a single BAD error-correcting code with Algorithm 2 as a decoding algorithm.

4. Assume $2(n-1)\leq m$ . For any single BAR $\mathrm{rev_{BA}}$ , $\mathrm{Dec}_{A}\circ\mathrm{rev_{BA}}({\bf x})={\bf x}$ .

5. If $2(n-1)\leq m$ , any monotone code $A_{a,m}(n)$ is a single BAD/BAR error-correcting code with Algorithm 2 as a decoding algorithm. Here $\circ$ denotes the map composition.

Proof.

1. It is trivial from the steps 3 and 4 of Algorithm 2. 2. A proof is provided in subsection III-B. 3. It is a corollary of 1. and 2. 4. A proof is provided in subsection III-C. 5. It is a corollary of 3. and 4. ∎

III-A Decoding algorithm for single BAD errors

In this subsection, we provide the details of Algorithm 2. Curiously, Algorithm 2 is similar to Algorithm 1 for correcting single deletion/reversal errors for monotone codes.

We introduce the following notation. For a positive integer $n$ and a binary sequence ${\bf y}=y_{1}y_{2}\cdots y_{n}$ , define ${\bf\tilde{y}}$ as

{\bf\tilde{y}}\coloneqq\begin{cases}y_{1}\overline{y}_{2}y_{3}\cdots y_{n-1}\overline{y}_{n}&(n:\text{even}),\\ y_{1}\overline{y}_{2}y_{3}\cdots\overline{y}_{n-1}y_{n}&(\text{otherwise}).\end{cases}

For the $i$ th entry of ${\bf y}$ , say $y_{i}$ , define $\tilde{y}_{i}$ as

\tilde{y}_{i}\coloneqq\begin{cases}\overline{y}_{i}&(i:\text{even}),\\ y_{i}&(\text{otherwise}).\end{cases}

For integers $i$ and $j$ , define $[i,j]$ as $[i,j]\coloneqq\{k\in\mathbb{Z}|i\leq k\leq j\}$ . We denote the subsequence of ${\bf y}$ in the range $[i,j]$ , by ${\bf y}_{[i,j]}$ , i.e., ${\bf y}_{[i,j]}\coloneqq y_{i}y_{i+1}\cdots y_{j}$ . By using the range notation, the permutation $\sigma^{-1}$ can be written in the following form.

Remark III.4.

$\sigma^{-1}(y_{1}y_{2}\cdots y_{n})=y_{1}\sigma^{-1}({\bf y}_{[3,n]})y_{2}$ .

Algorithm 2 Decoding algorithm for single BAD/BAR errors

1: Input:

a\in\mathbb{Z}_{\geq 0},n

and

m\in\mathbb{Z}_{\geq 2}

, and

{\bf y}\in\bigcup_{t\geq 0}\mathbb{B}^{t}

. Output:

{\bf z}\in\mathbb{B}^{n}

or a symbol ?.

2: Compute the length of

{\bf y}

, say

|{\bf y}|

3: if

|{\bf y}|=n

then

4: Go to 10.

5: else if

|{\bf y}|=n-2

then

6: go to 21.

7: else

{\bf z}\coloneqq?

. Go to 43.

9: end if

10: Compute

r\coloneqq\min\{s\in\mathbb{Z}_{\geq 0}\mid s\equiv a-\tau({\bf y})\pmod{m}\}

11: if

r=0

then

12:

{\bf z}\coloneqq{\bf y}

. Go to 43.

13: else

14: Compute

p\coloneqq n-\min\{r,m-r\}

15: if

\mathrm{rev_{BA}}_{p}({\bf y})\in A_{a,m}(n)

then

16:

{\bf z}\coloneqq\mathrm{rev_{BA}}_{p}({\bf y})

. Go to 43.

17: else

18:

{\bf z}\coloneqq?

. Go to 43.

19: end if

20: end if

21: Compute

r\coloneqq\min\{s\in\mathbb{Z}_{\geq 0}\mid s\equiv a-\tau({\bf y})\pmod{m}\}

22: Compute

w\coloneqq\mathrm{wt}({\bf\tilde{y}})

23: if

r\leq w

then

24: Compute

p\coloneqq\max\{j\in[n-1]\mid L^{(1)}(j,{\bf\tilde{y}})=r)\}

25: if

p

: even then

26:

b\coloneqq 10

27: else

28:

b\coloneqq 01

29: end if

30: else

31: Compute

p\coloneqq\min\{j\in[n-1]\mid R^{(0)}(j,{\bf\tilde{y}})=r-w-1)\}

32: if

p

: even then

33:

b\coloneqq 01

34: else

35:

b\coloneqq 10

36: end if

37: end if

38: if

\mathrm{ins}_{p,{\bf b}}({\bf y})\in A_{a,m}(n)

then

39:

{\bf z}\coloneqq\mathrm{ins}_{p,{\bf b}}({\bf y})

40: else

41:

{\bf z}\coloneqq?

42: end if

43: Output z.

Before we move to proofs for Theorem III.3, we provide examples of Algorithm 2 for each error.

Example III.5 (Decoding for a single BAD error).

Let $n=5,a=0,m=5,$ and ${\bf y}=101$ . Note that the azinv code has six words,

C_{0,5}(5)=\{01000,01010,01010,01011,01111,10110,10001\}.

•

Since $|{\bf y}|=3$ , we go to step 6 and then to step 21.
•

Algorithm 2 computes $r$ and $w$ , then we obtain $r=3$ and $w=3.$
•

Since $r\leq w$ , then

$\displaystyle p$ $\displaystyle=\max\{j\in[4]\mid L^{(1)}(j,{\bf\tilde{y}})=3)\}$

$\displaystyle=4.$
•

Since $p=4$ is even, then $b=10$ .
•

The output $\mathrm{Dec}_{A}({\bf y})=\mathrm{ins}_{p,b}({\bf y})$ is

$\mathrm{ins}_{4,10}(101)=10110.$

Example III.6 (Decoding for a single BAR error).

Let $n=6,a=0,m=10,$ and ${\bf y}=100000$ . Note that the azinv code has five words,

C_{0,10}(6)=\{010000,010100,010101,010111,011111\}.

•

Since $|{\bf y}|=6$ , we go to step 4 and then to step 10.
•

Algorithm 2 computes $r$ , then $r=5$ .
•

Since $r\neq 0$ ,

$\displaystyle p$ $\displaystyle=6-\min\{5,5\}$

$\displaystyle=1.$
•

The output $\mathrm{Dec}_{A}({\bf y})=\mathrm{rev_{BA}}_{p}({\bf y})$ is

$\mathrm{BR}_{1}(100000)=010000.$

III-B Proof for single BAD error-correction

To prove 2 of Theorem III.3, we introduce the following three Lemmas III.8, III.9 and III.10 From now on, till the end of this subsection, we assume the following.

Hypothesis III.7.

A binary sequence ${\bf x}$ is a codeword of $A_{a,m}(n)$ . Set ${\bf y}\coloneqq\mathrm{del_{BA}}_{i}({\bf x})$ for a fixed $i$ . $r$ is the value at the step 21 of Algorithm 2.

Lemma III.8.

The following four inequalities hold.

$\displaystyle 0$	$\displaystyle\leq L^{(1)}(i,{\bf y})$	(1)
	$\displaystyle\leq\mathrm{wt}({\bf y})$	(2)
	$\displaystyle<1+\mathrm{wt}({\bf y})+R^{(0)}(i,{\bf y})$	(3)
	$\displaystyle<n.$	(4)

Proof.

The inequality (1) follows from the definition of $L^{(1)}(i,{\bf y})$ . The inequality (2) follows from Lemma II.10 and $R^{(1)}(i,{\bf y})\geq 0$ . The inequality (3) follows from $R^{(0)}(i,{\bf y})\geq 0$ . We show the inequality (4).

	$\displaystyle 1+\mathrm{wt}({\bf y})+R^{(0)}(i,{\bf y})$	$\displaystyle=1+R^{(1)}(1,{\bf y})+R^{(0)}(i,{\bf y})$
		$\displaystyle\leq 1+R^{(1)}(1,{\bf y})+R^{(0)}(1,{\bf y})$
		$\displaystyle=n-1$
		$\displaystyle<n.$

∎

Lemma III.9.

Either $\tilde{x}_{i}\tilde{x}_{i+1}=00$ or $\tilde{x}_{i}\tilde{x}_{i+1}=11$ holds.

Proof.

By the assumption ${\bf y}=\mathrm{del_{BA}}_{i}({\bf x})$ , $x_{i}\neq x_{i+1}$ holds. Thus, either $x_{i}x_{i+1}=01$ or $x_{i}x_{i+1}=10$ holds. Therefofre, whichever $i$ is odd or even, either $\tilde{x}_{i}\tilde{x}_{i+1}=00$ or $\tilde{x}_{i}\tilde{x}_{i+1}=11$ holds. ∎

Lemma III.10.

r=\begin{cases}L^{(1)}(i,{\bf\tilde{y}})&(\tilde{x}_{i}\tilde{x}_{i+1}=00),\\ 1+\mathrm{wt}({\bf\tilde{y}})+R^{(0)}(i,{\bf\tilde{y}})&(\tilde{x}_{i}\tilde{x}_{i+1}=11).\end{cases}

Proof.

It follows from the definitions of $r$ and $a$ .

	$\displaystyle r$	$\displaystyle\equiv a-\tau({\bf y})\pmod{m}$
		$\displaystyle=a-\mathrm{inv}(\sigma^{-1}({\bf y}))$
		$\displaystyle\equiv\mathrm{inv}(\sigma^{-1}({\bf x}))-\mathrm{inv}(\sigma^{-1}({\bf y}))\pmod{m}$

By the assumption ${\bf y}=\mathrm{BAD}_{i}({\bf x})$ , the following holds.

{\bf x}=\mathrm{ins}_{i,x_{i}x_{i+1}}({\bf y})=y_{1}y_{2}\cdots y_{i-1}x_{i}x_{i+1}y_{i}y_{i+1}\cdots y_{n-2}.

Therefore, we have

	$\displaystyle\mathrm{inv}(\sigma^{-1}({\bf x}))-\mathrm{inv}(\sigma^{-1}({\bf y}))$
	$\displaystyle=\mathrm{inv}(\sigma^{-1}(\mathrm{ins}_{i,x_{i}x_{i+1}}({\bf y})))-\mathrm{inv}(\sigma^{-1}({\bf y}))$
	$\displaystyle=\mathrm{inv}(\sigma^{-1}(y_{1}y_{2}\cdots y_{i-1}x_{i}x_{i+1}y_{i}y_{i+1}\cdots y_{n-2}))$
	$\displaystyle-\mathrm{inv}(\sigma^{-1}(y_{1}y_{2}\cdots y_{i-1}\quad\quad\quad y_{i}y_{i+1}\cdots y_{n-2}))$

\displaystyle=\left\{\begin{array}[]{cc}\mathrm{inv}(y_{1}y_{3}\cdots y_{i-2}x_{i}\sigma^{-1}({\bf y}_{[i,n-2]})x_{i+1}y_{i-1}\cdots y_{4}y_{2})\\ \\ -\mathrm{inv}(y_{1}y_{3}\cdots y_{i-2}\quad\sigma^{-1}({\bf y}_{[i,n-2]})\quad y_{i-1}\cdots y_{4}y_{2})\\ (i:\text{odd}),\\ \\ \\ \mathrm{inv}(y_{1}y_{3}\cdots y_{i-1}x_{i+1}\sigma^{-1}({\bf y}_{[i+1,n-2]})y_{i}x_{i}\cdots y_{4}y_{2})\\ \\ -\mathrm{inv}(y_{1}y_{3}\cdots y_{i-1}\quad\sigma^{-1}({\bf y}_{[i+1,n-2]})y_{i}\quad\cdots y_{4}y_{2})\\ (i:\text{even})\end{array}\right.

\displaystyle=\left\{\begin{array}[]{cc}\mathrm{inv}(y_{1}y_{3}\cdots y_{i-2}\quad 0\sigma^{-1}({\bf y}_{[i,n-2]})1\quad y_{i-1}\cdots y_{4}y_{2})\\ \\ -\mathrm{inv}(y_{1}y_{3}\cdots y_{i-2}\quad\sigma^{-1}({\bf y}_{[i,n-2]})\quad y_{i-1}\cdots y_{4}y_{2})\\ (i:\text{odd},x_{i}x_{i+1}=01),\\ \\ \\ \mathrm{inv}(y_{1}y_{3}\cdots y_{i-1}\quad 0\sigma^{-1}({\bf y}_{[i+1,n-2]})y_{i}1\quad\cdots y_{4}y_{2})\\ \\ -\mathrm{inv}(y_{1}y_{3}\cdots y_{i-1}\quad\sigma^{-1}({\bf y}_{[i+1,n-2]})y_{i}\quad\cdots y_{4}y_{2})\\ (i:\text{even},x_{i}x_{i+1}=10),\\ \\ \\ \mathrm{inv}(y_{1}y_{3}\cdots y_{i-2}\quad 1\sigma^{-1}({\bf y}_{[i,n-2]})0\quad y_{i-1}\cdots y_{4}y_{2})\\ \\ -\mathrm{inv}(y_{1}y_{3}\cdots y_{i-2}\quad\sigma^{-1}({\bf y}_{[i,n-2]})\quad y_{i-1}\cdots y_{4}y_{2})\\ (i:\text{odd},x_{i}x_{i+1}=10),\\ \\ \\ \mathrm{inv}(y_{1}y_{3}\cdots y_{i-1}\quad 1\sigma^{-1}({\bf y}_{[i+1,n-2]})y_{i}0\quad\cdots y_{4}y_{2})\\ \\ -\mathrm{inv}(y_{1}y_{3}\cdots y_{i-1}\quad\sigma^{-1}({\bf y}_{[i+1,n-2]})y_{i}\quad\cdots y_{4}y_{2})\\ (i:\text{even},x_{i}x_{i+1}=01)\end{array}\right.

\displaystyle=\left\{\begin{array}[]{ll}\displaystyle\sum_{j=1,j:\text{odd}}^{i-2}y_{j}+\sum_{j=2,j:\text{even}}^{i-1}\overline{y}_{j}\\ (i:\text{odd},x_{i}x_{i+1}=01),\\ \\ \displaystyle\sum_{j=1,j:\text{odd}}^{i-1}y_{j}+\sum_{j=2,j:\text{even}}^{i-2}\overline{y}_{j}\\ (i:\text{even},x_{i}x_{i+1}=10),\\ \\ \displaystyle\sum_{j=1,j:\text{odd}}^{i-2}y_{j}+\sum_{j=2,j:\text{even}}^{i-1}\overline{y}_{j}+(n-2-(i-1))+1\\ (i:\text{odd},x_{i}x_{i+1}=10),\\ \\ \displaystyle\sum_{j=1,j:\text{odd}}^{i-1}y_{j}+\sum_{j=2,j:\text{even}}^{i-2}\overline{y}_{j}+(n-2-(i-1))+1\\ (i:\text{even},x_{i}x_{i+1}=01)\\ \end{array}\right.

	$\displaystyle=\left\{\begin{array}[]{cc}\displaystyle\sum_{j=1}^{i-1}\tilde{y}_{j}\\ (i:\text{odd},x_{i}x_{i+1}=01\quad\text{or}\quad i:\text{even},x_{i}x_{i+1}=10),\\ \\ \displaystyle\sum_{j=1}^{i-1}\tilde{y}_{j}+\sum_{j=i}^{n-2}(\tilde{y}_{j}+\overline{\tilde{y}}_{j})+1\\ (i:\text{odd},x_{i}x_{i+1}=10\quad\text{or}\quad i:\text{even},x_{i}x_{i+1}=01)\end{array}\right.$
	$\displaystyle=\left\{\begin{array}[]{cc}L^{(1)}(i,{\bf\tilde{y}})\\ (\tilde{x}_{i}\tilde{x}_{i+1}=00),\\ \\ L^{(1)}(i,{\bf\tilde{y}})+R^{(1)}(i,{\bf\tilde{y}})+R^{(0)}(i,{\bf\tilde{y}})+1\\ (\tilde{x}_{i}\tilde{x}_{i+1}=11)\end{array}\right.$
	$\displaystyle=\left\{\begin{array}[]{cc}L^{(1)}(i,{\bf\tilde{y}})&(\tilde{x}_{i}\tilde{x}_{i+1}=00),\\ 1+\mathrm{wt}({\bf\tilde{y}})+R^{(0)}(i,{\bf\tilde{y}})&(\tilde{x}_{i}\tilde{x}_{i+1}=11).\end{array}\right.$

The last follows from Lemma II.10. We show that the equality holds in the cases of $\tilde{x}_{i}\tilde{x}_{i+1}=00$ and $\tilde{x}_{i}\tilde{x}_{i+1}=11$ .

Case $\tilde{x}_{i}\tilde{x}_{i+1}=00$ : we have shown

r\equiv L^{(1)}(i,{\bf\tilde{y}})\pmod{m}.

On the other hand, the following inequalities hold by Lemma III.8.

0\leq L^{(1)}(i,{\bf\tilde{y}})<n<m.

By the definition of $r$ ,

0\leq r<m.

This implies $r=L^{(1)}(i,{\bf\tilde{y}})$ .

Case $\tilde{x}_{i}\tilde{x}_{i+1}=11$ : We have shown

r\equiv 1+\mathrm{wt}({\bf\tilde{y}})+R^{(0)}(i,{\bf\tilde{y}})\pmod{m}.

On the other hand, the following inequalities hold by Lemma III.8.

0\leq 1+\mathrm{wt}({\bf\tilde{y}})+R^{(0)}(i,{\bf\tilde{y}})\leq n<m.

By the definition of $r$ ,

0\leq r<m.

This implies $r=1+\mathrm{wt}({\bf\tilde{y}})+R^{(0)}(i,{\bf\tilde{y}})$ . Therefore, Lemma III.10 holds. ∎

proof of 2 of Theorem III.3.

Let us focus on the step 23 of Algorithm 2. The case of $r\leq w$ and the case of $r>w$ are shown separately.

In the case of $r\leq w$ , we have $\mathrm{Dec}_{A}({\bf y})=\mathrm{ins}_{p,b}({\bf y})$ . We show $\mathrm{ins}_{p,b}({\bf y})={\bf x}$ . First, we can show that $r=L^{(1)}(i,{\bf\tilde{y}})$ holds, similarly to the case of Monotone codes.

Next, we show that the deleted symbols $x_{i}x_{i+1}$ satisfy $\tilde{x}_{i}\tilde{x}_{i+1}=00$ . This is shown by contradiction. Since $x_{i}x_{i+1}\in\{01,10\}$ , either $\tilde{x}_{i}\tilde{x}_{i+1}=00$ or $\tilde{x}_{i}\tilde{x}_{i+1}=11$ holds. Assume that $\tilde{x}_{i}\tilde{x}_{i+1}=11$ holds. Lemma III.10 implies

r=1+\mathrm{wt}({\bf\tilde{y}})+R^{(0)}(i,{\bf\tilde{y}}),

which contradicts to

r=L^{(1)}(i,{\bf\tilde{y}}).

Therefore, $\tilde{x}_{i}\tilde{x}_{i+1}=00$ .

Finally, we show that $\mathrm{ins}_{p,b}({\bf y})={\bf x}$ . We showed that the deleted symbols $x_{i}x_{i+1}$ satisfy $\tilde{x}_{i}\tilde{x}_{i+1}=00$ . By the assumption ${\bf y}=\mathrm{del_{BA}}_{i}({\bf x})$ , we have ${\bf x}=\mathrm{ins}_{i,x_{i}x_{i+1}}({\bf y})$ . Therefore, it suffices to prove that $\mathrm{ins}_{p,b}({\bf y})=\mathrm{ins}_{i,x_{i}x_{i+1}}({\bf y})$ . Furthermore we will show

0=\tilde{y}_{i}=\tilde{y}_{i+1}=\cdots=\tilde{y}_{p-1}.

Since $r=L^{(1)}(i,{\bf\tilde{y}})$ , then

i\in\{j\in[n-1]\mid L^{(1)}(j,{\bf\tilde{y}})=r)\}

holds. Since

p\in\{j\in[n-1]\mid L^{(1)}(j,{\bf\tilde{y}})=r)\},

then

\displaystyle r=L^{(1)}(i,{\bf\tilde{y}})=L^{(1)}(p,{\bf\tilde{y}})

holds. Therefore,

\displaystyle\sum_{j=1}^{i-1}\tilde{y}_{j}=\sum_{j=1}^{p-1}\tilde{y}_{j}

holds. In the case of $r\leq w$ , $i\leq p$ follows from the definition of $p$ . Since $i\leq p$ , then

	$\displaystyle 0$	$\displaystyle=\sum_{j=1}^{p-1}\tilde{y}_{j}-\sum_{j=1}^{i-1}\tilde{y}_{j}$
		$\displaystyle=\sum_{j=i}^{p-1}\tilde{y}_{j}$

holds. Therefore, we have

0=\tilde{y}_{i}=\tilde{y}_{i+1}=\cdots=\tilde{y}_{p-1}.

By a similar argument, we can prove in the remaining case $r>w$ . ∎

III-C Proof for single BAR error-correction

To prove 4 of Theorem III.3, we introduce the following two Lemmas III.12 and III.13. From now on, till the end of this subsection, we assume the following.

Hypothesis III.11.

A binary sequence ${\bf x}$ is a codeword of $A_{a,m}(n)$ with $m\geq 2(n-1)$ . Set ${\bf y}\coloneqq\mathrm{rev_{BA}}_{i}({\bf x})$ for a fixed $i$ . $r$ is the value at the step 10 of Algorithm 2.

Lemma III.12.

Either $\tilde{y}_{i}\tilde{y}_{i+1}=00$ or $\tilde{y}_{i}\tilde{y}_{i+1}=11$ holds.

Proof.

By the assumption ${\bf y}=\mathrm{rev_{BA}}_{i}({\bf x})$ , $x_{i}\neq x_{i+1}$ holds. Hence, $y_{i}\neq y_{i+1}$ holds. Thus, either $y_{i}y_{i+1}=01$ or $y_{i}y_{i+1}=10$ holds. Therefofre, whichever $i$ is odd or even, either $\tilde{y}_{i}\tilde{y}_{i+1}=00$ or $\tilde{y}_{i}\tilde{y}_{i+1}=11$ holds. ∎

Lemma III.13.

r=\begin{cases}n-i&(\tilde{y}_{i}\tilde{y}_{i+1}=00),\\ m-(n-i)&(\tilde{y}_{i}\tilde{y}_{i+1}=11)\end{cases}

and $r\neq 0$ .

Proof.

It follows from the definitions of $r$ and $a$ .

	$\displaystyle r$	$\displaystyle\equiv a-\tau({\bf y})\pmod{m}$
		$\displaystyle=a-\mathrm{inv}(\sigma^{-1}({\bf y}))$
		$\displaystyle\equiv\mathrm{inv}(\sigma^{-1}({\bf x}))-\mathrm{inv}(\sigma^{-1}({\bf y}))\pmod{m}$

By the assumption ${\bf y}=\mathrm{rev_{BA}}_{i}({\bf x})$ , the following holds.

{\bf x}=\mathrm{rev_{BA}}_{i}({\bf y})=y_{1}y_{2}\cdots y_{i-1}{y}_{i+1}y_{i}\cdots y_{n}.

Therefore, we have

	$\displaystyle\mathrm{inv}(\sigma^{-1}({\bf x}))-\mathrm{inv}(\sigma^{-1}({\bf y}))$
	$\displaystyle=\mathrm{inv}(\sigma^{-1}(\mathrm{rev_{BA}}_{i}({\bf y})))-\mathrm{inv}(\sigma^{-1}({\bf y}))$
	$\displaystyle=\mathrm{inv}(\sigma^{-1}(y_{1}y_{2}\cdots y_{i-1}y_{i+1}y_{i}\cdots y_{n}))$
	$\displaystyle-\mathrm{inv}(\sigma^{-1}(y_{1}y_{2}\cdots y_{i-1}y_{i}y_{i+1}\cdots y_{n}))$
	$\displaystyle=\left\{\begin{array}[]{cc}\mathrm{inv}(y_{1}y_{3}\cdots y_{i+1}\sigma^{-1}({\bf y}_{[i+2,n]})y_{i}y_{i-1}\cdots y_{4}y_{2})\\ \\ -\mathrm{inv}(y_{1}y_{3}\cdots y_{i}\quad\sigma^{-1}({\bf y}_{[i+2,n]})y_{i+1}y_{i-1}\cdots y_{4}y_{2})\\ (i:\text{odd}),\\ \\ \\ \mathrm{inv}(y_{1}y_{3}\cdots y_{i}\sigma^{-1}({\bf y}_{[i+3,n]})y_{i+2}y_{i+1}\cdots y_{4}y_{2})\\ \\ -\mathrm{inv}(y_{1}y_{3}\cdots y_{i+1}\sigma^{-1}({\bf y}_{[i+3,n]})y_{i+2}y_{i}\cdots y_{4}y_{2})\\ (i:\text{even})\end{array}\right.$
	$\displaystyle=\left\{\begin{array}[]{cc}n-i\\ (i:\text{odd},y_{i}y_{i+1}=01\quad\text{or}\quad i:\text{even},y_{i}y_{i+1}=10),\\ \\ -(n-i)\\ (i:\text{odd},y_{i}y_{i+1}=10\quad\text{or}\quad i:\text{even},y_{i}y_{i+1}=01)\end{array}\right.$
	$\displaystyle\equiv\left\{\begin{array}[]{cc}n-i&(\tilde{y}_{i}\tilde{y}_{i+1}=00),\\ m-(n-i)&(\tilde{y}_{i}\tilde{y}_{i+1}=11).\end{array}\right.$

We show that the equality holds in the case of $\tilde{y}_{i}\tilde{y}_{i+1}=00$ and $\tilde{y}_{i}\tilde{y}_{i+1}=11$ . Since $i\in[n-1]$ , we have $1\leq n-i\leq n-1<m$ . Then, $0<m-(n-i)\leq m-1<m$ . Therefore,

r=\begin{cases}n-i&(\tilde{y}_{i}\tilde{y}_{i+1}=00),\\ m-(n-i)&(\tilde{y}_{i}\tilde{y}_{i+1}=11)\end{cases}

and $r\neq 0$ . ∎

proof 4 of Theorem III.3.

Lemma III.13 implies $r\neq 0$ . Therefore, $\mathrm{Dec}({\bf y})\neq{\bf y}$ . Let us focus on the step 14 of Algorithm 2. Lemma III.13 implies $r=n-i$ or $r=m-(n-i)$ . Whichever $r=n-i$ or $r=m-(n-i)$ ,

	$\displaystyle\min\{r,m-r\}$	$\displaystyle=\min\{n-i,m-(n-i)\}$
		$\displaystyle=n-i$

holds, since $m\geq 2n$ . Then,

	$\displaystyle p$	$\displaystyle=n-\min\{r,m-r\}$
		$\displaystyle=n-(n-i)$
		$\displaystyle=i$

holds. Thus,

	$\displaystyle\mathrm{rev_{BA}}_{p}({\bf y})$	$\displaystyle=\mathrm{rev_{BA}}_{i}({\bf y})$
		$\displaystyle={\bf x}$
		$\displaystyle\in A_{a,m}(n).$

Therefore, $\mathrm{Dec}({\bf y})=\mathrm{rev_{BA}}_{p}({\bf y})={\bf x}$ . ∎

IV Analysis and unification of the algorithms

In this section, we provide a unified view of our proposed algorithms. After that, we prove that the algorithms can be computed in linear time in the code-length.

Flowchart 1 is the unified representation of the parts of Algorithm 1 and Algorithm 2, that is, the steps 21st - 34th of Algorithm 1 and the steps 21st - 42th of Algorithm 2.

Refer to caption — Figure 1: Flowchart 1 for deletion error-correction

The following table summarizes the variables and the functions in Flowchart 1.

	Monotone codes	Azinv codes
the condition of $a$	$\in\mathbb{Z}_{\geq 0}$	$\in\mathbb{Z}_{\geq 0}$
the condition of $m$	$\geq k_{n}+1$	$\geq n$
the condition of ${\bf y}$	$\in\mathbb{B}^{n-1}$	$\in\mathbb{B}^{n-2}$
the condition of ${\bf k}$	$(k_{1},k_{2},\cdots,k_{n})$	$(1,2,\cdots,n)$
$\mathrm{remainder}(a,m,{\bf y},{\bf k})$	$(a-\rho_{\bf k}({\bf y}))\%m$	$(a-\tau({\bf y}))\%m$
$\mathrm{weight}({\bf y},{\bf k})$	$\mathrm{wt}_{\bf k}({\bf y})$	$\mathrm{wt}({\bf\tilde{y}})$
$\mathrm{position1}(r,{\bf y},{\bf k})$	$\max J_{1,1}$	$\max J_{1,2}$
$\mathrm{position2}(r,{\bf y},{\bf k})$	$\min J_{2,1}$	$\min J_{2,2}$
$\mathrm{sequence1}(p)$	$0$	$\begin{cases}10(p:\text{even})\\ 01(p:\text{odd})\end{cases}$
$\mathrm{sequence2}(p)$	$1$	$\begin{cases}01(p:\text{even})\\ 10(p:\text{odd})\end{cases}$
$\mathrm{inserted}(p,b,{\bf y})$	$\mathrm{ins}_{p,{\bf b}}({\bf y})$	$\mathrm{ins}_{p,{\bf b}}({\bf y})$

TABLE I: Variables and Functions in Flowchart 1

Here, $(k_{1},k_{2},\cdots,k_{n})$ is a positive monotonic increasing integer sequence and $J_{1,1}$ , $J_{1,2}$ , $J_{2,1}$ and $J_{2,2}$ are defined as follows.

	$\displaystyle J_{1,1}\coloneqq\{j\in[n]\mid R^{(1)}_{{\bf k}}(j,{\bf y})=r\},$
	$\displaystyle J_{1,2}\coloneqq\{j\in[n-1]\mid L^{(1)}(j,{\bf\tilde{y}})=r\},$
	$\displaystyle J_{2,1}\coloneqq\{j\in[n]\mid L^{(0)}_{{\bf k}}(j,{\bf y})=r-w-k_{1}\},$
	$\displaystyle J_{2,2}\coloneqq\{j\in[n-1]\mid R^{(0)}(j,{\bf\tilde{y}})=r-w-1\}.$

For integers $a$ and $b$ , $a\%b$ denotes the remainder of $a$ divided by $b$ .

Flowchart 2 is the unified representation of the parts of Algorithm 1 and Algorithm 2, that is, the steps 10th - 20th of Algorithm 1 and Algorithm 2.

The following table summarizes the variables and the functions in Flowchart 2, where $(k_{1},k_{2},\cdots,k_{n})$ is a positive monotonic increasing integer sequence and $J$ is defined as follows.

\displaystyle J\coloneqq\{j\in[n]\mid k_{j}=\min\{r,m-r\}\}.

	Monotone codes	Azinv codes
the condition of $a$	$\in\mathbb{Z}_{\geq 0}$	$\in\mathbb{Z}_{\geq 0}$
the condition of $m$	$\geq 2k_{n}$	$\geq 2(n-1)$
the condition of ${\bf y}$	$\in\mathbb{B}^{n}$	$\in\mathbb{B}^{n}$
the condition of ${\bf k}$	$(k_{1},k_{2},\cdots,k_{n})$	$(1,2,\cdots,n)$
$\mathrm{remainder}(a,m,{\bf y},{\bf k})$	$(a-\rho_{\bf k}({\bf y}))\%m$	$(a-\tau({\bf y}))\%m$
$\mathrm{position}(r,{\bf y},{\bf k})$	$\min J$	$n-\min\{r,m-r\}$
$\mathrm{reversed}(p,{\bf y})$	$\mathrm{rev}_{p}({\bf y})$	$\mathrm{rev_{BA}}_{p}({\bf y})$

TABLE II: Variables and Functions in Flowchart 2

We can compute $\mathrm{inv}({\bf y})$ in linear time in the length of ${\bf y}$ by using the following Theorem IV.1.

Theorem IV.1.

Set ${\bf s}\coloneqq(n,n-1,\cdots,2,1)$ . For ${\bf y}\in\mathbb{B}^{n}$ ,

\displaystyle\mathrm{inv}({\bf y})=\rho_{\bf s}({\bf y})-{{\mathrm{wt}({\bf y})+1}\choose 2}.

Proof.

Since ${\bf y}\in\mathbb{B}^{n}$ ,

	$\displaystyle\mathrm{inv}({\bf y})$	$\displaystyle=\#\{(i,j)\mid 1\leq i<j\leq n,y_{i}>y_{j}\}$
		$\displaystyle=\#\{(i,j)\mid 1\leq i<j\leq n,(y_{i},y_{j})=(1,0)\}$

holds. Set $I\coloneqq\{i\in[n]\mid y_{i}=1\}$ . Then, we have

	$\displaystyle\#\{(i,j)\mid 1\leq i<j\leq n,(y_{i},y_{j})=(1,0)\}$
	$\displaystyle=\sum_{i\in I}\#\{j\in[n]\mid i<j,y_{j}=0\}$
	$\displaystyle=\sum_{i\in I}((n-i)-\#\{j\in[n]\mid i<j,y_{j}=1\})$
	$\displaystyle=\sum_{i\in I}((n-i+1)-(1+\#\{j\in[n]\mid i<j,y_{j}=1\}))$
	$\displaystyle=\sum_{i\in I}(n-i+1)-\sum_{i\in I}\#\{j\in[n]\mid i\leq j,y_{j}=1\}$
	$\displaystyle=\sum_{i\in I}y_{i}(n-i+1)-\frac{\#I(\#I+1)}{2}$
	$\displaystyle=\rho_{\bf s}({\bf y})-{{\mathrm{wt}({\bf y})+1}\choose 2}.$

∎

Theorem IV.2.

For each function in Flowcharts 1 or 2, its computational cost is $O(|{\bf y}|)$ , where $|{\bf y}|$ is the length of ${\bf y}$ .

Proof.

Since the definitions of $\mathrm{sequence1}(p)$ , $\mathrm{sequence2}(p)$ , and $\mathrm{reversed}(p,{\bf y})$ , they can be computed in constant time.

Since we only need to use ”For loop” once, $\mathrm{position1}(r,{\bf y},{\bf k})$ , $\mathrm{position2}(r,{\bf y},{\bf k})$ , $\mathrm{position}(r,{\bf y},{\bf k})$ , $\mathrm{inserted}(p,b,{\bf y})$ , and ${\bf\tilde{y}}$ can be computed in linear time in the length of ${\bf y}$ .

Since the inner product, $\mathrm{inv}({\bf y})$ , and ${\bf\tilde{y}}$ can be computed in linear time, $\mathrm{remainder}(a,m,{\bf y},{\bf k})$ and $\mathrm{weight}({\bf y},{\bf k})$ can be computed in linear time in the length of ${\bf y}$ . ∎

The following is a corollary of Theorem IV.2.

Corollary IV.3.

Algorithm 1 and Algorithm 2 are linear time algorithms in the code-length.

V conclusion

In this paper, we provided the single deletion/reversal error-correcting algorithm for monotone codes and the single BAD/BAR error-correcting algorithm for azinv codes. Constructions of these codes are different. However, algorithms for these codes and the proofs of Theorem II.4 and Theorem III.3 correspond to each other.

In Section IV, we provided the unification of these decoding algorithms for monotone codes and azinv codes. The respective deletion error-correcting algorithms for monotone codes and azinv codes are represented by the same flowchart, and the respective reversal error-correcting algorithms for monotone codes and azinv codes are represented by the same flowchart. We also showed that these algorithms are linear-time algorithms.

As a future work, we will consider decoding algorithms for single insertion errors for monotone codes and azinv codes. Furthermore, we will concider decoding algorithms for other deletion/reversal errors. Monotone codes are defined by $\rho_{\bf k}({\bf x})$ and azinv codes are defined by $\tau(\bf x)$ . By replacing one of these functions with the other, we will create new codes that are capable of the other deletion error-correcting and the other reversal error-correcting. These error-correcting algorithms are expected to have the same flowcharts as the ones for monotone codes and azinv codes.

Moreover, since monotone codes can freely take a positive monotonic increasing integer sequence ${\bf k}$ , it is expected to be able to add the other property to monotone codes in addition to the single deletion/reversal error-correctable property. For example, it is known to be able to add properties of being two-deletion error-correctable[10] and easy to encode [8]. Monotone codes are generalized by introducing parameter ${\bf k}$ into Levenshtein codes. In the same way, the generalization with parameter ${\bf k}$ in azinv codes can be considered. The function $\mathrm{inv}({\bf x})$ used to define azinv codes has the property of IV.1. We can generalize azinv codes by taking a positive monotonic decreasing integer sequence as ${\bf s}$ in IV.1. The generalized azinv codes are expected to be able to add properties of being two-BAD error-correctable and easy to encode. In addition to these properties, there are some other similar properties in Levenshtein codes and azinv codes, such as optimality and convergence[9]. We would like to discuss these topics in a future work for further development of our research.

VI acknowledgement

This paper is partially supported by KAKENHI 18H01435.

References

[1] Tilo Buschmann and Leonid V Bystrykh. Levenshtein error-correcting barcodes for multiplexed dna sequencing. BMC bioinformatics, 14(1):272, 2013.
[2] Ryan Gabrys, Eitan Yaakobi, and Olgica Milenkovic. Codes in the damerau distance for deletion and adjacent transposition correction. IEEE Transactions on Information Theory, 64(4):2550–2570, 2017.
[3] Yeow Meng Chee, Han Mao Kiah, Alexander Vardy, Eitan Yaakobi, et al. Coding for racetrack memories. IEEE Transactions on Information Theory, 64(11):7094–7112, 2018.
[4] Jin Sima and Jehoshua Bruck. Correcting deletions in multiple-heads racetrack memories. In 2019 IEEE International Symposium on Information Theory (ISIT), pages 1367–1371. IEEE, 2019.
[5] Masato Inoue and Haruhiko Kaneko. Deletion/insertion/reversal error correcting codes for bit-patterned media recording. In 2011 IEEE International Symposium on Defect and Fault Tolerance in VLSI and Nanotechnology Systems, pages 286–293. IEEE, 2011.
[6] Anantha Raman Krishnan and Bane Vasic. Coding for correcting insertions and deletions in bit-patterned media recording. In 2011 IEEE Global Telecommunications Conference-GLOBECOM 2011, pages 1–5. IEEE, 2011.
[7] V.I. Levenshtein. Binary codes capable of correcting deletions, insertions, and reversals. Soviet physics doklady, 10(8):707–710, 1966.
[8] Manabu Hagiwara. On ordered syndromes for multi insertion/deletion error-correcting codes. In Information Theory (ISIT), 2016 IEEE International Symposium on, pages 625–629. IEEE, 2016.
[9] Manabu Hagiwara. Perfect codes for single balanced adjacent deletions. In Information Theory (ISIT), 2017 IEEE International Symposium on, pages 1938–1942. IEEE, 2017.
[10] Albertus SJ Helberg and Hendrik C Ferreira. On multiple insertion/deletion correcting codes. IEEE Transactions on Information Theory, 48(1):305–308, 2002.

	$\displaystyle p$	$\displaystyle=\max\{j\in[4]\mid R^{(1)}(j,{\bf y})=2)\}$
		$\displaystyle=3,$
	$\displaystyle b$	$\displaystyle=0.$

	$\displaystyle p$	$\displaystyle=\min\{3,17\}$
		$\displaystyle=3.$

	$\displaystyle p$	$\displaystyle=\max\{j\in[4]\mid L^{(1)}(j,{\bf\tilde{y}})=3)\}$
		$\displaystyle=4.$

	$\displaystyle p$	$\displaystyle=6-\min\{5,5\}$
		$\displaystyle=1.$