Robust low-delay Streaming PIR using convolutional codes

Julia Lieb, Diego Napp and Raquel Pinto Julia Lieb is at the Department of Mathematics, University of Zurich, Switzerland e-mail: julia.lieb@math.uzh.ch.Diego Napp is at the Department of Mathematics, University of Alicante, Spain e-mail: diego.napp@ua.es.Raquel Pinto is at the Department of Mathematics, University of Aveiro, Portugal e-mail: raquel@ua.pt.

Abstract

In this paper we investigate the design of a low-delay robust streaming PIR scheme on coded data that is resilient to unresponsive or slow servers and can privately retrieve streaming data in a sequential fashion subject to a fixed decoding delay. We present a scheme based on convolutional codes and the star product and assume no collusion between servers. In particular we propose the use of convolutional codes that have the maximum distance increase, called Maximum Distance Profile (MDP). We show that the proposed scheme can deal with many different erasure patterns.

Index Terms:

Private Information Retrieval, Private Streaming, Convolutional codes low-delay MDP codes Erasure channel.

I Introduction

Video traffic has had an explosive growth and it is expected to keep its exponential growth in the coming years [1]. Service providers for real-time video streaming are typically hosted in a public cloud, with multiple servers in different data centers, e.g. Google Cloud, Amazon CloudFront and Microsoft Azure. These cloud services aim for private and low-latency communications.

The problem of Private Information Retrieval (PIR) has attracted a lot of attention in the recent years and studies how to retrieve a file from a storage system without revealing the desired file to the servers. It was initially addressed for replicated files [7] and recently for coded files [6, 8, 13, 16, 17]. In this last setting, the general model of the information theoretic PIR problem is as follows. Let a coded database be contained in $n$ servers storing $m$ files and assume that the user knows the content of the servers. Each file is coded and stored independently using the same code and the user wants to retrieve a particular file from the database with zero information gain from the servers, i.e., the user wants information theoretic privacy [19]. Recently, the literature on PIR models has grown considerably with extensions for more general PIR models with several additional constraints. Most of the efforts in private retrieval have focused on efficient schemes that optimize different metrics, such as communication cost or rate. However, in many cases some of the servers may be busy and do not respond within a desirable time frame or network failures may occur. For this reason, new robust schemes were proposed in order to deal with such scenarios [19, 18] adding redundancy to tolerate certain missing files from the servers. PIR schemes on coded data for Byzantine or unresponsive servers were presented in [21, 18]. These schemes are suited for retrieving one single file and therefore use block codes. In [10] a scheme for sequential retrieving was proposed but again for a given set of files of fixed size and assuming that all the responses of the servers are lost at the same time instant. The case of a non-bursty channel is also considered in this paper but only using unit memory convolutional codes. However, to the best of the author’s knowledge, the problem of low-delay private retrieval of a stream of files (of undetermined length) with some low or unresponsive servers remains unexplored.

In this work we investigate this more general problem and propose a novel robust scheme to deal with low-delay streaming retrieval of files from $n$ servers in the presence of possible unresponsive servers by using Maximum Distance Profile (MDP) convolutional codes. This class of codes is suitable for low-delay streaming applications as they possess optimal error-correcting capabilities within a decoding window, see [5, 20]. One of the advantages of using convolutional codes over block codes is the sliding window flexibility that allows to select different decoding windows according to the erasure pattern. We show how to take advantage of this property to provide robust PIR in this context. We present a scheme that is able to stream files consisting of many stripes in the presence of erasures without assuming any particular structure in the sequence of erasures. The model in [10] treated burst erasure channels using general convolutional codes and the non-bursty channel case was treated using unit memory convolutional codes. Unit memory codes are restricted to store only what occurred in the previous instant and therefore are far from optimal for low-delay applications when the given delay constraint is larger than one. Note also that when only burst erasure channels are assumed, there exist concrete constructions of convolutional codes that are optimal in such a context [5, 4, 14]. In this work we extend this thread of research and consider a non necessarily bursty channel using convolutional codes with no restriction in the memory, namely, MDP convolutional codes. In contrast to [10], where the response of the servers is built in a convolutional fashion but the storage code is still a block code, we also use a convolutional code to store the files on the servers.

II Preliminaries

In this section we recall basic material and introduce the definitions needed for this work, including the notion of convolutional code and superregular matrix. Let $\mathbb{F}=\mathbb{F}_{q}$ be a finite field of size $q$ and $\mathbb{F}[z]$ be the ring of polynomials with coefficients in $\mathbb{F}$ .

Definition 1

An $\mathbf{[n,k]}$ -block code $\mathcal{C}$ is a $k$ -dimensional subspace of $\mathbb{F}^{n}$ , i.e., there exists $G\in\mathbb{F}^{k\times n}$ of full row rank matrix such that

\mathcal{C}=\{\boldsymbol{v}\in\mathbb{F}^{n}\ |\ \boldsymbol{v}=\boldsymbol{u}G\ \text{for some}\ \boldsymbol{u}\in\mathbb{F}^{k}\}.

$G$ is called generator matrix of the code and is unique up to left multiplication with an invertible matrix $U\in Gl_{k}(\mathbb{F})$ . Furthermore, $u\in\mathbb{F}^{k}$ is called message vector and the elements $\boldsymbol{v}\in\mathcal{C}$ are called codewords.

Convolutional codes process a continuous sequence of data instead of blocks of fixed vectors as done by block codes. If we introduce a variable $z$ , called the delay operator, to indicate the time instant in which each information arrived or each codeword was transmitted, then we can represent the sequence message $(\boldsymbol{v}_{0},\boldsymbol{v}_{1},\cdots,\boldsymbol{v}_{l})$ as a polynomial vector $\boldsymbol{v}(z)=\boldsymbol{v}_{0}+\boldsymbol{v}_{1}z+\cdots+\boldsymbol{v}_{l}z^{l}$ . Formally, we can define convolutional codes as follows.

A rate $k/n$ convolutional code $\mathcal{C}$ [20] is an $\mathbb{F}[z]$ -submodule of $\mathbb{F}[z]^{n}$ with rank $k$ given by

\mathcal{C}=\text{im}_{\mathbb{F}[z]}G(z)=\{\boldsymbol{v}(z)\in\mathbb{F}[z]^{n}\ |\ \boldsymbol{v}(z)=\boldsymbol{u}(z)G(z),\text{ with }\boldsymbol{u}(z)\in\mathbb{F}[z]^{k}\}

where $G(z)\in\mathbb{F}[z]^{k\times n}$ is a matrix, called generator matrix, that is basic, i.e., $G(z)$ has a polynomial right inverse.

Note that if $\boldsymbol{v}(z)=\boldsymbol{u}(z)G(z)$ , with

\displaystyle\boldsymbol{u}(z)

\displaystyle=\boldsymbol{u}_{0}+\boldsymbol{u}_{1}z+\boldsymbol{u}_{2}z^{2}+\cdots\ \text{ and }\ G(z)=\sum_{j=0}^{\mu}G_{j}z^{j}

then,

	$\displaystyle\boldsymbol{v}_{0}+\boldsymbol{v}_{1}z+\boldsymbol{v}_{2}z^{2}+\cdots$
		$\displaystyle=\boldsymbol{u}_{0}G_{0}+\left(\boldsymbol{u}_{1}G_{0}+\boldsymbol{u}_{0}G_{1}\right)z+\left(\boldsymbol{u}_{2}G_{0}+\boldsymbol{u}_{1}G_{1}+\boldsymbol{u}_{0}G_{2}\right)z^{2}+\cdots$

The maximum degree of all polynomials in the $j$ -th row of $G(z)$ is denoted by $\delta_{j}$ . The degree $\delta$ of $\mathcal{C}$ is defined as the maximum degree of the full size minors of $G(z)$ . We say that $\mathcal{C}$ is an $(n,k,\delta)$ convolutional code [15]. Important for the performance of a code in terms of error-free decoding is the (Hamming) distance between two codewords. In the case of convolutional codes, the most relevant notion of distance for low-delay decoding is the column distance that can be defined as follows.

The $\mathbf{j}$ -th column distance [11] is defined as

d_{j}^{c}\left(\mathcal{C}\right)=\min\left\{\text{wt}\left(\boldsymbol{v}_{[0,j]}(z)\right)|\ \boldsymbol{v}(z)\in\mathcal{C}\text{ and }\boldsymbol{v}_{0}\neq\mathbf{0}\right\},

where $\boldsymbol{v}_{[0,j]}(z)=\boldsymbol{v}_{0}+\boldsymbol{v}_{1}z+\cdots+\boldsymbol{v}_{j}z^{j}$ represents the $j$ -th truncation of the codeword $\boldsymbol{v}(z)\in\mathcal{C}$ and

\text{wt}(\boldsymbol{v}_{[0,j]}(z))=\text{wt}(\boldsymbol{v}_{0})+\text{wt}(\boldsymbol{v}_{1})+\cdots+\text{wt}(\boldsymbol{v}_{j})

where $\text{wt}(\boldsymbol{v}_{i})$ is the Hamming weight of $\boldsymbol{v}_{i}$ , which determines the number of nonzero components of $\boldsymbol{v}_{i}$ , for $i=1,\ldots,j$ . For simplicity, we use $d_{j}^{c}$ instead of $d_{j}^{c}(\mathcal{C})$ .

The $j$ -th column distance is upper bounded [9] by

d_{j}^{c}\leq(n-k)(j+1)+1,

and the maximality of any of the column distances implies the maximality of all the previous ones, that is, if $d_{j}^{c}=(n-k)(j+1)+1$ for some $j$ , then $d_{i}^{c}=(n-k)(i+1)+1$ for all $i\leq j$ . The value

L=\left\lfloor\frac{\delta}{k}\right\rfloor+\left\lfloor\frac{\delta}{n-k}\right\rfloor

(1)

is the largest value for which the bound can be achieved and an $(n,k,\delta)$ convolutional code $\mathcal{C}$ with $d_{L}^{c}=(n-k)(L+1)+1$ is called a maximum distance profile (MDP) code [9]. Hence, MDP codes have optimal error correcting capabilities within time intervals and therefore are ideal for low delay correction. In this work we shall assume that the retrieval must be performed within a given delay constraint $\Delta\leq L$ , see [2, 5].

Assume that

G(z)=\sum_{j=0}^{\mu}G_{j}z^{j},G_{j}\in\mathbb{F}^{k\times n},G_{\mu}\neq 0,

and consider the associated sliding matrix

G_{j}^{c}=\left(\begin{array}[c]{cccc}G_{0}&G_{1}&\cdots&G_{j}\\ &G_{0}&\cdots&G_{j-1}\\ &&\ddots&\vdots\\ &&&G_{0}\end{array}\right)

(2)

with $G_{j}=0$ when $j>\mu$ , for $j\in\mathbb{N}$ .

Theorem 2 (Theorem $2.4$ in [9])

Let $G_{j}^{c}$ be the matrices defined in (2). Then the following statements are equivalent:

1.

$d_{j}^{c}=(n-k)(j+1)+1$ ;
2.

every $(j+1)k\times(j+1)k$ full size minor of $G_{j}^{c}$ formed from the columns with the indices $1\leq t_{1}\leq\cdots\leq t_{(j+1)k}$ , where $t_{ik+1}\leq in$ , for $i=1,2,\ldots,j$ is nonzero;

In particular, when $j=L$ , $\mathcal{C}$ is an MDP code.

Theorem 3

[20, Theorem 3.1] Let $\mathcal{C}$ be an $(n,k,\delta)$ MDP convolutional code. If $d_{j}^{c}=(n-k)(j+1)+1$ and in any sliding window of length $(j+1)n$ at most $(j+1)(n-k)$ erasures occur in a transmitted sequence, then complete recovery is possible.

Considering the proof of this theorem, one sees that the recovery is even possible within a delay of $j$ windows of size $n$ and that the given condition for complete recovery is only sufficient but not necessary.

We will develop a PIR scheme in which the star product of certain block codes plays an important role.

Definition 4

The star product of two vectors $v,w\in\mathbb{F}^{n}$ is defined as $v\ast w=(v_{1}w_{1},\ldots,v_{n}w_{n})$ . The star product of two block codes $\mathcal{C},\mathcal{D}\subset\mathbb{F}^{n}$ is defined as $\mathcal{C}\ast\mathcal{D}=\langle c\ast d\ |\ c\in\mathcal{C},\ d\in\mathcal{D}\rangle$ .

Star product PIR was first introduced in [8]. The main idea of this scheme is to design the queries to the different servers in such a way that if the responses are formed as inner products of the query and the stored information, then the total response is a codeword of a certain star product code with some error, where the error contains the information one is interested in. In [10] this scheme was adopted forming the responses in a convolutional way. In the following section, we present a star product scheme where the responses as well as the storage code are convolutional.

III Streaming PIR scheme

We have $m$ sequences of files $(X_{s}^{i})_{s\in\mathbb{N}}$ with $X_{s}^{i}\in\mathbb{F}^{k}$ for $i=1,\ldots,m$ and $s\in\mathbb{N}$ . These are encoded with an $(n,k,\delta)$ MDP convolutional code $\mathcal{C}$ with generator matrix $G(z)=\sum_{r=0}^{\mu}G_{r}z^{r}$ to obtain the sequences of files $(Y_{t}^{i})_{t\in\mathbb{N}}$ with $Y_{t}^{i}=\sum_{r+s=t}X_{s}^{i}G_{r}\in\mathbb{F}^{n}$ for $i=1,\ldots,m$ and $t\in\mathbb{N}$ where we set $G_{r}=0$ for $r>\mu$ . Moreover, we have $n$ servers and for $j=1,\ldots,n$ , we store the $j$ -th component $Y_{t,j}^{i}$ of each vector $Y_{t}^{i}$ (for $i=1,\ldots,m$ , $t\in\mathbb{N}$ ) on server number $j$ . Furthermore, we assume that $(\mu+2)k\leq n$ and that for $f=1,\ldots,\mu$ , $\begin{pmatrix}G_{0}\\ \vdots\\ G_{f}\end{pmatrix}$ is the generator matrix of an $[n,(f+1)k]$ MDS block code denoted by $\mathcal{C}_{f}$ . We will present a construction for an $(n,k,\delta)$ MDP convolutional code with these properties later in this paper. It holds $Y_{t}^{i}\in\mathcal{C}_{f}$ for all $f\in\{t,\ldots,\mu\}$ and $Y_{t}^{i}\in\mathcal{C}_{\mu}$ for $t\geq\mu$ . Thus, we set $\mathcal{C}_{f}=\mathcal{C}_{\mu}$ for $f\geq\mu$ .

The user wants to stream the sequence $(X_{s}^{i})_{s\in\mathbb{N}}$ for some $i$ without the servers knowing $i$ , i.e. without that the servers know which sequence he or she is streaming. For our PIR scheme we assume that there is no collusion between the servers (i.e. the number of colluding servers, usually denoted by $t$ in the literature is equal to $1$ ).
Set $d=[1\ \ldots\ 1]\in\mathbb{F}^{n}$ , let $\mathcal{D}$ be the $[n,1]$ block code generated by $d$ and $D\in\mathbb{F}^{(\mu+1)m\times n}$ be a matrix whose rows are constituted by $(\mu+1)m$ random codewords of $\mathcal{D}$ (i.e. multiples of $d$ ). For a subset $J\subset\{1,\ldots,n\}$ , we denote by $E\in\mathbb{F}^{n}$ the vector with entries $E_{j}:=\begin{cases}1&\text{if}\ j\in J\\ 0&\text{otherwise}\end{cases}$ and we denote by $e_{j}$ the $j$ -th standard basis vector of $\mathbb{F}^{(\mu+1)m}$ .
For $j=1,\ldots,n$ , we send the following query $q_{j}^{i}$ to server $j$ :

\displaystyle q_{j}^{i}=D_{\cdot,j}+E_{j}\sum_{l=0}^{\mu}e_{lm+i}\in\mathbb{F}^{(\mu+1)m\times 1}

(3)

where $D_{\cdot,j}$ denotes the $j$ -th column of $D$ . We write $q_{j}^{i}=\begin{pmatrix}q_{j,1}^{i}\\ \vdots\\ q_{j,\mu+1}^{i}\end{pmatrix}$ with $q_{j,k}^{i}\in\mathbb{F}^{m\times 1}$ for $k=1,\ldots,\mu+1$ and $Y_{t,j}:=(Y^{1}_{t,j},Y^{2}_{t,j},\ldots,Y^{m}_{t,j})\in\mathbb{F}^{m}$ .
The response of server $j$ at time $t\in\mathbb{N}$ is

\displaystyle r_{t,j}^{i}=\sum_{k+r-1=t}\langle q_{j,k}^{i},Y^{\top}_{r,j}\rangle\in\mathbb{F}

(4)

where $Y_{r,j}=0$ for $r\not\in\mathbb{N}$ .
Hence the total response at time $t\in\mathbb{N}$ is given by

$\displaystyle r_{t}^{i}$	$\displaystyle=[r_{t,1}^{i},\ldots,r_{t,n}^{i}]=$
	$\displaystyle=\underbrace{[D_{1,1}Y_{t,1}^{1},\ldots,D_{1,n}Y_{t,n}^{1}]}_{\in\mathcal{D}\ast\mathcal{C}_{t}}+\underbrace{[D_{2,1}Y_{t,1}^{2},\ldots,D_{2,n}Y_{t,n}^{2}]}_{\in\mathcal{D}\ast\mathcal{C}_{t}}+\cdots+$
	$\displaystyle+\underbrace{[D_{(\mu+1)m,1}Y_{t-\mu,1}^{m},\ldots,D_{(\mu+1)m,n}Y_{t-\mu,n}^{m}]}_{\in\mathcal{D}\ast\mathcal{C}_{t}}+diag(E)\sum_{l=0}^{\mu}Y^{i}_{t-l}$	(5)

where diag(E) denotes the diagonal matrix with diagonal entries equal to the entries of the vector $E$ .

By Definition 4 and the definition of the code $\mathcal{D}$ the star product code $\mathcal{D}\ast\mathcal{C}_{t}$ is equal to the MDS code $\mathcal{C}_{t}$ . As $\mathcal{D}\ast\mathcal{C}_{t}$ is a linear code, any sum of codewords is again a codeword. Hence, the response has the form

\displaystyle r_{t}^{i}=c_{t}+diag(E)\sum_{l=0}^{\mu}Y^{i}_{t-l}

(6)

for some $c_{t}\in\mathcal{C}_{t}$ .

We assume that it is possible that some parts of the response at time $t$ get lost during transmission and could not be received. Hence the vector $r_{t}^{i}$ could have some erased components. We denote by $T_{t}\subset\{1,\ldots,n\}$ the set that consists of the positions of the erased components of the vector $r_{t}^{i}$ .

Lemma 5

If $|T_{t}\cup J|<n-k(\min\{t,\mu\}+1)+1$ , the user is able to obtain the vector $diag(E)\sum_{l=0}^{\mu}Y^{i}_{t-l}$ . In particular, this is true if $|J|+n_{t}<n-k(\min\{t,\mu\}+1)+1$ , where $n_{t}$ is the number of erased components of the vector $r_{t}^{i}$ .

Proof:

Using equation (6) and the definition of the vector $E$ , we apply erasure decoding in the $[n,(\min\{t,\mu\}+1)k]$ MDS code $\mathcal{C}_{t}$ to the vector $r_{t}^{i}$ where the set of erasures is the union of $T$ and $J$ . The lemma follows from the fact that an $[n,(\min\{t,\mu\}+1)k]$ MDS code could correct any set of erasures whose cardinality is smaller than the minimum distance $n-k(\min\{t,\mu\}+1)+1$ of the code. ∎

For each $t\in\mathbb{N}$ for which the condition of the preceding lemma is not fulfilled we are not able to obtain $diag(E)\sum_{l=0}^{\mu}Y^{i}_{t-l}$ . Therefore, we define

\displaystyle diag_{t}(\hat{E})=\begin{cases}diag(E)&\text{if}\ |T_{t}\cup J|<n-k(\min\{t,\mu\}+1)+1\\ 0_{n}&\text{otherwise}\end{cases}

(7)

where $0_{n}$ denotes the $n\times n$ zero matrix.

It remains to show how to obtain the desired sequence of files $(X_{s}^{i})_{s\in\mathbb{N}}$ from the sequence $(diag_{t}(\hat{E})\sum_{l=0}^{\mu}Y^{i}_{t-l})_{t\in\mathbb{N}}$ . With the definitions $\hat{r}^{i}_{t}:=\sum_{l=0}^{\mu}Y^{i}_{t-l}$ and

\displaystyle\tilde{\mathcal{G}}:=\left[\begin{array}[]{ccccccccc}G_{0}&G_{0}+G_{1}&\cdots&\sum_{r=0}^{\mu}G_{r}&\sum_{r=1}^{\mu}G_{r}&\cdots&G_{\mu}&&\\ &G_{0}&G_{0}+G_{1}&\cdots&\sum_{r=0}^{\mu}G_{r}&\sum_{r=1}^{\mu}G_{r}&\cdots&G_{\mu}&\\ &\qquad\ \ddots&\ddots&&\ddots&\ddots&&\qquad\ \ddots\end{array}\right]

(11)

one obtains

\displaystyle[\hat{r}^{i}_{1},\hat{r}^{i}_{2},\ldots]=[X_{1}^{i},X_{2}^{i},\ldots]\cdot\tilde{\mathcal{G}}

(12)

Denote by $I_{k}\in\mathbb{F}^{k\times k}$ the identity matrix and set $U\!:=\!\!\left[\begin{array}[]{cccc}I_{k}&\cdots&I_{k}&\\ &I_{k}&\cdots&I_{k}\\ &\qquad\ \ddots&&\qquad\ \ddots\end{array}\right]$ where each block of $k$ rows of $U$ contains $\mu+1$ identity matrices. Then, one has

\displaystyle\tilde{\mathcal{G}}=U\cdot\underbrace{\left[\begin{array}[]{cccccc}G_{0}&G_{1}&\cdots&G_{\mu}&&\\ &G_{0}&G_{1}&\cdots&G_{\mu}&\\ &\qquad\ \ddots&\qquad\ddots&&\qquad\ \ddots\end{array}\right]}_{:=\mathcal{G}}.

(16)

Therefore, one obtains the following lemma.

Lemma 6

The column distances of the convolutional code $\tilde{\mathcal{C}}$ with generator matrix $\tilde{G}(z)=\sum_{b=0}\tilde{G}_{b}z^{b}$ where $\tilde{G}_{b}=\sum_{r=0}^{\mu}G_{b-r}$ are equal to the column distances of $\mathcal{C}$ .

Proof:

First note that the matrix $\tilde{\mathcal{G}}$ defined in (11) is the sliding generator matrix of $\tilde{\mathcal{C}}$ . Denote by $\tilde{d}_{j}^{c}$ the $j$ -th column distance of the code $\tilde{\mathcal{C}}$ and by $U_{j}$ the matrix that consists of the first $k(j+1)$ rows and the first $k(j+1)$ columns of the matrix $U$ . Then, it holds

	$\displaystyle\tilde{d}_{j}^{c}$	$\displaystyle=\min_{X^{i}_{1}\neq 0}\left(wt\left([X^{i}_{1},\ldots,X^{i}_{j+1}]\cdot\tilde{G}_{j}^{c}\right)\right)\stackrel{{\scriptstyle\eqref{u}}}{{=}}\min_{X^{i}_{1}\neq 0}\left(wt\left([X^{i}_{1},\ldots,X^{i}_{j+1}]U_{j}\cdot G_{j}^{c}\right)\right)$
		$\displaystyle=\min_{\hat{X}^{i}_{1}\neq 0}\left(wt\left([\hat{X}^{i}_{1},\ldots,\hat{X}^{i}_{j+1}]\cdot G_{j}^{c}\right)\right)=d_{j}^{c}$		(17)

∎

Hence, we could use equation (12) to obtain $[diag_{1}(\hat{E})\hat{r}^{i}_{1},diag_{2}(\hat{E})\hat{r}^{i}_{2},\ldots]$ via erasure decoding with an MDP convolutional code where the set of positions of the total erasures denoted by $T$ has the form $T=\bigcup_{t\in\mathbb{N}}S_{t}$ with

\displaystyle S_{t}=\begin{cases}\underbrace{\{T_{t}+(t-1)n\}}_{\text{transmission erasures}}\cup\underbrace{(\{1+(t-1)n,\ldots,n+(t-1)n\}\setminus\{J+(t-1)n\})}_{\text{erasures caused by the multiplication with}\ diag(E)}&\text{if}\ diag_{t}(\hat{E})\neq 0\\ \vskip 0.28453pt\\ \{1+(t-1)n,\ldots,n+(t-1)n\}&\text{otherwise}\end{cases}

(18)

where for $J=\{j_{1},\ldots,j_{|J|}\}$ , the set $\{J+(t-1)n\}$ is defined as
$\{j_{1}+(t-1)n,\ldots,j_{|J|}+(t-1)n\}$ and $\{T_{t}+(t-1)n\}$ should be defined analogous. Hence, using also Theorem 3, we get the following theorem.

Theorem 7

Assume that $\Delta\leq L=\lfloor\frac{\delta}{n-k}\rfloor+\lfloor\frac{\delta}{k}\rfloor$ . If the set of erasures $T$ given in (18) is such that in every sliding window of the sequence $(r_{t}^{i})_{t\in\mathbb{N}}$ of size $(\Delta+1)n$ there are not more than $(\Delta+1)(n-k)$ erasures, then one could obtain the desired sequence of files $(X_{s}^{i})_{s\in\mathbb{N}}$ from the sequence $(diag_{t}(\hat{E})\sum_{l=0}^{\mu}Y^{i}_{t-l})_{t\in\mathbb{N}}$ within time delay $\Delta$ , i.e. one could privatly obtain the sequence of files $(X_{s}^{i})_{s\in\mathbb{N}}$ within time delay $\Delta$ .

From this theorem we could deduce which erasure patterns we can correct for sure with our proposed scheme.

Corollary 8

With the proposed scheme private reception within time delay $\Delta\leq L$ is possible if for $t\in\mathbb{N}$ , there are not more than $n-k(\min\{\mu,t\}+1)-|J|$ transmission erasures in positions $\{1+(t-1)n,\ldots,n+(t-1)n\}\setminus\{J+(t-1)n\}$ of the sequence of responses $(r^{i}_{t})_{t\in\mathbb{N}}$ and in every sliding window of this sequence of length $(\Delta+1)n$ there are not more than $(\Delta+1)(n-k)$ transmission erasures in positions $\{1,\ldots,(\Delta+1)n\}\cap\bigcup_{t\in\mathbb{N}}\{J+(t-1)n\}$ .

Finally, we have to choose the cardinality of the set $J\subset\{1,\ldots,n\}$ . Then, the set $J$ is chosen randomly with this fixed cardinality. If the cardinality of $J$ is larger, this leads to more erasures for $\mathcal{C}_{t}$ to correct. But in turn if the cardinality of $J$ is smaller, this leads to more erasures for $\mathcal{C}$ to correct. To balance this somehow, we want to determine $|J|$ such that the number of erasures one could correct in positions $\{1+(t-1)n,\ldots,n+(t-1)n\}\setminus\{J+(t-1)n\}$ is approximately the same as the number of erasures one could correct in positions $\{1+(t-1)n,\ldots,n+(t-1)n\}\cap\{J+(t-1)n\}$ . We denote this number of erasures by $n_{t}$ . This approach leads to the following equations:

	$\displaystyle n_{t}$	$\displaystyle\leq n-k(\min\{\mu,t\}+1)-\|J\|\quad\text{and}$		(19)
	$\displaystyle n_{t}$	$\displaystyle\leq n-k-(n-\|J\|)=\|J\|-k.$		(20)

This implies

\displaystyle n_{t}-k\leq|J|\leq n-k(\min\{\mu,t\}+1)-n_{t}

(21)

and consequently,

\displaystyle n_{t}\leq\frac{1}{2}(n-k(\min\{\mu,t\}+2)).

(22)

Having equality in this last equation, implies $|J|=\frac{1}{2}(n-k\min\{\mu,t\})$ . However, we need $|J|$ to be an integer and independent of $t$ . As depending on the erasure pattern, the MDP convolutional code $\mathcal{C}$ might be able to correct more erasures than (20) indicates, we propose to rather choose $|J|$ smaller, which finally leads to

|J|=\lfloor\frac{1}{2}(n-k\mu)\rfloor.

(23)

Of course, depending on the erasures that occur during transmission, other choices for $|J|$ could lead to a better performance. However, as we do not know the erasure pattern before transmission and we have to choose $J$ before, we cannot adapt $J$ corresponding to the erasure pattern but have to choose it in a way that the numbers of channel erasures our codes $\mathcal{C}_{t}$ and $\mathcal{C}$ are able to tolerate are balanced.

Note that we can correct more erasures in $r_{t}^{i}$ if $t$ is small (as the code $\mathcal{C}_{t}$ has a larger minimum distance if $t$ is small). This means that we could tolerate slightly more erasures at the beginning of the stream than at the end.

In the following, we illustrate the erasure correcting capability of our scheme with the help of two examples.

Example 9

Let $n=6$ , $k=1$ and $\mu=2$ . This implies $\delta=2$ and $L=2$ , i.e. $\mathcal{C}$ is an $(6,1,2)$ MDP convolutional code that could recover all erasures patterns for which in each sliding window of size $18$ there are not more than $15$ erasures. We assume $\Delta=L$ . Moreover, according to equation (23), we have $|J|=2$ . We illustrate one window of the response sequence $(r^{i}_{t})_{t\in\mathbb{N}}$ in the following figure, where the squares with content $j$ denote the positions of the set $J$ :

According to Theorem 8 we are able to recover $2$ erasures in the first $4$ positions with erasure decoding in $\mathcal{C}_{1}$ . Moreover, $\mathcal{C}_{2}$ and $\mathcal{C}_{3}$ are both able to correct $1$ additional erasure. Finally, the convolutional code $\mathcal{C}$ is able to correct $3$ erasures in the positions in which we have a $j$ . To count the total number of erasures as well as the number of erasure patterns that can be corrected (assuming that erasures occur independently of each other), we have to distinguish two cases.

For the first case, we assume that the erasure pattern allows decoding with $\mathcal{C}_{1}$ , $\mathcal{C}_{2}$ and $\mathcal{C}_{3}$ . Hence, we are able to correct up to $7$ erasures in $18$ positions. Moreover, if we assume that the erasures occur independently of each other, we could correct $\left(\sum_{i=0}^{2}\binom{4}{i}\right)\cdot 5\cdot 5\cdot\left(\sum_{i=0}^{3}\binom{6}{i}\right)=10175$ different erasure patterns.

For the second case, we assume that the erasure pattern is such that there exists $t\in\{1,2,3\}$ such that decoding with $\mathcal{C}_{t}$ is not possible, i.e. the $t$ -th window of size $n=6$ has to be considered as completely lost for $\mathcal{C}$ . In order that recovery is still possible, decoding with $\mathcal{C}_{s}$ for $s\neq t$ has to be possible and only one additional erasure in the positions in $J$ outside the completely erased window can be tolerated. Thus, for $t=1$ the maximal number of erasures that can be corrected is $9$ and the number of correctable erasure patterns equals $625$ . For $t\neq 1$ , the maximum number of erasures that can be corrected is $10$ and the number of correctable erasure patterns equals $2750$ .

Summimg up, considering all cases, one gets that there are $13550$ erasure patterns that we can correct.

If one would choose $|J|=1$ , correction is not possible anymore if one complete window of size $n$ is lost. We would still be able to correct $7$ erasures but all these erasures have to be in positions

\bigcup_{t=1}^{L+1}\left(\{1+(t-1)n,\ldots,n+(t-1)n\}\setminus\{J+(t-1)n\}\right)

whereas no erasures in positions

\bigcup_{t=1}^{L+1}\left(\{1+(t-1)n,\ldots,n+(t-1)n\}\cap\{J+(t-1)n\}\right),

could be corrected. Counting the number of erasure patterns that we are able to correct under the assumption of independent erasures, we get $6656$ .

If one would choose $|J|=3$ , there are three cases to distinguish. For the first case, assume that no window of size $n$ is completely lost for recovery with $\mathcal{C}$ . Then, we could again correct $7$ erasures but only $1$ of these erasures can have a position in

\bigcup_{t=1}^{L+1}\left(\{1+(t-1)n,\ldots,n+(t-1)n\}\setminus\{J+(t-1)n\}\right).

The number of erasure patterns that could be corrected is $1864$ .

For the second case, assume that correction with $\mathcal{C}_{t}$ is not possible for exactly one $t\in\{1,2,3\}$ . For $t=1$ , one could correct up to $9$ erasures and $168$ erasure patterns, for $t\neq 1$ , up to $10$ erasures and $2352$ erasure patterns.

For the third case, assume that correction with $\mathcal{C}_{t}$ is not possible for exactly two values $t\in\{1,2,3\}$ , denoted by $t_{1}$ and $t_{2}$ . If $1\in\{t_{1},t_{2}\}$ , one could correct up to $12$ erasures and $56$ erasure patterns, for $1\notin\{t_{1},t_{2}\}$ , up to $13$ erasures and $196$ erasure patterns.

Hence the total number of erasure patterns that could be corrected is $4636$ . This illustrates that our choice of $J$ is optimal if we assume the erasures to occur independently of each other.

Finally, we want to consider, how many erasures we can correct in a larger window and choose a window of size 24, which is illustrated as follows:

According to Theorem 8 we are able to recover $2$ erasures in the first $4$ positions with erasure decoding in $\mathcal{C}_{1}$ and $3$ additional erasures with $\mathcal{C}_{t}$ for $t\geq 2$ . The convolutional code $\mathcal{C}$ is able to correct up to $5$ erasures in the positions with $j$ . Under the assumption that decoding with $\mathcal{C}_{t}$ is possible for $t=1,\ldots,4$ , we are able to correct up to $10$ erasures. If decoding is not possible for exactly one $t$ , one can correct up to $12$ erasures if $t=1$ and up to $13$ erasures if $t\neq 1$ . If decoding is not possible for exactly two of the star product codes, recovery is only possible if this happens for $t=1$ and $t=4$ , in which case up to $15$ erasures can be corrected.

If one would choose $|J|=1$ , we would only be able to correct $7$ erasures and all these erasures have to be in positions in

\bigcup_{t=1}^{L+1}\left(\{1+(t-1)n,\ldots,n+(t-1)n\}\setminus\{J+(t-1)n\}\right).

If one would choose $|J|=3$ , one has to distinguish four cases. Under the assumption that decoding with $\mathcal{C}_{t}$ is possible for $t=1,\ldots,4$ , we are able to correct up to $10$ erasures in total but only $1$ of these erasures could be in

\bigcup_{t=1}^{L+1}\left(\{1+(t-1)n,\ldots,n+(t-1)n\}\setminus\{J+(t-1)n\}\right).

If decoding is not possible for exactly one $t$ , one can correct up to $12$ erasures if $t=1$ and up to $13$ erasures if $t\neq 1$ . If decoding is not possible for exactly two of the star product codes and $\mathcal{C}_{1}$ is among them, one could correct up to $15$ erasures and if $\mathcal{C}_{1}$ is not among them, one could correct up to $16$ erasures. If decoding is not possible for exactly three of the star product codes, one could correct up to $18$ erasures (but there are only two erasure patterns for this scenario).

Example 10

Let $n=10$ , $k=2$ and $\mu=2$ . This implies (if we use for $\mathcal{C}$ the construction presented in the next section where $G_{\mu}$ is full rank) $\delta=4$ and $L=2$ , i.e. $\mathcal{C}$ is an $(10,2,4)$ MDP convolutional code that can recover all erasures patterns for which in each sliding window of size $30$ there are not more than $24$ erasures. We assume $\Delta=L$ . Moreover, according to equation (23), we have $|J|=3$ .

According to Theorem 8 we are able to recover $3$ erasures in the first $7$ positions of the response sequence $(r^{i}_{t})_{t\in\mathbb{N}}$ with erasure decoding in $\mathcal{C}_{1}$ . Moreover, $\mathcal{C}_{2}$ and $\mathcal{C}_{3}$ are both able to correct $1$ additional erasure. Finally, the convolutional code $\mathcal{C}$ is able to correct $3$ erasures in the positions covered by one of the sets $\{J+(t-1)n\}$ . In total, we are able to correct $8$ erasures in $30$ positions in the case that correction with all $\mathcal{C}_{t}$ is possible, up to $12$ erasures in the case that (only) the first window of size $n=10$ is lost completely and up to $14$ erasures in the case that another window of size $n$ is erased completely.

If one would choose $|J|=2$ , we would be able to correct $8$ erasures but all these erasures have to be in positions in

\bigcup_{t=1}^{L+1}\left(\{1+(t-1)n,\ldots,n+(t-1)n\}\setminus\{J+(t-1)n\}\right)

whereas no erasures in

\bigcup_{t=1}^{L+1}\left(\{1+(t-1)n,\ldots,n+(t-1)n\}\cap\{J+(t-1)n\}\right)

could be corrected.

If one would choose $|J|=4$ , we can again correct $8$ erasures in the case that correction with all $\mathcal{C}_{t}$ is possible but only $2$ of these erasures can have a position in

\bigcup_{t=1}^{L+1}\left(\{1+(t-1)n,\ldots,n+(t-1)n\}\setminus\{J+(t-1)n\}\right).

Moreover, we could correct up to $12$ erasures in the case that the first window of size $n=10$ is lost completely and up to $14$ erasures in the case that another window of size $n$ is erased completely.

Again our choice of $J$ is optimal if we assume the erasures to occur independently of each other.

Remark 11

The major advantage of using convolutional codes instead of block codes is that the symbols in different windows of size $n$ are dependent on each other and hence erasures cannot only be recovered with the help of the received symbols in the same window but also with the help of received symbols of other windows. This is illustrated also by the previous examples where recovery is possible if all symbols with positions in $J$ are erased if not too many symbols with positions in $\{J+n\}$ and $\{J+2n\}$ are erased. This is due to the fact that there are erasure patterns where all symbols of the first window of size $n$ are erased but recovery with a convolutional code is still possible. But of course. this can never be possible using block codes since in this case all windows of size $n$ have to be decoded independently of each other.

IV Construction of suitable streaming codes

The aim of this section is to provide constructions for $(n,k,\delta)$ MDP convolutional codes $\mathcal{C}$ , which have the additional property that, for $f=1,\ldots,\mu$ , $\mathcal{C}_{f}$ is an $[n,(f+1)k]$ MDS block code, as proposed at the beginning of the previous section. To this end, we will use the following lemma and proposition.

Lemma 12

[12] Let $\mathcal{C}$ be an $[n,k]$ block code with generator matrix $G$ . Then, $\mathcal{C}$ is MDS if, and only if, all $k\times k$ full size minors of $G$ are nonzero.

Proposition 13

[3, Theorem 3.3] Let $\alpha$ be a primitive element of a finite field $\mathbb{F}=\mathbb{F}_{p^{N}}$ and $B=[b_{i,l}]$ be a matrix over $\mathbb{F}$ with the following properties

1.

if $b_{i,l}\neq 0$ , then $b_{i,l}=\alpha^{\beta_{i,l}}$ for a positive integer $\beta_{i,l}$
2.

if $b_{i,l}=0$ , then $b_{i^{\prime},l}=0$ for any $i^{\prime}>i$ or $b_{i,l^{\prime}}=0$ for any $l^{\prime}<l$
3.

if $l<l^{\prime}$ , $b_{i,l}\neq 0$ and $b_{i,l^{\prime}}\neq 0$ , then $2\beta_{i,l}\leq\beta_{i,l^{\prime}}$
4.

if $i<i^{\prime}$ , $b_{i,l}\neq 0$ and $b_{i^{\prime},l}\neq 0$ , then $2\beta_{i,l}\leq\beta_{i^{\prime},l}$ .

Suppose $N$ is greater than any exponent of $\alpha$ appearing as a nontrivial term of any minor of $B$ . Then $B$ has the property that each of its minors which is not trivially zero is nonzero.

The following theorem gives the desired construction.

Theorem 14

Let $p$ be prime, $N\in\mathbb{N}$ and $\alpha$ be a primitive element of $\mathbb{F}_{p^{N}}$ . For $i=1,\ldots,\mu$ , set

\displaystyle G_{i}:=\left[\begin{array}[]{ccc}\alpha^{2^{in}}&\cdots&\alpha^{2^{(i+1)n-1}}\\ \vdots&&\vdots\\ \alpha^{2^{in+k-1}}&\cdots&\alpha^{2^{(i+1)n+k-2}}\end{array}\right].

(27)

Then, the convolutional code $\mathcal{C}$ with generator matrix $G(z)=\sum_{i=0}^{\mu}G_{i}z^{i}$ is an MDP convolutional code and moreover, for $0\leq t\leq\mu$ , $\begin{pmatrix}G_{0}\\ \vdots\\ G_{t}\end{pmatrix}$ is the generator matrix of an MDS block code if $N>\max\{2^{n(L+2)-1},2^{(\mu+1)n+k-1}\}$ .

Proof:

Obviously, the fullsize minors of $\left[\begin{array}[]{ccc}G_{0}&&\\ \vdots&\ddots&\\ G_{L}&\cdots&G_{0}\end{array}\right]$ and $\left[\begin{array}[]{ccc}&&G_{0}\\ &\text{\reflectbox{$\ddots$}}&\vdots\\ G_{0}&\cdots&G_{L}\end{array}\right]$ are equal. Thus, it follows from Theorem 2 and Proposition 13 that $\mathcal{C}$ is an MDP convolutional code if $N>2^{n(L+2)-1}$ (for the bound on $N$ also see Theorem 3.2 of [3]). Moreover, it follows from Lemma 12 and Proposition 13 that $\begin{pmatrix}G_{0}\\ \vdots\\ G_{t}\end{pmatrix}$ for $0\leq t\leq\mu$ are generator matrices of MDS block codes if $N>2^{(\mu+1)n+k-1}>\sum_{j=(\mu+1)n-1}^{(\mu+1)n+k-2}2^{j}$ . ∎

V Conclusion

We have studied the problem of private streaming of a sequence of files having the resilience against unresponsive servers the primary metric for judging the efficiency of a PIR scheme. We proposed for the first time a general scheme for such a problem. This scheme is based on MDP convolutional codes and the star product of codes. It suits for a context where some servers fail to respond in contrast to other solutions considered in the literature where all the servers were assumed to fail at the same time instant. The approach presented can retrieve files in a sequential fashion and therefore is optimal for low-delay streaming applications. Some examples were presented to show how to take advantage of the proposed scheme. We derived a large set of erasure patterns that our codes can recover. Concrete constructions of such codes exist although large field sizes are required. The construction of optimal codes for PIR over small fields that can deal with both burst and isolated erasures/errors is an interesting open problem that requires further research.

Acknowledgment

The work of the first and third author was supported by the Portuguese Foundation for Science and Technology (FCT-Fundação para a Ciência e a Tecnologia), through CIDMA - Center for Research and Development in Mathematics and Applications, within project UID/MAT/04106/2019. The first author was supported by the German Research Foundation within grant LI 3101/1-1. The second author was partially supported by Spanish grant AICO/2017/128 of the Generalitat Valenciana and the University of Alicante under the project VIGROB-287.

References

[1] Cisco visual network index: Forecast and methodology, 2016-2021. Tech. Rep., June 2017, 2018.
[2] N. Adler and Y. Cassuto. Burst-erasure correcting codes with optimal average delay. IEEE Trans. Inform. Theory, 63(5):2848–2865, May 2017.
[3] P. Almeida, D. Napp, and R. Pinto. Superregular matrices and applications to convolutional codes. Linear Algebra and its Applications, 499:1–25, 2016.
[4] A. Badr, A. Khisti, W. T. Tan, and J. Apostolopoulos. Robust streaming erasure codes based on deterministic channel approximations. In 2013 IEEE International Symposium on Information Theory, pages 1002–1006, 2013.
[5] A. Badr, A. Khisti, Wai-Tian. Tan, and J. Apostolopoulos. Layered constructions for low-delay streaming codes. IEEE Trans. Inform. Theory, 63(1):111–141, 2017.
[6] K. Banawan and S. Ulukus. The capacity of private information retrieval from coded databases. IEEE Transactions on Information Theory, 64(3):1945–1956, 2018.
[7] Benny Chor, Eyal Kushilevitz, Oded Goldreich, and Madhu Sudan. Private information retrieval. J. ACM, 45(6):965–981, 1998.
[8] R. Freij-Hollanti, O. Gnilke, C. Hollanti, and D. Karpuk. Private information retrieval from coded databases with colluding servers. SIAM Journal on Applied Algebra and Geometry, 1(1):647–664, 2017.
[9] H. Gluesing-Luerssen, J. Rosenthal, and R. Smarandache. Strongly MDS convolutional codes. IEEE Trans. Inform. Theory, 52(2):584–598, 2006.
[10] Lukas Holzbaur, Ragnar Freij-Hollanti, Antonia Wachter-Zeh, and Camilla Hollanti. Private streaming with convolutional codes. In 2018 IEEE Information Theory Workshop, ITW 2018, pages 550–554. Institute of Electrical and Electronics Engineers, 2019.
[11] R. Johannesson and K. Sh. Zigangirov. Fundamentals of Convolutional Coding. IEEE Press, New York, 2015.
[12] F. J. MacWilliams and N. J.A. Sloane. The Theory of Error-Correcting Codes. North Holland, Amsterdam, 1977.
[13] U. Martínez-Peñas. Private information retrieval from locally repairable databases with colluding servers. In 2019 IEEE International Symposium on Information Theory (ISIT), 2019.
[14] E. Martinian and C. E. W. Sundberg. Burst erasure correction codes with low decoding delay. IEEE Transactions on Information Theory, 50(10):2494–2502, 2004.
[15] R. J. McEliece. The algebraic theory of convolutional codes. In Handbook of Coding Theory, volume 1, pages 1065–1138. Elsevier Science Publishers, 1998.
[16] N. B. Shah, K. V. Rashmi, and K. Ramchandran. One extra bit of download ensures perfectly private information retrieval. In 2014 IEEE International Symposium on Information Theory, pages 856–860, 2014.
[17] R. Tajeddine and S. El Rouayheb. Private information retrieval from mds coded data in distributed storage systems. In 2016 IEEE International Symposium on Information Theory (ISIT), pages 1411–1415, 2016.
[18] R. Tajeddine, O. W. Gnilke, D. Karpuk, R. Freij-Hollanti, and C. Hollanti. Private information retrieval from coded storage systems with colluding, byzantine, and unresponsive servers. IEEE Transactions on Information Theory, 65(6):3898–3906, 2019.
[19] R. Tajeddine and S. E. Rouayheb. Robust private information retrieval on coded data. In 2017 IEEE International Symposium on Information Theory (ISIT), pages 1903–1907, 2017.
[20] V. Tomas, J. Rosenthal, and R. Smarandache. Decoding of convolutional codes over the erasure channel. IEEE Trans. Inform. Theory, 58(1):90–108, January 2012.
[21] Yiwei Zhang and Gennian Ge. Private information retrieval from mds coded databases with colluding servers under several variant models. 2017.

Robust low-delay Streaming PIR using convolutional codes

Abstract

Index Terms:

I Introduction

II Preliminaries

Definition 1

Theorem 2 (Theorem 2.42.4 in [9])

Theorem 3

Definition 4

III Streaming PIR scheme

Lemma 5

Proof:

Lemma 6

Proof:

Theorem 7

Corollary 8

Example 9

Example 10

Remark 11

IV Construction of suitable streaming codes

Lemma 12

Proposition 13

Theorem 14

Proof:

V Conclusion

Acknowledgment

References

Theorem 2 (Theorem $2.4$ in [9])