Repair Rates for Multiple Descriptions on Distributed Storage

Høst-Madsen, Anders; Yang, Heecheol; Kim, Minchul; Lee, Jungwoo

doi:10.3390/e24050612

Open AccessArticle

Repair Rates for Multiple Descriptions on Distributed Storage^†

¹

Department of Electrical Engineering, University of Hawaii Manoa, Honolulu, HI 96822, USA

²

Division of Computer Convergence, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 34134, Korea

³

Department of Electrical and Computer Engineering, Seoul National University, Seoul 08826, Korea

^*

Author to whom correspondence should be addressed.

^†

This paper is an extended version of our paper published in International Symposium on Information Theory and Its Applications, ISITA2020, Kapolei, Hawai’i, USA, 24–27 October 2020, pp. 245–249, “Repair of Multiple Descriptions on Distributed Storage”; Copyright (c)2020 IEICE.

Entropy 2022, 24(5), 612; https://0-doi-org.brum.beds.ac.uk/10.3390/e24050612

Submission received: 22 March 2022 / Revised: 22 April 2022 / Accepted: 25 April 2022 / Published: 27 April 2022

(This article belongs to the Special Issue Wireless Networks: Information Theoretic Perspectives Ⅱ)

Download

Browse Figures

Versions Notes

Abstract

:

In a traditional distributed storage system, a source can be restored perfectly when a certain subset of servers is contacted. The coding is independent of the contents of the source. This paper considers instead a lossy source coding version of this problem where the more servers that are contacted, the higher the quality of the restored source. An example could be video stored on distributed storage. In information theory, this is called the multiple description problem, where the distortion depends on the number of descriptions received. The problem considered in this paper is how to restore the system operation when one of the servers fail and a new server replaces it, that is, repair. The requirement is that the distortions in the restored system should be no more than in the original system. The question is how many extra bits are needed for repair. We find an achievable rate and show that this is optimal in certain cases. One conclusion is that it is necessary to design the multiple description codes with repair in mind; just using an existing multiple description code results in unnecessary high repair rates.

Keywords:

distributed storage; multiple description coding; rate-distortion; lossy source coding; repair

1. Introduction

In distributed storage systems, data is divided into multiple segments that are then stored on separate servers. In a typical setup [1], data is divided into k segments that are stored on n servers using an

(n, k)

maximum distance separable (MDS) code. If a user is able to contact any set of k servers, the data can be reconstructed. Notice that in this setup, if the user is able to contact less than k servers, it can retrieve no information, while on the other hand, there is no advantage in being able to contact more than k servers. One could instead want the quality of the reconstructed data to depend on how many servers a user is able to contact. An example could be video: it is common that the quality of streamed video depends on the network connection. In the context of distributed storage, the quality would now be dependent on the number of servers possible to connect, which could be constrained by network connection, physical location, delay, or cost. In information theory, this is known as multiple description coding [2,3]. Originally, multiple description coding was aimed at packet transmission networks, where some packets may be lost, but it can be directly applied to the distributed storage problem. We will accordingly call the systems we consider multiple description distributed storage.

A central issue in distributed storage is how to repair the system when one or more of the servers fail or become unavailable and are replaced by new servers [1]. In traditional distributed storage, this is also solved by the MDS code: if one server fails, the repair can be done by contacting k surviving servers, reconstruct the source, and then generating a new coded segment. The problem we consider in this paper is how repair can be done for multiple description distributed storage. The paper [1] and many following papers also consider how much network traffic is required for repair. However, in this paper we will only consider the amount of additional data needed to be stored for repair to be possible. The amount of network traffic is a topic for future research.

In general, the quality of reconstruction could be dependent not only on the number of servers connected, but which servers. However, to simplify the problem, we only consider the symmetric scenario where the quality only depends on the number of servers. This is the symmetric multiple description problem considered in [4]. A multiple description coding system with repair is specified as follows: when a subset

J \subset {1, \dots, n}

of servers is contacted, a source X should be restored with a distortion at most

D_{J}

. If one (or multiple) of the servers fails, we should be able to set up a replacement server with enough information so that the whole region

D_{J}, J \subset {1, \dots, n}

is restored. We consider two scenarios:

There is a special (highly reliable) repair server that does not participate in the usual operation of the system, but only comes into action if another server fails. The repair server can contact all other (non-failed) servers and use their information combined with its own information to restore the failed server (collaborative repair).
The repair information is stored in a distributed fashion among the n servers (distributed repair).

For simplicity, in this paper we only consider failure of a single server.

A straightforward solution is to separate the source coding problem (multiple description) and the repair problem. Any existing code for multiple description can then be used, and repair can be done using minimum distance separable (MDS) erasure codes as in traditional distributed storage [1]. We will use this as a baseline. For case 1 above, the repair server can simply store the xor (sum modulo 2) of the bits on the operational servers. When one server fails, the xor together with the bits from the surviving servers can restore the failed server. Thus, if each operational server stores

l R

bits, the repair server also needs to store

l R

bits. For distributed repair, the xor can replaced with an

(n, n - 1)

erasure code. Therefore in addition to the

l R

bits for operation, each server needs to store

\frac{l R}{n - 1}

bits for repair. It should be clear that these rates are also optimal with separation: even if the system knows in advance which server will fail, it cannot store less information. We can consider this as a separate source channel coding solution, with multiple description being source coding and the repair being channel coding. It is known that in many information theory problems, joint source–channel coding is superior to separation. This is then the question we consider here: can we find a better joint source–channel coding solution that can beat the above rates? We will see that for some cases of desired distortion, separation is in fact optimal, while in other cases, joint source–channel coding provides much better rates.

The problem of repair of multiple description has been considered in some previous papers. In [5], the authors consider a problem like 1. above, but they do not give a single letter description of rate-distortion regions. In [6], the authors consider practical codes for repairing. In the current paper we aim to provide single letter expression for achievable rate-distortion regions, and in some cases the actual rate-distortion region. This paper is an extended version of our conference paper [7] with proof of the general achievable rate and specialization to the two level case, where we can prove optimality in certain cases.

2. Problem Description

In the following, we use the term repair node for the special repair server and operational nodes to denote the other servers. We let

I_{k} = {1, \dots, k}

and

X_{I_{k}} = [X_{1}, \dots, X_{k}]

, with the definition

I_{0} = \emptyset

and

X_{I_{0}} = []

(e.g.,

H (Y | X_{I_{0}}) = H (Y)

). For variables with multiple indices,

X_{I_{k}, I_{j}}

denotes a matrix of variables, i.e, the collection

{X_{11}, X_{12}, \dots, X_{1 j}, X_{21}, \dots, \dots, X_{k 1},

X_{k 2}, \dots, X_{k j}}

, and

X_{k I_{j}}

denotes a row.

We consider a symmetric multiple description problem as in [4]. We have an i.i.d. (independent identically distributed) source X that takes values in a finite alphabet

X

and needs to be restored in the finite alphabet

\hat{X}

; this can be generalized to a continuous alphabet Gaussian source through usual quantization arguments [3]. Let

J \subset I_{n}

. We are given a collection of distortion measures

{\tilde{d}}_{| J |} : X \times \hat{X} \to R^{+}

, and define

d_{| J |} (x^{l}, {\hat{x}}^{l}) = \frac{1}{l} \sum_{i = 1}^{l} {\tilde{d}}_{| J |} (x_{i}, {\hat{x}}_{i})

The required maximum distortion

D_{J}

is then a function of

| J |

and the distortion measures

d_{| J |}

only.

2.1. Distributed Repair

We will first define the distributed repair problem. For a source sequence

x^{l}

of length l, each node stores

l R_{t}

bits. There are n encoding functions

f_{i} : X^{l} \to {1, \dots, 2^{l R_{t}}}

,

2^{n - 1} - 1

decoding functions

g_{J} : {1, \dots, 2^{l R_{t}}}^{| J |} \to {\hat{X}}^{l}

,

J \subset I_{n}, 1 \leq | J | \leq n - 1

, and n repair functions

h_{i} : {1, \dots, 2^{l R_{t}}}^{n - 1} \to {1, \dots, 2^{l R_{t}}}

. We define the error probability of repair as

P_{r}^{(l)} = max_{i = 1, \dots, n} P (h_{i} (f_{I_{n} - {i}} (x^{l})) \neq f_{i} (x^{l})) .

Here,

f_{I_{n} - {i}} (x^{l})

is the length

n - 1

list obtained by removing the i-th component from

(f_{1} ((x^{l}), f_{2} (x^{l}), \dots, f_{n} (x^{l}))

. We now say that an a tuple

(R_{t}, D_{1}, \dots, D_{n - 1})

is achievable if there exists a sequence of

(2^{l R_{t}}, l)

codes with

\begin{matrix} \forall m < n : lim_{l \to \infty} max_{J : | J | = m} E [d_{| J |} (x^{l}, g_{J} (f_{J} (x^{l})))] & \leq D_{m} \\ lim_{l \to \infty} P_{r}^{(l)} & = 0 \end{matrix}

(1)

We call this exact repair. The repaired node is required to be an exact copy of the failed node, except that we allow a certain, vanishing, and error rate. Notice that the randomness in the system is purely due to the source

x^{l}

. Thus, for a given sequence

x^{l}

, either all failures can be repaired exactly, and if they can be repaired once, they can be repaired infinitely many times; or, some failures can never be repaired. The probability of the source sequences that are not repairable should be vanishingly small.

An alternative problem formulation, which we call functional repair, is to allow approximate repair, where the only requirement is that after repair the distortion constraint is satisfied. In that case, one would have to carefully consider repeated repair. In this paper, we will only consider exact repair for coding schemes. It should be noted that in the cases where we have tight converses (the two node case [7], Theorem 3 in some scenarios), the converses are actually for functional repair; thus, functional repair might not decrease rates.

2.2. Collaborate Repair

For collaborate repair with a dedicated repair node, each node stores

l R

bits and the repair node

l R_{r}

bits. There are now n encoding functions

f_{i} : X^{l} \to {1, \dots, 2^{l R}}

and additionally a repair encoder

f_{r} : X^{l} \to {1, \dots, 2^{l R_{r}}}

,

2^{n} - 1

decoding functions

g_{J} : {1, \dots, 2^{l R_{t}}}^{| J |} \to {\hat{X}}^{l}

,

J \subset I_{n}, 1 \leq | J | \leq n

, and n repair functions

h_{i} : {1, \dots, 2^{l R}}^{n - 1} \times {1, \dots, 2^{l R_{r}}} \to {1, \dots, 2^{l R}}

. We define the error probability of repair as

P_{r}^{(l)} = max_{i = 1, \dots, n} P (h_{i} (f_{I_{n} - {i}} (x^{l}), f_{r} (x^{l})) \neq f_{i} (x^{l}))

We now say that an a tuple

(R, R_{r}, D_{1}, \dots, D_{n})

is achievable if there exists a sequence of

(2^{l R}, 2^{l R_{r}}, l)

codes with

\begin{matrix} \forall m \leq n : lim_{l \to \infty} max_{J : | J | = m} E [d_{| J |} (x^{l}, g_{J} (f_{J} (x^{l})))] & \leq D_{m} \\ lim_{l \to \infty} P_{r}^{(l)} & = 0 \end{matrix}

(2)

3. Achievable Rate

The rate-distortion region for multiple description coding is only known in a few cases; among those are the two node Gaussian case first studied in [2], and the two level case studied in [8,9]. There are, therefore, many different achievable schemes for multiple description coding, e.g., [4,10,11,12], and we have to design repairs for each specific method. In this paper, we will consider the Puri Pradhan Ramchandran (PPR) scheme [4,13], as this is specifically aimed at the symmetric case and is well-suited for repair. It is optimal in certain cases [8,9], but not always [11].

The coding method in [4] is based on source-channel erasure codes (SCEC) from [13]. An

(n, k)

-SCEC is similar to an

(n, k)

-MDS erasure code: if any k of n packets are received, the transmitted message can be recovered with a certain distortion. However, with an

(n, k)

-SCEC if

m > k

packets are received, the message can be recovered with decreasing distortion with m. Using a concatenation of

(n, 1), (n, 2), \dots, (n, n)

SCEC, [4] obtained the following result

Proposition 1

(PPR [4]). For any symmetric probability distribution

p (y_{I_{n - 1}, I_{n}}, y_{n} | x)

the lower convex closure of

(R, D_{1}, \dots, D_{n})

is achievable, where

E [d_{| J |} (X, g_{J} (Y_{I_{| J |} J})] \leq D_{| J |}, | J | \leq n

and

\begin{matrix} R & \geq \sum_{k = 1}^{n - 1} \frac{1}{k} H (Y_{k I_{k}} | Y_{I_{k - 1}, I_{k}}) \\ + \frac{1}{n} I (Y_{n}; X | Y_{I_{n - 1} I_{n}}) - \frac{1}{n} H (Y_{I_{n - 1} I_{n}}, Y_{n} | X) \end{matrix}

A probability distribution

p (y_{I_{n - 1}, I_{n}}, y_{n - 1} | x)

is symmetric if for all

1 \leq r_{i} \leq n, i \in I_{n -}

the joint distribution of

Y_{n - 1}

and all

(r_{1} + r_{2} + \dots + r_{n - 1})

random variables where any

r_{i}

are chosen from the ith layer, conditioned on X are the same.

We first notice that for collaborative repair, reconstruction from n nodes does not make sense: since we can repair the last node from

n - 1

nodes, there can be no gain for a user to access all n nodes. The performance is therefore specified by

(D_{1}, D_{2}, \dots, D_{n - 1})

. As a baseline, we thus consider the standard PPR scheme where we use at most

n - 1

nodes for the reconstruction. Now, in layer

n - 1

, we just need a single common message (in standard PPR that happens at layer n). This message can be encoded using an

(n, n - 1)

MDS erasure code. We then get the following rate, which we state without proof as it is a simple modification of PPR:

Proposition 2.

For any symmetric probability distribution

p (y_{I_{n - 2}, I_{n}}, y_{n - 1} | x)

the lower convex closure of

(R, D_{1}, \dots, D_{n - 1})

is achievable, where

E [d_{| J |} (X, g_{J} (Y_{I_{| J |} J})] \leq D_{| J |}, | J | \leq n - 1

, the following rate is achievable with n nodes and using at most

(n - 1)

nodes for reconstruction

\begin{matrix} R & \geq \sum_{k = 1}^{n - 2} \frac{1}{k} H (Y_{k I_{k}} | Y_{I_{k - 1}, I_{k}}) \\ + \frac{1}{n - 1} I (Y_{n - 1}; X | Y_{I_{n - 2} I_{n - 1}}) - \frac{1}{n} H (Y_{I_{n - 2} I_{n}} | X) \end{matrix}

Notice that one should not think of this as an ‘improved’ PPR scheme; rather it is the PPR scheme adapted to the special case here, where at most

n - 1

nodes are used for reconstruction.

For our repair coding scheme, we amend the PPR scheme, specifically from Proposition 2. We still use an

(n, k)

-SCEC at layers

k \leq n - 2

, but add a common message (

U_{k}

) at each layer

k \leq n - 2

. At layer 1, this is a true common message that is duplicated to all nodes. At layers

k > 1

this is a message stored with an

(n, k)

-MDS code. Common messages were shown to be necessary to achieve optimality for the two-node case in [7]. We also use binning for repair of correlated quantizations. A system schematic for a specific case can be seen in Figure 1 below. The addition of common messages strictly decreases the rate for repair in some cases, see Section 5.

The following is the main result of the paper, an achievable repair rate; this rate can be compared to the rate in Proposition 2. As above, we call a probability distribution

p (y_{I_{n - 2}, I_{n}}, u_{I_{n - 2}}, y_{n - 1} | x)

symmetric if for all

1 \leq r_{i} \leq n - 1, i \in I_{n - 2}

and all

k \in I_{n - 2}

the joint distribution of

Y_{n - 1}, U_{k}

and all

(r_{1} + r_{2} + \dots + r_{n - 2})

random variables where any

r_{i}

are chosen from the ith layer, conditioned on X are the same.

Theorem 1

(Distributed repair). For any symmetric probability distribution

p (y_{I_{n - 2}, I_{n}}, u_{I_{n - 2}},

y_{n - 1} | x)

the lower convex closure of

(R + R_{r}, D_{1}, \dots, D_{n - 1})

is achievable, where

E [d_{| J |} (X, g_{J} (Y_{I_{| J |} J},

U_{I_{| J |}})] \leq D_{| J |}, | J | \leq n - 1

and the information needed to encode operational information is

\begin{matrix} R > \sum_{k = 1}^{n - 2} \frac{1}{k} H (Y_{k I_{k}} | U_{I_{k}}, Y_{I_{k - 1} I_{k}}) - \frac{1}{n} H (Y_{k I_{n}} | X, U_{I_{k}}, Y_{I_{k - 1} I_{n}}) \\ + \frac{1}{n - 1} I (Y_{n - 1}; X | Y_{I_{n - 2} I_{n - 1}}, U_{I_{n - 2}}) \\ + \sum_{k = 1}^{n - 2} \frac{1}{k} (H (U_{k} | Y_{I_{k - 1} I_{k}}, U_{I_{k - 1}}) - H (U_{k} | X, Y_{I_{k - 1} I_{n}}, U_{I_{k - 1}})) \end{matrix}

with additional information needed to encode repair information

\begin{matrix} R_{r} & > \frac{1}{n - 1} \sum_{k = 1}^{n - 2} [H (Y_{k n} | U_{I_{k}}, Y_{k I_{n - 1}} Y_{I_{k - 1} I_{n}}) \\ {- \frac{1}{n} H (Y_{k I_{n}} | X, Y_{k - 1 I_{n}}, U_{I_{k}})]}^{+} \end{matrix}

with

{[x]}^{+} = max {0, x}

Proof.

There is a formal proof in Appendix A—the purpose here is to outline how the coding is done and how the rate expressions are obtained, without a deep knowledge of [4].

Consider at first layer 1. We generate a codebook

C_{u 1}

by picking

2^{l R_{u 1}^{'}}

elements uniformly randomly with replacement from the typical set according to the distribution

p_{U_{1}} (u_{1})

. We also generate n independent random codebooks

C_{1 I_{n}}

drawn from the typical set according to

p_{Y_{11}} (y_{11})

with

2^{l R_{1}^{'}}

codewords. We need to be able to find a codeword in

C_{u 1}

that is jointly typical with

x^{l}

with high probability, which, from standard rate distortion, is the case if

\begin{matrix} R_{u 1} = R_{u 1}^{'} & > H (U_{1}) - H (U_{1} | X) = I (X; U_{1}) \end{matrix}

This codeword is stored in all the nodes. We now need to be able to find n codewords from

C_{1 I_{n}}

that are jointly typical with

x^{l}

and the chosen codeword

u_{1}^{l} \in C_{u 1}

. There are about

2^{n l H (Y_{11})}

(marginally) typical sequences, and about

2^{l H (Y_{11}, \dots, Y_{1 n} | U_{1}, X)}

that are jointly typical with a given

x^{l}

and

u_{1}^{l}

(see, e.g., [14] (Section 15.2)); the probability that a given codeword combination in

C_{1 I_{n}}

is jointly typical, therefore it is about

2^{l (H (Y_{11}, \dots, Y_{1 n} | U_{1}, X) - n H (Y_{11}))}

. The probability that no codeword is jointly typical then is about

{(1 - 2^{l (H (Y_{11}, \dots, Y_{1 n} | U_{1}, X) - n H (Y_{11}))})}^{2^{n l R_{1}^{'}}} \leq exp (- 2^{l (n R_{1}^{'} - (n H (Y_{11}) - H (Y_{11}, \dots, Y_{1 n} | U_{1}, X)))})

The inequality is standard in rate distortion, see [3,14]. Thus, if

\begin{matrix} n R_{1}^{'} & > n H (Y_{11}) - H (Y_{11}, \dots, Y_{1 n} | U_{1}, X) \end{matrix}

(3)

there is a high probability that at least one of the

2^{n l R_{1}^{'}}

codeword combinations is jointly typical.

The codewords in

C_{1 j}

are randomly binned into

2^{l R_{1}}

bins. At the time of decoding, the common codeword

u_{1}^{l} \in C_{u 1}

is available as well as the bin number i for the codeword

y_{i j}^{l} \in C_{1 j}

. The decoder looks for a codeword in bin i that is typical with

u_{1}^{l}

. There is always one, the actual codeword, but if there is more than one, the decoding results in error. The probability that a random codeword in

C_{1 j}

is jointly typical with

u_{1}^{l}

is about

2^{l (H (Y_{11} | U_{1}) - H (Y_{11}))}

as above, while there are about

2^{l (R_{1}^{'} - R_{1})}

codewords in each bin. By the union bound, the probability that there is at least one random codeword in the bin jointly typical is approximately upper bounded by

2^{l (R_{1}^{'} - R_{1})} 2^{- l (H (Y_{11}) - H (Y_{11} | U_{1}))}

. Thus, if

R_{1}^{'} - R_{1} < H (Y_{11}) - H (Y_{11} | U_{1})

(4)

there is only one such codeword with high probability. Combining (3) and (4) we get

R_{1} > H (Y_{11} | U_{1}) - \frac{1}{n} H (Y_{i 1}, \dots, Y_{i n} | U_{1}, X)

At layer

k < n - 1

we similarly generate a random codebook

C_{u k}

with

2^{l R_{u k}^{'}}

typical elements according to the marginal distribution

p_{U_{k}} (u_{k})

and n independent random codebooks

C_{k I_{n}}

according to the distribution

p_{Y_{k 1}} (y_{k 1})

with

2^{l R_{k}^{'}}

codewords. We need to be able to find a codeword in

C_{u k}

that is jointly typical with

x^{l}

and all the codewords chosen in the previous layers. This is possible if

\begin{matrix} R_{u k}^{'} & > H (U_{k}) - H (U_{k} | X, Y_{I_{k - 1} I_{n}}, U_{I_{k - 1}}) \end{matrix}

with the same argument as for (3). We also need to be able to find an n-tuple of codewords from

C_{k I_{n}}

that are jointly typical with all prior codewords and

x^{l}

, which is possible with high probability if (again as in (3))

\begin{matrix} n R_{k}^{'} & > n H (Y_{k 1}) - H (Y_{k I_{n}} | X, Y_{k - 1 I_{n}}, U_{I_{k}}) \end{matrix}

For

C_{u k}

, we generate n independent binning partitions each with

2^{l R_{u k}}

elements. The bin number in the i-th partition is stored in the i-th node. When the decoder has access to k nodes, say nodes

1, \dots, k

it needs to be able to be able to find a unique codeword in the k bins jointly typical with codewords from previous layers. The probability that a random selected codeword is jointly typical is about

2^{l (H (U_{k} | Y_{I_{k - 1} I_{k}}, U_{I_{k - 1}}) - H (U_{k}))}

, as above. There are about

2^{l R_{u k}^{'}} 2^{- l k R_{u k}}

in each combined bin. Therefore, if

k R_{u k} > R_{u k}^{'} + H (U_{k} | Y_{I_{k - 1} I_{k}}, U_{I_{k - 1}}) - H (U_{k})

or

R_{u k} > \frac{1}{k} (H (U_{k} | Y_{I_{k - 1} I_{k}}, U_{I_{k - 1}}) - H (U_{k} | X, Y_{I_{k - 1} I_{n}}, U_{I_{k - 1}}))

(5)

with high probability there is only one jointly typical codeword in the combined bin. It also needs to find a single codeword in the k bins for

C_{k I_{k}}

s that are jointly typical with

(U_{I_{k}}, Y_{I_{k - 1} I_{k}})

. The probability that a random codeword is jointly typical is about

2^{l (H (Y_{k I_{k}} | U_{I_{k}}, Y_{I_{k - 1} I_{k}}) - k H (Y_{k 1}))}

, while the number of codewords in the k joint bins is about

2^{l k R_{k}^{'}} 2^{- l k R_{k}}

. With high probability there is only one such if

k (R_{k}^{'} - R_{k}) < k H (Y_{k 1}) - H (Y_{k I_{k}} | U_{I_{k}}, Y_{I_{k - 1} I_{k}})

or

R_{k} > \frac{1}{k} H (Y_{k I_{k}} | U_{I_{k}}, Y_{I_{k - 1} I_{k}}) - \frac{1}{n} H (Y_{k I_{n}} | X, U_{I_{k}}, Y_{I_{k - 1} I_{n}})

(as in [13] this can be repeated for any collection of k nodes).

At layer

n - 1

only a single codebook is generated, and this is binned into n independent partitions. Upon receipt, in analogy with (5), this can be found uniquely with high probability if

\begin{matrix} R_{n - 1} & > \frac{1}{n - 1} H (Y_{n - 1} | Y_{I_{n - 2} I_{n - 1}}, U_{I_{n - 2}}) \\ - \frac{1}{n - 1} H (Y_{n - 1} | X, Y_{I_{n - 2} I_{n}}, U_{I_{n - 2}}) \end{matrix}

For repair, the joint

2^{n l R_{k}^{'}}

codewords in

C_{k 1} \times \dots \times C_{k n}

at layer

k < n - 1

are binned into

2^{l R_{r k}}

bins. The single bin number of the n chosen codewords is encoded with an

(n, n - 1)

MDS erasure code.

Now, suppose node n is lost, and needs to be recovered. The repair node works from the bottom up. So, suppose the previous

k - 1

layers have been recovered, that is,

y_{I_{k - 1} I_{b}}^{l}, u_{I_{k - 1}}^{l}

are known without error. First

u_{k}^{l}

is recovered, which can be done since

n - 1 \geq k

nodes are used. It can also decode the codewords in

C_{k I_{n - 1}}

. It restores the bin number of the repair codeword from the erasure code. There are approximately

2^{l (n R_{k}^{'} - R_{r k})}

codewords in the bin, but since it knows the codewords in

C_{k I_{n - 1}}

, there are only about

2^{l (R_{k}^{'} - R_{r k})}

valid ones. It searches in the bin for valid codewords jointly typical with

y_{k I_{n - 1}}^{l}, y_{I_{k - 1} I_{n}}^{l}, u_{I_{k}}^{l}

. With high probability, there is only one such if

R_{k}^{'} - R_{r k} < H (Y_{k n}) - H (Y_{k n} | U_{I_{k}}, Y_{k I_{n - 1}} Y_{I_{k - 1} I_{n}})

(The right hand side could be negative. This means that the lost codeword can be recovered from the surviving ones without extra repair information. Then we just put

R_{r k} = 0

.) Then

\begin{matrix} R_{r k} & > H (Y_{k n} | U_{I_{k}}, Y_{k I_{n - 1}} Y_{I_{k - 1} I_{n}}) \\ - \frac{1}{n} H (Y_{k I_{n}} | X, Y_{k - 1 I_{n}}, U_{I_{k}}) \end{matrix}

(6)

There is at least one codeword in the bin, namely the correct one. Thus, if there is no error (more than one codeword), the repair is exact, as required from the exact repairability condition in Section 2. □

The above result can easily be adapted to the case of a repair node that collaborates with the operational nodes. There are only two differences:

The repair node can restore operation of the full n node distortion region. Therefore, the terminal single common codeword is not at layer $n - 1$ , but at layer n. At the same time, the repair node now has to store repair information for this last codeword.
For distributed repair, distributions are chosen to minimize $R + R_{r}$ . For collaborative repair, distributions are chosen to minimize R, and $R_{r}$ is then as given for those distributions.

With this in mind, we get

Theorem 2

(Collaborative repair). For any symmetric probability distribution

p (y_{I_{n - 1}, I_{n}}, u_{I_{n - 1}},

y_{n} | x)

the lower convex closure of

(R, D_{1}, \dots, D_{n})

is achievable, where

E [d_{| J |} (X, g_{J} (Y_{I_{| J |} J}, U_{I_{| J |}})] \leq D_{| J |}, | J | \leq n

and

\begin{matrix} R > \sum_{k = 1}^{n - 1} \frac{1}{k} H (Y_{k I_{k}} | U_{I_{k}}, Y_{I_{k - 1} I_{k}}) - \frac{1}{n} H (Y_{k I_{n}} | X, U_{I_{k}}, Y_{I_{k - 1} I_{n}}) \\ + \frac{1}{n} I (Y_{n}; X | Y_{I_{n - 1} I_{n}}, U_{I_{n - 1}}) \\ + \sum_{k = 1}^{n - 1} \frac{1}{k} (H (U_{k} | Y_{I_{k - 1} I_{k}}, U_{I_{k - 1}}) - H (U_{k} | X, Y_{I_{k - 1} I_{n}}, U_{I_{k - 1}})) \end{matrix}

The additional information the repair node has to store is

\begin{matrix} R_{r} & > \sum_{k = 1}^{n - 1} [H (Y_{k n} | U_{I_{k}}, Y_{k I_{n - 1}} Y_{I_{k - 1} I_{n}}) \\ {- \frac{1}{n} H (Y_{k I_{n}} | X, Y_{k - 1 I_{n}}, U_{I_{k}})]}^{+} \\ + \frac{1}{n} H (Y_{n} | Y_{I_{n - 1} I_{n}}, U_{I_{n - 1}}) - \frac{1}{n} H (Y_{n} | Y_{I_{n - 1} I_{n}}, U_{I_{n - 1}}, X) \end{matrix}

The proof is nearly identical to the proof of Theorem 1, so it will be omitted.

4. The Two Level Case

In [9], the authors considered the situation when there were only two cases of node access: Either we have access to all n nodes, or we have access to a given number

k < n

nodes; there are two levels of distortion:

(D_{k}, D_{n})

. Importantly, they were able to derive the exact capacity region for this case for Gaussian sources, one of the few cases when this known except for the original EC case [2]. This makes it an interesting case to consider for repair: at least we can upper bound the number of bits needed for repair by the achievable rate in Section 3. The paper [9] considered the vector Gaussian case, but we restrict ourselves to the scalar Gaussian case.

To fit into the framework of [9], we need to consider the case when there is a repair node, Theorem 2. In that case, the scheme is as shown on Figure 1. The

U_{k}

represents a common codeword that is stored jointly on the operational nodes with an

(n, k)

MDS. If one server fails, this can be restored without additional information from the repair as

k \leq n - 1

.

Y_{k 1}, \dots, Y_{k L}

represent individual codewords using SCEC (source-channel erasure code) codes from [4,13]; here, the repair is accomplished using correlation and a bin index, similar to the two node case. Finally,

Y_{n}

represents resolution information, which can be repaired due to the

(n + 1, n)

MDS code.

The explicit rate constraints from Theorem 2 are

\begin{matrix} R > \frac{1}{k} H (Y_{k I_{k}} | U_{k}) \\ + \frac{1}{n} H (Y_{n} | Y_{k I_{n}}, U_{k}) - \frac{1}{n} H (Y_{k I_{n}}, Y_{n} | X, U_{k}) \\ + \frac{1}{k} (H (U_{k}) - H (U_{k} | X)) \end{matrix}

with

\begin{matrix} R_{r} & > {[H (Y_{k n} | U_{k}, Y_{k I_{n - 1}}) - \frac{1}{n} H (Y_{k I_{n}} | X, U_{k})]}^{+} \\ + \frac{1}{n} H (Y_{n} | Y_{k I_{n}}, U_{k}) - \frac{1}{n} H (Y_{n} | Y_{k I_{n}}, U_{k}, X) \end{matrix}

We consider an iid Gaussian source with

x_{i} \sim N (0, 1)

with a quadratic distortion function:

{\tilde{d}}_{| J |} (x_{i}, {\hat{x}}_{i}) = {(x_{i} - {\hat{x}}_{i})}^{2}

. For this situation, we can calculate the achievable repair rate explicitly. We recall that the problem setup is that R is fixed to the optimum rate from [9]. We then obtain:

Theorem 3.

In the Gaussian two level case, we have the following bounds on the repair rate:

1.: For $k {(D_{k}^{- 1} - 1)}^{- 1} - n {(D_{n}^{- 1} - 1)}^{- 1} \leq 0$ a common message is used and achieves

$R_{r} \leq \frac{1}{2} log (\frac{D_{k} (n - k)}{D_{k} (n - k - 1) + D_{n}})$

For $k = n - 1$ the upper bound is tight.
2.: For $0 < k {(D_{k}^{- 1} - 1)}^{- 1} - n {(D_{n}^{- 1} - 1)}^{- 1} \leq n - k$ no common message is used and

$R_{r} \leq \frac{1}{2} log (\frac{(D_{k} - 1) n (n - k) {(\frac{k (D_{k} - D_{n})}{(D_{k} - 1) D_{n} (k - n)})}^{1 / n}}{k (- D_{k} n + D_{n} + n - 1) + (D_{k} - 1) (n - 1) n})$

For $k = n - 1$ the upper bound is tight.
3.: For $k {(D_{k}^{- 1} - 1)}^{- 1} - n {(D_{n}^{- 1} - 1)}^{- 1} > n - k$ no common message is used and the exact repair rate is

$R_{r} = R = \frac{1}{2 n} log (\frac{1}{D_{n}})$

for all k and n.

We will discuss some implications of this result. The converse is provided by the bound (A8)

(n - 1) R + R_{r} \geq \frac{1}{2} log (\frac{1}{D_{n}})

, which is simply the requirement that the repair node together with the surviving nodes should be able to restore the source with distortion

D_{n}

. This is clearly also a converse for functional repair, which could indicate that relaxing to functional repair cannot decrease rates. For

k = n - 1

, the theorem provides the exact repair rate; without using common messages, we could not have achieved the bound. We can compare with separate repair and multiple description coding, as mentioned in the introduction. For case 3, the theorem separation is optimal, but for the other cases

R_{r} < R

. For example, for

n = 10, k = 5, D_{k} = 0.5, D_{n} = 0.48

, we get

R = 0.06, R_{r} = 0.02

for case 1.

5. Example Gaussian Case

Figure 2 shows typical numerical results. All curves are for two levels of constraints,

(D_{1}, D_{2})

, but variable number of nodes. First, from the bottom, we have the curve for the optimum region for the two node problem according to EC [2,3]. Notice that this is achieved without any refinement information, using only correlation between the base layer random variables; refinement information is only required for

D_{1} > \frac{1}{2}

and

D_{2} < 2 D_{1} - 1

. Second, we have the curves for the three node problem, but where we use at most two nodes for reconstruction, either using [4] (Section V) directly (ignoring the

D_{3}

constraint), or using Theorem 1 without repair. It can be noticed that using Proposition 2 gives a slight improvement; this is not due to the common message, but due to the fact that PPR uses

n - 1

codewords in the last layer, while the modified PPR uses only one. For the 4 node case, we use (4,1)-SCEC and (4,2)-SCEC successively, as well as (4,1)-MDS common message and (4,2)-MDS common message. Therefore, we have 2 variables

U_{1}

and

U_{2}

for common messages, and

Y_{1 i}

and

Y_{2 i}

for SCEC, where i = 1, 2, 3, 4. As a result, it is noted that the overall rate of the 4 node system improves over that of the 3 node system, whereas the overall rate of the 2 node system improves over that of the 3 node system where common message and SCEC were used only once. We see that a common message gives a clear improvement.

6. Conclusions

The paper has derived achievable rates for repair of multiple description distributed storage, which in some cases is optimal. Our solution shows that joint repair and multiple description coding beats separate coding in many cases. It also shows that it is sub-optimal for repair to just take a standard multiple description code and add repair information. Rather, the multiple description code has to be designed with repair in mind. In this paper, we do this by adding common messages.

This paper is only a first step in solving repair of multiple description distributed storage. For one thing, we have assumed that the repair bandwidth is unlimited. When the required repair bandwidth is also of concern as in [1], an entirely new set of constraints comes into play. We will consider this in a later paper.

Author Contributions

Conceptualization, A.H.-M. and J.L.; methodology, A.H.-M. and J.L.; software, H.Y. and M.K.; validation, H.Y. and M.K.; formal analysis, A.H.-M.; resources, J.L.; writing—original draft preparation, A.H.-M.; writing—review and editing, A.H.-M. and J.L.; visualization, H.Y. and M.K.; supervision, J.L.; project administration, A.H.-M.; funding acquisition, A.H.-M. All authors have read and agreed to the published version of the manuscript.

Funding

The research was funded in part by the NSF grant CCF-1908957.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:

MDS	Maximum distance separable (code)
EC	El-Gamal Cover (coding scheme)
ZB	Zhang-Berger (coding scheme)
PPR	Puri Pradhan Ramchandran (coding scheme)
SCEC	Source-channel erasure code

Appendix A. Proof of Theorem 1

Contrary to the proof outline, which is intended to stand by itself, the formal proof is a modification of the proof of Theorem 2 in [4], and reading it requires a good familiarity with [4]. We will not repeat the proof in [4], but only the new elements. The proof in this paper adds common messages, which require a separate codebook generation, and an analysis of additional error events. It also adds repair codebooks, and an analysis of repair error.

We let

T_{ϵ}^{l} (X)

denote the strongly

ϵ

typical set for X.

The coding scheme for repair uses MDS codes in several places. These can be put in the binning framework of PPR [13]. However, it is easier to think of them as pure channel codes. We can state this as follows:

Remark A1.

A message

M \in {1, \dots, 2^{l R}}

is stored on n nodes, of which at least arbitrary k is accessed for decoding. With

l R^{'} > \frac{1}{k} l R

bits on each node, decoding is possible with error

P (E) \to 0

as

l \to \infty

.

Appendix A.1. Codebook Generation

The codebooks

C_{I_{n - 2} I_{n}}

are generated and binned exactly as in [4]. The difference from [4] is that there is no n-th layer, and that at layer

n - 1

there is only one codebook

C_{n - 1}

. The codebook

C_{n - 1}

of size

2^{l R_{n - 1}^{'}}

is generated like

C_{n}

in [4], but then stored on the nodes with an

(n, n - 1)

MDS code.

We also generate

n - 2

common codebooks

C_{u I_{n - 2}}

by drawing

2^{l R_{u k}^{'}}

codewords

(u_{k}^{(l)} (1), \dots, u_{k}^{(l)} (2^{l R_{u_{k}}^{'}}))

independently with replacement over the set

T_{ϵ}^{l} (U_{k})

according to a uniform distribution. The indices for

C_{u k}, k = 2, \dots,, n - 2

are next binned. Let

ξ_{u k} = 2^{l (R_{u k}^{'} - R_{u k} + γ_{u k})}

for some

γ_{k} > 0

and make

2^{l R_{u k}}

bins. For each bin, select

ξ_{u k}

numbers from the set

{1, \dots, 2^{l R_{u k}^{'}}}

, uniformly and with replacement. They are finally coded with an

(n, k)

MDS erasure code.

We finally generate

(n - 1)

repair codebooks through binning. First, if

\begin{matrix} 0 & > H (Y_{k n} | U_{I_{k}}, Y_{k I_{n - 1}} Y_{I_{k - 1} I_{n}}) \\ - \frac{1}{n} H (Y_{k I_{n}} | X, Y_{k - 1 I_{n}}, U_{I_{k}}) \end{matrix}

(A1)

it turns out, as will be seen later, that the lost codeword can be recovered from the remaining ones with high probability. In that case, we set

R_{r k} = 0

and store no extra repair information. For consistency, we think of there being one bin at layer k containing all codewords. Otherwise, we let

ξ_{r k} = 2^{l (n R_{k}^{'} - R_{r k} + γ_{r k})}

for some

γ_{r k} > 0

and make

2^{l R_{r k}}

bins. For each bin, select

ξ_{r k}

vectors from the set

{1, \dots, 2^{l R_{k}^{'}}}^{n}

, uniformly and with replacement. The bin indices are further coded with an

(n, n - 1)

MDS erasure code.

Appendix A.2. Encoding

Given a source codeword

x^{(l)} \in X^{l}

, we find codewords so that

(x^{(l)}, u_{I_{n - 2}}^{(l)} (V_{I_{n - 2}}), y_{I_{n - 2} I_{n}}^{(l)} . * (Q_{I_{n - 2} I_{m}}), y_{n - 1}^{(l)} (Q_{n - 1}))

are jointly typical. The binning of

Q_{I_{n - 2} I_{m}}

and

Q_{n - 1}

are done exactly as in [4] to obtain bin indices

B_{I_{n - 2} I_{n}}, B_{n - 1}

. The bin index

B_{n - 1}

is further coded with the

(n, n - 1)

MDS code. For

V_{k}

, we find the smallest bin index

B_{u k}

that contains

V_{k}

(if

V_{k}

is in no bin,

B_{u k} = 0

), and this is further coded with the

(n, k)

MDS code.

For repair, for those

k \in I_{n - 1}

where repair information is needed, we find the smallest bin index

W_{k}

so that

Q_{k I_{n}}

is in the corresponding bin; if no bin contains

W_{k}

, we put

W_{k} = 0

. These are then coded with the

(n, n - 1)

MDS code.

Appendix A.3. Decoding

We assume node

1, 2, \dots, j

are available. The bin indices

B_{u I_{j^{'}}}

are decoded from the MDS code, where

j^{'} = min {j, n - 2}

. The decoding now is similar to [4], except that there is also a common codeword. Consider decoding at layer

k \in {2, \dots, j^{'}}

. First, we find an index

V_{k}

in bin

B_{u k}

so that

(y_{I_{k - 1} I_{j}}^{(l)}, u_{k}^{(l)} (V_{k}), u_{I_{k - 1}}^{(l)}) \in T_{ϵ}^{l} (Y_{I_{k - 1} I_{j}}, U_{I_{k}})

Next, for any size k subset

S \subset I_{j}

, the decoder looks in bins

B_{k S}

for codewords

y_{k S}^{(l)}

so that

(y_{k S}^{(l)}, y_{I_{k - 1} I_{j}}^{(l)}, u_{I_{k}}^{(l)}) \in T_{ϵ}^{l} (Y_{k I_{k}}, Y_{I_{k - 1} I_{j}}, U_{I_{k}})

If

j = n - 1

,

B_{n - 1}

is first recovered from the MDS code. Then, the above procedure is repeated (there is no

U_{n - 1}

).

The reconstructions of

{\hat{x}}^{(l)}

are standard as in [4].

Appendix A.4. Repair

Without loss of generality and to simplify notation, we can assume that node n fails. The repair is done layer by layer. At layer 1, we copy

V_{1}

from any node to the replacement node n. Next, from the

(n - 1)

surviving nodes we decode the repair bin index

W_{1}

from the MDS code; if there is no extra repair information, we put

W_{1} = 1

. We know

Q_{1 I_{n - 1}}

from the surviving nodes. In bin

W_{1}

, we look for an index

Q_{1 n}

so that the corresponding codeword

(y^{(l)} . * (Q_{1 I_{n}}), u_{1}^{(l)} (V_{1})) \in T_{ϵ}^{l} (Y_{1 I_{n}}, U_{1})

; if there is more than one, there is a repair error. We then store the recovered

Q_{1 n}

in the replacement node n.

The following layers proceed in almost the same way. However, now to recover the common message

V_{k}

we arbitrarily choose k of the surviving nodes and decode

V_{k}

just as with usual operation. The decoded

V_{k}

is then encoded with the exact same MDS code and we store the corresponding codeword on the replacement node n. We next find an index

Q_{k n}

in bin

W_{k}

so that

(y_{I_{k} I_{n}}^{(l)} . * (Q_{I_{k} I_{n}}), u_{I_{k}}^{(l)} (V_{I_{k}})) \in T_{ϵ}^{l} (Y_{I_{k} I_{n}}, U_{I_{k}})

.

On the last layer, we simply decode

Q_{n - 1}

from the surviving nodes as usual, and then we re-encode with the same MDS code, and store the recovered bin index on the new node n.

We notice that this repair is exact: the information on the restored node is exactly the same as on the failed node, except if a repair error happens.

Appendix A.5. Analysis of Decoding Error

We have some slightly modified error events compared to [4] and some additional ones. We find it necessary to write these down explicitly

$E_{0}$ : $x^{(l)} \notin T_{ϵ}^{l} (X)$ .
$E_{1}$ : There exists no indices so that

$\begin{matrix} (x^{(l)}, u_{I_{n - 2}}^{(l)} (V_{I_{n - 2}}), y_{I_{n - 2} I_{n}}^{(l)} . * (Q_{I_{n - 2} I_{m}}), \\ y_{n - 1}^{(l)} (Q_{n - 1})) \\ \in T_{ϵ}^{l} (X, U_{I_{n - 2}}, Y_{I_{n - 2} I_{n}}, Y_{n - 1}) \end{matrix}$
$E_{2}$ : Not all the indices $(B_{2 I_{n}}, \dots, B_{(n - 2) I_{n}}, B_{n - 1})$ are greater than zero.
$E_{3}$ : For some subset $S \subset I_{n}$ with $| S | = k \in {2, \dots, n - 2}$ there exists some other $Q_{k S}^{'}$ in bins $B_{k S}$ so that (We use a slightly different notation for $E_{3}$ compared to [4], which we think is clearer.

$(y_{k S}^{(l)} (Q_{k S}^{'}), y_{I_{k - 1} I_{j}}^{(l)}, u_{I_{k}}^{(l)}) \in T_{ϵ}^{l} (Y_{k I_{k}}, Y_{I_{k - 1} I_{k}}, U_{I_{k}})$
$E_{4}$ : Not all the indices $B_{u k}$ are greater than zero.
$E_{5}$ : For some $2 \leq k \leq n - 2$ there exist another index $V_{k}^{'} \neq V_{k}$ in bin $B_{u k}$ so that

$(y_{I_{k - 1} I_{j}}^{(l)}, u_{k}^{(l)} (V_{k}^{'}), u_{I_{k - 1}}^{(l)}) \in T_{ϵ}^{l} (Y_{I_{k - 1} I_{k}}, U_{I_{k}})$

(A2)
$E_{6}$ : There is a decoding error in the $(n, k)$ MDS erasure code for $B_{u k}$ .
$E_{7}$ : There is a decoding error in the $(n, n - 1)$ MDS erasure code for $B_{n - 1}$ .

First by Remark A1,

P (E_{6}), P (E_{7}) \to 0

as long as the rates before the MDS is scaled appropriately.

As in [4] we have

P (E_{0}) \to 0

as

l \to \infty

. For

E_{1}

as in [4] we define

E_{1 i}

as an encoding error on layer i given that the previous layers have been encoded correctly and in addition, here, that

u_{i}^{(l)}

has been encoded correctly. Then, as in [4], we find that

P (E_{1 i}) \to 0

if

\begin{matrix} n R_{1}^{'} & > n H (Y_{11}) - H (Y_{1 I_{n}} | X, U_{1}) \\ n R_{i}^{'} & > n H (Y_{i 1}) - H (Y_{i I_{n}} | X, Y_{I_{i - 1} I_{n}}, U_{I_{i - 1}}) \\ n R_{n - 1}^{'} & > I (Y_{n - 1}; X, Y_{I_{n - 2} I_{n}} U_{I_{n - 2}}) \end{matrix}

(A3)

with the difference being the addition of the

U_{*}

variables. Similarly, we can define

E_{1 i}^{u}

as an encoding error of

u_{i}^{(l)}

given that the previous layers have been encoded correctly, and we similarly have that

P (E_{1 i}^{u}) \to 0

if

\begin{matrix} R_{u 1}^{'} & > H (U_{1}) - H (U_{1} | X) \\ R_{u i}^{'} & > H (U_{i}) - H (U_{i} | X, Y_{I_{i - 1} I_{n}}, U_{I_{i - 1}}) \end{matrix}

(A4)

The proof that

P (E_{2}) \to 0

is unchanged from [4], and the proof that

P (E_{4}) \to 0

is similar.

The proof that

P (E_{3}) \to 0

is similar to [4], except that at the time of decoding at layer k the decoder has access to

u_{I_{k}}^{(l)}

. The relevant probability of decoding error at layer k is therefore

P (E_{3 k} | E_{3 I_{k - 1}}^{c}, E_{2}^{c}, E_{4}^{c}, E_{5 I_{k}}^{c}, E_{6}^{c}, E_{7}^{c})

, and since we search for codewords in

T_{ϵ}^{l} (Y_{k I_{k}}, Y_{I_{k - 1} I_{j}}, U_{I_{k}})

, the condition for this error probability converging to zero is

R_{k} > R_{k}^{'} - H (Y_{k 1}) + \frac{1}{k} H (Y_{k I_{k}} | U_{I_{k}}, Y_{I_{k - 1} I_{k}})

(A5)

instead of [4] (A17).

To prove that

P (E_{5}) \to 0

, we let

E_{5 k}

be the decoding error on layer k, and then bound

P_{5 k} = P (E_{5 k} | E_{3 I_{k - 1}}^{c}, E_{2}^{c}, E_{4}^{c}, E_{5 I_{k - 1}}^{c}, E_{6}^{c}, E_{7}^{c})

. If we pick a random codeword

u_{k}^{(l)} \in T_{ϵ}^{l} (U_{k})

, the probability that this is jointly typical, i.e., the event (A2), is

P \leq 2^{- l (I (U_{k}; Y_{I_{k - 1}} U_{I_{k - 1}}) - δ (ϵ))}

There are

ξ_{u k} = 2^{l (R_{u k}^{'} - R_{u k} + γ_{u k})}

elements in each bin, and therefore,

P_{5 k} \leq ξ_{u k} P

if we let

γ_{u k} > δ (ϵ)

, we have

P_{5 k} \to 0

if

R_{u k}^{'} - R_{u k} < I (U_{k}; Y_{I_{k - 1}} U_{I_{k - 1}})

Together with (A4), this gives (5).

Appendix A.6. Analysis of Repair Error

If

E_{4} - E_{7}

, from above happen, there is also a repair error. Notice that at time of repair, we have access to

n - 1

nodes, and we can therefore use decoding for

n - 1

nodes, and in that case we have proven that

\sum_{i = 4}^{7} P (E_{i}) \to 0

as

l \to \infty

. We have the following additional repair error events:

$E_{r 1}$ : Some $W_{k} = 0$ for $k \in I_{n - 2}$ .
$E_{r 2}$ : For $k \in I_{n - 2}$ , there exists another bin index $Q_{k n}^{'}$ in bin $W_{k}$ so that

$\begin{matrix} (y_{k I_{n}}^{(l)} (Q_{k I_{n}}^{'}), y_{I_{k - 1} I_{n}}^{(l)} . * (Q_{I_{k - 1} I_{n}}), u_{I_{k}}^{(l)} (V_{I_{k}})) \\ \in T_{ϵ}^{l} (Y_{I_{k} I_{n}}, U_{I_{k}}) \end{matrix}$
$E_{r 3}$ : For $k \in I_{n - 2}$ , there is a decoding error in the $(n, n - 1)$ MDS erasure code for $W_{k}$ .

Appendix A.6.1. Bounding E_r₁

In total, for all bins, we pick

N = 2^{l R_{r k}} ξ_{r k} = 2^{l (n R_{k}^{'} + γ_{r k})}

elements with replacement from a set of size

2^{n l R_{r k}^{'}}

. The probability that a particular element was never picked is then

P (E_{r 1}) = {(1 - 2^{- n l R_{r k}^{'}})}^{N}

and

\begin{matrix} log P (E_{r 1}) & = N log (1 - 2^{- n l R_{r k}^{'}}) \leq - N 2^{- n l R_{r k}^{'}} \\ = - 2^{l γ_{r k}} \to - \infty as l \to \infty \end{matrix}

Appendix A.6.2. Bounding E_r₂

First, we will argue that if (A1) is satisfied, we can predict

y_{k n}^{(l)}

with probability approaching one. We can state this as follows: if we pick a random

y_{k n}^{(l)} \in T_{ϵ}^{l} (Y_{k n})

, what is the probability P that

\begin{matrix} (y_{k n}^{(l)}, y_{k I_{n - 1}}^{(l)} (Q_{k I_{n - 1}}), y_{I_{k - 1} I_{n}}^{(l)} . * (Q_{I_{k - 1} I_{n}}), u_{I_{k}}^{(l)} (V_{I_{k}})) \\ \in T_{ϵ}^{l} (Y_{I_{k} I_{n}}, U_{I_{k}}) \end{matrix}

(A6)

This is actually a standard channel coding problem, so we get

P \leq 2^{- l (I (Y_{k n}; Y_{k I_{n - 1}}, Y_{i I_{n - 1}}, U_{I_{k}}) - δ (ϵ))}

(A7)

Since the codebook

C_{u k}

has

2^{l R_{k}^{'}}

elements, we then have

P (E_{r 2 k}) \leq 2^{l R_{k}^{'}} P

Thus,

P (E_{r 2 k}) \to 0

as

l \to \infty

if

R_{k}^{'} < H (Y_{k n}) - H (Y_{k n} | Y_{k I_{n - 1}}, Y_{i I_{n - 1}}, U_{I_{k}}) - δ (ϵ)

Now in consideration of (A5) there is no gain from making

R_{k}^{'}

larger than needed. Thus,

R_{k}^{'}

is chosen arbitrarily close to the limit given by (A3), and we therefore have

P (E_{r 2 k}) \to 0

if

\begin{matrix} H (Y_{k 1}) - \frac{1}{n} H (Y_{k I_{n}} | X, Y_{I_{k - 1} I_{n}}, U_{I_{k - 1}}) \\ < H (Y_{k n}) - H (Y_{k n} | Y_{k I_{n - 1}}, Y_{i I_{n - 1}}, U_{I_{k}}) - δ (ϵ) \end{matrix}

which is (A1).

Now, turn to the case when (A1) is not satisfied. We look for vectors

(Q_{k 1}^{'}, Q_{k 2}^{'}, \dots, Q_{k n}^{'}) \in {1, \dots, 2^{l R_{k}^{'}}}^{n}

that

Are in the bin indicated by $W_{k}$ .
Has $Q_{k i}^{'} = Q_{k i}$ , $i \leq n - 1$ .
Are jointly typical, i.e., satisfy (A6).

For condition 3, (A7) is still valid. Each bin contains

ξ_{r k} = 2^{l (n R_{k}^{'} - R_{r k} + γ_{r k})}

vectors. Each of these has probability

P_{2} = 2^{- l (n - 1) R_{l k}}

of satisfying conditions 2. Therefore,

P (E_{r 2 k}) \leq ξ_{r k} P_{2} P = 2^{l (R_{k}^{'} - R_{r k} + γ_{r k})} P

if we choose

γ_{r k} > δ (ϵ)

we have

P (E_{r 2 k}) \to 0

as

l \to \infty

if

R_{k}^{'} - R_{r k} < H (Y_{k n}) - H (Y_{k n} | Y_{k I_{n - 1}}, Y_{i I_{n - 1}}, U_{I_{k}})

Which together with (A3) and the argument above leads to (6).

Appendix B. Proof of Theorem 3

We use the following simple converse: when one node fails and the remaining

n - 1

nodes collaborates with the repair node, they have to be able to restore X with distortion

D_{n}

. Therefore,

(n - 1) R + R_{r} \geq \frac{1}{2} log (\frac{1}{D_{n}})

(A8)

While the calculations in the proof are in principle straightforward, we include some detail to make it simpler for readers to further develop the results. The three different cases in the Theorem are as in [9] (Section VI.A). We put

\begin{matrix} Y_{k I_{n}} & = X + Q_{k} \\ U_{k} & = X + Q_{u} \\ Y_{n} & = X + Q_{n} \end{matrix}

with

Q_{\dots}

zero-mean Gaussian,

E [Q_{u}^{2}] = σ_{u}^{2}

,

E [Q_{k i}^{2}] = σ_{k}^{2}

,

E [Q_{n}^{2}] = σ_{n}^{2}

,

E [Q_{k i} Q_{k j}] = ρ σ_{k}^{2}

for

i \neq j

, and all other noise variables uncorrelated. Let

\begin{matrix} R_{k} & = (1 - ρ) I + ρ 1 1^{T} \\ R_{k}^{- 1} & = \frac{1}{1 - ρ} I - \frac{ρ}{(1 - ρ) (1 + (k - 1) ρ)} 1 1^{T} \end{matrix}

Here,

1

is a column vector of all 1s, so

1 1^{T}

is a matrix of all ones.

We first calculate the distortions,

\begin{matrix} D_{k} & = [\begin{matrix} σ_{u}^{2} & 0 \\ 0 & σ_{k}^{2} R_{k} \end{matrix}] \\ D_{k} & = {(1 + 1^{T} D_{k}^{- 1} 1)}^{- 1} \\ = {(1 + σ_{u}^{- 2} + σ_{k}^{- 2} (\frac{1}{1 - ρ} k - \frac{ρ k^{2}}{(1 - ρ) (1 + (k - 1) ρ)}))}^{- 1} \end{matrix}

(A9)

\begin{matrix} = {(1 + σ_{u}^{- 2} + \frac{k σ_{k}^{- 2}}{1 + (k - 1) ρ})}^{- 1} \end{matrix}

(A10)

\begin{matrix} = \frac{(1 + (k - 1) ρ) σ_{k}^{2} σ_{u}^{2}}{(1 + (k - 1) ρ) σ_{k}^{2} σ_{u}^{2} + (1 + (k - 1) ρ) σ_{k}^{2} + k σ_{u}^{2}} \end{matrix}

(A11)

\begin{matrix} Q_{2} & = [\begin{matrix} σ_{n}^{2} & 0 & 0 \\ 0 & σ_{u}^{2} & 0 \\ 0 & 0 & σ_{k}^{2} R_{n} \end{matrix}] \\ D_{n} & = {(1 + 1^{T} Q_{2}^{- 1} 1)}^{- 1} \\ = {(1 + σ_{n}^{- 2} + σ_{u}^{- 2} + \frac{n σ_{k}^{- 2}}{1 + (n - 1) ρ})}^{- 1} \end{matrix}

(A12)

The

D_{k}

distortion constraint is always satisfied with equality, and therefore

σ_{k}^{2} = \frac{k D_{k} σ_{u}^{2}}{(1 + (k - 1) ρ) (σ_{u}^{2} - D_{k} σ_{u}^{2} - D_{k})}

(A13)

In general, we can write

\begin{matrix} h (X | Y) & = \frac{1}{2} log ({(2 π e)}^{n} det K_{X | Y}) \\ = \frac{1}{2} log ({(2 π e)}^{n} det (K_{XX} - K_{YX}^{T} K_{YY}^{- 1} K_{YX})) \end{matrix}

So we just need to the various conditional covariances

\begin{matrix} K_{Y_{k I_{k}} | U_{k}} & = 1 1^{T} + σ_{k}^{2} R_{k} - \frac{1}{1 + σ_{u 1}^{2}} 1 1^{T} = σ_{k}^{2} R_{k} + \frac{σ_{u 1}^{2}}{1 + σ_{u 1}^{2}} 1 1^{T} \\ K_{Y_{k I_{n}}, U_{k}} & = 1 1^{T} + [\begin{matrix} σ_{u}^{2} & 0 \\ 0 & σ_{k}^{2} R_{n} \end{matrix}] \\ K_{Y_{k I_{n}}, U_{k}}^{- 1} & = [\begin{matrix} σ_{u}^{- 2} & 0 \\ 0 & σ_{k}^{- 2} R_{n}^{- 1} \end{matrix}] - \frac{[\begin{matrix} σ_{u}^{- 2} & 0 \\ 0 & σ_{k}^{- 2} R_{n}^{- 1} \end{matrix}] 1 1^{T} [\begin{matrix} σ_{u}^{- 2} & 0 \\ 0 & σ_{k}^{- 2} R_{n}^{- 1} \end{matrix}]}{1 + 1^{T} [\begin{matrix} σ_{u}^{- 2} & 0 \\ 0 & σ_{k}^{- 2} R_{n}^{- 1} \end{matrix}] 1} \\ K_{Y_{n} | Y_{k I_{n}} U_{k}} & = 1 + σ_{n}^{2} - 1^{T} K_{Y_{k I_{n}}, U_{k}}^{- 1} 1 \\ = 1 + σ_{n}^{2} - \frac{(1 + (n - 1) ρ) σ_{k}^{2} + n σ_{u}^{2}}{(1 + (n - 1) ρ) σ_{k}^{2} (σ_{u}^{2} + 1) + n σ_{u}^{2}} \\ K_{Y_{k I_{n}} | X U} & = K_{Q_{k}} = σ_{k}^{2} R_{n} \\ K_{Y_{n} | X U_{k}} & = σ_{n}^{2} \end{matrix}

We need

\begin{matrix} det (K_{Y_{k I_{k}} | U_{k}}) & = det (σ_{k}^{2} R_{k} + \frac{σ_{u}^{2}}{1 + σ_{u}^{2}} 1 1^{T}) \\ = (1 - \frac{σ_{k}^{- 2} σ_{u}^{2}}{1 + σ_{u}^{2}} 1^{T} R_{k}^{- 1} 1) det σ_{k}^{2} R_{k} \\ = (1 + \frac{σ_{u}^{2}}{1 + σ_{u}^{2}} \frac{k σ_{k}^{- 2}}{1 + (k - 1) ρ}) σ_{k}^{2 k} (1 + \frac{ρ}{1 - ρ} 1^{T} I 1) det (1 - ρ) I \\ = (1 + \frac{σ_{u}^{2}}{1 + σ_{u}^{2}} \frac{k σ_{k}^{- 2}}{1 + (k - 1) ρ}) {((1 - ρ) σ_{k}^{2})}^{k} (1 + \frac{k ρ}{1 - ρ}) \\ det (K_{Y_{k I_{n}} | X U_{k}}) & = det (σ_{k}^{2} R_{n}) \\ = {((1 - ρ) σ_{k}^{2})}^{n} (1 + \frac{n ρ}{1 - ρ}) \end{matrix}

Then we get

\begin{matrix} R & = \frac{1}{2 k} log ((1 - \frac{1}{1 + σ_{u 1}^{2}} \frac{k σ_{k}^{- 2}}{1 + (k - 1) ρ}) {((1 - ρ) σ_{k}^{2})}^{k} (1 + \frac{k ρ}{1 - ρ})) \\ + \frac{1}{2 n} log (\frac{1}{σ_{n}^{2}} (1 + σ_{n}^{2} - \frac{(1 + (n - 1) ρ) σ_{k}^{2} + n σ_{u}^{2}}{(1 + (n - 1) ρ) σ_{k}^{2} (σ_{u}^{2} + 1) + n σ_{u}^{2}})) \\ - \frac{1}{2 n} log ({((1 - ρ) σ_{k}^{2})}^{n} (1 + \frac{n ρ}{1 - ρ})) \\ + \frac{1}{2 k} log (1 + \frac{1}{σ_{u}^{2}}) \end{matrix}

For repair we need

\begin{matrix} K_{U_{k} Y_{k I_{n - 1}}} & = 1 1^{T} + [\begin{matrix} σ_{u}^{2} & 0 \\ 0 & σ_{k}^{2} R_{n - 1} \end{matrix}] \\ K_{U_{k} Y_{k I_{n - 1}}}^{- 1} & = [\begin{matrix} σ_{u}^{- 2} & 0 \\ 0 & σ_{k}^{- 2} R_{n - 1}^{- 1} \end{matrix}] - \frac{[\begin{matrix} σ_{u}^{- 2} & 0 \\ 0 & σ_{k}^{- 2} R_{n - 1}^{- 1} \end{matrix}] 1 1^{T} [\begin{matrix} σ_{u}^{- 2} & 0 \\ 0 & σ_{k}^{- 2} R_{n - 1}^{- 1} \end{matrix}]}{1 + 1^{T} [\begin{matrix} σ_{u}^{- 2} & 0 \\ 0 & σ_{k}^{- 2} R_{n - 1}^{- 1} \end{matrix}] 1} \\ K_{Y_{k n}, U_{k} Y_{k I_{n - 1}}} & = [\begin{matrix} 1 & (1 + ρ_{1} σ_{k}^{2}) 1^{T} \end{matrix}] \\ K_{Y_{k I_{n}} | X, U_{k}} & = σ_{k}^{2} R_{n} \\ K_{Y_{k n} | U_{k} Y_{k I_{n - 1}}} & = 1 + σ_{k}^{2} - \frac{σ_{u}^{- 2} + \frac{(n - 1) σ_{k}^{- 2}}{1 + (n - 2) ρ} {(1 + ρ_{1} σ_{k}^{2})}^{2}}{1 + σ_{u}^{- 2} + \frac{(n - 1) σ_{k}^{- 2}}{1 + (n - 2) ρ}} \\ = 1 + σ_{k}^{2} - \frac{(1 + (n - 2) ρ) σ_{k}^{2} + (n - 1) σ_{u}^{2} {(1 + ρ_{1} σ_{k}^{2})}^{2}}{(1 + (n - 2) ρ) σ_{k}^{2} (σ_{u}^{2} + 1) + (n - 1) σ_{u}^{2}} \end{matrix}

From which

\begin{matrix} R_{r} & = \frac{1}{2} log (1 + σ_{k}^{2} - \frac{(1 + (n - 2) ρ) σ_{k}^{2} + (n - 1) σ_{u}^{2} {(1 + ρ_{1} σ_{k}^{2})}^{2}}{(1 + (n - 2) ρ) σ_{k}^{2} (σ_{u}^{2} + 1) + (n - 1) σ_{u}^{2}}) \\ - \frac{1}{2 n} log ({((1 - ρ) σ_{k}^{2})}^{n} (1 + \frac{n ρ}{1 - ρ})) \\ + \frac{1}{2 n} log (\frac{1}{σ_{n}^{2}} (1 + σ_{n}^{2} - \frac{(1 + (n - 1) ρ) σ_{k}^{2} + n σ_{u}^{2}}{(1 + (n - 1) ρ) σ_{k}^{2} (σ_{u}^{2} + 1) + n σ_{u}^{2}})) \end{matrix}

(A14)

For case 1, we set

σ_{n}^{2} = \infty

and

ρ = 0

. Then

R = \frac{1}{2 k} log (\frac{1}{D_{k}})

independent of

σ_{u}^{2}

. We choose

σ_{u}^{2}

so that we get exactly

D_{n}

. Solving for

σ_{u}^{2}

and inserting in (A14) results in

R_{r} = \frac{1}{2} log (\frac{D_{k} (n - k)}{D_{k} (n - k - 1) + D_{n}})

Then,

(n - 1) R + R_{r} = \frac{1}{2} log (\frac{D_{k} (n - k)}{D_{k} (n - k - 1) + D_{n}} \frac{1}{D_{k}^{(n - 1) / k}})

for

k = n - 1

we get

(n - 1) R + R_{r} = \frac{1}{2} log (\frac{D_{k}}{D_{n}} \frac{1}{D_{k}}) = \frac{1}{2} log (\frac{1}{D_{n}})

which achieves (A8)

For case 2, we put

σ_{u}^{2} = σ_{n}^{2} = \infty

. We solve for

ρ

so that we exactly achieve

D_{n}

,

ρ = \frac{D_{k} ((n - k) D_{n} + k) - D_{n} n}{D_{k} (D_{n} n - k (D_{n} + n - 1)) + D_{n} (k - 1) n}

Giving

\begin{matrix} R & = \frac{1}{2} log ({(\frac{(D_{k} - 1) D_{n} (k - n)}{k (D_{k} - D_{n})})}^{- 1 / n} {(\frac{(D_{n} - 1) (k - n)}{n (D_{k} - D_{n})})}^{1 / k}) \\ R_{r} & = \frac{1}{2} log (\frac{(D_{k} - 1) n (n - k) {(\frac{(D_{k} - 1) D_{n} (k - n)}{k (D_{k} - D_{n})})}^{- 1 / n}}{k (- D_{k} n + D_{n} + n - 1) + (D_{k} - 1) (n - 1) n}) \end{matrix}

Inserting

k = n - 1

and simplifying, it is seen that (A8) is achieved.

Now region III. We put

σ_{u}^{2} = \infty

, and find

σ_{k}^{2}

and

σ_{n}^{2}

to exactly satisfy

D_{k}

and

D_{n}

. We minimize the resulting (large) expression with respect to

ρ

, giving

ρ = \frac{D_{k} - 1}{D_{k} + k - 1}

. This results in

R = R_{r} = \frac{1}{2} log (\frac{1}{\sqrt[n]{D_{n}}})

References

Dimakis, A.G.; Godfrey, P.B.; Wu, Y.; Wainwright, M.J.; Ramchandran, K. Network Coding for Distributed Storage Systems. IEEE Trans. Inf. Theory 2010, 56, 4539–4551. [Google Scholar] [CrossRef] [Green Version]
Gamal, A.E.; Cover, T. Achievable rates for multiple descriptions. IEEE Trans. Inf. Theory 1982, 28, 851–857. [Google Scholar] [CrossRef]
Gamal, A.E.; Kim, Y.H. Network Information Theory; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
Puri, R.; Pradhan, S.S.; Ramchandran, K. n-channel symmetric multiple descriptions-part II:An achievable rate-distortion region. IEEE Trans. Inf. Theory 2005, 51, 1377–1392. [Google Scholar] [CrossRef]
Chan, T.H.; Ho, S.W. Robust multiple description coding—Joint Coding for source and storage. In Proceedings of the 2013 IEEE International Symposium on Information Theory, Istanbul, Turkey, 7–12 July 2013; pp. 1809–1813. [Google Scholar] [CrossRef]
Kapetanovic, D.; Chatzinotas, S.; Ottersten, B. Index assignment for multiple description repair in distributed storage systems. In Proceedings of the 2014 IEEE International Conference on Communications (ICC), Sydney, Australia, 10–14 June 2014; pp. 3896–3901. [Google Scholar] [CrossRef] [Green Version]
Høst-Madsen, A.; Yang, H.; Kim, M.; Lee, J. Repair of Multiple Descriptions on Distributed Storage. In Proceedings of the ISITA’2020, Honolulu, HI, USA, 24–27 October 2020. [Google Scholar]
Wang, H.; Viswanath, P. Vector Gaussian Multiple Description With Individual and Central Receivers. IEEE Trans. Inf. Theory 2007, 53, 2133–2153. [Google Scholar] [CrossRef] [Green Version]
Wang, H.; Viswanath, P. Vector Gaussian Multiple Description With Two Levels of Receivers. IEEE Trans. Inf. Theory 2009, 55, 401–410. [Google Scholar] [CrossRef] [Green Version]
Venkataramani, R.; Kramer, G.; Goyal, V.K. Multiple description coding with many channels. IEEE Trans. Inf. Theory 2003, 49, 2106–2114. [Google Scholar] [CrossRef]
Tian, C.; Chen, J. New Coding Schemes for the Symmetric K-Description Problem. IEEE Trans. Inf. Theory 2010, 56, 5344–5365. [Google Scholar] [CrossRef]
Viswanatha, K.B.; Akyol, E.; Rose, K. Combinatorial Message Sharing and a New Achievable Region for Multiple Descriptions. IEEE Trans. Inf. Theory 2016, 62, 769–792. [Google Scholar] [CrossRef] [Green Version]
Pradhan, S.S.; Puri, R.; Ramchandran, K. n-channel symmetric multiple descriptions—Part I: (n, k) source-channel erasure codes. IEEE Trans. Inf. Theory 2004, 50, 47–61. [Google Scholar] [CrossRef]
Cover, T.; Thomas, J. Information Theory, 2nd ed.; John Wiley: Hoboken, NJ, USA, 2006. [Google Scholar]

Figure 1. Two layer repair. See text for explanation.

Figure 2. Plots of R or

R + R_{r}

for two levels of constraints

(D_{1}, D_{2})

and variable number of nodes.

Figure 2. Plots of R or

R + R_{r}

for two levels of constraints

(D_{1}, D_{2})

and variable number of nodes.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Høst-Madsen, A.; Yang, H.; Kim, M.; Lee, J. Repair Rates for Multiple Descriptions on Distributed Storage. Entropy 2022, 24, 612. https://0-doi-org.brum.beds.ac.uk/10.3390/e24050612

AMA Style

Høst-Madsen A, Yang H, Kim M, Lee J. Repair Rates for Multiple Descriptions on Distributed Storage. Entropy. 2022; 24(5):612. https://0-doi-org.brum.beds.ac.uk/10.3390/e24050612

Chicago/Turabian Style

Høst-Madsen, Anders, Heecheol Yang, Minchul Kim, and Jungwoo Lee. 2022. "Repair Rates for Multiple Descriptions on Distributed Storage" Entropy 24, no. 5: 612. https://0-doi-org.brum.beds.ac.uk/10.3390/e24050612

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Repair Rates for Multiple Descriptions on Distributed Storage^†

Abstract

1. Introduction

2. Problem Description

2.1. Distributed Repair

2.2. Collaborate Repair

3. Achievable Rate

4. The Two Level Case

5. Example Gaussian Case

6. Conclusions

Author Contributions

Funding

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

Appendix A. Proof of Theorem 1

Appendix A.1. Codebook Generation

Appendix A.2. Encoding

Appendix A.3. Decoding

Appendix A.4. Repair

Appendix A.5. Analysis of Decoding Error

Appendix A.6. Analysis of Repair Error

Appendix A.6.1. Bounding E_r₁

Appendix A.6.2. Bounding E_r₂

Appendix B. Proof of Theorem 3

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Repair Rates for Multiple Descriptions on Distributed Storage †

Abstract

1. Introduction

2. Problem Description

2.1. Distributed Repair

2.2. Collaborate Repair

3. Achievable Rate

4. The Two Level Case

5. Example Gaussian Case

6. Conclusions

Author Contributions

Funding

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

Appendix A. Proof of Theorem 1

Appendix A.1. Codebook Generation

Appendix A.2. Encoding

Appendix A.3. Decoding

Appendix A.4. Repair

Appendix A.5. Analysis of Decoding Error

Appendix A.6. Analysis of Repair Error

Appendix A.6.1. Bounding Er1

Appendix A.6.2. Bounding Er2

Appendix B. Proof of Theorem 3

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Repair Rates for Multiple Descriptions on Distributed Storage^†

Appendix A.6.1. Bounding E_r₁

Appendix A.6.2. Bounding E_r₂