Distributed Hypothesis Testing over Noisy Broadcast Channels

Salehkalaibar, Sadaf; Wigger, Michèle

doi:10.3390/info12070268

Open AccessArticle

Distributed Hypothesis Testing over Noisy Broadcast Channels

by

Sadaf Salehkalaibar

¹ and

Michèle Wigger

^2,*,†

¹

Department of Electrical and Computer Engineering, College of Engineering, University of Tehran, Tehran 1433957131, Iran

²

LTCI, Telecom Paris, IP Paris, 91120 Paris, France

^*

Author to whom correspondence should be addressed.

^†

M. Wigger has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 programme, grant agreement No 715111.

Information 2021, 12(7), 268; https://0-doi-org.brum.beds.ac.uk/10.3390/info12070268

Submission received: 21 May 2021 / Revised: 8 June 2021 / Accepted: 8 June 2021 / Published: 29 June 2021

(This article belongs to the Special Issue Statistical Communication and Information Theory)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

This paper studies binary hypothesis testing with a single sensor that communicates with two decision centers over a memoryless broadcast channel. The main focus lies on the tradeoff between the two type-II error exponents achievable at the two decision centers. In our proposed scheme, we can partially mitigate this tradeoff when the transmitter has a probability larger than

1 / 2

to distinguish the alternate hypotheses at the decision centers, i.e., the hypotheses under which the decision centers wish to maximize their error exponents. In the cases where these hypotheses cannot be distinguished at the transmitter (because both decision centers have the same alternative hypothesis or because the transmitter’s observations have the same marginal distribution under both hypotheses), our scheme shows an important tradeoff between the two exponents. The results in this paper thus reinforce the previous conclusions drawn for a setup where communication is over a common noiseless link. Compared to such a noiseless scenario, here, however, we observe that even when the transmitter can distinguish the two hypotheses, a small exponent tradeoff can persist, simply because the noise in the channel prevents the transmitter to perfectly describe its guess of the hypothesis to the two decision centers.

Keywords:

hypothesis testing; broadcast channel; error exponents

1. Introduction

In Internet of Things (IoT) networks, data are collected at sensors and transmitted over a wireless channel to remote decision centers, which decide on one or multiple hypotheses based on the collected information. In this paper, we study simple binary hypothesis testing with a single sensor but two decision centers. The results can be combined with previous studies focusing on multiple sensors and a single decision center to tackle the practically relevant case of multiple sensors and multiple decision centers. We consider a single sensor for simplicity and because our main focus is on studying the tradeoff between the performances at the two decision centers that can arise because the single sensor has to send information over the channel that can be used by both decision centers. A simple, but highly suboptimal, approach would be to time-share communication and serve each of the two decision centers only during a part of the transmission. As we will see, better schemes are possible, and, in some cases, it is even possible to serve each of the two decision centers as if the other center was not present in the system.

In this paper, we follow the information-theoretic framework introduced in [1,2]. That means each terminal observes a memoryless sequence, and depending on the underlying hypothesis

H \in {0, 1}

, all sequences follow one of two possible joint distributions, which are known to all involved terminals. A priori, the transmitter, however, ignores the correct hypothesis and has to compute its transmit signal as a function of the observed source symbols only. Decision centers observe outputs of the channel, and, combined with their local observations, they have to make a decision on whether

H = 0

or

H = 1

. The performance of the decision center is measured by its type-II error exponent, i.e., the expontial decay in the length of the observations of the probability of deciding on

H = 0

when the true hypothesis is

H = 1

. As a constraint on the decision center, we impose that the type-I error probability, i.e., the probability of deciding

H = 1

when the true hypothesis is

H = 0

, vanishes (at any desired speed) with increasing observation lengths. The motivation for studying such asymmetric requirements on the two error probabilities stems, for example, from alert systems, where the miss-detection event is much more harmful than the false-alarm event, and, as a consequence, in our systems we require the miss-detection probability to decay much faster than the false-alarm probability.

This problem setting has first been considered for the setup with a single sensor and a single decision center when communication is over a noiseless link of given capacity [1,2]. For this canonical problem, the optimal error exponent has been identified in the special cases of testing against independence [1] and testing against conditional independence [3,4].

The scheme proposed by Shimokawa–Han–Amari in [3,4] yields an achievable error exponent for all distributed hypothesis testing problems (not only testing against conditional independence) [3,4], but it might not be optimal in general [5]. The Shimokawa–Han–Amari (SHA) scheme has been extended to various more involved setups such as noiseless networks with multiple sensors and a single decision center [2,6,7]; networks where the sensor and the decision center can communicate interactively [8,9]; multi-hop networks [10]; networks with multiple decision centers [10,11,12,13].

The works most closely related to the current paper are [10,12,13,14,15,16]. Specifically, Refs [10,12,13,14] consider a single-sensor multi-detector system where communication is over a common noiseless link from the sensor to all decision centers. Focusing on two decision centers, two scenarios can be encountered here: (1) the two decision centers have the same null and alternate hypotheses, and as a consequence, both aim at maximizing the error exponent under the same hypothesis

H

; or (2) the two decision centers have opposite null and alternate hypotheses and thus one decision center wishes to maximize the error exponent under hypothesis

H = 0

and the other under hypothesis

H = 1

. The second scenario is motivated by applications where the decision centers have different goals. Hypothesis testing for scenario 1) was studied in [10,12,13,14], and the results showed a tradeoff between the exponents achieved at the two decision centers. Intuitively, the tradeoff comes from the fact that communication from the sensor is serving both decision centers at the same time. Scenario 2) was considered in [12,13]. In this case, a tradeoff only occurs when the sensor’s observation alone provides no advantage in guessing the hypothesis. Otherwise, a tradeoff-free exponent region can be achieved by the following simple scheme: the sensor takes a tentative guess on the hypothesis based only on its local observations. It communicates this tentative guess to both decision centers using a single bit and then dedicates the rest of the communication using a dedicated SHA scheme only to the decision center that wishes to maximize the error exponent under the hypothesis that does not correspond to its guess. The other decision center simply keeps the transmitter’s tentative guess and ignores the rest of the communication.

In this paper, we extend these previous works to memoryless broadcast channels (BC). Hypothesis testing over BCs was already considered in [10], however, only for above scenario 1 where both decision centers have the same null and alternate hypothesis and in the special case of testing against conditional independence, in which case the derived error exponents were proved to be optimal. Interestingly, when testing against conditional independence over noisy channels, only the capacity of the channel matters but not the properties, see [15,16]. General hypothesis testing over noisy channels is much more challenging and requires additional tools, such as joint source-channel coding and unequal error protection (UEP) coding [17]. The latter can, in particular, be used to specially protect the communication of the sensor’s tentative guess, which allows one to avoid a degradation of the performance of classical hypothesis testing schemes.

We present general distributed hypothesis testing schemes over memoryless BCs, and we analyze the performances of these schemes with a special focus on the tradeoff in exponents they achieve for the two decision centers. We propose two different schemes, depending on whether the sensor can distinguish with error probability

\neq 1 / 2

the two null hypotheses at the two decision centers. If a distinction is possible (because the decision centers have different null hypothesis and the sensor’s observations follow different marginal distributions under the two hypotheses), then we employ a similar scheme as proposed in [12,13] over a common noiseless link, but where the SHA scheme is replaced by the UEP-based scheme for DMCs in [15]. That means, the sensor makes a tentative guess about the hypothesis and conveys this guess to both decision centers using an UEP mechanism. Moreover, the joint source-channel coding scheme in [15] with dedicated codebooks is used to communicate to the decision center that aims to maximize the error exponent under the hypothesis that does not correspond to the sensor’s tentative guess. This scheme shows no tradeoff between the exponents achieved at the two decision centers in various interesting cases. Sometimes, however, a tradeoff arises because even under UEP the specially protected messages can be in error and because the decision centers can confuse the codewords of the two different sets of codebooks. For the case where the sensor cannot reasonably distinguish the alternate hypotheses at the two decision centers (because both decision centers have the same alternate hypotheses or the sensor’s observations have the same marginal observations under both hypotheses), we present a scheme similar to [10] but again including UEP. In this scheme, a tradeoff between the exponents achieved at the two decision centers naturally arises and mostly stems from the inherent tradeoff in distributed lossy compression systems with multiple decoders having different side informations.

Notation

We mostly follow the notation in [18]. Random variables are denoted by capital letters, e.g.,

X

,

Y

, and their realizations by lower-case letters, e.g.,

x

, y. Script symbols such as

X

and

Y

stand for alphabets of random variables, and

X^{n}

and

Y^{n}

for the corresponding n-fold Cartesian products. Sequences of random variables

(X_{i}, \dots, X_{j})

and realizations

(x_{i}, \dots, x_{j})

are abbreviated by

X_{i}^{j}

and

x_{i}^{j}

. When

i = 1

, then we also use

X^{j}

and

x^{j}

instead of

X_{1}^{j}

and

x_{1}^{j}

.

We write the probability mass function (pmf) of a discrete random variable X as

P_{X}

; to indicate the pmf under hypothesis

H = 1

, we also use

Q_{X}

. The conditional pmf of X given Y is written as

P_{X | Y}

, or as

Q_{X | Y}

when

H = 1

. The term

D (P ∥ Q)

stands for the Kullback–Leibler (KL) divergence between two pmfs P and Q over the same alphabet. We use

t p (a^{n}, b^{n})

to denote the joint type of the pair of sequences

(a^{n}, b^{n})

, and cond_tp

(a^{n} | b^{n})

for the conditional type of

a^{n}

given

b^{n}

. For a joint type

π_{A B C}

over alphabet

A \times B \times C

, we denote by

I_{π_{A B C}} (A; B | C)

the conditional ßmutual information assuming that the random triple

(A, B, C)

has pmf

π_{A B C}

; similarly for the entropy

H_{π_{A B C}} (A)

and the conditional entropy

H_{π_{A B C}} (A | B)

. Sometimes we abbreviate

π_{A B C}

by

π

. In addition, when

π_{A B C}

has been defined and is clear from the context, we write

π_{A}

or

π_{A B}

for the corresponding subtypes. When the type

π_{A B C}

coincides with the actual pmf of a triple

(A, B, C)

, we omit the subscript and simply write

H (A)

,

H (A | B)

, and

I (A; B | C)

.

For a given

P_{X}

and a constant

μ > 0

, let

T_{μ}^{n} (P_{X})

be the set of μ-typical sequences in

X^{n}

as defined in [8] (Section 2.4). Similarly,

T_{μ}^{n} (P_{X Y})

stands for the set of jointly μ-typical sequences. The expectation operator is written as

E [\cdot]

. We abbreviate independent and identically distributed by i.i.d. The log function is taken with base 2. Finally, in our justifications, we use (DP) and (CR) for “data processing inequality” and “chain rule”.

2. System Model

Consider the distributed hypothesis testing problem in Figure 1, where a transmitter observes sequence

X^{n}

, Receiver 1 sequence

Y_{1}^{n}

, and Receiver 2 sequence

Y_{2}^{n}

. Under the null hypothesis:

\begin{matrix} H = 0 : (X^{n}, Y_{1}^{n}, Y_{2}^{n}) i . i . d . \sim P_{X Y_{1} Y_{2}}, \end{matrix}

(1)

and under the alternative hypothesis:

\begin{matrix} H = 1 : (X^{n}, Y_{1}^{n}, Y_{2}^{n}) i . i . d . \sim Q_{X Y_{1} Y_{2}}, \end{matrix}

(2)

for two given pmfs

P_{X Y_{1} Y_{2}}

and

Q_{X Y_{1} Y_{2}}

. The transmitter can communicate with the receivers over n uses of a discrete memoryless broadcast channel

(W, V_{1} \times V_{2}, P_{V_{1} V_{2} | W})

where

W

denotes the finite channel input alphabet and

V_{1}

and

V_{2}

the finite channel output alphabets. Specifically, the transmitter feeds inputs

W^{n} = f^{(n)} (X^{n}),

(3)

to the channel, where

f^{(n)}

denotes the chosen (possibly stochastic) encoding function

f^{(n)} : X^{n} \to W^{n} .

(4)

Each Receiver

i \in {1, 2}

observes the BC ouputs

V_{i}^{n}

, where for a given input

W_{t} = w_{t}

,

\begin{matrix} (V_{1, t}, V_{2, t}) \sim Γ_{V_{1} V_{2} | W} (\cdot, \cdot | w_{t}), t \in {1, \dots, n} . \end{matrix}

(5)

Based on the sequence of channel outputs

V_{i}^{n}

and the source sequence

Y_{i}^{n}

, Receiver i decides on the hypothesis

H

. That means it produces the guess

{\hat{H}}_{i} = g^{(n)} (V_{i}^{n}, Y_{i}^{n}),

(6)

for a chosen decoding function

g_{i}^{(n)} : V_{i}^{n} \times Y_{i}^{n} \to {0, 1} .

(7)

There are different possible scenarios regarding the requirements on error probabilities. We assume that each receiver is interested in only one of the two exponents. For each

i \in {1, 2}

, let

h_{i} \in {0, 1}

be the hypothesis whose error exponent Receiver i wishes to maximize, and

{\bar{h}}_{i}

the other hypothesis, i.e.,

{\bar{h}}_{i} \in {0, 1}

and

h_{i} \neq {\bar{h}}_{i}

. (The values of

h_{1}

and

h_{2}

are fixed and part of the problem statement.) We then have:

Definition 1.

An exponent pair

(θ_{1}, θ_{2})

is said to be achievable over a BC, if for each

ϵ_{1}, ϵ_{2} \in (0, 1)

and sufficiently large blocklengths n, there exist encoding and decoding functions

(f^{(n)}, g_{1}^{(n)}, g_{2}^{(n)})

such that:

\begin{matrix} α_{1, n} & \overset{Δ}{=} Pr [{\hat{H}}_{1} = h_{1} | H = {\bar{h}}_{1}], α_{2, n} \overset{Δ}{=} Pr [{\hat{H}}_{2} = h_{2} | H = {\bar{h}}_{2}], \end{matrix}

(8)

\begin{matrix} β_{1, n} & \overset{Δ}{=} Pr [{\hat{H}}_{1} = {\bar{h}}_{1} | H = h_{1}], β_{2, n} \overset{Δ}{=} Pr [{\hat{H}}_{2} = {\bar{h}}_{2} | H = h_{2}], \end{matrix}

(9)

satisfy

\begin{matrix} α_{i, n} & \leq ϵ_{i}, i \in {1, 2}, \end{matrix}

(10)

and

\begin{matrix} - \underset{n \to \infty}{lim^{¯}} \frac{1}{n} log β_{i, n} & \geq θ_{i}, i \in {1, 2} . \end{matrix}

(11)

Definition 2.

The fundamental exponents region

E

is the set of all exponent pairs

(θ_{1}, θ_{2})

that are achievable.

Remark 1.

Notice that both

α_{1, n}

and

β_{1, n}

depend on the BC law

Γ_{V_{1} V_{2} | W}

only through the conditional marginal distribution

Γ_{V_{1} | W}

. Similarly,

α_{2, n}

and

β_{2, n}

only depend on

Γ_{V_{2} | W}

. As a consequence, also the fundamental exponents region

E

depends on the joint laws

P_{X Y_{1} Y_{2}}

and

Q_{X Y_{1} Y_{2}}

only through their marginal laws

P_{X Y_{1}}

,

P_{X Y_{2}}

,

Q_{X Y_{1}}

, and

Q_{X Y_{2}}

.

Remark 2.

As a consequence to the preceding Remark 1, when

P_{X} = Q_{X}

, one can restrict attention to a scenario where both receivers aim at maximizing the error exponent under hypothesis

H = 1

, i.e.,

h_{1} = h_{2} = 1

. In fact, under

P_{X} = Q_{X}

, the fundamental exponents region

E

for arbitrary

h_{1}

and

h_{2}

coincides with the fundamental exponents region

E

for

h_{1}^{'} = 1

and

h_{2}^{'} = 1

if one exchanges pmfs

P_{X Y_{1}}

and

Q_{X Y_{1}}

in case

h_{1} = 0

and one exchanges pmfs

P_{X Y_{2}}

and

Q_{X Y_{2}}

in case

h_{2} = 0

.

To simplify the notation in the sequel, we use the following shorthand notations for the pmfs

P_{X Y_{1} Y_{2}}

and

Q_{X Y_{1} Y_{2}}

.

For each

i \in {1, 2}

:

if {\bar{h}}_{i} = 0 ⟹ (p_{X Y_{1} Y_{2}}^{i} : = P_{X Y_{1} Y_{2}} and q_{X Y_{1} Y_{2}}^{i} : = Q_{X Y_{1} Y_{2}})

(12a)

and

if {\bar{h}}_{i} = 1 ⟹ (p_{X Y_{1} Y_{2}}^{i} : = Q_{X Y_{1} Y_{2}} and q_{X Y_{1} Y_{2}}^{i} : = P_{X Y_{1} Y_{2}}) .

(12b)

We propose two coding schemes yielding two different exponent regions, depending on whether

\forall x \in X : p_{X}^{1} (x) = p_{X}^{2} (x),

(13)

or

\exists x \in X : p_{X}^{1} (x) \neq p_{X}^{2} (x) .

(14)

Notice that (13) always holds when

h_{1} = h_{2}

. In contrast, given (14), then obviously

h_{1} \neq h_{2}

.

3. Results on Exponents Region

Before presenting our main results, we recall the achievable error exponent over a discrete memoryless channel reported in [15] (Theorem 1).

3.1. Achievable Exponent for Point-to-Point Channels

Consider a single-receiver setup with only Receiver 1 that wishes to maximize the error exponent under hypothesis

h_{1} = 1

. For simplicity then, we drop the user index 1 and simply call the receiver’s source observation

Y^{n}

and its channel outputs

V^{n}

.

Theorem 1

(Theorem 1 in [15]). Any exponent θ satisfying the following condition is achievable:

\begin{matrix} θ & \leq max min {θ_{standard} (P_{S | X}), θ_{dec} (P_{S | X}, P_{T}, P_{W | T}), θ_{miss} (P_{S | X}, P_{T}, P_{W | T})}, \end{matrix}

(15)

where the maximization is over pmfs

P_{S | X}

,

P_{T}

, and

P_{W | T}

such that the joint law

P_{S T W V X Y} : = P_{X Y} P_{S | X} P_{T} P_{W | T} P_{V | W}

satisfies

I (S; X | Y) \leq I (W; V | T),

(16)

and where the exponents in (15) are defined as:

\begin{matrix} θ_{standard} (P_{S | X}) & : = min_{\begin{matrix} {\tilde{P}}_{S X Y} : \\ {\tilde{P}}_{S X} = P_{S X} \\ {\tilde{P}}_{S Y} = P_{S Y} \end{matrix}} D ({\tilde{P}}_{S X Y} ∥ P_{S | X} Q_{X Y}), \end{matrix}

(17)

\begin{matrix} θ_{dec} (P_{S | X}, P_{T}, P_{W | T}) & : = min_{\begin{matrix} {\tilde{P}}_{S X Y} : \\ {\tilde{P}}_{S X} = P_{S X} \\ {\tilde{P}}_{Y} = P_{Y} \\ H_{P} (S | Y) \leq H_{\tilde{P}} (S | Y) \end{matrix}} D ({\tilde{P}}_{S X Y} ∥ P_{S | X} Q_{X Y}) - I (S; X | Y) + I (W; V | T), \end{matrix}

(18)

\begin{matrix} θ_{miss} (P_{S | X}, P_{T}, P_{W | T}) & : = D (P_{Y} ∥ Q_{Y}) + E_{P_{T}} [D (P_{V | T} ∥ Γ_{V | W = T})] - I (S; X | Y) + I (W; V | T) . \end{matrix}

(19)

Here, all mutual information terms are calculated with respect to the joint pmf

P_{S T W V X Y}

defined above.

The exponent in Theorem 1 is obtained by the following scheme, which is also depicted in Figure 2.

The transmitter attempts to quantize the source sequence

X^{n}

using a random codebook consisting of codewords

{S^{n} (m, ℓ)}

. If the quantization fails because no codeword is jointly typical with the source sequence, then the transmitter applies the UEP mechanism in [17] by sending an IID

P_{T}

-sequence

T^{n}

over the channel. Otherwise, it sends the codeword

W^{n} (m)

for m indicating the first index of the

S^{n} (m, ℓ)

codeword that is jointly typical with its source observation

X^{n}

. The receiver jointly decodes the channel and source codeword by verifying the existence of indices

(m^{'}, ℓ^{'})

such that

W^{n} = W^{n} (m^{'})

is jointly typical with its channel outputs

V^{n}

and there is no other codeword

S^{n} (m^{'}, \tilde{ℓ})

with smaller conditional empirical entropy given

Y^{n}

than

S^{n} (m^{'}, ℓ^{'})

. If the decoded codeword

S^{n} (m^{'}, ℓ^{'})

is jointly typical with the receiver’s observation

Y^{n}

, then it produces

\hat{H} = 0

, and otherwise

\hat{H} = 1

.

The three competing type-II error exponents in Theorem 1 can be understood in view of this coding scheme as follows. Exponent

θ_{standard}

indicates the event that a random codeword

S^{n} (m, ℓ)

is jointly typical with the transmitter’s observation

X^{n}

and with the receiver’s observation

Y^{n}

while being under

H = 1

. This is also the error exponent in Han’s scheme [2] over a noiseless communication link and does not depend on the channel law

Γ_{V | W}

. Exponent

θ_{dec}

is related to the joint decoding that checks the joint typicality of the source codeword, as well as of the channel codeword, and applies a conditional minimum entropy decoder. A similar error exponent is observed in the SHA scheme [3,4] over a noiseless link if the mutual information

I (W; V | T)

is replaced by the rate of the link. The third exponent

θ_{m i s s}

finally indicates an event where the transmitter sends

T^{n}

(so as to indicate the receiver to decide for

\hat{H} = 1

) but the receiver detects a channel codeword

W^{n} (m^{'})

and a corresponding source codeword

S^{n} (m^{'}, ℓ^{'})

. This exponent is directly related to the channel transition law

Γ_{V | W}

and not only to the mutual information of the channel and does not occur when transmission is over a noiseless link. Interestingly, it is redundant in view of exponent

θ_{d e c}

whenever

Q_{X Y} = P_{X} Q_{Y}

because in this case the minimization in (18) evaluates to

D (P_{Y} ∥ Q_{Y})

. In this special case, the exponent can also be shown to be optimal, see [15].

We now present our achievable exponents region, where we distinguish the two cases (1)

h_{1} \neq h_{2}

and

P_{X} \neq Q_{X}

; and (2)

(h_{1} = h_{2})

or

P_{X} = Q_{X}

.

3.2. Achievable Exponents Region When $h_{1} \neq h_{2}$ and $P_{X} \neq Q_{X}$

Theorem 2.

If

h_{1} \neq h_{2}

and

P_{X} \neq Q_{X}

, i.e., (14) holds, then all error exponent pairs

(θ_{1}, θ_{2})

satisfying the following condition are achievable:

\begin{matrix} θ_{i} & \leq min {θ_{standard, i} θ_{dec, i}, θ_{cross, i}, θ_{miss, i}}, i \in {1, 2}, \end{matrix}

(20)

where the union is over pmfs

p_{S | X}^{i}

,

p_{T}

,

p_{T_{i} | T}^{i}

, and

p_{W | T_{i}}^{i}

, for

i \in {1, 2}

, so that the joint pmfs

p^{1}, p^{2}

defined through (12) and

\begin{matrix} p_{S X Y_{1} Y_{2} T T_{i} W V_{1} V_{2}}^{i} & : = p_{S | X}^{i} \cdot p_{X Y_{1} Y_{2}}^{i} \cdot p_{T} \cdot p_{T_{i} | T}^{i} \cdot p_{W | T T_{i}}^{i} \cdot Γ_{V_{1} V_{2} | W}, i \in {1, 2}, \end{matrix}

(21)

satisfy constraints

I_{p^{i}} (S; X | Y_{i}) < I_{p^{i}} (W; V_{i} | T, T_{i}), i \in {1, 2},

(22)

and where the exponents in (20) are defined as follows, where we set

q^{1} = p^{2}

and

q^{2} = p^{1}

:

\begin{matrix} θ_{standard, i} & : = min_{\begin{matrix} {\tilde{P}}_{S X Y_{i}} : \\ {\tilde{P}}_{S X} = p_{S X}^{i} \\ {\tilde{P}}_{S Y_{i}} = p_{S Y_{i}}^{i} \end{matrix}} D ({\tilde{P}}_{S X Y_{i}} ∥ p_{S | X}^{i} q_{X Y_{i}}^{i}), \end{matrix}

(23)

\begin{matrix} θ_{dec, i} & : = min_{\begin{matrix} {\tilde{P}}_{S X Y_{i}} : \\ {\tilde{P}}_{S X} = p_{S X}^{i} \\ {\tilde{P}}_{Y_{i}} = p_{Y_{i}}^{i} \\ H_{p^{i}} (S | Y_{i}) \leq H_{\tilde{P}} (S | Y_{i}) \end{matrix}} D ({\tilde{P}}_{S X Y_{i}} ∥ p_{S | X}^{i} q_{X Y_{i}}^{i}) - I_{p^{i}} (S; X | Y_{i}) + I_{p^{i}} (W; V_{i} | T, T_{i}), \end{matrix}

(24)

θ_{miss, i} : = D (p_{Y_{i}}^{i} ∥ q_{Y_{i}}^{i}) + E_{p_{T}} [D (p_{V_{i} | T}^{i} ∥ Γ_{V_{i} | W = T})] - I_{p^{i}} (S; X | Y_{i}) + I_{p^{i}} (W; V_{i} | T, T_{i}),

(25)

\begin{matrix} θ_{cross, i} & : = min_{\begin{matrix} {\tilde{P}}_{S X Y_{i}} : \\ {\tilde{P}}_{Y_{i}} = p_{Y_{i}}^{i} \\ H_{p^{i}} (S | Y_{i}) \leq H_{\tilde{P}} (S | Y_{i}) \end{matrix}} E_{{\tilde{P}}_{S X}} [D ({\tilde{P}}_{Y_{i} | X S} ∥ q_{Y_{i} | X}^{i})] + min_{\begin{matrix} {\tilde{P}}_{T T_{i} W} : \\ {\tilde{P}}_{T W} = q_{T W}^{i} \\ {\tilde{P}}_{T T_{i}} = p_{T T_{i}}^{i} \end{matrix}} E_{{\tilde{P}}_{T T_{i} W}} [D (p_{V_{i} | T T_{i}}^{i} ∥ Γ_{V_{i} | W})] \\ - I_{p^{i}} (S; X | Y_{i}) + I_{p^{i}} (W; V_{i} | T, T_{i}) . \end{matrix}

(26)

Proof.

See Appendix A. □

In Theorem 2, the exponent triple

θ_{standard, 1}, θ_{dec, 1}, θ_{miss, 1}

can be optimized over the pmfs

p_{S | X}^{1}

,

p_{T_{1} | T}^{1}

and

p_{W | T T_{1}}^{1}

and independently thereof the exponent triple

θ_{standard, 2}, θ_{dec, 2}, θ_{miss, 2}

can be optimized over the pmfs

p_{S | X}^{2}

,

p_{T_{2} | T}^{2}

and

p_{W | T, T_{2}}^{2}

. The pmf

p_{T}

is common to both optimizations. However, whenever the exponents

θ_{cross, 1}

and

θ_{cross, 2}

are not active, Theorem 2 depends only on

p_{S | X}^{i}

,

p_{T_{i}}^{i}

and

p_{W | T_{i}}^{i}

, for

i = 1, 2

, and there is thus no tradeoff between the two exponents

θ_{1}

and

θ_{2}

. In other words, the same exponents

θ_{1}

and

θ_{2}

can be attained as in a system where the transmitter communicates over two individual DMCs

Γ_{V_{1} | W}

and

Γ_{V_{2} | W}

to the two receivers, or equivalently each receiver achieves the same exponent as if the other receiver was not present in the system.

The scheme achieving the exponents region in Theorem 2 is described in detail in Section 4 and analyzed in Appendix A. The main feature is that the sensor makes a tentative decision on

H

and conveys this decision to both receivers through its choice of the codebooks and a special coded time-sharing sequence indicating this choice. The receiver that wishes to maximize the error exponent corresponding to the hypothesis guessed at the sensor directly decides on this hypothesis. The other receiver should compare its own observation to a quantized version of the source sequence observed at the sensor. The sensor uses the quantization and binning scheme presented in [15] tailored to this latter receiver using either coded time-sharing sequence

T_{1}^{n}

and codebooks

{S^{n} (1; m, ℓ)}

and

{W^{n} (1; m)}

or coded time-sharing sequence

T_{2}^{n}

and codebooks

{S^{n} (2; m, ℓ)}

and

{W^{n} (2; m)}

, respectively. The overall scheme is illustrated in Figure 3.

Exponents

θ_{standard, i}

,

θ_{dec, i}

, and

θ_{miss, i}

have similar explanations as in the single-user case. Exponent

θ_{cross, i}

corresponds to the event that the transmitter sends a codeword from

{W (j; m)}

, for

j = 3 - i

, but Receiver i decides that a codeword from

{W (i; m)}

was sent and a source codeword

S (i; m, ℓ)

satisfies the minimum conditional entropy condition and the typicality check with the observed source sequence

Y_{i}^{n}

. Notice that setting

T_{i}

as a constant decreases the error exponent

θ_{cross, i}

.

For the special case where the BC consists of a common noiseless link, Theorem 2 has been proved in [12,13]. (More precisely, [12] considers the more general case with

K \geq 2

receivers and

M \geq K

hypotheses.) In this case, the exponents

(θ_{miss, 1}, θ_{cross, 1})

and

(θ_{miss, 2}, θ_{cross, 2})

are not active and there is no tradeoff between

θ_{1}

and

θ_{2}

.

3.3. Achievable Exponents Region for $h_{1} = h_{2}$ or $P_{X} = Q_{X}$

Define for any pmfs

P_{T}

,

P_{S U_{1} U_{2} | X T}

and function

\begin{matrix} f : S \times U_{1} \times U_{2} \times X \to W \end{matrix}

(27)

the joint pmfs

\begin{matrix} p_{S U_{1} U_{2} X Y_{1} Y_{2} T V_{1} V_{2}}^{i} & : = P_{S U_{1} U_{2} | X T} \cdot p_{X Y_{1} Y_{2}}^{i} \cdot P_{T} \cdot Γ_{V_{1} V_{2} | S U_{1} U_{2} X}, i \in {1, 2}, \end{matrix}

(28)

and

\begin{matrix} Γ_{V_{1} V_{2} | S U_{1} U_{2} X} : = Γ_{V_{1} V_{2} | W} (v_{1}, v_{2} | f (s, u_{1}, u_{2}, x)), \forall s \in S, u_{1} \in U_{1}, u_{2} \in U_{2}, x \in X, \end{matrix}

(29)

and for each

i \in {1, 2}

, the four exponents

\begin{matrix} θ_{standard, i} & : = min_{\begin{matrix} {\tilde{P}}_{S U_{i} X Y_{i} T V_{i}} : \\ {\tilde{P}}_{S U_{i} X T} = p_{S U_{i} X T}^{i} \\ {\tilde{P}}_{S U_{i} Y_{i} T V_{i}} = p_{S U_{i} Y_{i} T V_{i}}^{i} \end{matrix}} D ({\tilde{P}}_{S U_{i} X Y_{i} T V_{i}} ∥ p_{S U_{i} | X}^{i} q_{X Y_{i}}^{i} P_{T} Γ_{V_{i} | S U_{1} U_{2} X}), \end{matrix}

(30a)

\begin{matrix} θ_{dec, i}^{a} & : = min_{\begin{matrix} {\tilde{P}}_{S U_{i} X Y_{i} T V_{i}} : \\ {\tilde{P}}_{S U_{i} X T} = p_{S U_{i} X T}^{i} \\ {\tilde{P}}_{Y_{i} T V_{i}} = p_{Y_{i} T V_{i}}^{i} \\ H_{p^{i}} (S, U_{i} | Y_{i}, T, V_{i}) \leq H_{\tilde{P}} (S, U_{i} | Y_{i}, T, V_{i}) \end{matrix}} D ({\tilde{P}}_{S U_{i} X Y_{i} T V_{i}} ∥ p_{S U_{i} | X}^{i} q_{X Y_{i}}^{i} P_{T} Γ_{V_{i} | S U_{1} U_{2} X}) - I_{p^{i}} (S, U_{i}; X | T) + I_{p^{i}} (S, U_{i}; Y_{i}, V_{i} | T), \end{matrix}

(30b)

\begin{matrix} θ_{dec, i}^{b} & : = min_{\begin{matrix} {\tilde{P}}_{S U_{i} X Y_{i} T V_{i}} : \\ {\tilde{P}}_{S U_{i} X T} = p_{S U_{i} X T}^{i} \\ {\tilde{P}}_{S Y_{i} T V_{i}} = p_{S Y_{i} T V_{i}}^{i} \\ H_{p^{i}} (U_{i} | S, Y_{i}, T, V_{i}) \leq H_{\tilde{P}} (U_{i} | S, Y_{i}, T, V_{i}) \end{matrix}} D ({\tilde{P}}_{S U_{i} X Y_{i} T V_{i}} ∥ p_{S U_{i} | X}^{i} q_{X Y_{i}}^{i} P_{T} Γ_{V_{i} | S U_{1} U_{2} X}) - I_{p^{i}} (U_{i}; X | S, T) + I_{p^{i}} (U_{i}; Y_{i}, V_{i} | S, T), \end{matrix}

(30c)

\begin{matrix} θ_{miss, i} & : = E_{P_{T}} [D (p_{Y_{i} V_{i} | T}^{i} ∥ q_{Y_{i}}^{i} Γ_{V_{i} | W = T})] - I_{p^{i}} (S, U_{i}; X | T) + I_{p^{i}} (S, U_{i}; Y_{i}, V_{i} | T) . \end{matrix}

(30d)

Theorem 3.

If

h_{1} = h_{2}

or

P_{X} = Q_{X}

, i.e., (13) holds, then the union of all nonnegative error exponent pairs

(θ_{1}, θ_{2})

satisfying the following conditions is achievable:

\begin{matrix} θ_{i} & \leq min \{θ_{standard, i}, θ_{dec, i}^{a}, θ_{dec, i}^{b}, θ_{miss, i}\}, i \in {1, 2}, \\ θ_{1} + θ_{2} & \leq min {θ_{standard, 1} + θ_{standard, 2}, θ_{standard, 1} + θ_{dec, 2}^{a}, θ_{standard, 1} + θ_{dec, 2}^{b}, \\ θ_{standard, 2} + θ_{dec, 1}^{a}, θ_{standard, 2} + θ_{dec, 1}^{b}, θ_{miss, 1} + θ_{miss, 2}} \end{matrix}

(31a)

\begin{matrix} - I_{p^{1}} (U_{1}; U_{2} | S, T), \end{matrix}

(31b)

\begin{matrix} θ_{1} + θ_{2} & \leq min \{θ_{dec, 1}^{a}, θ_{dec, 1}^{b}\} + min \{θ_{dec, 2}^{a}, θ_{dec, 2}^{b}\} - 2 I_{p^{1}} (U_{1}; U_{2} | S, T), \end{matrix}

(31c)

where the union is over pmfs

P_{T}

,

P_{S U_{1} U_{2} | X T}

and functions f as in (27) so that the pmfs (28) and (29) satisfy for

i \in {1, 2}

:

\begin{matrix} I_{p^{i}} (S, U_{i}; X | T) & \leq I_{p^{i}} (S, U_{i}; Y_{i}, V_{i} | T), \end{matrix}

(32a)

\begin{matrix} I_{p^{i}} (U_{i}; X | S, T) & \leq I_{p^{i}} (U_{i}; Y_{i}, V_{i} | S, T), \end{matrix}

(32b)

\begin{matrix} I_{p^{1}} (S, U_{1}; X | T) + I_{p^{1}} (S, U_{2}; X | T) + I_{p^{1}} (U_{1}; U_{2} | S, T) \\ \leq I_{p^{1}} (S, U_{1}; Y_{1}, V_{1} | T) + I_{p^{2}} (S, U_{2}; Y_{2}, V_{2} | T), \end{matrix}

(32c)

\begin{matrix} I_{p^{1}} (U_{1}; X | S, T) + I_{p^{1}} (U_{2}; X | S, T) + I_{p^{1}} (U_{1}; U_{2} | S, T) \\ \leq I_{p^{1}} (U_{1}; Y_{1}, V_{1} | S, T) + I_{p^{2}} (U_{2}; Y_{2}, V_{2} | S, T), \end{matrix}

(32d)

\begin{matrix} I_{p^{1}} (U_{1}; X | S, T) + I_{p^{1}} (S, U_{2}; X | T) + I_{p^{1}} (U_{1}; U_{2} | S, T) \\ \leq I_{p^{1}} (U_{1}; Y_{1}, V_{1} | S, T) + I_{p^{2}} (S, U_{2}; Y_{2}, V_{2} | T), \end{matrix}

(32e)

\begin{matrix} I_{p^{1}} (S, U_{1}; X | T) + I_{p^{1}} (U_{2}; X | S, T) + I_{p^{1}} (U_{1}; U_{2} | S, T) \\ \leq I_{p^{1}} (S, U_{1}; Y_{1}, V_{1} | T) + I_{p^{2}} (U_{2}; Y_{2}, V_{2} | S, T), \end{matrix}

(32f)

Proof.

The coding and testing scheme achieving these exponents is described in Section 5. The analysis of the scheme is similar to the proof of [15] (Theorem 4) and omitted for brevity. In particular, error exponent

θ_{standard, i}

corresponds with the event that Receiver i decodes the correct cloud and satellite codewords but wrongly decides on

{\hat{H}}_{i} = 0

. In contrast, error exponents

θ_{dec, i}^{a}

and

θ_{dec, i}^{b}

correspond to the events that Receiver i wrongly decides on

{\hat{H}}_{i} = 0

after wrongly decoding both the cloud center and the satellite or only the satellite. Error exponent

θ_{miss, i}

corresponds to the miss-detection event. Due to the implicit rate constraints in (46), the final constraints in (31) are obtained by eliminating the rates

R_{0}, R_{1}, R_{2}

by means of Fourier–Motzkin elimination. Notice that in constraint (31c), the mutual information

I_{p^{1}} (U_{1}; U_{2} | S, T)

is multiplied by a factor 2, whereas in (31b), it appears without a factor. The reason is that the error analysis includes union bounds over the codewords in a bin and when wrongly decoding the satellite codewords (which is the case of exponents

θ_{dec, i}^{a}

and

θ_{dec, i}^{b}

) then the union bound is over pairs of codewords, whereas under correct decoding, it is over single codewords. In the former case, we have the factor

2^{2 n R_{i}^{'}}

in the error probability, and, in the latter case, the factor

2^{n R_{i}^{'}}

. The auxiliary rates

R_{1}^{'}

and

R_{2}^{'}

are then eliminated using the Fourier–Motzkin elimination algorithm. □

For each

i \in {1, 2}

, exponents

θ_{standard, i}, θ_{dec, i}^{a}, θ_{dec, i}^{b}

, and

θ_{miss, i}

have the same form as the three exponents in [15] (Theorem 1) for the DMC. There is, however, a tradeoff between the two exponents

θ_{1}

and

θ_{2}

in the above theorem because they share the same choice of the auxiliary pmfs

P_{T}

and

P_{S U_{1} U_{2} | X T}

and the function f. In [10], the above setup is studied in the special case of testing against conditional independence, and the mentioned tradeoff is illustrated through a Gaussian example.

4. Coding and Testing Scheme When $p_{X}^{1} \neq p_{X}^{2}$

Fix

μ > 0

, a sufficiently large blocklength n, auxiliary distributions

p_{T}

,

p_{T_{1} | T}^{1}

and

p_{T_{2} | T}^{2}

over

W

, conditional channel input distributions

p_{W | T T_{1}}^{1}

and

p_{W | T T_{2}}^{2}

, and conditional pmfs

p_{S | X}^{1}

and

p_{S | X}^{2}

over a finite auxiliary alphabet

S

such that for each

i \in {1, 2}

:

I_{p^{i}} (S; X | Y_{i}) < I_{p^{i}} (W; V_{i} | T, T_{i}) .

(33)

The mutual information in (33) is calculated according to the joint distribution:

\begin{matrix} p_{S X Y_{1} Y_{2} T T_{i} W V_{1} V_{2}}^{i} & = p_{S | X}^{i} \cdot p_{X Y_{1} Y_{2}}^{i} \cdot p_{T} \cdot p_{T_{i} | T}^{i} \cdot p_{W | T T_{i}}^{i} \cdot Γ_{V_{1} V_{2} | W} . \end{matrix}

(34)

For each

i \in {1, 2}

, if

I_{p^{i}} (S; X) < I_{p^{i}} (W; V_{i} | T, T_{i})

, choose rates

\begin{matrix} R_{i} & : = I_{p^{i}} (S; X) + μ, \end{matrix}

(35)

\begin{matrix} R_{i}^{'} & : = 0 . \end{matrix}

(36)

If

I_{p^{i}} (S; X) \geq I_{p^{i}} (W; V_{i} | T, T_{i})

, then choose rates

\begin{matrix} R_{i} & : = I_{p^{i}} (W; V_{i} | T, T_{i}) - μ, \end{matrix}

(37)

\begin{matrix} R_{i}^{'} & : = I_{p^{i}} (S; X) - I_{p^{i}} (W; V_{i} | T, T_{i}) + 2 μ . \end{matrix}

(38)

Again, all mutual informations in (35)–(38) are calculated with respect to the pmf in (34).

Code Construction: Generate a sequence

T^{n} = (T_{1}, \dots, T_{n})

by independently drawing each component

T_{k}

according to

p_{T}

. For each

i \in {1, 2}

, generate a sequence

T_{i}^{n} = (T_{i, 1}, \dots, T_{i, n})

by independently drawing each

T_{i, k}

according to

p_{T_{i} | T}^{i} (. | t)

when

T_{k} = t

. In addition, construct a random codebook

C_{W}^{i} = \{W^{n} (i; m) : m \in {1, . . ., ⌊ 2^{n R_{i}} ⌋}\}

(39)

superpositioned on

(T^{n}, T_{i}^{n})

where the k-th symbol

W_{k} (i; m)

of codeword

W^{n} (i; m)

is drawn independently of all codeword symbols according to

p_{W | T T_{i}}^{i} (\cdot | t, t_{i})

when

T_{k} = t

and

T_{i, k} = t_{i}

. Finally, construct a random codebook

\begin{matrix} C_{S}^{i} & = {S^{n} (i; m, ℓ) : m \in {1, \dots, ⌊ 2^{n R_{i}} ⌋}, ℓ \in {1, \dots, ⌊ 2^{n R_{i}^{'}} ⌋}}, i \in {1, 2}, \end{matrix}

(40)

by independently drawing the k-th component

S_{k} (i; m, ℓ)

of codeword

S^{n} (i; m, ℓ)

according to the marginal pmf

p_{S}^{i}

.

Reveal all codebooks and the realizations

t^{n}, t_{1}^{n}, t_{2}^{n}

of the sequences

T^{n}, T_{1}^{n}, T_{2}^{n}

to all terminals.

Transmitter: Given source sequence

X^{n} = x^{n}

, the transmitter looks for indices

(i, m, ℓ) \in {1, 2} \times {1, \dots, ⌊ 2^{n R_{1}} ⌋} \times {1, \dots, ⌊ 2^{n R_{i}^{'}} ⌋}

such that codeword

s^{n} (i; m, ℓ)

from codebook

C_{S}^{i}

satisfies

\begin{matrix} (s^{n} (i; m, ℓ), x^{n}) \in T_{μ / 2}^{n} (p_{S X}^{i}) . \end{matrix}

(41)

and the corresponding codeword

w^{n} (i; m)

from codebook

C_{W}^{i}

satisfies

\begin{matrix} (t^{n}, t_{i}^{n}, w^{n} (i; m)) \in T_{μ / 2}^{n} (p_{T T_{i} W}^{i}) . \end{matrix}

(42)

(Notice that when

μ

is sufficiently small, then Condition (41) can be satisfied for at most one value

i \in {1, 2}

, because

p_{X}^{1} \neq p_{X}^{2}

.) If successful, the transmitter picks uniformly at random one of the triples

(i, m, ℓ)

that satisfy (41), and it sends the sequence

w^{n} (i; m)

over the channel. If no triple satisfies Condition (41), then the transmitter sends the sequence

t^{n}

over the channel.

Receiver

i \in {1, 2}

: Receives

v_{i}^{n}

and checks whether there exist indices

(m^{'}, ℓ^{'})

such that the following three conditions are satisfied:

$\begin{matrix} (t^{n}, t_{i}^{n}, w^{n} (i; m^{'}), v_{i}^{n}) \in T_{μ}^{n} (p_{T T_{i} W V_{i}}^{i}), \end{matrix}$

(43)
$\begin{matrix} H_{tp (s^{n} (i; m^{'}, ℓ^{'}), y_{i}^{n})} (S | Y_{i}) = min_{\tilde{ℓ}} H_{tp (s^{n} (i; m^{'}, \tilde{ℓ}), y_{i}^{n})} (S | Y_{i}), \end{matrix}$

(44)
$\begin{matrix} (s^{n} (i; m^{'}, ℓ^{'}), y_{i}^{n}) \in T_{μ}^{n} (p_{S Y_{i}}^{i}) . \end{matrix}$

(45)

If successful, it declares

{\hat{H}}_{i} = {\bar{h}}_{i}

. Otherwise, it declares

{\hat{H}}_{i} = h_{i}

.

Analysis: See Appendix A.

5. Coding and Testing Scheme When $p_{X}^{1} = p_{X}^{2}$

In this case, the scheme is based on hybrid source-channel coding. Choose a large positive integer n, auxiliary alphabets

S

,

U_{1},

and

U_{2}

, and a function f as in (27).

Choose an auxiliary distribution

P_{T}

over

W

, and a conditional distribution

P_{S U_{1} U_{2} | X T}

over

S \times U_{1} \times U_{2}

so that for

i \in {1, 2}

inequalities (32) are satisfied with strict inequality.

Then, choose a positive

μ

and rates

R_{0}, R_{1}, R_{2}

so that

\begin{matrix} R_{0} & = I_{p^{1}} (S; X | T) + μ, \end{matrix}

(46a)

\begin{matrix} R_{i} & > I_{p^{1}} (U_{i}; X | S, T), i \in {1, 2}, \end{matrix}

(46b)

\begin{matrix} R_{1} + R_{2} & > I_{p^{1}} (U_{1}; X | S, T) + I_{p^{1}} (U_{2}; X | S, T) + I_{p^{1}} (U_{1}; U_{2} | S, T), \end{matrix}

(46c)

and

\begin{matrix} R_{0} + R_{i} & < I_{p^{i}} (S, U_{i}; Y_{i}, V_{i} | T), \end{matrix}

(46d)

\begin{matrix} R_{i} & < I_{p^{i}} (U_{i}; Y_{i}, V_{i} | S, T) . \end{matrix}

(46e)

Generate a sequence

T^{n}

i.i.d. according to

P_{T}

and construct a random codebook

C_{S} = \{S^{n} (m_{0}) : m_{0} \in {1, . . ., ⌊ 2^{n R_{0}} ⌋}\}

superpositioned on

T^{n}

where each codeword is drawn independently according to

p_{S | T}^{1}

conditioned on

T^{n}

. Then, for each index

m_{0}

and

i \in {1, 2}

, randomly generate a codebook

C_{U_{i}} (m_{0}) = \{U_{i}^{n} (m_{0}, m_{i}) : m_{i} \in {1, . . ., ⌊ 2^{n R_{i}} ⌋}\}

superpositioned on

(T^{n}, S^{n} (m_{0}))

by drawing each entry of the n-length codeword

U_{i}^{n} (m_{0}, m_{i})

i.i.d. according to the conditional pmf

p_{U_{i} | S T}^{1} (\cdot | S_{k} (m_{0}), T)

where

S_{k} (m_{0})

denotes the k-th symbol of

S^{n} (m_{0})

. Reveal the realizations of the codebooks and the sequence

T^{n}

to all terminals.

Transmitter: Given that it observes the source sequence

X^{n} = x^{n}

, the transmitter looks for indices

(m_{0}, m_{1}, m_{2})

that satisfy

\begin{matrix} (s^{n} (m_{0}), u_{1}^{n} (m_{0}, m_{1}), u_{2}^{n} (m_{0}, m_{2}), x^{n}, t^{n}) \in T_{μ / 2}^{n} (p_{S U_{1} U_{2} X T}^{1}) . \end{matrix}

(47)

If successful, it picks one of these indices uniformly at random and sends the codeword

w^{n}

over the channel, where

\begin{matrix} w_{k} = f (s_{k} (m_{0}), u_{1, k} (m_{0}, m_{1}), u_{2, k} (m_{0}, m_{2}), x_{k}), k \in {1, \dots, n}, \end{matrix}

(48)

and where

(s_{k} (m_{0}), u_{1, k} (m_{0}, m_{1}), u_{2, k} (m_{0}, m_{2}))

denote the k-th components of codewords

(s^{n} (m_{0}), u_{1}^{n} (m_{0}, m_{1}), u_{2}^{n} (m_{0},

m_{2}))

. Otherwise, it sends the sequence of inputs

t^{n}

over the channel.

Receiver

i \in {1, 2}

: After observing

V_{i}^{n} = v_{i}^{n}

and

Y_{i}^{n} = y_{i}^{n}

, Receiver

i \in {1, 2}

looks for indices

m_{0}^{'} \in {1, \dots, ⌊ 2^{n R_{0}} ⌋}

and

m_{i}^{'} \in {1, \dots, ⌊ 2^{n R_{i}} ⌋}

that satisfy the following conditions:

$\begin{matrix} (s^{n} (m_{0}^{'}), u_{i}^{n} (m_{0}^{'}, m_{i}^{'}), y_{i}^{n}, t^{n}, v_{i}^{n}) \in T_{μ}^{n} (p_{S U_{i} Y_{i} T V_{i}}^{i}) . \end{matrix}$

(49)
$\begin{matrix} H_{tp (s^{n} (m_{0}^{'}), u_{i}^{n} (m_{0}^{'}, m_{i}^{'}), y_{i}^{n}, t^{n}, v_{i}^{n})} (S, U_{i} | Y_{i}, T, V_{i}) \\ = min_{{\tilde{m}}_{0}, {\tilde{m}}_{i}} H_{tp (s^{n} ({\tilde{m}}_{0}), u_{i}^{n} ({\tilde{m}}_{0}, {\tilde{m}}_{i}), y_{i}^{n}, t^{n}, v_{i}^{n})} (S, U_{i} | Y_{i}, T, V_{i}), \end{matrix}$

(50)

If successful, Receiver i declares

{\hat{H}}_{i} = {\bar{h}}_{i}

. Otherwise, it declares

{\hat{H}}_{i} = h_{i}

.

Analysis: Similar to [15] (Appendix D) and omitted.

6. Summary and Conclusions

The paper proposed and analyzed general distributed hypothesis testing schemes both for the case where the sensor can distinguish the two null hypotheses and where it cannot. Our general schemes recover all previously studied special cases. Moreover, our schemes illustrate a similar phenomenon for setups with common noisefree communication links from the sensor to all decision centers: while a tradeoff arises when the transmitter cannot distinguish the alternate hypotheses at the two decision centers, such a tradeoff can almost completely be mitigated when the transmitter can distinguish the alternate hypotheses. In contrast to the noise-free link scenario, under a noisy broadcast channel model, a tradeoff can still arise in this case because decision centers can confuse the decision taken at the transmitter, and thus misinterpret to whom the communication is dedicated.

Interesting directions for future research include information-theoretic converse results and extensions to multiple sensors or more than two decision centers.

Author Contributions

Author Formal analysis, S.S. and M.W.; writing—original draft preparation, S.S.; writing—review and editing, M.W. All authors have read and agreed to the published version of the manuscript.

Funding

Author has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 programme, grant agreement No 715111.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Theorem 2

The proof is based on the scheme of Section 4. Fix a choice of blocklength n, the small positive

μ

and the (conditional) pmfs

p_{T}

,

p_{T_{1} | T}^{1}

,

p_{T_{2} | T}^{2}

,

p_{W | T T_{1}}^{1}

,

p_{W | T T_{2}}^{2}

,

p_{S | X}^{1}

and

p_{S | X}^{2}

so that (22) holds. Assume that

I_{p^{1}} (S; X) \geq I_{p^{1}} (W; V_{1} | T, T_{1})

and

I_{p^{2}} (S; X) \geq I_{p^{2}} (W; V_{2} | T, T_{2})

, in which case

R_{1}, R_{2}, R_{1}^{'}, R_{2}^{'}

are given by (37) and (38). Additionally, set for convenience of notation:

\begin{matrix} p_{S^{'}}^{i} (s) & = & p_{S}^{i} (s), \forall s \in S, \end{matrix}

(A1)

\begin{matrix} p_{W^{'} | T T_{i}}^{i} (w | t, t_{i}) & = & p_{W | T T_{i}}^{i} (w | t, t_{i}), \forall t, t_{i}, w \in W . \end{matrix}

(A2)

The analysis of type-I error probability is similar as in [15] (Appendix A). The main novelty is that because

p_{X}^{1} (x) \neq p_{X}^{2} (x)

for some

x \in X

, for sufficiently small values of

μ > 0

, the source sequence cannot lie in both

T_{μ / 2} (p_{X}^{1})

and

T_{μ / 2} (p_{X}^{2})

. Details are omitted.

Consider the type-II error probability at Receiver 1 averaged over all random codebooks. Define the following events for

i \in {1, 2}

:

\begin{matrix} E_{Tx, i} (m, ℓ) : & {(S^{n} (i; m, ℓ), X^{n}) \in T_{μ / 2}^{n} (p_{S X}^{i}), \\ (T^{n}, T_{i}^{n}, W^{n} (i; m)) \in T_{μ / 2}^{n} (p_{T T_{i} W}^{i}), W^{n} (i; m)) is sent}, \end{matrix}

(A3)

\begin{matrix} E_{Rx, i} (m^{'}, ℓ^{'}) : & {(S^{n} (i; m^{'}, ℓ^{'}), {Y_{i}}^{n}) \in T_{μ}^{n} (p_{S Y_{i}}^{i}), \\ (T^{n}, T_{i}^{n}, W^{n} (i; m^{'}), V_{i}^{n}) \in T_{μ}^{n} (p_{T T_{i} W V_{i}}^{i}), \\ H_{tp (S^{n} (i; m^{'}, ℓ^{'}), Y_{i}^{n})} (S | Y_{i}) = min_{\tilde{l}} H_{tp (S^{n} (i; m^{'}, \tilde{ℓ}), Y_{i}^{n})} (S | Y_{i})} . \end{matrix}

(A4)

Notice that

\begin{matrix} E_{C} [β_{1, n}] = Pr [{\hat{H}}_{1} = 0 | H = h_{1}] \leq Pr [⋃_{m^{'}, ℓ^{'}} E_{Rx, 1} (m^{'}, ℓ^{'}) | H = h_{1}] . \end{matrix}

(A5)

Above probability is upper bounded as:

\begin{matrix} Pr & [⋃_{m^{'}, ℓ^{'}} E_{Rx, 1} (m^{'}, ℓ^{'}) | H = h_{1}] \\ \leq Pr [(⋃_{m^{'}, ℓ^{'}} E_{Rx, 1} (m^{'}, ℓ^{'})) \cap (⋃_{m, ℓ} E_{Tx, 1} (m, ℓ)) | H = h_{1}] \\ + Pr [(⋃_{m^{'}, ℓ^{'}} E_{Rx, 1} (m^{'}, ℓ^{'})) \cap (⋂_{m, ℓ} E_{Tx, 1}^{c} (m, ℓ)) \cap (⋃_{m, ℓ} E_{Tx, 2} (m, ℓ)) | H = h_{1}] \\ + Pr [(⋃_{m^{'}, ℓ^{'}} E_{Rx, 1} (m^{'}, ℓ^{'})) \cap (⋂_{m, ℓ} E_{Tx, 1}^{c} (m, ℓ)) \cap (⋂_{m, ℓ} E_{Tx, 2}^{c} (m, ℓ)) | H = h_{1}] . \end{matrix}

(A6)

The sum of above probabilities can be upper bounded by the sum of the probabilities of the following events:

\begin{matrix} B_{1} & : & \{\exists (m, ℓ) s . t . (E_{Tx, 1} (m, ℓ) and E_{Rx, 1} (m, ℓ))\}, \end{matrix}

(A7)

\begin{matrix} B_{2} & : & \{\exists (m, ℓ, ℓ^{'}) with ℓ \neq ℓ^{'} s . t . (E_{Tx, 1} (m, ℓ) and E_{Rx, 1} (m, ℓ^{'}))\}, \end{matrix}

(A8)

\begin{matrix} B_{3} & : & {\exists (m, m^{'}, ℓ, ℓ^{'}) with & ℓ \neq ℓ^{'} and m \neq m^{'} \\ s . t . (E_{Tx, 1} (m, ℓ) and E_{Rx, 1} (m^{'}, ℓ^{'}))}, \end{matrix}

(A9)

\begin{matrix} B_{4} & : & \{\forall (m, ℓ) E_{Tx, 1}^{c} (m, ℓ)\} \\ \cap \{\exists (m, m^{'}, ℓ, ℓ^{'}) s . t . E_{Tx, 2} (m, ℓ) \cap E_{Rx, 1} (m^{'}, ℓ^{'})\}, \end{matrix}

(A10)

\begin{matrix} B_{5} & : & \{\forall (m, ℓ) E_{Tx, 1}^{c} (m, ℓ) and E_{Tx, 2}^{c} (m, ℓ)\} \\ \cap \{\exists (m^{'}, ℓ^{'}) s . t . E_{Rx, 1} (m^{'}, ℓ^{'})\} . \end{matrix}

(A11)

Thus, we have

\begin{matrix} E_{C} [β_{1, n}] \leq \sum_{i = 1}^{5} Pr [B_{i} | H = h_{1}] . \end{matrix}

(A12)

The probabilities of events

B_{1}

,

B_{2}

,

B_{3}

and

B_{5}

can be bounded following similar steps to [15] (Appendix A). This yields:

\begin{matrix} Pr [B_{1} | H = h_{1}] & \leq 2^{- n (θ_{μ, standard, 1} - δ_{1} (μ))}, \end{matrix}

(A13)

\begin{matrix} Pr [B_{2} | H = h_{1}] & \leq 2^{- n (θ_{μ, dec, 1} - δ_{2} (μ))}, \end{matrix}

(A14)

\begin{matrix} Pr [B_{3} | H = h_{1}] & \leq 2^{- n (θ_{μ, dec, 1} - δ_{3} (μ))}, \end{matrix}

(A15)

\begin{matrix} Pr [B_{5} | H = h_{1}] & \leq 2^{- n (θ_{μ, miss, 1} - δ_{5} (μ))}, \end{matrix}

(A16)

for some functions

δ_{1} (μ)

,

δ_{2} (μ)

,

δ_{3} (μ)

and

δ_{5} (μ)

that go to zero as n goes to infinity and

μ \to 0

, and where we define:

\begin{matrix} θ_{standard, i} & : = & min_{\begin{matrix} {\tilde{P}}_{S X Y_{i}} : \\ | π_{S X} - p_{S X}^{i} | < μ / 2 \\ | π_{S Y_{i}} - p_{S Y_{i}}^{i} | < μ \end{matrix}} D (π_{S X Y_{i}} ∥ p_{S | X}^{i} q_{X Y_{i}}^{i}), \end{matrix}

(A17)

\begin{matrix} θ_{dec, i} & : = & min_{\begin{matrix} {\tilde{P}}_{S X Y_{i}} : \\ | π_{S X} - p_{S X}^{i} | < μ / 2 \\ | π_{Y_{i}} - p_{Y_{i}}^{i} | < μ \\ H_{p^{i}} (S | Y_{i}) \leq H_{π} (S | Y_{i}) \end{matrix}} D (π_{S X Y_{i}} ∥ p_{S | X}^{i} q_{X Y_{i}}^{i}) - I_{p^{i}} (S; X | Y_{i}) + I_{p^{i}} (W; V_{i} | T, T_{i}), \end{matrix}

(A18)

\begin{matrix} θ_{miss, i} & : = & D (p_{Y_{i}}^{i} ∥ q_{Y_{i}}^{i}) + E_{p_{T}} [D (p_{V_{i} | T}^{i} ∥ Γ_{V_{i} | W = T})] - I_{p^{i}} (S; X | Y_{i}) + I_{p^{i}} (W; V_{i} | T, T_{i}) . \end{matrix}

(A19)

Consider event

B_{4}

:

\begin{matrix} Pr [B_{4} | H = h_{1}] \\ \leq \sum_{\begin{matrix} m, ℓ \end{matrix}} \sum_{m^{'}, ℓ^{'}} Pr [(S^{n} (2; m, ℓ), X^{n}) \in T_{μ / 2}^{n} (p_{S X}^{2}), (T^{n}, W^{n} (2; m)) \in T_{μ / 2}^{n} (p_{T W}^{2}), \\ W^{n} (2; m) is sent, (S^{n} (1; m^{'}, ℓ^{'}), Y_{1}^{n}) \in T_{μ}^{n} (p_{S Y_{1}}^{1}), \\ (T^{n}, T_{1}^{n}, W^{n} (1; m^{'}), V_{1}^{n}) \in T_{μ}^{n} (p_{T T_{1} W V_{1}}^{1}) \\ H_{tp (S^{n} (1; m^{'}, ℓ^{'}), Y_{1}^{n})} (S | Y_{1}) = min_{\tilde{ℓ}} H_{tp (S^{n} (1; m^{'}, \tilde{ℓ}), Y_{1}^{n})} (S | Y_{1}) | H = h_{1}] \\ \overset{(a)}{\leq} \sum_{\begin{matrix} m, ℓ \end{matrix}} \sum_{m^{'}, ℓ^{'}} Pr [(S^{n} (2; m, ℓ), X^{n}) \in T_{μ / 2}^{n} (p_{S X}^{2}), (S^{n} (1; m^{'}, ℓ^{'}), Y_{1}^{n}) \in T_{μ}^{n} (p_{S Y_{1}}^{1}), \\ H_{tp (S^{n} (1; m^{'}, ℓ^{'}), Y_{1}^{n})} (S | Y_{1}) = min_{\tilde{ℓ}} H_{tp (S^{n} (1; m^{'}, \tilde{ℓ}), Y_{1}^{n})} (S | Y_{1}) | H = h_{1}] \\ \cdot Pr [(T^{n}, T_{1}^{n}, W^{n} (1; m^{'}), V_{1}^{n}) \in T_{μ}^{n} (p_{T T_{1} W V_{1}}^{1}), \\ (T^{n}, W^{n} (2; m)) \in T_{μ / 2}^{n} (p_{T W}^{2}) | \\ W^{n} (2; m) is sent, H = h_{1}] \\ \overset{(b)}{\leq} 2^{n (R_{1} + R_{1}^{'} + R_{2} + R_{2}^{'})} \cdot max_{\begin{matrix} π_{S S^{'} X Y_{1}} : \\ | π_{S X} - p_{S X}^{2} | < μ / 2 \\ | π_{S^{'} Y_{1}} - p_{S Y_{1}}^{1} | < μ \\ H_{π} (S^{'} | Y_{1}) \leq H_{π} (S | Y_{1}) \end{matrix}} 2^{- n (D (π_{S S^{'} X Y_{1}} ∥ p_{S}^{2} p_{S^{'}}^{1} q_{X Y_{1}}^{1}) - μ)} \\ \cdot max_{\begin{matrix} π_{T T_{1} W^{'} W V_{1}} : \\ | π_{T W} - p_{T W}^{2} | < μ / 2 \\ | π_{T T_{1} W^{'} V_{1}} - p_{T T_{1} W V_{1}}^{1} | < μ \end{matrix}} 2^{- n (D (π_{T T_{1} W^{'} W V_{1}} ∥ p_{T} p_{T_{1} | T}^{1} p_{W^{'} | T T_{1}}^{1} p_{W | T}^{2} Γ_{V_{1} | W}) - μ)}, \end{matrix}

(A20)

where

(a)

holds because the channel code is drawn independently of the source code and

(b)

holds by Sanov’s theorem.

Define

\begin{matrix} {\tilde{θ}}_{μ, cross, 1} & : = min_{\begin{matrix} π_{S S^{'} X Y_{1}} : \\ | π_{S X} - p_{S X}^{2} | < μ / 2 \\ | π_{S^{'} Y_{1}} - p_{S Y_{1}}^{1} | < μ \\ H_{π} (S^{'} | Y_{1}) \leq H_{π} (S | Y_{1}) \end{matrix}} D (π_{S S^{'} X Y_{1}} ∥ p_{S}^{2} p_{S^{'}}^{1} q_{X Y_{1}}^{1}) \\ + min_{\begin{matrix} π_{T T_{1} W^{'} W V_{1}} : \\ | π_{T W} - p_{T W}^{2} | < μ / 2 \\ | π_{T T_{1} W^{'} V_{1}} - p_{T T_{1} W V_{1}}^{1} | < μ \end{matrix}} D (π_{T T_{1} W^{'} W V_{1}} ∥ p_{T} p_{T_{1} | T}^{1} p_{W^{'} | T T_{1}}^{1} p_{W | T}^{2} Γ_{V_{1} | W}) \\ - R_{1} - R_{2} - R_{1}^{'} - R_{2}^{'} - 2 μ, \end{matrix}

(A21)

and notice that

\begin{matrix} {\tilde{θ}}_{μ, cross, 1} \\ \overset{((37) & (38))}{=} & min_{\begin{matrix} π_{S S^{'} X Y_{1}} : \\ | π_{S X} - p_{S X}^{2} | < μ / 2 \\ | π_{S^{'} Y_{1}} - p_{S Y_{1}}^{1} | < μ \\ H_{π} (S^{'} | Y_{1}) \leq H_{π} (S | Y_{1}) \end{matrix}} D (π_{S S^{'} X Y_{1}} ∥ p_{S}^{2} p_{S^{'}}^{1} q_{X Y_{1}}^{1}) \\ + min_{\begin{matrix} π_{T T_{1} W^{'} W V_{1}} : \\ | π_{T W} - p_{T W}^{2} | < μ \\ | π_{T T_{1} W^{'} V_{1}} - p_{T T_{1} W V_{1}}^{1} | < μ \end{matrix}} D (π_{T T_{1} W^{'} W V_{1}} ∥ p_{T} p_{T_{1} | T}^{1} p_{W^{'} | T T_{1}}^{1} p_{W | T}^{2} Γ_{V_{1} | W}) \\ - I_{p^{1}} (S; X) - I_{p^{2}} (S; X) - 4 μ \\ \overset{(c)}{=} & min_{\begin{matrix} π_{S S^{'} X Y_{1}} : \\ | π_{S X} - q_{S X}^{1} | < μ / 2 \\ | π_{S^{'} Y_{1}} - p_{S Y_{1}}^{1} | < μ \\ H_{π} (S^{'} | Y_{1}) \leq H_{π} (S | Y_{1}) \end{matrix}} D (π_{S S^{'} X Y_{1}} ∥ q_{S}^{1} p_{S^{'}}^{1} q_{X Y_{1}}^{1}) \\ + min_{\begin{matrix} π_{T T_{1} W^{'} W V_{1}} : \\ | π_{T W} - q_{T W}^{1} | < μ \\ | π_{T T_{1} W^{'} V_{1}} - p_{T T_{1} W V_{1}}^{1} | < μ \end{matrix}} D (π_{T T_{1} W^{'} W V_{1}} ∥ p_{T} p_{T_{1} | T}^{1} p_{W^{'} | T T_{1}}^{1} q_{W | T}^{1} Γ_{V_{1} | W}) \\ - I_{p^{1}} (S; X) - I_{q^{1}} (S; X) - 4 μ \\ \overset{(CR)}{=} & min_{\begin{matrix} π_{S S^{'} X Y_{1}} : \\ | π_{S X} - q_{S X}^{1} | < μ / 2 \\ | π_{S^{'} Y_{1}} - p_{S Y_{1}}^{1} | < μ \\ H_{π} (S^{'} | Y_{1}) \leq H_{π} (S | Y_{1}) \end{matrix}} [D (π_{S X Y_{1}} ∥ q_{S | X}^{1} q_{X Y_{1}}^{1}) + E_{π_{S X Y_{1}}} [D (π_{S^{'} | S X Y_{1}} ∥ p_{S^{'}}^{1})]] \\ - I_{p^{1}} (S; X) \\ + min_{\begin{matrix} π_{T T_{1} W^{'} W V_{1}} : \\ | π_{T W} - q_{T W}^{1} | < μ \\ | π_{T T_{1} W^{'} V_{1}} - p_{T T_{1} W V_{1}}^{1} | < μ \end{matrix}} [D (π_{T T_{1} W^{'} W} ∥ p_{T T_{1}}^{1} p_{W^{'} | T T_{1}}^{1} q_{W | T}^{1}) \\ + E_{T T_{1} W^{'} W} [D (π_{V_{1} | T T_{1} W^{'} W} ∥ π_{V_{1} | T T_{1}}) + D (π_{V_{1} | T T_{1}} ∥ Γ_{V_{1} | W})]] \\ - 4 μ \\ \overset{(DP)}{\geq} & min_{\begin{matrix} π_{S S^{'} X Y_{1}} : \\ | π_{S X} - q_{S X}^{1} | < μ / 2 \\ | π_{S^{'} Y_{1}} - p_{S Y_{1}}^{1} | < μ \\ H_{π} (S^{'} | Y_{1}) \leq H_{π} (S | Y_{1}) \end{matrix}} [D (π_{S X Y_{1}} ∥ q_{S | X}^{1} q_{X Y_{1}}^{1}) + E_{π_{Y_{1}}} [D (π_{S^{'} | Y_{1}} ∥ p_{S^{'}}^{1})]] - I_{p^{1}} (S; X) \\ + min_{\begin{matrix} π_{T T_{1} W^{'} W V_{1}} : \\ | π_{T W} - q_{T W}^{1} | < μ \\ | π_{T T_{1} W^{'} V_{1}} - p_{T T_{1} W V_{1}}^{1} | < μ \end{matrix}} [E_{π_{T T_{1} W^{'}}} [D (π_{V_{1} | T T_{1} W^{'}} ∥ π_{V_{1} | T T_{1}}) + D (π_{V_{1} | T T_{1}} ∥ Γ_{V_{1} | W})] \\ - 4 μ \\ \overset{(d)}{=} & min_{\begin{matrix} π_{S X Y_{1}} : \\ | π_{Y_{1}} - p_{Y_{1}}^{1} | < μ \\ H_{p^{1}} (S | Y_{1}) \leq H_{π} (S | Y_{1}) \end{matrix}} E_{q_{X S}^{1}} [D (π_{Y_{1} | X S} ∥ q_{Y_{1} | X}^{1})] + I_{p^{1}} (S; Y_{1}) - I_{p^{1}} (S; X) \\ + I_{p^{1}} (V_{1}; W | T, T_{1}) + min_{\begin{matrix} π_{T T_{1} W V_{1}} : \\ | π_{T W} - q_{T W}^{1} | < μ \\ | π_{T T_{1} V_{1}} - p_{T T_{1} V_{1}}^{1} | < μ \end{matrix}} E_{π_{T T_{1} W}} [D (p_{V_{1} | T T_{1}}^{1} ∥ Γ_{V_{1} | W})] - δ_{3} (μ) \\ = & θ_{μ, cross, 1} - δ_{4} (μ) \end{matrix}

(A22)

for a function

δ_{4} (μ)

that goes to zero as

μ \to 0

and

\begin{matrix} θ_{μ, cross, 1} & : = & min_{\begin{matrix} π_{S X Y_{1}} : \\ | π_{Y_{1}} - p_{Y_{1}}^{1} | < μ \\ H_{p^{1}} (S | Y_{1}) \leq H_{π} (S | Y_{1}) \end{matrix}} E_{q_{X S}^{1}} [D (π_{Y_{1} | X S} ∥ q_{Y_{1} | X}^{1})] + I_{p^{1}} (S; Y_{1}) - I_{p^{1}} (S; X) \\ + I_{p^{1}} (V_{1}; W | T, T_{1}) + min_{\begin{matrix} π_{T T_{1} W V_{1}} : \\ | π_{T W} - q_{T W}^{1} | < μ \\ | π_{T T_{1} V_{1}} - p_{T T_{1} V_{1}}^{1} | < μ \end{matrix}} E_{π_{T T_{1} W}} [D (p_{V_{1} | T T_{1}}^{1} ∥ Γ_{V_{1} | W})] . \end{matrix}

(A23)

Here,

(c)

holds because the condition

p_{X}^{1} \neq p_{X}^{2}

implies that

h_{1} = {\bar{h}}_{2}

and thus

p^{2} = q^{1}

, and

(d)

holds by the constraints in the minimizations.

Combining (A20), (A21) and (A22) establishes:

\begin{matrix} Pr [B_{4} | H = h_{1}] \leq 2^{- n (θ_{μ, cross, 1} - δ_{3} (μ))} . \end{matrix}

(A24)

Considering (A12)–(A16) and (A24), we get:

\begin{matrix} E_{C} [β_{1, n}] \leq max {2^{- n (θ_{μ, standard, 1} - δ_{1} (μ))}, 2^{- n (θ_{μ, dec, 1} - δ_{2} (μ))}, 2^{- n (θ_{μ, dec, 1} - δ_{2}^{'} (μ))}, 2^{- n (θ_{μ, cross, 1} - δ_{3} (μ))}, 2^{- n (θ_{μ, miss, 1} - δ_{4} (μ))}} . \end{matrix}

(A25)

By standard arguments and successively eliminating the worst half of the codewords with respect to

α_{1, n}

and the exponents

θ_{μ, standard, 1}

,

θ_{μ, dec, 1}

,

θ_{μ, cross, 1}

and

θ_{μ, miss, 1}

, it can be shown that there exists at least one codebook for which

\begin{matrix} α_{1, n} & < & ϵ, \end{matrix}

(A26)

\begin{matrix} β_{1, n} & \leq & 32 \cdot max {2^{- n (θ_{μ, standard, 1} - δ_{1} (μ))}, 2^{- n (θ_{μ, dec, 1} - δ_{2} (μ))}, 2^{- n (θ_{μ, dec, 1} - δ_{2}^{'} (μ))}, 2^{- n (θ_{μ, cross, 1} - δ_{3} (μ))}, 2^{- n (θ_{μ, miss, 1} - δ_{4} (μ))}} . \end{matrix}

(A27)

Letting

μ \to 0

and

n \to \infty

, we get

θ_{μ, standard, 1} \to θ_{standard, 1}

,

θ_{μ, dec, 1} \to θ_{dec, 1}

,

θ_{μ, cross, 1} \to θ_{cross, 1}

and

θ_{μ, miss, 1} \to θ_{miss, 1}

. A similar bound can be found for

θ_{2}

. This concludes the proof.

References

Ahlswede, R.; Csiszár, I. Hypothesis testing with communication constraints. IEEE Trans. Inf. Theory 1986, 32, 533–542. [Google Scholar] [CrossRef] [Green Version]
Han, T.S. Hypothesis testing with multiterminal data compression. IEEE Trans. Inf. Theory 1987, 33, 759–772. [Google Scholar] [CrossRef]
Shimokawa, H.; Han, T.; Amari, S.I. Error bound for hypothesis testing with data compression. In Proceedings of the 1994 IEEE International Symposium on Information Theory, Trondheim, Norway, 27 June–1 July 1994; p. 114. [Google Scholar]
Shimokawa, H. Hypothesis Testing with Multiterminal Data Compression. Master’s Thesis, University of Tokyo, Tokyo, Janpan, 1994. [Google Scholar]
Weinberger, N.; Kochman, Y. On the reliability function of distributed hypothesis testing under optimal detection. IEEE Trans. Inf. Theory 2019, 65, 4940–4965. [Google Scholar] [CrossRef]
Rahman, M.S.; Wagner, A.B. On the optimality of binning for distributed hypothesis testing. IEEE Trans. Inf. Theory 2012, 58, 6282–6303. [Google Scholar] [CrossRef] [Green Version]
Zhao, W.; Lai, L. Distributed testing against independence with multiple terminals. In Proceedings of the 2014 52nd Annual Allerton Conference on Communication, Control, and Computing (Allerton), Monticello, IL, USA, 30 September–3 October 2014; pp. 1246–1251. [Google Scholar]
Xiang, Y.; Kim, Y.H. Interactive hypothesis testing against independence. In Proceedings of the 2013 IEEE International Symposium on Information Theory, Istanbul, Turkey, 7–12 July 2013; pp. 2840–2844. [Google Scholar]
Katz, G.; Piantanida, P.; Debbah, M. Collaborative distributed hypothesis testing with general hypotheses. In Proceedings of the 2016 IEEE International Symposium on Information Theory (ISIT), Barcelona, Spain, 10–15 July 2016; pp. 1705–1709. [Google Scholar]
Salehkalaibar, S.; Wigger, M.; Timo, R. On hypothesis testing against independence with multiple decision centers. IEEE Trans. Commun. 2018, 66, 2409–2420. [Google Scholar] [CrossRef] [Green Version]
Salehkalaibar, S.; Wigger, M.; Wang, L. Hypothesis testing over the two-hop relay network. IEEE Trans. Inf. Theory 2019, 65, 4411–4433. [Google Scholar] [CrossRef]
Escamilla, P.; Wigger, M.; Zaidi, A. Distributed hypothesis testing with concurrent detection. In Proceedings of the 2018 IEEE International Symposium on Information Theory (ISIT), Vail, CO, USA, 17–22 June 2018. [Google Scholar]
Escamilla, P.; Wigger, M.; Zaidi, A. Distributed hypothesis testing: Cooperation and concurrent detection. IEEE Trans. Inf. Theory 2020, 66, 7550–7564. [Google Scholar] [CrossRef]
Tian, C.; Chen, J. Successive refinement for hypothesis testing and lossless one-helper problem. IEEE Trans. Inf. Theory 2008, 54, 4666–4681. [Google Scholar] [CrossRef] [Green Version]
Salehkalaibar, S.; Wigger, M. Distributed hypothesis testing based on unequal error protection codes. IEEE Trans. Inf. Theory 2020, 66, 4150–4182. [Google Scholar] [CrossRef]
Sreekumar, S.; Gündüz, D. Distributed hypothesis testing over discrete memoryless channels. IEEE Trans. Inf. Theory 2020, 66, 2044–2066. [Google Scholar] [CrossRef]
Borade, S.; Nakiboglu, B.; Zheng, L. Unequal error protection: An information-theoretic perspective. IEEE Trans. Inf. Theory 2009, 55, 5511–5539. [Google Scholar] [CrossRef]
El Gamal, A.; Kim, Y.H. Network Information Theory; Cambridge University Press: Cambridge, MA, USA, 2011. [Google Scholar]

Figure 1. Hypothesis testing over a noisy BC.

Figure 2. Coding and testing scheme for hypothesis testing over a DMC.

Figure 3. Coding and testing scheme for hypothesis testing over a BC.

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Salehkalaibar, S.; Wigger, M. Distributed Hypothesis Testing over Noisy Broadcast Channels. Information 2021, 12, 268. https://0-doi-org.brum.beds.ac.uk/10.3390/info12070268

AMA Style

Salehkalaibar S, Wigger M. Distributed Hypothesis Testing over Noisy Broadcast Channels. Information. 2021; 12(7):268. https://0-doi-org.brum.beds.ac.uk/10.3390/info12070268

Chicago/Turabian Style

Salehkalaibar, Sadaf, and Michèle Wigger. 2021. "Distributed Hypothesis Testing over Noisy Broadcast Channels" Information 12, no. 7: 268. https://0-doi-org.brum.beds.ac.uk/10.3390/info12070268

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Distributed Hypothesis Testing over Noisy Broadcast Channels

Abstract

1. Introduction

Notation

2. System Model

3. Results on Exponents Region

3.1. Achievable Exponent for Point-to-Point Channels

3.2. Achievable Exponents Region When $h_{1} \neq h_{2}$ and $P_{X} \neq Q_{X}$

3.3. Achievable Exponents Region for $h_{1} = h_{2}$ or $P_{X} = Q_{X}$

4. Coding and Testing Scheme When $p_{X}^{1} \neq p_{X}^{2}$

5. Coding and Testing Scheme When $p_{X}^{1} = p_{X}^{2}$

6. Summary and Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Proof of Theorem 2

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Distributed Hypothesis Testing over Noisy Broadcast Channels

Abstract

1. Introduction

Notation

2. System Model

3. Results on Exponents Region

3.1. Achievable Exponent for Point-to-Point Channels

3.2. Achievable Exponents Region When h 1 ≠ h 2 and P X ≠ Q X

3.3. Achievable Exponents Region for h 1 = h 2 or P X = Q X

4. Coding and Testing Scheme When p X 1 ≠ p X 2

5. Coding and Testing Scheme When p X 1 = p X 2

6. Summary and Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Proof of Theorem 2

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.2. Achievable Exponents Region When $h_{1} \neq h_{2}$ and $P_{X} \neq Q_{X}$

3.3. Achievable Exponents Region for $h_{1} = h_{2}$ or $P_{X} = Q_{X}$

4. Coding and Testing Scheme When $p_{X}^{1} \neq p_{X}^{2}$

5. Coding and Testing Scheme When $p_{X}^{1} = p_{X}^{2}$