On the Rate-Distortion Function of Sampled Cyclostationary Gaussian Processes

Abakasanga, Emeka; Shlezinger, Nir; Dabora, Ron

doi:10.3390/e22030345

Open AccessArticle

On the Rate-Distortion Function of Sampled Cyclostationary Gaussian Processes

by

Emeka Abakasanga

¹,

Nir Shlezinger

²

and

Ron Dabora

^1,*

¹

Department of Electrical and Computer Engineering, Ben-Gurion University, Be’er-Sheva 8410501, Israel

²

Faculty of Mathematics and Computer Science, Weizmann Institute of Science, Rehovot 7610001, Israel

^*

Author to whom correspondence should be addressed.

Entropy 2020, 22(3), 345; https://0-doi-org.brum.beds.ac.uk/10.3390/e22030345

Submission received: 14 February 2020 / Revised: 8 March 2020 / Accepted: 10 March 2020 / Published: 17 March 2020

(This article belongs to the Special Issue Wireless Networks: Information Theoretic Perspectives)

Download

Browse Figures

Versions Notes

Abstract

:

Man-made communications signals are typically modelled as continuous-time (CT) wide-sense cyclostationary (WSCS) processes. As modern processing is digital, it is applied to discrete-time (DT) processes obtained by sampling the CT processes. When sampling is applied to a CT WSCS process, the statistics of the resulting DT process depends on the relationship between the sampling interval and the period of the statistics of the CT process: When these two parameters have a common integer factor, then the DT process is WSCS. This situation is referred to as synchronous sampling. When this is not the case, which is referred to as asynchronous sampling, the resulting DT process is wide-sense almost cyclostationary (WSACS). The sampled CT processes are commonly encoded using a source code to facilitate storage or transmission over wireless networks, e.g., using compress-and-forward relaying. In this work, we study the fundamental tradeoff between rate and distortion for source codes applied to sampled CT WSCS processes, characterized via the rate-distortion function (RDF). We note that while RDF characterization for the case of synchronous sampling directly follows from classic information-theoretic tools utilizing ergodicity and the law of large numbers, when sampling is asynchronous, the resulting process is not information stable. In such cases, the commonly used information-theoretic tools are inapplicable to RDF analysis, which poses a major challenge. Using the information-spectrum framework, we show that the RDF for asynchronous sampling in the low distortion regime can be expressed as the limit superior of a sequence of RDFs in which each element corresponds to the RDF of a synchronously sampled WSCS process (yet their limit is not guaranteed to exist). The resulting characterization allows us to introduce novel insights on the relationship between sampling synchronization and the RDF. For example, we demonstrate that, differently from stationary processes, small differences in the sampling rate and the sampling time offset can notably affect the RDF of sampled CT WSCS processes.

Keywords:

wide-sense cyclostationary; wide-sense almost cyclostationary; rate-distortion function; information spectrum

1. Introduction

Man-made signals are typically generated using a repetitive procedure, which takes place at fixed intervals. The resulting signals are thus commonly modeled as continuous-time (CT) random processes exhibiting periodic statistical properties [1,2,3], which are referred to as wide-sense cyclostationary (WSCS) processes. In digital communications, where the transmitted waveforms commonly obey the WSCS model [3], the received CT signal is first sampled to obtain a discrete-time (DT) received signal. In the event that the sampling interval is commensurate with the period of the statistics of the CT WSCS signal, cyclostationarity is preserved in DT ([3] Section 3.9). In this work, we refer to this situation as synchronous sampling. However, it is practically common to encounter scenarios in which the sampling rate at the receiver and the symbol rate of the received CT WSCS process are incommensurate, which is referred to as asynchronous sampling. The resulting sampled process in such cases is a DT wide-sense almost cyclostationary (WSACS) stochastic process ([3] Section 3.9).

This research aims at investigating lossy source coding for asynchronously sampled CT WSCS processes. In the source coding problem, every sequence of information symbols from the source is mapped into a sequence of code symbols, referred to as codewords, taken from a predefined codebook. In lossy source coding, the source sequence is recovered up to a predefined distortion constraint, within an arbitrary small tolerance of error. The figure-of-merit for lossy source coding is the rate-distortion function (RDF) which characterizes the minimum number of bits per source symbol required to compress the source sequence such that it can be reconstructed at the decoder within the specified maximal distortion [4]. For an independent and identically distributed (IID) random source process, the RDF can be expressed as the minimum mutual information between the source variable and the reconstruction variable, such that with the corresponding conditional distribution of the reconstruction symbol given the source symbol, the distortion constraint is satisfied ([5] Chapter 10). The source coding problem has been further studied in multiple different scenarios, including the reconstruction of a single source at multiple destinations [6] and the reconstruction of multiple correlated stationary Gaussian sources at a single destination [7,8,9].

For sampled stationary source processes, ergodicity theory and the asymptotic equipartition property (AEP) ([5] Chapter 3) were utilized for characterizing the RDF in different scenarios ([10] Chapter 9), ([4] Section I), [11]. However, as in a broad range of applications, including digital communications networks, the CT signals are WSCS processes, the sampling operation results in DT source signals whose statistics depends on the relationship between the sampling rate and the period of the statistics of the source signal. When sampling is synchronous, the resulting DT source signal is WSCS ([3] Section 3.9). The RDF for lossy compression of DT WSCS Gaussian sources with memory was studied in [12]. The work [12] used the fact that any WSCS signal can be transformed into a set of stationary subprocess [2]; thereby facilitating the application of information-theoretic results obtained for multivariate stationary sources to the derivation of the RDF; Nonetheless, in many digital communications scenarios, the sampling rate and the symbol rate of the CT WSCS process are not related in any way, and are possibly incommensurate, resulting in a sampled process which is a DT WSACS stochastic process. Such situations can occur as a result of the a-priori determined values of the sampling interval and the symbol duration of the WSCS source signal, as well as due to sampling clock jitters resulting from hardware impairments. A comprehensive review of trends and applications for almost cyclostationary signals can be found in [13]. Despite of their apparent frequent occurrences, the RDF for lossy compression of WSACS sources has not been characterized, which is the motivation for the current research. A major challenge associated with characterizing fundamental limits for asynchronously sampled WSCS processes stems from the fact that the resulting processes are not information stable, in the sense that their conditional distributions are not ergodic ([14] Page X), [15,16]. As a result, the standard information-theoretic tools cannot be employed, making the characterization of the RDF a very challenging problem.

Our recent study in [17] on channel coding reveals that for the case of additive CT WSCS Gaussian noise, capacity varies significantly with sampling rates, whether the Nyquist criterion is satisfied or not. In particular, it was observed that the capacity can change dramatically with minor variations in the sampling rate, causing it to switch from synchronous sampling to asynchronous sampling. This is in direct contrast to the results obtained for wide-sense stationary noise for which the capacity remains unchanged for any sampling rate above the Nyquist rate [18]. A natural fundamental question that arises from this result is how the RDF of a sampled Gaussian source process varies with the sampling rate. As a motivating example, one may consider compress-and-forward (CF) relaying, in which the relay samples at a rate which can be incommensurate with the symbol rate of the incoming communications signal.

In this work, we employ the information-spectrum framework [14] for characterizing the RDF of asynchronously sampled memoryless Gaussian WSCS processes, as this framework is applicable to the information-theoretic analysis of non information-stable processes ([14] Page VII). We further note that while rate characterizations obtained using information spectrum tools and its associated quantities may be difficult to evaluate ([14] Remark 1.7.3), here we obtain a numerically computable characterization of the RDF. In particular, we focus on the mean squared error (MSE) distortion measure in the low distortion regime, namely, source codes for which the average square of the difference between the source and the reproduction process is not larger than the minimal source variance. The results of this research lead to accurate modelling of signal compression in current and future digital communications systems. The derived RDF, which characterizes the fundamental performance limits in encoding sampled CT WSCS Gaussian processes into a digital representation, allows to evaluate source coding schemes associated with different levels of complexity in terms of their gap from optimality, when applied to this important class of signals.

Furthermore, we utilize our characterization of the RDF to examine how the RDF for a sampled CT WSCS Gaussian source varies with different sampling rates and sampling time offsets. We demonstrate that, differently from stationary sources, when applying a lossy source code to a sampled WSCS process, the achievable rate-distortion tradeoff can be significantly affected by minor variations in the sampling time offset and the sampling rate. Our results thus allow identifying the sampling rate and sampling time offsets which minimize the RDF in systems involving sampled WSCS processes.

The rest of this work is organised as follows: Section 2 provides a scientific background on cyclostationary processes and on rate-distortion analysis of DT WSCS Gaussian sources. Section 3 presents the problem formulation and auxiliary results, and Section 4 details the main result of RDF characterization for sampled WSCS Gaussian process. Numerical examples and discussions are provided in Section 5, and Section 6 concludes the paper.

2. Preliminaries and Background

In the following we review the main tools and framework used in this work: In Section 2.1 we detail the notations, and in Section 2.2 we review the basics of cyclostationary processes and the statistical properties of a DT process resulting from sampling a CT WSCS process. In Section 2.3 we recall some preliminaries of rate-distortion theory as well as the RDF for DT WSCS Gaussian source processes. This background creates a premise for the statement of the main result provided in Section 4 of this paper.

2.1. Notations

In this paper, random vectors are denoted by boldface uppercase letters, e.g.,

X

; boldface lowercase letters denote deterministic column vectors, e.g.,

x

. Scalar RVs and deterministic values are denoted via standard uppercase and lowercase fonts respectively, e.g., X and x. Scalar random processes are denoted with

X (t), t \in R

for CT and with

X [n], n \in Z

for DT. Uppercase Sans-Serif fonts represent matrices, e.g.,

A

, and the element at the

i^{t h}

row and the

l^{t h}

column of

A

is denoted with

{(A)}_{i, l}

. We use

| \cdot |

to denote the absolute value,

⌊ d ⌋, d \in R

, to denote the floor function and

d^{+}, d \in R

, to denote the

max {0, d}

.

δ [\cdot]

denotes the Kronecker delta function:

δ [n] = 1

for

n = 0

and

δ [n] = 0

otherwise, and

E {\cdot}

denotes the stochastic expectation. The sets of positive integers, integers, rational numbers, real numbers, positive numbers, and complex numbers are denoted by

N, Z

,

Q

,

R

,

R^{+ +}

, and

C

, respectively. The cumulative distribution function (CDF) is denoted by

F_{X} (x) ≜ Pr (X \leq x)

and the probability density function (PDF) of a CT random variable (RV) is denoted by

p_{X} (x)

. We represent a real Gaussian distribution with mean

μ

and variance

σ^{2}

by the notation

N (μ, σ^{2})

. All logarithms are taken to base-2, and

j = \sqrt{- 1}

. Lastly, for any sequence

y [i]

,

i \in N

, and positive integer

k \in N

,

y^{(k)}

denotes the column vector

{(y [1], \dots, y [k])}^{T}

.

2.2. Wide-Sense Cyclostationary Random Processes

Here, we review some preliminaries from the theory of cyclostationarity. We begin by recalling the definition of wide-sense cyclostationary processes for CT and for DT:

Definition 1

(CT wide-sense cyclostationary processes ([3] Section 3.2.1)). A scalar stochastic process

{S (t)}_{t \in R}

is called WSCS if both its first-order and its second-order moments are periodic with respect to

t \in R

with some period

T_{p} \in R

.

Definition 2

(DT wide-sense cyclostationary processes ([2] Section 17.2)). A scalar stochastic process

{S [n]}_{n \in Z}

is called WSCS if both its first-order and its second-order moments are periodic with respect to

n \in Z

with some period

N_{p} \in Z

.

WSCS signal are thus random processes whose first and second-order moments are periodic functions with the same period. To define WSACS signals, we first recall the definition of almost-periodic functions:

Definition 3

(Almost-periodic functions ([19] Definition 2.1)). A DT function

x [n]

,

n \in Z

, is called an almost-periodic function if for every

ϵ > 0

there exists a number

l (ϵ) \in N

with the property that for any

n \in Z

and any

α \in Z

,

\exists Δ \in [α, α + l (ϵ)]

, such that

| x [Δ] - x [n] | < ϵ .

Definition 4

(DT wide-sense almost-cyclostationary processes ([2] Section 17.2)). A scalar stochastic process

{\{S [n])\}}_{n \in Z}

is called WSACS if its first and its second order moments are almost-periodic functions with respect to

n \in Z

.

Remark 1.

Note that when the mean and the autocorrelation function are each periodic with periods which are incommensurate, the resulting processes is WSACS. We note that in many practical cases the mean is zero, see e.g., ([2] Section 17.2), hence the classification of the process will be determined by the periodicity of the autocorrelation function.

The DT WSCS model is commonly used in the communications literature, as it facilitates the the analysis of many problems of interest, such as fundamental rate limits analysis [20,21,22], channel identification [23], synchronization [24], and noise mitigation [25]. However, in many scenarios, the considered signals are WSACS rather than WSCS. To see how the WSACS model is obtained in the context of sampled signals, we briefly recall the discussion in [17] on sampled WSCS processes (please refer to ([17] Section II.B) for more details): Consider a CT WSCS random process

S (t)

, which is sampled uniformly with a sampling interval of

T_{s}

and sampling time offset

ϕ

, resulting in a DT random process

S [i] = S (i \cdot T_{s} + ϕ)

. It is well known that contrary to stationary processes, which have a time-invariant statistical characteristics, the values of

T_{s}

and

ϕ

have a significant effect on the statistics of sampled WSCS processes ([17] Section II.B). To demonstrate this point, consider a CT WSCS process with variance

σ_{s}^{2} (t) = \frac{1}{2} \cdot sin (2 π t / T_{sym}) + 2

for some

T_{sym} > 0

. The sampled process for

ϕ = 0

(no sampling time offset) and

T_{s} = \frac{T_{sym}}{3}

has a variance function whose period is

N_{p} = 3

:

σ_{s}^{2} (i T_{s}) = {2, 2.433, 1.567, 2, 2.433, 1.567, \dots}

, for

i = 0, 1, 2, 3, 4, 5, \dots

; while the DT process obtained with the same sampling interval and with a sampling time offset of

ϕ = \frac{T_{s}}{2 π}

has a periodic variance with

N_{p} = 3

with values

σ_{s}^{2} (i T_{s} + ϕ) = {2.155, 2.335, 1.510, 2.155, 2.335, 1.510, \dots}

, for

i = 0, 1, 2, 3, 4, 5, \dots

, which are different from the values of the DT variance for

ϕ = 0

. It follows that both variances are periodic in discrete-time with the same period

N_{p} = 3

, although with different values within the period, which is a result of the sampling time offset, yet, both DT processes correspond to two instances of synchronous sampling. Lastly, consider the sampled variance obtained by sampling without a time offset (i.e.,

ϕ = 0

) at a sampling interval of

T_{s} = (1 + \frac{1}{2 π}) \frac{T_{sym}}{3}

. For this case,

T_{s}

is not an integer divisor of

T_{sym}

or of any of its integer multiples (i.e.,

\frac{T_{sym}}{T_{s}} = 2 + \frac{2 π - 2}{2 π + 1} \equiv 2 + ϵ

; where

ϵ \notin Q

and

ϵ \in [0, 1)

) resulting in the variance values

σ_{s}^{2} (i T_{s}) = {2, 2.335, 1.5027, 2.405, 1.896, 1.75, \dots}

, for

i = 0, 1, 2, 3, 4, 5 \dots

. For this scenario, the DT variance is not periodic but is almost-periodic, corresponding to asynchronous sampling and the resulting DT process is not WSCS but WSACS ([3] Section 3.2). The example above demonstrates that the statistical properties of sampled WSCS processes depend on the sampling rate and the sampling time offset, implying that the RDF of such processes should also depend on these quantities, as we demonstrate in the sequel.

2.3. The Rate-Distortion Function for DT WSCS Processes

In this subsection we review the source coding problem and the existing results on the RDF of WSCS processes. We begin by recalling the definition of a source coding scheme, see, e.g., ([26] Chapter 30), ([5] Chapter 10):

Definition 5

(Source coding scheme). A source coding scheme with blocklength l consists of (see Figure 1):

1.: An encoder $f_{S}$ which maps a block of l source samples ${S [i]}_{i = 1}^{l}$ into an index from a set of $M = 2^{l R}$ indexes, $f_{S} : {S [i]}_{i = 1}^{l} \mapsto {1, 2, \dots, M}$ .
2.: A decoder $g_{S}$ which maps the received index into a reconstructed sequence of length l, ${\{\hat{S} [i]\}}_{i = 1}^{l}$ , $g_{S} : {1, 2, \dots, M} \mapsto {\{\hat{S} [i]\}}_{i = 1}^{l}$

The encoder-decoder pair is referred to as an

(R, l)

source code, where R is the rate of the code in bits per source symbol, defined as:

R = \frac{1}{l} {log}_{2} M

(1)

The RDF characterizes the minimal average number of bits per source symbol, denoted

R (D)

, that can be used to encode a source process such that it can be reconstructed from its encoded representation with a recovery distortion not larger than

D > 0

([5] Section 10.2). In the current work, we use the MSE distortion measure, which measures the distortion due to decoding a source symbol S into

\hat{S}

via

d (S, \hat{S}) = {(S - \hat{S})}^{2}

. The distortion for a sequence of source samples

S^{(l)}

decoded into a reproduction sequence

{\hat{S}}^{(l)}

is given by

d (S^{(l)}, {\hat{S}}^{(l)}) = \frac{1}{l} \sum_{i = 1}^{l} {(S [i] - \hat{S} [i])}^{2}

and the average distortion in decoding a random source sequence

S^{(l)}

into a random reproduction sequence

{\hat{S}}^{(l)}

is defined as:

\bar{d} (S^{(l)}, {\hat{S}}^{(l)}) ≜ E \{d (S^{(l)}, {\hat{S}}^{(l)})\} = \frac{1}{l} \sum_{i = 1}^{l} E \{{(S [i] - \hat{S} [i])}^{2}\},

(2)

where the expectation in Equation (2) is taken with respect to the joint probability distributions on the source

S [i]

and its reproduction

\hat{S} [i]

. Using Definition 5 we can now define the achievable rate-distortion pair for a source

S [i]

, as stated in the following definition ([10] Pg. 471):

Definition 6

(Achievable rate-distortion pair). A rate-distortion pair

(R, D)

is achievable for a process

{S [i]}_{i \in N}

if for any

η > 0

and for all sufficiently large l it is possible to construct an

(R_{s}, l)

source code such that

R_{s} \leq R + η .

(3)

and

\bar{d} (S^{(l)}, {\hat{S}}^{(l)}) \leq D + η .

(4)

Definition 7.

The rate-distortion function

R (D)

is defined as the infimum of all achievable rates R for a given maximum allowed distortion D.

Definition 6 defines a rate-distortion pair to be achievable if the rate and the distortion constraints are satisfied using source codes with any sufficiently large blocklength. In the following lemma, which will be used to characterize the RDF of DT WSCS signals, we state that it is sufficient to consider only source codes whose blocklength is an integer multiple of some fixed positive integer:

Lemma 1.

Consider the process

{S [i]}_{i \in N}

with a finite and bounded variance. For a given maximum allowed distortion D, the optimal reproduction process

{\hat{S} [i]}_{i \in N}

is also the optimal reproduction process when restricted to using source codes whose blocklengths are integer multiples of some fixed positive integer r.

Proof.

The proof of the lemma is detailed in Appendix A. □

This lemma facilitates switching between multivariate and scalar representations of the source and the reproduction processes.

The RDF obviously depends on the distribution of the source

{S [i]}_{i \in N}

. Thus, statistically different sources have different RDFs. However, when a source is scaled by some positive constant, the RDF of the scaled process with the MSE distortion can be inferred from that of the original source process, as stated in the following theorem:

Theorem 1.

Let

{S [i]}_{i \in N}

be a source process for which the rate-distortion pair

(R, D)

is achievable under the MSE distortion. Then, for every

α \in R^{+ +}

, it holds that the rate-distortion pair

(R, α^{2} \cdot D)

is achievable for the scaled source

{α \cdot S [i]}_{i \in N}

.

Proof.

The proof of the theorem is detailed in Appendix B. □

Lastly, in the proof of our main result, we make use of the RDF for DT WSCS sources derived in ([12] Theorem 1), repeated below for ease of reference. Prior to the statement of the theorem, we recall that for blocklenghts which are integer multiples of

N_{p}

, a WSCS process

S [i]

with period

N_{p} > 0

can be represented as an equivalent

N_{p}

-dimensional process

S^{(N_{p})} [i]

via the decimated component decomposition (DCD) ([2] Section 17.2). The power spectral density (PSD) of the process

S^{(N_{p})}

is defined as ([12] Section II):

{(ρ_{S} (e^{j 2 π f}))}_{u, v} = \sum_{Δ \in Z} {(R_{S} [Δ])}_{u, v} e^{- j 2 π f Δ} - \frac{1}{2} \leq f \leq \frac{1}{2}, u, v \in {1, 2, \dots N_{p}}

(5)

where

R_{S} [Δ] ≜ E \{S^{(N_{p})} [i] \cdot S^{(N_{p})} [i + Δ]\}

([2] Section 17.2). We now proceed to the statement of ([12] Theorem 1):

Theorem 2.

([12] Theorem 1) Consider a zero-mean real DT WSCS Gaussian source

S [i], i \in N

with memory, and let

N_{p} \in N

denote the period of its statistics. The RDF is expressed as:

R (D) = \frac{1}{2 N_{p}} \sum_{m = 1}^{N_{p}} \int_{f = - 0.5}^{0.5} {(log (\frac{λ_{m} (e^{j 2 π f})}{θ}))}^{+} d f,

(6a)

where

λ_{m} (e^{j 2 π f})

,

m = 1, 2, \dots, N_{p}

denote the eigenvalues of the PSD matrix of the process

S^{(N_{p})} [i]

, which is obtained from

S [i]

by applying the

N_{p}

-dimensional DCD, and θ is selected such that

D = \frac{1}{N_{p}} \sum_{m = 1}^{N_{p}} \int_{f = - 0.5}^{0.5} min \{λ_{m} (e^{j 2 π f}), θ\} d f .

(6b)

We note that

S^{(N_{p})} [i]

corresponds to a vector of stationary processes whose elements are not identically distributed; hence the variance function is different for each scalar stationary process. Using ([12] Theorem 1), we can directly obtain the RDF for the special case of a DT memoryless WSCS Gaussian process. This is stated in the following corollary:

Corollary 1.

Let

{S [i]}_{i \in N}

be a zero-mean DT memoryless real WSCS Gaussian source with period

N_{p} \in N

, and set

σ_{m}^{2} = E {S^{2} [m]}

for

m = 1, 2, \dots, N_{P}

. The RDF for compression of

S [i]

is expressed as:

\begin{matrix} R (D) = \{\begin{matrix} \frac{1}{2 N_{p}} \sum_{m = 1}^{N_{p}} log (\frac{σ_{m}^{2}}{D_{m}}) & D \leq \frac{1}{N_{p}} \sum_{m = 1}^{N_{p}} σ_{m}^{2} \\ 0 & D > \frac{1}{N_{p}} \sum_{m = 1}^{N_{p}} σ_{m}^{2}, \end{matrix} \end{matrix}

(7a)

where

D_{m} ≜ min \{σ_{m}^{2}, θ\}

, and θ is defined such that

D = \frac{1}{N_{p}} \sum_{m = 1}^{N_{p}} D_{m} .

(7b)

Proof.

Applying Equations (6a) and (6b) to our specific case of a memoryless WSCS source, we obtain Equations (7a) and (7b) as follows: First, note that the corresponding DCD components for a zero-mean memoryless WSCS process are also zero-mean and memoryless; hence the PSD matrix for the multivariate process

S^{(N_{p})} [i]

is a diagonal matrix, whose eigenvalues are the constant diagonal elements such that the mth diagonal element is equal to the variance

σ_{m}^{2}

:

λ_{m} (e^{j 2 π f}) = σ_{m}^{2}

. Now, writing Equation (6a) for this case we obtain:

\begin{matrix} R (D) & = \frac{1}{2 N_{p}} \sum_{m = 1}^{N_{p}} \int_{f = - 0.5}^{0.5} {(log (\frac{λ_{m} (e^{j 2 π f})}{θ}))}^{+} d f \\ = \frac{1}{2 N_{p}} \sum_{m = 1}^{N_{p}} {(log (\frac{σ_{m}^{2}}{θ}))}^{+} . \end{matrix}

(8)

Since

{(log (\frac{σ_{m}^{2}}{θ}))}^{+} = max \{0, log (\frac{σ_{m}^{2}}{θ})\} \equiv log (\frac{σ_{m}^{2}}{D_{m}})

it follows that (8) coincides with (7a). Next, expressing Equation (6b) for the memoryless source process, we obtain:

D = \frac{1}{N_{p}} \sum_{m = 1}^{N_{p}} \int_{f = - 0.5}^{0.5} min \{λ_{m} (e^{j 2 π f}), θ\} d f = \frac{1}{N_{p}} \sum_{m = 1}^{N_{p}} min \{σ_{m}^{2}, θ\},

(9)

proving Equation (7b). □

Now, from Lemma 1, we conclude that the RDF for compression of source sequences whose blocklength is an integer multiple of

N_{p}

is the same as the RDF for compressing source sequences whose blocklength is arbitrary. We recall that from ([5] Chapter 10.3.3) it follows that for the zero-mean memoryless Gaussian DCD vector source process

S^{(N_{p})} [i]

the optimal reproduction process which achieves the RDF is an

N_{p} \times 1

memoryless process whose covariance matrix is diagonal with non-identically distributed elements. From [2], we can apply the inverse DCD to obtain a WSCS process. Hence, from Lemma 1 we can conclude that the optimal reproduction process for the DT WSCS Gaussian source is a DT WSCS Gaussian process.

3. Problem Formulation and Auxiliary Results

Our objective is to characterize the RDF for compression of asynchronously sampled CT WSCS Gaussian sources when the sampling interval is larger than the memory of the source. In particular, we focus on the minimal rate required to achieve a high fidelity reproduction, representing the RDF curve for distortion values not larger than the variance of the source. Such characterization of the RDF for asynchronous sampling is essential for comprehending the relationship between the minimal required number of bits and the sampling rate at a given distortion. Our analysis constitutes an important step towards constructing joint source-channel coding schemes for scenarios in which the symbol rate of the transmitter is not necessarily synchronized with the sampling rate of the source to be transmitted. Such scenarios arise, for example, when recording a communications signal for storage or processing, or in compress-and-forward relaying (([26] Chapter 16.7), [27]) in which the relay compresses the sampled received signal, which is then forwarded to the assisted receiver. As the relay operates with its own sampling clock, which need not necessarily be synchronized with the symbol rate of the assisted transmitter, sampling at the relay may result in a DT WSACS source signal. In the following we first characterize the sampled source model in Section 3.1. Then, as a preliminary step towards our characterization the RDF for asynchronously sampled CT WSCS Gaussian processes stated in Section 4, we recall in Section 3.2 the definitions of some information-spectrum quantities used in this study. Finally, in Section 3.3, we recall an auxiliary result relating the information-spectrum quantities of a collection of sequences of RVs to the information-spectrum quantities of its limit sequence of RVs. This result will be applied in the derivation of the RDF with asynchronous sampling.

3.1. Source Model

Consider a real CT, zero-mean WSCS Gaussian random process

S_{c} (t)

with period

T_{ps}

. Let the variance function of

S_{c} (t)

be defined as

σ_{S_{c}}^{2} (t) ≜ E \{S_{c}^{2} (t)\}

, and assume it is both upper bounded and lower bounded away from zero, and that it is continuous in

t \in R

. Let

τ_{m} > 0

denote the maximal correlation length of

S_{c} (t)

, i.e.,

r_{S_{c}} (t, τ) ≜ E \{S_{c} (t) S_{c} (t - τ)\} = 0, \forall | τ | > τ_{m}

. By the cyclostationarity of

S_{c} (t)

, we have that

σ_{S_{c}}^{2} (t) = σ_{S_{c}}^{2} (t + T_{ps}), \forall t \in R

. Let

S_{c} (t)

be sampled uniformly with the sampling interval

T_{s} > 0

such that

T_{ps} = (p + ϵ) \cdot T_{s}

for

p \in N

and

ϵ \in [0, 1)

yielding

S_{ϵ} [i] ≜ S_{c} (i \cdot T_{s})

, where

i \in Z

. The variance of

S_{ϵ} [i]

is given by

σ_{S_{ϵ}}^{2} [i] ≜ r_{S_{ϵ}} [i, 0] = σ_{S_{c}}^{2} (\frac{i \cdot T_{ps}}{p + ϵ})

.

In this work, as in [17], we assume that the duration of temporal correlation of the CT signal is shorter than the sampling interval

T_{s}

, namely,

τ_{m} < T_{s}

. Consequently, the DT Gaussian process

S_{ϵ} [i]

is a memoryless zero-mean Gaussian process and its autocorrelation function is given by:

\begin{matrix} r_{S_{ϵ}} [i, Δ] & = E \{S_{ϵ} [i] S_{ϵ} [i + Δ]\} \\ = E \{S_{c} (\frac{i \cdot T_{ps}}{p + ϵ}) \cdot S_{c} (\frac{(i + Δ) \cdot T_{ps}}{p + ϵ})\} = σ_{S_{c}}^{2} (\frac{i \cdot T_{ps}}{p + ϵ}) \cdot δ [Δ] = σ_{S_{ϵ}}^{2} [i] \cdot δ [Δ] . \end{matrix}

(10)

While we do not explicitly account for sampling time offsets in our definition of the sampled process

S_{ϵ} [i]

, it can be incorporated by replacing

σ_{S_{c}}^{2} (t)

with a time-shifted version, i.e.,

σ_{S_{c}}^{2} (t - ϕ)

, see also ([17] Section II.C).

It can be noted from (10) that if

ϵ

is a rational number, i.e.,

\exists u, v \in N

, u and v are relatively prime, such that

ϵ = \frac{u}{v}

, then

{\{S_{ϵ} [i]\}}_{i \in Z}

is a DT memoryless WSCS process with the period

p_{u, v} = p \cdot v + u \in N

([17] Section II.C). For this class of processes, the RDF can be obtained from ([12] Theorem 1) as stated in Corollary 1. On the other hand, if

ϵ

is an irrational number, then sampling becomes asynchronous and leads to a WSACS process whose RDF has not been characterized to date.

3.2. Definitions of Relevant Information-Spectrum Quantities

Conventional information theoretic tools for characterizing RDFs are based on an underlying ergodicity of the source. Consequently, these techniques cannot be applied to characterize the RDF of asynchronously sampled WSCS processes. To tackle this challenge, we use the information-spectrum framework, as this framework [14] can be utilized to obtain general formulas for rate limits for any arbitrary class of processes. The resulting expressions are not restricted to specific statistical models of the considered processes, and in particular, do not require information stability or stationarity. In the following, we recall the definitions of several information-spectrum quantities used in this study, see also ([14] Definitions 1.3.1 and 1.3.2):

Definition 8.

The limit-inferior in probability of a sequence of real RVs

{Z_{k}}_{k \in N}

is defined as

p - \underset{k \to \infty}{lim inf} Z_{k} ≜ sup \{α \in R | lim_{k \to \infty} Pr (Z_{k} < α) = 0\} ≜ α_{0} .

(11)

Hence,

α_{0}

is the largest real number satisfying that

\forall \tilde{α} < α_{0}

and

\forall μ > 0

there exists

k_{0} (μ, \tilde{α}) \in N

such that

Pr (Z_{k} < \tilde{α}) < μ

,

\forall k > k_{0} (μ, \tilde{α})

.

Definition 9.

The limit-superior in probability of a sequence of real RVs

{Z_{k}}_{k \in N}

is defined as

p - \underset{k \to \infty}{lim sup} Z_{k} ≜ inf \{β \in R | lim_{k \to \infty} Pr (Z_{k} > β) = 0\} ≜ β_{0} .

(12)

Hence,

β_{0}

is the smallest real number satisfying that

\forall \tilde{β} > β_{0}

and

\forall μ > 0

, there exists

k_{0} (μ, \tilde{β}) \in N

, such that

Pr (Z_{k} > \tilde{β}) < μ

,

\forall k > k_{0} (μ, \tilde{β})

.

The notion of uniform integrability of a sequence of RVs is a basic property in probability ([28] Chapter 12), which is not directly related to information spectrum methods. However, since it plays an important role in the information spectrum characterization of RDFs, we include its statement in the following definition:

Definition 10

(Uniform integrability ([28] Definition 12.1), ([14] Equation (5.3.2))). The sequence of real-valued random variables

{Z_{k}}_{k = 1}^{\infty}

, is said to satisfy uniform integrability if

lim_{u \to \infty} sup_{k \geq 1} \int_{z : | z | \geq u} p_{Z_{k}} (z) | z | d z = 0

(13)

The aforementioned quantities facilitate characterizing the RDF of arbitrary sources. Consider a general source process

{S [i]}_{i = 1}^{\infty}

(stationary or non-stationary) taking values from the source alphabet

S [i] \in S

and a reproduction process

{\hat{S} [i]}_{i = 1}^{\infty}

with values from the reproduction alphabet

\hat{S} [i] \in \hat{S}

. It follows from ([14] Section 5.5) that for a distortion measure which satisfies the uniform integrability criterion, i.e., that there exists a deterministic sequence

{r [i]}_{i = 1}^{\infty}

such that the sequence of RVs

{d (S^{(k)}, r^{(k)})}_{k = 1}^{\infty}

satisfies Definition 10 ([14] Page 336), then the RDF is expressed as ([14] Equation (5.4.2)):

R (D) = inf_{F_{S, \hat{S}} : {\bar{d}}_{S} (S^{(k)}, {\hat{S}}^{(k)}) \leq D} \bar{I} (S^{(k)}; {\hat{S}}^{(k)}),

(14)

where

{\bar{d}}_{S} (S^{(k)}, {\hat{S}}^{(k)}) = \underset{k \to \infty}{lim sup} E \{d (S^{(k)}, {\hat{S}}^{(k)})\}

,

F_{S, \hat{S}}

denotes the joint CDF of

{S [i]}_{i = 1}^{\infty}

and

{\hat{S} [i]}_{i = 1}^{\infty}

, and

\bar{I} (S^{(k)} : {\hat{S}}^{(k)})

represents the limit superior in probability of the mutual information rate of

S^{(k)}

and

{\hat{S}}^{(k)}

, given by:

\bar{I} (S^{(k)}; {\hat{S}}^{(k)}) ≜ p - \underset{k \to \infty}{lim sup} \frac{1}{k} log \frac{p_{S^{(k)} | {\hat{S}}^{(k)}} (S^{(k)} | {\hat{S}}^{(k)})}{p_{S^{(k)}} (S^{(k)})}

(15)

In order to use the RDF characterization in (14), the distortion measure must satisfy the uniform integrability criterion. For the considered class of sources detailed in Section 3.1, the MSE distortion satisfies this criterion, as stated in the following lemma:

Lemma 2.

For any real memoryless zero-mean Gaussian source

{S [i]}_{i = 1}^{\infty}

with bounded variance, i.e.,

\exists σ_{max}^{2} < \infty

such that

E {S^{2} [i]} \leq σ_{max}^{2}

for all

i \in N

, the MSE distortion satisfies the uniform integrability criterion.

Proof.

Set the deterministic sequence

{r [i]}_{i = 1}^{\infty}

to be the all-zero sequence. Under this setting and the MSE distortion, it holds that

d (S^{(k)}, r^{(k)}) = \frac{1}{k} \sum_{i = 1}^{k} S^{2} [i]

. To prove the lemma, we show that the sequence of RVs

{\{d (S^{(k)}, r^{(k)})\}}_{k = 1}^{\infty}

has a bounded

ℓ_{2}

norm, which implies that it is uniformly integrable by ([28] Corollary 12.8). The

ℓ_{2}

norm of

d (S^{(k)}, r^{(k)})

satisfies

\begin{matrix} E \{d {(S^{(k)}, r^{(k)})}^{2}\} & = \frac{1}{k^{2}} E \{\sum_{i = 1}^{k} S^{2} [i] \sum_{j = 1}^{k} S^{2} [j]\} \\ = \frac{1}{k^{2}} \sum_{i = 1}^{k} \sum_{j = 1}^{k} E \{S^{2} [i] S^{2} [j]\} \overset{(a)}{\leq} \frac{1}{k^{2}} \sum_{i = 1}^{k} \sum_{j = 1}^{k} 3 σ_{max}^{4} = 3 σ_{max}^{4}, \end{matrix}

(16)

where

(a)

follows since

E {S^{2} [i] S^{2} [j]} = E {S^{2} [i]} E {S^{2} [j]} = σ_{max}^{4}

for

i \neq j

while

E {S^{4} [i]} = 3 σ_{max}^{4}

([29] Chapter 5.4). Equation (16) proves that

d (S^{(k)}, r^{(k)})

is

ℓ_{2}

-bounded by

3 σ_{max}^{4} < \infty

for all

k \in N

, which in turn implies that the MSE distortion is uniformly integrable for the source

{S [i]}_{i = 1}^{\infty}

. □

Since, as detailed in Section 3.1, we focus in the following on memoryless zero-mean Gaussian sources, Lemma 2 implies that the RDF of the source can be characterized using (14). However, (14) is in general difficult to evaluate, and thus does not lead to a meaningful understanding of how the RDF of sampled WSCS sources behaves, motivating our analysis in Section 4.

3.3. Information Spectrum Limits

The following theorem originally stated in ([17] Theorem 1) presents a fundamental result which is directly useful for the derivation of the RDF:

Theorem 3.

([17] Theorem 1) Let

{\{{\tilde{Z}}_{k, n}\}}_{n, k \in N}

be a set of sequences of real scalar RVs satisfying two assumptions:

AS1: For every fixed $n \in N$ , every convergent subsequence of ${\{{\tilde{Z}}_{k, n}\}}_{k \in N}$ converges in distribution, as $k \to \infty$ , to a finite deterministic scalar. Each subsequence may converge to a different scalar.
AS2: For every fixed $k \in N$ , the sequence ${\{{\tilde{Z}}_{k, n}\}}_{n \in N}$ converges uniformly in distribution, as $n \to \infty$ , to a scalar real-valued RV $Z_{k}$ . Specifically, letting ${\tilde{F}}_{k, n} (α)$ and $F_{k} (α)$ , $α \in R$ , denote the CDFs of ${\tilde{Z}}_{k, n}$ and of $Z_{k}$ , respectively, then by AS2 it follows that $\forall η > 0$ , there exists $n_{0} (η)$ such that for every $n > n_{0} (η)$

$|{\tilde{F}}_{k, n} (α) - F_{k} (α)| < η,$

for each $α \in R$ , $k \in N$ .

Then, for

{\{{\tilde{Z}}_{k, n}\}}_{n, k \in N}

it holds that

\begin{matrix} p - \underset{k \to \infty}{lim inf} Z_{k} & = & lim_{n \to \infty} (p - \underset{k \to \infty}{lim inf} {\tilde{Z}}_{k, n}), \end{matrix}

(17a)

\begin{matrix} p - \underset{k \to \infty}{lim sup} Z_{k} & = & lim_{n \to \infty} (p - \underset{k \to \infty}{lim sup} {\tilde{Z}}_{k, n}) . \end{matrix}

(17b)

Proof.

In Appendix C we explicitly prove Equation (17b). This complements the proof in ([17] Appendix A) which explicitly considers only (17a). □

4. Rate-Distortion Characterization for Sampled CT WSCS Gaussian Sources

4.1. Main Result

Using the information-spectrum based characterization of the RDF (14) combined with the characterization of the limit of a sequence of information spectrum quantities in Theorem 3, we now analyze the RDF of asynchronously sampled WSCS processes. Our analysis is based on constructing a sequence of synchronously sampled WSCS processes, whose RDF is given in Corollary 1. Then, we show that the RDF of the asynchronously sampled process can be obtained as the limit superior of the computable RDFs of the sequence of synchronously sampled processes. We begin by letting

ϵ_{n} ≜ \frac{⌊ n \cdot ϵ ⌋}{n}

for

n \in N

and defining a Gaussian source process

S_{n} [i] = S_{c} (\frac{i \cdot T_{ps}}{p + ϵ_{n}})

. From the discussion in Section 3.1 (see also ([17] Section II.C)), it follows that since

ϵ_{n}

is rational,

S_{n} [i]

is a WSCS process and its period is given by

p_{n} = p \cdot n + ⌊ n \cdot ϵ ⌋

. Accordingly, the periodic correlation function of

S_{n} [i]

can be obtained similarly to (10) as:

r_{S_{n}} [i, Δ] = E \{S_{n} [i] S_{n} [i + Δ]\} = σ_{S_{c}}^{2} (\frac{i \cdot T_{ps}}{p + ϵ_{n}}) \cdot δ [Δ] .

(18)

Due to cyclostationarity of

S_{n} [i]

, we have that

r_{S_{n}} [i, Δ] = r_{S_{n}} [i + p_{n}, Δ]

,

\forall i, Δ \in Z

, and we let

σ_{S_{n}}^{2} [i] ≜ r_{S_{n}} [i, 0]

denote its periodic variance.

We next restate Corollary 1 in terms of

ϵ_{n}

as follows:

Proposition 1.

Consider a DT, memoryless, zero-mean, WSCS Gaussian random process

S_{n} [i]

with a variance

σ_{S_{n}}^{2} [i]

, obtained from

S_{c} (t)

by sampling with a sampling interval of

T_{s} (n) = \frac{T_{ps}}{p + ϵ_{n}}

. Let

S_{n}^{(p_{n})} [i]

denote the memoryless stationary multivariate random process obtained by applying the DCD to

S_{n} [i]

and let

σ_{S_{n}}^{2} [m]

,

m = 1, 2, \dots, p_{n}

, denote the variance of the

m^{t h}

component of

S_{n}^{(p_{n})} [i]

. The rate-distortion function is given by:

\begin{matrix} R_{n} (D) = \{\begin{matrix} \frac{1}{2 p_{n}} \sum_{m = 1}^{p_{n}} (log (\frac{σ_{S_{n}}^{2} [m]}{D_{n} [m]})) & D \leq \frac{1}{p_{n}} \sum_{m = 1}^{p_{n}} σ_{S_{n}}^{2} [m] \\ 0 & D > \frac{1}{p_{n}} \sum_{m = 1}^{p_{n}} σ_{S_{n}}^{2} [m] \end{matrix}, \end{matrix}

(19a)

where for

D \leq \frac{1}{p_{n}} \sum_{m = 1}^{p_{n}} σ_{S_{n}}^{2} [m]

we let

D_{n} [m] ≜ min \{σ_{S_{n}}^{2} [m], θ_{n}\}

, and

θ_{n}

is selected such that

D = \frac{1}{p_{n}} \sum_{m = 1}^{p_{n}} D_{n} [m] .

(19b)

We recall that the RDF of

S_{n} [i]

is characterized in Proposition 1 via the RDF of the multivariate stationary process

S_{n}^{(p_{n})} [i]

obtained via a

p_{n}

-dimensional DCD applied to

S_{n} [i]

. Next, we recall that the relationship between the source process

S_{n}^{(p_{n})} [i]

and the optimal reconstruction process, denoted by

{\hat{S}}_{n}^{(p_{n})} [i]

, is characterized in ([5] Chapter 10.3.3) via a linear, multivariate, time-invariant backward channel with a

p_{n} \times 1

additive vector noise process

W_{n}^{(p_{n})} [i]

, and is given by:

S_{n}^{(p_{n})} [i] = {\hat{S}}_{n}^{(p_{n})} [i] + W_{n}^{(p_{n})} [i], i \in N .

(20)

It also follows from ([5] Section 10.3.3) that for the IID Gaussian multivariate process whose entries are independent and distributed via

{(S_{n}^{(p_{n})} [i])}_{m} \sim N (0, σ_{S_{n}}^{2} [m])

,

m \in {1, 2, \dots, p_{n}}

, the optimal reconstruction vector process

{\hat{S}}_{n}^{(p_{n})} [i]

and the corresponding noise vector process

W_{n}^{(p_{n})} [i]

each follow a multivariate Gaussian distribution:

{\hat{S}}_{n}^{(p_{n})} [i] \sim N (0, [\begin{matrix} σ_{{\hat{S}}_{n}}^{2} [1] & \dots & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & \dots & σ_{{\hat{S}}_{n}}^{2} [p_{n}] \end{matrix}]) and W_{n}^{(p_{n})} [i] \sim N (0, [\begin{matrix} D_{n} [1] & \dots & 0 \\ ⋮ & ⋱ & ⋮ \\ 0 & \dots & D_{n} [p_{n}] \end{matrix}]),

where

D_{n} [m] ≜ min \{σ_{S_{n}}^{2} [m], θ_{n}\}

;

θ_{n}

denotes the reverse waterfilling threshold defined in Prop. 1 for the index n, and is selected such that

D = \frac{1}{p_{n}} \sum_{m = 1}^{p_{n}} D_{n} [m]

. The optimal reconstruction process,

{\hat{S}}_{n}^{(p_{n})} [i]

and the noise process

W_{n}^{(p_{n})} [i]

are mutually independent, and for each

m \in {1, 2, \dots, p_{n}}

it holds that

E \{{(S_{n}^{(p_{n})} [i] - {\hat{S}}_{n}^{(p_{n})} [i])}_{m}^{2}\} = D_{n} [m]

, see ([5] Chapters 10.3.2 and 10.3.3). The multivariate relationship between stationary processes in (20) can be transformed into an equivalent linear relationship between cyclostationary Gaussian memoryless processes via the inverse DCD transformation ([2] Sec 17.2) applied to each of the processes, resulting in:

S_{n} [i] = {\hat{S}}_{n} [i] + W_{n} [i], i \in N .

(21)

We are now ready to state our main result, which is the RDF of asynchronously sampled DT sources

S_{ϵ} [i], ϵ \notin Q

, in the low MSE regime, i.e., when the distortion D is not larger than the source variance. The RDF is stated in the following theorem, which applies to both synchronous sampling as well as to asynchronous sampling:

Theorem 4.

Consider a DT source

{S_{ϵ} [i]}_{i = 1}^{\infty}

obtained by sampling a CT WSCS source, whose period of statistics is

T_{ps}

, at intervals

T_{s}

. Then, for any distortion constraint D such that

D < min_{0 \leq t \leq T_{ps}} σ_{S_{c}}^{2} (t)

and any

ϵ \in [0, 1)

, the RDF

R_{ϵ} (D)

for compressing

{S_{ϵ} [i]}_{i = 1}^{\infty}

can be obtained as the limit:

R_{ϵ} (D) = \underset{n \to \infty}{lim sup} R_{n} (D),

(22)

where

R_{n} (D)

is defined Prop. 1.

Proof.

The detailed proof is provided in Appendix D. Here, we give a brief outline: The derivation of the RDF with asynchronous sampling follows three steps: First, we note that sampling at a rate of

T_{s} (n) = \frac{T_{ps}}{p + ϵ_{n}}

results in a sequence of DT WSCS sources

{S_{n} [i]}_{i \in N, n \in N}

whose sampling interval

T_{s} (n)

asymptotically approaches, as

n \to \infty

, the sampling interval for irrational

ϵ

given by

T_{s} = \frac{T_{ps}}{p + ϵ}

. We define a sequence of rational numbers

ϵ_{n}

s . t .

ϵ_{n} \to ϵ

as

n \to \infty

; Building upon this insight, we prove that the RDF with

T_{s}

can be stated as a double limit where the outer limit is with respect to the blocklength and the inner limit is with respect to

ϵ_{n}

. Lastly, we use Theorem 3 to show that the order of the limits can be exchanged, obtaining a limit of expressions which are computable. □

Remark 2.

Theorem 4 focuses on the low distortion regime, defined as the values of D satisfying

D < min_{0 \leq t \leq T_{ps}} σ_{S_{c}}^{2} (t)

. This implies that

θ_{n}

has to be smaller than

min_{0 \leq t \leq T_{ps}} σ_{S_{c}}^{2} (t)

; hence, from Prop. 1 it follows that for the corresponding stationary noise vector

W_{n}^{(p_{n})} [i]

in (20),

D_{n} [m] = min \{σ_{S_{n}}^{2} [m], θ_{n}\} = θ_{n}

and

D = \frac{1}{p_{n}} \sum_{m = 1}^{p_{n}} D_{n} [m] = θ_{n} = D_{n} [m]

. We note that since every element of the vector

{(W_{n}^{(p_{n})} [i])}_{m}

has the same variance

D_{n} [m] = D

for all

n \in N

and

m = 1, 2, \dots, p_{n}

then by applying the inverse DCD to

W_{n}^{(p_{n})} [i]

, the resulting scalar DT process

W_{n} [i]

is wide sense stationary; and in fact IID with

E \{{(W_{n} [i])}^{2}\} = D

.

4.2. Discussion and Relationship with Capacity Derivation in Reference 17

Theorem 4 provides a meaningful and computable characterization for the RDF of sampled WSCS signals. We note that the proof of the main theorem uses some of the steps used in our recent study on the capacity of memoryless channels with sampled CT WSCS Gaussian noise [17]. It should be emphasized, however, that there are several fundamental differences between the two studies, which require the introduction of new treatments and derivations original to the current work. First, it is important to note that in the study on capacity, a physical channel model exists, and therefore the conditional PDF of the output signal given the input signal can be characterized explicitly for both synchronous sampling and asynchronous sampling for every input distribution. For the current study of the RDF we note that the relationship (21), commonly referred to as the backward channel [30], ([5] Chapter 10.3.2), characterizes the relationship between the source process and the optimal reproduction process, and hence is valid only for synchronous sampling and for the optimal reproduction process. Consequently, in the RDF analysis the limiting relationship (21) as

n \to \infty

is not even known to exist and, in fact, we can show it exists under a rather strict condition on the distortion (namely, the condition

D < min_{0 \leq t \leq T_{ps}} σ_{S_{c}}^{2} (t)

stated in Theorem 4). In particular, to prove the statement in Theorem 4, we had to show that from the backward channel (21), we can define an asymptotic relationship, as

n \to \infty

, which corresponds to the asynchronously sampled source process, denoted by

S_{ϵ} [i]

, and relates

S_{ϵ} [i]

with its optimal reconstruction process

{\hat{S}}_{ϵ} [i]

. This is done by showing that the PDFs for the reproduction process

{\hat{S}}_{n} [i]

and noise process

W_{n} [i]

from (21), each converge uniformly as

n \to \infty

to a respective limiting PDF, which has to be defined as well. This enabled us to relate the RDFs for the synchronous sampling and for the asynchronous sampling cases using Theorem 3, eventually leading to (22). Accordingly, in our detailed proof of Theorem 4 given in Appendix D, Lemmas A6 and A8 as well as a significant part of Lemma A4 are largely new, addressing the special aspects of the proof arising from the fundamental differences between current setup and the setup in [17], while the derivations of Lemmas A3 and A7 follow similarly to ([17] Lemma B.1) and ([17] Lemma B.5), respectively, and parts of Lemma A4 coincide with ([17] Lemma B.2).

5. Numerical Examples

In this section we demonstrate the insights arising from our RDF characterization via numerical examples. Recalling that Theorem 4 states the RDF for asynchronously sampled CT WSCS Gaussian process,

R_{ϵ} (D)

, as the limit supremum of a sequence of RDFs corresponding to DT memoryless WSCS Gaussian source processes

{\{R_{n} (D)\}}_{n \in N}

, we first consider the convergence of

{R_{n} (D)}_{n \in N}

in Section 5.1. Next, in Section 5.2 we study the variation of the RDF of the sampled CT process due to changes in the sampling rate and in the sampling time offset.

Similarly to ([17] Section IV), define a periodic continuous pulse function, denoted by

Π_{t_{dc}, t_{rf}} (t)

, with equal rise/fall time

t_{rf} = 0.01

, duty cycle

t_{dc} \in [0, 0.98]

, and period of 1, i.e.,

Π_{t_{dc}, t_{rf}} (t + 1) = Π_{t_{dc}, t_{rf}} (t)

for all

t \in R

. Specifically, for

t \in [0, 1)

the function

Π_{t_{dc}, t_{rf}} (t)

is given by

Π_{t_{dc}, t_{rf}} (t) = \{\begin{matrix} \frac{t}{t_{rf}} & t \in [0, t_{rf}] \\ 1 & t \in (t_{rf}, t_{dc} + t_{rf}) \\ 1 - \frac{t - t_{dc} - t_{rf}}{t_{rf}} & t \in [t_{dc} + t_{rf}, t_{dc} + 2 \cdot t_{rf}] \\ 0 & t \in (t_{dc} + 2 \cdot t_{rf}, 1) . \end{matrix}

(23)

In the following, we model the time varying variance of the WSCS source

σ_{S_{c}}^{2} (t)

to be a linear periodic function of

Π_{t_{dc}, t_{rf}} (t)

. To that aim, we define a time offset between the first sample and the rise start time of the periodic continuous pulse function; we denote the time offset by

ϕ \in [0, 1)

. This corresponds to the sampling time offset normalized to the period

T_{ps}

. The variance of

S_{c} (t)

is a periodic function with period

T_{ps}

which is defined as

σ_{S_{c}}^{2} (t) = 0.2 + 4.8 \cdot Π_{t_{dc}, t_{rf}} (\frac{t}{T_{ps}} - ϕ), t \in [0, T_{ps}),

(24)

with a period of

T_{ps} = 5

μ

secs.

5.1. Convergence of $R_{n} (D)$ in n

From Theorem 4 it follows that if the distortion satisfies

D < min_{0 \leq t \leq T_{ps}} σ_{S_{c}}^{2} (t)

, the RDF of the asynchronously sampled CT WSCS Gaussian process is given by the limit superior of the sequence

{R_{n} (D)}_{n \in N}

; where

R_{n} (D)

is defined in Proposition 1. In this subsection, we study the sequence of RDFs

{R_{n} (D)}_{n \in N}

as n increases. For this evaluation setup, we fixed the distortion constraint at

D = 0.18

and set

ϵ = \frac{π}{7}

and

p = 2

. Let the variance of the CT WSCS Gaussian source process

σ_{S_{c}}^{2} (t)

be modelled by Equation (24) for two sampling time offsets

ϕ = {0, \frac{1}{16}}

. For each offset

ϕ

, four duty cycle values were considered:

t_{dc} = [20, 45, 75, 98] %

. For each n we obtain the synchronous sampling mismatch

ϵ_{n} ≜ \frac{⌊ n \cdot ϵ ⌋}{n}

, which approaches

ϵ

as

n \to \infty

, where

n \in N

. Since

ϵ_{n}

is a rational number, corresponding to a sampling period of

T_{s} (n) = \frac{T_{ps}}{p + ϵ_{n}}

, then for each n, the resulting dt process is WSCS with the period

p_{n} = p \cdot n + ⌊ n \cdot ϵ ⌋

and its RDF follows from Proposition 1.

Figure 2 and Figure 3 depict

R_{n} (D)

for

n \in [1, 500]

with the specified duty cycles and sampling time offsets, where in Figure 2 there is no sampling time offset, i.e.,

ϕ = 0

, and in Figure 3 the sampling time offset is set to

ϕ = \frac{1}{16}

. We observe that in both figures the RDF values are higher for higher

t_{dc}

. This can be explained by noting that for higher

t_{dc}

values, the resulting time-averaged variance of the DT source process increases, hence, a higher number of bits per source sample is required to encode the source process maintaining the same distortion value. Moreover, in all configurations,

R_{n} (D)

varies significantly for smaller values of n. Comparing Figure 2 and Figure 3, we see that the pattern of these variations depends on the sampling time offset

ϕ

. For example, when

t_{dc} = 45 %

at

n \in [4, 15]

, then for

ϕ = 0

the rdf varies in the range

[1.032, 1.143]

bits per source sample, while for

ϕ = \frac{1}{16}

the RDF varies in the range

[1.071, 1.237]

bits per source sample. However, as n increases above 230, the variations in

R_{n} (D)

become smaller and are less dependent on the sampling time offset, and the resulting values of

R_{n} (D)

are approximately in the same range for each

t_{dc}

in both Figure 2 and Figure 3 for

n \geq 230

. This behaviour can be explained by noting that as n varies, the period

p_{n}

also varies and hence the statistics of the DT variance differs over its respective period. This consequently affects the resulting RDF (especially for small periods). As n increases

ϵ_{n}

approaches the asynchronous sampling mismatch

ϵ

and the period

p_{n}

takes a sufficiently large value such that the samples of the DT variance over the period are identically distributed irrespective of the value of

ϕ

; leading to a negligible variation in the RDF as seen in the above figures.

5.2. The Variation of the RDF with the Sampling Rate

Next, we observe the dependence of the RDF for the sampled memoryless WSCS Gaussian process on the value of the sampling interval

T_{s}

. For this setup, we fix the distortion constraint to

D = 0.18

and set the duty cycle in the source process (24) to

t_{dc} = [45, 75] %

. Figure 4 and Figure 5 demonstrate the numerically evaluated values for

R_{n} (D)

at sampling intervals in the range

2 < \frac{T_{ps}}{T_{s}} < 4

with the sampling time offsets

ϕ = 0

and

ϕ = \frac{1}{16}

, respectively. We note that while the discussion which follows focuses on this range, as it corresponds to relatively low sampling rates—which are typically preferable in practice, the statements and observations regarding the relationship between the denominator of

\frac{T_{ps}}{T_{s}}

and the value of

R_{n} (D)

, and regarding the continuity the RDF in the parameter

\frac{T_{ps}}{T_{s}}

, are directly applicable to any range of values of

\frac{T_{ps}}{T_{s}}

, e.g., when higher sampling rates are preferable. A very important insight which arises from the figures is that the sequence of RDFs

R_{n} (D)

is not convergent; hence, for example, one cannot approach the RDF for

\frac{T_{ps}}{T_{s}} = 2.5

by simply taking rational values of

\frac{T_{ps}}{T_{s}}

which approach

2.5

. This verifies that the RDF for asynchronous sampling cannot be obtained by straightforward application of previous results, and indeed, the entire analysis carried out in the manuscript is necessary for the desired characterization.

We observe in Figure 4 and Figure 5 that when

\frac{T_{ps}}{T_{s}}

has a fractional part with a relatively small integer denominator, the variations in the RDF are significant and depend on the sampling time offset. These variations can either degrade the ability to accurately represent the source, which are the observed peaks in Figure 4 and Figure 5, or alternatively, allow to encode the signal to within the same distortion with smaller code rates, corresponding to the deeps in these figures. However, when

\frac{T_{ps}}{T_{s}}

approaches an irrational number, the period of the sampled variance function becomes very long, and consequently, the RDF is approximately constant and independent of the sampling time offset. As an example, consider

\frac{T_{ps}}{T_{s}} = 2.5

and

t_{dc} = 75 %

: For sampling time offset

ϕ = 0

the rdf takes a value of

1.469

bits per source sample, as shown in Figure 4 while for the offset of

ϕ = \frac{1}{16}

the RDF peaks to

1.934

bits per source sample as can be seen in Figure 5. On the other hand, when approaching asynchronous sampling, the RDF takes a nearly constant value of

1.85

bits per source sample for all the considered values of

\frac{T_{ps}}{T_{s}}

and this value is invariant to the offset

ϕ

. This follows since when the denominator of the fractional part of

\frac{T_{ps}}{T_{s}}

increases, then the DT period of the resulting sampled variance,

p_{n}

, increases and practically captures the entire set of values of the CT variance regardless of the sampling time offset. In a similar manner as with the study on capacity in [17], we conjecture that since asynchronous sampling captures the entire set of values of the CT variance, the respective RDF represents the RDF of the analog source, which does not depend on the specific sampling rate and offset. Figure 4 and Figure 5 demonstrate how slight variations in the sampling rate can result in significant changes in the RDF. For instance, at

ϕ = 0

we observe in Figure 4 that when the sampling rate switches from

T_{s} = 2.25 \cdot T_{ps}

to

T_{s} = 2.26 \cdot T_{ps}

, i.e., the sampling rate switches from being synchronous to being nearly asynchronous, then the RDF changes from

1.624

bits per source sample to

1.859

bits per source sample for

t_{dc} = 75 %

; also, we observe in Figure 5 for

t_{dc} = 45 %

, that when the sampling rate switches from

T_{s} = 2.5 \cdot T_{ps}

to

T_{s} = 2.51 \cdot T_{ps}

, i.e., the sampling rate also switches from being synchronous to being nearly asynchronous, then the rdf changes from

1.005

bits per source sample to

1.154

bits per source sample.

Lastly, Figure 6 and Figure 7 numerically evaluate the RDF versus the distortion constraint

D \in [0.05, 0.19]

for sampling time offsets of 0 and

\frac{1}{16}

respectively. At each

ϕ

, the result is evaluated at three different values of synchronization mismatch

ϵ

. For this setup, we fix

t_{dc} = 75 %

,

p = 2

and

ϵ \in {0.5, \frac{5 π}{32}, 0.6}

. The only mismatch value that refers to the asynchronous sampling case is

ϵ = \frac{5 π}{32}

and its corresponding sampling interval is approximately

2.007

μ

secs, which is a negligible variation from the sampling intervals corresponding to

ϵ \in {0.5, 0.6}

, which are

2.000

μ

secs and

1.923

μ

secs, respectively. Observing both figures, we see that the rdf may vary significantly for very slight variation in the sampling rate. For instance, as shown in Figure 6 for

ϕ = 0

, at

D = 0.18

, a slight change in the synchronization mismatch from

ϵ = \frac{5 π}{32}

(i.e.,

T_{s} \approx 2.007 μ

secs) to

ϵ = 0.5

(i.e.,

T_{s} = 2.000 μ

secs) results to approximately

20 %

decrease in the rdf. For

ϕ = \frac{1}{16}

the same change in the sampling synchronization mismatch at

D = 0.18

results in an increase in the RDF by roughly

4 %

. These results demonstrate the unique and counter-intuitive characteristics of the RDF of sampled WSCS signals which arise from our derivation. It is also interesting to examine how the RDF varies with the sampling time offset

ϕ

. To that aim we plot in Figure 8 the RDF vs.

ϕ

for the three sampling rates used in Figure 6 and Figure 7 at

D = 0.18

. The points marked on the plot correspond to

ϕ = 0

and

ϕ = \frac{1}{16}

considered in Figure 6 and Figure 7, respectively. We observe that the RDF is indeed periodic with

ϕ

. These variations in the RDF occur as by changing

ϕ

the number of high variance samples within a period of the variance of the DT process changes due to the duty cycle of the CT variance. Then, at

ϕ = 0

the periodic variance of the DT process corresponding to

T_{s} = 2.000 μ secs

has the smallest number of high variance values within a period, and when

ϕ = \frac{1}{16}

the periodic variance of the DT process corresponding to asynchronous sampling has the smallest number of high variance values within a period. For the asynchronous sampling rate the sampling time offset does not matter as in any case (nearly) all values of the CT variance are reflected in the variance of the DT process.

6. Conclusions

In this work the RDF of a sampled CT WSCS Gaussian source process was characterized for scenarios in which the resulting DT process is memoryless and the distortion is relatively small. This characterization shows the relationship between the sampling rate and the minimal number of bits per source sample required for compression at a given distortion. For cases in which the sampling rate is synchronized with the period of the statistics of the source process, the resulting DT process is WSCS and standard information theoretic framework can be used for deriving its RDF. For asynchronous sampling, information stability does not hold, and hence we resort to the information spectrum framework to obtain a characterization. To that aim we derived a relationship between some relevant information spectrum quantities for uniformly convergent sequences of RVs. This relationship was further applied to characterize the RDF of an asynchronously sampled CT WSCS Gaussian source process as the limit superior of a sequence of RDFs, each corresponding to the synchronous sampling of the CT WSCS Gaussian process. The results were derived in the low distortion regime, i.e., under the condition that the distortion constraint D is less than the minimum variance of the source, and for sampling intervals which are larger than the correlation length of the CT process. Our numerical examples give rise to non-intuitive insights which follow from the derivations. In particular, the numerical evaluation demonstrates that the RDF for a sampled CT WSCS Gaussian source can change dramatically with minor variations in the sampling rate and the sampling time offset. In particular, when the sampling rate switches from being synchronous to being asynchronous and vice versa, the RDF may change considerably as the statistical model of the source switches between WSCS and WSACS. The resulting analysis enables determining the sampling system parameters in order to facilitate accurate and efficient source coding of acquired CT signals.

Author Contributions

Conceptualization, E.A., N.S. and R.D.; Methodology, E.A., N.S. and R.D.; Derivation, E.A., N.S. and R.D.; Writing—original draft preparation, E.A., N.S. and R.D.; writing—review and editing, E.A., N.S. and R.D.; Supervision, N.S. and R.D. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Israel Science Foundation under Grants 1685/16 and 0100101, and by the Israeli Ministry of Economy through the HERON 5G consortium.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Lemma 1

Proof.

To prove that the minimum achievable rate at a given maximum distortion for a code with arbitrary blocklength can be achieved by considering only codes whose blocklength is an integer multiple of r, we apply the following approach: We first show that every rate-distortion pair achievable when restricted to using source codes whose blocklength is an integer multiple of r is also achievable when using arbitrary blocklenghts; We then prove that every achievable rate-distortion pair is also achievable when restricted to using codes whose blocklength is an integer multiple of r. Combining these two assertions proves that the rate-distortion function of the source

{S [i]}_{i \in N}

can be obtained when restricting the blocklengths to be an integer multiple of r. Consequently, a reproduction signal

{\hat{S} [i]}_{i \in N}

which achieves the minimal rate for a given D under the restriction to use only blocklengths which are an integer multiple of r is also the reproduction signal achieving the minimal rate without this restriction, and vice versa, thus proving the lemma.

To prove the first assertion, consider a rate-distortion pair

(R, D)

which is achievable when using codes whose blocklength is an integer multiple of r. It thus follows directly from Definition 6 that for every

η > 0

,

\exists b_{0} \in N

such that for all

b > b_{0}

there exists a a source code

(R_{(b \cdot r)}, b \cdot r)

with rate

R_{(b \cdot r)} \leq R + η

satisfying

\bar{d} (S^{(b \cdot r)}, {\hat{S}}^{(b \cdot r)}) \leq D + \frac{η}{2}

. We now show that we can construct a code with an arbitrary blocklength

l = b \cdot r + j

where

0 < j < r

(i.e., the blocklength l is not an integer multiple of r) satisfying Definition 6 for all

j \in {1, \dots, r - 1}

as follows: Apply the code

(R_{(b \cdot r)}, b \cdot r)

to the first

b \cdot r

samples of

S [i]

and then concatenate each codeword by j zeros to obtain a source code having codewords of length

b \cdot r + j

. The average distortion (i.e., see (2)) of the resulting

(R_{(b \cdot r + j)}, b \cdot r + j)

code is given by:

\begin{matrix} \bar{d} (S^{(b \cdot r + j)}, {\hat{S}}^{(b \cdot r + j)}) & = \frac{1}{b \cdot r + j} (\sum_{i = 1}^{b \cdot r} E \{{(S [i] - \hat{S} [i])}^{2}\} + \sum_{i = b \cdot r + 1}^{b \cdot r + j} E \{{(S [i])}^{2}\}) \\ = \frac{1}{b \cdot r + j} (b \cdot r \cdot \bar{d} (S^{(b \cdot r)}, {\hat{S}}^{(b \cdot r)}) + \sum_{i = 1}^{j} σ_{S}^{2} [i]) \\ = \frac{b \cdot r}{b \cdot r + j} \cdot \bar{d} (S^{(b \cdot r)}, {\hat{S}}^{(b \cdot r)}) + \frac{1}{b \cdot r + j} \sum_{i = 1}^{j} σ_{S}^{2} [i] . \end{matrix}

(A1)

Thus,

\exists b_{1} > b_{0}

such that

\frac{1}{b_{1} \cdot r + j} \sum_{i = 1}^{j} σ_{S}^{2} [i] < \frac{η}{2}

and hence, for all

b > b_{1}

\begin{matrix} \bar{d} (S^{(b \cdot r + j)}, {\hat{S}}^{(b \cdot r + j)}) & = \frac{b \cdot r}{b \cdot r + j} \cdot \bar{d} (S^{(b \cdot r)}, {\hat{S}}^{(b \cdot r)}) + \frac{1}{b \cdot r + j} \sum_{i = 1}^{j} σ_{S}^{2} [i] \\ \leq \frac{b \cdot r}{b \cdot r + j} \cdot \bar{d} (S^{(b \cdot r)}, {\hat{S}}^{(b \cdot r)}) + \frac{η}{2} \\ \leq \bar{d} (S^{(b \cdot r)}, {\hat{S}}^{(b \cdot r)}) + \frac{η}{2} \leq D + η . \end{matrix}

(A2)

The rate

R_{(b \cdot r + j)}

satisfies:

R_{(b \cdot r + j)} = \frac{1}{b \cdot r + j} \cdot {log}_{2} M = R_{(b \cdot r)} \cdot \frac{b \cdot r}{b \cdot r + j} \leq (R + η) \cdot \frac{b \cdot r}{b \cdot r + j} \leq R + η .

(A3)

Consequently, any rate-distortion pair achievable with codes whose blocklength is an integer multiple of r can be achieved by codes with arbitrary blocklengths.

Next, we prove that any achievable rate-distortion pair

(R, D)

can be achieved by codes whose blocklength is an integer multiple of r. To that aim, we fix

η > 0

. By Definition 6, it holds that there exists a code of blocklength l satisfying (3) and (4). To show that

(R, D)

is achievable using codes whose blocklength is an integer multiple of r, we assume here that l is not an integer multiple of r, hence, there exist some positive integers b and j such that

j < r

and

l = b \cdot r + j

. We denote this code by

(R_{(b \cdot r + j)}, b \cdot r + j)

. It follows from Definition 6 that

R_{(b \cdot r + j)} \leq R + η

and

\bar{d} (S^{(b \cdot r + j)}, {\hat{S}}^{(b \cdot r + j)}) \leq D + \frac{η}{2}

. Next, we construct a code

(R_{(b + 1) \cdot r}, (b + 1) \cdot r)

with codewords whose length is

(b + 1) \cdot r

, i.e., an integer multiple of r, by adding

r - j

zeros at the end of each codeword of the code

(R_{(b \cdot r + j)}, b \cdot r + j)

. The average distortion can now be computed as follows:

\begin{matrix} \bar{d} (S^{((b + 1) \cdot r)}, {\hat{S}}^{((b + 1) \cdot r)}) & = \frac{1}{(b + 1) \cdot r} (\sum_{i = 1}^{b \cdot r + j} E \{{(S [i] - \hat{S} [i])}^{2}\} + \sum_{i = b \cdot r + j + 1}^{(b + 1) \cdot r} E \{{(S [i])}^{2}\}) \\ = \frac{1}{(b + 1) \cdot r} ((b \cdot r + j) \cdot \bar{d} (S^{(b \cdot r + j)}, {\hat{S}}^{(b \cdot r + j)}) + \sum_{i = b \cdot r + j + 1}^{(b + 1) \cdot r} σ_{S}^{2} [i]) \\ = \frac{b \cdot r + j}{(b + 1) \cdot r} \cdot \bar{d} (S^{(b \cdot r + j)}, {\hat{S}}^{(b \cdot r + j)}) + \frac{\sum_{i = b \cdot r + j + 1}^{(b + 1) \cdot r} σ_{S}^{2} [i]}{(b + 1) \cdot r}, \end{matrix}

(A4)

Thus, since the variance is finite and bounded,

\exists b_{1} > b_{0}

such that

\frac{\sum_{i = b_{1} \cdot r + j + 1}^{(b_{1} + 1) \cdot r} σ_{S}^{2} [i]}{(b_{1} + 1) \cdot r} < \frac{η}{2}

for all

b > b_{1}

. Hence, for all

b > b_{1}

\begin{matrix} \bar{d} (S^{((b + 1) \cdot r)}, {\hat{S}}^{((b + 1) \cdot r}) & \leq \frac{b \cdot r + j}{(b + 1) \cdot r} \cdot \bar{d} (S^{(b \cdot r + j)}, {\hat{S}}^{(b \cdot r + j)}) + \frac{η}{2} \\ \leq \bar{d} (S^{(b \cdot r + j)}, {\hat{S}}^{(b \cdot r + j)}) + \frac{η}{2} \leq D + η . \end{matrix}

(A5)

The rate

R_{(b + 1) \cdot r}

can be expressed as follows:

R_{(b + 1) \cdot r} = \frac{1}{(b + 1) \cdot r} \cdot {log}_{2} M = R_{(b \cdot r + j)} \cdot \frac{b \cdot r + j}{(b + 1) \cdot r} \leq (R + η) \cdot \frac{b \cdot r + j}{(b + 1) \cdot r} < R + η .

(A6)

It follows that

R_{(b + 1) \cdot r} \leq R + η

for any arbitrary

η

by selecting a sufficiently large b. This proves that every rate-distortion pair achievable with arbitrary blocklengths (e.g.,

l = b \cdot r + j, j < r

) is also achievable when considering source codes whose blocklength is an integer multiple of r (i.e.,

l = b \cdot r

). This concludes the proof. □

Appendix B. Proof of Theorem 1

Recall that

α \in R^{+ +}

. To prove the theorem, we fix a rate-distortion pair

(R, D)

that is achievable for the source

{S [i]}_{i \in N}

. By Definition 6 this implies that for all

η > 0

there exists

l_{0} (η) \in N

such that for all

l > l_{0} (η)

there exists a source code

C_{l}

with rate

R_{(l)} \leq R + η

and MSE distortion

D_{(l)} = E \{\frac{1}{l} ∥ S^{(l)} - {\hat{S}}^{(l)} ∥^{2}\} \leq D + η

, where

∥ \cdot ∥

denotes the norm of a vector. Next, we use the code

C_{l}

to define the source code

C_{l}^{(α)}

, which operates in the following manner: The encoder first scales its input block by

1 / α

. Then, the block is encoded using the source code

C_{l}

. Finally, the selected codeword is scaled by

α

. Since the

C_{l}^{(α)}

has the same number of codewords and the same blocklength as

C_{l}

, it follows that its rate, denote

R_{(l)}^{(α)}

, satisfied

R_{(l)}^{(α)} = R_{(l)} \leq R + η

. Furthermore, by the construction of

C_{l}^{(α)}

, it holds that its reproduction vector when applied to

α \cdot S^{(l)}

is equal to the output of

C_{l}

applied to

S^{(l)}

scaled by

α

, i.e.,

α \cdot {\hat{S}}^{(l)}

. Consequently, the MSE of

C_{l}^{(α)}

when applied to the source

{α \cdot S [i]}_{i \in N}

, denoted

D_{(l)}^{(α)}

, satisfies

D_{(l)}^{(α)} = E \{\frac{1}{l} ∥ α \cdot S^{(l)} - α \cdot {\hat{S}}^{(l)} ∥^{2}\} = α^{2} \cdot D_{(l)} \leq α^{2} \cdot D + α^{2} η

.

It thus follows that for all

\tilde{η} > 0

there exists

{\tilde{l}}_{0} (\tilde{η}) = l_{0} (min (\tilde{η}, α^{2} \tilde{η}))

such that for all

l > {\tilde{l}}_{0} (\tilde{η})

there exists a code

C_{l}^{(α)}

with rate

R_{(l)}^{(α)} \leq R + \tilde{η}

which achieves an MSE distortion of

D_{(l)}^{(α)} \leq α^{2} \cdot D + \tilde{η}

when applied to the compression of

{α \cdot S [i]}_{i \in N}

. Hence,

(R, α^{2} \cdot D)

is achievable for compression of

{α \cdot S [i]}_{i \in N}

by Definition 6, proving the theorem.

Appendix C. Proof of Theorem 3

In this appendix, we prove (17b) by applying a similar approach as used for proving (17a) in ([17] Appendix A). We first note that Definition 9 can also be written as follows:

p - \underset{k \to \infty}{lim sup} Z_{k} \overset{(a)}{=} inf \{β \in R | \underset{k \to \infty}{lim sup} Pr (Z_{k} > β) = 0\} \overset{(b)}{=} inf \{β \in R | \underset{k \to \infty}{lim inf} F_{k} (β) = 1\} .

(A7)

For the equality

(a)

, we note that the set of probabilities

{Pr (Z_{k} > β)}_{k \in N}

is non-negative and bounded in

[0, 1]

; hence, for any

β \in R

for which

\underset{k \to \infty}{lim sup} Pr (Z_{k} > β) = 0

, it also holds from ([31] Theorem 3.17) that the limit of any subsequence of

{\{Pr (Z_{k} > β)\}}_{k \in N}

is also 0, since non-negativity of the probability implies

\underset{k \to \infty}{lim inf} Pr (Z_{k} > β) \geq 0

. Then, combined with the relationship

\underset{k \to \infty}{lim inf} Pr (Z_{k} > β) \leq \underset{k \to \infty}{lim sup} Pr (Z_{k} > β)

, we conclude:

\begin{matrix} 0 \leq \underset{k \to \infty}{lim inf} Pr (Z_{k} > β) \leq \underset{k \to \infty}{lim sup} Pr (Z_{k} > β) = 0 \\ \Rightarrow \underset{k \to \infty}{lim inf} Pr (Z_{k} > β) = \underset{k \to \infty}{lim sup} Pr (Z_{k} > β) \overset{(a)}{=} lim_{k \to \infty} Pr (Z_{k} > β) = 0, \end{matrix}

where

(a)

follows from ([31] Example 3.18(c)). This implies

lim_{k \to \infty} Pr (Z_{k} > β)

exists and is equal to 0.

In the opposite direction, if

lim_{k \to \infty} Pr (Z_{k} > β) = 0

then it follows from ([31] Example 3.18(c)) that

\underset{k \to \infty}{lim sup} Pr (Z_{k} > β) = 0

. Next, we note that since

F_{k} (β)

is bounded in

[0, 1]

then

\underset{k \to \infty}{lim inf} F_{k} (β)

is finite

\forall β \in R

, even if

lim_{k \to \infty} F_{k} (β)

does not exist. Equality

(b)

follows since

\underset{k \to \infty}{lim sup} Pr (Z_{k} > β) = \underset{k \to \infty}{lim sup} (1 - Pr (Z_{k} \leq β))

which according to ([32] Theorem 7.3.7) is equal to

1 + \underset{k \to \infty}{lim sup} (- Pr (Z_{k} \leq β))

. By ([33] Chapter 1, page 29), this quantity is also equal to

1 - \underset{k \to \infty}{lim inf} (Pr (Z_{k} \leq β)) = 1 - \underset{k \to \infty}{lim inf} F_{k} (β)

.

Next, we state the following lemma:

Lemma A1.

Given assumption AS2, for all

β \in R

it holds that

\underset{k \to \infty}{lim inf} F_{k} (β) = lim_{n \to \infty} \underset{k \to \infty}{lim inf} {\tilde{F}}_{k, n} (β) .

(A8)

Proof.

To prove the lemma we first show that

\underset{k \to \infty}{lim inf} F_{k} (β) \leq lim_{n \to \infty} \underset{k \to \infty}{lim inf} {\tilde{F}}_{k, n} (β)

, and then we show

\underset{k \to \infty}{lim inf} F_{k} (β) \geq lim_{n \to \infty} \underset{k \to \infty}{lim inf} {\tilde{F}}_{k, n} (β)

. Recall that by AS2, for all

β \in R

and

k \in N

,

{\tilde{F}}_{k, n} (β)

converges as

n \to \infty

to

F_{k} (β)

, uniformly over k and

β

, i.e., for all

η > 0

there exists

n_{0} (η) \in N

,

k_{0} (n_{0} (η), η) \in N

such that for every

n > n_{0} (η)

,

β \in R

and

k > k_{0} (n_{0} (η), η)

, it holds that

| {\tilde{F}}_{k, n} (β) - F_{k} (β) | < η

. Consequently, for every subsequence

0 < k_{1} < k_{2} < \dots

such that

lim_{l \to \infty} {\tilde{F}}_{k_{l}, n} (β)

exists for any

n > n_{0} (η)

, it follows from ([31] Theorem 7.11) that, as the convergence over k is uniform, the limits over n and l are interchangeable:

lim_{n \to \infty} lim_{l \to \infty} {\tilde{F}}_{k_{l}, n} (β) = lim_{l \to \infty} lim_{n \to \infty} {\tilde{F}}_{k_{l}, n} (β) = lim_{l \to \infty} F_{k_{l}} (β) .

(A9)

The existence of such a convergent subsequence is guaranteed by the Bolzano-Weierstrass theorem ([31] Theorem 2.42) as

{\tilde{F}}_{k, n} (β) \in [0, 1]

.

From the properties of the limit inferior ([31] Theorem 3.17) it follows that there exists a subsequence of

{\{F_{k} (β)\}}_{k \in N}

, denoted

{\{F_{k_{m}} (β)\}}_{m \in N}

, such that

lim_{m \to \infty} F_{k_{m}} (β) = \underset{k \to \infty}{lim inf} F_{k} (β)

. Consequently,

\begin{matrix} \underset{k \to \infty}{lim inf} F_{k} (β) & = lim_{m \to \infty} F_{k_{m}} (β) \overset{(a)}{=} lim_{n \to \infty} lim_{m \to \infty} {\tilde{F}}_{k_{m}, n} (β) \\ \overset{(b)}{\geq} lim_{n \to \infty} \underset{k \to \infty}{lim inf} {\tilde{F}}_{k, n} (β), \end{matrix}

(A10)

where

(a)

follows from (A9), and

(b)

follows from the definition of the limit inferior ([31] Definition 3.16). Similarly, by ([31] Theorem 3.17), for any

n \in N

there exists a subsequence of

{{\tilde{F}}_{k, n} (β)}_{k \in N}

which we denote by

{\{{\tilde{F}}_{k_{l}, n} (β)\}}_{l \in N}

where

{k_{l}}_{l \in N}

satisfy

0 < k_{1} < k_{2} < \dots

, such that

lim_{l \to \infty} {\tilde{F}}_{k_{l}, n} (β) = \underset{k \to \infty}{lim inf} {\tilde{F}}_{k, n} (β)

. Therefore,

\begin{matrix} lim_{n \to \infty} \underset{k \to \infty}{lim inf} {\tilde{F}}_{k, n} (β) & = lim_{n \to \infty} lim_{l \to \infty} {\tilde{F}}_{k_{l}, n} (β) \\ \overset{(a)}{=} lim_{l \to \infty} F_{k_{l}} (β) \overset{(b)}{\geq} \underset{k \to \infty}{lim inf} F_{k} (β), \end{matrix}

(A11)

where

(a)

follows from (A9), and

(b)

follows from the definition of the limit inferior ([31] Definition 3.16). Therefore,

\underset{k \to \infty}{lim inf} F_{k} (β) \leq lim_{n \to \infty} \underset{k \to \infty}{lim inf} {\tilde{F}}_{k, n} (β)

. Combining (A10) and (A11) proves (A8) in the statement of the lemma. □

Lemma A2.

Given assumptions AS1–AS2, the sequence of RVs

{\{{\tilde{Z}}_{k, n}\}}_{k, n \in N}

satisfies

\begin{matrix} lim_{n \to \infty} (p - \underset{k \to \infty}{lim sup} {\tilde{Z}}_{k, n}) & = inf \{β \in R | lim_{n \to \infty} \underset{k \to \infty}{lim inf} {\tilde{F}}_{k, n} (β) = 1\} . \end{matrix}

(A12)

Proof.

Since by assumption AS1, for every

n \in N

, every convergent subsequence of

{\{{\tilde{Z}}_{k, n}\}}_{k \in N}

converges in distribution as

k \to \infty

to a deterministic scalar, it follows that every convergent subsequence of

{\tilde{F}}_{k, n} (β)

converges as

k \to \infty

to a step function, which is the CDF of the corresponding sublimit of

{\tilde{Z}}_{k, n}

. In particular, the limit

\underset{k \to \infty}{lim inf} {\tilde{F}}_{k, n} (β)

is a step function representing the CDF of the deterministic scalar

ζ_{n}

, i.e.,

\underset{k \to \infty}{lim inf} {\tilde{F}}_{k, n} (β) = \{\begin{matrix} 0 & β < ζ_{n} \\ 1 & β \geq ζ_{n} . \end{matrix}

(A13)

Since, by Lemma A1, AS2 implies that the limit

lim_{n \to \infty} \underset{k \to \infty}{lim inf} {\tilde{F}}_{k, n} (β)

exists (convergence to a discontinuous function is in the sense of ([31] Ex. 7.3)), then

lim_{n \to \infty} ζ_{n}

exists. Hence, we obtain that

lim_{n \to \infty} \underset{k \to \infty}{lim inf} {\tilde{F}}_{k, n} (β) = \{\begin{matrix} 0 & β < lim_{n \to \infty} ζ_{n} \\ 1 & β \geq lim_{n \to \infty} ζ_{n}, \end{matrix}

(A14)

and from the right-hand side of (A12) we have that

inf \{β \in R | lim_{n \to \infty} \underset{k \to \infty}{lim inf} {\tilde{F}}_{k, n} (β) = 1\} = lim_{n \to \infty} ζ_{n} .

(A15)

Next, from (A7) and (A13) we note that

\begin{matrix} p - \underset{k \to \infty}{lim sup} {\tilde{Z}}_{k, n} = inf \{β \in R | \underset{k \to \infty}{lim inf} {\tilde{F}}_{k, n} (β) = 1\} = ζ_{n} . \end{matrix}

Consequently, the left-hand side of (A12) is equal to

lim_{n \to \infty} ζ_{n}

. Combining with (A15) we arrive at the equality (A12) in the statement of the lemma. □

Substituting (A8) into (A7) results in

\begin{matrix} p - \underset{k \to \infty}{lim sup} Z_{k} & = inf \{β \in R | lim_{n \to \infty} \underset{k \to \infty}{lim inf} {\tilde{F}}_{k, n} (β) = 1\} \overset{(a)}{=} lim_{n \to \infty} (p - \underset{k \to \infty}{lim sup} {\tilde{Z}}_{k, n}), \end{matrix}

(A16)

where

(a)

follows from (A12). Equation (A16) concludes the proof for (17b).

Appendix D. Proof of Theorem 4

In this appendix we detail the proof of Theorem 4. The outline of the proof is given as follows:

We first show in Appendix D.1 that for any $k \in N$ , the PDF of the random vector $S_{n}^{(k)}$ , representing the first k samples of the CT WSCS source $S_{c} (t)$ sampled at time instants $T_{s} (n) = \frac{T_{ps}}{p + ϵ_{n}}$ , converges in the limit as $n \to \infty$ and for any $k \in N$ to the PDF of $S_{ϵ}^{(k)}$ , which represents the first k samples of the CT WSCS source $S_{c} (t)$ , sampled at time instants $T_{s} = \frac{T_{ps}}{p + ϵ}$ . We prove that this convergence is uniform in $k \in N$ and in the realization vector $s^{(k)} \in R^{k}$ . This is stated in Lemma A3.
Next, in Appendix D.2 we apply Theorem 3 to relate the mutual information density rates for the random source vector $S_{n}^{(k)}$ and its reproduction ${\hat{S}}_{n}^{(k)}$ with that of the random source vector $S_{ϵ}^{(k)}$ and its reproduction ${\hat{S}}_{ϵ}^{(k)}$ . To that aim, let the functions $F_{S_{n}, {\hat{S}}_{n}}$ and $F_{S_{ϵ}, {\hat{S}}_{ϵ}}$ denote the joint distributions of an arbitrary dimensional source and reproduction vectors corresponding to the synchronously sampled and to the asynchronously sampled source process respectively. We define the following mutual information density rates:

${\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}) ≜ \frac{1}{k} log \frac{p_{S_{n}^{(k)} | {\hat{S}}_{n}^{(k)}} (S_{n}^{(k)} | {\hat{S}}_{n}^{(k)})}{p_{S_{n}^{(k)}} (S_{n}^{(k)})},$

(A17a)

and

$Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}}) ≜ \frac{1}{k} log \frac{p_{S_{ϵ}^{(k)} | {\hat{S}}_{ϵ}^{(k)}} (S_{ϵ}^{(k)} | {\hat{S}}_{ϵ}^{(k)})}{p_{S_{ϵ}^{(k)}} (S_{ϵ}^{(k)})},$

(A17b)

$k, n \in N$ . The RVs ${\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}})$ and $Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}})$ in (A17) denote the mutual information density rates ([14] Definition 3.2.1) between the DT source process and the corresponding reproduction process for the case of synchronous sampling and for the case of asynchronous sampling, respectively.
We then show that if the pairs of source process and optimal reproduction process ${\{S_{n} [i], {\hat{S}}_{n} [i]\}}_{i \in N}$ and ${\{S_{ϵ} [i], {\hat{S}}_{ϵ} [i]\}}_{i \in N}$ satisfy that $p_{{\hat{S}}_{n}^{(k)}} ({\hat{s}}^{(k)}) \underset{n \to \infty}{⟶} p_{{\hat{S}}_{ϵ}^{(k)}} ({\hat{s}}^{(k)})$ uniformly with respect to ${\hat{s}}^{(k)} \in R^{k}$ and $k \in N$ , and that $p_{S_{n}^{(k)} | {\hat{S}}_{n}^{(k)}} (s^{(k)} | {\hat{s}}^{(k)}) \underset{n \to \infty}{⟶} p_{S_{ϵ}^{(k)} | {\hat{S}}_{ϵ}^{(k)}} (s^{(k)} | {\hat{s}}^{(k)})$ uniformly in ${({({\hat{s}}^{(k)})}^{T}, {(s^{(k)})}^{T})}^{T} \in R^{2 k}$ and $k \in N$ , then ${\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}) ⟶_{n \to \infty}^{(d i s t .)} Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}})$ uniformly in $k \in N$ . In addition, Lemma A5 proves that every subsequence of ${\{{\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}})\}}_{k \in N}$ $w . r . t .$ k, indexed as $k_{l}$ converges in distribution, in the limit $l \to \infty$ to a deterministic scalar.
Lastly, in Appendix D.3 we combine the above results to show in Lemmas A7 and A8 that $R_{ϵ} (D) \leq \underset{n \to \infty}{lim sup} R_{n} (D)$ and $R_{ϵ} (D) \geq \underset{n \to \infty}{lim sup} R_{n} (D)$ respectively; implying that $R_{ϵ} (D) = \underset{n \to \infty}{lim sup} R_{n} (D)$ , which proves the theorem.

To facilitate our proof we will need uniform convergence in

k \in N

, of

p_{S_{n}^{(k)}} (s^{(k)})

,

p_{{\hat{S}}_{n}^{(k)}} ({\hat{s}}^{(k)})

and

p_{S_{n}^{(k)} | {\hat{S}}_{n}^{(k)}} (s^{(k)} | {\hat{s}}^{(k)})

to

p_{S_{ϵ}^{(k)}} (s^{(k)})

,

p_{{\hat{S}}_{ϵ}^{(k)}} ({\hat{s}}^{(k)})

and

p_{S_{ϵ}^{(k)} | {\hat{S}}_{ϵ}^{(k)}} (s^{(k)} | {\hat{s}}^{(k)})

, respectively. To that aim, we will make the following scaling assumption w.l.o.g.:

Assumption A1.

The variance of the source and the allowed distortion are scaled by some factor

α^{2}

such that

α^{2} \cdot min \{D, (min_{0 \leq t \leq T_{ps}} σ_{S_{c}}^{2} (t) - D)\} > \frac{1}{2 π} .

(A18)

Note that this assumption has no effect on the generality of the RDF for multivariate stationary processes detailed in ([5] Section 10.3.3), ([34] Section IV). Moreover, by Theorem 1, for every

α > 0

it holds that when any rate R achievable when compressing the original source

S_{c} (t)

with distortion not larger that D is achievable when compressing the scaled source

α \cdot S_{c} (t)

with distortion not larger than

α^{2} \cdot D

. Note that if for the source

S_{c} (t)

the distortion satisfies

D < min_{0 \leq t \leq T_{ps}} σ_{S_{c}}^{2} (t)

, then for the scaled source and distortion we have

α^{2} \cdot D < min_{0 \leq t \leq T_{ps}} α^{2} \cdot σ_{S_{c}}^{2} (t)

.

Appendix D.1. Convergence in Distribution of $S_{n}^{(k)}$ to $S_{ϵ}^{(k)}$ Uniformly with Respect to $k \in N$

In order to prove the uniform convergence in distribution,

S_{n}^{(k)} ⟶_{n \to \infty}^{(d i s t .)} S_{ϵ}^{(k)}

, uniformly with respect to

k \in N

, we first prove, in Lemma A3, that as

n \to \infty

the sequence of PDFs of

S_{n}^{(k)}

,

p_{S_{n}^{(k)}} (s^{(k)})

, converges to the PDF of

S_{ϵ}^{(k)}

,

p_{S_{ϵ}^{(k)}} (s^{(k)})

, uniformly in

s^{(k)} \in R^{k}

and in

k \in N

. Next, we show in Corollary A1 that

S_{n}^{(k)} ⟶_{n \to \infty}^{(d i s t .)} S_{ϵ}^{(k)}

uniformly in

k \in N

.

To that aim, let us define the set

K ≜ {1, 2, \dots, k}

and consider the k- dimensional zero-mean, memoryless random vectors

S_{n}^{(k)}

and

S_{ϵ}^{(k)}

with their respective diagonal correlation matrices expressed below:

R_{n}^{(k)} ≜ E \{(S_{n}^{(k)}) {(S_{n}^{(k)})}^{T}\} = diag (σ_{S_{n}}^{2} [1], \dots, σ_{S_{n}}^{2} [k]),

(A19a)

R_{ϵ}^{(k)} ≜ E \{(S_{ϵ}^{(k)}) {(S_{ϵ}^{(k)})}^{T}\} = diag (σ_{S_{ϵ}}^{2} [1], \dots, σ_{S_{ϵ}}^{2} [k]) .

(A19b)

Since

ϵ_{n} ≜ \frac{⌊ n \cdot ϵ ⌋}{n}

it holds that

\frac{n \cdot ϵ - 1}{n} \leq ϵ_{n} \leq \frac{n \cdot ϵ}{n}

; therefore

lim_{n \to \infty} ϵ_{n} = ϵ .

(A20)

Now we note that since

σ_{S_{c}}^{2} (t)

is uniformly continuous, then by the definition of a uniformly continuous function, for each

i \in N

, the limit in (A20) implies that

lim_{n \to \infty} σ_{S_{n}}^{2} [i] \equiv lim_{n \to \infty} σ_{S_{c}}^{2} (i \cdot \frac{T_{ps}}{p + ϵ_{n}}) = σ_{S_{c}}^{2} (i \cdot \frac{T_{ps}}{p + ϵ}) \equiv σ_{S_{ϵ}}^{2} [i] .

(A21)

From Assumption A1, it follows that

σ_{S_{n}}^{2} [i]

satisfies

σ_{S_{n}}^{2} [i] > \frac{1}{2 π}

; Hence, we can state the following lemma:

Lemma A3.

The PDF of

S_{n}^{(k)}

,

p_{S_{n}^{(k)}} (s^{(k)})

, converges as

n \to \infty

to the PDF of

S_{ϵ}^{(k)}

,

p_{S_{ϵ}^{(k)}} (s^{(k)})

, uniformly in

s^{(k)} \in R^{k}

and in

k \in N

:

lim_{n \to \infty} p_{S_{n}^{(k)}} (s^{(k)}) = p_{S_{ϵ}^{(k)}} (s^{(k)}), \forall s^{(k)} \in R^{k}, \forall k \in N .

Proof.

The proof of the lemma directly follows from the steps in the proof of ([17] Lemma B.1), which was applied to random Gaussian processes with independent entries and variance larger than

\frac{1}{2 π}

. □

Lemma A3 gives rise to the following corollary:

Corollary A1.

For any

k \in N

it holds that

S_{n}^{(k)} ⟶_{n \to \infty}^{(d i s t .)} S_{ϵ}^{(k)}

, and convergence is uniform over k.

Proof.

The corollary holds due to ([35] Theorem 1): Since

p_{S_{n}^{(k)}} (s^{(k)})

converges to

p_{S_{ϵ}^{(k)}} (s^{(k)})

then

S_{n}^{(k)} ⟶_{n \to \infty}^{(d i s t .)} S_{ϵ}^{(k)}

. In addition, since the convergence of the PDFs is uniform in

k \in N

, the convergence of the CDFs is also uniform in

k \in N

. □

Appendix D.2. Showing that ${\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt})$ and $Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}})$ Satisfy the Conditions of Theorem 3

Let

F_{S_{n}, {\hat{S}}_{n}}^{opt}

denote the joint distribution for the source process and the corresponding optimal reproduction process satisfying the distortion constraint D. We next prove that for

F_{S_{n}, {\hat{S}}_{n}}^{opt} ⟶_{n \to \infty}^{(d i s t .)} F_{S_{ϵ}, {\hat{S}}_{ϵ}}

, then

{\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt})

and

Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}})

satisfy AS1–AS2. In particular, Lemma A4 proves that

{\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt}) ⟶_{n \to \infty}^{(d i s t .)} Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}})

uniformly in

k \in N

for the optimal zero-mean Gaussian reproduction vectors with independent entries. Lemma A5 proves that for any fixed n,

{\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt})

converges in distribution to a deterministic scalar as

k \to \infty

.

Lemma A4.

Let

{{\hat{S}}_{n}^{(k)}}_{n \in N}

and

{W_{n}^{(k)}}_{n \in N}

be two sets of mutually independent sequences of

k \times 1

zero-mean Gaussian random vectors related via the backward channel (20), each having independent entries and let PDFs

p_{{\hat{S}}_{n}^{(k)}} ({\hat{s}}^{(k)})

and

p_{W_{n}^{(k)}} (w^{(k)})

, respectively, denote their PDFs. Consider two other zero-mean Gaussian random vectors

{\hat{S}}_{ϵ}^{(k)}

and

W_{ϵ}^{(k)}

each having independent entries with the PDFs

p_{{\hat{S}}_{ϵ}^{(k)}} ({\hat{s}}^{(k)})

and

p_{W_{ϵ}^{(k)}} (w^{(k)})

, respectively, such that

lim_{n \to \infty} p_{{\hat{S}}_{n}^{(k)}} ({\hat{s}}^{(k)}) = p_{{\hat{S}}_{ϵ}^{(k)}} ({\hat{s}}^{(k)})

uniformly in

{\hat{s}}^{(k)} \in R^{k}

and uniformly with respect to

k \in N

, and

lim_{n \to \infty} p_{W_{n}^{(k)}} (w^{(k)}) = p_{W_{ϵ}^{(k)}} (w^{(k)})

uniformly in

w^{(k)} \in R^{k}

and uniformly with respect to

k \in N

. Then, the RVs

{\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt})

and

Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}})

, defined via (A17) satisfy

{\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt}) ⟶_{n \to \infty}^{(d i s t .)} Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}})

uniformly over

k \in N

.

Proof.

To begin the proof, for

(s^{(k)}, {\hat{s}}^{(k)}) \in R^{2 k}

, define

f_{k, n} (s^{(k)}, {\hat{s}}^{(k)}) ≜ \frac{p_{S_{n}^{(k)} | {\hat{S}}_{n}^{(k)}} (s^{(k)} | {\hat{s}}^{(k)})}{p_{S_{n}^{(k)}} (s^{(k)})}, f_{k, ϵ} (s^{(k)}, {\hat{s}}^{(k)}) ≜ \frac{p_{S_{ϵ}^{(k)} | {\hat{S}}_{ϵ}^{(k)}} (s^{(k)} | {\hat{s}}^{(k)})}{p_{S_{ϵ}^{(k)}} (s^{(k)})} .

(A22)

Now, we recall the backward channel relationship (20):

S_{n}^{(k)} = {\hat{S}}_{n}^{(k)} + W_{n}^{(k)},

(A23)

where

{\hat{S}}_{n}^{(k)}

and

W_{n}^{(k)}

are mutually independent zero-mean, Gaussian random vectors with independent entries, corresponding to the optimal compression process and its respective distortion. From this relationship we obtain

\begin{matrix} p_{S_{n}^{(k)} | {\hat{S}}_{n}^{(k)}} (s^{(k)} | {\hat{s}}^{(k)}) & \overset{(a)}{=} p_{{\hat{S}}_{n}^{(k)} + W_{n}^{(k)} | {\hat{S}}_{n}^{(k)}} (s^{(k)} | {\hat{s}}^{(k)}) \\ = p_{W_{n}^{(k)} | {\hat{S}}_{n}^{(k)}} (s^{(k)} - {\hat{s}}^{(k)} | {\hat{s}}^{(k)}) \\ \overset{(b)}{=} p_{W_{n}^{(k)}} (s^{(k)} - {\hat{s}}^{(k)}), \end{matrix}

(A24)

where

(a)

follows since

S_{n}^{(k)} = {\hat{S}}_{n}^{(k)} + W_{n}^{(k)}

, see (A23), and

(b)

follows since

W_{n}^{(k)}

and

{\hat{S}}_{n}^{(k)}

are mutually independent. The joint PDF of

S_{n}^{(k)}

and

{\hat{S}}_{n}^{(k)}

can be expressed via the conditional PDF as:

\begin{matrix} p_{S_{n}^{(k)}, {\hat{S}}_{n}^{(k)}} (s^{(k)}, {\hat{s}}^{(k)}) = p_{S_{n}^{(k)} | {\hat{S}}_{n}^{(k)}} (s^{(k)} | {\hat{s}}^{(k)}) \cdot p_{{\hat{S}}_{n}^{(k)}} ({\hat{s}}^{(k)}) \overset{(a)}{=} p_{W_{n}^{(k)}} (s^{(k)} - {\hat{s}}^{(k)}) \cdot p_{{\hat{S}}_{n}^{(k)}} ({\hat{s}}^{(k)}), \end{matrix}

(A25)

where

(a)

follows from (A24). Since

{\hat{S}}_{n}^{(k)}

and

W_{n}^{(k)}

are Gaussian and mutually independent and since the product of two multivariate Gaussian PDFs is also a multivariate Gaussian PDF ([36] Section 3), it follows from (A25) that

S_{n}^{(k)}

and

{\hat{S}}_{n}^{(k)}

are jointly Gaussian. Following the mutual independence of

W_{n}^{(k)}

and

{\hat{S}}_{n}^{(k)}

, the right hand side (RHS) of (A25) is also equivalent to the joint PDF of

{[{(W_{n}^{(k)})}^{T}, {({\hat{S}}_{n}^{(k)})}^{T}]}^{T}

denoted by

p_{W_{n}^{(k)}, {\hat{S}}_{n}^{(k)}} (s^{(k)} - {\hat{s}}^{(k)}, {\hat{s}}^{(k)})

. Now, from (A24), the assumption

lim_{n \to \infty} p_{W_{n}^{(k)}} (w^{(k)}) = p_{W_{ϵ}^{(k)}} (w^{(k)})

implies that a limit exists for the conditional PDF

p_{S_{n}^{(k)} ∣ {\hat{S}}_{n}^{(k)}} (s^{(k)} ∣ {\hat{s}}^{(k)})

, this we denote by

p_{S_{ϵ}^{(k)} ∣ {\hat{S}}_{ϵ}^{(k)}} (s^{(k)} ∣ {\hat{s}}^{(k)})

. Combining this with the assumption

lim_{n \to \infty} p_{{\hat{S}}_{n}^{(k)}} ({\hat{s}}^{(k)}) = p_{{\hat{S}}_{ϵ}^{(k)}} ({\hat{s}}^{(k)})

, we have that,

\begin{matrix} lim_{n \to \infty} p_{S_{n}^{(k)}, {\hat{S}}_{n}^{(k)}} (s^{(k)}, {\hat{s}}^{(k)}) & = lim_{n \to \infty} (p_{S_{n}^{(k)} | {\hat{S}}_{n}^{(k)}} (s^{(k)} | {\hat{s}}^{(k)}) \cdot p_{{\hat{S}}_{n}^{(k)}} ({\hat{s}}^{(k)})) \\ \overset{(a)}{=} lim_{n \to \infty} (p_{W_{n}^{(k)}} (s^{(k)} - {\hat{s}}^{(k)}) \cdot p_{{\hat{S}}_{n}^{(k)}} ({\hat{s}}^{(k)})) \\ \overset{(b)}{=} lim_{n \to \infty} (p_{W_{n}^{(k)}} (s^{(k)} - {\hat{s}}^{(k)})) \cdot lim_{n \to \infty} (p_{{\hat{S}}_{n}^{(k)}} ({\hat{s}}^{(k)})) \\ = p_{S_{ϵ}^{(k)} | {\hat{S}}_{ϵ}^{(k)}} (s^{(k)} | {\hat{s}}^{(k)}) \cdot p_{{\hat{S}}_{ϵ}^{(k)}} ({\hat{s}}^{(k)}) \\ = p_{S_{ϵ}^{(k)}, {\hat{S}}_{ϵ}^{(k)}} (s^{(k)}, {\hat{s}}^{(k)}), \end{matrix}

(A26)

where

(a)

follows from (A24), and

(b)

follows since the limit for each sequence in the product exists ([31] Theorem 3.3); Convergence is uniform in

{({({\hat{s}}^{(k)})}^{T}, {(s^{(k)})}^{T})}^{T} \in R^{2 k}

and

k \in N

, as each sequence converges uniformly in

k \in N

([31] Page 165). Observe that the joint PDF for the zero-mean Gaussian random vectors

[S_{n}^{(k)}, {\hat{S}}_{n}^{(k)}]

is given by the general expression:

p_{S_{n}^{(k)}, {\hat{S}}_{n}^{(k)}} (s^{(k)}, {\hat{s}}^{(k)}) = {(Det (2 π {\tilde{C}}_{n}^{(2 k)}))}^{- \frac{1}{2}} exp (- \frac{1}{2} [{({\hat{s}}^{(k)})}^{T}, {(s^{(k)})}^{T}] {({\tilde{C}}_{n}^{(2 k)})}^{- 1} {[{({\hat{s}}^{(k)})}^{T}, {(s^{(k)})}^{T}]}^{T}),

(A27)

where

{\tilde{C}}_{n}^{(2 k)}

denotes the joint covariance matrix of

{[{({\hat{S}}_{n}^{(k)})}^{T}, {(S_{n}^{(k)})}^{T}]}^{T}

. From (A27) we note that

p_{S_{n}^{(k)}, {\hat{S}}_{n}^{(k)}} (s^{(k)}, {\hat{s}}^{(k)})

is a continuous mapping of

{\tilde{C}}_{n}^{(2 k)}

with respect to the index n, see ([17] Lemma B.1). Hence the convergence in (A26) of

p_{S_{n}^{(k)}, {\hat{S}}_{n}^{(k)}} (s^{(k)}, {\hat{s}}^{(k)})

as

n \to \infty

directly implies the convergence of

{\tilde{C}}_{n}^{(2 k)}

as

n \to \infty

to a limit which we denote by

{\tilde{C}}_{ϵ}^{(2 k)}

. It therefore follows that the limit function

p_{S_{ϵ}^{(k)}, {\hat{S}}_{ϵ}^{(k)}} (s^{(k)}, {\hat{s}}^{(k)})

corresponds to the PDF of a Gaussian vector with the covariance matrix

{\tilde{C}}_{ϵ}^{(2 k)}

.

The joint PDF for the zero-mean Gaussian random vectors

[W_{n}^{(k)}, {\hat{S}}_{n}^{(k)}]

can be obtained using their mutual independence as:

\begin{matrix} p_{W_{n}^{(k)}, {\hat{S}}_{n}^{(k)}} (s^{(k)} - {\hat{s}}^{(k)}, {\hat{s}}^{(k)}) \\ = {(Det (2 π Σ_{n}^{(2 k)}))}^{\frac{1}{2}} exp (- \frac{1}{2} [{(s^{(k)} - {\hat{s}}^{(k)})}^{T}, {({\hat{s}}^{(k)})}^{T}] {(Σ_{n}^{(2 k)})}^{- 1} {[{(s^{(k)} - {\hat{s}}^{(k)})}^{T}, {({\hat{s}}^{(k)})}^{T}]}^{T}), \end{matrix}

(A28)

where

Σ_{n}^{(2 k)}

denotes the joint covariance matrix of

{[{(W_{n}^{(k)})}^{T}, {({\hat{S}}_{n}^{(k)})}^{T}]}^{T}

. Since the vectors

W_{n}^{(k)}

and

{\hat{S}}_{n}^{(k)}

are zero-mean, mutually independent and, by the relationship (20), each vector has independent entries, it follows that

Σ_{n}^{(2 k)}

is a diagonal matrix with each diagonal element taking the value of the corresponding temporal variance at the respective index

i \in {1, 2, \dots, k}

. i.e.,

\begin{matrix} Σ_{n}^{(2 k)} & ≜ E \{{({(W_{n}^{(k)})}^{T}, {({\hat{S}}_{n}^{(k)})}^{T})}^{T} \cdot ({(W_{n}^{(k)})}^{T}, {({\hat{S}}_{n}^{(k)})}^{T})\} \\ = diag (E \{{(W_{n} [1])}^{2}\}, E \{{(W_{n} [2])}^{2}\}, \dots, E \{{(W_{n} [k])}^{2}\}, σ_{{\hat{S}}_{n}}^{2} [1], σ_{{\hat{S}}_{n}}^{2} [2] \dots, σ_{{\hat{S}}_{n}}^{2} [k]) . \end{matrix}

(A29)

The convergence of

p_{W_{n}^{(k)}, {\hat{S}}_{n}^{(k)}} (s^{(k)} - {\hat{s}}^{(k)}, {\hat{s}}^{(k)})

, from (A26), implies a convergence of the diagonal elements in (A29) as

n \to \infty

. Hence

Σ_{n}^{(2 k)}

converges as

n \to \infty

to a diagonal joint covariance matrix which we denote by

Σ_{ϵ}^{(2 k)}

. This further implies that the limiting vectors

W_{ϵ}^{(k)}

and

{\hat{S}}_{ϵ}^{(k)}

are zero-mean, mutually independent and each vector has independent entries in

i \in [1, 2, \dots, k]

.

Relationship (A26) implies that the joint limit distribution satisfies

p_{S_{ϵ}^{(k)}, {\hat{S}}_{ϵ}^{(k)}} (s^{(k)}, {\hat{s}}^{(k)}) = p_{{\hat{S}}_{ϵ}^{(k)}} ({\hat{s}}^{(k)}) p_{W_{ϵ}^{(k)}} (s^{(k)} - {\hat{s}}^{(k)})

. Consequently, we can define an asymptotic backward channel that satisfies (A26) via the expression:

S_{ϵ}^{(k)} [i] = {\hat{S}}_{ϵ}^{(k)} [i] + W_{ϵ}^{(k)} [i] .

(A30)

Next, by convergence of the joint PDF

p_{W_{n}^{(k)}} (s^{(k)} - {\hat{s}}^{(k)}) \cdot p_{{\hat{S}}_{n}^{(k)}} ({\hat{s}}^{(k)})

uniformly in

k \in N

and in

{({(s^{(k)})}^{T}, {({\hat{s}}^{(k)})}^{T})}^{T} \in R^{2 k}

, it follows from ([35] Theorem 1) that

{[{({\hat{S}}_{n}^{(k)})}^{T}, {(W_{n}^{(k)})}^{T}]}^{T} ⟶_{n \to \infty}^{(d i s t .)} {[{({\hat{S}}_{ϵ}^{(k)})}^{T}, {(W_{ϵ}^{(k)})}^{T}]}^{T}

and the convergence is uniform in

k \in N

and in

{({(s^{(k)})}^{T}, {({\hat{s}}^{(k)})}^{T})}^{T} \in R^{2 k}

. Then, by the continuous mapping theorem (CMT) ([37] Theorem 7.7), we have

\begin{matrix} {[{(S_{n}^{(k)})}^{T}, {({\hat{S}}_{n}^{(k)})}^{T}]}^{T} = {[{({\hat{S}}_{n}^{(k)} + W_{n}^{(k)})}^{T}, {({\hat{S}}_{n}^{(k)})}^{T}]}^{T} ⟶_{n \to \infty}^{(d i s t .)} {[{({\hat{S}}_{ϵ}^{(k)} + W_{ϵ}^{(k)})}^{T}, {({\hat{S}}_{ϵ}^{(k)})}^{T}]}^{T} = {[{(S_{ϵ}^{(k)})}^{T}, {({\hat{S}}_{ϵ}^{(k)})}^{T}]}^{T} . \end{matrix}

Now, using the extended CMT ([37] Theorem 7.24), we will show that

f_{k, n} (S_{n}^{(k)}, {\hat{S}}_{n}^{(k)}) ⟶_{n \to \infty}^{(d i s t .)} f_{k, ϵ} (S_{ϵ}^{(k)}, {\hat{S}}_{ϵ}^{(k)})

for each

k \in N

, following the same approach as in the proof of ([17] Lemma B.2). Then, since

{\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt}) = \frac{1}{k} log f_{k, n} (S_{n}^{(k)}, {\hat{S}}_{n}^{(k)})

and

Z_{k}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}}) = \frac{1}{k} log f_{k, ϵ} (S_{ϵ}^{(k)}, {\hat{S}}_{ϵ}^{(k)})

, we conclude that

{\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt}) ⟶_{n \to \infty}^{(d i s t .)} Z_{k}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}})

, where it also follows from the proof of ([17] Lemma B.2) that the convergence is uniform in

k \in N

. Specifically, to prove that

f_{k, n} (S_{n}^{(k)}, {\hat{S}}_{n}^{(k)}) ⟶_{n \to \infty}^{(d i s t .)} f_{k, ϵ} (S_{ϵ}^{(k)}, {\hat{S}}_{ϵ}^{(k)})

, we will show that the following two properties hold:

P1: The distribution of ${[{(S_{ϵ}^{(k)})}^{T}, {({\hat{S}}_{ϵ}^{(k)})}^{T}]}^{T}$ is separable (as defined in ([37] Pg. 101)).
P2: For any convergent sequence ${({(s_{n}^{(k)})}^{T}, {({\hat{s}}_{n}^{(k)})}^{T})}^{T} \in R^{2 k}$ such that $lim_{n \to \infty} (s_{n}^{(k)}, {\hat{s}}_{n}^{(k)}) = (s_{ϵ}^{(k)}, {\hat{s}}_{ϵ}^{(k)})$ , then $lim_{n \to \infty} f_{k, n} (s_{n}^{(k)}, {\hat{s}}_{n}^{(k)}) = f_{k, ϵ} (s_{ϵ}^{(k)}, {\hat{s}}_{ϵ}^{(k)})$ .

To prove property P1, we show that

U^{(k)} ≜ {[{(S_{ϵ}^{(k)})}^{T}, {({\hat{S}}_{ϵ}^{(k)})}^{T}]}^{T}

(here we misuse the dimension notation as

U^{(k)}

denotes a

2 k

-dimensional vector) is separable ([37] Pg. 101), i.e., we show that

\forall η > 0

, there exists

β > 0

such that

Pr (∥ U^{(k)} ∥^{2} > β) < η

. To that aim, recall first that by Markov’s inequality ([29] Pg. 114), it follows that

Pr U^{(k)} ∥^{2} > β) < \frac{1}{β} E \{{∥U^{(k)}∥}^{2}\}

. For the asynchronously sampled source process, we note that

σ_{S_{ϵ}}^{2} [i] ≜ E \{{(S_{ϵ} [i])}^{2}\} \in [0, max_{0 \leq t \leq T_{ps}} σ_{S_{c}}^{2} (t)]

. By the independence of

W_{ϵ}^{(k)}

and

{\hat{S}}_{ϵ}^{(k)}

, and by the fact that their mean is zero, we have, from (A30) that

E \{{(S_{ϵ} [i])}^{2}\} = E \{{({\hat{S}}_{ϵ} [i])}^{2}\} + E \{{(W_{ϵ} [i])}^{2}\} \leq max_{0 \leq t \leq T_{ps}} σ_{S_{c}}^{2} (t)

; Hence

E \{{({\hat{S}}_{ϵ} [i])}^{2}\} \leq max_{0 \leq t \leq T_{ps}} σ_{S_{c}}^{2} (t)

, and

E \{{(W_{ϵ} [i])}^{2}\} \leq max_{0 \leq t \leq T_{ps}} σ_{S_{c}}^{2} (t)

. This further implies that

E \{{∥U^{(k)}∥}^{2}\} = E \{{∥{[{(S_{ϵ}^{(k)})}^{T}, {({\hat{S}}_{ϵ}^{(k)})}^{T}]}^{T}∥}^{2}\} \leq 2 \cdot k \cdot max_{0 \leq t \leq T_{ps}} σ_{S_{c}}^{2} (t)

; therefore for each

β > \frac{1}{η} E \{{∥U^{(k)}∥}^{2}\}

we have that

Pr ({∥U^{(k)}∥}^{2} > β) < η

, and thus

U^{(k)}

is separable.

By the assumption in this lemma it follows that

\forall η > 0

there exists

n_{0} (η) > 0

such that for all

n > n_{0} (η)

we have that

\forall w^{(k)} \in R^{k}

,

| p_{W_{n}^{(k)}} (w^{(k)}) - p_{W_{ϵ}^{(k)}} (w^{(k)}) | < η

, for all sufficiently large

k \in N

. Consequently, for all

{({(s^{(k)})}^{T}, {({\hat{s}}^{(k)})}^{T})}^{T} \in R^{2 k}

,

n > n_{0} (η)

and a sufficiently large

k \in N

, it follows from (A24) that

\begin{matrix} |p_{S_{n}^{(k)} | {\hat{S}}_{n}^{(k)}} (s^{(k)} | {\hat{s}}^{(k)}) - p_{S_{ϵ}^{(k)} | {\hat{S}}_{ϵ}^{(k)}} (s^{(k)} | {\hat{s}}^{(k)})| & = |p_{W_{n}^{(k)}} (s^{(k)} - {\hat{s}}^{(k)}) - p_{W_{ϵ}^{(k)}} (s^{(k)} - {\hat{s}}^{(k)})| < η . \end{matrix}

(A31)

Following the continuity of

p_{S_{n}^{(k)} | {\hat{S}}_{n}^{(k)}} (s^{(k)} | {\hat{s}}^{(k)})

and of

p_{S_{n}^{(k)}} (s^{(k)})

,

f_{k, n} (s^{(k)}, {\hat{s}}^{(k)})

is also continuous ([31] Theorem 4.9); hence, when

lim_{n \to \infty} (s_{n}^{(k)}, {\hat{s}}_{n}^{(k)}) = (s^{(k)}, {\hat{s}}^{(k)})

, then

lim_{n \to \infty} f_{k, n} (s_{n}^{(k)}, {\hat{s}}_{n}^{(k)}) = f_{k, ϵ} (s^{(k)}, {\hat{s}}^{(k)})

. This satisfies condition P2 for the extended CMT; Therefore, by the extended CMT, we have that

f_{k, n} (S_{n}^{(k)}, {\hat{S}}_{n}^{(k)}) ⟶_{n \to \infty}^{(d i s t .)} f_{k, ϵ} (S_{ϵ}^{(k)}, {\hat{S}}_{ϵ}^{(k)})

. Since the RVs

{\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt})

and

Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}})

, defined in (A17), are also continuous mappings of

f_{k, n} (S_{n}^{(k)}, {\hat{S}}_{n}^{(k)})

and of

f_{k, ϵ} (S_{ϵ}^{(k)}, {\hat{S}}_{ϵ}^{(k)})

, respectively, it follows from the CMT ([37] Theorem 7.7) that

{\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt}) ⟶_{n \to \infty}^{(d i s t .)} Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}})

.

Finally, to prove that the convergence

{\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt}) ⟶_{n \to \infty}^{(d i s t .)} Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}})

is uniform in

k \in N

, we note that as

{\hat{S}}_{n}^{(k)}

and

{\hat{S}}_{ϵ}^{(k)}

have independent entries, and the backward channels (21) and (A30) are memoryless. Hence, it follows from the proof of ([17] Lemma B.2), that the characteristic function of the RV

k \cdot {\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt})

which is denoted by

Φ_{k \cdot {\tilde{Z}}_{k, n}} (α) ≜ E \{e^{j \cdot α \cdot k \cdot {\tilde{Z}}_{k, n}}\}

converges to the characteristic function of

k \cdot Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}})

, denoted by

Φ_{k \cdot Z_{k, ϵ}} (α)

, uniformly over

k \in N

. Thus, for all sufficiently small

η > 0

,

\exists k_{0} \in N, n_{0} (η, k_{0}) \in N

such that

\forall n > n_{0} (η, k_{0})

, and

\forall k > k_{0}

| Φ_{k \cdot {\tilde{Z}}_{k, n}} (α) - Φ_{k \cdot Z_{k, ϵ}} (α) | < η, \forall α \cdot \in R .

(A32)

Hence, following Lévy’s convergence theorem ([38] Theorem 18.1) we conclude that

k \cdot {\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt}) ⟶_{n \to \infty}^{(d i s t .)} k \cdot Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}})

and that this convergence is uniform for sufficiently large k. Finally, since the CDFs of

k \cdot {\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt})

and

k \cdot Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}})

obtained at

α \in R

are equivalent to the CDFs of

{\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt})

and

Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}})

obtained at

\frac{α}{k} \in R

respectively, we can conclude that

{\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt}) ⟶_{n \to \infty}^{(d i s t .)} Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}})

, uniformly in

k \in N

. □

The following convergence lemma A5 corresponds to ([17] Lemma B.3),

Lemma A5.

Let

n \in N

be given. Every subsequence of

{\{{\tilde{Z}}_{k, n}^{'} (F_{{\hat{S}}_{n}, S_{n}}^{opt})\}}_{k \in N}

, indexed by

k_{l}

, converges in distribution, in the limit as

l \to \infty

, to a finite deterministic scalar.

Proof.

Recall that the RVs

{\tilde{Z}}_{k, n}^{'} (F_{{\hat{S}}_{n}, S_{n}}^{opt})

represent the mutual information density rate between k samples of the source process

S_{n} [i]

and the corresponding samples of its reproduction process

{\hat{S}}_{n} [i]

, where these processes are jointly distributed via the Gaussian distribution measure

F_{{\hat{S}}_{n}, S_{n}}^{opt}

. Further, recall that the relationship between the source signal and the reproduction process which achieves the RDF can be described via the backward channel in (21) for a Gaussian source. The channel (21) is a memoryless additive WSCS Gaussian noise channel with period

p_{n}

, thus, by [21], it can be equivalently represented as a

p_{n} \times 1

multivariate memoryless additive stationary Gaussian noise channel, which is an information stable channel ([39] Section 1.5). For such channels in which the source and its reproduction obey the RDF-achieving joint distribution

F_{S_{n}, {\hat{S}}_{n}}^{opt}

, the mutual information density rate converges as k increases, almost surely, to the finite and deterministic mutual information rate ([14] Theorem 5.9.1). Since almost sure convergence implies convergence in distribution ([37] Lemma 7.21), this proves the lemma. □

Appendix D.3. Showing that $R_{ϵ} (D) = \underset{n \to \infty}{lim sup} R_{n} (D)$

This section completes the proof to Theorem 4. We note from (14) that the RDF for the source process

S_{n} [i]

(for fixed length coding and MSE distortion measure) is given by:

R_{n} (D) = inf_{F_{{\hat{S}}_{n}, S_{n}} : {\bar{d}}_{S} (F_{{\hat{S}}_{n}, S_{n}}) \leq D} \{p - \underset{k \to \infty}{lim sup} {\tilde{Z}}_{k, n}^{'} (F_{{\hat{S}}_{n}, S_{n}}^{opt})\},

(A33)

where

{\bar{d}}_{S} (F_{{\hat{S}}_{n}, S_{n}}) = \underset{k \to \infty}{lim sup} \frac{1}{k} E \{∥ S_{n}^{(k)} - {\hat{S}}_{n}^{(k)} ∥^{2}\}

.

We now state the following lemma characterizing the asymptotic statistics of the optimal reconstruction

{\hat{S}}_{n}^{(k)}

process and the respective noise process

W_{n}^{(k)}

used in the backward channel relationship (21):

Lemma A6.

Consider the RDF-achieving distribution with distortion D for compression of a vector Gaussian source process

S_{n}^{(k)}

characterized by the backward channel (21). Then, there exists a subsequence in the index

n \in N

denoted

n_{1} < n_{2} < \dots

, such that for the RDF-achieving distribution, the sequences of reproduction vectors

{{\hat{S}}_{n_{l}}^{(k)}}_{l \in N}

and backward channel noise vectors

{W_{n_{l}}^{(k)}}_{l \in N}

satisfy that

lim_{l \to \infty} p_{{\hat{S}}_{n_{l}}^{(k)}} ({\hat{s}}^{(k)}) = p_{{\hat{S}}_{ϵ}^{(k)}} ({\hat{s}}^{(k)})

uniformly in

{\hat{s}}^{(k)} \in R^{k}

and uniformly with respect to

k \in N

, as well as

lim_{l \to \infty} p_{W_{n_{l}}^{(k)}} (w^{(k)}) = p_{W_{ϵ}^{(k)}} (w^{(k)})

uniformly in

w^{(k)} \in R^{k}

and uniformly with respect to

k \in N

, where

p_{{\hat{S}}_{ϵ}^{(k)}} ({\hat{s}}^{(k)})

and

p_{W_{ϵ}^{(k)}} (w^{(k)})

are Gaussian PDFs.

Proof.

Recall from the analysis of the RDF for WSCS processes that for each

n \in N

, the marginal distributions of the RDF-achieving reproduction process

{\hat{S}}_{n} [i]

and the backward channel noise

W_{n} [i]

is Gaussian, memoryless, zero-mean, and with variances

σ_{{\hat{S}}_{n}}^{2} [i] ≜ E \{{({\hat{S}}_{n} [i])}^{2}\}

and

E \{{(W_{n} [i])}^{2}\} = σ_{S_{n}}^{2} [i] - σ_{{\hat{S}}_{n}}^{2} [i],

(A34)

respectively. Consequently, the sequences of reproduction vectors

{{\hat{S}}_{n}^{(k)}}_{n \in N}

and backward channel noise vectors

{W_{n}^{(k)}}_{n \in N}

are zero-mean Gaussian with independent entries for each

k \in N

. Since

σ_{S_{n}}^{2} [i] \leq max_{t \in R} σ_{S_{c}}^{2} (t)

, then, from (A34), it follows that

σ_{{\hat{S}}_{n}}^{2} [i]

is also bounded in the interval

[0, max_{t \in R} σ_{S_{c}}^{2} (t)]

for all

n \in N

. Therefore, by Bolzano-Weierstrass theorem ([31] Theorem 2.42),

σ_{{\hat{S}}_{n}}^{2} [i]

has a convergent subsequence, and we let

n_{1} < n_{2} < \dots

denote the indexes of this convergent subsequence and let the limit of the subsequence be denoted by

σ_{{\hat{S}}_{ϵ}}^{2} [i]

. From the CMT, as applied in the proof of ([17] Lemma B.1), the convergence

σ_{{\hat{S}}_{n_{l}}}^{2} [i] \underset{l \to \infty}{⟶} σ_{{\hat{S}}_{ϵ}}^{2} [i]

for each

i \in N

implies that the subsequence of PDFs

p_{{\hat{S}}_{n_{l}}^{(k)}} ({\hat{s}}^{(k)})

corresponding to the memoryless Gaussian random vectors

{{\hat{S}}_{n_{l}}^{(k)}}_{l \in N}

converges as

l \to \infty

to a Gaussian PDF which we denote by

p_{{\hat{S}}_{ϵ}^{(k)}} ({\hat{s}}^{(k)})

, and the convergence of

p_{{\hat{S}}_{n_{l}}^{(k)}} ({\hat{s}}^{(k)})

is uniform in

s^{(k)}

for any fixed

k \in N

. By Remark 2, it holds that

W_{n} [i]

is a memoryless stationary process with variance

E \{{(W_{n} [i])}^{2}\} = D

and by Equation (A34),

σ_{{\hat{S}}_{n}}^{2} [i] = σ_{S_{n}}^{2} [i] - D

. Hence by Assumption A1 and by the proof of ([17] Lemma B.1), it follows that for a fixed

η > 0

and

k_{0} \in N

,

\exists n_{0} (η, k_{0})

such that for all

n > n_{0} (η, k_{0})

and for all sufficiently large k, it holds that

| p_{{\hat{S}}_{n_{l}}^{(k)}} ({\hat{s}}^{(k)}) - p_{{\hat{S}}_{ϵ}^{(k)}} ({\hat{s}}^{(k)}) | < η

for every

{\hat{s}}^{(k)} \in R^{k}

. Since

n_{0} (η, k_{0})

does not depend on k (only on the fixed

k_{0}

), this implies that the convergence is uniform with respect to

k \in N

.

The fact that

W_{n} [i]

is a zero-mean stationary Gaussian process with variance D for each

n \in N

, implies that the sequence of PDFs

p_{W_{n}^{(k)}} (w^{(k)})

converges as

n \to \infty

to a Gaussian PDF which we denote by

p_{W^{(k)}} (w^{(k)})

, hence its subsequence with indices

n_{1} < n_{2} < \dots

also converges to

p_{W^{(k)}} (w^{(k)})

. Since

D > \frac{1}{2 π}

by Assumption A1 combined with the proof of ([17] Lemma B.1) it follows that this convergence is uniform in

w^{(k)}

and in

k \in N

to

p_{W_{ϵ}^{(k)}} (w^{(k)})

.

Following the proof of Corollary A1, it holds that the subsequences of the memoryless Gaussian random vectors

\{{\hat{S}}_{n_{l}}^{(k)}\}

and

\{W_{n_{l}}^{(k)}\}

converge in distribution as

l \to \infty

to a Gaussian distribution, and the convergence is uniform in

k \in N

for any fixed

k \in N

. Hence, as shown in Lemma A4 the joint distribution

{[{(S_{n_{l}}^{(k)})}^{T}, {({\hat{S}}_{n_{l}}^{(k)})}^{T}]}^{T} ⟶_{n \to \infty}^{(d i s t .)} {[{(S_{ϵ}^{(k)})}^{T}, {({\hat{S}}_{ϵ}^{(k)})}^{T}]}^{T}

, and the limit distribution is jointly Gaussian. □

Lemma A7.

The RDF of

{S_{ϵ} [i]}

satisfies

R_{ϵ} (D) \leq \underset{n \to \infty}{lim sup} R_{n} (D)

, and the rate

\underset{n \to \infty}{lim sup} R_{n} (D)

is achievable for the source

{S_{ϵ} [i]}

with distortion D when the reproduction process which obeys a Gaussian distribution.

Proof.

According to Lemma A6, we note that the sequence of joint distributions

{F_{S_{n}, {\hat{S}}_{n}}^{opt}}_{n \in N}

has a convergent subsequence, i.e., there exists a set of indexes

n_{1} < n_{2} < \dots

such that the sequence of distributions with independent entries

{F_{S_{n_{l}}, {\hat{S}}_{n_{l}}}^{opt}}_{l \in N}

converges in the limit

l \to \infty

to a joint Gaussian distribution

F_{S_{ϵ}, {\hat{S}}_{ϵ}}^{'}

and the convergence is uniform in

k \in N

. Hence, this satisfies the condition of Lemma A4; This implies that

{\tilde{Z}}_{k, n_{l}}^{'} (F_{S_{n_{l}}, {\hat{S}}_{n_{l}}}^{opt}) ⟶_{l \to \infty}^{(d i s t .)} Z_{k}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}}^{'})

uniformly in

k \in N

. Moreover, by Lemma A5 every subsequence of

{\{{\tilde{Z}}_{k, n_{l}}^{'} (F_{S_{n_{l}}, {\hat{S}}_{n_{l}}}^{opt})\}}_{l \in N}

converges in distribution to a finite deterministic scalar as

k \to \infty

. Therefore, by Theorem 3 it holds that

\begin{matrix} lim_{l \to \infty} (p - \underset{k \to \infty}{lim sup} {\tilde{Z}}_{k, n_{l}}^{'} (F_{S_{n_{l}}, {\hat{S}}_{n_{l}}}^{opt})) & = p - \underset{k \to \infty}{lim sup} Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}}^{'}) \\ \geq inf_{F_{S_{ϵ}, {\hat{S}}_{ϵ}}} \{p - \underset{k \to \infty}{lim sup} Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}})\} = R_{ϵ} (D) . \end{matrix}

(A35)

From (14) we have that

R_{n} (D) = p - \underset{k \to \infty}{lim sup} {\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt})

, then from (A35), it follows that

R_{ϵ} (D) \leq lim_{l \to \infty} R_{n_{l}} (D) \overset{(a)}{\leq} \underset{n \to \infty}{lim sup} R_{n} (D),

(A36)

where

(a)

follows since, by ([31] Definition 3.16), the limit of every subsequence is not greater than the limit superior. Noting that

F_{S_{ϵ}, {\hat{S}}_{ϵ}}^{'}

is Gaussian by Lemma A6 concludes the proof. □

Lemma A8.

The RDF of

{S_{ϵ} [i]}

satisfies

R_{ϵ} (D) \geq \underset{n \to \infty}{lim sup} R_{n} (D)

.

Proof.

To prove this lemma, we first show that for a joint distribution

F_{S_{ϵ}, {\hat{S}}_{ϵ}}

which achieves a rate-distortion pair

(R_{ϵ}, D)

it holds that

R_{ϵ} \geq E {Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}}^{'})}

: Recall that

(R_{ϵ}, D)

is an achievable rate-distortion pair for the source

{S_{ϵ} [i]}

, namely, there exists a sequence of codes

{C_{l}}

whose rate-distortion approach

(R_{ϵ}, D)

when applied to

{S_{ϵ} [i]}

, This implies that for any

η > 0

there exists

l_{0} (η)

such that

\forall l > l_{0} (η)

it holds that

C_{l}

has a code rate

R_{l} = \frac{1}{l} {log}_{2} M_{l}

satisfying

R_{l} \leq R_{ϵ} + η

by (3). Recalling Definition 5, the source code maps

S_{ϵ}^{(l)}

into a discrete index

J_{l} \in {1, 2, \dots, M_{l}}

, which is in turn mapped into

{\hat{S}}_{ϵ}^{(l)}

, i.e.,

S_{ϵ}^{(l)} \mapsto J_{l} \mapsto {\hat{S}}_{ϵ}^{(l)}

form a Markov chain. Since

J_{l}

is a discrete random variable taking values in

{1, 2, \dots, M_{l}}

, it holds that

\begin{matrix} {log}_{2} M_{l} & \geq H (J_{l}) \\ \overset{(a)}{\geq} I (S_{ϵ}^{(l)}; J_{l}) \\ \overset{(b)}{\geq} I (S_{ϵ}^{(l)}; {\hat{S}}_{ϵ}^{(l)}), \end{matrix}

(A37)

where

(a)

follows since

I (S_{ϵ}^{(l)}; J_{l}) = H (J_{l}) - H (J_{l} | S_{ϵ}^{(l)})

which is not larger than

H (J_{l})

as

J_{l}

takes discrete values; while

(b)

follows from the data processing inequality ([5] Chapter 2.8). Now, (A37) implies that for each

l > l_{0} (η)

, the reproduction obtained using the code

C_{l}

satisfies

\frac{1}{l} I (S_{ϵ}^{(l)}; {\hat{S}}_{ϵ}^{(l)}) \leq \frac{1}{l} log M_{l} \leq R_{ϵ} + η

. Since for every arbitrarily small

η \to 0

, this inequality holds for all

l > l_{0} (η)

, i.e., for all sufficiently large l, it follows that

R_{ϵ} \geq \underset{k \to \infty}{lim sup} \frac{1}{l} I (S_{ϵ}^{(l)}; {\hat{S}}_{ϵ}^{(l)})

. Hence, replacing the blocklength symbol from l to k, as

\frac{1}{k} I (S_{ϵ}^{(k)}, {\hat{S}}_{ϵ}^{(k)}) = E {Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}}^{'})}

([5] Equation (2.3)), we conclude that

R_{ϵ} (D) \geq \underset{k \to \infty}{lim sup} E {Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}}^{'})} .

(A38)

Next, we consider

\underset{k \to \infty}{lim sup} E {Z_{k_{l}, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}}^{'})}

: Let

Z_{k_{l}, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}}^{'})

be a subsequence of

E \{Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}}^{'})\}

with the indexes

k_{1} < k_{2} < \dots

such that its limit equals the limit superior. i.e.,

lim_{l \to \infty} E \{Z_{k_{l}, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}}^{'})\} = \underset{k \to \infty}{lim sup} E \{Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}}^{'})\}

. Since by Lemma A4, the sequence of non-negative RVs

{\{{\tilde{Z}}_{k_{l}, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt})\}}_{n \in N}

convergences in distribution to

Z_{k_{l}, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}}^{'})

as

n \to \infty

uniformly in

k \in N

, it follows from ([40] Theorem 3.5) that

E \{Z_{k_{l}, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}}^{'})\} = lim_{n \to \infty} E \{{\tilde{Z}}_{k_{l}, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt})\}

. Moreover, we define a family of distributions

F (D)

such that

F (D) = {F_{S, \hat{S}} : D (F_{S, \hat{S}}) \leq D}

. Consequently, Equation (A38) can now be written as:

\begin{matrix} R_{ϵ} (D) \geq \underset{k \to \infty}{lim sup} E \{Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}}^{'})\} & = lim_{l \to \infty} lim_{n \to \infty} E \{{\tilde{Z}}_{k_{l}, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt})\} \\ \overset{(a)}{=} lim_{n \to \infty} lim_{l \to \infty} E \{{\tilde{Z}}_{k_{l}, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt})\} \\ \overset{(b)}{=} \underset{n \to \infty}{lim sup} lim_{l \to \infty} E \{{\tilde{Z}}_{k_{l}, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt})\} \\ \geq \underset{n \to \infty}{lim sup} lim_{l \to \infty} inf_{F_{S, \hat{S}} \in F (D)} E \{{\tilde{Z}}_{k_{l}, n}^{'} (F_{S, \hat{S}})\} \\ \overset{(c)}{=} \underset{n \to \infty}{lim sup} lim_{l \to \infty} inf_{F_{S, \hat{S}} \in F (D)} \frac{1}{k_{l}} I ({\hat{S}}_{n}^{(k_{l})}; S_{n}^{(k_{l})}), \end{matrix}

(A39)

where

(a)

follows since the convergence

{\tilde{Z}}_{k_{l}, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt}) ⟶_{n \to \infty}^{(d i s t .)} Z_{k_{l}, ϵ}^{'} (F_{S_{ϵ}, \hat{S}}^{'})

is uniform with respect to

k_{l}

, thus the limits are interchangeable ([31] Theorem 7.11);

(b)

follows since the limit of the subsequence

E \{{\tilde{Z}}_{k_{l}, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt})\}

exists in the index n, and is therefore equivalent to the limit superior,

\underset{n \to \infty}{lim sup} E \{{\tilde{Z}}_{k_{l}, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt})\}

([31] Page 57); and

(c)

holds since mutual information is the expected value of the mutual information density rate ([5] Equation (2.30)). Finally, we recall that in the proof of Lemma A5 it was established that the backward channel for the RDF at the distortion constraint D, defined in (21), is information stable, hence for such backward channels, we have from ([41] Theorem 1) that the minimum rate is defined as

R_{n} (D) = lim_{k \to \infty} inf_{F_{S, \hat{S}} \in F (D)} \frac{1}{k} I ({\hat{S}}_{ϵ}^{(k)}; S_{n}^{(k)})

and the limit exists; Hence,

lim_{k \to \infty} inf_{F_{S, \hat{S}} \in F (D)} \frac{1}{k} I ({\hat{S}}_{ϵ}^{(k)}; S_{n}^{(k)}) = lim_{l \to \infty} inf_{F_{S, \hat{S}} \in F (D)} \frac{1}{k_{l}} I ({\hat{S}}^{(k_{l})}; S_{n}^{(k_{l})})

in the index k. Substituting this into Equation (A39) yields the result:

R_{ϵ} (D) \geq \underset{n \to \infty}{lim sup} R_{n} (D) .

(A40)

This proves the lemma. □

Combining the Lemmas A7 and A8 proves that

R_{ϵ} (D) = \underset{n \to \infty}{lim sup} R_{n} (D)

and the rate is achievable with Gaussian inputs, completing the proof of the theorem.

References

Gardner, W.; Brown, W.; Chen, C.K. Spectral correlation of modulated signals: Part II-digital modulation. IEEE Trans. Commun. 1987, 35, 595–601. [Google Scholar] [CrossRef]
Giannakis, G.B. Cyclostationary signal analysis. In Digital Signal Processing Handbook; CRC PRESS: Boca Raton, FL, USA, 1998; pp. 17–21. [Google Scholar]
Gardner, W.A.; Napolitano, A.; Paura, L. Cyclostationarity: Half a century of research. Signal Process. 2006, 86, 639–697. [Google Scholar] [CrossRef]
Berger, T.; Gibson, J.D. Lossy source coding. IEEE Trans. Inf. Theory 1998, 44, 2693–2723. [Google Scholar] [CrossRef] [Green Version]
Cover, T.M.; Thomas, J.A. Elements of Information Theory; John Wiley & Sons: New York, NY, USA, 2006. [Google Scholar]
Wolf, J.K.; Wyner, A.D.; Ziv, J. Source coding for multiple descriptions. Bell Syst. Tech. J. 1980, 59, 1417–1426. [Google Scholar] [CrossRef]
Wyner, A.D.; Ziv, J. The rate-distortion function for source coding with side information at the decoder. IEEE Trans. Inf. Theory 1976, 22, 1–10. [Google Scholar] [CrossRef]
Oohama, Y. Gaussian multiterminal source coding. IEEE Trans. Inf. Theory 1997, 43, 1912–1923. [Google Scholar] [CrossRef]
Pandya, A.; Kansal, A.; Pottie, G.; Srivastava, M. Lossy source coding of multiple Gaussian sources: m-helper problem. In Proceedings of the IEEE Information Theory Workshop, San Antonio, TX, USA, 24–29 October 2004; pp. 34–38. [Google Scholar]
Gallager, R.G. Information Theory and Reliable Communication; Springer: Berlin, Germany, 1968; Volume 588. [Google Scholar]
Harrison, M.T. The generalized asymptotic equipartition property: Necessary and sufficient conditions. IEEE Trans. Inf. Theory 2008, 54, 3211–3216. [Google Scholar] [CrossRef] [Green Version]
Kipnis, A.; Goldsmith, A.J.; Eldar, Y.C. The distortion rate function of cyclostationary Gaussian processes. IEEE Trans. Inf. Theory 2018, 64, 3810–3824. [Google Scholar] [CrossRef] [Green Version]
Napolitano, A. Cyclostationarity: New trends and applications. Signal Process. 2016, 120, 385–408. [Google Scholar] [CrossRef]
Han, T.S. Information-Spectrum Methods in Information Theory; Springer: Berlin, Germany, 2003; Volume 50. [Google Scholar]
Verdú, S.; Han, T.S. A general formula for channel capacity. IEEE Trans. Inf. Theory 1994, 40, 1147–1157. [Google Scholar] [CrossRef] [Green Version]
Zeng, W.; Mitran, P.; Kavcic, A. On the information stability of channels with timing errors. In Proceedings of the IEEE International Symposium on Information Theory (ISIT), Seattle, WA, USA, 9–14 July 2006; pp. 1885–1889. [Google Scholar]
Shlezinger, N.; Abakasanga, E.; Dabora, R.; Eldar, Y.C. The Capacity of Memoryless Channels with Sampled Cyclostationary Gaussian Noise. IEEE Trans. Commun. 2020, 68, 106–121. [Google Scholar] [CrossRef]
Shannon, C.E. Communication in the presence of noise. Proc. IEEE 1998, 86, 447–457. [Google Scholar] [CrossRef]
Cherif, F. A various types of almost periodic functions on Banach spaces: Part I. Int. Math. Forum 2011, 6, 921–952. [Google Scholar]
Shlezinger, N.; Dabora, R. On the capacity of narrowband PLC channels. IEEE Trans. Commun. 2015, 63, 1191–1201. [Google Scholar] [CrossRef] [Green Version]
Shlezinger, N.; Dabora, R. The capacity of discrete-time Gaussian MIMO channels with periodic characteristics. In Proceedings of the IEEE International Symposium on Information Theory (ISIT), Barcelona, Spain, 10–16 October 2016; pp. 1058–1062. [Google Scholar]
Shlezinger, N.; Zahavi, D.; Murin, Y.; Dabora, R. The secrecy capacity of Gaussian MIMO channels with finite memory. IEEE Trans. Inf. Theory 2017, 63, 1874–1897. [Google Scholar] [CrossRef]
Heath, R.W.; Giannakis, G.B. Exploiting input cyclostationarity for blind channel identification in OFDM systems. IEEE Trans. Signal Process. 1999, 47, 848–856. [Google Scholar] [CrossRef] [Green Version]
Shaked, R.; Shlezinger, N.; Dabora, R. Joint estimation of carrier frequency offset and channel impulse response for linear periodic channels. IEEE Trans. Commun. 2017, 66, 302–319. [Google Scholar] [CrossRef]
Shlezinger, N.; Dabora, R. Frequency-shift filtering for OFDM signal recovery in narrowband power line communications. IEEE Trans. Commun. 2014, 62, 1283–1295. [Google Scholar] [CrossRef] [Green Version]
El Gamal, A.; Kim, Y.H. Network Information Theory; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
Wu, X.; Xie, L.L. On the optimal compressions in the compress-and-forward relay schemes. IEEE Trans. Inf. Theory 2013, 59, 2613–2628. [Google Scholar] [CrossRef] [Green Version]
Zitkovic, G. Lecture Notes on the Theory of Probability Parts I and II. Available online: https://web.ma.utexas.edu/users/gordanz/lecture_notes_page.html (accessed on 12 March 2020).
Papoulis, A. Probability, Random Variables, and Stochastic Processes; McGraw-Hill: New York, NY, USA, 2002. [Google Scholar]
Zamir, R.; Kochman, Y.; Erez, U. Achieving the Gaussian rate–distortion function by prediction. IEEE Trans. Inf. Theory 2008, 54, 3354–3364. [Google Scholar] [CrossRef]
Rudin, W. Principles of Mathematical Analysis; International series in pure and applied mathematics; McGraw-Hill: New York, NY, USA, 1976. [Google Scholar]
Dixmier, J. General Topology; Springer: New York, NY, USA, 1984. [Google Scholar]
Stein, E.M.; Shakarchi, R. Real Analysis: Measure Theory, Integration, and Hilbert Spaces; Princeton University Press: Princeton, NJ, USA, 2009. [Google Scholar]
Kolmogorov, A. On the Shannon theory of information transmission in the case of continuous signals. IRE Trans. Inf. Theory 1956, 2, 102–108. [Google Scholar] [CrossRef]
Scheffé, H. A useful convergence theorem for probability distributions. Ann. Math. Stat. 1947, 18, 434–438. [Google Scholar] [CrossRef]
Bromiley, P. Products and convolutions of Gaussian probability density functions. Tina-Vision Memo 2003, 3, 1. [Google Scholar]
Kosorok, M.R. Introduction to Empirical Processes and Semiparametric Inference.; Springer: New York, NY, USA, 2008. [Google Scholar]
Williams, D. Probability with Martingales; Cambridge University Press: Cambridge, UK, 1991. [Google Scholar]
Dobrushin, R.L. A general formulation of the fundamental theorem of Shannon in the theory of information. Uspekhi Matematicheskikh Nauk 1959, 14, 3–104. [Google Scholar]
Billingsley, P. Convergence of Probability Measures; John Wiley & Sons: New York, NY, USA, 2013. [Google Scholar]
Venkataramanan, R.; Pradhan, S.S. Source coding with feed-forward: Rate-distortion theorems and error exponents for a general source. IEEE Trans. Inf. Theory 2007, 53, 2154–2179. [Google Scholar] [CrossRef]

Figure 1. Source coding block diagram.

Figure 2.

R_{n} (D)

versus n; offset

ϕ = 0

.

Figure 2.

R_{n} (D)

versus n; offset

ϕ = 0

.

Figure 3.

R_{n} (D)

versus n; offset

ϕ = \frac{1}{16}

.

Figure 3.

R_{n} (D)

versus n; offset

ϕ = \frac{1}{16}

.

Figure 4.

R_{n} (D)

versus

\frac{T_{ps}}{T_{s}}

; offset

ϕ = 0

.

Figure 4.

R_{n} (D)

versus

\frac{T_{ps}}{T_{s}}

; offset

ϕ = 0

.

Figure 5.

R_{n} (D)

versus

\frac{T_{ps}}{T_{s}}

; offset

ϕ = \frac{1}{16}

.

Figure 5.

R_{n} (D)

versus

\frac{T_{ps}}{T_{s}}

; offset

ϕ = \frac{1}{16}

.

Figure 6.

R_{n} (D)

versus D; offset

ϕ = 0

.

Figure 6.

R_{n} (D)

versus D; offset

ϕ = 0

.

Figure 7.

R_{n} (D)

versus D; offset

ϕ = \frac{1}{16}

.

Figure 7.

R_{n} (D)

versus D; offset

ϕ = \frac{1}{16}

.

Figure 8.

R_{n} (D)

versus

ϕ

at

t_{dc} = 75 %

.

Figure 8.

R_{n} (D)

versus

ϕ

at

t_{dc} = 75 %

.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Abakasanga, E.; Shlezinger, N.; Dabora, R. On the Rate-Distortion Function of Sampled Cyclostationary Gaussian Processes. Entropy 2020, 22, 345. https://0-doi-org.brum.beds.ac.uk/10.3390/e22030345

AMA Style

Abakasanga E, Shlezinger N, Dabora R. On the Rate-Distortion Function of Sampled Cyclostationary Gaussian Processes. Entropy. 2020; 22(3):345. https://0-doi-org.brum.beds.ac.uk/10.3390/e22030345

Chicago/Turabian Style

Abakasanga, Emeka, Nir Shlezinger, and Ron Dabora. 2020. "On the Rate-Distortion Function of Sampled Cyclostationary Gaussian Processes" Entropy 22, no. 3: 345. https://0-doi-org.brum.beds.ac.uk/10.3390/e22030345

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

On the Rate-Distortion Function of Sampled Cyclostationary Gaussian Processes

Abstract

1. Introduction

2. Preliminaries and Background

2.1. Notations

2.2. Wide-Sense Cyclostationary Random Processes

2.3. The Rate-Distortion Function for DT WSCS Processes

3. Problem Formulation and Auxiliary Results

3.1. Source Model

3.2. Definitions of Relevant Information-Spectrum Quantities

3.3. Information Spectrum Limits

4. Rate-Distortion Characterization for Sampled CT WSCS Gaussian Sources

4.1. Main Result

4.2. Discussion and Relationship with Capacity Derivation in Reference 17

5. Numerical Examples

5.1. Convergence of $R_{n} (D)$ in n

5.2. The Variation of the RDF with the Sampling Rate

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Proof of Lemma 1

Appendix B. Proof of Theorem 1

Appendix C. Proof of Theorem 3

Appendix D. Proof of Theorem 4

Appendix D.1. Convergence in Distribution of $S_{n}^{(k)}$ to $S_{ϵ}^{(k)}$ Uniformly with Respect to $k \in N$

Appendix D.2. Showing that ${\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt})$ and $Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}})$ Satisfy the Conditions of Theorem 3

Appendix D.3. Showing that $R_{ϵ} (D) = \underset{n \to \infty}{lim sup} R_{n} (D)$

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

On the Rate-Distortion Function of Sampled Cyclostationary Gaussian Processes

Abstract

1. Introduction

2. Preliminaries and Background

2.1. Notations

2.2. Wide-Sense Cyclostationary Random Processes

2.3. The Rate-Distortion Function for DT WSCS Processes

3. Problem Formulation and Auxiliary Results

3.1. Source Model

3.2. Definitions of Relevant Information-Spectrum Quantities

3.3. Information Spectrum Limits

4. Rate-Distortion Characterization for Sampled CT WSCS Gaussian Sources

4.1. Main Result

4.2. Discussion and Relationship with Capacity Derivation in Reference 17

5. Numerical Examples

5.1. Convergence of R n ( D ) in n

5.2. The Variation of the RDF with the Sampling Rate

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Proof of Lemma 1

Appendix B. Proof of Theorem 1

Appendix C. Proof of Theorem 3

Appendix D. Proof of Theorem 4

Appendix D.1. Convergence in Distribution of S n ( k ) to S ϵ ( k ) Uniformly with Respect to k ∈ N

Appendix D.2. Showing that Z ˜ k , n ′ F S n , S ^ n opt and Z k , ϵ ′ F S ϵ , S ^ ϵ Satisfy the Conditions of Theorem 3

Appendix D.3. Showing that R ϵ ( D ) = lim sup n → ∞ R n ( D )

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

5.1. Convergence of $R_{n} (D)$ in n

Appendix D.1. Convergence in Distribution of $S_{n}^{(k)}$ to $S_{ϵ}^{(k)}$ Uniformly with Respect to $k \in N$

Appendix D.2. Showing that ${\tilde{Z}}_{k, n}^{'} (F_{S_{n}, {\hat{S}}_{n}}^{opt})$ and $Z_{k, ϵ}^{'} (F_{S_{ϵ}, {\hat{S}}_{ϵ}})$ Satisfy the Conditions of Theorem 3

Appendix D.3. Showing that $R_{ϵ} (D) = \underset{n \to \infty}{lim sup} R_{n} (D)$