Next Article in Journal
Clustering Financial Return Distributions Using the Fisher Information Metric
Next Article in Special Issue
Amplitude Constrained MIMO Channels: Properties of Optimal Input Distributions and Bounds on the Capacity
Previous Article in Journal
Doubly Nonnegative and Semidefinite Relaxations for the Densest k-Subgraph Problem
Previous Article in Special Issue
Gaussian Multiple Access Channels with One-Bit Quantizer at the Receiver ,
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Quasi-Concavity for Gaussian Multicast Relay Channels

1
Independent Researcher, Amalienstr. 49A, 80799 Munich, Germany
2
Institute for Communications Engineering, Technical University of Munich, 80333 Munich, Germany
*
Author to whom correspondence should be addressed.
Submission received: 6 January 2019 / Revised: 21 January 2019 / Accepted: 22 January 2019 / Published: 24 January 2019
(This article belongs to the Special Issue Information Theory for Data Communications and Processing)

Abstract

:
Standard upper and lower bounds on the capacity of relay channels are cut-set (CS), decode-forward (DF), and quantize-forward (QF) rates. For real additive white Gaussian noise (AWGN) multicast relay channels with one source node and one relay node, these bounds are shown to be quasi-concave in the receiver signal-to-noise ratios and the squared source-relay correlation coefficient. Furthermore, the CS rates are shown to be quasi-concave in the relay position for a fixed correlation coefficient, and the DF rates are shown to be quasi-concave in the relay position. The latter property characterizes the optimal relay position when using DF. The results extend to complex AWGN channels with random phase variations.

1. Introduction

A multicast relay channel (MRC) is an information network with a source node, a relay node, and two or more destination nodes, and where one message originating at the source should be received reliably at the destinations. We consider additive white Gaussian noise (AWGN) MRCs and show that certain information rate expressions are quasi-concave in the receiver signal-to-noise ratios (SNRs), the squared source-relay correlation coefficient, and the relay position. In particular, we study cut-set (CS), decode-forward (DF), and quantize-forward (QF) rates. Quasi-concavity suggests that efficient algorithms can optimize signaling and the relay position. However, the main motivation of this work is not practicality, but simply to provide better understanding of the problem.
Relay positioning has been studied by many authors, with a focus on rate enhancement (e.g., [1,2]), range extension (e.g., [3,4]), and outage probability (e.g., [1,5,6]). We study the problem of placing a relay to maximize the multicast rate by extending results of [7,8,9,10]. A preliminary version of this paper without proofs appeared in [11]. Our focus is on real alphabet channels. However, our main results also apply to complex alphabet channels if there are random phase variations so that beamforming is not useful.
This paper is organized as follows. Section 2 presents the MRC model and reviews the CS, DF, and QF rates. Section 3 develops quasi-concavity results in the squared source-relay correlation coefficient ρ 2 and the channel SNRs. Section 4 introduces a distance dependence for the channel gains and shows that the CS rate is quasi-concave in the relay position when ρ is fixed. We further show that the DF rate is quasi-concave in the relay position. Section 5 illustrates quasi-concavity for one-, two-, and three-dimensional networks, and compares the performance of two DF strategies. Section 6 discusses complex AWGN channels and a sum (source plus relay) power constraint. Section 7 concludes the paper. Appendix A and Appendix B review useful results on concavity and quasi-concavity, and prove a few new results.

2. Model and Information Rates

2.1. Model

An MRC has three types of nodes:
  • a source node s that generates a message W and transmits the symbols X s n = X s , 1 , X s , 2 , , X s , n ;
  • a relay node r that receives and forwards symbols Y r , k and X r , k , respectively, for k = 1 , 2 , , n ;
  • destination nodes j = 1 , 2 , , N where node j receives Y j n = Y j , 1 , Y j , 2 , , Y j , n and estimates W as W ^ j .
We denote the destination node set as T = { 1 , 2 , , N } . The classic relay channel has N = 1 and Figure 1 shows an MRC with N = 2 .
A memoryless MRC has a function h ( · ) and a noise random variable Z so that for every time instant the N + 1 channel outputs Y = ( Y r Y 1 Y N ) are given by
Y = h ( X s , X r , Z ) .
The noise Z is statistically independent of X s and X r , and the noise variables at different times are statistically independent.
An encoding strategy for M messages has
  • W uniformly distributed over { 1 , 2 , , M } ;
  • an encoding function e s ( · ) such that X s n = e s ( W ) ;
  • relay functions e r , k ( · ) with X r , k = e r , k ( Y r , 1 , , Y r , k 1 ) , where k = 1 , 2 , , n ;
  • decoding functions d j ( · ) such that d j ( Y j n ) = W ^ j , j T .
The error probability at destination j is P e , j = Pr W ^ j W . The multicast rate is R = ( log 2 M ) / n bits/use. The rate R is achievable if, for any ϵ > 0 and sufficiently large n, there is an encoding strategy with P e , j ϵ for all j T . The capacity C is the supremum of the achievable rates.

2.2. Information Rates

The following bounds were given in [12] for the relay channel ( N = 1 ). Their extensions to MRCs are straightforward.
  • CS Rate: C R C S where
    R C S = max min 1 j N min I ( X s X r ; Y j ) , I ( X s ; Y r Y j | X r )
    and where the maximization is over all X s X r .
  • Direct-Transmission (DT) Rate: C R D T where
    R D T = max { min 1 j N I ( X s ; Y j | X r = x ) }
    and where the maximization is over all x and X s .
  • DF Rate: C R D F where
    R D F = max min 1 j N min I ( X s X r ; Y j ) , I ( X s ; Y r | X r )
    and where the maximization is over all X s X r .
  • QF Rate: C R Q F where
    R Q F = max min 1 j N min I ( X s X r ; Y j ) I ( Y r ; Y ^ r | X s X r Y j ) , I ( X s ; Y ^ r Y j | X r )
    where Y ^ r is an auxiliary random variable, and where the maximization is over all X s X r Y ^ r such that X s and X r are independent and X s X r Y r Y ^ r forms a Markov chain.

2.3. Real Alphabet AWGN MRC

The real alphabet AWGN MRC has real channel symbols and
Y r = a s , r X s + Z r
Y j = a s , j X s + a r , j X r + Z j
where j T . The a s , r , a s , j , and a r , j are channel gains between the nodes (see Figure 2). We later relate these gains to distances between the nodes. The Z r and Z j , j = 1 , 2 , , N , are independent and identically distributed Gaussian random variables with zero mean and unit variance. We may alternatively write (5) and (6) in vector form as
Y j = A j X + Z j
where X = ( X s X r ) T , Y j = ( Y r Y j ) T , Z = ( Z r Z j ) T , and
A j = a s , r 0 a s , j a r , j .
We consider individual average block power constraints
E k = 1 n X s , k 2 n P s , E k = 1 n X r , k 2 n P r .
The SNR and the capacity of the link from node u (with transmit power P u ) to node v are the respective
SNR u , v = a u , v 2 P u
C ( SNR u , v ) = 1 2 log 1 + SNR u , v .
We simplify the above rate bounds for the AWGN MRC.
  • CS Rate:
    R C S = max ρ [ min 1 j N min C SNR s , j + SNR r , j + 2 ρ SNR s , j SNR r , j , C ( 1 ρ 2 ) ( SNR s , j + SNR s , r ) ) ]
    where the correlation coefficient ρ satisfies | ρ | 1 . One can restrict attention to non-negative ρ .
  • DT Rate:
    R D T = min 1 j N C ( SNR s , j ) .
  • DF Rate:
    R D F = max ρ min 1 j N min C ( SNR s , j + SNR r , j + 2 ρ SNR s , j SNR r , j ) , C ( ( 1 ρ 2 ) SNR s , r ) .
    One can again restrict attention to non-negative ρ .
  • QF Rate: Optimizing X s X r Y ^ r seems difficult. Instead, we choose X s and X r to be zero-mean Gaussian with variances P s and P r , respectively. We further choose Y ^ r = Y r + Z r where Z r is zero-mean Gaussian with variance N r . Optimizing N r gives (see [13], pp. 336–337)
    R ˜ Q F = min 1 j N C SNR s , j + SNR r , j SNR s , r SNR s , j + SNR r , j + SNR s , r + 1 .

3. Quasi-Concavity in SNRs and ρ 2

3.1. CS Rate

We consider two characterizations of R C S . First, let a j T = ( a s , j a r , j ) be the second row of A j , let Q X be the covariance matrix of X (see Appendix A), and let det M be the determinant of the square matrix M . The CS rate (12) can be expressed as the maximum of
R C S ( Q X ) = min 1 j N min 1 2 log a j T Q X a j + 1 , 1 2 log det Q ( Y j T X r ) T P r
over the convex set of Q X with diagonal entries P s and P r . The first logarithm in (16) is clearly concave in Q X . The second logarithm is concave in Q ( Y j T X r ) T (see Appendix A) and Q ( Y j T X r ) T is linear in Q X . To prove the latter claim, observe that
Q ( Y j T X r ) T = A ˜ j Q X A ˜ j T + I 2 0 0 0
where A ˜ j T = A j T 0 1 T and I 2 is the 2 × 2 identity matrix. Hence R C S ( Q X ) is concave in (the convex set of) Q X because it is the minimum of 2 N concave functions.
Suppose next that we wish to consider ρ and the SNRs individually rather than via Q X . Define the vector
S = ( SNR s , r , SNR s , 1 , , SNR s , N , SNR r , 1 , , SNR r , N )
and the functions
f j ( ρ , S ) = SNR s , j + SNR r , j + 2 ρ SNR s , j SNR r , j
g j ( ρ , S ) = ( 1 ρ 2 ) SNR s , j + SNR s , r
R C S ( ρ , S ) = min 1 j N min C ( f j ( ρ , S ) ) , C ( g j ( ρ , S ) ) .
We establish the following results. We restrict attention to 0 ρ 1 and positive S .
Lemma 1.
f j ( ρ , S ) and g j ( ρ , S ) are concave in ρ, concave in S , and quasi-concave in ( ρ 2 , S ) .
Proof. 
Concavity with respect to ρ is established by observing that f j ( ρ , S ) is linear in ρ , and g j ( ρ , S ) is linear in ρ 2 which is concave in ρ .
Consider next concavity with respect to S . The Hessian of f j ( ρ , S ) with respect to S has only one non-zero eigenvalue
ρ 2 · SNR s , j 2 + SNR r , j 2 SNR s , j 3 / 2 SNR r , j 3 / 2 .
Thus, f j ( ρ , S ) is concave in S for non-negative ρ and positive S . The function g j ( ρ , S ) is linear in S , and thus concave in S .
Now consider quasi-concavity with respect to ( ρ 2 , S ) . Substituting a = SNR s , j , b = SNR r , j , c = ρ 2 into the fifth function of Lemma A6 in Appendix B, we find that f j ( ρ , S ) is quasi-concave in ( ρ 2 , S ) . For the g j ( ρ , S ) , observe that a b is quasi-concave for non-negative ( a , b ) , see the first function of Lemma A6. This implies (see (A7))
( λ a 1 + λ ¯ a 2 ) ( λ b 1 + λ ¯ b 2 ) min a 1 b 1 , a 2 b 2
for 0 λ 1 , and where λ ¯ = 1 λ . Substituting a i = 1 ρ i 2 and b i = SNR s , j , i + SNR s , r , i for i = 1 , 2 , we find that g j ( ρ , S ) is quasi-concave in ( ρ 2 , S ) . ☐
Theorem 1.
R C S ( ρ , S ) is concave in ρ, concave in S , and quasi-concave in ( ρ 2 , S ) .
Proof. 
R C S ( ρ , S ) involves taking logarithms and minima of (quasi-) concave functions. The results thus follow by applying Lemma 1 above and Lemma A5, Parts 2 and 3, in Appendix B. ☐
Corollary 1.
Consider S as a function of P = ( P s , P r ) . Then R C S ( ρ , S ( P ) ) is quasi-concave in ( ρ 2 , P ) .
Proof. 
The proof follows from the proof of Theorem 1 and because S is a linear function of P . ☐
To illustrate the quasi-concavity, consider one relay and the channel gains a s , r = 5 / 2 , a s , 1 = 1 , and a r , 1 = 5 / 3 . This scenario corresponds to the geometry in Section 5.1 with r = 0.4 . Figure 3 shows a contour plot of R C S ( ρ , S ( P ) ) when P s = 1 . Observe that the contour lines form convex regions, as predicted by Corollary 1.

3.2. DF Rate

Consider the functions
g j ( ρ , S ) = ( 1 ρ 2 ) SNR s , r
R D F ( ρ , S ) = min 1 j N min C ( f j ( ρ , S ) ) , C ( g j ( ρ , S ) ) .
As above, we restrict attention to 0 ρ 1 and positive S .
Theorem 2.
R D F ( ρ , S ) is concave in ρ, concave in S , and quasi-concave in ( ρ 2 , S ) .
Proof. 
The proof is similar to that of Theorem 1. ☐
Corollary 2.
R D F ( ρ , S ( P ) ) is quasi-concave in ( ρ 2 , P ) .
Proof. 
See the proof of Corollary 1. ☐

3.3. DT Rate

The DT rate (13) is clearly concave in S and P .

3.4. QF Rate

Consider the functions
h j ( S ) = SNR s , j + SNR r , j SNR s , r SNR s , j + SNR r , j + SNR s , r + 1
R ˜ Q F ( S ) = min 1 j N C ( h j ( S ) ) .
We establish the following results. We restrict attention to non-negative S .
Lemma 2.
h j ( S ) is quasi-concave in ( SNR r , j , SNR s , r ) .
Proof. 
Substitute a = SNR r , j , b = SNR s , r , k = SNR s , j + 1 into the second function of Lemma A6 in Appendix B, and apply Lemma A5, Part 1. ☐
Theorem 3.
R ˜ Q F ( S ) is quasi-concave in S if the SNR s , j , j = 1 , 2 , , n , are held fixed.
Proof. 
Apply Lemma 2 above and Lemma A5, Parts 2 and 3, in Appendix B. ☐

4. Quasi-Concavity in Relay Position

Suppose the channel gain for the node pair ( i , j ) is
a i , j = ξ i , j D i , j α / 2
where ξ i , j is a “fading” gain, D i , j = i j is the Euclidean distance between the positions i and j of nodes i and j, respectively, and α 2 is a path-loss exponent. We thus have
SNR i , j = ξ i , j P i D i , j α = ξ i , j P i i j α .
We establish quasi-concavity results in ρ 2 and r , where r is the position of the relay node.

4.1. CS Rate

Consider the functions (19)–(21) but relabeled as f j ( ρ , r ) , g j ( ρ , r ) , and R C S ( ρ , r ) to emphasize the dependence on the considered parameters. We again consider 0 ρ 1 and positive S .
Lemma 3.
f j ( ρ , r ) and g j ( ρ , r ) are quasi-concave in r for fixed ρ. Furthermore, f j ( ρ , r ) is quasi-concave in ( ρ 2 , r ) .
Proof. 
Consider the functions
f ˜ j ( ρ , D α ) = ξ s , j P s D s , j α + ξ r , j P r D α + 2 ρ ξ s , j P s D s , j α ξ r , j P r D α
g ˜ j ( ρ , D α ) = ( 1 ρ 2 ) ξ s , j P s D s , j α + ξ s , r P s D α
which are quasi-linear in D α for fixed ρ since they are decreasing in D α . However, D r , j α is a convex function of r for α 1 , and thus Lemma A5, Part 5, in Appendix B establishes that f j ( ρ , r ) is quasi-concave in r for fixed ρ . Similarly, D s , r α is a convex function of r for α 1 , and we find that g j ( ρ , r ) is quasi-concave in r for fixed ρ .
Next, substitute a = D α and b = ρ 2 into the third function of Lemma A6, and use Lemma A5, Part 1, to show that f ˜ j ( ρ , D α ) is quasi-concave in ( ρ 2 , D α ) . However, f ˜ j is decreasing in D α and D r , j α is convex in r , so Lemma A5, Part 5, establishes that f j ( ρ , r ) is quasi-concave in ( ρ 2 , r ) . ☐
Unfortunately, g ˜ j is quasi-convex (and not quasi-concave) in ( ρ 2 , D α ) . To see this, substitute a = D α and b = ρ 2 into the fourth function of Lemma A6. Quasi-concavity would have been useful since it would have permitted using Lemma A5, Parts 2 and 4, to establish the quasi-concavity of
R C S ( r ) = max ρ min 1 j N min C ( f j ( ρ , r ) ) , C ( g j ( ρ , r ) ) .
However, we have been unable to prove this, and our numerical results suggest that R C S ( ρ , r ) is not quasi-concave in ( ρ 2 , r ) . Nevertheless, Lemma 3 suffices to establish an intermediate result which is useful in Section 5 when we study ρ = 0 .
Theorem 4.
R C S ( ρ , r ) is quasi-concave in r for fixed ρ, 0 ρ 1 .
Proof. 
R C S ( ρ , r ) is the minimum of functions that are quasi-concave in r . Lemma A5, Part 2, thus establishes the theorem. ☐

4.2. DF Rate

The quasi-convexity of g ˜ j ( ρ , D α ) relaxes for the DF rate (25). Consider the negative of the fourth function of Lemma A6 in Appendix B with k 1 = 0 :
f ( a , b ) = ( 1 b ) k 2 / a .
This function is quasi-linear in ( a , b ) since both its superlevel and sublevel sets are convex. This result implies the following theorem. We again consider the functions (24)–(25) but relabeled as g j ( ρ , r ) and R D F ( ρ , r ) . We further define
g ˜ j ( ρ , D α ) = ( 1 ρ 2 ) ξ s , r P s D α
R D F ( r ) = max ρ min 1 j N min C ( f j ( ρ , r ) ) , C ( g j ( ρ , r ) ) .
As above, we consider 0 ρ 1 and positive S .
Theorem 5.
R D F ( ρ , r ) is quasi-concave in ( ρ 2 , r ) , and R D F ( r ) is quasi-concave in r .
Proof. 
g ˜ j ( ρ , D α ) is quasi-linear in ( ρ 2 , D α ) and decreasing in D α . Furthermore, D s , r α is convex in r , and thus Lemma A5, Part 5, in Appendix B establishes that g j ( ρ , r ) is quasi-concave in ( ρ 2 , r ) . R D F ( ρ , r ) is therefore quasi-concave in r , as it is the minimum of quasi-concave functions (see Lemma A5, Part 2). Furthermore, R D F ( r ) is concave in r by Lemma A5, Part 4. ☐

5. DF Performance

This section presents numerical results for the DF strategy and compares them to results from [7,8,9]. We consider 1-, 2-, and 3-dimensional MRCs with different numbers N of destination nodes. For simplicity, we consider the low SNR or broadband regime where
C ( SNR ) = 1 2 log ( 1 + SNR ) 1 2 SNR .
In other words, we consider the CS and DF rates without the logarithms. This approach is valid not only in the limit of low SNR, but more generally because we proved our quasi-concavity results without taking logarithms. Furthermore, in the low SNR regime the rates of full-duplex and half-duplex transmission are the same under a block power constraint.
We choose P s = P r = P = 1 , α = 2 , and ξ u , v = 1 for all node pairs ( u , v ) . We study both coherent transmission where ρ is optimized and non-coherent transmission with ρ = 0 . The rates are in nats/channel use. Alternatively, suppose we use sync pulses sampled at 2 W samples per second, where W is the (one-sided) signal bandwidth. Suppose further that the (one-sided) noise power spectral density is 1 Watt/Hz. Then at low SNR the rates in nats/channel use are the same as the rates in nats/sec.

5.1. One Dimension

Consider a relay channel ( N = 1 ) where the source is at the origin ( s = 0 ) and the destination is at point 1 ( 1 = 1 ). Figure 4 shows the low SNR CS rates, DF rates, and the routing-based DF (RDF) rates developed in [7], which are given by
R C S 1 2 min ξ s , 1 P s s 1 α + ξ r , 1 P r r 1 α + 2 ρ ξ s , 1 ξ r , 1 P s P r s 1 α / 2 r 1 α / 2 , ( 1 ρ 2 ) ξ s , 1 P s s 1 α + ξ s , r P s s r α
R D F 1 2 min ξ s , 1 P s s 1 α + ξ r , 1 P r r 1 α + 2 ρ ξ s , 1 ξ r , 1 P s P r s 1 α / 2 r 1 α / 2 , ( 1 ρ 2 ) ξ s , r P s s r α
R R D F max 0 β 1 1 2 min ξ r , 1 P r r 1 α , β ξ s , r P s s r α + ( 1 β ) ξ s , 1 P s s 1 α .
Observe that all curves are quasi-concave (but not concave) in r . Theorems 4 and 5 predict the quasi-concavity for all curves except for the coherent CS rates. Observe also that the curves for the coherent and non-coherent rates merge for relay positions exceeding a certain value ( r = 0.5 and r 0.47 for the respective CS and DF rates). The reason for this behavior is that ρ = 0 is optimal for the coherent CS and DF rates beyond these positions, see the ρ curve in [1] (Figure 16). Furthermore, the non-coherent CS rates coincide with the non-coherent DF rates for a large range of r .
The best relay positions for the two strategies are different. For example, r = 0.5 maximizes R R D F while the r maximizing R D F is closer to the source. This is because when the source transmits, the relay and the destination listen, and the destination “collects” information. The relay can thus be positioned closer to the source while maintaining the same information rate from the source to the relay, and from the source-relay pair to the destination. At the optimal positions, we compute R D F 2.26 P nats/sec and R R D F = 2 P nats/sec, so the DF gain is ≈13%.
Finally, we illustrate that R D F ( ρ , r ) is quasi-concave in ( ρ 2 , r ) in Figure 5. The contour lines form convex regions, as predicted by Theorem 5.

5.2. Two Dimensions

Consider N = 5 destinations positioned on a square in the two-dimensional Euclidean plane with the source node at the origin. Figure 6a plots the node positions as circles, and the non-coherent R D F as a function of the relay position. The best relay position is shown by a circle labeled r DF and the corresponding rate is R D F 0.011 P nats/sec. Figure 6c plots the low SNR two-hop rate
R 2 H min 1 j 5 1 2 min ξ s , r P s s r α , ξ r , j P r r j α
as a function of the relay position. The best relay position is shown by a circle labeled r 2 H and the corresponding two-hop rate is R 2 H = 0.01 P nats/sec. The non-coherent DF gain is thus ≈10%.
Figure 6b,d shows contour plots for R D F and R 2 H . The contours form convex regions, as predicted by Theorem 5. Again, the relay position maximizing R D F lies closer to the source than the relay position maximizing R 2 H .

5.3. Three Dimensions

Consider N = 5 destinations positioned in 3-dimensional Euclidean space as in Figure 7. The figure also shows the convex hull (a polyhedron) of the points. The points r DF and r 2 H denote the relay positions that maximize the non-coherent R D F and R 2 H , respectively. We remark that r DF and r 2 H remain unchanged if more destinations are positioned inside the polyhedron. This is because the points in the polyhedron receive at least the same rate as the worst of the five nodes at the corner points.

6. Discussion

6.1. Complex AWGN Channels

For complex-alphabet AWGN channels, we could replace (5) and (6) by adding phases ϕ i , j for i = s , r and j = 1 , 2 , , N as follows:
Y r = a s , r e j ϕ s , r X s + Z r
Y j = a s , j e j ϕ s , j X s + a r , j e j ϕ r , j X r + Z j
where the noise variables Z r and Z j , j = 1 , 2 , , N , are independent, identically distributed, circularly symmetric, complex, Gaussian random variables with zero mean and unit variance. The distance dependence of a i , j can be chosen as in (28) and the phase dependence as
ϕ i , j = 2 π D i , j λ
where λ = c / f o is the wavelength, c is the speed of light, and f o is the carrier frequency.
For example, the DF rate (14), normalized by the number of real dimensions, is
R D F = max ρ min 1 j N min C ( SNR s , j + SNR r , j + 2 ρ e j ( ϕ s , j ϕ r , j ) SNR s , j SNR r , j , C ( ( 1 | ρ | 2 ) SNR s , r )
where the complex correlation coefficient ρ satisfies 0 | ρ | 1 . Observe that for a classic relay channel, with N = 1 destination, one can choose ρ to make ρ e j ( ϕ s , 1 ϕ r , 1 ) real and non-negative, as for real alphabet AWGN channels. However, for N 2 one must choose complex ρ in general. Furthermore, the quasi-concavity in r will not be valid in general because the phases ϕ r , j change with r , and we cannot optimize ρ for each destination node separately. However, we remark that this effect is “local" in the sense that for large carrier frequencies the phase variations are sensitive to changes in r . A pragmatic approach would then be to optimize r for non-coherent transmission ( ρ = 0 ) even if beamforming is permitted. Furthermore, if the channel exhibits random phase variations, then the best approach is to choose ρ = 0 (see [1], Figure 18) in which case we have quasi-concavity for both the CS and DF rates. Finally, we remark that it might be interesting to consider quasi-concavity in the correlation coefficients for problems where the source and relay have sufficiently many antennas to overcome the problem outlined above.

6.2. Sum-Power Constraint

For some applications, it is interesting to consider a sum-power constraint
E k = 1 n | X | s , k 2 + | X | r , k 2 n P T .
As is usually done, we set P r = P T P s and consider P s , 0 P s P T as a new optimization parameter. One might now hope that R C S ( P s , r ) or R D F ( P s , r ) are quasi-concave in ( P s , r ) for fixed ρ , or at least for ρ = 0 . Unfortunately, we have found counterexamples that show this is not the case. The rate functions do seem to have interesting properties, however, and these deserve further exploration.

7. Conclusions

Various quasi-concavity results were established for AWGN MRCs. In particular, the CS rates are quasi-concave in the relay position for a fixed correlation coefficient (Theorem 4) and the DF rates are quasi-concave in the relay position (Theorem 5).

Author Contributions

Both authors conceived the problem and solution and wrote the paper.

Funding

This research was funded by the German Ministry of Education and Research in the framework of an Alexander von Humboldt Professorship.

Acknowledgments

The authors thank the reviewers for their comments that helped to improve the paper.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Covariance Matrices and Concavity

The covariance matrix of a real-valued random column vector V is
Q V = E ( V E V ) ( V E V ) T .
A useful property of covariance matrices is as follows (see [14], p. 684). If Q V is a principal minor of Q V , then the following function is concave in Q V :
f ( Q V ) = log det Q V det Q V .

Appendix B. Concave and Quasi-Concave Functions

We review results on quasi-concavity, and then establish quasi-concavity for several functions.

Appendix B.1. Definitions

Consider the following sets. The domain D f of a real-valued function f : R n R is the set of arguments for which f is defined. The hypograph and hypergraph of f are the respective
H f = { ( x , y ) y f ( x ) } , H ^ f = { ( x , y ) y f ( x ) } .
The superlevel and sublevel sets of f with respect to β R are the respective
S f , β = { x | f ( x ) β } , S ^ f , β = { x | f ( x ) β } .
Concave and quasi-concave functions can be defined via the convexity of these sets. Recall that a set S , S R n , is convex if for any two points x 1 and x 2 in S and for any λ satisfying 0 λ 1 we have
λ x 1 + λ ¯ x 2 S
where λ ¯ = 1 λ . Suppose that D f is convex. The function f is concave over D f if and only if its hypograph H f is convex. Similarly, f is convex over D f if and only if H ^ f is convex. The function f is quasi-concave over D f if and only if all its superlevel sets are convex, and f is quasi-convex over D f if and only if all its sublevel sets are convex. A function that is quasi-convex and quasi-concave is called quasi-linear. For example, any non-increasing or non-decreasing function is quasi-linear.

Appendix B.2. Basic Properties

Two properties of concave and quasi-concave functions are as follows; these properties are often used as the definitions of such functions. Similar properties exist for convex and quasi-convex functions.
Lemma A1.
The function f is concave if and only if
f ( λ x 1 + λ ¯ x 2 ) λ f ( x 1 ) + λ ¯ f ( x 2 )
for all x 1 and x 2 in D f and for all 0 λ 1 .
Lemma A2.
The function f is quasi-concave if and only if
f ( λ x 1 + λ ¯ x 2 ) min ( f ( x 1 ) , f ( x 2 ) )
for all x 1 and x 2 in D f and for all 0 λ 1 .
The next two properties assume that f is twice differentiable and that D f is convex. Let H f ( x ) and B f ( x ) be the respective Hessian and bordered Hessian of f at x .
Lemma A3.
(see [15], Section 3.1.4) f is concave if and only if H f ( x ) is negative semidefinite for all x D f .
Lemma A4.
(see [16], p. 771) f is quasi-concave on the open and convex set D f if the determinants D 2 , D 3 , , D n of the respective second to nth leading principal minors of B f ( x ) satisfy ( 1 ) k D k < 0 for k = 2 , 3 , , n and for all x D f .

Appendix B.3. Compositions Preserving Quasi-Concavity

The following compositions preserve quasi-concavity.
Lemma A5.
Suppose f and f i , 1 i n , are quasi-concave, then so are the functions
1. 
h = k 1 f + k 2 , where k 1 0 and k 2 R ;
2. 
h = min 1 i n f i ;
3. 
h = g f where f is quasi-concave and g is non-decreasing;
4. 
h ( a ) = sup b B f ( a , b ) where B is a convex set;
5. 
h ( a , b ) = f ( g ( a ) , b ) where g is convex and f ( a ˜ , b ) is non-increasing in a ˜ for fixed b .
Proof. 
Properties 1)–4) are standard (see [15], Section 3.4). For property 5), observe that
h ( λ a 1 + λ ¯ a 2 , λ b 1 + λ ¯ b 2 ) = f ( g ( λ a 1 + λ ¯ a 2 ) , λ b 1 + λ ¯ b 2 ) ( a ) f ( λ g ( a 1 ) + λ ¯ g ( a 2 ) , λ b 1 + λ ¯ b 2 ) ( b ) min f ( g ( a 1 ) , b 1 ) , f ( g ( a 2 ) , b 2 )
where ( a ) follows because g ( λ a 1 + λ ¯ a 2 ) λ g ( a 1 ) + λ ¯ g ( a 2 ) and f ( a ˜ , b ) is non-increasing in a ˜ . Step ( b ) follows because f is quasi-concave. ☐

Appendix B.4. Examples of Quasi-Concave Functions

We establish quasi-concavity for several useful functions.
Lemma A6.
The following functions are quasi-concave for x = ( a b ) with non-negative entries.
1. 
f ( x ) = a b
2. 
f ( x ) = a b a + b + k for a positive constant k
3. 
f ( x ) = k 1 / a + 2 k 2 b / a for positive constants k 1 , k 2
4. 
f ( x ) = ( 1 b ) ( k 1 + k 2 / a ) for positive constants k 1 , k 2 , and b 1
Furthermore, the following function is quasi-concave for x = ( a b c ) with non-negative entries.
5. 
f ( x ) = a + b + 2 a b c
Proof. 
We consider positive x , and we use bordered Hessians B f ( x ) and the derivatives D k of their kth leading principal minors, k = 2 , 3 , , n . The results extend to non-negative x by using continuity at zero values, except for the third and fourth functions where a = 0 makes the functions undefined.
  • We have D 2 < 0 and D 3 > 0 for
    B f ( x ) = 0 b a b 0 1 a 1 0 .
  • We have D 2 < 0 and D 3 > 0 for
    B f ( x ) = 0 b ( b + k ) ( a + b + k ) 2 a ( a + k ) ( a + b + k ) 2 b ( b + k ) ( a + b + k ) 2 2 b ( b + k ) ( a + b + k ) 3 2 a b + ( a + b + k ) k ( a + b + k ) 3 a ( a + k ) ( a + b + k ) 2 2 a b + ( a + b + k ) k ( a + b + k ) 3 2 a ( a + k ) ( a + b + k ) 3 .
  • We have D 2 < 0 and D 3 > 0 for
    B f ( x ) = 0 k 1 + k 2 a b a 2 k 2 a b k 1 + k 2 a b a 2 4 k 1 + 3 k 2 a b 2 a 3 k 2 2 a 3 / 2 b k 2 a b k 2 2 a 3 / 2 b k 2 2 b 3 / 2 a .
  • If b 1 , we have D 2 < 0 and D 3 > 0 for
    B f ( x ) = 0 ( 1 b ) k 2 a 2 k 1 + k 2 a ( 1 b ) k 2 a 2 2 ( 1 b ) k 2 a 3 k 2 a 2 k 1 + k 2 a k 2 a 2 0 .
  • We have D 2 < 0 , D 3 > 0 and D 4 < 0 for
    B f ( x ) = 0 1 + b c a 1 + a c b a b c 1 + b c a b c 2 a 3 / 2 1 2 c a b 1 2 b a c 1 + a c b 1 2 c a b a c 2 b 3 / 2 1 2 a b c a b c 1 2 b a c 1 2 a b c a b 2 c 3 / 2 .
 ☐

References

  1. Kramer, G.; Gastpar, M.; Gupta, P. Cooperative strategies and capacity theorems for relay networks. IEEE Trans. Inf. Theory 2005, 51, 3037–3063. [Google Scholar] [CrossRef]
  2. Lin, B.; Ho, P.-H.; Xie, L.-L.; Shen, X.; Tapolcai, J. Optimal relay station placement in broadband wireless access networks. IEEE Trans. Mob. Comput. 2010, 9, 259–269. [Google Scholar] [CrossRef]
  3. Aggarwal, V.; Bennatan, A.; Calderbank, A.R. Calderbank. On maximizing coverage in Gaussian relay channels. IEEE Trans. Inf. Theory 2009, 55, 2518–2536. [Google Scholar] [CrossRef]
  4. Joshi, G.; Karandikar, A. Optimal relay placement for cellular coverage extension. In Proceedings of the Seventeenth National Conference on Communications, Bangalore, India, 28–30 January 2011; pp. 1196–1200. [Google Scholar]
  5. Lee, J.; Wang, H.; Andrews, J.G.; Hong, D. Outage probability of cognitive relay networks with interference constraints. IEEE Trans. Wirel. Commun. 2011, 10, 390–395. [Google Scholar] [CrossRef]
  6. Chen, X.; Song, S.H.; Letaief, K.B. Relay position optimization improves finite-SNR diversity gain of decode-and-forward MIMO relay systems. IEEE Trans. Commun. 2012, 60, 3311–3321. [Google Scholar] [CrossRef]
  7. Thakur, M.; Fawaz, N.; Médard, M. Optimal relay location and power allocation for low-SNR broadcast relay channels. In Proceedings of the 30th IEEE International Conference on Computer Communications (INFOCOM 2011), Shanghai, China, 10–15 April 2011; pp. 2822–2830. [Google Scholar]
  8. Thakur, M.; Fawaz, N.; Médard, M. On the geometry of wireless network multicast in 2-D. In Proceedings of the 2011 IEEE International Symposium on Information Theory, Saint Petersburg, Russian, 31 July–5 August 2011; pp. 1628–1632. [Google Scholar]
  9. Thakur, M.; Fawaz, N.; Médard, M. Reducibility of joint relay positioning and flow optimization problem. In Proceedings of the 2012 IEEE International Symposium on Information Theory, Boston, MA, USA, 1–6 July 2012; pp. 1117–1121. [Google Scholar]
  10. Thakur, M.; Kramer, G. Relay Positioning for Multicast Relay Networks. In Proceedings of the 2013 IEEE International Symposium on Information Theory, Istanbul, Turkey, 7–12 July 2013; pp. 1954–1958. [Google Scholar]
  11. Thakur, M.; Kramer, G. Quasi-concavity for Gaussian multicast relay channels. In Proceedings of the 2015 IEEE International Symposium on Information Theory, Hong Kong, China, 14–19 June 2015; pp. 2867–2869. [Google Scholar]
  12. Cover, T.M.; El Gamal, A. Capacity theorems for the relay channel. IEEE Trans. Inf. Theory 1979, 25, 572–584. [Google Scholar] [CrossRef]
  13. Kramer, G.; Maric, I.; Yates, R.D. Cooperative Communications. Found. Trends Netw. 2006, 1, 271–425. [Google Scholar] [CrossRef] [Green Version]
  14. Cover, T.M.; Thomas, J.A. Elements of Information Theory, 2nd ed.; John Wiley & Sons: New York, NY, USA, 2006. [Google Scholar]
  15. Boyd, S.; Vandenberghe, L. Convex Optimization; Cambridge University Press: New York, NY, USA, 2004. [Google Scholar]
  16. Bazaraa, M.S.; Sherali, H.D.; Shetty, C.M. Nonlinear Programming; Wiley: Hoboken, NJ, USA, 2006. [Google Scholar]
Figure 1. Multicast relay channel (MRC) with two destinations.
Figure 1. Multicast relay channel (MRC) with two destinations.
Entropy 21 00109 g001
Figure 2. AWGN MRC with two destinations.
Figure 2. AWGN MRC with two destinations.
Entropy 21 00109 g002
Figure 3. Contour plot of R C S ( ρ , S ( P ) ) when P s = 1 .
Figure 3. Contour plot of R C S ( ρ , S ( P ) ) when P s = 1 .
Entropy 21 00109 g003
Figure 4. Relay channel rates for low signal-to-noise ratio (SNR) and P = 1 .
Figure 4. Relay channel rates for low signal-to-noise ratio (SNR) and P = 1 .
Entropy 21 00109 g004
Figure 5. Contour plot of R D F ( ρ , r ) in (37).
Figure 5. Contour plot of R D F ( ρ , r ) in (37).
Entropy 21 00109 g005
Figure 6. (a) R D F for N = 5 ; (b) R D F contour plot; (c) R 2 H for the same network; (d) R 2 H contour plot.
Figure 6. (a) R D F for N = 5 ; (b) R D F contour plot; (c) R 2 H for the same network; (d) R 2 H contour plot.
Entropy 21 00109 g006
Figure 7. N = 5 destination geometry in three dimensions.
Figure 7. N = 5 destination geometry in three dimensions.
Entropy 21 00109 g007

Share and Cite

MDPI and ACS Style

Thakur, M.; Kramer, G. Quasi-Concavity for Gaussian Multicast Relay Channels. Entropy 2019, 21, 109. https://0-doi-org.brum.beds.ac.uk/10.3390/e21020109

AMA Style

Thakur M, Kramer G. Quasi-Concavity for Gaussian Multicast Relay Channels. Entropy. 2019; 21(2):109. https://0-doi-org.brum.beds.ac.uk/10.3390/e21020109

Chicago/Turabian Style

Thakur, Mohit, and Gerhard Kramer. 2019. "Quasi-Concavity for Gaussian Multicast Relay Channels" Entropy 21, no. 2: 109. https://0-doi-org.brum.beds.ac.uk/10.3390/e21020109

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop