This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Performance of Hybrid-ARQ in Block-Fading Channels: A Fixed Outage Probability Analysis

Peng Wu and Nihar Jindal The authors are with the Department of Electrical and Computer Engineering, University of Minnesota, Minneapolis, MN 55455, USA (email: pengwu@umn.edu; nihar@umn.edu).
Abstract

This paper studies the performance of hybrid-ARQ (automatic repeat request) in Rayleigh block-fading channels. The long-term average transmitted rate is analyzed in a fast-fading scenario where the transmitter only has knowledge of channel statistics, and, consistent with contemporary wireless systems, rate adaptation is performed such that a target outage probability (after a maximum number of H-ARQ rounds) is maintained. H-ARQ allows for early termination once decoding is possible, and thus is a coarse, and implicit, mechanism for rate adaptation to the instantaneous channel quality. Although the rate with H-ARQ is not as large as the ergodic capacity, which is achievable with rate adaptation to the instantaneous channel conditions, even a few rounds of H-ARQ make the gap to ergodic capacity reasonably small for operating points of interest. Furthermore, the rate with H-ARQ provides a significant advantage compared to systems that do not use H-ARQ and only adapt rate based on the channel statistics.

I Introduction

ARQ (automatic repeat request) is an extremely powerful type of feedback-based communication that is extensively used at different layers of the network stack. The basic ARQ strategy adheres to the pattern of transmission followed by feedback of an ACK/NACK to indicate successful/unsuccessful decoding. If simple ARQ or hybrid-ARQ (H-ARQ) with Chase combining (CC) [1] is used, a NACK leads to retransmission of the same packet in the second ARQ round. If H-ARQ with incremental redundancy (IR) is used, the second transmission is not the same as the first and instead contains some “new” information regarding the message (e.g., additional parity bits). After the second round the receiver again attempts to decode, based upon the second ARQ round alone (simple ARQ) or upon both ARQ rounds (H-ARQ, either CC or IR). The transmitter moves on to the next message when the receiver correctly decodes and sends back an ACK, or a maximum number of ARQ rounds (per message) is reached.

ARQ provides an advantage by allowing for early termination once sufficient information has been received. As a result, it is most useful when there is considerably uncertainty in the amount/quality of information received. At the network layer, this might correspond to a setting where the network congestion is unknown to the transmitter. At the physical layer, which is the focus of this paper, this corresponds to a fading channel whose instantaneous quality is unknown to the transmitter.

Although H-ARQ is widely used in contemporary wireless systems such as HSPA [2], WiMax [3] (IEEE 802.16e) and 3GPP LTE [4], the majority of research on this topic has focused on code design, e.g., [5], [6], [7], while relatively little research has focused on performance analysis of H-ARQ [8]. Most relevant to the present work, in [9] Caire and Tuninetti established a relationship between H-ARQ throughput and mutual information in the limit of infinite block length. For multiple antenna systems, the diversity-multiplexing-delay tradeoff of H-ARQ was studied by El Gamal et al. [10], and the coding scheme achieving the optimal tradeoff was introduced; Chuang et al. [11] considered the optimal SNR exponent in the block-fading MIMO (multiple-input multiple-output) H-ARQ channel with discrete input signal constellation satisfying a short-term power constraint. H-ARQ has also been recently studied in quasi-static channels (i.e., the channel is fixed over all H-ARQ rounds) [12], [13] and shown to bring benefits to secrecy [14].

In this paper we build upon the results of [9] and perform a mutual information-based analysis of H-ARQ in block-fading channels. We consider a scenario where the fading is too fast to allow instantaneous channel quality feedback to the transmitter, and thus the transmitter only has knowledge of the channel statistics, but nonetheless each transmission experiences only a limited degree of channel selectivity. In this setting, rate adaptation can only be performed based on channel statistics and achieving a reasonable error/outage probability generally requires a conservative choice of rate if H-ARQ is not used. On the other hand, H-ARQ allows for implicit rate adaptation to the instantaneous channel quality because the receiver terminates transmission once the channel conditions experienced by a codeword are good enough to allow for decoding.

We analyze the long-term average transmitted rate achieved with H-ARQ, assuming that there is a maximum number of H-ARQ rounds and that a target outage probability at H-ARQ termination cannot be exceeded. We compare this rate to that achieved without H-ARQ in the same setting as well as to the ergodic capacity, which is the achievable rate in the idealized setting where instantaneous channel information is available to the transmitter. The main findings of the paper are that (a) H-ARQ generally provides a significant advantage over systems that do not use H-ARQ but have an equivalent level of channel selectivity, (b) the H-ARQ rate is reasonably close to the ergodic capacity in many practical settings, and (c) the rate with H-ARQ is much less sensitive to the desired outage probability than an equivalent system that does not use H-ARQ.

The present work differs from prior literature in a number of important aspects. One key distinction is that we consider systems in which the rate is adapted to the average SNR such that a constant target outage probability is maintained at all SNR’s, whereas most prior work has considered either fixed rate (and thus decreasing outage) [11] or increasing rate and decreasing outage as in the diversity-multiplexing tradeoff framework [10][15]. The fixed outage paradigm is consistent with contemporary wireless systems where an outage level near 1% is typical (see [16] for discussion), and certain conclusions depend heavily on the outage assumption. With respect to [9], note that the focus of [9] is on multi-user issues, e.g., whether or not a system becomes interference-limited at high SNR in the regime of very large delay, whereas we consider single-user systems and generally focus on performance with short delay constraints (i.e., maximum number of H-ARQ rounds). In addition, we use the attempted transmission rate, rather than the successful rate (which is used in [9]), as our performance metric. This is motivated by applications such as Voice-over-IP (VoIP), where a packet is dropped (and never retransmitted) if it cannot be decoded after the maximum number of H-ARQ rounds and quality of service is maintained by achieving the target (post-H-ARQ) outage probability. On the other hand, reliable data communication requires the use of higher-layer retransmissions whenever H-ARQ outages occur; in such a setting, the relevant metric is the successful rate, which is the product of the attempted transmission rate and the success probability (i.e., one minus the post-H-ARQ outage probability). If the post-H-ARQ outage is fixed to some target value (e.g., 1%1\%), then studying the attempted rate is effectively equivalent to studying the successful rate.111If the post-H-ARQ outage probability can be optimized, then a careful balancing between the attempted rate and higher-layer retransmissions should be conducted in order to maximize the successful rate. Although this is beyond the scope of the present paper, note that some results in this direction can be found in [17] [18].

II System Model

We consider a block-fading channel where the channel remains constant over a block but varies independently from one block to another. The tt-th received symbol in the i-th block is given by:

yt,i=SNRhixt,i+zt,i,\displaystyle y_{t,i}=\sqrt{\mbox{\scriptsize\sf SNR}}~h_{i}x_{t,i}+z_{t,i}, (1)

where the index i=1,2,i=1,2,\cdots indicates the block number, t=1,2,,Tt=1,2,\cdots,T indexes channel uses within a block, SNR is the average received SNR, hih_{i} is the fading channel coefficient in the ii-th block, and xt,i,yt,i,x_{t,i},y_{t,i}, and zt,iz_{t,i} are the transmitted symbol, received symbol, and additive noise, respectively. It is assumed that hkh_{k} is complex Gaussian (circularly symmetric) with unit variance and zero mean, and that h1,h2,h_{1},h_{2},\ldots are i.i.d.. The noise zt,iz_{t,i} has the same distribution as hkh_{k} and is independent across channel uses and blocks. The transmitted symbol xt,ix_{t,i} is constrained to have unit average power; we consider Gaussian inputs, and thus xt,ix_{t,i} has the same distribution as the fading and the noise. Although we focus only on Rayleigh fading and single antenna systems, our basic insights can be extended to incorporate other fading distributions and MIMO as discussed in Section IV (Remark 1).

We consider the setting where the receiver has perfect channel state information (CSI), while the transmitter is aware of the channel distribution but does not know the instantaneous channel quality. This models a system in which the fading is too fast to allow for feedback of the instantaneous channel conditions from the receiver back to the transmitter, i.e., the channel coherence time is not much larger than the delay in the feedback loop. In cellular systems this is the case for moderate-to-high velocity users. This setting is often referred to as open-loop because of the lack of instantaneous channel tracking at the transmitter, although other forms of feedback, such as H-ARQ, are permitted. The relevant performance metrics, notably what we refer to as outage probability and fixed outage transmitted rate, are specified at the beginning of the relevant sections.

If H-ARQ is not used, we assume each codeword spans LL fading blocks; LL is therefore the channel selectivity experienced by each codeword. When H-ARQ is used, we make the following assumptions:

  • The channel is constant within each H-ARQ round (TT symbols), but is independent across H-ARQ rounds.222An intuitive but somewhat misleading extension of the quasi-static fading model to the H-ARQ setting is to assume that the channel is constant for the duration of the H-ARQ rounds corresponding to a particular message/codeword, but is drawn independently across different messages. Because more H-ARQ rounds are needed to decode when the channel quality is poor, such a model actually changes the underlying fading distribution by increasing the probability of poor states and reducing the probability of good channel states. In this light, it is more accurate model the channel across H-ARQ rounds according to a stationary and ergodic random process with a high degree of correlation.

  • A maximum of MM H-ARQ rounds are allowed. An outage is declared if decoding is not possible after MM rounds, and this outage probability can be no larger than the constraint ϵ\epsilon.

Because the channel is assumed to be independent across H-ARQ rounds, MM is the maximum amount of channel selectivity experienced by a codeword. When comparing H-ARQ and no H-ARQ, we set L=ML=M such that maximum selectivity is equalized.

It is worth noting that these assumptions on the channel variation are quite reasonable for the fast-fading/open-loop scenarios. Transmission slots in modern systems are typically around one millisecond, during which the channel is roughly constant even for fast fading. 333Frequency-domain channel variation within each H-ARQ round is briefly discussed in Section IV-B. An H-ARQ round generally corresponds to a single transmission slot, but subsequent ARQ rounds are separated in time by at least a few slots to allow for decoding and ACK/NACK feedback; thus the assumption of independent channels across H-ARQ rounds is reasonable. Moreover, a constraint on the number of H-ARQ rounds limits complexity (the decoder must retain information received in prior H-ARQ rounds in memory) and delay.

Throughout the paper we use the notation Fϵ1(X)F_{\epsilon}^{-1}(X) to denote the solution yy to the equation [Xy]=ϵ\mathbb{P}[X\leq y]=\epsilon, where XX is a random variable; this quantity is well defined wherever it is used.

III Performance Without H-ARQ: Fixed-Length Coding

We begin by studying the baseline scenario where H-ARQ is not used and every codeword spans LL fading blocks. In this setting the outage probability is the probability that mutual information received over the LL fading blocks is smaller than the transmitted rate RR [19, eq (5.83)]:

Pout(R,SNR)=[1Li=1Llog2(1+SNR|hi|2)R].\displaystyle P_{\textrm{out}}(R,\mbox{\scriptsize\sf SNR})={\mathbb{P}}\left[\frac{1}{L}\sum_{i=1}^{L}\log_{2}(1+\mbox{\scriptsize\sf SNR}|h_{i}|^{2})\leq R\right]. (2)

where hih_{i} is the channel in the ii-th fading block. The outage probability reasonably approximates the decoding error probability for a system with strong coding [20] [21], and the achievability of this error probability has been rigorously shown in the limit of infinite block length (TT\rightarrow\infty) [22] [23].

Because the outage probability is a non-decreasing function of RR, by setting the outage probability to ϵ\epsilon and solving for RR we get the following straightforward definition of ϵ\epsilon-outage capacity [24]:

Definition 1

The ϵ\epsilon-outage capacity with outage constraint ϵ\epsilon and diversity order LL, denoted by CϵL(SNR)C_{\epsilon}^{L}(\mbox{\scriptsize\sf SNR}), is the largest rate such that the outage probability in (2) is no larger than ϵ\epsilon:

CϵL(SNR)\displaystyle C_{\epsilon}^{L}(\mbox{\scriptsize\sf SNR}) \displaystyle\triangleq maxPout(R,SNR)ϵR\displaystyle\max_{P_{\textrm{out}}(R,\mbox{\scriptsize\sf SNR})\leq\epsilon}R (3)

Using notation introduced earlier, the ϵ\epsilon-outage capacity can be rewritten as

CϵL(SNR)\displaystyle C_{\epsilon}^{L}(\mbox{\scriptsize\sf SNR}) =\displaystyle= Fϵ1(1Li=1Llog2(1+SNR|hi|2))\displaystyle F_{\epsilon}^{-1}\left(\frac{1}{L}\sum_{i=1}^{L}\log_{2}(1+\mbox{\scriptsize\sf SNR}|h_{i}|^{2})\right) (4)
=\displaystyle= log2SNR+Fϵ1(1Li=1Llog2(1SNR+|hi|2)).\displaystyle\log_{2}\mbox{\scriptsize\sf SNR}+F_{\epsilon}^{-1}\left(\frac{1}{L}\sum_{i=1}^{L}\log_{2}\left(\frac{1}{\mbox{\scriptsize\sf SNR}}+|h_{i}|^{2}\right)\right). (5)

For L=1L=1, Pout(R,SNR)P_{\textrm{out}}(R,\mbox{\scriptsize\sf SNR}) can be written in closed form and inverted to yield Cϵ1(SNR)=log2(1+loge(11ϵ)SNR)C_{\epsilon}^{1}(\mbox{\scriptsize\sf SNR})=\log_{2}\left(1+\log_{e}\left(\frac{1}{1-\epsilon}\right)\mbox{\scriptsize\sf SNR}\right) [19]. For L>1L>1 the outage probability cannot be written in closed form nor inverted, and therefore CϵL(SNR)C_{\epsilon}^{L}(\mbox{\scriptsize\sf SNR}) must be numerically computed. There are, however, two useful approximations to ϵ\epsilon-outage capacity. The first one is the high-SNR affine approximation [25], which adds a constant rate offset term to the standard multiplexing gain characterization.

Theorem 1

The high-SNR affine approximation to ϵ\epsilon-outage capacity is given by

CϵL(SNR)=log2SNR+Fϵ1(1Li=1Llog2(|hi|2))+o(1),\displaystyle C_{\epsilon}^{L}(\mbox{\scriptsize\sf SNR})=\log_{2}\mbox{\scriptsize\sf SNR}+F_{\epsilon}^{-1}\left(\frac{1}{L}\sum_{i=1}^{L}\log_{2}\left(|h_{i}|^{2}\right)\right)+o(1), (6)

where the notation implies that the o(1)o(1) term vanishes as SNR\mbox{\scriptsize\sf SNR}\rightarrow\infty.

Proof:

The proof is identical to that of the high-SNR offset characterization of MIMO channels in [26, Theorem 1], noting that single antenna block fading is equivalent to a MIMO channel with a diagonal channel matrix. ∎

In terms of standard high SNR notation where C(SNR)=𝒮(log2SNR)+o(1)C(\mbox{\scriptsize\sf SNR})=\mathcal{S}_{\infty}(\log_{2}\mbox{\scriptsize\sf SNR}-\mathcal{L}_{\infty})+o(1) [25][27], the multiplexing gain 𝒮=1\mathcal{S}_{\infty}=1 and the rate offset =Fϵ1(1Li=1Llog2(|hi|2))\mathcal{L}_{\infty}=-F_{\epsilon}^{-1}\left(\frac{1}{L}\sum_{i=1}^{L}\log_{2}\left(|h_{i}|^{2}\right)\right). The rate offset is the difference between the ϵ\epsilon-outage capacity and the capacity of an AWGN channel with signal-to-noise ratio SNR. Although a closed form expression for \mathcal{L}_{\infty} cannot be found for L>1L>1, from [28],

[1Li=1Llog2(|hi|2)y]=2yLG1,L+1L,1(2yL|0,0,,0,10),\displaystyle\mathbb{P}\left[\frac{1}{L}\sum_{i=1}^{L}\log_{2}\left(|h_{i}|^{2}\right)\leq y\right]=2^{yL}G_{1,L+1}^{L,1}\left(2^{yL}|_{0,0,\ldots,0,-1}^{0}\right), (7)

where Gp,qm,n(x|b1,,bqa1,,ap)G_{p,q}^{m,n}\left(x|_{b_{1},\ldots,b_{q}}^{a_{1},\ldots,a_{p}}\right) is the Meijer G-function [29, eq. (9.301)]. Based on (6), \mathcal{L}_{\infty} therefore is the solution to 2LG1,L+1L,1(2L|0,0,,0,10)=ϵ2^{-\mathcal{L}_{\infty}L}G_{1,L+1}^{L,1}\left(2^{-\mathcal{L}_{\infty}L}|_{0,0,\ldots,0,-1}^{0}\right)=\epsilon. The rate offset \mathcal{L}_{\infty} is plotted versus LL in Fig. 1 for ϵ=0.01\epsilon=0.01. As LL\rightarrow\infty the offset converges to 𝔼[log2(|h|2)]0.83-{\mathbb{E}}[\log_{2}\left(|h|^{2}\right)]\approx 0.83, the offset of the ergodic Rayleigh channel [27].

While the affine approximation is accurate at high SNR’s, motivated by the Central Limit Theorem (CLT), an approximation that is more accurate for moderate and low SNR’s is reached by approximating random variable 1Li=1Llog2(1+SNR|hi|2)\frac{1}{L}\sum_{i=1}^{L}\log_{2}(1+\mbox{\scriptsize\sf SNR}|h_{i}|^{2}) by a Gaussian random variable with the same mean and variance [30][31]. The mean μ\mu and variance σ2\sigma^{2} of log2(1+SNR|h|2)\log_{2}(1+\mbox{\scriptsize\sf SNR}|h|^{2}) are given by [32][33]:

μ(SNR)\displaystyle\mu(\mbox{\scriptsize\sf SNR}) =\displaystyle= 𝔼[log2(1+SNR|h|2)]=log2(e)e1/SNRE1(1/SNR),\displaystyle{\mathbb{E}}[\log_{2}(1+\mbox{\scriptsize\sf SNR}|h|^{2})]=\log_{2}(e)e^{1/\mbox{\scriptsize\sf SNR}}E_{1}(1/\mbox{\scriptsize\sf SNR}), (8)
σ2(SNR)\displaystyle\sigma^{2}(\mbox{\scriptsize\sf SNR}) =\displaystyle= 2SNRlog22(e)e1/SNRG3,44,0(1/SNR|0,1,1,10,0,0)μ2(SNR),\displaystyle\frac{2}{\mbox{\scriptsize\sf SNR}}\log_{2}^{2}(e)e^{1/\mbox{\scriptsize\sf SNR}}G_{3,4}^{4,0}\left(1/\mbox{\scriptsize\sf SNR}|_{0,-1,-1,-1}^{0,0,0}\right)-\mu^{2}(\mbox{\scriptsize\sf SNR}), (9)

where E1(x)=1t1ext𝑑tE_{1}(x)=\int_{1}^{\infty}t^{-1}e^{-xt}dt, and at high SNR the standard deviation σ(SNR)\sigma(\mbox{\scriptsize\sf SNR}) converges to πlog2e6\frac{\pi\log_{2}e}{\sqrt{6}} [33]. The mutual information is thus approximated by a 𝒩(μ(SNR),σ2(SNR)L)\mathcal{N}(\mu(\mbox{\scriptsize\sf SNR}),\frac{\sigma^{2}(\mbox{\scriptsize\sf SNR})}{L}), and therefore

Pout(R,SNR)Q(Lσ(SNR)(μ(SNR)R)),\displaystyle P_{\textrm{out}}(R,\mbox{\scriptsize\sf SNR})\approx Q\left(\frac{\sqrt{L}}{\sigma(\mbox{\scriptsize\sf SNR})}(\mu(\mbox{\scriptsize\sf SNR})-R)\right), (10)

where Q()Q(\cdot) is the tail probability of a unit variance normal. Setting this quantity to ϵ\epsilon and then solving for RR yields an ϵ\epsilon-outage capacity approximation [30, eq. (26)]:

CϵL(SNR)\displaystyle C_{\epsilon}^{L}(\mbox{\scriptsize\sf SNR}) \displaystyle\approx μ(SNR)σ(SNR)LQ1(ϵ).\displaystyle\mu(\mbox{\scriptsize\sf SNR})-\frac{\sigma(\mbox{\scriptsize\sf SNR})}{\sqrt{L}}Q^{-1}(\epsilon). (11)

The accuracy of this approximation depends on how accurately the CDF of a Gaussian matches the CDF (i.e., outage probability) of random variable 1Li=1Llog2(1+SNR|hi|2)\frac{1}{L}\sum_{i=1}^{L}\log_{2}(1+\mbox{\scriptsize\sf SNR}|h_{i}|^{2}). In Fig. 2 both CDF’s are plotted for L=2L=2 and L=10L=10, and SNR=0\mbox{\scriptsize\sf SNR}=0, 1010, and 2020 dB. As expected by the CLT, as LL increases the approximation becomes more accurate. Furthermore, the match is less accurate for very small values of ϵ\epsilon because the tails of the Gaussian and the actual random variable do not precisely match. Finally, note that the match is not as accurate at low SNR’s: this is because the mutual information random variable has a density close to a chi-square in this regime, and is thus not well approximated by a Gaussian. Although not accurate in all regimes, numerical results confirm that the Gaussian approximation is reasonably accurate for the range of interest for parameters (e.g., 0.01ϵ0.20.01\leq\epsilon\leq 0.2 and 0SNR200\leq\mbox{\scriptsize\sf SNR}\leq 20 dB). More importantly, this approximation yields important insights.

In Fig. 3 the true ϵ\epsilon-outage capacity CϵL(SNR)C_{\epsilon}^{L}(\mbox{\scriptsize\sf SNR}) and the affine and Gaussian approximations are plotted versus SNR for ϵ=0.01\epsilon=0.01 and L=3,10L=3,10. The Gaussian approximation is reasonably accurate at moderate SNR’s, and is more accurate for larger values of LL. On the other hand, the affine approximation, which provides a correct high SNR offset, is asymptotically tight at high SNR.

III-A Ergodic Capacity Gap

When evaluating the effect of the diversity order LL, it is useful to compare the ergodic capacity μ(SNR)\mu(\mbox{\scriptsize\sf SNR}) and CϵL(SNR)C_{\epsilon}^{L}(\mbox{\scriptsize\sf SNR}). By Chebyshev’s inequality, for any 0<w<μ0<w<\mu,

[|μ(SNR)1Li=1Llog2(1+SNR|hi|2)|w]σ2(SNR)Lw2\displaystyle\mathbb{P}\left[\left|\mu(\mbox{\scriptsize\sf SNR})-\frac{1}{L}\sum_{i=1}^{L}\log_{2}(1+\mbox{\scriptsize\sf SNR}|h_{i}|^{2})\right|\geq w\right]\leq\frac{\sigma^{2}(\mbox{\scriptsize\sf SNR})}{Lw^{2}} (12)

By replacing ww with μ(SNR)R\mu(\mbox{\scriptsize\sf SNR})-R and equating the right hand side (RHS) with ϵ\epsilon, we get

μ(SNR)σ(SNR)LϵCϵL(SNR)μ(SNR)+σ(SNR)Lϵ.\displaystyle\mu(\mbox{\scriptsize\sf SNR})-\frac{\sigma(\mbox{\scriptsize\sf SNR})}{\sqrt{L\epsilon}}\leq C_{\epsilon}^{L}(\mbox{\scriptsize\sf SNR})\leq\mu(\mbox{\scriptsize\sf SNR})+\frac{\sigma(\mbox{\scriptsize\sf SNR})}{\sqrt{L\epsilon}}. (13)

This implies CϵL(SNR)μ(SNR)C_{\epsilon}^{L}(\mbox{\scriptsize\sf SNR})\rightarrow\mu(\mbox{\scriptsize\sf SNR}) as LL\rightarrow\infty, as intuitively expected; reasonable values of ϵ\epsilon are smaller than 0.50.5, and thus we expect convergence to occur from below.

In order to capture the speed at which this convergence occurs, we define the quantity ΔECFD\Delta_{\rm EC-FD} as the difference between the ergodic and ϵ\epsilon-outage capacities. Based on (13) we can upper bound ΔECFD\Delta_{\rm EC-FD} as:

ΔECFD(SNR)=μ(SNR)CϵL(SNR)σ(SNR)Lϵ.\displaystyle\Delta_{\rm EC-FD}(\mbox{\scriptsize\sf SNR})=\mu(\mbox{\scriptsize\sf SNR})-C_{\epsilon}^{L}(\mbox{\scriptsize\sf SNR})\leq\frac{\sigma(\mbox{\scriptsize\sf SNR})}{\sqrt{L\epsilon}}. (14)

This bound shows that the rate gap goes to zero at least as fast as O(1/L)O\left(1/\sqrt{L}\right). Although we cannot rigorously claim that ΔECFD\Delta_{\rm EC-FD} is of order 1/L1/\sqrt{L}, by (11) the Gaussian approximation to this quantity is:

ΔECFD(SNR)σ(SNR)LQ1(ϵ),\displaystyle\Delta_{\rm EC-FD}(\mbox{\scriptsize\sf SNR})\approx\frac{\sigma(\mbox{\scriptsize\sf SNR})}{\sqrt{L}}Q^{-1}(\epsilon), (15)

which is also O(1/L)O\left(1/\sqrt{L}\right). This approximation becomes more accurate as LL\rightarrow\infty, by the CLT, and thus is expected to correctly capture the scaling with LL. Note that (15) has the interpretation that the rate must be Q1(ϵ)L\frac{Q^{-1}(\epsilon)}{\sqrt{L}} deviations below the ergodic capacity μ(SNR)\mu(\mbox{\scriptsize\sf SNR}) in order to ensure 1ϵ1-\epsilon reliability. In Fig. 4 the actual capacity gap and the approximation in (15) are plotted for ϵ=0.01\epsilon=0.01 and ϵ=0.05\epsilon=0.05 with SNR=20\mbox{\scriptsize\sf SNR}=20 dB, and a reasonable match between the approximation and the exact gap is seen.

IV Performance with Hybrid-ARQ

We now move on to the analysis of hybrid-ARQ, which will be shown to provide a significant performance advantage relative to the baseline of non-H-ARQ performance. H-ARQ is clearly a variable-length code, in which case the average transmission rate must be suitably defined. If each message contains bb information bits and each ARQ round corresponds to TT channel symbols, then the initial transmission rate is RinitbTR_{\textrm{init}}\triangleq\frac{b}{T} bits/symbol. If random variable XiX_{i} denotes the number of H-ARQ rounds used for the ii-th message, then a total of i=1NXi\sum_{i=1}^{N}X_{i} H-ARQ rounds are used and the average transmission rate (in bits/symbol or bps/Hz) across those NN messages is:

NbTi=1NXi=Rinit1Ni=1NXi.\frac{Nb}{T\sum_{i=1}^{N}X_{i}}=\frac{R_{\textrm{init}}}{\frac{1}{N}\sum_{i=1}^{N}X_{i}}. (16)

We are interested in the long-term average transmission rate, i.e., the case where NN\rightarrow\infty. By the law of large numbers (note that the XiX_{i}’s are i.i.d. in our model), 1Ni=1NXi𝔼[X]\frac{1}{N}\sum_{i=1}^{N}X_{i}\rightarrow{\mathbb{E}}[X] and thus the rate converges to

Rinit𝔼[X]bits/symbol\frac{R_{\textrm{init}}}{{\mathbb{E}}[X]}~~\textrm{bits/symbol} (17)

Here XX is the random variable representing the number of H-ARQ rounds per message; this random variable is determined by the specifics of the H-ARQ protocol.

In the remainder of the paper we focus on incremental redundancy (IR) H-ARQ because it is the most powerful type of H-ARQ, although we compare IR to Chase combining in Section IV-E. In [9] it is shown that mutual information is accumulated over H-ARQ rounds when IR is used, and that decoding is possible once the accumulated mutual information is larger than the number of information bits in the message. Therefore, the number of H-ARQ rounds XX is the smallest number mm such that:

i=1mlog2(1+SNR|hi|2)>Rinit.\sum_{i=1}^{m}\log_{2}(1+\mbox{\scriptsize\sf SNR}|h_{i}|^{2})>R_{\textrm{init}}. (18)

The number of rounds is upper bounded by MM, and an outage occurs whenever the mutual information after MM rounds is smaller than RinitR_{\textrm{init}}:

PoutIR,M(Rinit)=[i=1Mlog2(1+SNR|hi|2)Rinit].\displaystyle P_{\textrm{out}}^{\textrm{IR},M}(R_{\textrm{init}})={\mathbb{P}}\left[\sum_{i=1}^{M}\log_{2}(1+\mbox{\scriptsize\sf SNR}|h_{i}|^{2})\leq R_{\textrm{init}}\right]. (19)

This is the same as the expression for outage probability of MM-order diversity without H-ARQ in (2), except that mutual information is summed rather than averaged over the MM rounds. This difference is a consequence of the fact that RinitR_{\textrm{init}} is defined for transmission over one round rather than all MM rounds; dividing by 𝔼[X]{\mathbb{E}}[X] in (17) to obtain the average transmitted rate makes the expressions consistent. Due to this relationship, if the initial rate is set as Rinit=MCϵMR_{\textrm{init}}=M\cdot C_{\epsilon}^{M}, where CϵMC_{\epsilon}^{M} is the ϵ\epsilon-outage capacity for MM-order diversity without H-ARQ, then the outage at H-ARQ termination is ϵ\epsilon.

In order to simplify expressions, it is useful to define Ak(Rinit)A_{k}(R_{\textrm{init}}) as the probability that the accumulated mutual information after kk rounds is smaller than RinitR_{\textrm{init}}:

Ak(Rinit)[i=1klog2(1+SNR|hi|2)Rinit].A_{k}(R_{\textrm{init}})\triangleq{\mathbb{P}}\left[\sum_{i=1}^{k}\log_{2}(1+\mbox{\scriptsize\sf SNR}|h_{i}|^{2})\leq R_{\textrm{init}}\right]. (20)

The expected number of H-ARQ rounds per message is therefore given by:

𝔼[X]=1+k=1M1[X>k]=1+k=1M1Ak(Rinit).\displaystyle{\mathbb{E}}[X]=1+\sum_{k=1}^{M-1}\mathbb{P}[X>k]=1+\sum_{k=1}^{M-1}A_{k}(R_{\textrm{init}}). (21)

The long-term average transmitted rate, which is denoted as CϵIR,MC_{\epsilon}^{\textrm{IR},M}, is defined by (17). With initial rate Rinit=MCϵMR_{\textrm{init}}=M\cdot C_{\epsilon}^{M} we have:444All quantities in this expression except MM are actually functions of SNR. For the sake of compactness, however, dependence upon SNR is suppressed in this and subsequent expressions, except where explicit notation is necessary.

CϵIR,MRinit𝔼[X]=(M𝔼[X])CϵM.\displaystyle C_{\epsilon}^{\textrm{IR},M}\triangleq\frac{R_{\textrm{init}}}{\mathbb{E}[X]}=\left(\frac{M}{{\mathbb{E}}[X]}\right)C_{\epsilon}^{M}. (22)

Note that CϵIR,MC_{\epsilon}^{\textrm{IR},M} is the attempted long-term average transmission rate, as discussed in Section I. For the sake of brevity this quantity is referred to as the H-ARQ rate; this is not to be confused with the initial rate RinitR_{\textrm{init}}. Similarly, we refer to ϵ\epsilon-outage capacity CϵMC_{\epsilon}^{M} as the non-H-ARQ rate in the rest of the paper.

Because 𝔼[X]M{\mathbb{E}}[X]\leq M, the H-ARQ rate is at least as large as the non-H-ARQ rate, i.e., CϵIR,MCϵMC_{\epsilon}^{\textrm{IR},M}\geq C_{\epsilon}^{M}, and the advantage with respect to the non-H-ARQ benchmark is precisely the multiplicative factor M𝔼[X]\frac{M}{{\mathbb{E}}[X]}. This difference is explained as follows. Because Rinit=MCϵMR_{\textrm{init}}=M\cdot C_{\epsilon}^{M}, each message/packet contains CϵMMTC_{\epsilon}^{M}MT information bits regardless of whether H-ARQ is used. Without H-ARQ these bits are always transmitted over MTMT symbols, whereas with H-ARQ an average of only 𝔼[X]T{\mathbb{E}}[X]T symbols are required.

In Fig. 5 the average rates with (CϵIR,MC_{\epsilon}^{\textrm{IR},M}) and without H-ARQ (CϵMC_{\epsilon}^{M}) are plotted versus SNR for ϵ=0.01\epsilon=0.01 and M=1,2M=1,2 and 66 (M=1M=1 does not allow for H-ARQ in our model). Ergodic capacity is also plotted as a reference. Based on the figure, we immediately notice:

  • H-ARQ with 6 rounds outperforms H-ARQ with 2 rounds.

  • H-ARQ provides a significant advantage relative to non-H-ARQ for the same value of MM for a wide range of SNR’s, but this advantage vanishes at high SNR.

Increasing rate with MM is to be expected, because larger MM corresponds to more time diversity and more early termination opportunities. The behavior with respect to SNR is perhaps less intuitive. The remainder of this section is devoted to quantifying and explaining the behavior seen in Fig. 5. We begin by extending the Gaussian approximation to H-ARQ, then examine performance scaling with respect to MM, SNR, and ϵ\epsilon, and finally compare IR to Chase Combining.

IV-A Gaussian Approximation

By the definition of Ak()A_{k}(\cdot) and (21)-(22), the H-ARQ rate can be written as:

CϵIR,M\displaystyle C_{\epsilon}^{\textrm{IR},M} =\displaystyle= AM1(ϵ)1+k=1M1Ak(AM1(ϵ)),\displaystyle\frac{A_{M}^{-1}(\epsilon)}{1+\sum_{k=1}^{M-1}A_{k}\left(A_{M}^{-1}(\epsilon)\right)}, (23)

where AM1()A_{M}^{-1}(\cdot) refers to the inverse of function AM()A_{M}(\cdot). If we use the approach of Section III and approximate the mutual information accumulated in kk rounds by a Gaussian with mean μk\mu k and variance σ2k\sigma^{2}k, where μ\mu and σ2\sigma^{2} are defined in (8) and (9), we have:

Ak(Rinit)Q(μkRinitσk).\displaystyle A_{k}(R_{\textrm{init}})\approx Q\left(\frac{\mu k-R_{\textrm{init}}}{\sigma\sqrt{k}}\right). (24)

Similar to (11), the initial rate Rinit=AM1(ϵ)R_{\textrm{init}}=A_{M}^{-1}(\epsilon) can be approximated as M[μσMQ1(ϵ)]M\left[\mu-\frac{\sigma}{\sqrt{M}}Q^{-1}(\epsilon)\right]. Applying the approximation of Ak(Rinit)A_{k}(R_{\textrm{init}}) to each term in (23) and using the property 1Q(x)=Q(x)1-Q(x)=Q(-x) yields:

CϵIR,MM[μσMQ1(ϵ)]Mk=1M1Q(MkkμσMkQ1(ϵ)).\displaystyle C_{\epsilon}^{\textrm{IR},M}\approx\frac{M\left[\mu-\frac{\sigma}{\sqrt{M}}Q^{-1}(\epsilon)\right]}{M-\sum_{k=1}^{M-1}Q\left(\frac{M-k}{\sqrt{k}}\frac{\mu}{\sigma}-\sqrt{\frac{M}{k}}Q^{-1}(\epsilon)\right)}. (25)

This approximation is easier to compute than the actual H-ARQ rate and is reasonably accurate. Furthermore, it is useful for the insights it can provide.

IV-B Scaling with H-ARQ Rounds MM

In this section we study the dependence of the H-ARQ rate on MM. We first show convergence to the ergodic capacity as MM\rightarrow\infty:

Theorem 2

For any SNR, the H-ARQ rate converges to the ergodic capacity as MM\rightarrow\infty:

limMCϵIR,M(SNR)=μ(SNR)\displaystyle\lim_{M\rightarrow\infty}C_{\epsilon}^{\textrm{IR},M}(\mbox{\scriptsize\sf SNR})=\mu(\mbox{\scriptsize\sf SNR}) (26)
Proof:

See Appendix A. ∎

To quantify how fast this convergence is, similar to Section III-A we investigate the difference between the ergodic capacity and the H-ARQ rate. Defining ΔECIRμ(SNR)CϵIR,M(SNR)\Delta_{\rm EC-IR}\triangleq\mu(\mbox{\scriptsize\sf SNR})-C_{\epsilon}^{\textrm{IR},M}(\mbox{\scriptsize\sf SNR}) we have

ΔECIR\displaystyle\Delta_{\rm EC-IR} =\displaystyle= μ𝔼[X](𝔼[X]MCϵMμ)μ𝔼[X](𝔼[X](MσμQ1(ϵ)M))\displaystyle\frac{\mu}{{\mathbb{E}}[X]}\left({\mathbb{E}}[X]-\frac{M\cdot C_{\epsilon}^{M}}{\mu}\right)\approx\frac{\mu}{{\mathbb{E}}[X]}\left({\mathbb{E}}[X]-\left(M-\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M}\right)\right) (27)

where the approximation follows from CϵMμσQ1(ϵ)MC_{\epsilon}^{M}\approx\mu-\frac{\sigma Q^{-1}(\epsilon)}{\sqrt{M}} in (11). Because 𝔼[X]{\mathbb{E}}[X] is on the order of MM (as established in the proof of Theorem 2), the key is the behavior of the term 𝔼[X](MσμQ1(ϵ)M){\mathbb{E}}[X]-\left(M-\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M}\right).

To better understand 𝔼[X]{\mathbb{E}}[X] we again return to the Gaussian approximation. While the CDF of XX is defined by (Xk)=1Ak(Rinit)\mathbb{P}\left(X\leq k\right)=1-A_{k}(R_{\textrm{init}}) (for k=1,,M1k=1,\ldots,M-1), we use X~\tilde{X} to denote the random variable using the Gaussian approximation and thus define its CDF (for integers kk) as:

(X~k)=Q(μ(Mk)MσQ1(ϵ)σk)\displaystyle\mathbb{P}\left(\tilde{X}\leq k\right)=Q\left(\frac{\mu(M-k)-\sqrt{M}\sigma Q^{-1}(\epsilon)}{\sigma\sqrt{k}}\right) (28)

where we have used Ak(Rinit)Q(μkRinitσk)A_{k}(R_{\textrm{init}})\approx Q\left(\frac{\mu k-R_{\textrm{init}}}{\sigma\sqrt{k}}\right) evaluated with Rinit=MCϵMMμMσQ1(ϵ)R_{\textrm{init}}=MC_{\epsilon}^{M}\approx M\mu-\sqrt{M}\sigma Q^{-1}(\epsilon). From this expression, we can immediately see that the median of X~\tilde{X} is MσμQ1(ϵ)M\left\lceil M-\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M}\right\rceil [34]. If this was equal to the mean of XX, then by (27) the rate difference would be well approximated by βμM\frac{\beta\mu}{M}, where β\beta is the difference between MσμQ1(ϵ)M\left\lceil M-\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M}\right\rceil and (MβμQ1(ϵ)M)\left(M-\frac{\beta}{\mu}Q^{-1}(\epsilon)\sqrt{M}\right) and thus is no larger than one. By studying the characteristics of X~\tilde{X} (and of XX) we can see that the median is in fact quite close to the mean. A tedious calculation in Appendix B gives the following approximation to 𝔼[X]\mathbb{E}[X]:

𝔼[X]\displaystyle{\mathbb{E}}[X] \displaystyle\approx MσμQ1(ϵ)M+0.5(1ϵ)σμMQ1(ϵ)Q(x)𝑑x,\displaystyle M-\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M}+0.5(1-\epsilon)-\frac{\sigma}{\mu}\sqrt{M}\int_{Q^{-1}(\epsilon)}^{\infty}Q(x)dx, (29)

which is reasonably accurate for large MM. The most important factor is the term 0.5(1ϵ)0.5(1-\epsilon), which is due to the fact that only an integer number of H-ARQ rounds can be used. The factor σμMQ1(ϵ)Q(x)𝑑x-\frac{\sigma}{\mu}\sqrt{M}\int_{Q^{-1}(\epsilon)}^{\infty}Q(x)dx exists because the random variable is truncated at the point where its CDF is 1ϵ1-\epsilon.

Applying this into (27), the rate difference can be approximated as:

ΔECIR\displaystyle\Delta_{\rm EC-IR} \displaystyle\approx μ(0.5(1ϵ)σμMQ1(ϵ)Q(x)𝑑x)MσμQ1(ϵ)MσμMQ1(ϵ)Q(x)𝑑x+0.5(1ϵ).\displaystyle\frac{\mu\left(0.5(1-\epsilon)-\frac{\sigma}{\mu}\sqrt{M}\int_{Q^{-1}(\epsilon)}^{\infty}Q(x)dx\right)}{M-\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M}-\frac{\sigma}{\mu}\sqrt{M}\int_{Q^{-1}(\epsilon)}^{\infty}Q(x)dx+0.5(1-\epsilon)}. (30)

The denominator increases with MM at the order of MM (more precisely as MMM-\sqrt{M}), while the numerator actually decreases with MM and can even become negative if MM is extremely large. For reasonable values of MM, however, the negative term in the numerator is essentially inconsequential (for example, if ϵ=0.01\epsilon=0.01 and SNR=10\mbox{\scriptsize\sf SNR}=10 dB, the negative term is much smaller than 0.5(1ϵ)0.5(1-\epsilon) for M<5000M<5000) and thus can be reasonably neglected. By ignoring this negative term and replacing the denominator with the leading order MM term, we get a further approximation of the rate gap:

ΔECIR\displaystyle\Delta_{\rm EC-IR} \displaystyle\approx 0.5(1ϵ)μM\displaystyle\frac{0.5(1-\epsilon)\mu}{M} (31)

Based on this approximation, we see that the rate gap decreases roughly on the order O(1/M)O\left(1/M\right), rather than the O(1/M)O\left(1/\sqrt{M}\right) decrease without H-ARQ. In Fig. 6 we plot the exact capacity gap with and without H-ARQ, as well as the Gaussian approximation to the H-ARQ gap (30) and its simplified form in (31) for ϵ=0.01\epsilon=0.01 at SNR=10\mbox{\scriptsize\sf SNR}=10 dB. Both approximations are seen to be reasonably accurate especially for large MM. In the inset plot, which is in log-log scale, we see that the exact capacity gap goes to zero at order 1/M1/M, consistent with the result obtained from our approximation.

The fast convergence with H-ARQ can be intuitively explained as follows. If transmission could be stopped precisely when enough mutual information has been received, the transmitted rate would be exactly matched to the instantaneous mutual information and thus ergodic capacity would be achieved. When H-ARQ is used, however, transmission can only be terminated at the end of a round as, opposed to within a round, and thus a small amount of the transmission can be wasted. This ”rounding error”, which is reflected in the 0.5(1ϵ)0.5(1-\epsilon) term in (30) and (31), is essentially the only penalty incurred by using H-ARQ rather than explicit rate adaptation.

Remark 1

Because the value of H-ARQ depends primarily on the mean and variance of the mutual information in each H-ARQ round, our basic insights can be extended to multiple-antenna channels and to channels with frequency (or time) diversity within each ARQ round if the change in mean and variance is accounted for. For example, with order FF frequency diversity the mutual information in the ii-th H-ARQ round becomes 1Fl=1Flog(1+SNR|hi,l|2)\frac{1}{F}\sum_{l=1}^{F}\log(1+\mbox{\scriptsize\sf SNR}|h_{i,l}|^{2}), where hi,lh_{i,l} is the channel in the ii-th round on the ll-th frequency channel. The mean mutual information is unaffected, while the variance is decreased by a factor of 1F\frac{1}{F}. \lozenge

IV-C Scaling with SNR

In this section we quantify the behavior of H-ARQ as a function of the average SNR.555 Because constant outage corresponds to the full-multiplexing point, the results of [10] imply that CϵIR,MC_{\epsilon}^{\textrm{IR},M} cannot have a multiplexing gain/pre-log larger than one (in [10] it is shown that H-ARQ does not increase the full multiplexing point). However, the DMT-based results of [10] do not provide rate-offset characterization as in Theorems 3 and 4. Fig. 5 indicated that the benefit of H-ARQ vanishes at high SNR, and the following theorem makes this precise:

Theorem 3

If SNR is taken to infinity while keeping MM fixed, the expected number of H-ARQ rounds converges to MM and the H-ARQ rate converges to CϵM(SNR)C_{\epsilon}^{M}(\mbox{\scriptsize\sf SNR}), the non-H-ARQ rate with the same selectivity:

limSNR𝔼[X]=M\displaystyle\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}{\mathbb{E}}[X]=M (32)
limSNR[CϵIR,M(SNR)CϵM(SNR)]=0\displaystyle\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}\left[C_{\epsilon}^{\textrm{IR},M}(\mbox{\scriptsize\sf SNR})-C_{\epsilon}^{M}(\mbox{\scriptsize\sf SNR})\right]=0 (33)
Proof:

See Appendix C. ∎

The intuition behind this result can be gathered from Fig. 7, where the CDF’s of the accumulated mutual information after 22 and 33 rounds are plotted for for SNR=10\mbox{\scriptsize\sf SNR}=10 and 4040 dB. If M=3M=3 the initial rate is set at the ϵ\epsilon-point of the CDF of i=13log(1+SNR|hi|2)\sum_{i=1}^{3}\log(1+\mbox{\scriptsize\sf SNR}|h_{i}|^{2}). Because the CDF’s overlap for M=2M=2 and M=3M=3 considerably when SNR=10\mbox{\scriptsize\sf SNR}=10 dB, there is a large probability that sufficient mutual information is accumulated after 22 rounds and thus early termination occurs. However, the overlap between these CDF’s disappears as SNR increases, because i=1klog(1+SNR|hi|2)klogSNR+i=1klog(|hi|2)\sum_{i=1}^{k}\log(1+\mbox{\scriptsize\sf SNR}|h_{i}|^{2})\approx k\log\mbox{\scriptsize\sf SNR}+\sum_{i=1}^{k}\log(|h_{i}|^{2}), and thus the early termination probability vanishes.

Although the H-ARQ advantage eventually vanishes, the advantage persists throughout a large SNR range and the Gaussian approximation (Section IV-A) can be used to quantify this. The probability of terminating in strictly less than MM rounds is approximated by:

[XM1]\displaystyle\mathbb{P}[X\leq M-1] \displaystyle\approx Q(μ(SNR)Mσ(SNR)Q1(ϵ)σM1)\displaystyle Q\left(\frac{\mu(\mbox{\scriptsize\sf SNR})-\sqrt{M}\sigma(\mbox{\scriptsize\sf SNR})Q^{-1}(\epsilon)}{\sigma\sqrt{M-1}}\right) (34)

In order for this approximation to be greater than one-half we require the numerator inside the QQ-function to be less than zero, which corresponds to

μ(SNR)σ(SNR)MQ1(ϵ)orMμ(SNR)σ(SNR)Q1(ϵ)\displaystyle\frac{\mu(\mbox{\scriptsize\sf SNR})}{\sigma(\mbox{\scriptsize\sf SNR})}\leq\sqrt{M}Q^{-1}(\epsilon)~~~\textrm{or}~~~\sqrt{M}\geq\frac{\mu(\mbox{\scriptsize\sf SNR})}{\sigma(\mbox{\scriptsize\sf SNR})Q^{-1}(\epsilon)} (35)

As SNR increases μ(SNR)\mu(\mbox{\scriptsize\sf SNR}) increases without bound whereas σ(SNR)\sigma(\mbox{\scriptsize\sf SNR}) converges to a constant. Thus μ(SNR)/σ(SNR)\mu(\mbox{\scriptsize\sf SNR})/\sigma(\mbox{\scriptsize\sf SNR}) increases quickly with SNR, which makes the probability of early termination vanish. From this we see that the H-ARQ advantage lasts longer (in terms of SNR) when MM is larger. The second inequality in (35) captures an alternative viewpoint, which is roughly the minimum value of MM required for H-ARQ to provide a significant advantage.

Motivated by naive intuition that the H-ARQ rate is monotonically increasing in the initial rate RinitR_{\textrm{init}}, up to this point we have chosen Rinit=MCϵM=AM1(ϵ)R_{\textrm{init}}=MC_{\epsilon}^{M}=A_{M}^{-1}(\epsilon) such that outage at H-ARQ termination is exactly ϵ\epsilon. However, it turns out that the H-ARQ rate is not always monotonic in RinitR_{\textrm{init}}. In Fig. 8, the H-ARQ rate Rinit𝔼[X]\frac{R_{\textrm{init}}}{{\mathbb{E}}[X]} is plotted versus initial rate RinitR_{\textrm{init}} for M=2,3,M=2,~3, and 44 for SNR=10\mbox{\scriptsize\sf SNR}=10 dB (left) and 3030 dB (right). At 1010 dB, Rinit𝔼[X]\frac{R_{\textrm{init}}}{{\mathbb{E}}[X]} monotonically increases with RinitR_{\textrm{init}} and thus there is no advantage to optimizing the initial rate. At 3030 dB, however, Rinit𝔼[X]\frac{R_{\textrm{init}}}{{\mathbb{E}}[X]} behaves non-monotonically with RinitR_{\textrm{init}}. We therefore define C~ϵIR,M(SNR)\tilde{C}_{\epsilon}^{\textrm{IR},M}(\mbox{\scriptsize\sf SNR}) as the maximized H-ARQ rate, where the maximization is performed over all values of initial rate RinitR_{\textrm{init}} such that the outage constraint ϵ\epsilon is not violated:

C~ϵIR,M(SNR)maxRinitAM1(ϵ)\displaystyle\tilde{C}_{\epsilon}^{\textrm{IR},M}(\mbox{\scriptsize\sf SNR})\triangleq\max_{R_{\textrm{init}}\leq A_{M}^{-1}(\epsilon)} Rinit𝔼[X]\displaystyle\frac{R_{\textrm{init}}}{{\mathbb{E}}[X]} (36)

The local maxima seen in Fig. 8 appear to preclude a closed form solution to this maximization. Although optimization of the initial rate provides an advantage over a certain SNR range, the following theorem shows that it does not provide an improvement in the high-SNR offset:

Theorem 4

H-ARQ with an optimized initial rate, i.e., C~ϵIR,M(SNR)\tilde{C}_{\epsilon}^{\textrm{IR},M}(\mbox{\scriptsize\sf SNR}), achieves the same high-SNR offset as unoptimized H-ARQ CϵIR,M(SNR)C_{\epsilon}^{\textrm{IR},M}(\mbox{\scriptsize\sf SNR})

limSNR[C~ϵIR,M(SNR)CϵIR,M(SNR)]=0\displaystyle\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}\left[\tilde{C}_{\epsilon}^{\textrm{IR},M}(\mbox{\scriptsize\sf SNR})-C_{\epsilon}^{\textrm{IR},M}(\mbox{\scriptsize\sf SNR})\right]=0 (37)

Furthermore, the only initial rate (ignoring o(1)o(1) terms) that achieves the correct offset is the unoptimized value Rinit=MCϵMR_{\textrm{init}}=MC_{\epsilon}^{M}.

Proof:

See Appendix D. ∎

In Fig. 9, rates with and without optimization of the initial rate are plotted for ϵ=0.01\epsilon=0.01 and M=2,6M=2,6. For M=2M=2 optimization begins to make a difference at the point where the unoptimized curve abruptly decreases towards CϵMC_{\epsilon}^{M} around 2525 dB, but this advantage vanishes around 5555 dB. For M=6M=6 the advantage of initial rate optimization comes about at a much higher SNR, consistent with (35). Convergence of C~ϵIR,6(SNR)\tilde{C}_{\epsilon}^{\textrm{IR},6}(\mbox{\scriptsize\sf SNR}) to CϵIR,6(SNR)C_{\epsilon}^{\textrm{IR},6}(\mbox{\scriptsize\sf SNR}) does eventually occur, but is not visible in the figure.

IV-D Scaling with Outage Constraint ϵ\epsilon

Another advantage of H-ARQ is that the H-ARQ rate is generally less sensitive to the desired outage probability ϵ\epsilon than an equivalent non-H-ARQ system. This advantage is clearly seen in Fig. 10, where the H-ARQ and non-H-ARQ rates are plotted versus ϵ\epsilon for M=5M=5 at SNR=0\mbox{\scriptsize\sf SNR}=0, 1010 and 2020 dB. When ϵ\epsilon is large (e.g., roughly around 0.50.5) H-ARQ provides almost no advantage: a large outage corresponds to a large initial rate, which in turn means early termination rarely occurs. However, for more reasonable values of ϵ\epsilon, the H-ARQ rate is roughly constant with respect to ϵ\epsilon whereas the non-H-ARQ rate decreases sharply as ϵ0\epsilon\rightarrow 0. The transmitted rate must be decreased in order to achieve a smaller ϵ\epsilon (with or without H-ARQ), but with H-ARQ this decrease is partially compensated by the accompanying decreasing in the number of rounds 𝔼[X]\mathbb{E}[X].

IV-E Chase Combining

If Chase combining is used, a packet is retransmitted whenever a NACK is received and the receiver performs maximal-ratio-combining (MRC) on all received packets. As a result, SNR rather the mutual information is accumulated over H-ARQ rounds and the outage probability is given by:

PoutCC,M(Rinit)=[log2(1+SNRi=1M|hi|2)Rinit].\displaystyle P_{\textrm{out}}^{\textrm{CC},M}(R_{\textrm{init}})={\mathbb{P}}\left[\log_{2}\left(1+\mbox{\scriptsize\sf SNR}\sum_{i=1}^{M}|h_{i}|^{2}\right)\leq R_{\textrm{init}}\right]. (38)

For outage ϵ\epsilon, the initial rate is Rinit=log2(1+Fϵ1(i=1M|hi|2)SNR)R_{\textrm{init}}=\log_{2}\left(1+F_{\epsilon}^{-1}\left(\sum_{i=1}^{M}|h_{i}|^{2}\right)\mbox{\scriptsize\sf SNR}\right).

Different from IR, the expected number of H-ARQ rounds in CC is not dependent on SNR and thus the average rate for outage ϵ\epsilon can be written in closed form:

CϵCC,M(SNR)=Rinit𝔼[X]=log2(1+Fϵ1(i=1M|hi|2)SNR)MeFϵ1(i=1M|hi|2)k=1M1(Mk)(Fϵ1(i=1M|hi|2))k1(k1)!,C_{\epsilon}^{\textrm{CC},M}(\mbox{\scriptsize\sf SNR})=\frac{R_{\textrm{init}}}{{\mathbb{E}}[X]}=\frac{\log_{2}\left(1+F_{\epsilon}^{-1}\left(\sum_{i=1}^{M}|h_{i}|^{2}\right)\mbox{\scriptsize\sf SNR}\right)}{M-e^{-F_{\epsilon}^{-1}\left(\sum_{i=1}^{M}|h_{i}|^{2}\right)}\sum_{k=1}^{M-1}(M-k)\frac{\left(F_{\epsilon}^{-1}\left(\sum_{i=1}^{M}|h_{i}|^{2}\right)\right)^{k-1}}{(k-1)!}}, (39)

where the denominator is 𝔼[X]{\mathbb{E}}[X]. According to (39), we can get the high SNR affine approximation as:

CϵCC,M(SNR)=1𝔼[X]log2SNR+1𝔼[X]log2(Fϵ1(i=1M|hi|2))+o(1).\displaystyle C_{\epsilon}^{\textrm{CC},M}(\mbox{\scriptsize\sf SNR})=\frac{1}{{\mathbb{E}}[X]}\log_{2}\mbox{\scriptsize\sf SNR}+\frac{1}{{\mathbb{E}}[X]}\log_{2}\left(F_{\epsilon}^{-1}\left(\sum_{i=1}^{M}|h_{i}|^{2}\right)\right)+o(1). (40)

Because 𝔼[X]>1{\mathbb{E}}[X]>1 for any positive outage value, the pre-log factor (i.e., multiplexing gain) is 1𝔼[X]\frac{1}{{\mathbb{E}}[X]} and thus is less than one. This implies that CC performs poorly at high SNR. This is to be expected because CC is essentially a repetition code, which is spectrally inefficient at high SNR. As with IR, the performance of CC at high SNR can be improved through rate optimization. At high SNR, the pre-log is critical and thus the initial rate should be selected so that 𝔼[X]{\mathbb{E}}[X] is close to one and thereby avoiding H-ARQ altogether. Even with optimization, CC is far inferior to IR at moderate and high SNR’s. On the other hand, CC performs reasonably well at low SNR. This is because log(1+x)x\log(1+x)\approx x for small values of xx, and thus SNR-accumulation is nearly equivalent to mutual information-accumulation. In Fig. 11 rate-optimized IR and CC are plotted for M=2,4M=2,4 and ϵ=0.01\epsilon=0.01, and the results are consistent with the above intuitions.

V Conclusion

In this paper we have studied the performance of hybrid-ARQ in the context of an open-loop/fast-fading system in which the transmission rate is adjusted as a function of the average SNR such that a target outage probability is not exceeded. The general findings are that H-ARQ provides a significant rate advantage relative to a system not using H-ARQ at reasonable SNR levels, and that H-ARQ provides a rate quite close to the ergodic capacity even when the channel selectivity is limited.

There appear to be some potentially interesting extensions of this work. Contemporary cellular systems utilize simple ARQ on top of H-ARQ, and it is not fully understood how to balance these reliability mechanisms; some results in this direction are presented in [17]. Although we have assumed error-free ACK/NACK feedback, such errors can be quite important (c.f., [35]) and merit further consideration. Finally, while we have considered only the mutual information of Gaussian inputs, it is of interest to extend the results to discrete constellations and possibly compare to the performance of actual codes.

VI Acknowledgments

The authors gratefully acknowledge Prof. Robert Heath for suggesting use of the Gaussian approximation, and Prof. Gerhard Wunder for suggesting use of Chebyshev’s inequality in Section III-A.

Appendix A PROOF OF THEOREM 22

Because CϵIR,M=M𝔼[X]CϵMC_{\epsilon}^{\textrm{IR},M}=\frac{M}{\mathbb{E}[X]}C_{\epsilon}^{M} and limMCϵM=μ\lim_{M\rightarrow\infty}C_{\epsilon}^{M}=\mu (Section III-A), we can prove limMCϵIR,M=μ\lim_{M\rightarrow\infty}C_{\epsilon}^{\textrm{IR},M}=\mu by showing limMM𝔼[X]=1\lim_{M\rightarrow\infty}\frac{M}{\mathbb{E}[X]}=1. Because 𝔼[X]M\mathbb{E}[X]\leq M, we can show this simply by showing 𝔼[X]\mathbb{E}[X] is of order MM. For notational convenience we define Yilog2(1+SNR|hi|2)Y_{i}\triangleq\log_{2}(1+\mbox{\scriptsize\sf SNR}|h_{i}|^{2}), and then have:

𝔼[X]\displaystyle\mathbb{E}[X] \displaystyle\geq 1+k=1M1[i=1kYiM(μσMϵ)]\displaystyle 1+\sum_{k=1}^{M-1}\mathbb{P}\left[\sum_{i=1}^{k}Y_{i}\leq M\left(\mu-\frac{\sigma}{\sqrt{M\epsilon}}\right)\right] (41)
\displaystyle\geq 1+k=1MM34[i=1kYiMμσMϵ]\displaystyle 1+\sum_{k=1}^{\lceil M-M^{\frac{3}{4}}\rceil}\mathbb{P}\left[\sum_{i=1}^{k}Y_{i}\leq M\mu-\sigma\sqrt{\frac{M}{\epsilon}}\right] (42)
\displaystyle\geq 1+MM34[i=1MM34YiMμσMϵ],\displaystyle 1+\lceil M-M^{\frac{3}{4}}\rceil\cdot\mathbb{P}\left[\sum_{i=1}^{\lceil M-M^{\frac{3}{4}}\rceil}Y_{i}\leq M\mu-\sigma\sqrt{\frac{M}{\epsilon}}\right], (43)

where the first line holds because 𝔼[X]\mathbb{E}[X] is increasing in RinitR_{\textrm{init}} and Rinit=MCϵMM(μσMϵ)R_{\text{init}}=MC_{\epsilon}^{M}\geq M\left(\mu-\frac{\sigma}{\sqrt{M\epsilon}}\right) from (13), the second holds because the summands are non-negative, and the last line because the summands are decreasing in kk. A direct application of the CLT shows [i=1MM34YiMμσMϵ]1\mathbb{P}\left[\sum_{i=1}^{\lceil M-M^{\frac{3}{4}}\rceil}Y_{i}\leq M\mu-\sigma\sqrt{\frac{M}{\epsilon}}\right]\rightarrow 1 as MM\rightarrow\infty, and thus, with some straightforward algebra, we have limMM𝔼[X]1\lim_{M\rightarrow\infty}\frac{M}{\mathbb{E}[X]}\rightarrow 1.

Appendix B PROOF OF (29)

Firstly, we relax the constraint on X~\tilde{X} (discreteness and finiteness) to define a new continuous random variable X^\hat{X}, which is distributed along the whole real line. The CDF of X^\hat{X} (for all real xx) is

(X^x)=Q(μ(Mx)MσQ1(ϵ)σx)\displaystyle\mathbb{P}\left(\hat{X}\leq x\right)=Q\left(\frac{\mu(M-x)-\sqrt{M}\sigma Q^{-1}(\epsilon)}{\sigma\sqrt{x}}\right) (44)

Now if we consider the distribution of X^(MσμQ1(ϵ)M)\hat{X}-\left(M-\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M}\right), we have

[X^(MσμQ1(ϵ)M)x]\displaystyle\mathbb{P}\left[\hat{X}-\left(M-\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M}\right)\leq x\right] =\displaystyle= Q(μxσMσμQ1(ϵ)M+x)\displaystyle Q\left(\frac{-\mu x}{\sigma\sqrt{M-\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M}+x}}\right) (45)

where the equality follows from (44). Notice as MM\rightarrow\infty, MσμQ1(ϵ)M+xM\sqrt{M-\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M}+x}\rightarrow\sqrt{M}, so

Q(μxσMσμQ1(ϵ)M+x)\displaystyle Q\left(\frac{-\mu x}{\sigma\sqrt{M-\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M}+x}}\right) \displaystyle\rightarrow Q(μxσM)=Φ(xσμM),\displaystyle Q\left(\frac{-\mu x}{\sigma\sqrt{M}}\right)=\Phi\left(\frac{x}{\frac{\sigma}{\mu}\sqrt{M}}\right), (46)

where Φ()\Phi(\cdot) is the standard normal CDF with zero mean and unit variance. So as MM\rightarrow\infty, the limiting distribution of X^\hat{X}, denoted by Φ^()\hat{\Phi}(\cdot), goes to 𝒩(MσμQ1(ϵ)M,(σμ)2M)\mathcal{N}\left(M-\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M},\left(\frac{\sigma}{\mu}\right)^{2}M\right), which is

Φ^(x)=Φ(x(MσμQ1(ϵ)M)σμM),for x\displaystyle\hat{\Phi}(x)=\Phi\left(\frac{x-\left(M-\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M}\right)}{\frac{\sigma}{\mu}\sqrt{M}}\right),\quad\quad\quad\quad\text{for $x\in\mathbb{R}$} (47)

Since 𝔼[X~]{\mathbb{E}}[\tilde{X}] is an approximation to 𝔼[X]{\mathbb{E}}[X], then we focus on evaluating 𝔼[X~]{\mathbb{E}}[\tilde{X}] for large MM:

𝔼[X~]\displaystyle{\mathbb{E}}[\tilde{X}] =\displaystyle= k=0M1(1[X~k])=Mk=1M1Φ~(k)=(a)Mk=1M1Φ^(k)\displaystyle\sum_{k=0}^{M-1}\left(1-\mathbb{P}[\tilde{X}\leq k]\right)=M-\sum_{k=1}^{M-1}\tilde{\Phi}(k)\overset{(a)}{=}M-\sum_{k=1}^{M-1}\hat{\Phi}(k) (48)
\displaystyle\approx M(1MΦ^(x)𝑑xk=1M1Φ^(k+1)Φ^(k)2)\displaystyle M-\left(\int_{1}^{M}\hat{\Phi}(x)dx-\sum_{k=1}^{M-1}\frac{\hat{\Phi}(k+1)-\hat{\Phi}(k)}{2}\right)
(b)\displaystyle\overset{(b)}{\approx} M(1MΦ(x(MσμQ1(ϵ)M)σμM)𝑑x0.5(1ϵ))\displaystyle M-\left(\int_{1}^{M}\Phi\left(\frac{x-\left(M-\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M}\right)}{\frac{\sigma}{\mu}\sqrt{M}}\right)dx-0.5(1-\epsilon)\right)
=\displaystyle= MM2σμQ1(ϵ)MMΦ(x(MσμQ1(ϵ)M)σμM)𝑑x\displaystyle M-\int_{M-2\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M}}^{M}\Phi\left(\frac{x-\left(M-\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M}\right)}{\frac{\sigma}{\mu}\sqrt{M}}\right)dx-
1M2σμQ1(ϵ)MΦ(x(MσμQ1(ϵ)M)σμM)𝑑x+0.5(1ϵ)\displaystyle\quad\ \int_{1}^{M-2\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M}}\Phi\left(\frac{x-\left(M-\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M}\right)}{\frac{\sigma}{\mu}\sqrt{M}}\right)dx+0.5(1-\epsilon)

where (a) holds since Φ~(k)=Φ^(k)\tilde{\Phi}(k)=\hat{\Phi}(k) when k=1,2,,M1k=1,2,\cdots,M-1 and (b) follows from Φ^(M)=1ϵ\hat{\Phi}(M)=1-\epsilon and Φ^(1)\hat{\Phi}(1) is negligible when MM is large enough. Actually, the first integral in (48) can be evaluated as σμQ1(ϵ)M\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M} because the expression inside the integral is symmetric with respect to MσμQ1(ϵ)MM-\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M} and Φ(x)+Φ(x)=1\Phi(x)+\Phi(-x)=1 for any xx\in\mathbb{R}. For large MM, the second integral in (48) can be approximated as:

σμMQ1(ϵ)MμσQ(x)𝑑x(a)σμMQ1(ϵ)Q(x)𝑑xσ2.5μMMμσex22x𝑑x\displaystyle\frac{\sigma}{\mu}\sqrt{M}\int_{Q^{-1}(\epsilon)}^{\frac{\sqrt{M}\mu}{\sigma}}Q\left(x\right)dx\overset{(a)}{\approx}\frac{\sigma}{\mu}\sqrt{M}\int_{Q^{-1}(\epsilon)}^{\infty}Q(x)dx-\frac{\sigma}{2.5\mu}\sqrt{M}\int_{\frac{\sqrt{M}\mu}{\sigma}}^{\infty}\frac{e^{-\frac{x^{2}}{2}}}{x}dx (49)
=\displaystyle= σμMQ1(ϵ)Q(x)𝑑xσ5μME1(Mμ22σ2)σμMQ1(ϵ)Q(x)𝑑x\displaystyle\frac{\sigma}{\mu}\sqrt{M}\int_{Q^{-1}(\epsilon)}^{\infty}Q(x)dx-\frac{\sigma}{5\mu}\sqrt{M}E_{1}\left(\frac{M\mu^{2}}{2\sigma^{2}}\right)\approx\frac{\sigma}{\mu}\sqrt{M}\int_{Q^{-1}(\epsilon)}^{\infty}Q(x)dx

where (a) follows from [36]: when xx is positive and large enough, Q(x)ex222.5xQ(x)\approx\frac{e^{-\frac{x^{2}}{2}}}{2.5x}. The last line holds because σ5μME1(Mμ22σ2)0\frac{\sigma}{5\mu}\sqrt{M}E_{1}\left(\frac{M\mu^{2}}{2\sigma^{2}}\right)\approx 0 when M is sufficiently large [37]. This finally yields:

𝔼[X]𝔼[X~]MσμQ1(ϵ)MσμMQ1(ϵ)Q(x)𝑑x+0.5(1ϵ).\displaystyle{\mathbb{E}}[X]\approx{\mathbb{E}}[\tilde{X}]\approx M-\frac{\sigma}{\mu}Q^{-1}(\epsilon)\sqrt{M}-\frac{\sigma}{\mu}\sqrt{M}\int_{Q^{-1}(\epsilon)}^{\infty}Q(x)dx+0.5(1-\epsilon). (50)

Appendix C PROOF OF THEOREM 33

In order to prove the theorem, we first establish the following lemma:

Lemma 1

If the initial rate RinitR_{\textrm{init}} has a pre-log of rr, i.e., limSNRRinitlog2SNR=r\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}\frac{R_{\textrm{init}}}{\log_{2}\mbox{\scriptsize\sf SNR}}=r, then

limSNR[i=1klog2(1+SNR|hi|2)Rinit]=\displaystyle\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}\mathbb{P}\left[\sum_{i=1}^{k}\log_{2}(1+\mbox{\scriptsize\sf SNR}|h_{i}|^{2})\leq R_{\textrm{init}}\right]= 1,\displaystyle 1, for k<rk<r and k+k\in\mathbb{Z}^{+} (51a)
limSNR[i=1klog2(1+SNR|hi|2)Rinit]=\displaystyle\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}\mathbb{P}\left[\sum_{i=1}^{k}\log_{2}(1+\mbox{\scriptsize\sf SNR}|h_{i}|^{2})\leq R_{\textrm{init}}\right]= 0,\displaystyle 0, for k>rk>r and k+k\in\mathbb{Z}^{+} (51b)
Proof:

For notational convenience we use γi\gamma_{i} to denote the quantity SNR|hi|2\mbox{\scriptsize\sf SNR}|h_{i}|^{2}. We prove the first result by using the fact that i=1klog2(1+γi)klog2(1+maxi=1,,kγi)\sum_{i=1}^{k}\log_{2}(1+\gamma_{i})\leq k\log_{2}(1+\max_{i=1,\ldots,k}\gamma_{i}) which yields:

[i=1klog2(1+γi)Rinit]\displaystyle\mathbb{P}\left[\sum_{i=1}^{k}\log_{2}(1+\gamma_{i})\leq R_{\textrm{init}}\right] \displaystyle\geq [klog2(1+maxi=1,,kγi)Rinit]\displaystyle{\mathbb{P}}\left[k\log_{2}\left(1+\max_{i=1,\ldots,k}\gamma_{i}\right)\leq R_{\textrm{init}}\right] (52)
=\displaystyle= (1e2Rinitk1SNR)k=(1e2Rinitklog2(SNR)+1SNR)k,\displaystyle\left(1-e^{-\frac{2^{\frac{R_{\textrm{init}}}{k}}-1}{\mbox{\scriptsize\sf SNR}}}\right)^{k}=\left(1-e^{-2^{\frac{R_{\textrm{init}}}{k}-\log_{2}(\mbox{\scriptsize\sf SNR})}+\frac{1}{\mbox{\scriptsize\sf SNR}}}\right)^{k},

where the first equality follows because the γi\gamma_{i}’s are i.i.d. exponential with mean SNR. The exponent Rinitklog2(SNR)\frac{R_{\textrm{init}}}{k}-\log_{2}(\mbox{\scriptsize\sf SNR}) behaves as rkklog2(SNR)\frac{r-k}{k}\log_{2}(\mbox{\scriptsize\sf SNR}). If k<rk<r this exponent goes to infinity. Because the 1SNR\frac{1}{\mbox{\scriptsize\sf SNR}} term vanishes, ee is raised to a power converging to -\infty, and thus (52) converges to 11. This yields the result in (51b). To prove (51b) we combine the property i=1klog2(1+γi)klog2(1+mini=1,,kγi)\sum_{i=1}^{k}\log_{2}(1+\gamma_{i})\geq k\log_{2}(1+\min_{i=1,\ldots,k}\gamma_{i}) with the same argument as above:

[i=1klog2(1+γi)Rinit]\displaystyle\mathbb{P}\left[\sum_{i=1}^{k}\log_{2}(1+\gamma_{i})\leq R_{\textrm{init}}\right] \displaystyle\leq [klog2(1+mini=1,,kγi)Rinit]=1ek(2Rinitklog2(SNR)1SNR)\displaystyle{\mathbb{P}}\left[k\log_{2}\left(1+\min_{i=1,\ldots,k}\gamma_{i}\right)\leq R_{\textrm{init}}\right]=1-e^{-k\left(2^{\frac{R_{\textrm{init}}}{k}-\log_{2}(\mbox{\scriptsize\sf SNR})}-\frac{1}{\mbox{\scriptsize\sf SNR}}\right)}

If k>rk>r, ee is raised to a power that converges to 0 and thus we get (51b). ∎

We now move on to the proof of the theorem. Using the expression for 𝔼[X]{\mathbb{E}}[X] in (21) we have:

limSNR𝔼[X]\displaystyle\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}{\mathbb{E}}[X] =\displaystyle= limSNR1+[log2(1+γ1)Rinit]++[i=1L1log2(1+γi)Rinit].\displaystyle\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}1+{\mathbb{P}}\left[\log_{2}(1+\gamma_{1})\leq R_{\textrm{init}}\right]+\ldots+{\mathbb{P}}\left[\sum_{i=1}^{L-1}\log_{2}(1+\gamma_{i})\leq R_{\textrm{init}}\right]. (53)

Because Rinit=MCϵMR_{\textrm{init}}=MC_{\epsilon}^{M} has a pre-log of MM, the lemma implies that each of the terms converge to one and thus limSNR𝔼[X]=M\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}{\mathbb{E}}[X]=M.

In terms of the high-SNR offset we have:

limSNR[CϵIR,M(SNR)CϵM(SNR)]\displaystyle\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}[C_{\epsilon}^{\textrm{IR},M}(\mbox{\scriptsize\sf SNR})-C_{\epsilon}^{M}(\mbox{\scriptsize\sf SNR})] =\displaystyle= limSNR[RinitCϵM(SNR)𝔼[X]]CϵM(SNR)𝔼[X]\displaystyle\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}\left[\frac{R_{\textrm{init}}}{C_{\epsilon}^{M}(\mbox{\scriptsize\sf SNR})}-{\mathbb{E}}[X]\right]\frac{C_{\epsilon}^{M}(\mbox{\scriptsize\sf SNR})}{{\mathbb{E}}[X]} (54)
(a)\displaystyle\stackrel{{\scriptstyle(a)}}{{\leq}} limSNR[M𝔼[X]]CϵM(SNR)\displaystyle\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}\left[M-{\mathbb{E}}[X]\right]C_{\epsilon}^{M}(\mbox{\scriptsize\sf SNR})
=\displaystyle= limSNR[M𝔼[X]](log2(SNR)+O(1))\displaystyle\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}\left[M-{\mathbb{E}}[X]\right](\log_{2}(\mbox{\scriptsize\sf SNR})+O(1))
=(b)\displaystyle\stackrel{{\scriptstyle(b)}}{{=}} limSNR[M𝔼[X]]log2(SNR),\displaystyle\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}\left[M-{\mathbb{E}}[X]\right]\log_{2}(\mbox{\scriptsize\sf SNR}),

where (a) holds because 𝔼[X]1{\mathbb{E}}[X]\geq 1 and Rinit=MCϵM(SNR)R_{\textrm{init}}=MC_{\epsilon}^{M}(\mbox{\scriptsize\sf SNR}), and (b) holds because 𝔼[X]M{\mathbb{E}}[X]\rightarrow M and therefore the O(1)O(1) term does not effect the limit.

Because the additive terms defining 𝔼[X]{\mathbb{E}}[X] in (21) are decreasing, we lower bound 𝔼[X]{\mathbb{E}}[X] as

𝔼[X]\displaystyle{\mathbb{E}}[X] \displaystyle\geq M[i=1M1log2(1+γi)Rinit]M(1e2RinitM1log2(SNR)+1SNR)M1,\displaystyle M\mathbb{P}\left[\sum_{i=1}^{M-1}\log_{2}(1+\gamma_{i})\leq R_{\textrm{init}}\right]\geq M\left(1-e^{-2^{\frac{R_{\textrm{init}}}{M-1}-\log_{2}(\mbox{\scriptsize\sf SNR})}+\frac{1}{\mbox{\scriptsize\sf SNR}}}\right)^{M-1}, (55)

where the last inequality follows from (52). Plugging this bound into (54) yields:

limSNR[M𝔼[X]]log2(SNR)\displaystyle\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}\left[M-{\mathbb{E}}[X]\right]\log_{2}(\mbox{\scriptsize\sf SNR}) \displaystyle\leq limSNRM(1(1e2RinitM1log2SNR+1SNR)M1)log2(SNR)\displaystyle\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}M\left(1-\left(1-e^{-2^{\frac{R_{\textrm{init}}}{M-1}-\log_{2}\mbox{\scriptsize\sf SNR}}+\frac{1}{\mbox{\scriptsize\sf SNR}}}\right)^{M-1}\right)\log_{2}(\mbox{\scriptsize\sf SNR})
=\displaystyle= limSNRMlog2(SNR)j=1M1(M1j)(e2RinitM1log2SNR+1SNR)j\displaystyle\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}-M\log_{2}(\mbox{\scriptsize\sf SNR})\sum_{j=1}^{M-1}{M-1\choose j}\left(-e^{-2^{\frac{R_{\textrm{init}}}{M-1}-\log_{2}\mbox{\scriptsize\sf SNR}}+\frac{1}{\mbox{\scriptsize\sf SNR}}}\right)^{j}

where the last line follows from the binomial expansion. Because RinitR_{\textrm{init}} has a pre-log of MM, each of the terms is of the form αlog2(SNR)eSNRβ\alpha\log_{2}(\mbox{\scriptsize\sf SNR})e^{-\mbox{\scriptsize\sf SNR}^{\beta}} for some β>0\beta>0 and some constant α\alpha, and thus the RHS of the last line is zero. Because CϵIR,M(SNR)CϵM(SNR)C_{\epsilon}^{\textrm{IR},M}(\mbox{\scriptsize\sf SNR})\geq C_{\epsilon}^{M}(\mbox{\scriptsize\sf SNR}), this shows limSNR[CϵIR,M(SNR)CϵM(SNR)]=0\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}[C_{\epsilon}^{\textrm{IR},M}(\mbox{\scriptsize\sf SNR})-C_{\epsilon}^{M}(\mbox{\scriptsize\sf SNR})]=0.

Appendix D PROOF OF THEOREM 44

In order to prove that rate optimization does not increase the high-SNR offset, we need to consider all possible choices of the initial rate RinitR_{\textrm{init}}. We begin by considering all choices of RinitR_{\textrm{init}} with a pre-log of MM, i.e., satisfying limSNRRinitlog2SNR=M\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}\frac{R_{\textrm{init}}}{\log_{2}\mbox{\scriptsize\sf SNR}}=M. Because the proof of convergence of 𝔼[X]{\mathbb{E}}[X] in the proof of Theorem 33 only requires RinitR_{\textrm{init}} to have a pre-log of MM, we have 𝔼[X]M{\mathbb{E}}[X]\rightarrow M. To bound the offset, we write the rate as Rinit=MCϵM(SNR)f(SNR)R_{\textrm{init}}=MC_{\epsilon}^{M}(\mbox{\scriptsize\sf SNR})-f(\mbox{\scriptsize\sf SNR}) where f(SNR)f(\mbox{\scriptsize\sf SNR}) is strictly positive and sub-logarithmic (because the pre-log is MM), and thus the rate offset is:

MCϵM(SNR)f(SNR)𝔼[X]CϵM(SNR)\displaystyle\frac{MC_{\epsilon}^{M}(\mbox{\scriptsize\sf SNR})-f(\mbox{\scriptsize\sf SNR})}{{\mathbb{E}}[X]}-C_{\epsilon}^{M}(\mbox{\scriptsize\sf SNR}) =\displaystyle= (M𝔼[X])CϵM(SNR)𝔼[X]f(SNR)𝔼[X].\displaystyle\left(M-{\mathbb{E}}[X]\right)\frac{C_{\epsilon}^{M}(\mbox{\scriptsize\sf SNR})}{{\mathbb{E}}[X]}-\frac{f(\mbox{\scriptsize\sf SNR})}{{\mathbb{E}}[X]}. (56)

By the same argument as in Appendix C, the first term is upper bounded by zero in the limit. Therefore:

limSNRRinit𝔼[X]CϵM(SNR)limSNRf(SNR)𝔼[X].\displaystyle\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}\frac{R_{\textrm{init}}}{{\mathbb{E}}[X]}-C_{\epsilon}^{M}(\mbox{\scriptsize\sf SNR})\leq\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}-\frac{f(\mbox{\scriptsize\sf SNR})}{{\mathbb{E}}[X]}. (57)

Relative to CϵM(SNR)C_{\epsilon}^{M}(\mbox{\scriptsize\sf SNR}), the offset is either strictly negative (if f(SNR)f(\mbox{\scriptsize\sf SNR}) is bounded) or goes to negative infinity. In either case a strictly worse offset is achieved.

Let us now consider pre-log factors, denoted by rr, strictly smaller than MM (i.e., r<Mr<M). We first consider non-integer values of rr. By Lemma 1 the first 1+r1+\lfloor r\rfloor terms in the expression for 𝔼[X]{\mathbb{E}}[X] converge to one while the other terms go to zero. Therefore 𝔼[X]1+r=r{\mathbb{E}}[X]\rightarrow 1+\lfloor r\rfloor=\lceil r\rceil. The long-term transmitted rate, given by Rinit𝔼[X]\frac{R_{\textrm{init}}}{{\mathbb{E}}[X]}, therefore has pre-log equal to rr\frac{r}{\lceil r\rceil}. This quantity is strictly smaller than one, and therefore a non-integer rr yields average rate with a strictly suboptimal pre-log factor.

We finally consider integer values of rr satisfying r<Mr<M. In this case we must separately consider rates of the form Rinit=rlogSNR±O(1)R_{\textrm{init}}=r\log\mbox{\scriptsize\sf SNR}\pm O(1) versus those of the form Rinit=rlogSNR±o(logSNR)R_{\textrm{init}}=r\log\mbox{\scriptsize\sf SNR}\pm o(\log\mbox{\scriptsize\sf SNR}). Here we use o(logSNR)o(\log\mbox{\scriptsize\sf SNR}) to denote terms that are sub-logarithmic and that go to positive infinity; note that we also explicitly denote the sign of the O(1)O(1) or o(logSNR)o(\log\mbox{\scriptsize\sf SNR}) terms. We first consider Rinit=rlogSNR±O(1)R_{\textrm{init}}=r\log\mbox{\scriptsize\sf SNR}\pm O(1). By Lemma 1 the terms corresponding to k=0,,r1k=0,\ldots,r-1 in the expression for 𝔼[X]{\mathbb{E}}[X] converge to one, while the terms corresponding to k=r+1,,M1k=r+1,\ldots,M-1 go to one. Furthermore, the term corresponding to k=rk=r converges to a strictly positive constant denoted δ\delta:

δ\displaystyle\delta =\displaystyle= limSNR[i=1rlog2(1+SNR|hi|2)rlog2(SNR)±O(1)]\displaystyle\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}\mathbb{P}\left[\sum_{i=1}^{r}\log_{2}(1+\mbox{\scriptsize\sf SNR}|h_{i}|^{2})\leq r\log_{2}(\mbox{\scriptsize\sf SNR})\pm O(1)\right] (58)
=\displaystyle= limSNR[i=1rlog2(SNR|hi|2)rlog2(SNR)±O(1)]=[i=1rlog2(|hi|2)±O(1)]\displaystyle\lim_{\mbox{\scriptsize\sf SNR}\rightarrow\infty}\mathbb{P}\left[\sum_{i=1}^{r}\log_{2}(\mbox{\scriptsize\sf SNR}|h_{i}|^{2})\leq r\log_{2}(\mbox{\scriptsize\sf SNR})\pm O(1)\right]=\mathbb{P}\left[\sum_{i=1}^{r}\log_{2}(|h_{i}|^{2})\leq\pm O(1)\right]

where the second equality follows from [26]. δ\delta is strictly positive because the support of log2(|hi|2)\log_{2}(|h_{i}|^{2}), and thus of the sum, is the entire real line. As a result, 𝔼[X]r+δ{\mathbb{E}}[X]\rightarrow r+\delta, which is strictly larger than rr. The pre-log of the average rate is then rr+δ<1\frac{r}{r+\delta}<1, and so this choice of initial rate is also sub-optimal.

If Rinit=rlogSNR+o(logSNR)R_{\textrm{init}}=r\log\mbox{\scriptsize\sf SNR}+o(\log\mbox{\scriptsize\sf SNR}) the terms in the 𝔼[X]{\mathbb{E}}[X] expression behave largely the same as above except that the k=rk=r term converges to one because the O(1)O(1) term in (58) is replaced with a quantity tending to positive infinity. Therefore 𝔼[X]r+1{\mathbb{E}}[X]\rightarrow r+1, which also yields a sub-optimal pre-log of rr+1<1\frac{r}{r+1}<1.

We are thus finally left with the choice Rinit=rlogSNRo(logSNR)R_{\textrm{init}}=r\log\mbox{\scriptsize\sf SNR}-o(\log\mbox{\scriptsize\sf SNR}). This is the same as the above case except that the k=rk=r term converges to zero. Therefore 𝔼[X]r{\mathbb{E}}[X]\rightarrow r, and thus the achieved pre-log is one. In this case we must explicitly consider the rate offset, which is written as:

Rinit𝔼[X]CϵM\displaystyle\frac{R_{\textrm{init}}}{{\mathbb{E}}[X]}-C_{\epsilon}^{M} =\displaystyle= rlogSNRo(logSNR)𝔼[X]CϵM=rlogSNR𝔼[X]CϵMo(logSNR)𝔼[X].\displaystyle\frac{r\log\mbox{\scriptsize\sf SNR}-o(\log\mbox{\scriptsize\sf SNR})}{{\mathbb{E}}[X]}-C_{\epsilon}^{M}=\frac{r\log\mbox{\scriptsize\sf SNR}}{{\mathbb{E}}[X]}-C_{\epsilon}^{M}-\frac{o(\log\mbox{\scriptsize\sf SNR})}{{\mathbb{E}}[X]}. (59)

Using essentially the same proof as for Theorem 33, the difference between the first two terms is upper bounded by zero in the limit of SNR\mbox{\scriptsize\sf SNR}\rightarrow\infty. Thus, the rate offset goes to negative infinity.

Because we have shown each choice of RinitR_{\textrm{init}} (except Rinit=MCϵMR_{\textrm{init}}=MC_{\epsilon}^{M}) achieves either a strictly sub-optimal pre-log or the correct pre-log but a strictly negative offset, this proves both parts of the theorem.

References

  • [1] D. Chase, “Class of algorithms for decoding block codes with channel measurement information,” IEEE Trans. Inf. Theory, vol. 18, no. 1, pp. 170–182, 1972.
  • [2] M. Sauter, Communications Systems for the Mobile Information Society. John Wiley, 2006.
  • [3] J. G. Andrews, A. Ghosh, and R. Muhamed, Fundamentals of Wimax: Understanding Broadband Wireless Networking, 1st ed. Prentice Hall PTR, 2007.
  • [4] L. Favalli, M. Lanati, and P. Savazzi, Wireless Communications 2007 CNIT Thyrrenian Symposium, 1st ed. Springer, 2007.
  • [5] S. Sesia, G. Caire, and G. Vivier, “Incremental redundancy hybrid ARQ schemes based on low-density parity-check codes,” IEEE Trans. Commun., vol. 52, no. 8, pp. 1311–1321, Aug. 2004.
  • [6] D. Costello, J. Hagenauer, H. Imai, and S. Wicker, “Applications of error-control coding,” IEEE Trans. Inf. Theory, vol. 44, no. 6, pp. 2531–2560, Oct. 1998.
  • [7] E. Soljanin, N. Varnica, and P. Whiting, “Incremental redundancy hybrid ARQ with LDPC and raptor codes,” submitted to IEEE Trans. Inf. Theory, 2005.
  • [8] C. Lott, O. Milenkovic, and E. Soljanin, “Hybrid ARQ: theory, state of the art and future directions,” IEEE Inf. Theory Workshop, pp. 1–5, Jul. 2007.
  • [9] G. Caire and D. Tuninetti, “The throughput of hybrid-ARQ protocols for the Gaussian collision channel,” IEEE Trans. Inf. Theory, vol. 47, no. 4, pp. 1971–1988, Jul. 2001.
  • [10] H. E. Gamal, G. Caire, and M. E. Damen, “The MIMO ARQ channel: diversity-multiplexing-delay tradeoff,” IEEE Trans. Inf. Theory, pp. 3601–3621, 2006.
  • [11] A. Chuang, A. Guillén i Fàbregas, L. K. Rasmussen, and I. B. Collings, “Optimal throughput-diversity-delay tradeoff in MIMO ARQ block-fading channels,” IEEE Trans. Inf. Theory, vol. 54, no. 9, pp. 3968–3986, Sep. 2008.
  • [12] C. Shen, T. Liu, and M. P. Fitz, “Aggressive transmission with ARQ in quasi-static fading channels,” Proc. of IEEE Int’l Conf. in Commun. (ICC’08), pp. 1092–1097, May 2008.
  • [13] R. Narasimhan, “Throughput-delay performance of half-duplex hybrid-ARQ relay channels,” Proc. of IEEE Int’l Conf. in Commun. (ICC’08), pp. 986–990, May 2008.
  • [14] X. Tang, R. Liu, P. Spasojevic, and H. V. Poor, “On the Throughput of Secure Hybrid-ARQ Protocols for Gaussian Block-Fading Channels,” IEEE Trans. Inf. Theory, vol. 55, no. 4, pp. 1575–1591, Apr. 2009.
  • [15] L. Zheng and D. Tse, “Diversity and multiplexing: a fundamental tradeoff in multiple-antenna channels,” IEEE Trans. Inf. Theory, vol. 49, no. 5, pp. 1073–1096, May 2003.
  • [16] A. Lozano and N. Jindal, “Transmit diversity v. spatial multiplexing in modern MIMO systems,” submitted to IEEE Trans. Wireless Commun., 2008.
  • [17] P. Wu and N. Jindal, “Coding Versus ARQ in Fading Channels: How reliable should the PHY be?” submitted to Proc. of IEEE Globe Commum. Conf. (Globecom’09), 2009.
  • [18] T. A. Courtade and R. D. Wesel, “A cross-layer perspective on rateless coding for wirelss channels,” to appear at Proc. of IEEE Int’l Conf. in Commun. (ICC’09), Jun. 2009.
  • [19] D. Tse and P. Viswanath, Fundamentals of Wireless Communications. Cambridge University, 2005.
  • [20] G. Carie, G. Taricco, and E. Biglieri, “Optimum power control over fading channels,” IEEE Trans. Inf. Theory, vol. 45, no. 5, pp. 1468–1489, Jul. 1999.
  • [21] A. Guillén i Fàbregas and G. Caire, “Coded modulation in the block-fading channel: coding theorems and code construction,” IEEE Trans. Inf. Theory, vol. 52, no. 1, pp. 91–114, Jan. 2006.
  • [22] N. Prasad and M. K. Varanasi, “Outage theorems for MIMO block-fading channels,” IEEE Trans. Inf. Theory, vol. 52, no. 12, pp. 5284–5296, Dec. 2006.
  • [23] E. Malkamäki and H. Leib, “Coded diversity on block-fading channels,” IEEE Trans. Inf. Theory, vol. 45, no. 2, pp. 771–781, Mar. 1999.
  • [24] S. Verdú and T. S. Han, “A general formula for channel capacity,” IEEE Trans. Inf. Theory, vol. 40, no. 4, pp. 1147–1157, Jul. 1994.
  • [25] S. Shamai and S. Verdú, “The impact of frequency-flat fading on the spectral efficiency of CDMA,” IEEE Trans. Inf. Theory, vol. 47, no. 5, pp. 1302–1327, May 2001.
  • [26] N. Prasad and M. Varanasi, “MIMO outage capacity in the high SNR regime,” Proc. of IEEE Int’l Symp. on Inform. Theory (ISIT’05), pp. 656–660, Sep. 2005.
  • [27] A. Lozano, A. M. Tulino, and S. Verdú, “High-SNR power offset in multiantenna communication,” IEEE Trans. Inf. Theory, vol. 51, no. 12, pp. 4134–4151, Dec. 2005.
  • [28] J. Salo, H. E. Sallabi, and P. Vainikainen, “The distribution of the product of independent Rayleigh random variables,” IEEE Trans. Antennas Propag., vol. 54, no. 2, pp. 639–643, Feb. 2006.
  • [29] I. S. Gradshteyn and I. M. Ryzhik, Table of Integrals, Series, and Products, 5th ed. Academic, 1994.
  • [30] G. Barriac and U. Madhow, “Characterizing outage rates for space-time communication over wideband channels,” IEEE Trans. Commun., vol. 52, no. 4, pp. 2198–2207, Dec. 2004.
  • [31] P. J. Smith and M. Shafi, “On a Gaussian approximation to the capacity of wireless MIMO systems,” Proc. of IEEE Int’l Conf. in Commun. (ICC’02), pp. 406–410, Apr. 2002.
  • [32] M. S. Alouini and A. J. Goldsmith, “Capacity of Rayleigh fading channels under different adaptive transmission and diversity-combining techniques,” IEEE Trans. Veh. Technol., vol. 48, no. 4, pp. 1165–1181, Jul. 1999.
  • [33] M. R. McKay, P. J. Smith, H. A. Suraweera, and I. B. Collings, “On the mutual information distribution of OFDM-based spatial multiplexing: exact variance and outage approximation,” IEEE Trans. Inf. Theory, vol. 54, no. 7, pp. 3260–3278, 2008.
  • [34] B. Fristedt and L. Gray, A Modern Approach to Probability Theory. Birkhäuser Boston, 1997.
  • [35] M. Meyer, H. Wiemann, M. Sagfors, J. Torsner, and J.-F. Cheng, “ARQ concept for the UMTS long-term evolution,” Proc. of IEEE Vehic. Technol. Conf. (VTC’2006 Fall), Sep. 2006.
  • [36] N. Kingsbury, “Approximation formula for the Gaussian error integral, Q(x),” http://cnx.org/content/m11067/latest/.
  • [37] M. Abramowitz and I. A. Stegun, Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables. Dover Publications, 1964.
Refer to caption
Figure 1: High SNR rate offset \mathcal{L}_{\infty} (bps/Hz) versus diversity order LL for ϵ=0.01\epsilon=0.01
Refer to caption
Figure 2: CDF’s of mutual information (12i=12log2(1+SNR|hi|2)\frac{1}{2}\sum_{i=1}^{2}\log_{2}(1+\mbox{\scriptsize\sf SNR}|h_{i}|^{2}) and 110i=110log2(1+SNR|hi|2)\frac{1}{10}\sum_{i=1}^{10}\log_{2}(1+\mbox{\scriptsize\sf SNR}|h_{i}|^{2}) respectively) for L=2L=2 and L=10L=10 at SNR=0\mbox{\scriptsize\sf SNR}=0, 1010, and 2020 dB
Refer to caption
Figure 3: ϵ\epsilon-outage capacity CϵLC_{\epsilon}^{L} (bps/Hz) versus SNR (dB) for ϵ=0.01\epsilon=0.01
Refer to caption
Figure 4: Ergodic capacity - ϵ\epsilon-outage capacity difference ΔEC - FD\Delta_{\textrm{EC - FD}} (bps/Hz) versus diversity order L, at SNR=20\mbox{\scriptsize\sf SNR}=20 dB
Refer to caption
Figure 5: Ergodic capacity, H-ARQ rate, and non-H-ARQ rate (bps/Hz) versus SNR (dB) for ϵ=0.01\epsilon=0.01
Refer to caption
Figure 6: Ergodic capacity - H-ARQ rate difference ΔEC-IR\Delta_{\textrm{EC-IR}} (bps/Hz) and ergodic capacity - non-H-ARQ rate difference ΔEC-FD\Delta_{\textrm{EC-FD}} (bps/Hz) versus MM for ϵ=0.01\epsilon=0.01 at SNR=10\mbox{\scriptsize\sf SNR}=10 dB
Refer to caption
Figure 7: CDF’s of accumulated mutual information over 22 and 33 rounds (random variables i=12log2(1+SNR|hi|2)\sum_{i=1}^{2}\log_{2}(1+\mbox{\scriptsize\sf SNR}|h_{i}|^{2}) and i=13log2(1+SNR|hi|2)\sum_{i=1}^{3}\log_{2}(1+\mbox{\scriptsize\sf SNR}|h_{i}|^{2}), respectively) at SNR=10\mbox{\scriptsize\sf SNR}=10 and 4040 dB
Refer to caption
(a) SNR=10\mbox{\scriptsize\sf SNR}=10 dB
Refer to caption
(b) SNR=30\mbox{\scriptsize\sf SNR}=30 dB
Figure 8: H-ARQ rate Rinit𝔼[X]\frac{R_{\textrm{init}}}{\mathbb{E}[X]} (bps/Hz) versus initial rate RinitR_{\textrm{init}} (bps/Hz)
Refer to caption
Figure 9: H-ARQ rate with/without an optimized initial rate (bps/Hz) and non-H-ARQ rate (bps/Hz) versus SNR (dB) for ϵ=0.01\epsilon=0.01
Refer to caption
Figure 10: H-ARQ rate (bps/Hz) and non-H-ARQ rate (bps/Hz) versus outage probability ϵ\epsilon for M=5M=5 at SNR=0\mbox{\scriptsize\sf SNR}=0 and 1010 and 2020 dB
Refer to caption
Figure 11: Optimized H-ARQ rate C~ϵIR,M\tilde{C}_{\epsilon}^{\textrm{IR},M} (bps/Hz) and C~ϵCC,M\tilde{C}_{\epsilon}^{\textrm{CC},M} (bps/Hz) versus SNR (dB) for ϵ=0.01\epsilon=0.01