9
1160 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 53, NO. 3, MARCH 2007 [15] M. Pohst and H. Zassenhaus, Algorithmic algebraic number theory, En- cyclopedia of Mathematics and its Applications. Cambridge, U.K.: Cambridge University Press, 1989. [16] J. Proakis, Digital Communications, 4 ed. New York: McGrawHill, 2000. [17] P. Sebastian, H. Sampath, and A. Paulraj, “Adaptive modulation for multiple antenna systems,” in Proc. Asilomar Conf. on Signals, Syst. Comput., Oct. 2000, vol. 1, pp. 506–510. [18] G. Taricco and E. Biglieri, “Exact pairwise error probability of space- time codes,” IEEE Trans. Inf. Theory, vol. 48, no. 2, pp. 510–513, Feb. 2002. [19] ——, “Correction to ëxact pairwise error probability of space-time codes,” IEEE Trans. Inf. Theory, vol. 49, no. 3, pp. 766–766, Mar. 2003. [20] V. Tarokh, N. Seshadri, and A. R. Calderbank, “Space-time codes for high data rate wireless communication: Performance criterion and code construction,” IEEE Trans. Inf. Theory, vol. 44, no. 2, pp. 744–765, Mar. 1998. [21] V. Tarokh, A. Vardy, and K. Zeger, “Universal bound on the perfor- mance of lattice codes,” IEEE Trans. Inf. Theory, vol. 45, no. 2, pp. 670–681, Mar. 1999. [22] I. E. Telatar, “Capacity of multi-antenna Gaussian channels,” European Trans. Telecommun., vol. 10, no. 6, pp. 585–595, Nov.-Dec. 1999. [23] E. Viterbo and J. Boutros, “A universal lattice code decoder for fading channels,” IEEE Trans. Inf. Theory, vol. 45, no. 5, pp. 1639–1642, Jul. 1999. [24] E. Viterbo and E. Biglieri, “Computing the Voronoi cell of a lattice: The diamond-cutting algorithm,” IEEE Trans. Inf. Theory, vol. 42, no. 1, pp. 161–171, Jan. 1996. Information-Bearing Noncoherently Modulated Pilots for MIMO Training Yingqun Yu, Student Member, IEEE, Georgios B. Giannakis, Fellow, IEEE, and Nihar Jindal, Member, IEEE Abstract—In this correspondence, we deal with noncoherent commu- nications over multiple-input–multiple-output (MIMO) wireless links. For a Rayleigh flat block-fading channel with transmit- and re- ceive-antennas and a channel coherence interval of length , it is well known that for , or, at high signal-to-noise-ratio (SNR) and , unitary space–time modulation (USTM) is capacity-achieving, but incurs exponential demodulation complexity in . On the other hand, conventional training-based schemes that rely on known pilot symbols for channel estimation simplify the receiver design, but they induce certain SNR loss. To achieve desirable tradeoffs between performance and complexity, we propose a novel training approach where USTM symbols over a short length are used as pilots which carry information to the receiver, and coherent communication is used for the remainder of the block. This new approach considerably reduces the receiver complexity when is a small fraction of , and recovers part of the SNR loss experienced by the conventional training-based schemes. When and , but the ratios , are fixed, we obtain analytical expressions of the asymptotic Manuscript received May 11, 2005; revised September 13, 2006. This work was supported in part by the ARL/CTA under Grant DAAD19-01-2-0011. This work was prepared through collaborative participation in the Communications and Networks Consortium sponsored by the U. S. Army Research Laboratory under the Collaborative Technology Alliance Program, Cooperative Agreement DAAD19-01-2-0011. The U.S. Government is authorized to reproduce and dis- tribute reprints for Government purposes notwithstanding any copyright nota- tion thereon. The authors are with the Department of Electrical and Computer Engi- neering, University of Minnesota, Minneapolis, MN 55455 USA (e-mail: [email protected]; [email protected]; [email protected]). Color versions of Figs 1–7 are available online at http://ieeexplore.ieee.org. Communicated by B. Hassibi, Associate Editor for Communications. SNR loss for both the conventional and new training-based approaches, serving as a guideline for practical designs. Index Terms—Capacity, channel estimation, coherent detection, multiple antennas, noncoherent detection, unitary space–time modulation. I. INTRODUCTION Due to fading of the channel strength caused by constructive and destructive interference of the multiple signal paths between the transmitter and the receiver, a major challenge in wireless communi- cations is coping with channel uncertainties. 1 Pilot symbol-assisted modulation (PSAM) is a standard training-based approach when com- municating over time-varying channels [4], [23], [26]. In PSAM, pilot symbols known to both the transmitter and the receiver are multiplexed with data symbols and used as training for channel acquisition. Since known pilot symbols carry no data information, they reduce power and bandwidth resources during data transmission. Clearly, there is a tradeoff in allocating these resources between pilot symbols and data symbols. Sending more pilots with increased power improves the quality of channel estimation as well as the reliability of commu- nication. However, overincreasing the overhead for training reduces the amount of channel uses and power for information-carrying data symbols, which decreases data throughput. A basic information-theoretic question for PSAM is how much training is necessary when using Shannon’s capacity as the per- formance metric. For a given channel estimation accuracy, lower bounds on channel capacity are available for a general setting [19], for Rayleigh flat block-fading multiple-input–multiple-output (MIMO) wireless channels [29], [22], [9], and for perfectly interleaved MIMO channels [2]. The optimal power allocation between pilot and data symbols as well as the number (equal to ) of training symbols optimizing a lower bound on capacity were obtained in [9]. Similar lower bounds are also available for single-antenna and multiple-an- tenna frequency-selective fading channels, based on which the optimal training design has been derived [16], [21], [27], [1]. Although training-based schemes like PSAM simplify trans- ceiver design for noncoherent multiple-antenna systems, informa- tion-theoretic studies have revealed that in general they are not capacity-achieving. Marzetta and Hochwald [17] investigated the capacity of a Rayleigh flat block-fading channel with transmit, receive antennas and a channel coherence interval of length , and found that the noncoherent channel capacity is achieved when the transmitted signal matrix is expressible as a product of two statistically independent matrices: a isotropically distributed (i.d.) unitary matrix times a real, diagonal and nonnegative random matrix. The asymptotic capacity at high SNR can be achieved using only antennas, and increases linearly with , where [29]. In comparison, if the receiver knows the channel coefficients perfectly, it is well-known that the ergodic coherent channel capacity increases linearly with under the same channel model [6], [25]. Motivated by results in [17], a class of isotropic unitary space-time modulation (USTM) signals was proposed in [11], [12], [15] et. al to encode the transmitted signals using isotropic unitary ma- trices. For [17], and for high SNR with [29], the optimal input has indeed a USTM form. Digital Object Identifier 10.1109/TIT.2006.890693 1 The views and conclusions contained in this document are those of the au- thors and should not be interpreted as representing the official policies, either expressed or implied, of the Army Research Laboratory or the U.S. Govern- ment. 0018-9448/$25.00 © 2007 IEEE

Information-Bearing Noncoherently Modulated Pilots for ... · Nihar Jindal, Member, IEEE Abstract—In this correspondence, we deal with noncoherent commu-nications over multiple-input–multiple-output

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Information-Bearing Noncoherently Modulated Pilots for ... · Nihar Jindal, Member, IEEE Abstract—In this correspondence, we deal with noncoherent commu-nications over multiple-input–multiple-output

1160 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 53, NO. 3, MARCH 2007

[15] M. Pohst and H. Zassenhaus, Algorithmic algebraic number theory, En-cyclopedia of Mathematics and its Applications. Cambridge, U.K.:Cambridge University Press, 1989.

[16] J. Proakis, Digital Communications, 4 ed. New York: McGrawHill,2000.

[17] P. Sebastian, H. Sampath, and A. Paulraj, “Adaptive modulation formultiple antenna systems,” in Proc. Asilomar Conf. on Signals, Syst.Comput., Oct. 2000, vol. 1, pp. 506–510.

[18] G. Taricco and E. Biglieri, “Exact pairwise error probability of space-time codes,” IEEE Trans. Inf. Theory, vol. 48, no. 2, pp. 510–513, Feb.2002.

[19] ——, “Correction to ëxact pairwise error probability of space-timecodes,” IEEE Trans. Inf. Theory, vol. 49, no. 3, pp. 766–766, Mar. 2003.

[20] V. Tarokh, N. Seshadri, and A. R. Calderbank, “Space-time codes forhigh data rate wireless communication: Performance criterion and codeconstruction,” IEEE Trans. Inf. Theory, vol. 44, no. 2, pp. 744–765,Mar. 1998.

[21] V. Tarokh, A. Vardy, and K. Zeger, “Universal bound on the perfor-mance of lattice codes,” IEEE Trans. Inf. Theory, vol. 45, no. 2, pp.670–681, Mar. 1999.

[22] I. E. Telatar, “Capacity of multi-antenna Gaussian channels,” EuropeanTrans. Telecommun., vol. 10, no. 6, pp. 585–595, Nov.-Dec. 1999.

[23] E. Viterbo and J. Boutros, “A universal lattice code decoder for fadingchannels,” IEEE Trans. Inf. Theory, vol. 45, no. 5, pp. 1639–1642, Jul.1999.

[24] E. Viterbo and E. Biglieri, “Computing the Voronoi cell of a lattice:The diamond-cutting algorithm,” IEEE Trans. Inf. Theory, vol. 42, no.1, pp. 161–171, Jan. 1996.

Information-Bearing Noncoherently Modulated Pilots forMIMO Training

Yingqun Yu, Student Member, IEEE,Georgios B. Giannakis, Fellow, IEEE, and

Nihar Jindal, Member, IEEE

Abstract—In this correspondence, we deal with noncoherent commu-nications over multiple-input–multiple-output (MIMO) wireless links.For a Rayleigh flat block-fading channel with M transmit- and N re-ceive-antennas and a channel coherence interval of length T , it is wellknown that for T � M , or, at high signal-to-noise-ratio (SNR) � � 1and M � minfN; bT=2cg, unitary space–time modulation (USTM) iscapacity-achieving, but incurs exponential demodulation complexity inT . On the other hand, conventional training-based schemes that rely onknown pilot symbols for channel estimation simplify the receiver design,but they induce certain SNR loss. To achieve desirable tradeoffs betweenperformance and complexity, we propose a novel training approach whereUSTM symbols over a short length T (< T ) are used as pilots whichcarry information to the receiver, and coherent communication is used forthe remainder of the block. This new approach considerably reduces thereceiver complexity when T is a small fraction of T , and recovers partof the SNR loss experienced by the conventional training-based schemes.When �!1 and T � T � 2M = 2N !1, but the ratios � =M=T ,� = T =T are fixed, we obtain analytical expressions of the asymptotic

Manuscript received May 11, 2005; revised September 13, 2006. This workwas supported in part by the ARL/CTA under Grant DAAD19-01-2-0011. Thiswork was prepared through collaborative participation in the Communicationsand Networks Consortium sponsored by the U. S. Army Research Laboratoryunder the Collaborative Technology Alliance Program, Cooperative AgreementDAAD19-01-2-0011. The U.S. Government is authorized to reproduce and dis-tribute reprints for Government purposes notwithstanding any copyright nota-tion thereon.

The authors are with the Department of Electrical and Computer Engi-neering, University of Minnesota, Minneapolis, MN 55455 USA (e-mail:[email protected]; [email protected]; [email protected]).

Color versions of Figs 1–7 are available online at http://ieeexplore.ieee.org.Communicated by B. Hassibi, Associate Editor for Communications.

SNR loss for both the conventional and new training-based approaches,serving as a guideline for practical designs.

Index Terms—Capacity, channel estimation, coherent detection, multipleantennas, noncoherent detection, unitary space–time modulation.

I. INTRODUCTION

Due to fading of the channel strength caused by constructive anddestructive interference of the multiple signal paths between thetransmitter and the receiver, a major challenge in wireless communi-cations is coping with channel uncertainties.1 Pilot symbol-assistedmodulation (PSAM) is a standard training-based approach when com-municating over time-varying channels [4], [23], [26]. In PSAM, pilotsymbols known to both the transmitter and the receiver are multiplexedwith data symbols and used as training for channel acquisition. Sinceknown pilot symbols carry no data information, they reduce powerand bandwidth resources during data transmission. Clearly, there isa tradeoff in allocating these resources between pilot symbols anddata symbols. Sending more pilots with increased power improvesthe quality of channel estimation as well as the reliability of commu-nication. However, overincreasing the overhead for training reducesthe amount of channel uses and power for information-carrying datasymbols, which decreases data throughput.

A basic information-theoretic question for PSAM is how muchtraining is necessary when using Shannon’s capacity as the per-formance metric. For a given channel estimation accuracy, lowerbounds on channel capacity are available for a general setting [19], forRayleigh flat block-fading multiple-input–multiple-output (MIMO)wireless channels [29], [22], [9], and for perfectly interleaved MIMOchannels [2]. The optimal power allocation between pilot and datasymbols as well as the number (equal to M ) of training symbolsoptimizing a lower bound on capacity were obtained in [9]. Similarlower bounds are also available for single-antenna and multiple-an-tenna frequency-selective fading channels, based on which the optimaltraining design has been derived [16], [21], [27], [1].

Although training-based schemes like PSAM simplify trans-ceiver design for noncoherent multiple-antenna systems, informa-tion-theoretic studies have revealed that in general they are notcapacity-achieving. Marzetta and Hochwald [17] investigated thecapacity of a Rayleigh flat block-fading channel with M transmit, Nreceive antennas and a channel coherence interval of length T , andfound that the noncoherent channel capacity is achieved when theT � M transmitted signal matrix is expressible as a product of twostatistically independent matrices: a T � M isotropically distributed(i.d.) unitary matrix times a real, diagonal and nonnegative M � Mrandom matrix. The asymptotic capacity at high SNR can be achievedusing onlyM� antennas, and increases linearly withM�(1�M�=T ),where M� = minfM;N; bT=2cg [29]. In comparison, if the receiverknows the channel coefficients perfectly, it is well-known that theergodic coherent channel capacity increases linearly with minfM;Ngunder the same channel model [6], [25].

Motivated by results in [17], a class of isotropic unitary space-timemodulation (USTM) signals was proposed in [11], [12], [15] et. al toencode the transmitted signals using T � M isotropic unitary ma-trices. For T � M [17], and for high SNR � � 1 with M �minfN; bT=2cg [29], the optimal input has indeed a USTM form.

Digital Object Identifier 10.1109/TIT.2006.890693

1The views and conclusions contained in this document are those of the au-thors and should not be interpreted as representing the official policies, eitherexpressed or implied, of the Army Research Laboratory or the U.S. Govern-ment.

0018-9448/$25.00 © 2007 IEEE

Page 2: Information-Bearing Noncoherently Modulated Pilots for ... · Nihar Jindal, Member, IEEE Abstract—In this correspondence, we deal with noncoherent commu-nications over multiple-input–multiple-output

IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 53, NO. 3, MARCH 2007 1161

The main drawback of USTM transmissions is that very often theirdesign requires numerical optimization [12], [18], [3], and becausethey possess no particular algebraic structure, they incur relatively highcomplexity. Furthermore, their demodulation is exponentially complexsince the constellation size grows exponentially with the block lengthT (the number of signal points is 2RT for a given rate of R bits persymbol). For this reason, USTM is practically applicable only for smallblock lengths or low rates. Differential USTM’s [14], [13], [8] et. al andalternative training-like constellations [28], [24], [5] enjoy polynomialcomplexity in T ; however, they are generally not capacity-achievingfor the block-fading channel.

Compared with the case when USTM is optimal, a training-basedscheme suffers SNR degradation due to imperfect channel state infor-mation (CSI), but gains the benefit of simplified receiver design. If T�out of T symbols in a fading block are used to send known trainingsymbols for channel estimation, it has been shown that at high SNRonly M� antennas should be used for transmission, and the achiev-able rate also increases linearly with M�(1 �M�=T ) similar to thenoncoherent case; however, due to channel estimation errors there isan SNR loss compared with the optimal noncoherent scheme [29], [9].A training-based scheme can be capacity-achieving at high SNR onlywhen T is sufficiently large, but the rate at which it attains this opti-mality as T grows has not been quantified yet.

In this correspondence, we first analytically compute the asymptoticSNR loss for the conventional training-based methods when � ! 1and T � 2M = 2N !1, but the ratio � =M=T is fixed. We showthat as � decreases, the asymptotic SNR loss drops monotonically butalso slowly from 2.17 dB (� = 0:5) to zero (�! 0). Further, we intro-duce a novel scheme that combines noncoherent and coherent detectionfor the block fading channel, and offers flexibility in trading off perfor-mance for complexity. A channel coherence interval T is divided intotwo parts: the noncoherent part with T� symbols and the coherent partwith Td(= T � T� ) symbols. The noncoherent symbols carry infor-mation unknown to the receiver and are encoded over multiple fadingblocks. A key observation is that after those T� noncoherent symbolsare correctly decoded without CSI, they can be further used to estimatethe channel coefficients in their own block, thus enabling coherent de-tection of the remaining Td coherent symbols. There are three advan-tages of the proposed scheme. First, unlike conventional training wherethe pilots are known sequences used only for channel estimation, herethose noncoherent symbols do carry information. Second, since T� isonly a small fraction of T , the cardinality of the noncoherent constel-lation is reduced considerably, leading to low decoding complexity.Finally, one is flexible to control the tradeoff between complexity andSNR loss by selecting a suitable T� .

The rest of the correspondence is organized as follows. In Section II,we introduce the system model and provide some preliminary results.In Section III, we dwell on the training-based scheme and compare itwith USTM. In Section IV, we introduce and analyze the novel non-coherent–coherent hybrid scheme. Numerical examples are given inSection V, and conclusions are drawn in Section VI.

II. SYSTEM MODEL AND PRELIMINARIES

A. System Model

We consider a single-user transmission with M transmit- and Nreceive-antennas over a frequency-nonselective (flat) Rayleigh block-fading channel, as in [17]. The channel coefficients, which are unknownto both the transmitter and the receiver, are assumed to remain con-stant over a block of T symbols, but are allowed to change indepen-dently from block to block. Within a block of T symbols, given that

a signal matrix ��� 2 CT�M is transmitted,2 the received signal matrixXXX 2 CT�N can be written as

XXX =�T

M���HHH +WWW (1)

where HHH 2 CM�N is the channel matrix, and WWW 2 CT�N denotesthe additive white noise matrix. BothHHH andWWW are complex Gaussianmatrices with independent and identically distributed (i.i.d.) CN (0; 1)entries. The power constraint on the transmitted signal is assumed tobe fTr(���y���)g = M , and thus � is the average received SNR ateach receive antenna since ���,HHH andWWW are independent. Because thereceiver does not know the channel matrixHHH , the model in (1) is oftenreferred to as a noncoherent channel; otherwise, it is called a coherentchannel.

B. Known Results on Coherent Capacity, Noncoherent Capacity andMutual Information of USTM

When perfect knowledge of the channel coefficients is available atthe receiver (but not at the transmitter), the channel capacity, oftencalled coherent capacity, is computed in [7], [25] and is summarizedin the following.

Lemma 1: If HHH is known to the receiver but not the transmitter, thecoherent capacity in bits per symbol is given by

Ccoherent(�) = log2det IIIM +

MHHHHHHy

= log2det IIIN +

MHHHyHHH : (2)

When M = N , the normalized asymptotic capacity for high SNR andlarge M satisfies

limM!1

lim�!1

Ccoherent(�)

M� log

2

e= 0: (3)

For the noncoherent channel model described by (1), it hasbeen shown that at high SNR the degrees of freedom persymbol for each noncoherent block is M�(1 � M�=T ), whereM� = minfM;N; bT=2cg [29]. This result indicates that at highSNR, the optimal strategy is to use only M� out of M availableantennas. The capacity-achieving input matrix can be written as��� = ���DDD, where ��� is a T �M isotropically distributed (i.d.) unitarymatrix, i.e.,���y��� = IIIM , andDDD is anM�M random real nonnegativediagonal matrix with fTr(DDD)g = 1 [17]. The distribution of DDD isgenerally unknown, except for the asymptotic case T � M [17] andfor high SNR with M � minfN; T=2g [29], where DDD becomes ascaled deterministic identity matrix DDD = 1=TIIIM , suggesting theso-called USTM inputs for noncoherent channels [11]. The result issummarized in the following lemma for the case T � 2M = 2N [29,Th. 9, Corollary 11].

Lemma 2: Assume T � 2M = 2N . If T � M and/or �� 1, theunitary space-time modulation with ���y��� = IIIM achieves the nonco-herent channel capacity of (1). In particular, for � � 1 the capacity isgiven by

CM;M(�) = 1�M

TCcoherent(�) + c(T;M) + o(1); (4)

where c(T;M) is a constant that depends only on M and T , and goesto zero as T ! 1, and o(1) is a term that goes to zero as � ! 1. If

2Here C denotes the complex field.

Page 3: Information-Bearing Noncoherently Modulated Pilots for ... · Nihar Jindal, Member, IEEE Abstract—In this correspondence, we deal with noncoherent commu-nications over multiple-input–multiple-output

1162 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 53, NO. 3, MARCH 2007

we let both T and M go to infinity but keep the ratio � =M=T fixed,then we have3

limM!1

lim�!1

CM;M(�)

M

� k(�)

ln 2+ (1� �) log2

e= 0 (5)

where

limM!1

lim�!1

c(T;M)

M� k(�)

ln 2= 0; (6)

and

k(�) =(1� �)2

2�ln(1� �) +

2ln� +

1� �

2< 0 (7)

for all 0 < � � 1=2.

III. TRAINING WITH KNOWN PILOT SYMBOLS VERSUS USTM

In this section, we introduce the conventional training-based schemewith known pilot symbols and compare it with USTM. Based on alower bound CL

known of the training-based schemes, we compare theasymptotic behavior of the two options at both high and low SNR. Itis shown that in terms of achievable rates USTM outperforms trainingunder both situations.

A. Training-Based Schemes

In a typical training-based system, the transmitted signal matrix ���

is partitioned into a training submatrix ���� and a data submatrix ���d asfollows [9]

��� =

� T

M����

� T

M���d

(8)

where ���� 2 CT �M with T� � M;Tr(���y����� ) = M is thetraining matrix known to both the transmitter and the receiver, and���d 2 CT �M with fTr(���yd���d)g = M is the data matrix carryinginformation from the transmitter to the receiver. The parameters ��and �d are the SNR values during the training phase and the datatransmission phase, respectively. In addition, we have the equations oftime and energy conservation: T� +Td = T , and �T = ��T� +�dTd.

Similarly, the received signal matrix XXX and the noise matrix WWW arealso partitioned into two submatrices

XXX =XXX�

XXXd

; and WWW =WWW �

WWW d

(9)

where XXX� 2 CT �N , XXXd 2 CT �N(WWW � 2 CT �N ;WWW d 2 CT �N)are the received signal (noise) matrices during the training phase andthe data transmission phase, respectively. We can thus write the signalmodel for the training phase as

XXX� =��T�M

����HHH +WWW � (10)

and for the data phase as

XXXd =�dTdM

���dHHH +WWW d: (11)

3We believe that in [29, eq. (22)] the term log e should be ln 2.

The capacity in bits per symbol for the training-based scheme isgiven by [9]

Cknown = sup��� ;p(��� )

1

TI(���� ;���d;XXX� ;���d)

= sup��� ;p(��� )

1

TI(���d;XXXdj���� ;XXX� ) (12)

since ���� is known to both the transmitter and the receiver and ���d isindependent of ���� and XXX� . The optimization in (12) is performedover all choices of the deterministic training matrix ���� and the inputdistributions p(���d) of the data matrix ���� , under the constraintsTr(���y����� ) = M and fTr(���yd���d)g = M . However, such anoptimization problem is very difficult to solve.

One option is to form an explicit channel estimate HHH first and use itas if it were correct. In this process, information may be thrown away,which results in a suboptimal scheme. Nevertheless, this method en-ables us to compute a tight lower bound on channel capacity Cknown.We first compute minimum mean square error (MMSE) estimate ofthe channel matrix, and then absorb the estimation error in the addi-tive noise to obtain an equivalent noise term. Further, this new noiseterm is replaced by a worst case noise, yielding a lower bound on mu-tual information. It has been shown in [9] that the training matrix withorthonormal columns ���y����� = IIIM simultaneously maximizes thislower bound and minimizes MSE. If optimal power allocation betweentraining and information-bearing symbols can be afforded, the optimaltraining interval should be T opt

� = M , and the corresponding lowerbound is

CLknown(�) = 1� M

TCcoherent(�e�) (13)

where

�eff =

�T

T�2M(p �p

� 1)2; for T > 2M�

1+2�; for T = 2M

�T

2M�T(p� �p� + 1)2; for T < 2M .

=(M + �T )(T �M)

�T (T � 2M): (14)

Compared with the noncoherent capacity in (4) at high SNR for thecase T � 2M = 2N , the training-based scheme can achieve the samedegrees of freedom M(1�M=T ); however, it incurs an SNR degra-dation which depends on �,M and T . In Section III-B1, an asymptoticexpression for SNR loss will be found.

If equal training and data power � = �� = �d is mandatory, e.g., inorder to ensure constant-modulus transmissions, then

CLknown(�) = 1� T�

TCcoherent(�e� ) (15)

where

�e� =�2T�=M

1 + (1 + T�=M)�: (16)

In this case, the optimal training interval can be found numerically.

B. Comparison Between Conventional Training-Based Schemes andUSTM

In this section, we compare the asymptotic behavior of the mutualinformation of USTM and training-based schemes with optimal powerallocation, which motivates our training-based scheme with unknownsymbols in Section IV.

Page 4: Information-Bearing Noncoherently Modulated Pilots for ... · Nihar Jindal, Member, IEEE Abstract—In this correspondence, we deal with noncoherent commu-nications over multiple-input–multiple-output

IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 53, NO. 3, MARCH 2007 1163

Fig. 1. Noncoherent training.

Fig. 2. Asymptotic SNR loss � (�).

1) High SNR: Consider for simplicity that T � 2M = 2N .From Lemma 2, we know that at high SNR, USTM inputs are ca-pacity-achieving. Compared with the first term in (4), the SNR lossin CL

known can be expressed as

� :=�

�eff=

T � 2M

T(p � � 1)�2

=T � 2M

T(p + � 1)2; for T > 2M: (17)

Since lim�!1 = (T �M)=(T � 2M), substitution of the latterinto � and simplification leads to

�1 := lim�!1

� = 1 + 2M

T1� M

T(18)

for T � 2M , since this expression also applies when T = 2M .Note that � is not the real SNR loss at high SNR, since there is

another term c(T;M) in (4) that does not depend on �. To accountfor that, let us consider the case when both T and M go to infinity, butthe ratio � = M=T is fixed. From Lemma 2, incorporating the termk(�) into log

2(�) yields

limM!1

lim�!1

CM;M(�)

M� (1� �) log

2

e� 2 = 0:

(19)

Also from Lemma 1, we have a similar asymptotic result forCLknown(�)

limM!1

lim�!1

CLknown(�)

M� (1� �) log

2

e� ��11

= 0: (20)

Upon comparing (19) with(20), we obtain the following result.

Proposition 1: When �!1, M = N !1 and T !1, but theratio � = M=T � 1=2 is fixed, CL

known suffers an asymptotic SNRloss relative to the noncoherent capacity

�loss(�) := �1 � 2= 1 + 2 �(1� �) � 2 (21)

where k(�) is given in (7).

Corollary 1: It holds that �loss(0:5) = 2:1715 dB, and �loss(0) =lim�!0 �(�) = 0 dB.

We plot �loss(�) in Fig. 2 with � in logarithmic scale. We observethat the SNR loss decreases monotonically to zero as � ! 0, whichis consistent with the intuition that for large T training can be optimal;however, the slope of decrease is very small. For example, the SNRloss is 1.5980 dB at � = 10�1(T = 10 M), and drops to 0.6981 dBat � = 10�2(T = 100 M). Even when � = 10�3(T = 1000 M),there is still 0.2523 dB SNR loss. For the region T = 2 � 20 M,there is always an SNR loss less than 2.17 dB, but more than 1.5 dB.Fig. 3 compares the asymptotic SNR loss with the case � = 25 dB,

Page 5: Information-Bearing Noncoherently Modulated Pilots for ... · Nihar Jindal, Member, IEEE Abstract—In this correspondence, we deal with noncoherent commu-nications over multiple-input–multiple-output

1164 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 53, NO. 3, MARCH 2007

Fig. 3. SNR loss for finite �, M and T .

M = 1; 4; 8 and T = 1 � 100. It can be seen that �loss(�) is actuallytight except when M = 1 and T < 10; therefore, �loss(�) is a goodapproximation for practical scenarios.

Note that the interest in noncoherent channel models is often limitedto the case when the channel varies quickly; that is, when T is small. IfT � M , then we can allocate minimal overhead for channel estima-tion and feed channel estimates back to the transmitter, which leads toadaptive signaling designs with improved performance. When T is notvery large or comparable to M , which is the scenario we focus on, atraining-based scheme is not capacity-optimal, although it can achievethe same degrees of freedom as the optimal noncoherent scheme.

2) Low SNR: We assume that T > M . We know that the optimumtraining length is T opt

� = M , but at low SNR, the lower bound of thetraining-based scheme is not sensitive to the length of training symbols[9], and the lower bound is given by (up to o(�2))

CLknown(�)

NT log e

4M�2: (22)

At low SNR, the mutual information of USTM inputs can be approx-imated as (up to o(�2)) [20]

IUSTM(�)N(T �M) log e

2M�2: (23)

From (22) and(23) we obtain

IUSTM(�)

CLknown(�)

2(T �M)

T: (24)

Proposition 2: For fixed M and T with T > M , the asymptoticratio of the mutual information of USTM to CL

known at low SNR is

�(�) := lim�!0

IUSTM(�)

CLknown(�)

= 2(1� �); (25)

where � = M=T .We can see that at low SNR, USTM is also better than training. The

reason that training-based schemes become worse is that at low SNR

the channel estimation has very low quality and thus becomes unreli-able. Since USTM bypasses channel estimation, it achieves higher ratesthan training. The complexity of USTM at low SNR can be acceptablesince the achievable rate for small � is quite low.

IV. TRAINING VIA INFORMATION-BEARING NONCOHERENT

SPACE–TIME MODULATION

In this section, we develop a novel training-based scheme where“pilot” symbols, just like data symbols, can also carry information andthus are unknown to the receiver. It appears impossible to estimate thechannel when the receiver does not know the transmitted pilot sym-bols, unless some kind of blind estimation scheme is used. However,it is certainly possible to do so after the receiver successfully decodesthem. The decoding of unknown pilot symbols, though, does not re-quire CSI knowledge, which can be enabled by using any noncoherentcommunication scheme.

A. Training Via Noncoherent Communication

The proposed system architecture is shown in Fig. 1. Informationdata are first encoded and then sent to the coherent and noncoherentmodulators, respectively. The modulator outputs ���� and ���d are mul-tiplexed for transmission. The receiver first demultiplexes the receivedsignal to obtain XXX� and XXXd. The received matrix XXX� carries data andis decoded first. Since���� is noncoherently modulated, the receiver candecode it without knowing the fading matrix HHH . Once the transmittedsignal matrix is recovered as ���� after decoding, the receiver can esti-mate the channel using ���� . The estimated channel HHH is subsequentlysent to the coherent detector to decode the information carried byXXXd.In practice, the data bits carried by ���� should be encoded across mul-tiple blocks with relatively strong codes so that the channel estimationerror caused by the incorrectly decoded���� is negligible. However, herewe only focus on the information-theoretic analysis and the design ofpractical coding schemes is beyond the scope of this correspondence.

The receiver structure depicted in Fig. 1 is suboptimal in general,since information may be lost, because: 1) ���� and ���d are decoded notjointly but separately; and 2) an explicit HHH is formed and used as if it

Page 6: Information-Bearing Noncoherently Modulated Pilots for ... · Nihar Jindal, Member, IEEE Abstract—In this correspondence, we deal with noncoherent commu-nications over multiple-input–multiple-output

IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 53, NO. 3, MARCH 2007 1165

were correct. In the following, we will develop a lower bound on thechannel capacity that favors this suboptimal receiver.

The capacity for the new scheme is the maximum over the distri-bution of the transmit signals of the mutual information between thetransmitted signals ���� , ���d and the received signals XXX� , XXXd, i.e.

Cunknown = supp(��� ;��� )

1

TI(���� ;���d;XXX� ;XXXd): (26)

Compared with the conventional training-based system, it is evenharder to compute the capacity for this new one. Similarly, we are onlyable to calculate a lower bound on capacity. Using the chain rule ofmutual information, we have

I(���� ;���d;XXX� ;XXXd) = I(���� ;XXX� ;XXXd) + I(���d;XXX� ;XXXdj���� )

= I(���� ;XXX�) + I(���� ;XXXdjXXX� )

+ I(���d;XXX� j���� ) + I(���d;XXXdj���� ;XXX� )

� I(���� ;XXX�) + I(���d;XXXdj���� ;XXX� ) (27)

where the inequality cannot be reduced to equality, since���� is randomand may depend on ���d. Even when ���� and ���d are independent, theterm I(���� ;XXXdjXXX� ) can still be nonzero. Nevertheless, supposing that���� and ���d are independent, we find that Cunknown satisfies

Cunknown � supp(��� );p(��� )

1

T[(I(���� ;XXX� ) + I(���d;XXXdj���� ;XXX� )]:

(28)

The optimization in (28) is taken over all input distributions p(���� ),p(���d) adhering to the power constraints that fTr(���y

����� )g = Mand fTr(���y

d���d)g = M . Note that the right-hand side (RHS) of (28)is consistent with the receiver structure shown in Fig. 1, in which thedata stream ���� is decoded first and ���d is decoded later based on XXX�

and the reconstructed ���� (with or without using an explicit estimate ofthe channel).

B. Training Via Unitary Space–Time Modulation

We do not know what inputs maximize I(���� ;XXX� ) +I(���d;XXXdj���� ;XXX�) in the RHS of (28). Instead, we choose ���� tobe USTM for which I(���� ;XXX�) can be calculated at least by MonteCarlo simulations [10], and then compute an analytical lower boundon I(���d;XXXdj���� ;XXX� ) as in [29] and [9]. The reason for choosingunitary ���� is twofold. First, the USTM inputs maximize I(���� ;XXX� )for large T� (� M), and for large �� with M � minfN; bT�=2cg.Second, the unitary ���� , which is used as training symbols after beingsuccessfully decoded, minimizes channel minimum square error(mse) and maximizes a lower bound on capacity of training-basedschemes simultaneously.

Since optimizing power between the training and nontraining partsis difficult, we assume that equal power is used: � = �� = �d. For theUSTM part, we have

I(���� ;XXX� ) = T� � IUSTM(�) (29)

where IUSTM(�) is the mutual information in bits per symbol ofUSTM inputs with block length T� .

For the part with channel estimation, due to equal transmissionpower, we obtain from (15) that

I(���d;XXXdj���� ;XXX� ) � 1�T�T

Ccoherent(�e�) � T

=(T � T� )Ccoherent(�e� ) (30)

where �eff = � T =M1+(1+T =M)�

:Combining (29) and (30) leads to a lowerbound on channel capacity

CLunknown(�) :=

1

T[T� IUSTM(�)

+ (T � T� )Ccoherent(�e� )]

=�1IUSTM(�) + (1� �1)Ccoherent(�e� )

(31)

if we define �1 = T�=T � 1.

C. High SNR

With T � T� � 2M = 2N , we have 2� � �1. We are interestedin the asymptotic behavior when �, T , T� and M go to infinity, but theratios �, �1 are fixed. Note that M=T� = �=�1.

Since T� � 2M , at high SNR, the first term inCLunknown(�) satisfies

[cf. Lemma 2]IUSTM(�)

M!

k( ��

)

ln 2+ 1�

�1log2

e; as

�;M !1: (32)

Note that for large �, we have �eff = �1+�=�

. Similar to (3), weobtain

Ccoherent(�e� )

M! log2

e1 +

�1

�1

; as

�;M !1: (33)

ThereforeCLunknown(�)

M!

�1k(��

)

ln 2+ (�1 � �) log2

e

+ (1� �1) log2�

e1 +

�1

�1

=(1� �) log2�

e� 1 +

�1

( )

� 2 : (34)

Compared with (19), we can identify the asymptotic SNR loss, as sum-marized in the following theorem.

Theorem 1: When �!1,M = N !1, T� !1 and T !1,but the ratios 2� = 2M=T � �1 = T�=T � 1 are fixed,CL

unknown(�)suffers an asymptotic SNR loss relative to the noncoherent capacity

�0loss(�;�1) = 1 +�

�1� 2 (35)

where k(�) is given in (7).Fig. 4 depicts rate versus �, for fixed �1(� 2�), while Fig. 5 de-

picts rate versus �1(� 2�), for fixed �. For fixed �1, a large � leadsto a high SNR loss, while for fixed �, a large �1 yields a small SNRloss. An interesting fact revealed by Figs. 4 and 5 is that for sufficientlysmall � and �1, we find that the conventional training-based schemewith power control is better, since �0loss(�;�1) > �loss(�). For ex-ample, �0loss(0:05; 0:1) = 1:425 [dB] while �loss(0:05) = 1:284 dB.The reason is that equal transmission power is used for both trainingand nontraining parts in computing �0loss(�;�1). For very small � and�1, the advantage of power control outweighs the benefits of nonco-herent training. If optimal power allocation is used, which is a difficultoptimization problem, we conjecture that �0loss(�;�1) will be alwayssmaller than �loss(�). Even without power optimization though, for

Page 7: Information-Bearing Noncoherently Modulated Pilots for ... · Nihar Jindal, Member, IEEE Abstract—In this correspondence, we deal with noncoherent commu-nications over multiple-input–multiple-output

1166 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 53, NO. 3, MARCH 2007

Fig. 4. Asymptotic SNR loss � (�; � ) for 2� � � � 1.

Fig. 5. Asymptotic SNR loss � (�;� ) for 2� � � � 1.

most interesting (�; �1) combinations (� > 0:05; � > 0:1), our non-coherent training-based approach outperforms the conventional one.

V. NUMERICAL RESULTS

In this section, we present simulations for the three approaches overnoncoherent channels: USTM, training with known pilot symbols,and training via USTM symbols. We obtain the mutual informa-tion IUSTM(�), CL

known(�) and CL

unknown(�) through Monte Carlosimulations for finite M = N , T and �. The results validate theasymptotic SNR loss �loss(�) and �0

loss(�;�1). For the method used

to numerically evaluate the mutual information of USTM inputs, werefer the reader to [10].

Fig. 6 shows the result for M = N = 1, T = 10 and T� =4. We observe that compared with USTM, training with known pilotsymbols suffers about 1.5dB SNR loss at � = 25 dB, while trainingwith USTM incurs only about 0.4 dB penalty. These results are veryclose to the asymptotic SNR loss given by Proposition 1 and Theorem1: �loss(0:1) = 1:598 dB, �0

loss(0:1;0:4) = 0:438 dB.Fig. 7 depicts the result for M = N = 2, T = 10 and T� =

5. Compared with USTM, training with known pilot symbols suffers

Page 8: Information-Bearing Noncoherently Modulated Pilots for ... · Nihar Jindal, Member, IEEE Abstract—In this correspondence, we deal with noncoherent commu-nications over multiple-input–multiple-output

IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 53, NO. 3, MARCH 2007 1167

Fig. 6. M = N = 1, T = 10, T = 4.

Fig. 7. M = N = 2, T = 10, T = 5.

about 1.8 dB SNR loss at � = 25 dB, while training with USTMincurs only about 0.55 dB loss. Similarly, these results are very closeto the asymptotic SNR loss given by Proposition 1 and Theorem 1:�loss(0:2) = 1:912 dB, �0

loss(0:2; 0:5) = 0:580 dB.

VI. CONCLUSION

We developed a new training scheme that uses information-bearingUSTM symbols as “pilots” instead of known symbols utilized by theconventional training-based approaches. The receiver first decodes

these USTM pilot symbols without channel state information, and thenuses the decoded symbols as training to estimate the channel. Whilethis new method decreases complexity of the capacity-achieving ap-proach through a short USTM block T� < T , it can also recover someSNR loss that is inherent to conventional training-based strategies.When T � T� � 2M = 2N ! 1 and � ! 1, but the ratios� = M=T , �1 = T�=T are fixed, the asymptotic expressions of theSNR loss were obtained analytically for both conventional and theproposed schemes, and are useful as a guideline for practical MIMO

Page 9: Information-Bearing Noncoherently Modulated Pilots for ... · Nihar Jindal, Member, IEEE Abstract—In this correspondence, we deal with noncoherent commu-nications over multiple-input–multiple-output

1168 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 53, NO. 3, MARCH 2007

designs. While the current work is only focused on information-theo-retic aspects, our future work will pursue practical coding schemes forthe proposed approach.

REFERENCES

[1] S. Adireddy, L. Tong, and H. Viswanathan, “Optimal placement oftraining for frequency selective block-fading channels,” IEEE Trans.Inf. Theory, vol. 48, pp. 2338–2353, Aug. 2002.

[2] J. Baltersee, G. Fock, and H. Meyr, “Achievable rate of MIMO chan-nels with data-aided channel estimation and perfect interleaving,” IEEEJ. Sel. Areas Commun., vol. 19, pp. 2358–2368, Dec. 2001.

[3] M. J. Borran, A. Sabharwal, and B. Aazhang, “On design criteria andconstruction of noncoherent space-time constellations,” IEEE Trans.Inf. Theory, vol. 49, pp. 2332–2351, Oct. 2003.

[4] J. K. Cavers, “An analysis of pilot symbol assisted modulation forRayleigh fading channels,” IEEE Trans. Veh. Technol., vol. 40, pp.686–693, Nov. 1991.

[5] P. Dayal, M. Brehler, and M. K. Varanasi, “Leveraging coherent space-time codes for noncoherent communication via training,” IEEE Trans.Inf. Theory, vol. 50, pp. 2058–2080, Sep. 2004.

[6] G. Foschini and M. Gans, “On limits of wireless communications infading environment when using multiple antennas,” Wireless Pers.Commun., vol. 6, pp. 331–335, Nov./Dec. 1998.

[7] G. J. Foschini, “Layered space-time architecture for wireless commu-nication in a fading environment when using multi-element antennas,”AT&T Bell Labs. Tech. J., vol. 1, no. 2, pp. 41–59, 1996.

[8] B. Hassibi and B. Hochwald, “Cayley differential unitary space-timecodes,” IEEE Trans. Inf. Theory, vol. 48, pp. 1485–1503, Jun. 2002.

[9] ——, “How much training is needed in a multiple-antenna wirelesslink,” IEEE Trans. Inf. Theory, vol. 49, pp. 951–964, Apr. 2003.

[10] B. Hassibi and T. L. Marzetta, “Multiple-antennas and isotropicallyrandom unitary inputs: The received signal density in closed form,”IEEE Trans. Inf. Theory, vol. 48, pp. 1473–1484, Jun. 2002.

[11] B. M. Hochwald and T. L. Marzetta, “Unitary space-time modulationfor multiple-antenna communications in Rayleigh flat fading,” IEEETrans. Inf. Theory, vol. 46, pp. 543–564, Mar. 2000.

[12] B. M. Hochwald, T. L. Marzetta, T. J. Richardson, W. Sweldens, andR. Urbanke, “Systematic design of unitary space-time constellations,”IEEE Trans. Inf. Theory, vol. 46, pp. 1962–1973, Sep. 2000.

[13] B. M. Hochwald and W. Sweldens, “Differential unitary space-timemodulation,” IEEE Trans. Inf. Theory, vol. 48, pp. 2041–2052, Dec.2000.

[14] B. L. Hughes, “Differential space-time modulation,” IEEE Trans. Inf.Theory, vol. 46, pp. 2567–2578, Nov. 2000.

[15] Y. Jing and B. Hassibi, “Cayley codes for unitary space-time modula-tion,” IEEE Trans. Signal Process., vol. 51, pp. 2891–2904, Nov. 2003.

[16] X. Ma, L. Yang, and G. B. Giannakis, “Optimal training for MIMOfrequency-selective fading channels,” IEEE Trans. Wireless Commun.,vol. 4, no. 2, pp. 453–466, Mar. 2005.

[17] T. L. Marzetta and B. M. Hochwald, “Capacity of a mobile multiple-antenna communication link in a Rayleigh flat-fading environment,”IEEE Trans. Inf. Theory, vol. 45, pp. 139–157, Jan. 1999.

[18] M. L. McCloud, M. Brehler, and M. K. Varanasi, “Signal design andconvolutional coding for space-time communication on the Rayleighfading channel,” IEEE Trans. Inf. Theory, vol. 48, pp. 1186–1194, May2002.

[19] M. Médard, “The effect upon channel capacity in wireless communica-tions of perfect and imperfect knowledge of the channel,” IEEE Trans.Inf. Theory, vol. 46, pp. 933–946, May 2000.

[20] A. L. Moustakas, S. H. Simon, and T. L. Marzetta, “Capacity of differ-ential versus nondifferential unitary space-time modulation for mimochannels,” IEEE Trans. Inf. Theory, 2004, submitted for publication.

[21] S. Ohno and G. B. Giannakis, “Capacity maximizing mmse-optimalpilots and precoders for wireless OFDM over rapidly fading channels,”IEEE Trans. Inf. Theory, vol. 50, pp. 2138–2145, Sep. 2004.

[22] A. Sabharwal, E. Erkip, and B. Aazhang, “On channel state informationin multiple antenna block fading channels,” in IEEE Symp. Inf. Theory,Honolulu, HI, Nov. 2000, pp. 116–119.

[23] S. Sampei and T. Sunaga, “Rayleigh fading compensation for QAM inland mobile radio communications,” IEEE Trans. Veh. Technol., vol.42, pp. 137–147, May 1993.

[24] V. Tarokh and I. Kim, “Existence and construction of noncoherentunitary space-time codes,” IEEE Trans. Inf. Theory, vol. 48, pp.3112–3117, Dec. 2002.

[25] I. E. Telatar, “Capacity of multi-antenna Gaussian channels,” Europ.Trans. Telecommun., vol. 10, pp. 585–595, Nov./Dec. 1999.

[26] L. Tong, B. Sadler, and M. Dong, “Pilot assisted wireless transmis-sions: General model, design criteria, and signal processing,” IEEESignal Process. Mag., vol. 21, no. 6, pp. 12–25, Nov. 2004.

[27] H. Vikalo, B. Hassibi, B. Hochwald, and T. Kailath, “On the capacity offrequency-selective channels in training-based transmission schemes,”IEEE Trans. Signal Process., vol. 52, pp. 2572–2583, Sep. 2004.

[28] W. Zhao, G. Leus, and G. B. Giannakis, “Orthogonal design of unitaryconstellations for uncoded and trellis coded noncoherent space-timesystems,” IEEE Trans. Inf. Theory, vol. 50, pp. 1319–1327, Jun. 2004.

[29] L. Zheng and D. Tse, “Communicating on the Grassmann manifold:A geometric approach to the noncoherent multiple antenna channel,”IEEE Trans. Inf. Theory, vol. 48, pp. 359–383, Feb. 2002.

On the Elementwise Convergence of Continuous Functionsof Hermitian Banded Toeplitz Matrices

Pedro M. Crespo, Senior Member, IEEE, andJesús Gutiérrez-Gutiérrez

Abstract—Toeplitz matrices and functions of Toeplitz matrices (such asthe inverse of a Toeplitz matrix, powers of a Toeplitz matrix or the expo-nential of a Toeplitz matrix) arise in many different theoretical and ap-plied fields. They can be found in the mathematical modeling of problemswhere some kind of shift invariance occurs in terms of space or time. R. M.Gray’s excellent tutorial monograph on Toeplitz and circulant matrices hasbeen, and remains, the best elementary introduction to the Szegö distribu-tion theory on the asymptotic behavior of continuous functions of Toeplitzmatrices. His asymptotic results, widely used in engineering due to the sim-plicity of its mathematical proofs, do not concern individual entries of thesematrices but rather, they describe an “average” behavior. However, thereare important applications where the asymptotic expressions of interest aredirectly related to the convergence of a single entry of a continuous functionof a Toeplitz matrix. Using similar mathematical tools and to gain insightinto the solutions of this sort of problems, the present correspondence de-rives new theoretical results regarding the convergence of these entries.

Index Terms—Circulant matrices, covariance matrices, elementwise con-vergence, functions of matrices, minimum mean square error (MMSE), sta-tionary stochastic time series, Szegö’s theorem, Toeplitz matrices.

I. INTRODUCTION

A square Toeplitz matrix is an n � n matrix Tn = t(n)i;j where

t(n)i;j = ti�j are complex numbers. Toeplitz matrices arise in many

theoretical and applied fields. In particular, there are many signal pro-cessing applications, where the Toeplitz matrix Tn is often Hermitian.Generally speaking, what is really relevant in many of the associatedproblems is a continuous function of the Toeplitz matrix

g(Tn) := Undiag(g(�1(Tn)); . . . ; g(�n(Tn)))U�1n

Manuscript received May 13, 2006. This work was supported in part by theSpanish Ministry of Education and Science, by the European Regional Devel-opment Fund and by the European Social Fund through the MIMESIS project(no. TEC2004-06451-C05-04/TCM) and the Torres-Quevedo program.

The authors are with CEIT and Tecnun (University of Navarra), 20018 SanSebastián, Spain (e-mail: [email protected]; [email protected]).

Communicated by A. Høst-Madsen, Associate Editor for Detection and Es-timation.

Digital Object Identifier 10.1109/TIT.2006.890697

0018-9448/$25.00 © 2007 IEEE