1
Channel estimation and hybrid precoding in frequency selective mmWave channels Wideband channel model Wideband channel estimation Nuria González-Prelcic, Robert W. Heath Jr. Freq. selective hybrid precoding Finding the precoders with perfect or imperfect CSI HYBRID PRECODING AND COMBINING CHANNEL ESTIMATION IN HYBRID ARCHITECTURES CHANNEL ESTIMATION WITH LOW RESOLUTION ADCS CHANNEL ESTIMATION & HYBRID PRECODING FOR BROADBAND NARROWBAND # of papers on SP for mmWave Signal processing challenges @mmWave Most of prior work on SP for mmWave is narrowband high-resolution multi-view video in real-time (>6Gbps) without HDMI cables … High data rate MmmWave applications High bandwith mmWave channels are frequency selective Virtual reality mmWave V2X Downloading high- definition 3D map data (~Gbyte) Sharing local sensors information ~ 100x Mbps Download multimedia data 100x Mbps - Gbps Strong interference happens much less often Sidelobe interference is weaker Rate gain when only accounting for stronger signal Additional gain from less interference in the narrower beams pointy beam fat beam 20 x gain 2 x gain rate multiplier mmWave cellular TX and RX array response vectors Complex gains for the L clusters Pulse shaping filter evaluated at the delays of each cluster USUALLY NEGLECTED H d = r N r N t L L X `=1 ` p rc (dT s - ` )a R (φ ` )a T (` ), H [k ]= r N r N t L L X `=1 ` β k,` a R (φ ` )a T (` ) P N c -1 d=0 p rc (dT s - ` )e -j 2kd K r N c is the delay tap length Time domain Frequency domain sampled response for the dth delay tap Baseband Precoding 1-bit ADC DAC 1-bit ADC DAC RF Chain RF Precoding 1-bit ADC ADC 1-bit ADC ADC Baseband Combining N t N r L t L r N s N s RF Combining F BB F RF W BB W RF RF Chain RF Chain RF Chain y[k ]= H[k ]F RF F[k ]s[k ]+ n[k ] Received signal at subcarrier k RF precoding in the time domain – common for all subcarriers Baseband precoding can be designed per subcarrier n F ? RF , {F ? [k ]} K k=1 o = arg max F RF ,{F[k]} K k=1 1 K K X k=1 log 2 I N MS + N S H[k ]F RF F[k ]F[k ] F RF H[k ] s.t. [F RF ] :,r 2 F RF , r =1, ..., N RF F RF F[k ] 2 U N BS N RF , k =1, 2, ..., K. ( RF precoders are taken from quantized codebooks (hardware constraints) Unitary power constraint Assumes optimal combining at RX Total power constraint also possible Perfect CSI Design the hybrid precoders to maximize mutual information [1] Imperfect CSI Design the hybrid precoders from the covariance [2] Formulation in the SU scenario Formulation in a MU scenario y j [k ]= H j [k ] X U i=1 P RF P i BB [k ]s i [k ]+ n j . ume a power constraint for each user and s 1. Design downlink hybrid combiners using covariance estimates and MSE 2. Use reciprocity to find the precoders during the uplink phase 3. Decompose the precoders combiners using an iterative method that alternates between RF and BB updates MIMO -FDM system BB precoder for user i subband k ˆ s j [k ]= W j,H BB [k ]W j,H RF y j [k ], postcombining rx signal user for user j [1] A. Alkhateeb and R. W. Heath Jr., “Frequency selective hybrid precoding for limited feedback millimeter wave systems,” IEEE Transactions on Communications, 2016 [2] José P. González-Coma, Nuria González-Prelcic, Luis Castedo and Robert W. Heath Jr., “Frequency selective multiuser hybrid precoding for mmWave systems with imperfect channel knowledge”, Asilomar 2016 [3] K. Venugopal, A. Alkhateeb, N. González Prelcic, and R. W. Heath, Jr, “Channel Estimation for Hybrid Architecture Based Wideband Millimeter Wave Systems”, submitted to IEEE JSAC, 2016 [4] J. Rodríguez-Fernández, K. Venugopal, N. González Prelcic, and R. W. Heath, Jr, “A Frequency-Domain Approach to Wideband Channel Estimation in Millimeter Wave Systems”, submitted to ICC 2017. References to on going work Dictionary with columns ... Random beamforming matrices a c T ( ˜ φ x ) a R ( ˜ y ) Measurement 1 Measurement M Quantized grid of AoA/AoD ... y (1) = p S (1) f T 1 w 1 ( I N c ¯ A tx A rx ) Γx + e (1) y (M ) = p S (M ) f T M w M ( I N c ¯ A tx A rx ) Γx + e (M ) Stack M measurements Γ = 2 6 6 6 4 I G r G t ˜ p T 1 I G r G t ˜ p T 2 . . . I G r G t ˜ p T N c 3 7 7 7 5 , has entries p n ()= p rc (n - ) ˜ p n 1, 2, n =1, 2, ··· ,N c ng the quantization e ˜ p n (k ) en, ne k =1, 2, ··· ,G c due to sampling in t Sampled version Compressive time domain solution [3] Frequency domain solution [4] Solve a sparse recovery problem Sparse vector containing the channel coefficients 2 6 6 6 4 r (1) [k ] r (2) [k ] . . . r (M ) [k ] 3 7 7 7 5 = p 2 6 6 6 4 Φ (1) Φ (2) . . . Φ (M ) 3 7 7 7 5 h v [k ]+ n[k ], Find effective compressed estimation strategies with very low Clear overhead comparison with beam training strategies Challenges measurements Φ (m) =(f (m) tr T W (m) tr ) 2 C L r N t N r =( ¯ ˜ A T ˜ A R ) 2 C N t N r G r G t Sparse vector containing the channel coefficients Exploit common support between subcarriers

Abstract Optimality of Frequency Flat Precoding in ...mmwrcn.ece.wisc.edu/wp-uploads/2017/01/channel-estimation-and... · video in real-time (>6Gbps) without HDMI cables ... please

  • Upload
    ngohanh

  • View
    213

  • Download
    0

Embed Size (px)

Citation preview

Channel estimation and hybrid precoding in frequency selective mmWave channels

Wideband channel model

Wideband channel estimation

Nuria González-Prelcic, Robert W. Heath Jr.

Freq. selective hybrid precoding

Finding the precoders with perfect or imperfect CSI

HYBRID PRECODING AND COMBINING

CHANNEL ESTIMATION IN HYBRID ARCHITECTURES

CHANNEL ESTIMATION WITH LOW RESOLUTION ADCS

CHANNEL ESTIMATION & HYBRID PRECODING FOR BROADBAND

NA

RR

OW

BA

ND

# of papers on SP for mmWave

Signal processing challenges @mmWave

Most of prior work on SP for mmWave is

narrowband

high-resolution multi-view video in real-time

(>6Gbps)

without HDMI cables …

High data rate MmmWave applications

High bandwith mmWave channels are

frequency selective Virtual reality

mmWave V2X

Downloading high-definition

3D map data (~Gbyte)

Sharing local sensors information ~ 100x Mbps

Download multimedia data

100x Mbps - Gbps

Strong interference happens much less often

Sidelobe interference is weaker

Rate gain when only accounting for stronger signal

Additional gain from less interference in the narrower beams

pointy beam fat beam

20 x gain

2 x gain

rate

mul

tiplie

r

mmWave cellular

TX and RX array response vectors

Complex gains for the L clusters

Pulse shaping filter evaluated at the delays of each cluster USUALLY NEGLECTED

Optimality of Frequency Flat Precoding inFrequency Selective Millimeter Wave Channels

Kiran Venugopal, Nuria González Prelcic, and Robert W. Heath, Jr.,

Abstract—less than 1/4 page mostly usually very short for a

letter, around 4 sentences

Index Terms—Millimeter wave communications, hybrid archi-

tecture, subspace estimation.

I. INTRODUCTION

1/2 page• MmWave wideband communication, large antenna sys-

tems, hybrid channel estimation, precoding and combin-ing.

• Prior work on hybrid precoding, but this is for frequencyflat channels. Some work on frequency selective case,but this requires more complicated precoding. Some workuses frequency flat precoders designed based on spatialcorrelation, which is invariant to frequency. but this is notnecessarily optimum in frequency selective channels.

• In this paper, we show that frequency flat precoding isoptimum in frequency selective MIMO channels with fewpaths, as found in mmWave systems. This means thatfrequency selective precoding is not necessarily requiredin mmWave MIMO systems. Further, this motivates theuse of compressed subspace estimators, as an alternativeto compressive channel estimation.

no organization needed for letters

II. SYSTEM MODEL

3/4 pages with figure• Frequency selective mmWave channel model with L

clusters• Received Signal model with SC-FDE or OFDM based

transmission-reception.please use standard OS notation like H[d], etc. please use theabbreviations in the input.tex file I added to this directory, youshouldn’t have things like mathbb etc

Consider a wideband mmWave system with Nt transmitantennas and Nr receive antennas. The geometric channelmodel [1], [2] consisting of L scattering clusters is assumed forrepresenting the frequency selective channel. The `th clusterhas a complex gain ↵` 2 C, delay ⌧` 2 R, angles of arrivaland departure (AoA/AoD) �` 2 [0, 2⇡) and ✓` 2 [0, 2⇡),respectively. The delay spread in the channel is capturedusing raised cosine pulse shaping filter response [1] and is

Kiran Venugopal and Robert W. Heath, Jr. are with the University of Texas,Austin, TX, USA. Email: {kiranv, rheath}@utexas.edu

Nuria González Prelcic is with University of Vigo, Spain. Email:[email protected]

This work was supported by XXX.

denoted as prc(⌧). Using the bandlimited nature of the pulseshaping filter, the discrete-time, frequency selective channelwith Nc delay taps can be represented in terms of the antennaarray response vectors of the receiver aR(�`) 2 CNr⇥1 andtransmitter aT(✓`) 2 CNt⇥1, so that the dth delay tap of theMIMO channel

Hd =

rNrNt

L

LX

`=1

↵`prc(dTs � ⌧`)aR(�`)a⇤T(✓`), (1)

for d = 0, 1, · · · , Nc and with Ts denoting the sampling inter-val. Using the geometric channel model in (1), the complexchannel matrix in the frequency domain can be written as

H [k] =Nc�1X

d=0

Hde�j 2⇡kd

K , (2)

which can be compactly written, with �k,` =PNc�1d=0 prc(dTs � ⌧`)e�j 2⇡kd

K , as

H [k] =

rNrNt

L

LX

`=1

↵`�k,`aR(�`)a⇤T(✓`). (3)

Consider a hybrid precoder-receiver architecture [3] for thefrequency selective mmWave system. We assume block trans-mission of length N with zero padding (ZP) or cyclic prefix(CP) of length at least Nc � 1 appended to each transmittedframe as shown in Fig. XXX. The precoder-combiner pairare assumed to be fixed during the transmission of a frame.Appropriate signal processing can be used at the transmitterand the receiver [4], [5] to remove inter-symbol interferenceoccurring during the data transmission, to obtain K parallelnarrowband channels in the frequency domain. With F(m)

and W(m), respectively denoting the precoder and combinerused during the transmission-reception of the mth frame, thereceived symbol post combining in the kth subcarrier can bewritten as

y(m)k = W⇤

(m)H [k]F(m)xk + nk. (4)

III. OPTIMALITY OF FREQUENCY-FLAT PRECODING

1/2 - 1 page• All the matrices corresponding to different MIMO chan-

nel taps belong to the same subspace• Subspace does not change when viewed in the frequency

domain, and across subcarriers• paragraph on why subspace estimation is useful here and

how it can be used for precoding

Optimality of Frequency Flat Precoding inFrequency Selective Millimeter Wave Channels

Kiran Venugopal, Nuria González Prelcic, and Robert W. Heath, Jr.,

Abstract—less than 1/4 page mostly usually very short for a

letter, around 4 sentences

Index Terms—Millimeter wave communications, hybrid archi-

tecture, subspace estimation.

I. INTRODUCTION

1/2 page• MmWave wideband communication, large antenna sys-

tems, hybrid channel estimation, precoding and combin-ing.

• Prior work on hybrid precoding, but this is for frequencyflat channels. Some work on frequency selective case,but this requires more complicated precoding. Some workuses frequency flat precoders designed based on spatialcorrelation, which is invariant to frequency. but this is notnecessarily optimum in frequency selective channels.

• In this paper, we show that frequency flat precoding isoptimum in frequency selective MIMO channels with fewpaths, as found in mmWave systems. This means thatfrequency selective precoding is not necessarily requiredin mmWave MIMO systems. Further, this motivates theuse of compressed subspace estimators, as an alternativeto compressive channel estimation.

no organization needed for letters

II. SYSTEM MODEL

3/4 pages with figure• Frequency selective mmWave channel model with L

clusters• Received Signal model with SC-FDE or OFDM based

transmission-reception.please use standard OS notation like H[d], etc. please use theabbreviations in the input.tex file I added to this directory, youshouldn’t have things like mathbb etc

Consider a wideband mmWave system with Nt transmitantennas and Nr receive antennas. The geometric channelmodel [1], [2] consisting of L scattering clusters is assumed forrepresenting the frequency selective channel. The `th clusterhas a complex gain ↵` 2 C, delay ⌧` 2 R, angles of arrivaland departure (AoA/AoD) �` 2 [0, 2⇡) and ✓` 2 [0, 2⇡),respectively. The delay spread in the channel is capturedusing raised cosine pulse shaping filter response [1] and is

Kiran Venugopal and Robert W. Heath, Jr. are with the University of Texas,Austin, TX, USA. Email: {kiranv, rheath}@utexas.edu

Nuria González Prelcic is with University of Vigo, Spain. Email:[email protected]

This work was supported by XXX.

denoted as prc(⌧). Using the bandlimited nature of the pulseshaping filter, the discrete-time, frequency selective channelwith Nc delay taps can be represented in terms of the antennaarray response vectors of the receiver aR(�`) 2 CNr⇥1 andtransmitter aT(✓`) 2 CNt⇥1, so that the dth delay tap of theMIMO channel

Hd =

rNrNt

L

LX

`=1

↵`prc(dTs � ⌧`)aR(�`)a⇤T(✓`), (1)

for d = 0, 1, · · · , Nc and with Ts denoting the sampling inter-val. Using the geometric channel model in (1), the complexchannel matrix in the frequency domain can be written as

H [k] =Nc�1X

d=0

Hde�j 2⇡kd

K , (2)

which can be compactly written, with �k,` =PNc�1d=0 prc(dTs � ⌧`)e�j 2⇡kd

K , as

H [k] =

rNrNt

L

LX

`=1

↵`�k,`aR(�`)a⇤T(✓`). (3)

Consider a hybrid precoder-receiver architecture [3] for thefrequency selective mmWave system. We assume block trans-mission of length N with zero padding (ZP) or cyclic prefix(CP) of length at least Nc � 1 appended to each transmittedframe as shown in Fig. XXX. The precoder-combiner pairare assumed to be fixed during the transmission of a frame.Appropriate signal processing can be used at the transmitterand the receiver [4], [5] to remove inter-symbol interferenceoccurring during the data transmission, to obtain K parallelnarrowband channels in the frequency domain. With F(m)

and W(m), respectively denoting the precoder and combinerused during the transmission-reception of the mth frame, thereceived symbol post combining in the kth subcarrier can bewritten as

y(m)k = W⇤

(m)H [k]F(m)xk + nk. (4)

III. OPTIMALITY OF FREQUENCY-FLAT PRECODING

1/2 - 1 page• All the matrices corresponding to different MIMO chan-

nel taps belong to the same subspace• Subspace does not change when viewed in the frequency

domain, and across subcarriers• paragraph on why subspace estimation is useful here and

how it can be used for precoding

Optimality of Frequency Flat Precoding inFrequency Selective Millimeter Wave Channels

Kiran Venugopal, Nuria González Prelcic, and Robert W. Heath, Jr.,

Abstract—less than 1/4 page mostly usually very short for a

letter, around 4 sentences

Index Terms—Millimeter wave communications, hybrid archi-

tecture, subspace estimation.

I. INTRODUCTION

1/2 page• MmWave wideband communication, large antenna sys-

tems, hybrid channel estimation, precoding and combin-ing.

• Prior work on hybrid precoding, but this is for frequencyflat channels. Some work on frequency selective case,but this requires more complicated precoding. Some workuses frequency flat precoders designed based on spatialcorrelation, which is invariant to frequency. but this is notnecessarily optimum in frequency selective channels.

• In this paper, we show that frequency flat precoding isoptimum in frequency selective MIMO channels with fewpaths, as found in mmWave systems. This means thatfrequency selective precoding is not necessarily requiredin mmWave MIMO systems. Further, this motivates theuse of compressed subspace estimators, as an alternativeto compressive channel estimation.

no organization needed for letters

II. SYSTEM MODEL

3/4 pages with figure• Frequency selective mmWave channel model with L

clusters• Received Signal model with SC-FDE or OFDM based

transmission-reception.please use standard OS notation like H[d], etc. please use theabbreviations in the input.tex file I added to this directory, youshouldn’t have things like mathbb etc

Consider a wideband mmWave system with Nt transmitantennas and Nr receive antennas. The geometric channelmodel [1], [2] consisting of L scattering clusters is assumed forrepresenting the frequency selective channel. The `th clusterhas a complex gain ↵` 2 C, delay ⌧` 2 R, angles of arrivaland departure (AoA/AoD) �` 2 [0, 2⇡) and ✓` 2 [0, 2⇡),respectively. The delay spread in the channel is capturedusing raised cosine pulse shaping filter response [1] and is

Kiran Venugopal and Robert W. Heath, Jr. are with the University of Texas,Austin, TX, USA. Email: {kiranv, rheath}@utexas.edu

Nuria González Prelcic is with University of Vigo, Spain. Email:[email protected]

This work was supported by XXX.

denoted as prc(⌧). Using the bandlimited nature of the pulseshaping filter, the discrete-time, frequency selective channelwith Nc delay taps can be represented in terms of the antennaarray response vectors of the receiver aR(�`) 2 CNr⇥1 andtransmitter aT(✓`) 2 CNt⇥1, so that the dth delay tap of theMIMO channel

Hd =

rNrNt

L

LX

`=1

↵`prc(dTs � ⌧`)aR(�`)a⇤T(✓`), (1)

for d = 0, 1, · · · , Nc and with Ts denoting the sampling inter-val. Using the geometric channel model in (1), the complexchannel matrix in the frequency domain can be written as

H [k] =Nc�1X

d=0

Hde�j 2⇡kd

K , (2)

which can be compactly written, with �k,` =PNc�1d=0 prc(dTs � ⌧`)e�j 2⇡kd

K , as

H [k] =

rNrNt

L

LX

`=1

↵`�k,`aR(�`)a⇤T(✓`). (3)

Consider a hybrid precoder-receiver architecture [3] for thefrequency selective mmWave system. We assume block trans-mission of length N with zero padding (ZP) or cyclic prefix(CP) of length at least Nc � 1 appended to each transmittedframe as shown in Fig. XXX. The precoder-combiner pairare assumed to be fixed during the transmission of a frame.Appropriate signal processing can be used at the transmitterand the receiver [4], [5] to remove inter-symbol interferenceoccurring during the data transmission, to obtain K parallelnarrowband channels in the frequency domain. With F(m)

and W(m), respectively denoting the precoder and combinerused during the transmission-reception of the mth frame, thereceived symbol post combining in the kth subcarrier can bewritten as

y(m)k = W⇤

(m)H [k]F(m)xk + nk. (4)

III. OPTIMALITY OF FREQUENCY-FLAT PRECODING

1/2 - 1 page• All the matrices corresponding to different MIMO chan-

nel taps belong to the same subspace• Subspace does not change when viewed in the frequency

domain, and across subcarriers• paragraph on why subspace estimation is useful here and

how it can be used for precoding

Nc is the delay tap length

Time domain

Frequency domain

sampled response for the dth delay tap

Baseband Precoding

1-bit ADC DAC

1-bit ADC DAC

RF Chain

RF Precoding

1-bit ADC ADC

1-bit ADC ADC

Baseband Combining

Nt Nr Lt Lr Ns Ns

RF Combining

FBB FRF WBB WRF

RF Chain

RF Chain

RF Chain

3

+

+

+

FRF

RFPrecoder

RF Chain

NRF

BasebandPrecoderNS

K-point IFFT

K-point IFFT

Digital Precoding

F{ }k

NRF

AddCP

RF Chain

NBS

+

+

+

BasebandPrecoderNMS NRF NRF NS

RF Chain

RF Chain

Delete CP

Delete CP

K-point FFT

K-point FFT

RFCombinerWRF

Digital Combining

{ }kW

AddCP

Fig. 1. A block diagram of the OFDM based BS-MS transceiver that employs hybrid analog/digital precoding.

of length D is then added to the symbol blocks before applying the NBS ⇥NRF RF precoding

FRF. It is important to emphasize here that the RF precoding matrix FRF is the same for all

subcarriers. This means that the RF precoder is assumed to be frequency flat while the baseband

precoders can be different for each subcarrier. The discrete-time transmitted complex baseband

signal at subcarrier k can therefore be written as

x[k] = FRFF[k]s[k], (1)

y[k] = H[k]FRFF[k]s[k] + n[k] (2)

where s[k] is the NS⇥1 transmitted vector at subcarrier k, such that E [s[k]s[k]⇤] = PKNS

INS , and

P is the average total transmit power. Since FRF is implemented using analog phase shifters,

its entries are of constant modulus. To reflect that, we normalize the entries�

[FRF]m,n

2

= 1.

Further, we assume that the angles of the analog phase shifters are quantized and have a finite set

of possible values. With these assumptions, [FRF]m,n = ej�m,n , where �m.n is a quantized angle.

The hybrid precoders are assumed to have a unitary power constraint,i.e., they meet FRFF[k] 2

UNBS⇥NS , with the set of semi-unitary matrices UNBS⇥NS =

U 2 CNBS⇥NS |U⇤U = I

.

At the MS, assuming perfect carrier and frequency offset synchronization, the received signal

is first combined in the RF domain using the NMS ⇥ NRF combining matrix WRF. Then, the

cyclic prefix is removed, and the symbols are returned back to the frequency domain where the

symbols at each subcarrier k are combined using the NRF⇥NS digital combining matrix W[k].

Denoting the NMS⇥NBS channel matrix at subcarrier k as H[k], the received signal at subcarrier

k after processing can be then expressed as

y[k] = W[k]⇤W⇤RFH[k]FRFF[k]s[k] +W[k]⇤W⇤

RFn[k], (3)

where n[k] ⇠ N (0, �2NI) is a Gaussian noise vector.

March 10, 2016 DRAFT

Received signal at subcarrier k

RF precoding in the time domain – common for all

subcarriers

Baseband precoding can be designed per

subcarrier

4

III. PROBLEM FORMULATION

The paper objective is to develop a low-complexity hybrid precoding design to maximize the

achievable system spectral efficiency. Given the system model in Section II. For simplicity of

exposition, we will assume that the receiver can perform optimal nearest neighbor decoding based

on the NMS-dimensional received signal with fully digital hardware. This allows decoupling the

transceiver design problem, and focusing on the hybrid precoders design to maximize the mutual

information of the system [10], defined as

I⇣

FRF, {F[k]}Kk=1

=

1

K

KX

k=1

log2

INMS +

NS

H[k]FRFF[k]F[k]⇤F⇤

RFH[k]⇤�

, (4)

where ⇢ =

PK�2 is the SNR. As combining with fully digital hardware is not a practical mmWave

solution, the hybrid combining design problem needs also to be considered. The design ideas

that will be given in this paper for the hybrid precoders, however, provide direct tools for

constructing the hybrid combining matrices, WRF, {W[k]}Kk=1, and is therefore omitted due to

space limitations.

If the RF beamforming vectors are taken from a codebook FRF that captures the RF hardware

constraints, then the maximum mutual information under the given hybrid precoding model isn

F?RF, {F?

[k]}Kk=1

o

=arg max

FRF,{F[k]}Kk=1

1

K

KX

k=1

log2

INMS +

NS

H[k]FRFF[k]F[k]⇤F⇤

RFH[k]⇤�

s.t. [FRF]:,r 2 FRF, r = 1, ..., NRF

FRFF[k] 2 UNBS⇥NRF , k = 1, 2, ..., K.(5)

One challenge of the hybrid precoding design to solve the optimization problem in (5) is the

coupling between baseband and RF precoders that arises in the power constraint (the second

constraint of (5)). In the following proposition, we show that the baseband precoders can be

written optimally as a function of the RF precoders.

Proposition 1: Define the SVD decompositions of the matrices H[k] = U[k]⌃[k]V[k]⇤ and

⌃[k]V[k]⇤FRF (F⇤RFFRF)

� 12= U[k]⌃[k]V[k]⇤, then the baseband precoders {F[k]}Kk=1 that solve

(5) are given by

F[k]? = (F⇤RFFRF)

� 12⇥

V[k]⇤

:,1:NS, k = 1, 2, ..., K. (6)

March 10, 2016 DRAFT

RF precoders are taken from quantized codebooks (hardware

constraints) Unitary power constraint

Assumes optimal combining at RX

Total power constraint also possible

Perfect CSI

Design the hybrid

precoders to maximize

mutual information

[1]

Imperfect CSI

Design the hybrid

precoders from the

covariance [2]

Formulation in the SU scenario

Formulation in a MU scenario

Asilomar 2016Jose P. Gonzalez-Coma⇤, Nuria Gonzalez-Prelcic†, Luis Castedo⇤ and Robert W. Heath‡

⇤Universidade da Coruna, Spain †Universidade de Vigo, Spain ‡The University of Texas at AustinEmail:{jose.gcoma⇤,luis⇤}@udc.es, [email protected], [email protected]

Abstract—

I. INTRODUCTION

II. SYSTEM MODEL

Let us consider the downlink of a cellular system where aBase Station (BS) equipped with NT antennas communicateswith U non-cooperative Mobile Users (MU) deploying NR,jantennas each. Moreover, the number of RF chains at the BSand the MU’s are LBS and LR,j , respectively. Several datastreams Ns,j LR,j for each user are independently andsimultaneously transmitted. Additionally, we assume the totalnumber of data streams smaller than the number of RF chainsat the BS, Ns =

PUj=1 Ns,j LBS.

We denote sj the data for user j, with E[sj ] =

0, E[sjsHj ] = IMj and E[sksHj ] = 0 for j 6= k. The data arelinearly processed in two stages with the baseband precoderP j

BB 2 CLBS⇥Ns,j and the analog precoder PRF 2 CNT⇥LBS . Atthe MU side, the received signal is linearly filtered with theanalog and baseband combiners, i.e., W j

RF 2 CNR,j⇥LR,j andW j

BB 2 CLR,j⇥Ns,j . Since the RF filters are implemented usinganalog phase shifters, its entries are restricted to a constantmodulus |[PRF]i,j |2 = 1, |[W j

RF]m,n|2 = 1.The d-delay channel model Hj,d 2 CNR,j⇥NT is considered

for each user [?]

Hj,d =

sNTNR

Np

NpX

`=1

↵j,`prc,j(dTs�⌧j,`)aMU,j(✓j,`)aHBS(�j,`),

(1)where Np is the number of channel paths, NR =

PUj=1 NR,j ,

aMU,j(✓j,`) 2 CNR,j and aHBS(�j,`) 2 CNT are the array

response vectors of the receiver and the transmitter, Ts is thesampling interval ⌧j,` is the delay, and ↵j,` is the gain. In thefrequency domain, it can be rewritten as

Hj [k] =DX

d=0

Hj,de�j 2⇡kdL (2)

Along this work, we assume an OFDM modulation with Lsubcarriers and cyclic prefix large enough to avoid intercarrierinterference. Then, we get L equivalent narrow-band channels,and the received signal at user j and subcarrier k is given by

yj [k] = Hj [k]XU

i=1PRFP

iBB[k]si[k] + nj . (3)

We assume a power constraint for each user and subcarrierkPRFP

kBBk2F = Ptx/(KMk), and define the SNR as SNR =

Ptx/�2n. The corresponding estimated data read as

ˆsk[l] = W k,HBB [l]W k,H

RF xk[l], (4)

with the AWGN nk ⇠ NC(0,�2nINR,k). Let us now intro-

duce the system model for the uplink considering channelreciprocity, i.e., HH

k is the channel for the user k in theuplink. The precoders and combiners in the uplink are thenT k

BB 2 CLR,k⇥Ns,k , T kRF 2 CNR,k⇥Ns,k and FRF 2 CNT⇥LBS ,

F kBB 2 CLBS⇥Ns,k , and the noise is n ⇠ NC(0,�2

nINT).Accordingly, the received signal in the uplink reads as

x[l] =KX

i=1

HHi [l]T i

RFTiBB[l]si[l] + n, (5)

where the RF precoders and combiners are, again, frequencyflat. The received signal at subcarrier l is filtered with thetwo-stage combiners to obtain the estimated data ˆsUL

k [l] =

F k,HBB [l]FH

RFx[l].The system model in the downlink yields the following

achievable sum-rateKX

k=1

Rk =

1

L

KX

k=1

LX

l=1

log2 det�INs,k +X�1

k [l]WHk [l]Hk[l]Pk[l]

⇥PHk [l]HH

k [l]Wk[l]�, (6)

with Xk[l] =

Pi 6=k W

Hk [l]Hk[l]Pi[l]PH

i [l]HHk [l]Wk[l] +

WHk [l]C

n

Wk[l] containing the noise and the interference, andthe hybrid precoders and combiners Pk[l] = PRFP

kBB[l] and

Wk[l] = W kRF[l], respectively.

III. PROBLEM FORMULATION

In this work we study the design of the precoders andcombiners to maximize (6). It is important to note thataddressing this problem is a very complicated task even fordigital solutions. Additionally, we need to take the hardwareconstraints of mmWave systems into account.

To consider perfect channel state information is ratherunrealistic. Channel estimation is also challenging because ofthe large number of antennas. Moreover, due to the particularcharacteristics of the mmWave system, it is not possible toaccess directly to the received signal, and only a versionaffected by the RF filtering is available.

A different approach is to consider that the autocorrelationof the received signal, and the crosscorrelation of the trans-mitted and received signal are estimated.

Under the former considerations, a different metric presentsitself as an adequate alternative to design the filters. TheMSE metric depends on the above mentioned correlations,and knowledge of the true channel realization is not needed.

1.  Design downlink hybrid combiners using covariance estimates and MSE 2.  Use reciprocity to find the precoders during the uplink phase 3.  Decompose the precoders combiners using an iterative method that

alternates between RF and BB updates

MIMO -FDM system

BB precoder for user i subband k

Asilomar 2016Jose P. Gonzalez-Coma⇤, Nuria Gonzalez-Prelcic†, Luis Castedo⇤ and Robert W. Heath‡

⇤Universidade da Coruna, Spain †Universidade de Vigo, Spain ‡The University of Texas at AustinEmail:{jose.gcoma⇤,luis⇤}@udc.es, [email protected], [email protected]

Abstract—

I. INTRODUCTION

II. SYSTEM MODEL

Let us consider the downlink of a cellular system where aBase Station (BS) equipped with NT antennas communicateswith U non-cooperative Mobile Users (MU) deploying NR,jantennas each. Moreover, the number of RF chains at the BSand the MU’s are LBS and LR,j , respectively. Several datastreams Ns,j LR,j for each user are independently andsimultaneously transmitted. Additionally, we assume the totalnumber of data streams smaller than the number of RF chainsat the BS, Ns =

PUj=1 Ns,j LBS.

We denote sj the data for user j, with E[sj ] =

0, E[sjsHj ] = IMj and E[sksHj ] = 0 for j 6= k. The data arelinearly processed in two stages with the baseband precoderP j

BB 2 CLBS⇥Ns,j and the analog precoder PRF 2 CNT⇥LBS . Atthe MU side, the received signal is linearly filtered with theanalog and baseband combiners, i.e., W j

RF 2 CNR,j⇥LR,j andW j

BB 2 CLR,j⇥Ns,j . Since the RF filters are implemented usinganalog phase shifters, its entries are restricted to a constantmodulus |[PRF]i,j |2 = 1, |[W j

RF]m,n|2 = 1.The d-delay channel model Hj,d 2 CNR,j⇥NT is considered

for each user [?]

Hj,d =

sNTNR

Np

NpX

`=1

↵j,`prc,j(dTs�⌧j,`)aMU,j(✓j,`)aHBS(�j,`),

(1)where Np is the number of channel paths, NR =

PUj=1 NR,j ,

aMU,j(✓j,`) 2 CNR,j and aHBS(�j,`) 2 CNT are the array

response vectors of the receiver and the transmitter, Ts is thesampling interval ⌧j,` is the delay, and ↵j,` is the gain. In thefrequency domain, it can be rewritten as

Hj [k] =DX

d=0

Hj,de�j 2⇡kdL (2)

Along this work, we assume an OFDM modulation with Lsubcarriers and cyclic prefix large enough to avoid intercarrierinterference. Then, we get L equivalent narrow-band channels,and the received signal at user j and subcarrier k is given by

yj [k] = Hj [k]XU

i=1PRFP

iBB[k]si[k] + nj . (3)

We assume a power constraint for each user and subcarrierkPRFP

iBBk2F = Ptx/(UL), and define the SNR as SNR =

Ptx/�2n. The corresponding estimated data read as

ˆsj [k] = W j,HBB [k]W j,H

RF yj [k], (4)

with the AWGN nk ⇠ NC(0,�2nINR,k). Let us now intro-

duce the system model for the uplink considering channelreciprocity, i.e., HH

k is the channel for the user k in theuplink. The precoders and combiners in the uplink are thenT k

BB 2 CLR,k⇥Ns,k , T kRF 2 CNR,k⇥Ns,k and FRF 2 CNT⇥LBS ,

F kBB 2 CLBS⇥Ns,k , and the noise is n ⇠ NC(0,�2

nINT).Accordingly, the received signal in the uplink reads as

x[l] =KX

i=1

HHi [l]T i

RFTiBB[l]si[l] + n, (5)

where the RF precoders and combiners are, again, frequencyflat. The received signal at subcarrier l is filtered with thetwo-stage combiners to obtain the estimated data ˆsUL

k [l] =

F k,HBB [l]FH

RFx[l].The system model in the downlink yields the following

achievable sum-rateKX

k=1

Rk =

1

L

KX

k=1

LX

l=1

log2 det�INs,k +X�1

k [l]WHk [l]Hk[l]Pk[l]

⇥PHk [l]HH

k [l]Wk[l]�, (6)

with Xk[l] =

Pi 6=k W

Hk [l]Hk[l]Pi[l]PH

i [l]HHk [l]Wk[l] +

WHk [l]C

n

Wk[l] containing the noise and the interference, andthe hybrid precoders and combiners Pk[l] = PRFP

kBB[l] and

Wk[l] = W kRF[l], respectively.

III. PROBLEM FORMULATION

In this work we study the design of the precoders andcombiners to maximize (6). It is important to note thataddressing this problem is a very complicated task even fordigital solutions. Additionally, we need to take the hardwareconstraints of mmWave systems into account.

To consider perfect channel state information is ratherunrealistic. Channel estimation is also challenging because ofthe large number of antennas. Moreover, due to the particularcharacteristics of the mmWave system, it is not possible toaccess directly to the received signal, and only a versionaffected by the RF filtering is available.

A different approach is to consider that the autocorrelationof the received signal, and the crosscorrelation of the trans-mitted and received signal are estimated.

Under the former considerations, a different metric presentsitself as an adequate alternative to design the filters. TheMSE metric depends on the above mentioned correlations,and knowledge of the true channel realization is not needed.

postcombining rx signal user for user j

[1] A. Alkhateeb and R. W. Heath Jr., “Frequency selective hybrid precoding for limited feedback millimeter wave systems,” IEEE Transactions on Communications, 2016 [2] José P. González-Coma, Nuria González-Prelcic, Luis Castedo and Robert W. Heath Jr., “Frequency selective multiuser hybrid precoding for mmWave systems with imperfect channel knowledge”, Asilomar 2016 [3] K. Venugopal, A. Alkhateeb, N. González Prelcic, and R. W. Heath, Jr, “Channel Estimation for Hybrid Architecture Based Wideband Millimeter Wave Systems”, submitted to IEEE JSAC, 2016 [4] J. Rodríguez-Fernández, K. Venugopal, N. González Prelcic, and R. W. Heath, Jr, “A Frequency-Domain Approach to Wideband Channel Estimation in Millimeter Wave Systems”, submitted to ICC 2017.

References to on going work

Dictionary with columns

...

Random beamforming matrices

p⇢(S(1) ⌦ fT

1

⌦w⇤1

)(INc ⌦Ac

tx

⌦Arx

)x+ v(1)

p⇢(S(M) ⌦ fT

M

⌦w⇤M

)(INc ⌦Ac

tx

⌦Arx

)x+ v(M)

{�`

, ✓`

, ↵`

, ⌧`

}

Hd

2 Nr⇥Nt

d = 0, 1, ... Nc

� 1

acT

(

˜�x

)⌦ aR

(

˜✓y

)

2

Measurement 1

Measurement M Quantized grid of AoA/AoD

... training phase, so that the post combining signal is2

66664

y(m)

1

y(m)

2

...y(m)

N

3

77775

T

=p⇢w⇤

m

⇥H

0

· · · HNc�1

⇤S(m)T⌦ f

m

+ e(m)T , (9)

where S(m) =

2

66664

s(m)

1

0 · · · 0

s(m)

2

s(m)

1

· · · ....

.... . .

...s(m)

N

· · · · · · s(m)

N�Nc+1

3

77775. (10)

The use of block transmission with Nc

� 1 zero padding isimportant here, since it would allow for reconfiguring the RFcircuits from one frame to the other and avoids loss of trainingdata during this reconfiguration. This would also avoid interframe interference. Also note that for symboling rate of morethan 1 Gbps (the chip rate used in IEEE 802.11ad preamble,for example, is 1760 MHZ), it is impractical to use differentprecoders and combiners for different symbols. It is morefeasible, however, to change the RF circuitry for differentframes with N ⇠ 64� 512 denoting the frame length in (10).Vectorizing (9) gives

y(m) =p⇢S(m) ⌦ fT

m

⌦w⇤m

2

6664

vec(H0

)vec(H

1

)...

vec(HNc�1

)

3

7775+ e(m). (11)

To formulate the compressive sensing problem we first exploitthe sparse nature of the channel in the angular domain.Accordingly, we assume that the AoAs and AoDs are drawnfrom an angle grid on G

r

and Gt

, respectively. Neglecting thegrid quantization error, we can then express (11) as

y(m)=p⇢⇣S(m)⌦fT

m

⌦w⇤m

⌘�INc⌦A

tx

⌦Arx

�x+ e(m), (12)

where Atx

and Arx

are the dictionary matrices used for sparserecovery. The N

t

⇥Gt

matrix Atx

consists of columns aT

(✓x

),with ✓

x

drawn from a quantized angle grid of size Gt

, andthe N

r

⇥Gr

matrix Arx

consists of columns aR

(�x

), with �x

drawn from a quantized angle grid of size Gr

. The signal xconsists of the channel gains and pulse shaping filter response,and is of size N

c

Gr

Gt

⇥ 1.Next the band-limited nature of the sampled pulse shaping

filter is used to make the measurement vector more sparse.Define

pn

(⌧) = prc

(n� ⌧) (13)and �

ps

(n) = diag ([pn

(⌧1

) pn

(⌧2

) · · · pn

(⌧L

)]) . (14)

Using (13) and (14), (6) can be written as

vec(Hd

) =

rN

t

Nt

L

�A

T

�AR

��

ps

(dTs

)

2

6664

↵1

↵2

...↵L

3

7775. (15)

Next, we look at the sampled version of the pulse-shapingfilter p

n

having entries pn

(k), for n = 1, 2, · · · , Nc

andk = 1, 2, · · · , G

c

. Then, neglecting the quantization errordue to sampling in the delay domain, we can write (12) as

y(m)=p⇢⇣S(m)⌦fT

m

⌦w⇤m

⌘�INc⌦A

tx

⌦Arx

��x+ e(m),

where � =

2

6664

IGrGt ⌦ pT

1

IGrGt ⌦ pT

2

...IGrGt ⌦ pT

Nc

3

7775,

and x is Gc

Gr

Gt

⇥ 1 sparse vector containing the complexchannel gains.

y(1)=p⇢⇣S(1)⌦fT

1

⌦w⇤1

⌘�INc⌦A

tx

⌦Arx

��x+ e(1)

Stacking M such measurements obtained from sending Mtraining frames and using different RF precoder and combinerfor each frame, we have

y =p⇢� x+ e, (16)

where y =

2

6664

y(1)

y(2)

...y(M)

3

77752 CNM⇥1 (17)

is the measured signal,

� =

2

6664

S(1)⌦fT1

⌦w⇤1

S(2)⌦fT2

⌦w⇤2

...S(M)⌦fT

M

⌦w⇤M

3

77752 CNM⇥NcNrNt (18)

is the measurement matrix, and

=�INc ⌦ A

tx

⌦Arx

�� (19)

=

2

6664

�A

tx

⌦Arx

�⌦ pT

1�A

tx

⌦Arx

�⌦ pT

2

...�A

tx

⌦Arx

�⌦ pT

Nc

3

77752 CNcNrNt⇥GcGrGt (20)

is the dictionary. The beamforming and combining vectorsfm

, wm

, m = 1, 2, · · · , M used for training have thephase angles chosen uniformly at random from the set A in(3).

AoA/AoD estimation With the sparse formulation of themmWave channel estimation problem in (16), compressedsensing tools can be first used to estimate the AoA and AoD.Note that we can increase or decrease G

r

, Gt

and Gc

to meetthe required level of sparsity. As the sensing matrix is knownat the receiver, sparse recovery algorithms can be used toestimate the AoA and AoD. Following this, the channel gainscan be estimated to minimize the minimum mean squarederror or via least squares by plugging in the columns of thedictionary matrices corresponding to the estimated AoA andAoD.

training phase, so that the post combining signal is2

66664

y(m)

1

y(m)

2

...y(m)

N

3

77775

T

=p⇢w⇤

m

⇥H

0

· · · HNc�1

⇤S(m)T⌦ f

m

+ e(m)T , (9)

where S(m) =

2

66664

s(m)

1

0 · · · 0

s(m)

2

s(m)

1

· · · ....

.... . .

...s(m)

N

· · · · · · s(m)

N�Nc+1

3

77775. (10)

The use of block transmission with Nc

� 1 zero padding isimportant here, since it would allow for reconfiguring the RFcircuits from one frame to the other and avoids loss of trainingdata during this reconfiguration. This would also avoid interframe interference. Also note that for symboling rate of morethan 1 Gbps (the chip rate used in IEEE 802.11ad preamble,for example, is 1760 MHZ), it is impractical to use differentprecoders and combiners for different symbols. It is morefeasible, however, to change the RF circuitry for differentframes with N ⇠ 64� 512 denoting the frame length in (10).Vectorizing (9) gives

y(m) =p⇢S(m) ⌦ fT

m

⌦w⇤m

2

6664

vec(H0

)vec(H

1

)...

vec(HNc�1

)

3

7775+ e(m). (11)

To formulate the compressive sensing problem we first exploitthe sparse nature of the channel in the angular domain.Accordingly, we assume that the AoAs and AoDs are drawnfrom an angle grid on G

r

and Gt

, respectively. Neglecting thegrid quantization error, we can then express (11) as

y(m)=p⇢⇣S(m)⌦fT

m

⌦w⇤m

⌘�INc⌦A

tx

⌦Arx

�x+ e(m), (12)

where Atx

and Arx

are the dictionary matrices used for sparserecovery. The N

t

⇥Gt

matrix Atx

consists of columns aT

(✓x

),with ✓

x

drawn from a quantized angle grid of size Gt

, andthe N

r

⇥Gr

matrix Arx

consists of columns aR

(�x

), with �x

drawn from a quantized angle grid of size Gr

. The signal xconsists of the channel gains and pulse shaping filter response,and is of size N

c

Gr

Gt

⇥ 1.Next the band-limited nature of the sampled pulse shaping

filter is used to make the measurement vector more sparse.Define

pn

(⌧) = prc

(n� ⌧) (13)and �

ps

(n) = diag ([pn

(⌧1

) pn

(⌧2

) · · · pn

(⌧L

)]) . (14)

Using (13) and (14), (6) can be written as

vec(Hd

) =

rN

t

Nt

L

�A

T

�AR

��

ps

(dTs

)

2

6664

↵1

↵2

...↵L

3

7775. (15)

Next, we look at the sampled version of the pulse-shapingfilter p

n

having entries pn

(k), for n = 1, 2, · · · , Nc

andk = 1, 2, · · · , G

c

. Then, neglecting the quantization errordue to sampling in the delay domain, we can write (12) as

y(m)=p⇢⇣S(m)⌦fT

m

⌦w⇤m

⌘�INc⌦A

tx

⌦Arx

��x+ e(m),

where � =

2

6664

IGrGt ⌦ pT

1

IGrGt ⌦ pT

2

...IGrGt ⌦ pT

Nc

3

7775,

and x is Gc

Gr

Gt

⇥ 1 sparse vector containing the complexchannel gains.

y(M)=p⇢⇣S(M)⌦fT

M

⌦w⇤M

⌘�INc⌦A

tx

⌦Arx

��x+ e(M)

Stacking M such measurements obtained from sending Mtraining frames and using different RF precoder and combinerfor each frame, we have

y =p⇢� x+ e, (16)

where y =

2

6664

y(1)

y(2)

...y(M)

3

77752 CNM⇥1 (17)

is the measured signal,

� =

2

6664

S(1)⌦fT1

⌦w⇤1

S(2)⌦fT2

⌦w⇤2

...S(M)⌦fT

M

⌦w⇤M

3

77752 CNM⇥NcNrNt (18)

is the measurement matrix, and

=�INc ⌦ A

tx

⌦Arx

�� (19)

=

2

6664

�A

tx

⌦Arx

�⌦ pT

1�A

tx

⌦Arx

�⌦ pT

2

...�A

tx

⌦Arx

�⌦ pT

Nc

3

77752 CNcNrNt⇥GcGrGt (20)

is the dictionary. The beamforming and combining vectorsfm

, wm

, m = 1, 2, · · · , M used for training have thephase angles chosen uniformly at random from the set A in(3).

AoA/AoD estimation With the sparse formulation of themmWave channel estimation problem in (16), compressedsensing tools can be first used to estimate the AoA and AoD.Note that we can increase or decrease G

r

, Gt

and Gc

to meetthe required level of sparsity. As the sensing matrix is knownat the receiver, sparse recovery algorithms can be used toestimate the AoA and AoD. Following this, the channel gainscan be estimated to minimize the minimum mean squarederror or via least squares by plugging in the columns of thedictionary matrices corresponding to the estimated AoA andAoD.

Stack M measurements

training phase, so that the post combining signal is2

66664

y(m)

1

y(m)

2

...y(m)

N

3

77775

T

=p⇢w⇤

m

⇥H

0

· · · HNc�1

⇤S(m)T⌦ f

m

+ e(m)T , (9)

where S(m) =

2

66664

s(m)

1

0 · · · 0

s(m)

2

s(m)

1

· · · ....

.... . .

...s(m)

N

· · · · · · s(m)

N�Nc+1

3

77775. (10)

The use of block transmission with Nc

� 1 zero padding isimportant here, since it would allow for reconfiguring the RFcircuits from one frame to the other and avoids loss of trainingdata during this reconfiguration. This would also avoid interframe interference. Also note that for symboling rate of morethan 1 Gbps (the chip rate used in IEEE 802.11ad preamble,for example, is 1760 MHZ), it is impractical to use differentprecoders and combiners for different symbols. It is morefeasible, however, to change the RF circuitry for differentframes with N ⇠ 64� 512 denoting the frame length in (10).Vectorizing (9) gives

y(m) =p⇢S(m) ⌦ fT

m

⌦w⇤m

2

6664

vec(H0

)vec(H

1

)...

vec(HNc�1

)

3

7775+ e(m). (11)

To formulate the compressive sensing problem we first exploitthe sparse nature of the channel in the angular domain.Accordingly, we assume that the AoAs and AoDs are drawnfrom an angle grid on G

r

and Gt

, respectively. Neglecting thegrid quantization error, we can then express (11) as

y(m)=p⇢⇣S(m)⌦fT

m

⌦w⇤m

⌘�INc⌦A

tx

⌦Arx

�x+ e(m), (12)

where Atx

and Arx

are the dictionary matrices used for sparserecovery. The N

t

⇥Gt

matrix Atx

consists of columns aT

(✓x

),with ✓

x

drawn from a quantized angle grid of size Gt

, andthe N

r

⇥Gr

matrix Arx

consists of columns aR

(�x

), with �x

drawn from a quantized angle grid of size Gr

. The signal xconsists of the channel gains and pulse shaping filter response,and is of size N

c

Gr

Gt

⇥ 1.Next the band-limited nature of the sampled pulse shaping

filter is used to make the measurement vector more sparse.Define

pn

(⌧) = prc

(n� ⌧) (13)and �

ps

(n) = diag ([pn

(⌧1

) pn

(⌧2

) · · · pn

(⌧L

)]) . (14)

Using (13) and (14), (6) can be written as

vec(Hd

) =

rN

t

Nt

L

�A

T

�AR

��

ps

(dTs

)

2

6664

↵1

↵2

...↵L

3

7775. (15)

Next, we look at the sampled version of the pulse-shapingfilter p

i

having entries pi

(n), for i = 1, 2, · · · , Nc

andn = 1, 2, · · · , G

c

. Then, neglecting the quantization errordue to sampling in the delay domain, we can write (12) as

y(m)=p⇢⇣S(m)⌦fT

m

⌦w⇤m

⌘�INc⌦A

tx

⌦Arx

��x+ e(m),

where � =

2

6664

IGrGt ⌦ pT

1

IGrGt ⌦ pT

2

...IGrGt ⌦ pT

Nc

3

7775,

and x is Gc

Gr

Gt

⇥ 1 sparse vector containing the complexchannel gains.

Stacking M such measurements obtained from sending Mtraining frames and using different RF precoder and combinerfor each frame, we have

y = � x+ e, (16)

where y =

2

6664

y(1)

y(2)

...y(M)

3

77752 CNM⇥1 (17)

is the measured signal,

� =

2

6664

S(1)⌦fT1

⌦w⇤1

S(2)⌦fT2

⌦w⇤2

...S(M)⌦fT

M

⌦w⇤M

3

77752 CNM⇥NcNrNt (18)

is the measurement matrix, and

=�INc ⌦ A

tx

⌦Arx

�� (19)

=

2

6664

�A

tx

⌦Arx

�⌦ pT

1�A

tx

⌦Arx

�⌦ pT

2

...�A

tx

⌦Arx

�⌦ pT

Nc

3

77752 CNcNrNt⇥GcGrGt (20)

is the dictionary. The beamforming and combining vectorsfm

, wm

, m = 1, 2, · · · , M used for training have thephase angles chosen uniformly at random from the set A in(3).

AoA/AoD estimation With the sparse formulation of themmWave channel estimation problem in (16), compressedsensing tools can be first used to estimate the AoA and AoD.Note that we can increase or decrease G

r

, Gt

and Gc

to meetthe required level of sparsity. As the sensing matrix is knownat the receiver, sparse recovery algorithms can be used toestimate the AoA and AoD. Following this, the channel gainscan be estimated to minimize the minimum mean squarederror or via least squares by plugging in the columns of thedictionary matrices corresponding to the estimated AoA andAoD.

has entries

training phase, so that the post combining signal is2

66664

y(m)

1

y(m)

2

...y(m)

N

3

77775

T

=p⇢w⇤

m

⇥H

0

· · · HNc�1

⇤S(m)T⌦ f

m

+ e(m)T , (9)

where S(m) =

2

66664

s(m)

1

0 · · · 0

s(m)

2

s(m)

1

· · · ....

.... . .

...s(m)

N

· · · · · · s(m)

N�Nc+1

3

77775. (10)

The use of block transmission with Nc

� 1 zero padding isimportant here, since it would allow for reconfiguring the RFcircuits from one frame to the other and avoids loss of trainingdata during this reconfiguration. This would also avoid interframe interference. Also note that for symboling rate of morethan 1 Gbps (the chip rate used in IEEE 802.11ad preamble,for example, is 1760 MHZ), it is impractical to use differentprecoders and combiners for different symbols. It is morefeasible, however, to change the RF circuitry for differentframes with N ⇠ 64� 512 denoting the frame length in (10).Vectorizing (9) gives

y(m) =p⇢S(m) ⌦ fT

m

⌦w⇤m

2

6664

vec(H0

)vec(H

1

)...

vec(HNc�1

)

3

7775+ e(m). (11)

To formulate the compressive sensing problem we first exploitthe sparse nature of the channel in the angular domain.Accordingly, we assume that the AoAs and AoDs are drawnfrom an angle grid on G

r

and Gt

, respectively. Neglecting thegrid quantization error, we can then express (11) as

y(m)=p⇢⇣S(m)⌦fT

m

⌦w⇤m

⌘�INc⌦A

tx

⌦Arx

�x+ e(m), (12)

where Atx

and Arx

are the dictionary matrices used for sparserecovery. The N

t

⇥Gt

matrix Atx

consists of columns aT

(✓x

),with ✓

x

drawn from a quantized angle grid of size Gt

, andthe N

r

⇥Gr

matrix Arx

consists of columns aR

(�x

), with �x

drawn from a quantized angle grid of size Gr

. The signal xconsists of the channel gains and pulse shaping filter response,and is of size N

c

Gr

Gt

⇥ 1.Next the band-limited nature of the sampled pulse shaping

filter is used to make the measurement vector more sparse.Define

pn

(⌧) = prc

(n� ⌧) (13)and �

ps

(n) = diag ([pn

(⌧1

) pn

(⌧2

) · · · pn

(⌧L

)]) . (14)

Using (13) and (14), (6) can be written as

vec(Hd

) =

rN

t

Nt

L

�A

T

�AR

��

ps

(dTs

)

2

6664

↵1

↵2

...↵L

3

7775. (15)

Next, we look at the sampled version of the pulse-shapingfilter p

i

having entries pi

(n), for i = 1, 2, · · · , Nc

andn = 1, 2, · · · , G

c

. Then, neglecting the quantization errordue to sampling in the delay domain, we can write (12) as

y(m)=p⇢⇣S(m)⌦fT

m

⌦w⇤m

⌘�INc⌦A

tx

⌦Arx

��x+ e(m),

where � =

2

6664

IGrGt ⌦ pT

1

IGrGt ⌦ pT

2

...IGrGt ⌦ pT

Nc

3

7775,

and x is Gc

Gr

Gt

⇥ 1 sparse vector containing the complexchannel gains.

Stacking M such measurements obtained from sending Mtraining frames and using different RF precoder and combinerfor each frame, we have

y = � x+ e, (16)

where y =

2

6664

y(1)

y(2)

...y(M)

3

77752 CNM⇥1 (17)

is the measured signal,

� =

2

6664

S(1)⌦fT1

⌦w⇤1

S(2)⌦fT2

⌦w⇤2

...S(M)⌦fT

M

⌦w⇤M

3

77752 CNM⇥NcNrNt (18)

is the measurement matrix, and

=�INc ⌦ A

tx

⌦Arx

�� (19)

=

2

6664

�A

tx

⌦Arx

�⌦ pT

1�A

tx

⌦Arx

�⌦ pT

2

...�A

tx

⌦Arx

�⌦ pT

Nc

3

77752 CNcNrNt⇥GcGrGt (20)

is the dictionary. The beamforming and combining vectorsfm

, wm

, m = 1, 2, · · · , M used for training have thephase angles chosen uniformly at random from the set A in(3).

AoA/AoD estimation With the sparse formulation of themmWave channel estimation problem in (16), compressedsensing tools can be first used to estimate the AoA and AoD.Note that we can increase or decrease G

r

, Gt

and Gc

to meetthe required level of sparsity. As the sensing matrix is knownat the receiver, sparse recovery algorithms can be used toestimate the AoA and AoD. Following this, the channel gainscan be estimated to minimize the minimum mean squarederror or via least squares by plugging in the columns of thedictionary matrices corresponding to the estimated AoA andAoD.

training phase, so that the post combining signal is2

66664

y(m)

1

y(m)

2

...y(m)

N

3

77775

T

=p⇢w⇤

m

⇥H

0

· · · HNc�1

⇤S(m)T⌦ f

m

+ e(m)T , (9)

where S(m) =

2

66664

s(m)

1

0 · · · 0

s(m)

2

s(m)

1

· · · ....

.... . .

...s(m)

N

· · · · · · s(m)

N�Nc+1

3

77775. (10)

The use of block transmission with Nc

� 1 zero padding isimportant here, since it would allow for reconfiguring the RFcircuits from one frame to the other and avoids loss of trainingdata during this reconfiguration. This would also avoid interframe interference. Also note that for symboling rate of morethan 1 Gbps (the chip rate used in IEEE 802.11ad preamble,for example, is 1760 MHZ), it is impractical to use differentprecoders and combiners for different symbols. It is morefeasible, however, to change the RF circuitry for differentframes with N ⇠ 64� 512 denoting the frame length in (10).Vectorizing (9) gives

y(m) =p⇢S(m) ⌦ fT

m

⌦w⇤m

2

6664

vec(H0

)vec(H

1

)...

vec(HNc�1

)

3

7775+ e(m). (11)

To formulate the compressive sensing problem we first exploitthe sparse nature of the channel in the angular domain.Accordingly, we assume that the AoAs and AoDs are drawnfrom an angle grid on G

r

and Gt

, respectively. Neglecting thegrid quantization error, we can then express (11) as

y(m)=p⇢⇣S(m)⌦fT

m

⌦w⇤m

⌘�INc⌦A

tx

⌦Arx

�x+ e(m), (12)

where Atx

and Arx

are the dictionary matrices used for sparserecovery. The N

t

⇥Gt

matrix Atx

consists of columns aT

(✓x

),with ✓

x

drawn from a quantized angle grid of size Gt

, andthe N

r

⇥Gr

matrix Arx

consists of columns aR

(�x

), with �x

drawn from a quantized angle grid of size Gr

. The signal xconsists of the channel gains and pulse shaping filter response,and is of size N

c

Gr

Gt

⇥ 1.Next the band-limited nature of the sampled pulse shaping

filter is used to make the measurement vector more sparse.Define

pn

(⌧) = prc

(n� ⌧) (13)and �

ps

(n) = diag ([pn

(⌧1

) pn

(⌧2

) · · · pn

(⌧L

)]) . (14)

Using (13) and (14), (6) can be written as

vec(Hd

) =

rN

t

Nt

L

�A

T

�AR

��

ps

(dTs

)

2

6664

↵1

↵2

...↵L

3

7775. (15)

Next, we look at the sampled version of the pulse-shapingfilter p

n

having entries pn

(k), for n = 1, 2, · · · , Nc

andk = 1, 2, · · · , G

c

. Then, neglecting the quantization errordue to sampling in the delay domain, we can write (12) as

y(m)=p⇢⇣S(m)⌦fT

m

⌦w⇤m

⌘�INc⌦A

tx

⌦Arx

��x+ e(m),

where � =

2

6664

IGrGt ⌦ pT

1

IGrGt ⌦ pT

2

...IGrGt ⌦ pT

Nc

3

7775,

and x is Gc

Gr

Gt

⇥ 1 sparse vector containing the complexchannel gains.

Stacking M such measurements obtained from sending Mtraining frames and using different RF precoder and combinerfor each frame, we have

y =p⇢� x+ e, (16)

where y =

2

6664

y(1)

y(2)

...y(M)

3

77752 CNM⇥1 (17)

is the measured signal,

� =

2

6664

S(1)⌦fT1

⌦w⇤1

S(2)⌦fT2

⌦w⇤2

...S(M)⌦fT

M

⌦w⇤M

3

77752 CNM⇥NcNrNt (18)

is the measurement matrix, and

=�INc ⌦ A

tx

⌦Arx

�� (19)

=

2

6664

�A

tx

⌦Arx

�⌦ pT

1�A

tx

⌦Arx

�⌦ pT

2

...�A

tx

⌦Arx

�⌦ pT

Nc

3

77752 CNcNrNt⇥GcGrGt (20)

is the dictionary. The beamforming and combining vectorsfm

, wm

, m = 1, 2, · · · , M used for training have thephase angles chosen uniformly at random from the set A in(3).

AoA/AoD estimation With the sparse formulation of themmWave channel estimation problem in (16), compressedsensing tools can be first used to estimate the AoA and AoD.Note that we can increase or decrease G

r

, Gt

and Gc

to meetthe required level of sparsity. As the sensing matrix is knownat the receiver, sparse recovery algorithms can be used toestimate the AoA and AoD. Following this, the channel gainscan be estimated to minimize the minimum mean squarederror or via least squares by plugging in the columns of thedictionary matrices corresponding to the estimated AoA andAoD.

training phase, so that the post combining signal is2

66664

y(m)

1

y(m)

2

...y(m)

N

3

77775

T

=p⇢w⇤

m

⇥H

0

· · · HNc�1

⇤S(m)T⌦ f

m

+ e(m)T , (9)

where S(m) =

2

66664

s(m)

1

0 · · · 0

s(m)

2

s(m)

1

· · · ....

.... . .

...s(m)

N

· · · · · · s(m)

N�Nc+1

3

77775. (10)

The use of block transmission with Nc

� 1 zero padding isimportant here, since it would allow for reconfiguring the RFcircuits from one frame to the other and avoids loss of trainingdata during this reconfiguration. This would also avoid interframe interference. Also note that for symboling rate of morethan 1 Gbps (the chip rate used in IEEE 802.11ad preamble,for example, is 1760 MHZ), it is impractical to use differentprecoders and combiners for different symbols. It is morefeasible, however, to change the RF circuitry for differentframes with N ⇠ 64� 512 denoting the frame length in (10).Vectorizing (9) gives

y(m) =p⇢S(m) ⌦ fT

m

⌦w⇤m

2

6664

vec(H0

)vec(H

1

)...

vec(HNc�1

)

3

7775+ e(m). (11)

To formulate the compressive sensing problem we first exploitthe sparse nature of the channel in the angular domain.Accordingly, we assume that the AoAs and AoDs are drawnfrom an angle grid on G

r

and Gt

, respectively. Neglecting thegrid quantization error, we can then express (11) as

y(m)=p⇢⇣S(m)⌦fT

m

⌦w⇤m

⌘�INc⌦A

tx

⌦Arx

�x+ e(m), (12)

where Atx

and Arx

are the dictionary matrices used for sparserecovery. The N

t

⇥Gt

matrix Atx

consists of columns aT

(✓x

),with ✓

x

drawn from a quantized angle grid of size Gt

, andthe N

r

⇥Gr

matrix Arx

consists of columns aR

(�x

), with �x

drawn from a quantized angle grid of size Gr

. The signal xconsists of the channel gains and pulse shaping filter response,and is of size N

c

Gr

Gt

⇥ 1.Next the band-limited nature of the sampled pulse shaping

filter is used to make the measurement vector more sparse.Define

pn

(⌧) = prc

(n� ⌧) (13)and �

ps

(n) = diag ([pn

(⌧1

) pn

(⌧2

) · · · pn

(⌧L

)]) . (14)

Using (13) and (14), (6) can be written as

vec(Hd

) =

rN

t

Nt

L

�A

T

�AR

��

ps

(dTs

)

2

6664

↵1

↵2

...↵L

3

7775. (15)

Next, we look at the sampled version of the pulse-shapingfilter p

n

having entries pn

(k), for n = 1, 2, · · · , Nc

andk = 1, 2, · · · , G

c

. Then, neglecting the quantization errordue to sampling in the delay domain, we can write (12) as

y(m)=p⇢⇣S(m)⌦fT

m

⌦w⇤m

⌘�INc⌦A

tx

⌦Arx

��x+ e(m),

where � =

2

6664

IGrGt ⌦ pT

1

IGrGt ⌦ pT

2

...IGrGt ⌦ pT

Nc

3

7775,

and x is Gc

Gr

Gt

⇥ 1 sparse vector containing the complexchannel gains.

Stacking M such measurements obtained from sending Mtraining frames and using different RF precoder and combinerfor each frame, we have

y =p⇢� x+ e, (16)

where y =

2

6664

y(1)

y(2)

...y(M)

3

77752 CNM⇥1 (17)

is the measured signal,

� =

2

6664

S(1)⌦fT1

⌦w⇤1

S(2)⌦fT2

⌦w⇤2

...S(M)⌦fT

M

⌦w⇤M

3

77752 CNM⇥NcNrNt (18)

is the measurement matrix, and

=�INc ⌦ A

tx

⌦Arx

�� (19)

=

2

6664

�A

tx

⌦Arx

�⌦ pT

1�A

tx

⌦Arx

�⌦ pT

2

...�A

tx

⌦Arx

�⌦ pT

Nc

3

77752 CNcNrNt⇥GcGrGt (20)

is the dictionary. The beamforming and combining vectorsfm

, wm

, m = 1, 2, · · · , M used for training have thephase angles chosen uniformly at random from the set A in(3).

AoA/AoD estimation With the sparse formulation of themmWave channel estimation problem in (16), compressedsensing tools can be first used to estimate the AoA and AoD.Note that we can increase or decrease G

r

, Gt

and Gc

to meetthe required level of sparsity. As the sensing matrix is knownat the receiver, sparse recovery algorithms can be used toestimate the AoA and AoD. Following this, the channel gainscan be estimated to minimize the minimum mean squarederror or via least squares by plugging in the columns of thedictionary matrices corresponding to the estimated AoA andAoD.

training phase, so that the post combining signal is2

66664

y(m)

1

y(m)

2

...y(m)

N

3

77775

T

=p⇢w⇤

m

⇥H

0

· · · HNc�1

⇤S(m)T⌦ f

m

+ e(m)T , (9)

where S(m) =

2

66664

s(m)

1

0 · · · 0

s(m)

2

s(m)

1

· · · ....

.... . .

...s(m)

N

· · · · · · s(m)

N�Nc+1

3

77775. (10)

The use of block transmission with Nc

� 1 zero padding isimportant here, since it would allow for reconfiguring the RFcircuits from one frame to the other and avoids loss of trainingdata during this reconfiguration. This would also avoid interframe interference. Also note that for symboling rate of morethan 1 Gbps (the chip rate used in IEEE 802.11ad preamble,for example, is 1760 MHZ), it is impractical to use differentprecoders and combiners for different symbols. It is morefeasible, however, to change the RF circuitry for differentframes with N ⇠ 64� 512 denoting the frame length in (10).Vectorizing (9) gives

y(m) =p⇢S(m) ⌦ fT

m

⌦w⇤m

2

6664

vec(H0

)vec(H

1

)...

vec(HNc�1

)

3

7775+ e(m). (11)

To formulate the compressive sensing problem we first exploitthe sparse nature of the channel in the angular domain.Accordingly, we assume that the AoAs and AoDs are drawnfrom an angle grid on G

r

and Gt

, respectively. Neglecting thegrid quantization error, we can then express (11) as

y(m)=p⇢⇣S(m)⌦fT

m

⌦w⇤m

⌘�INc⌦A

tx

⌦Arx

�x+ e(m), (12)

where Atx

and Arx

are the dictionary matrices used for sparserecovery. The N

t

⇥Gt

matrix Atx

consists of columns aT

(✓x

),with ✓

x

drawn from a quantized angle grid of size Gt

, andthe N

r

⇥Gr

matrix Arx

consists of columns aR

(�x

), with �x

drawn from a quantized angle grid of size Gr

. The signal xconsists of the channel gains and pulse shaping filter response,and is of size N

c

Gr

Gt

⇥ 1.Next the band-limited nature of the sampled pulse shaping

filter is used to make the measurement vector more sparse.Define

pn

(⌧) = prc

(n� ⌧) (13)and �

ps

(n) = diag ([pn

(⌧1

) pn

(⌧2

) · · · pn

(⌧L

)]) . (14)

Using (13) and (14), (6) can be written as

vec(Hd

) =

rN

t

Nt

L

�A

T

�AR

��

ps

(dTs

)

2

6664

↵1

↵2

...↵L

3

7775. (15)

Next, we look at the sampled version of the pulse-shapingfilter p

n

having entries pn

(k), for n = 1, 2, · · · , Nc

andk = 1, 2, · · · , G

c

. Then, neglecting the quantization errordue to sampling in the delay domain, we can write (12) as

y(m)=p⇢⇣S(m)⌦fT

m

⌦w⇤m

⌘�INc⌦A

tx

⌦Arx

��x+ e(m),

where � =

2

6664

IGrGt ⌦ pT

1

IGrGt ⌦ pT

2

...IGrGt ⌦ pT

Nc

3

7775,

and x is Gc

Gr

Gt

⇥ 1 sparse vector containing the complexchannel gains.

Stacking M such measurements obtained from sending Mtraining frames and using different RF precoder and combinerfor each frame, we have

y =p⇢� x+ e, (16)

where y =

2

6664

y(1)

y(2)

...y(M)

3

77752 CNM⇥1 (17)

is the measured signal,

� =

2

6664

S(1)⌦fT1

⌦w⇤1

S(2)⌦fT2

⌦w⇤2

...S(M)⌦fT

M

⌦w⇤M

3

77752 CNM⇥NcNrNt (18)

is the measurement matrix, and

=�INc ⌦ A

tx

⌦Arx

�� (19)

=

2

6664

�A

tx

⌦Arx

�⌦ pT

1�A

tx

⌦Arx

�⌦ pT

2

...�A

tx

⌦Arx

�⌦ pT

Nc

3

77752 CNcNrNt⇥GcGrGt (20)

is the dictionary. The beamforming and combining vectorsfm

, wm

, m = 1, 2, · · · , M used for training have thephase angles chosen uniformly at random from the set A in(3).

AoA/AoD estimation With the sparse formulation of themmWave channel estimation problem in (16), compressedsensing tools can be first used to estimate the AoA and AoD.Note that we can increase or decrease G

r

, Gt

and Gc

to meetthe required level of sparsity. As the sensing matrix is knownat the receiver, sparse recovery algorithms can be used toestimate the AoA and AoD. Following this, the channel gainscan be estimated to minimize the minimum mean squarederror or via least squares by plugging in the columns of thedictionary matrices corresponding to the estimated AoA andAoD.

training phase, so that the post combining signal is2

66664

y(m)

1

y(m)

2

...y(m)

N

3

77775

T

=p⇢w⇤

m

⇥H

0

· · · HNc�1

⇤S(m)T⌦ f

m

+ e(m)T , (9)

where S(m) =

2

66664

s(m)

1

0 · · · 0

s(m)

2

s(m)

1

· · · ....

.... . .

...s(m)

N

· · · · · · s(m)

N�Nc+1

3

77775. (10)

The use of block transmission with Nc

� 1 zero padding isimportant here, since it would allow for reconfiguring the RFcircuits from one frame to the other and avoids loss of trainingdata during this reconfiguration. This would also avoid interframe interference. Also note that for symboling rate of morethan 1 Gbps (the chip rate used in IEEE 802.11ad preamble,for example, is 1760 MHZ), it is impractical to use differentprecoders and combiners for different symbols. It is morefeasible, however, to change the RF circuitry for differentframes with N ⇠ 64� 512 denoting the frame length in (10).Vectorizing (9) gives

y(m) =p⇢S(m) ⌦ fT

m

⌦w⇤m

2

6664

vec(H0

)vec(H

1

)...

vec(HNc�1

)

3

7775+ e(m). (11)

To formulate the compressive sensing problem we first exploitthe sparse nature of the channel in the angular domain.Accordingly, we assume that the AoAs and AoDs are drawnfrom an angle grid on G

r

and Gt

, respectively. Neglecting thegrid quantization error, we can then express (11) as

y(m)=p⇢⇣S(m)⌦fT

m

⌦w⇤m

⌘�INc⌦A

tx

⌦Arx

�x+ e(m), (12)

where Atx

and Arx

are the dictionary matrices used for sparserecovery. The N

t

⇥Gt

matrix Atx

consists of columns aT

(✓x

),with ✓

x

drawn from a quantized angle grid of size Gt

, andthe N

r

⇥Gr

matrix Arx

consists of columns aR

(�x

), with �x

drawn from a quantized angle grid of size Gr

. The signal xconsists of the channel gains and pulse shaping filter response,and is of size N

c

Gr

Gt

⇥ 1.Next the band-limited nature of the sampled pulse shaping

filter is used to make the measurement vector more sparse.Define

pn

(⌧) = prc

(n� ⌧) (13)and �

ps

(n) = diag ([pn

(⌧1

) pn

(⌧2

) · · · pn

(⌧L

)]) . (14)

Using (13) and (14), (6) can be written as

vec(Hd

) =

rN

t

Nt

L

�A

T

�AR

��

ps

(dTs

)

2

6664

↵1

↵2

...↵L

3

7775. (15)

Next, we look at the sampled version of the pulse-shapingfilter p

n

having entries pn

(k), for n = 1, 2, · · · , Nc

andk = 1, 2, · · · , G

c

. Then, neglecting the quantization errordue to sampling in the delay domain, we can write (12) as

y(m)=p⇢⇣S(m)⌦fT

m

⌦w⇤m

⌘�INc⌦A

tx

⌦Arx

��x+ e(m),

where � =

2

6664

IGrGt ⌦ pT

1

IGrGt ⌦ pT

2

...IGrGt ⌦ pT

Nc

3

7775,

and x is Gc

Gr

Gt

⇥ 1 sparse vector containing the complexchannel gains.

Stacking M such measurements obtained from sending Mtraining frames and using different RF precoder and combinerfor each frame, we have

y =p⇢� x+ e, (16)

where y =

2

6664

y(1)

y(2)

...y(M)

3

77752 CNM⇥1 (17)

is the measured signal,

� =

2

6664

S(1)⌦fT1

⌦w⇤1

S(2)⌦fT2

⌦w⇤2

...S(M)⌦fT

M

⌦w⇤M

3

77752 CNM⇥NcNrNt (18)

is the measurement matrix, and

=�INc ⌦ A

tx

⌦Arx

�� (19)

=

2

6664

�A

tx

⌦Arx

�⌦ pT

1�A

tx

⌦Arx

�⌦ pT

2

...�A

tx

⌦Arx

�⌦ pT

Nc

3

77752 CNcNrNt⇥GcGrGt (20)

is the dictionary. The beamforming and combining vectorsfm

, wm

, m = 1, 2, · · · , M used for training have thephase angles chosen uniformly at random from the set A in(3).

AoA/AoD estimation With the sparse formulation of themmWave channel estimation problem in (16), compressedsensing tools can be first used to estimate the AoA and AoD.Note that we can increase or decrease G

r

, Gt

and Gc

to meetthe required level of sparsity. As the sensing matrix is knownat the receiver, sparse recovery algorithms can be used toestimate the AoA and AoD. Following this, the channel gainscan be estimated to minimize the minimum mean squarederror or via least squares by plugging in the columns of thedictionary matrices corresponding to the estimated AoA andAoD.

Sampled version

Compressive time domain solution [3]

Frequency domain solution [4]

Solve a sparse recovery problem

Sparse vector containing the channel coefficients

3

matrices �vd and the dictionaries

H[k] ⇡ AR(Nc�1X

d=0

�vde

�j 2⇡kN

d)AT⇤= AR�[k]AT

⇤. (6)

We also assume that due to the sparsity of�vd ,�[k] are

sparse matrices which share the same support for all thesubcarriers (the AoA/AoD are assumed invariant alongthe different subcarriers).

Assuming that the receiver applies a hybrid combinerW = WRFWBB 2 CNr⇥NRF , the received signal atsubcarrier k can be written asr(m)[k] = W⇤

BB[k]W⇤RFH[k]FRFFBB[k]s

(m)[k]

+W⇤BB[k]W

⇤RFn[k],

(7)

where n[k] ⇠ N�0,�2I

�is the circularly symmetric

complex Gaussian distributed additive noise vector.

III. COMPRESSIVE CHANNEL ESTIMATION IN THEFREQUENCY DOMAIN

In this section, we formulate a compressed sensingproblem to estimate the vectorized sparse channel vector.We assume a single RF chain is used at the transmitterfor the ease of exposition. During the training phase, forthe mth frame we use a training precoder f

(m)tr and a

training combining matrix W(m)tr . Assuming the trans-

mitted symbols are s(m)[k] = 1, the received samples inthe frequency domain for the mth frame can be writtenas

r(m)[k] =p⇢W(m)

tr⇤H[k]f (m)

tr + n(m)[k], (8)

where H[k] 2 CNr⇥Nt is the frequency-domain MIMOchannel response at the k-th subcarrier and n(m)[k] 2CLr⇥1 is the frequency-domain combined noise vectorreceived at the k-th subcarrier. We assume that thechannel coherence time is larger than the frame durationand that the same channel can be considered for severalconsecutive frames.

Using the result vec{AXC} = (CT ⌦ A) vec{X},the vectorized received signal can be written as

vec{r(m)[k]} =p⇢(f (m)

trT⌦W

(m)tr

⇤) vec{H[k]}+n(m)[k].

(9)Taking into account the expression in (6), the vectorizedchannel matrix can be written as vec{H[k]} = ( ¯AT ⌦AR) vec{�[k]}. Therefore, if we define

�(m) = (f (m)tr

T⌦W

(m)tr

⇤) 2 CLr⇥NtNr (10)

and = ( ¯AT ⌦ AR) 2 CNtNr⇥GrGt (11)

(9) can be rewritten as

vec{r(m)[k]} =p⇢�(m) hv[k] + n(m)[k], (12)

where hv[k] = vec{�[k]} 2 CGrGt⇥1 is the sparsevector containing the complex channel gains. To haveenough measurements and accurately reconstruct thesparse vector hv[k] it is necessary to use several train-ing frames, specially in very-low SNR regime. If thetransmitter and receiver communicate during M trainingsteps using different pseudorandomly built precoders andcombiners, (12) can be extended to

2

6664

r(1)[k]r(2)[k]

...r(M)[k]

3

7775=

p⇢

2

6664

�(1)

�(2)

...�(M)

3

7775 hv[k] + n[k], (13)

where

n[k] =⇥n(1)[k]T n(2)[k]T . . . n(M)[k]T

⇤(14)

is the vectorized noise vector after combining during Mtraining steps. Finally, the vector hv[k] can be found bysolving the sparse reconstruction problem

min ||hv[k]||1 subject to ||r[k]�� hv[k]||22 < ✏,(15)

where ✏ is a tunable parameter defining the maximumerror between the measurement and the received signalassuming the reconstructed channel between the trans-mitter and the receiver. The sensing matrix is definedas

� =

2

6664

�(1)

�(2)

...�(M)

3

77752 CMLr⇥NrNt , (16)

and the measurement is

r[k] =

2

6664

r(1)[k]r(2)[k]

...r(M)[k]

3

77752 CMLr⇥1. (17)

There is a great variety of algorithms to solve thisproblem. We could use for example Orthogonal Match-ing Pursuit (OMP) to find the sparsest approximationof the vector containing the channel gains. Since thevector hv[k] depends on the frequency bin, it wouldbe necessary to run the algorithm as many times asthe number of subcarriers (at most) at which we seekthe MIMO channel response. In the next subsections weconsider however an additional assumption to solve thisproblem without running N OMP algorithms.

Find effective compressed estimation strategies with very

low

Clear overhead comparison with beam training strategies

Challenges

measurements

3

matrices �vd and the dictionaries

H[k] ⇡ AR(Nc�1X

d=0

�vde

�j 2⇡kN

d)AT⇤= AR�[k]AT

⇤. (6)

We also assume that due to the sparsity of�vd ,�[k] are

sparse matrices which share the same support for all thesubcarriers (the AoA/AoD are assumed invariant alongthe different subcarriers).

Assuming that the receiver applies a hybrid combinerW = WRFWBB 2 CNr⇥NRF , the received signal atsubcarrier k can be written asr(m)[k] = W⇤

BB[k]W⇤RFH[k]FRFFBB[k]s

(m)[k]

+W⇤BB[k]W

⇤RFn[k],

(7)

where n[k] ⇠ N�0,�2I

�is the circularly symmetric

complex Gaussian distributed additive noise vector.

III. COMPRESSIVE CHANNEL ESTIMATION IN THEFREQUENCY DOMAIN

In this section, we formulate a compressed sensingproblem to estimate the vectorized sparse channel vector.We assume a single RF chain is used at the transmitterfor the ease of exposition. During the training phase, forthe mth frame we use a training precoder f

(m)tr and a

training combining matrix W(m)tr . Assuming the trans-

mitted symbols are s(m)[k] = 1, the received samples inthe frequency domain for the mth frame can be writtenas

r(m)[k] =p⇢W(m)

tr⇤H[k]f (m)

tr + n(m)[k], (8)

where H[k] 2 CNr⇥Nt is the frequency-domain MIMOchannel response at the k-th subcarrier and n(m)[k] 2CLr⇥1 is the frequency-domain combined noise vectorreceived at the k-th subcarrier. We assume that thechannel coherence time is larger than the frame durationand that the same channel can be considered for severalconsecutive frames.

Using the result vec{AXC} = (CT ⌦ A) vec{X},the vectorized received signal can be written as

vec{r(m)[k]} =p⇢(f (m)

trT⌦W

(m)tr

⇤) vec{H[k]}+n(m)[k].

(9)Taking into account the expression in (6), the vectorizedchannel matrix can be written as vec{H[k]} = ( ¯AT ⌦AR) vec{�[k]}. Therefore, if we define

�(m) = (f (m)tr

T⌦W

(m)tr

⇤) 2 CLr⇥NtNr (10)

and = ( ¯AT ⌦ AR) 2 CNtNr⇥GrGt (11)

(9) can be rewritten as

vec{r(m)[k]} =p⇢�(m) hv[k] + n(m)[k], (12)

where hv[k] = vec{�[k]} 2 CGrGt⇥1 is the sparsevector containing the complex channel gains. To haveenough measurements and accurately reconstruct thesparse vector hv[k] it is necessary to use several train-ing frames, specially in very-low SNR regime. If thetransmitter and receiver communicate during M trainingsteps using different pseudorandomly built precoders andcombiners, (12) can be extended to

2

6664

r(1)[k]r(2)[k]

...r(M)[k]

3

7775=

p⇢

2

6664

�(1)

�(2)

...�(M)

3

7775 hv[k] + n[k], (13)

where

n[k] =⇥n(1)[k]T n(2)[k]T . . . n(M)[k]T

⇤(14)

is the vectorized noise vector after combining during Mtraining steps. Finally, the vector hv[k] can be found bysolving the sparse reconstruction problem

min ||hv[k]||1 subject to ||r[k]�� hv[k]||22 < ✏,(15)

where ✏ is a tunable parameter defining the maximumerror between the measurement and the received signalassuming the reconstructed channel between the trans-mitter and the receiver. The sensing matrix is definedas

� =

2

6664

�(1)

�(2)

...�(M)

3

77752 CMLr⇥NrNt , (16)

and the measurement is

r[k] =

2

6664

r(1)[k]r(2)[k]

...r(M)[k]

3

77752 CMLr⇥1. (17)

There is a great variety of algorithms to solve thisproblem. We could use for example Orthogonal Match-ing Pursuit (OMP) to find the sparsest approximationof the vector containing the channel gains. Since thevector hv[k] depends on the frequency bin, it wouldbe necessary to run the algorithm as many times asthe number of subcarriers (at most) at which we seekthe MIMO channel response. In the next subsections weconsider however an additional assumption to solve thisproblem without running N OMP algorithms.

3

matrices �vd and the dictionaries

H[k] ⇡ AR(Nc�1X

d=0

�vde

�j 2⇡kN

d)AT⇤= AR�[k]AT

⇤. (6)

We also assume that due to the sparsity of�vd ,�[k] are

sparse matrices which share the same support for all thesubcarriers (the AoA/AoD are assumed invariant alongthe different subcarriers).

Assuming that the receiver applies a hybrid combinerW = WRFWBB 2 CNr⇥NRF , the received signal atsubcarrier k can be written asr(m)[k] = W⇤

BB[k]W⇤RFH[k]FRFFBB[k]s

(m)[k]

+W⇤BB[k]W

⇤RFn[k],

(7)

where n[k] ⇠ N�0,�2I

�is the circularly symmetric

complex Gaussian distributed additive noise vector.

III. COMPRESSIVE CHANNEL ESTIMATION IN THEFREQUENCY DOMAIN

In this section, we formulate a compressed sensingproblem to estimate the vectorized sparse channel vector.We assume a single RF chain is used at the transmitterfor the ease of exposition. During the training phase, forthe mth frame we use a training precoder f

(m)tr and a

training combining matrix W(m)tr . Assuming the trans-

mitted symbols are s(m)[k] = 1, the received samples inthe frequency domain for the mth frame can be writtenas

r(m)[k] =p⇢W(m)

tr⇤H[k]f (m)

tr + n(m)[k], (8)

where H[k] 2 CNr⇥Nt is the frequency-domain MIMOchannel response at the k-th subcarrier and n(m)[k] 2CLr⇥1 is the frequency-domain combined noise vectorreceived at the k-th subcarrier. We assume that thechannel coherence time is larger than the frame durationand that the same channel can be considered for severalconsecutive frames.

Using the result vec{AXC} = (CT ⌦ A) vec{X},the vectorized received signal can be written as

vec{r(m)[k]} =p⇢(f (m)

trT⌦W

(m)tr

⇤) vec{H[k]}+n(m)[k].

(9)Taking into account the expression in (6), the vectorizedchannel matrix can be written as vec{H[k]} = ( ¯AT ⌦AR) vec{�[k]}. Therefore, if we define

�(m) = (f (m)tr

T⌦W

(m)tr

⇤) 2 CLr⇥NtNr (10)

and = ( ¯AT ⌦ AR) 2 CNtNr⇥GrGt (11)

(9) can be rewritten as

vec{r(m)[k]} =p⇢�(m) hv[k] + n(m)[k], (12)

where hv[k] = vec{�[k]} 2 CGrGt⇥1 is the sparsevector containing the complex channel gains. To haveenough measurements and accurately reconstruct thesparse vector hv[k] it is necessary to use several train-ing frames, specially in very-low SNR regime. If thetransmitter and receiver communicate during M trainingsteps using different pseudorandomly built precoders andcombiners, (12) can be extended to

2

6664

r(1)[k]r(2)[k]

...r(M)[k]

3

7775=

p⇢

2

6664

�(1)

�(2)

...�(M)

3

7775 hv[k] + n[k], (13)

where

n[k] =⇥n(1)[k]T n(2)[k]T . . . n(M)[k]T

⇤(14)

is the vectorized noise vector after combining during Mtraining steps. Finally, the vector hv[k] can be found bysolving the sparse reconstruction problem

min ||hv[k]||1 subject to ||r[k]�� hv[k]||22 < ✏,(15)

where ✏ is a tunable parameter defining the maximumerror between the measurement and the received signalassuming the reconstructed channel between the trans-mitter and the receiver. The sensing matrix is definedas

� =

2

6664

�(1)

�(2)

...�(M)

3

77752 CMLr⇥NrNt , (16)

and the measurement is

r[k] =

2

6664

r(1)[k]r(2)[k]

...r(M)[k]

3

77752 CMLr⇥1. (17)

There is a great variety of algorithms to solve thisproblem. We could use for example Orthogonal Match-ing Pursuit (OMP) to find the sparsest approximationof the vector containing the channel gains. Since thevector hv[k] depends on the frequency bin, it wouldbe necessary to run the algorithm as many times asthe number of subcarriers (at most) at which we seekthe MIMO channel response. In the next subsections weconsider however an additional assumption to solve thisproblem without running N OMP algorithms.

Sparse vector containing the channel coefficients

Exploit common support between subcarriers