5
Tensor decomposition of polarized seismic waves Francesca Raimondi, Pierre Comon To cite this version: Francesca Raimondi, Pierre Comon. Tensor decomposition of polarized seismic waves. XXV` eme colloque GRETSI (GRETSI 2015), Sep 2015, Lyon, France. Lavoisier, pp.4, 2015, XXVeme Colloque Gretsi. <http://www.gretsi.fr/colloque2015/bienvenue.html>. <hal-01164363> HAL Id: hal-01164363 https://hal.archives-ouvertes.fr/hal-01164363 Submitted on 16 Jun 2015 HAL is a multi-disciplinary open access archive for the deposit and dissemination of sci- entific research documents, whether they are pub- lished or not. The documents may come from teaching and research institutions in France or abroad, or from public or private research centers. L’archive ouverte pluridisciplinaire HAL, est destin´ ee au d´ epˆ ot et ` a la diffusion de documents scientifiques de niveau recherche, publi´ es ou non, ´ emanant des ´ etablissements d’enseignement et de recherche fran¸cais ou ´ etrangers, des laboratoires publics ou priv´ es. Distributed under a Creative Commons Attribution - NonCommercial 4.0 International License

Tensor decomposition of polarized seismic waves · V TGR=[u, v 2]= 2 4 cos cos cos sin sin cos sin sin sin cos 3 5 If the complex envelope of the source signal is denoted by s(t),

  • Upload
    others

  • View
    13

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Tensor decomposition of polarized seismic waves · V TGR=[u, v 2]= 2 4 cos cos cos sin sin cos sin sin sin cos 3 5 If the complex envelope of the source signal is denoted by s(t),

Tensor decomposition of polarized seismic waves

Francesca Raimondi, Pierre Comon

To cite this version:

Francesca Raimondi, Pierre Comon. Tensor decomposition of polarized seismic waves. XXVemecolloque GRETSI (GRETSI 2015), Sep 2015, Lyon, France. Lavoisier, pp.4, 2015, XXVemeColloque Gretsi. <http://www.gretsi.fr/colloque2015/bienvenue.html>. <hal-01164363>

HAL Id: hal-01164363

https://hal.archives-ouvertes.fr/hal-01164363

Submitted on 16 Jun 2015

HAL is a multi-disciplinary open accessarchive for the deposit and dissemination of sci-entific research documents, whether they are pub-lished or not. The documents may come fromteaching and research institutions in France orabroad, or from public or private research centers.

L’archive ouverte pluridisciplinaire HAL, estdestinee au depot et a la diffusion de documentsscientifiques de niveau recherche, publies ou non,emanant des etablissements d’enseignement et derecherche francais ou etrangers, des laboratoirespublics ou prives.

Distributed under a Creative Commons Attribution - NonCommercial 4.0 International License

Page 2: Tensor decomposition of polarized seismic waves · V TGR=[u, v 2]= 2 4 cos cos cos sin sin cos sin sin sin cos 3 5 If the complex envelope of the source signal is denoted by s(t),

Tensor decomposition

of polarized seismic waves

Francesca Raimondi, Pierre Comon

GIPSA-Lab, UMR5216,11 rue des Mathematiques, Grenoble Campus, BP.46, F-38402 St Martin d’Heres cedex, France

[email protected]

Résumé – En traitement d’antenne, les décompositions tensorielles permettent d’estimer conjointement les sources et deles localiser. Pour que ces dernières puissent être utilisées, il faut que les données présentent au moins trois diversités, quisont habituellement le temps, l’espace, et la translation dans l’espace. L’approche présentée ici est basée sur la diversité depolarisation, une alternative très attractive lorsque l’antenne ne jouit pas d’invariance spatiale. Nous dérivons ensuite les bornesde Cramér-Rao dans ce contexte, en nous appuyant sur des conventions de différentiation de variables mixtes réelles et complexes.

Abstract – In antenna array processing, tensor decompositions allow to jointly estimate sources and their location. But thesetechniques can be used only if data are recorded as a function of at least three diversities, which are usually time, space andspace translation. The approach presented therein is based on polarization diversity, a very attractive alternative when theantenna array does not enjoy space invariance. Then we derive Cramér-Rao bounds in this context, by resorting to differentiationconventions for real-complex mixed variables.

1 Introduction

Starting from the ideas on vector sensor array developedfor seismic waves in [1] from the more general model forpolarized waves in [2] and [3], we state the observationmodel in tensor form. Next we compute the Cramér-RaoBounds (CRB) for the joint estimation of the four param-eters of polarized seismic waves. The ultimate estimationperformances are then compared to the CRB as a func-tion of the Signal to Noise Ratio (SNR). A deterministicapproach based on tensor decomposition has been intro-duced in [4]. The advantage of tensor decompositions liesin the need for shorter data records, since the estima-tion of statistical quantities from available samples is nota requirement (as opposed to traditional high resolutionalgorithms such as MUSIC [5] and ESPRIT [6]). CRBfor the low-rank decompositions of multidimensional ar-ray was derived in [7] and extended in [8]. Polarizationof waves has been first introduced in [9] as a multidimen-sional diversity in the tensor approach. The same authorsin [10] explore the concept of polarization separation andits influence on performances.

Notation We shall assume throughout the following no-tations: matrices, column vectors and scalars are denoted re-spectively in bold uppercase, e.g. A, bold lowercase, e.g. v,and plain lowercase; in particular array entries are written e.g.

vj

or Aij

. Transposition, complex conjugation and Hermitian

⇤This work has been supported by the ERC project “DECODA”no. 320594, in the frame of the European program FP7/2007-2013.

transposition are denoted by (

T), (⇤) and (

H), respectively. Ar-

rays with more than two indices are referred to as tensors, withsome abuse of terminology [11], and are denoted in bold calli-graphic, as T . The outer (tensor) product between two vectorsis denoted by u⌦v. Finally, k·k

F

refers to the Frobenius norm;⇥ denote Kronecker product. For the sake of conciseness, a

i

will represent the i-th column of matrix A.

2 Physical model

2.1 A 4-parameter far-field model

The physical quantity measured is the particle displace-

ment vector recorded by a three-component particle dis-placement sensor (or geophone), located at a given pointin space, in the direction of the x-, y- and z-axes of its ref-erence system. The z-axis is required to be perpendicularto the earth’s surface. The following parameterization isbased on the definition of four angular parameters. First,the unit vector pointing to the source is given by

u =

2

4cos ✓ cos sin ✓ cos

sin

3

5

where ✓ 2 (�⇡,⇡] refers to the azimuth and 2[�⇡/2,⇡/2] to the elevation of the source. Second, thepolarization ellipse is described by the orientation angle↵ 2 (�⇡/2,⇡/2] and the ellipticity angle � 2 [�⇡/4,⇡/4].Two models can be drawn respectively for transverse (TR)

Page 3: Tensor decomposition of polarized seismic waves · V TGR=[u, v 2]= 2 4 cos cos cos sin sin cos sin sin sin cos 3 5 If the complex envelope of the source signal is denoted by s(t),

waves and tilted generalized Rayleigh (TGR) waves [1].For a transverse wave, the polarization ellipse lies in thespace orthogonal to the direction of propagation u, and isspanned by the columns {v1 v2} of the matrix:

V

TR

= [v1 , v2] =

2

4� sin ✓ � cos ✓ sin cos ✓ � sin ✓ sin 0 cos

3

5 (1)

In the TGR model, the polarization ellipse is confined inthe plane spanned by the columns of matrix:

V

TGR

= [u , v2] =

2

4cos ✓ cos � cos ✓ sin sin ✓ cos � sin ✓ sin

sin cos

3

5

If the complex envelope of the source signal is denoted bys(t), the general data model can be written as

y(t) = V (✓, )Q(↵)w(�) s(t) 2 C3⇥1 (2)where V (✓, ) is one of the above matrices,

Q =

cos↵ sin↵� sin↵ cos↵

�, w =

cos�ı sin�

in the absence of noise, and ı =p�1.

2.2 Seismic waves and polarization

There exist several types of elastic waves associated withseismic activities [1]. Primary waves (or P-Waves) arecompressional elastic waves whose particle displacementvector is parallel to the direction of propagation. Forthese waves, ↵ = � = 0, which leads to a linearly polar-ized wavefront with particle motion along the direction ofpropagation u. Rayleigh Waves are elliptically polarizedsurface waves. Therefore, = 0, provided that the xy-plane corresponds to the earth surface. For these waves,it is obvious that ↵ = 0 and then Q = I. Secondary orShear waves (or S-waves) are transverse elliptically polar-ized in general. P-Waves and Rayleigh Waves are partic-ular cases of the TGR model described in Section 2.1 forelliptically polarized waveforms: the direction of propaga-tion is located in the plane spanned by the ellipse majorand minor axes. For reasons of space, we shall concentrateon TR waves in the remainder.

2.3 Tensor model

Now suppose that data are recorded on K polarized sen-sors located at points in space defined by vectors g(k) :=[gx

k

; gyk

; gzk

] 2 R3, 1 k K. Also suppose that R far-field narrow-band sources impinge on this vector sensorarray from direction u(r), 1 r R, and denote ! theircommon angular pulsation. We make the assumption thatimpinging waves have elliptical polarization (neither linearnor circular). Then from (2) we can assume the followingobservation model in baseband about pulsation !:

T = Z + E , Z =

RX

r=1

a(r)⌦ b(r)⌦ s(r) (3)

where ak

(r) =

1K

exp

�ı!v

g(k)Tu(r)

is the k-th en-try of the steering vector, v the wave celerity, b(r) =

V (✓r

, r

)Q(↵r

)w(�r

) 2 CL⇥1(L = 3) characterizes the

propagation type, (✓r

, r

) refers to the Direction of Arrival(DoA) of the r-th source and (↵

r

,�r

) its polarization, andsm

(r) is the signal propagating from the r-th source andreceived at time t

m

, 1 m M . The additive noise E isassumed to be i.i.d. circular Gaussian and independent ofthe sources. In terms of arrays of coordinates, model (3)rewrites:

Zk`m

(✓, ,↵,�,S) =

RX

r=1

ak

(r)b`

(r)sm

(r) (4)

or in column vector format:

z := vecZ =

RX

r=1

a(r)⇥ b(r)⇥ s(r) (5)

3 Parameter identification

3.1 Model identification

It is always possible to decompose the data tensor intoa sum of decomposable tensors [11, 4] of the form D(r) =a(r)⌦b(r)⌦s(r), that is, in terms of array of coordinates:

Dklm

(r) = ak

(r)bl

(r)sm

(r)

Hence tensor Z takes the form:

Z =

RX

r=1

&r

D(r) (6)

where coefficients &r

can always be chosen to be real pos-itive, and decomposable tensors D(r) to have unit norm,i.e. for Lp norms, kDk = kak kbk kck = 1. The minimalvalue of R such that this decomposition holds is calledrank of Z. If R is not too large, the corresponding de-composition is unique [4, 12, 11, 13] and deserves to bereferred to as Canonical Polyadic (CP); other terminolo-gies include rank decomposition or Candecomp/Parafac.Note that decomposable tensors have a rank equal to 1.Because of the uniqueness of the CP decomposition, de-composable tensors of (4) and (6) coincide in the absenceof noise. This means that vectors {a(r), b(r), s(r)} coin-cide up to some scaling factors [4, 8, 11].

3.2 Model identifiability

There exist sufficient conditions ensuring uniqueness ofthe exact CP, e.g. the Kruskal condition [4]:

A

+ B

+ C

� 2R+ 2 (7)where the notation

A

refers to the Kruskal -rank⇤ of ma-trix A. However, less stringent conditions guaranteeingalmost surely a unique solution can be found [12, 11, 13]:

R(K + L+M � 2) < KLM

⇤The Kruskal rank of a matrix A is the largest number A

suchthat any subset of

A

columns are lineraly independent.

Page 4: Tensor decomposition of polarized seismic waves · V TGR=[u, v 2]= 2 4 cos cos cos sin sin cos sin sin sin cos 3 5 If the complex envelope of the source signal is denoted by s(t),

This hold true when data are not corrupted by noise. How-ever, if noise is present, we have to solve a best rank-R

approximation problem:

min

ar,br,sr

����

����T �RX

r=1

a

r

⌦ b

r

⌦ s

r

����

����2

F

(8)

For d � 3, the best approximation of a d-partite func-tion of a sum of R product of d separable functions doesnot exist in general [12], as a sequence of rank-r functionscan converge to a limit which is not rank-r. A sufficientcondition ensuring existence of a solution to (8) via thedefinition of a physical constraint, the coherence, is de-rived in [12]. Unlike the Kuskal rank, coherences are easyto compute and present the advantage of having a physicalmeaning, i.e. the best rank-R approximation exists and isunique if either impinging signals are not too correlated,or their directions of arrivals and polarization states arenot too close.

4 Performances

4.1 Mixed real-complex gradients

Since the parameter of the array processing model arecomplex, a definition of the derivative of a real func-tion h(z) 2 Rp with respect to the complex variablez 2 Cn, z = x + ıy, x,y 2 R, needs to be introduced.[7] presents a derivation of Cramér-Rao bounds related tothe CP decomposition of multidimensional arrays, usingthe same definition of complex derivative as in [8, 14]:

@h@z

=

1

2

@h@x

� ı2

@h@y

(9)

For clarity, our notation of the derivative of a scalar func-tion ⌥(z) with respect to a column vector z 2 Cn is:

(@⌥@z is a column vector@⌥@zT is a line vector

Given an holomorphic column vector function f(z) 2 Cm,we define [

@f@zT ]ij =

@fi

@zjso that

(@fT

@z is an n⇥m matrix@f@zT is an m⇥ n matrix

In the sequel, we shall need a complex derivative chainrule. Given a scalar function ⌥(z) 2 R, a complex func-tion z(✓) = x + ıy 2 Cp, and a real variable ✓ 2 Rq, wehave from the real derivative chaine rule:

@⌥

@✓T=

@⌥

@xT

@x

@✓T+

@⌥

@yT

@y

@✓T

which, using (9), yields the chaine rule:

@⌥

@✓T= 2<

⇢@⌥

@zT

@z

@✓T

�(10)

4.2 Gradient calculation

In order to compute Cramér-Rao bounds, we shall needthe gradients of the log-likelihood, which turn out to bethe same as those of the cost function f defined below,deduced from (5), if noise is i.i.d. circular Gaussian:

f(#) =1

�2n

kt� z(#)k22 , # := [✓; ;↵;�; vecS; vecS

⇤]

(11)where �2

n

denotes its variance. The gradient expressionswill also be subsequently useful to implement descent al-gorithms. According to the chain rule (10) and definition(9), the partial derivatives of the cost function with re-spect to DoA parameters are given by

@f@✓

r

= 2<⇢@f@aT

r

@ar

@✓r

+

@f

@bTr

@br

@✓r

�,@f@↵

r

= 2<⇢@f

@bTr

@br

@↵r

@f@

r

= 2<⇢@f@aT

r

@ar

@ r

+

@f

@bTr

@br

@ r

�,@f@�

r

= 2<⇢@f

@bTr

@br

@�r

with

@f@aT

r

=

z �

RX

r=1

a

r

⇥ b

r

⇥ s

r

!H

(�I

K

⇥ b

r

⇥ s

r

)

@f

@bTr

=

z �

RX

r=1

a

r

⇥ b

r

⇥ s

r

!H

(�a

r

⇥ I

L

⇥ s

r

)

@f@sT

r

=

z �

RX

r=1

a

r

⇥ b

r

⇥ s

r

!H

(�a

r

⇥ b

r

⇥ I

M

)

@ar

@✓r

=

hı!v(�gx

k

sin ✓r

cos r

+ gyk

cos ✓r

cos r

)Akr

iK

k=1

Akr

=

1K

exp

�ı!v

[gxk

cos ✓r

cos r

+ gyk

sin ✓r

cos r

+ gzk

sin r

]

@br

@✓r

=

@Vr

@✓r

Q

r

w

r

,@b

r

@ r

=

@Vr

@ r

Q

r

w

r

@V TR

r

@✓r

=

2

4� cos ✓

r

sin ✓r

sin r

� sin ✓r

� cos ✓r

sin r

0 0

3

5

@ar

@ r

=

hı!v(�gx

k

cos ✓r

sin r

� gyk

sin ✓r

sin r

+ gzk

cos r

)Akr

iK

k=1

@V TR

r

@ r

=

2

40 � cos ✓

r

cos r

0 � sin ✓r

cos r

0 � sin r

3

5

@br

@↵r

= V

r

dQr

d↵r

w

r

,@b

r

@�r

= V

r

Q

r

dwr

d�r

dQr

d↵r

=

� sin↵

r

cos↵r

� cos↵r

� sin↵r

�,

dwr

d�r

=

� sin�

r

ı cos�r

4.3 Cramér-Rao Bounds

Cramér-Rao Bounds (CRB) represent the lower bound onthe variance of any unbiased estimator of a deterministicparameter. Define the Signal-to-Noise ratio (SNR) as [7]:

SNR = 10 log10kZk2

F

KLM�2n

Page 5: Tensor decomposition of polarized seismic waves · V TGR=[u, v 2]= 2 4 cos cos cos sin sin cos sin sin sin cos 3 5 If the complex envelope of the source signal is denoted by s(t),

where operator k · k2F

indicates Frobenius norm. For azero-mean, circularly complex Gaussian noise with covari-ance �2

n

I the log-likelihood takes the form (11) up to anadditive constant. Then, the mixed real-complex FisherInformation Matrix (FIM) can be shown to be given by[7, 8]:

�(#) = E(✓

@f(#)

@#

◆H✓@f(#)

@#

◆)

The CRB of any unbiased estimator of a vector param-eter # is is given by the inverse of the FIM. It is usefulto separate parameters to be estimated in three vectors:one real, [✓; ;↵;�], and two complex, vecS and vecS

⇤.With this organization, the FIM has 9 blocks [8]:

� =

1

�2n

0

@2<{G11} G12 G

⇤12

G

H12 G22 0

G

T12 0 G

⇤22

1

A

where G

ij

=

⇣@z@#i

⌘H⇣@z@#j

⌘.

5 Computer experiments

Signals were simulated according to realistic sampling con-ditions (1kHz sampling frequency, M = 42 time samples,K = 9 sensors, L = 3 polarization components). Ultimateperformances have been evaluated by running a gradientdescent initialized with the true values slightly corruptedby noise. A comparison with deterministic CRB is shownin Figure 1. The performance criterion is the total mean

square error (total MSE) of each DoA and polarizationparameter #: 1

N

PN

n=1

PR

r=1(ˆ#rn

�#r

)

2, where ˆ#rn

is theestimated parameter of the r-th source at the n-th Monte-Carlo trial, N = 99 is the number of trials. The numberof simultaneous sources was chosen to be R = 2, with thefollowing parameters:

(✓1 = �⇡/3, ✓2 = ⇡/6, 1 = �⇡/4, 2 = ⇡/7

↵1 = ⇡/5, ↵2 = ⇡/7, �1 = �⇡/7, �2 = ⇡/5

-10 0 10 20 3010�6

10�5

10�4

10�3

10�2

10�1

100

SNR [dB]

MS

E

CRB qCRB yMSE qMSE y

-10 0 10 20 30SNR [dB]

CRB aCRB bMSE aMSE b

Fig. 1: MSE vs SNR

CRBs are obtained by summing the diagonal entries inthe inverse of the first block, 2<{G11}, in the FIM. This

means source signals were considered as known and not asnuisances (which implies the obtention of slightly smallerbounds).

References

[1] S. Anderson and A. Nehorai, “Analysis of a polarizedseismic wave model,” IEEE Trans. Sig. Proc., vol. 44, no.2, pp. 379–386, 1996.

[2] A. Nehorai and E. Paldi, “Vector-sensor array processingfor electromagnetic source localization,” IEEE Trans. Sig.

Proc., vol. 42, no. 2, pp. 376–398, 1994.[3] A. Nehorai and E. Paldi, “Acoustic vector-sensor array

processing,” IEEE Trans. Sig. Proc., vol. 42, no. 9, pp.2481–2491, 1994.

[4] N. D. Sidiropoulos, R. Bro, and G. B. Giannakis, “Parallelfactor analysis in sensor array processing,” IEEE Trans.

Sig. Proc., vol. 48, no. 8, pp. 2377–2388, Aug. 2000.[5] R. O. Schmidt, “Multiple emitter location and signal pa-

rameter estimation,” IEEE Trans. Antenna Propagation,vol. 34, no. 3, pp. 276–280, Mar. 1986.

[6] R. Roy and T. Kailath, “ESPRIT-estimation of signalparameters via rotational invariance techniques,” IEEE

Trans. Acoust. Speech Sig. Proc., vol. 37, no. 7, pp. 984–995, 1989.

[7] X. Liu and N. D. Sidiropoulos, “Cramér-Rao lower boundsfor low-rank decomposition of multidimensional arrays,”IEEE Trans. Sig. Proc,, vol. 49, no. 9, pp. 2074–2086,2001.

[8] S. Sahnoun and P. Comon, “Tensor polyadic decompo-sition for antenna array processing,” in 21st Int. Conf.

Comput. Stat (CompStat), Geneva, Aug. 19-22 2014, pp.233–240, hal-00986973.

[9] X. Guo, S. Miron, D. Brie, S. Zhu, and X. Liao, “A Can-decomp/Parafac perspective on uniqueness of doa estima-tion using a vector sensor array,” IEEE Trans. Sig. Proc.,vol. 59, no. 7, pp. 3475–3481, 2011.

[10] X. Guo, S. Miron, and D. Brie, “The effect of polarizationseparation on the performance of Candecomp/Parafac-based vector sensor array processing,” Physical Commu-

nication, vol. 5, no. 4, pp. 289–295, 2012.[11] P. Comon, “Tensors: a brief introduction,” in IEEE Sig.

Proc. Magazine, 2014, vol. 31, pp. 44–53.[12] L.-H. Lim and P. Comon, “Blind multilinear identifica-

tion,” IEEE Trans. Inf. Theory, vol. 60, no. 2, pp. 1260–1280, Feb. 2014, open access.

[13] L. Chiantini, G. Ottaviani, and N. Vannieuwenhoven, “Analgorithm for generic and low-rank specific identifiabilityof complex tensors,” SIAM J. Matrix Ana. Appl., vol. 35,no. 4, pp. 1265–1287, 2014.

[14] A. Hjorungnes and D. Gesbert, “Complex-valued matrixdifferentiation: Techniques and key results,” IEEE Trans.

Sig. Proc., vol. 55, no. 6, pp. 2740–2746, 2007.