Lecture 14: Covariance estimation and matrix completionmath.xmu.edu.cn › group › nona › damc...

Lecture 14 Covariance estimation and matrixcompletion

May 27 - 29 2020

Covariance estimation and MC DAMC Lecture 14 May 27 - 29 2020 1 22

1 Covariance estimation

Suppose we have a sample of data points X1 XN in Rn It isoften reasonable to assume that these points are independentlysampled from the same probability distribution (or ldquopopulationrdquo)which is unknown We would like to learn something useful aboutthis distribution

Denote by X a random vector with this (unknown) distributionThe most basic parameter of the distribution is the mean EXOne can estimate EX from the sample by computing the samplemean

983123Ni=1XiN The law of large numbers guarantees that the

estimate becomes tight as the sample size N grows to infinity Inother words

N983131

Xi rarr EX as N rarr infin

The next most basic parameter of the distribution is thecovariance matrix

Σ = E(X minus EX)(X minus EX)T

The eigenvectors of the covariance matrix Σ are called theprincipal components Principal components that correspond tolarge eigenvalues of Σ are the directions in which the distributionof X is most extended see the figure

This method is called Principal Component Analysis (PCA)

One can estimate the covariance matrix Σ from the sample bycomputing the sample covariance

ΣN =1

N983131

(Xi minus EXi)(Xi minus EXi)T

Again the law of large numbers guarantees that the estimatebecomes tight as the sample size N grows to infinity ie

ΣN rarr Σ as N rarr infin

But how large should the sample size N be for covarianceestimation Generally one can not have N lt n for dimensionreasons (Why) We are going to show that

N sim n log n

is enough In other words covariance estimation is possible withjust logarithmic oversampling

For simplicity we shall state the covariance estimation bound formean zero distributions (If the mean is not zero we can estimateit from the sample and subtract The mean can be accuratelyestimated from a sample of size N = O(n))

Theorem 1 (Covariance estimation)

Let X be a random vector in Rn with covariance matrix Σ Supposethat

983042X98304222 ≲ E983042X98304222 = trΣ almost surely

Then for every N ge 1 we have

E 983042ΣN minus Σ983042 ≲ 983042Σ983042983075983157

n log n

983076

Proof Apply matrix Bernsteinrsquos inequality corollary 3 for the sum ofindependent random matrices XiX

Ti minus Σ and get

E 983042ΣN minus Σ983042 =1

983056983056983056983056983056

N983131

983043XiX

Ti minus Σ

983044983056983056983056983056983056

N(σ983155

log n+K log n)

983056983056983056983056983056

N983131

E983043XiX

Ti minus Σ

9830442983056983056983056983056983056 = N

983056983056983056E983043XXT minus Σ

9830442983056983056983056

and K is chosen so that

983042XXT minus Σ983042 le K almost surely

It remains to bound σ and K Let us start with σ We have

E(XXT minus Σ)2 = E983042X98304222XXT minus Σ2

≾ tr(Σ) middot EXXT

= tr(Σ) middot Σ

Thus σ2 ≲ Ntr(Σ)983042Σ983042 Next to bound K we have

983042XXT minus Σ983042 le 983042X98304222 + 983042Σ983042≲ tr(Σ) + 983042Σ983042le 2tr(Σ) = K

Therefore

E 983042ΣN minus Σ983042 ≲ 1

N(983155

Ntr(Σ)983042Σ983042 log n+ tr(Σ) log n)

The proof is completed by using tr(Σ) le n983042Σ983042

11 Low-dimensional distributions

Far fewer samples are needed for covariance estimation forlow-dimensional or approximately low-dimensional distributionsTo measure approximate low-dimensionality we can use the notionof the stable rank of Σ2 The stable rank of a matrix A is definedas the square of the ratio of the Frobenius to operator norms

r(A) =983042A9830422F983042A98304222

le rank(A)

The proof of Theorem 1 yields

E 983042ΣN minus Σ983042 983249 983042Σ983042983075983157

r log n

983076

where r = r(Σ12) = tr(Σ)983042Σ983042 Therefore covariance estimationis possible with N sim r log n samples

2 Norms of random matrices

Let Ai denote the ith row of A we have (exercise)

983042Ai9830422 le 983042A9830422 leradicnmax

i983042Ai9830422

For random matrices with independent entries the bound can beimproved to the point where the upper and lower bounds almostmatch

Theorem 2 (Norms of random matrices without boundednessassumptions)

Let A be an ntimes n symmetric random matrix whose entries on andabove the diagonal are independent mean zero random variables Then

983042Ai9830422 le E983042A9830422 le C log n middot Emaxi

983042Ai9830422

where Ai denote the rows of A

Lemma 3 (Symmetrization)

Let X1 XN be independent mean zero random vectors in a normedspace and ε1 εN be independent Rademacher random variablesThen

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 E

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 2E

983056983056983056983056983056

N983131

983056983056983056983056983056

Proof To prove the upper bound let (X primei) be an independent copy of

the random vectors (Xi) ie just different random vectors with thesame joint distribution as (Xi) and independent from (Xi) Then

983056983056983056983056983056983131

983056983056983056983056983056 = E

983056983056983056983056983056983131

Xi minus E

983075983131

X primei

983076983056983056983056983056983056

983249 E

983056983056983056983056983056983131

Xi minus983131

X primei

983056983056983056983056983056 = E

983056983056983056983056983056983131

983043Xi minusX prime

983044983056983056983056983056983056

The distribution of the random vectors Yi = Xi minusX primei is symmetric

which means that the distributions of Yi and minusYi are the same(Why) Thus the distribution of the random vectors Yi and εiYi is alsothe same for all we do is change the signs of these vectors at randomand independently of the values of the vectors Summarizing we canreplace Xi minusX prime

i in the sum above with εi(Xi minusX primei) Thus

983056983056983056983056983056983131

983056983056983056983056983056 le E

983056983056983056983056983056983131

εi(Xi minusX primei)

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056+ E

983056983056983056983056983056983131

εiXprimei

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056

This proves the upper bound in the symmetrization inequality Thelower bound can be proved by a similar argument (Do this)

Proof of Theorem 2 The lower bound is trivial The proof of the upperbound will be based on matrix Bernsteinrsquos inequalityWe represent A as a sum of independent mean zero symmetricrandom matrices Zij each of which contains a pair of symmetric entriesof A (or one diagonal entry)

A =983131

ilejZij

By the symmetrization inequality (Lemma 3) for the random matricesZij we get

E983042A983042 = E983056983056983056983131

i983249jZij

983056983056983056 983249 2E983056983056983056983131

i983249jXij

983056983056983056

where we set Xij = εijZij and εij are independent Rademacherrandom variables Now we condition on A The random variables Zij

become fixed values and all randomness remains in the Rademacherrandom variables εij

Note that Xij are (conditionally) bounded almost surely and this isexactly what we have lacked to apply matrix Bernsteinrsquos inequalityNow we can do it The corollary of matrix Bernsteinrsquos inequality gives

983056983056983056983131

ilejXij

983056983056983056 ≲ σ983155

log n+K log n

where σ2 = 983042983123

ilej EεX2ij983042 and K = maxilej 983042Xij983042 A good exercise is

to check that

σ ≲ maxi

983042Ai9830422 and K ≲ maxi

983042Ai9830422

Then we have

983056983056983056983131

i983249jXij

983056983056983056 ≲ log n middotmaxi

983042Ai9830422

Finally we unfix A by taking expectation of both sides of thisinequality with respect to A and using the law of total expectation

We state Theorem 2 for symmetric matrices but it is simple toextend it to general mtimes n random matrices A The bound in thiscase becomes

E983042A9830422 le C log(m+ n) middot (Emaxi

983042Ai9830422 + Emaxj

983042Aj9830422)

To see this apply Theorem 2 to the (m+ n)times (m+ n) symmetricrandom matrix 983063

0 AAT 0

983064

3 Matrix completion

Consider a fixed unknown ntimes n matrix X Suppose we are shownm randomly chosen entries of X Can we guess all the missingentries This important problem is called matrix completion Wewill analyze it using the bounds on the norms on random matriceswe just obtained

Obviously there is no way to guess the missing entries unless weknow something extra about the matrix X So let us assume thatX has low rank

rank(X) = r ≪ n

The number of degrees of freedom of an ntimes n matrix with rank ris O(rn) (Why) So we may hope that m sim rn observed entriesof X will be enough to determine X completely But how

Here we will analyze what is probably the simplest method formatrix completion Take the matrix Y that consists of theobserved entries of X while all unobserved entries are set to zeroUnlike X the matrix Y may not have small rank Compute thebest rank r approximation of Y The result as we will show willbe a good approximation to X

But before we show this let us define sampling of entries morerigorously Assume each entry of X is shown or hiddenindependently of others with fixed probability p Which entries areshown is decided by independent Bernoulli random variables

δij sim Ber(p) with p =m

which are often called selectors in this context The value of p ischosen so that among n2 entries of X the expected number ofselected (known) entries is m

Define the ntimes n matrix Y with entries Yij = δijXij We canassume that we are shown Y for it is a matrix that contains theobserved entries of X while all unobserved entries are replacedwith zeros The following result shows how to estimate X basedon Y

Theorem 4 (Matrix completion)

Let 983142X be a best rank r approximation to pminus1Y Then

n983042983142X minusX983042F 983249 C logn

983157rn

m983042X983042max

Here 983042X983042max = maxij |Xij | denotes the maximum magnitude of theentries of X

Remark This theorem controls the average error per entry in themean-squared sense To make the error small let us assume that wehave a sample of size m ≫ rn log2 n which is slightly larger than theideal size m sim rn This makes C log n

983155rnm = o(1) and forces the

recovery error to be bounded by o(1)983042X983042max Summarizing Theorem4 says that the expected average error per entry is much smaller thanthe maximal magnitude of the entries of X This is true for a sampleof almost optimal size m The smaller the rank r of the matrix X thefewer entries of X we need to see in order to do matrix completion

Proof of Theorem 4Step 1 The error in the operator norm Let us first bound therecovery error in the operator norm Decompose the error into twoparts using triangle inequality

983042983142X minusX983042 le 983042983142X minus pminus1Y 983042+ 983042pminus1Y minusX983042

Recall that 983142X is a best approximation to pminus1Y Then the first part ofthe error is smaller than the second part and we have

983042983142X minusX983042 le 2983042pminus1Y minusX983042 =2

p983042Y minus pX983042

The entries of the matrix Y minus pX

(Y minus pX)ij = (δij minus p)Xij

are independent and mean zero random variables

We have

E983042Y minus pX983042

983249 C log n middot983061Emax

i983042(Y minus pX)i9830422 + Emax

j983042(Y minus pX)j9830422

983062

All that remains is to bound the norms of the rows and columns ofY minus pX This is not difficult if we note that they can be expressed assums of independent random variables

983042(Y minus pX)i98304222 =n983131

(δij minus p)2X2ij 983249

n983131

(δij minus p)2 middot 983042X9830422max

and similarly for columns Taking expectation and noting that

E (δij minus p)2 = Var (δij) = p(1minus p)

we get

E 983042(Y minus pX)i9830422 983249983059E 983042(Y minus pX)i98304222

98306012983249 radic

pn983042X983042max

This is a good bound but we need something stronger Since themaximum appears inside the expectation we need a uniform boundwhich will say that all rows are bounded simultaneously with highprobability Such uniform bounds are usually proved by applyingconcentration inequalities followed by a union bound Bernsteinsinequality (145) yields (check)

983099983103

983101

n983131

(δij minus p)2 gt tpn

983100983104

983102 983249 exp(minusctpn) for t 983245 3

This probability can be further bounded by nminusct using the assumptionthat m = pn2 ge n log n A union bound over n rows leads to

983099983103

983101maxiisin[n]

n983131

983100983104

983102 983249 n middot nminusct for t 983245 3

Integrating this tail we have

Emaxiisin[n]

n983131

(δij minus p)2 ≲ pn

And this yields the desired bound on the rows

Emaxiisin[n]

983042(Y minus pX)i9830422 ≲radicpn983042X983042max

We can do similarly for the columns Then

E983042Y minus pX983042 ≲ log nradicpn983042X983042max

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

Step 2 Passing to Frobenius normWe know that rank(X) le r by assumption and rank(983141X) le r by

construction so rank(983142X minusX) le 2r There is a simple relationshipbetween the operator and Frobenius norms

983042983142X minusX983042F leradic2r983042983142X minusX983042

Taking expectation of both sides we get

E983042983142X minusX983042F 983249radic2rE983042983142X minusX983042 ≲ log n

983157rn

p983042X983042max

Dividing both sides by n we can rewrite this bound as

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

The proof is completed by noting the definition of the samplingprobability p = mn2

1 Covariance estimation

Suppose we have a sample of data points X1 XN in Rn It isoften reasonable to assume that these points are independentlysampled from the same probability distribution (or ldquopopulationrdquo)which is unknown We would like to learn something useful aboutthis distribution

Denote by X a random vector with this (unknown) distributionThe most basic parameter of the distribution is the mean EXOne can estimate EX from the sample by computing the samplemean

983123Ni=1XiN The law of large numbers guarantees that the

estimate becomes tight as the sample size N grows to infinity Inother words

N983131

Xi rarr EX as N rarr infin

ΣN =1

N983131

N sim n log n

E 983042ΣN minus Σ983042 ≲ 983042Σ983042983075983157

n log n

983076

Ti minus Σ and get

E 983042ΣN minus Σ983042 =1

983056983056983056983056983056

N983131

983043XiX

Ti minus Σ

983044983056983056983056983056983056

N(σ983155

log n+K log n)

983056983056983056983056983056

N983131

E983043XiX

Ti minus Σ

9830442983056983056983056983056983056 = N

983056983056983056E983043XXT minus Σ

9830442983056983056983056

= tr(Σ) middot Σ

Therefore

E 983042ΣN minus Σ983042 ≲ 1

N(983155

r(A) =983042A9830422F983042A98304222

le rank(A)

E 983042ΣN minus Σ983042 983249 983042Σ983042983075983157

r log n

983076

i983042Ai9830422

983042Ai9830422

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 E

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 2E

983056983056983056983056983056

N983131

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056 = E

983056983056983056983056983056983131

Xi minus E

983075983131

X primei

983076983056983056983056983056983056

983249 E

983056983056983056983056983056983131

Xi minus983131

X primei

983056983056983056983056983056 = E

983056983056983056983056983056983131

983044983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056 le E

983056983056983056983056983056983131

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056+ E

983056983056983056983056983056983131

εiXprimei

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056

A =983131

ilejZij

E983042A983042 = E983056983056983056983131

i983249jZij

983056983056983056 983249 2E983056983056983056983131

i983249jXij

983056983056983056

983056983056983056983131

ilejXij

983056983056983056 ≲ σ983155

log n+K log n

where σ2 = 983042983123

to check that

σ ≲ maxi

983042Ai9830422 and K ≲ maxi

983042Ai9830422

Then we have

983056983056983056983131

i983249jXij

983056983056983056 ≲ log n middotmaxi

983042Ai9830422

983042Ai9830422 + Emaxj

983042Aj9830422)

0 AAT 0

983064

3 Matrix completion

rank(X) = r ≪ n

n983042983142X minusX983042F 983249 C logn

983157rn

m983042X983042max

p983042Y minus pX983042

We have

E983042Y minus pX983042

j983042(Y minus pX)j9830422

983062

983042(Y minus pX)i98304222 =n983131

n983131

we get

98306012983249 radic

pn983042X983042max

983099983103

983101

n983131

983100983104

983099983103

983101maxiisin[n]

n983131

983100983104

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

ΣN =1

N983131

N sim n log n

E 983042ΣN minus Σ983042 ≲ 983042Σ983042983075983157

n log n

983076

Ti minus Σ and get

E 983042ΣN minus Σ983042 =1

983056983056983056983056983056

N983131

983043XiX

Ti minus Σ

983044983056983056983056983056983056

N(σ983155

log n+K log n)

983056983056983056983056983056

N983131

E983043XiX

Ti minus Σ

9830442983056983056983056983056983056 = N

983056983056983056E983043XXT minus Σ

9830442983056983056983056

= tr(Σ) middot Σ

Therefore

E 983042ΣN minus Σ983042 ≲ 1

N(983155

r(A) =983042A9830422F983042A98304222

le rank(A)

E 983042ΣN minus Σ983042 983249 983042Σ983042983075983157

r log n

983076

i983042Ai9830422

983042Ai9830422

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 E

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 2E

983056983056983056983056983056

N983131

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056 = E

983056983056983056983056983056983131

Xi minus E

983075983131

X primei

983076983056983056983056983056983056

983249 E

983056983056983056983056983056983131

Xi minus983131

X primei

983056983056983056983056983056 = E

983056983056983056983056983056983131

983044983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056 le E

983056983056983056983056983056983131

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056+ E

983056983056983056983056983056983131

εiXprimei

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056

A =983131

ilejZij

E983042A983042 = E983056983056983056983131

i983249jZij

983056983056983056 983249 2E983056983056983056983131

i983249jXij

983056983056983056

983056983056983056983131

ilejXij

983056983056983056 ≲ σ983155

log n+K log n

where σ2 = 983042983123

to check that

σ ≲ maxi

983042Ai9830422 and K ≲ maxi

983042Ai9830422

Then we have

983056983056983056983131

i983249jXij

983056983056983056 ≲ log n middotmaxi

983042Ai9830422

983042Ai9830422 + Emaxj

983042Aj9830422)

0 AAT 0

983064

3 Matrix completion

rank(X) = r ≪ n

n983042983142X minusX983042F 983249 C logn

983157rn

m983042X983042max

p983042Y minus pX983042

We have

E983042Y minus pX983042

j983042(Y minus pX)j9830422

983062

983042(Y minus pX)i98304222 =n983131

n983131

we get

98306012983249 radic

pn983042X983042max

983099983103

983101

n983131

983100983104

983099983103

983101maxiisin[n]

n983131

983100983104

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

ΣN =1

N983131

N sim n log n

E 983042ΣN minus Σ983042 ≲ 983042Σ983042983075983157

n log n

983076

Ti minus Σ and get

E 983042ΣN minus Σ983042 =1

983056983056983056983056983056

N983131

983043XiX

Ti minus Σ

983044983056983056983056983056983056

N(σ983155

log n+K log n)

983056983056983056983056983056

N983131

E983043XiX

Ti minus Σ

9830442983056983056983056983056983056 = N

983056983056983056E983043XXT minus Σ

9830442983056983056983056

= tr(Σ) middot Σ

Therefore

E 983042ΣN minus Σ983042 ≲ 1

N(983155

r(A) =983042A9830422F983042A98304222

le rank(A)

E 983042ΣN minus Σ983042 983249 983042Σ983042983075983157

r log n

983076

i983042Ai9830422

983042Ai9830422

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 E

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 2E

983056983056983056983056983056

N983131

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056 = E

983056983056983056983056983056983131

Xi minus E

983075983131

X primei

983076983056983056983056983056983056

983249 E

983056983056983056983056983056983131

Xi minus983131

X primei

983056983056983056983056983056 = E

983056983056983056983056983056983131

983044983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056 le E

983056983056983056983056983056983131

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056+ E

983056983056983056983056983056983131

εiXprimei

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056

A =983131

ilejZij

E983042A983042 = E983056983056983056983131

i983249jZij

983056983056983056 983249 2E983056983056983056983131

i983249jXij

983056983056983056

983056983056983056983131

ilejXij

983056983056983056 ≲ σ983155

log n+K log n

where σ2 = 983042983123

to check that

σ ≲ maxi

983042Ai9830422 and K ≲ maxi

983042Ai9830422

Then we have

983056983056983056983131

i983249jXij

983056983056983056 ≲ log n middotmaxi

983042Ai9830422

983042Ai9830422 + Emaxj

983042Aj9830422)

0 AAT 0

983064

3 Matrix completion

rank(X) = r ≪ n

n983042983142X minusX983042F 983249 C logn

983157rn

m983042X983042max

p983042Y minus pX983042

We have

E983042Y minus pX983042

j983042(Y minus pX)j9830422

983062

983042(Y minus pX)i98304222 =n983131

n983131

we get

98306012983249 radic

pn983042X983042max

983099983103

983101

n983131

983100983104

983099983103

983101maxiisin[n]

n983131

983100983104

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

E 983042ΣN minus Σ983042 ≲ 983042Σ983042983075983157

n log n

983076

Ti minus Σ and get

E 983042ΣN minus Σ983042 =1

983056983056983056983056983056

N983131

983043XiX

Ti minus Σ

983044983056983056983056983056983056

N(σ983155

log n+K log n)

983056983056983056983056983056

N983131

E983043XiX

Ti minus Σ

9830442983056983056983056983056983056 = N

983056983056983056E983043XXT minus Σ

9830442983056983056983056

= tr(Σ) middot Σ

Therefore

E 983042ΣN minus Σ983042 ≲ 1

N(983155

r(A) =983042A9830422F983042A98304222

le rank(A)

E 983042ΣN minus Σ983042 983249 983042Σ983042983075983157

r log n

983076

i983042Ai9830422

983042Ai9830422

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 E

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 2E

983056983056983056983056983056

N983131

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056 = E

983056983056983056983056983056983131

Xi minus E

983075983131

X primei

983076983056983056983056983056983056

983249 E

983056983056983056983056983056983131

Xi minus983131

X primei

983056983056983056983056983056 = E

983056983056983056983056983056983131

983044983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056 le E

983056983056983056983056983056983131

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056+ E

983056983056983056983056983056983131

εiXprimei

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056

A =983131

ilejZij

E983042A983042 = E983056983056983056983131

i983249jZij

983056983056983056 983249 2E983056983056983056983131

i983249jXij

983056983056983056

983056983056983056983131

ilejXij

983056983056983056 ≲ σ983155

log n+K log n

where σ2 = 983042983123

to check that

σ ≲ maxi

983042Ai9830422 and K ≲ maxi

983042Ai9830422

Then we have

983056983056983056983131

i983249jXij

983056983056983056 ≲ log n middotmaxi

983042Ai9830422

983042Ai9830422 + Emaxj

983042Aj9830422)

0 AAT 0

983064

3 Matrix completion

rank(X) = r ≪ n

n983042983142X minusX983042F 983249 C logn

983157rn

m983042X983042max

p983042Y minus pX983042

We have

E983042Y minus pX983042

j983042(Y minus pX)j9830422

983062

983042(Y minus pX)i98304222 =n983131

n983131

we get

98306012983249 radic

pn983042X983042max

983099983103

983101

n983131

983100983104

983099983103

983101maxiisin[n]

n983131

983100983104

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

Ti minus Σ and get

E 983042ΣN minus Σ983042 =1

983056983056983056983056983056

N983131

983043XiX

Ti minus Σ

983044983056983056983056983056983056

N(σ983155

log n+K log n)

983056983056983056983056983056

N983131

E983043XiX

Ti minus Σ

9830442983056983056983056983056983056 = N

983056983056983056E983043XXT minus Σ

9830442983056983056983056

= tr(Σ) middot Σ

Therefore

E 983042ΣN minus Σ983042 ≲ 1

N(983155

r(A) =983042A9830422F983042A98304222

le rank(A)

E 983042ΣN minus Σ983042 983249 983042Σ983042983075983157

r log n

983076

i983042Ai9830422

983042Ai9830422

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 E

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 2E

983056983056983056983056983056

N983131

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056 = E

983056983056983056983056983056983131

Xi minus E

983075983131

X primei

983076983056983056983056983056983056

983249 E

983056983056983056983056983056983131

Xi minus983131

X primei

983056983056983056983056983056 = E

983056983056983056983056983056983131

983044983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056 le E

983056983056983056983056983056983131

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056+ E

983056983056983056983056983056983131

εiXprimei

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056

A =983131

ilejZij

E983042A983042 = E983056983056983056983131

i983249jZij

983056983056983056 983249 2E983056983056983056983131

i983249jXij

983056983056983056

983056983056983056983131

ilejXij

983056983056983056 ≲ σ983155

log n+K log n

where σ2 = 983042983123

to check that

σ ≲ maxi

983042Ai9830422 and K ≲ maxi

983042Ai9830422

Then we have

983056983056983056983131

i983249jXij

983056983056983056 ≲ log n middotmaxi

983042Ai9830422

983042Ai9830422 + Emaxj

983042Aj9830422)

0 AAT 0

983064

3 Matrix completion

rank(X) = r ≪ n

n983042983142X minusX983042F 983249 C logn

983157rn

m983042X983042max

p983042Y minus pX983042

We have

E983042Y minus pX983042

j983042(Y minus pX)j9830422

983062

983042(Y minus pX)i98304222 =n983131

n983131

we get

98306012983249 radic

pn983042X983042max

983099983103

983101

n983131

983100983104

983099983103

983101maxiisin[n]

n983131

983100983104

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

= tr(Σ) middot Σ

Therefore

E 983042ΣN minus Σ983042 ≲ 1

N(983155

r(A) =983042A9830422F983042A98304222

le rank(A)

E 983042ΣN minus Σ983042 983249 983042Σ983042983075983157

r log n

983076

i983042Ai9830422

983042Ai9830422

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 E

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 2E

983056983056983056983056983056

N983131

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056 = E

983056983056983056983056983056983131

Xi minus E

983075983131

X primei

983076983056983056983056983056983056

983249 E

983056983056983056983056983056983131

Xi minus983131

X primei

983056983056983056983056983056 = E

983056983056983056983056983056983131

983044983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056 le E

983056983056983056983056983056983131

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056+ E

983056983056983056983056983056983131

εiXprimei

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056

A =983131

ilejZij

E983042A983042 = E983056983056983056983131

i983249jZij

983056983056983056 983249 2E983056983056983056983131

i983249jXij

983056983056983056

983056983056983056983131

ilejXij

983056983056983056 ≲ σ983155

log n+K log n

where σ2 = 983042983123

to check that

σ ≲ maxi

983042Ai9830422 and K ≲ maxi

983042Ai9830422

Then we have

983056983056983056983131

i983249jXij

983056983056983056 ≲ log n middotmaxi

983042Ai9830422

983042Ai9830422 + Emaxj

983042Aj9830422)

0 AAT 0

983064

3 Matrix completion

rank(X) = r ≪ n

n983042983142X minusX983042F 983249 C logn

983157rn

m983042X983042max

p983042Y minus pX983042

We have

E983042Y minus pX983042

j983042(Y minus pX)j9830422

983062

983042(Y minus pX)i98304222 =n983131

n983131

we get

98306012983249 radic

pn983042X983042max

983099983103

983101

n983131

983100983104

983099983103

983101maxiisin[n]

n983131

983100983104

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

r(A) =983042A9830422F983042A98304222

le rank(A)

E 983042ΣN minus Σ983042 983249 983042Σ983042983075983157

r log n

983076

i983042Ai9830422

983042Ai9830422

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 E

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 2E

983056983056983056983056983056

N983131

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056 = E

983056983056983056983056983056983131

Xi minus E

983075983131

X primei

983076983056983056983056983056983056

983249 E

983056983056983056983056983056983131

Xi minus983131

X primei

983056983056983056983056983056 = E

983056983056983056983056983056983131

983044983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056 le E

983056983056983056983056983056983131

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056+ E

983056983056983056983056983056983131

εiXprimei

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056

A =983131

ilejZij

E983042A983042 = E983056983056983056983131

i983249jZij

983056983056983056 983249 2E983056983056983056983131

i983249jXij

983056983056983056

983056983056983056983131

ilejXij

983056983056983056 ≲ σ983155

log n+K log n

where σ2 = 983042983123

to check that

σ ≲ maxi

983042Ai9830422 and K ≲ maxi

983042Ai9830422

Then we have

983056983056983056983131

i983249jXij

983056983056983056 ≲ log n middotmaxi

983042Ai9830422

983042Ai9830422 + Emaxj

983042Aj9830422)

0 AAT 0

983064

3 Matrix completion

rank(X) = r ≪ n

n983042983142X minusX983042F 983249 C logn

983157rn

m983042X983042max

p983042Y minus pX983042

We have

E983042Y minus pX983042

j983042(Y minus pX)j9830422

983062

983042(Y minus pX)i98304222 =n983131

n983131

we get

98306012983249 radic

pn983042X983042max

983099983103

983101

n983131

983100983104

983099983103

983101maxiisin[n]

n983131

983100983104

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

i983042Ai9830422

983042Ai9830422

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 E

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 2E

983056983056983056983056983056

N983131

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056 = E

983056983056983056983056983056983131

Xi minus E

983075983131

X primei

983076983056983056983056983056983056

983249 E

983056983056983056983056983056983131

Xi minus983131

X primei

983056983056983056983056983056 = E

983056983056983056983056983056983131

983044983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056 le E

983056983056983056983056983056983131

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056+ E

983056983056983056983056983056983131

εiXprimei

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056

A =983131

ilejZij

E983042A983042 = E983056983056983056983131

i983249jZij

983056983056983056 983249 2E983056983056983056983131

i983249jXij

983056983056983056

983056983056983056983131

ilejXij

983056983056983056 ≲ σ983155

log n+K log n

where σ2 = 983042983123

to check that

σ ≲ maxi

983042Ai9830422 and K ≲ maxi

983042Ai9830422

Then we have

983056983056983056983131

i983249jXij

983056983056983056 ≲ log n middotmaxi

983042Ai9830422

983042Ai9830422 + Emaxj

983042Aj9830422)

0 AAT 0

983064

3 Matrix completion

rank(X) = r ≪ n

n983042983142X minusX983042F 983249 C logn

983157rn

m983042X983042max

p983042Y minus pX983042

We have

E983042Y minus pX983042

j983042(Y minus pX)j9830422

983062

983042(Y minus pX)i98304222 =n983131

n983131

we get

98306012983249 radic

pn983042X983042max

983099983103

983101

n983131

983100983104

983099983103

983101maxiisin[n]

n983131

983100983104

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 E

983056983056983056983056983056

N983131

983056983056983056983056983056 983249 2E

983056983056983056983056983056

N983131

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056 = E

983056983056983056983056983056983131

Xi minus E

983075983131

X primei

983076983056983056983056983056983056

983249 E

983056983056983056983056983056983131

Xi minus983131

X primei

983056983056983056983056983056 = E

983056983056983056983056983056983131

983044983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056 le E

983056983056983056983056983056983131

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056+ E

983056983056983056983056983056983131

εiXprimei

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056

A =983131

ilejZij

E983042A983042 = E983056983056983056983131

i983249jZij

983056983056983056 983249 2E983056983056983056983131

i983249jXij

983056983056983056

983056983056983056983131

ilejXij

983056983056983056 ≲ σ983155

log n+K log n

where σ2 = 983042983123

to check that

σ ≲ maxi

983042Ai9830422 and K ≲ maxi

983042Ai9830422

Then we have

983056983056983056983131

i983249jXij

983056983056983056 ≲ log n middotmaxi

983042Ai9830422

983042Ai9830422 + Emaxj

983042Aj9830422)

0 AAT 0

983064

3 Matrix completion

rank(X) = r ≪ n

n983042983142X minusX983042F 983249 C logn

983157rn

m983042X983042max

p983042Y minus pX983042

We have

E983042Y minus pX983042

j983042(Y minus pX)j9830422

983062

983042(Y minus pX)i98304222 =n983131

n983131

we get

98306012983249 radic

pn983042X983042max

983099983103

983101

n983131

983100983104

983099983103

983101maxiisin[n]

n983131

983100983104

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

983056983056983056983056983056983131

983056983056983056983056983056 le E

983056983056983056983056983056983131

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056+ E

983056983056983056983056983056983131

εiXprimei

983056983056983056983056983056

983056983056983056983056983056983131

983056983056983056983056983056

A =983131

ilejZij

E983042A983042 = E983056983056983056983131

i983249jZij

983056983056983056 983249 2E983056983056983056983131

i983249jXij

983056983056983056

983056983056983056983131

ilejXij

983056983056983056 ≲ σ983155

log n+K log n

where σ2 = 983042983123

to check that

σ ≲ maxi

983042Ai9830422 and K ≲ maxi

983042Ai9830422

Then we have

983056983056983056983131

i983249jXij

983056983056983056 ≲ log n middotmaxi

983042Ai9830422

983042Ai9830422 + Emaxj

983042Aj9830422)

0 AAT 0

983064

3 Matrix completion

rank(X) = r ≪ n

n983042983142X minusX983042F 983249 C logn

983157rn

m983042X983042max

p983042Y minus pX983042

We have

E983042Y minus pX983042

j983042(Y minus pX)j9830422

983062

983042(Y minus pX)i98304222 =n983131

n983131

we get

98306012983249 radic

pn983042X983042max

983099983103

983101

n983131

983100983104

983099983103

983101maxiisin[n]

n983131

983100983104

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

A =983131

ilejZij

E983042A983042 = E983056983056983056983131

i983249jZij

983056983056983056 983249 2E983056983056983056983131

i983249jXij

983056983056983056

983056983056983056983131

ilejXij

983056983056983056 ≲ σ983155

log n+K log n

where σ2 = 983042983123

to check that

σ ≲ maxi

983042Ai9830422 and K ≲ maxi

983042Ai9830422

Then we have

983056983056983056983131

i983249jXij

983056983056983056 ≲ log n middotmaxi

983042Ai9830422

983042Ai9830422 + Emaxj

983042Aj9830422)

0 AAT 0

983064

3 Matrix completion

rank(X) = r ≪ n

n983042983142X minusX983042F 983249 C logn

983157rn

m983042X983042max

p983042Y minus pX983042

We have

E983042Y minus pX983042

j983042(Y minus pX)j9830422

983062

983042(Y minus pX)i98304222 =n983131

n983131

we get

98306012983249 radic

pn983042X983042max

983099983103

983101

n983131

983100983104

983099983103

983101maxiisin[n]

n983131

983100983104

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

983056983056983056983131

ilejXij

983056983056983056 ≲ σ983155

log n+K log n

where σ2 = 983042983123

to check that

σ ≲ maxi

983042Ai9830422 and K ≲ maxi

983042Ai9830422

Then we have

983056983056983056983131

i983249jXij

983056983056983056 ≲ log n middotmaxi

983042Ai9830422

983042Ai9830422 + Emaxj

983042Aj9830422)

0 AAT 0

983064

3 Matrix completion

rank(X) = r ≪ n

n983042983142X minusX983042F 983249 C logn

983157rn

m983042X983042max

p983042Y minus pX983042

We have

E983042Y minus pX983042

j983042(Y minus pX)j9830422

983062

983042(Y minus pX)i98304222 =n983131

n983131

we get

98306012983249 radic

pn983042X983042max

983099983103

983101

n983131

983100983104

983099983103

983101maxiisin[n]

n983131

983100983104

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

983042Ai9830422 + Emaxj

983042Aj9830422)

0 AAT 0

983064

3 Matrix completion

rank(X) = r ≪ n

n983042983142X minusX983042F 983249 C logn

983157rn

m983042X983042max

p983042Y minus pX983042

We have

E983042Y minus pX983042

j983042(Y minus pX)j9830422

983062

983042(Y minus pX)i98304222 =n983131

n983131

we get

98306012983249 radic

pn983042X983042max

983099983103

983101

n983131

983100983104

983099983103

983101maxiisin[n]

n983131

983100983104

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

rank(X) = r ≪ n

n983042983142X minusX983042F 983249 C logn

983157rn

m983042X983042max

p983042Y minus pX983042

We have

E983042Y minus pX983042

j983042(Y minus pX)j9830422

983062

983042(Y minus pX)i98304222 =n983131

n983131

we get

98306012983249 radic

pn983042X983042max

983099983103

983101

n983131

983100983104

983099983103

983101maxiisin[n]

n983131

983100983104

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

n983042983142X minusX983042F 983249 C logn

983157rn

m983042X983042max

p983042Y minus pX983042

We have

E983042Y minus pX983042

j983042(Y minus pX)j9830422

983062

983042(Y minus pX)i98304222 =n983131

n983131

we get

98306012983249 radic

pn983042X983042max

983099983103

983101

n983131

983100983104

983099983103

983101maxiisin[n]

n983131

983100983104

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

n983042983142X minusX983042F 983249 C logn

983157rn

m983042X983042max

p983042Y minus pX983042

We have

E983042Y minus pX983042

j983042(Y minus pX)j9830422

983062

983042(Y minus pX)i98304222 =n983131

n983131

we get

98306012983249 radic

pn983042X983042max

983099983103

983101

n983131

983100983104

983099983103

983101maxiisin[n]

n983131

983100983104

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

p983042Y minus pX983042

We have

E983042Y minus pX983042

j983042(Y minus pX)j9830422

983062

983042(Y minus pX)i98304222 =n983131

n983131

we get

98306012983249 radic

pn983042X983042max

983099983103

983101

n983131

983100983104

983099983103

983101maxiisin[n]

n983131

983100983104

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

We have

E983042Y minus pX983042

j983042(Y minus pX)j9830422

983062

983042(Y minus pX)i98304222 =n983131

n983131

we get

98306012983249 radic

pn983042X983042max

983099983103

983101

n983131

983100983104

983099983103

983101maxiisin[n]

n983131

983100983104

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

983099983103

983101

n983131

983100983104

983099983103

983101maxiisin[n]

n983131

983100983104

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

Emaxiisin[n]

n983131

Emaxiisin[n]

Therefore we get

E983042983142X minusX983042 ≲ log n

983157n

p983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

983157rn

p983042X983042max

n983042983142X minusX983042F ≲ log n

983157rn

pn2983042X983042max

Lecture 14: Covariance estimation and matrix completionmath.xmu.edu.cn › group › nona › damc...

Documents

High Dimensional Sparse Covariance Estimation: Accurate

High-dimensional covariance estimation by minimizing ℓ … · 2019. 12. 30. · High-dimensional covariance estimation by minimizing ℓ1-penalized log-determinant divergence Pradeep

NONPARAMETRIC COVARIANCE ESTIMATION IN FUNCTIONAL MAPPING ...ufdcimages.uflib.ufl.edu/UF/E0/02/25/95/00001/yap_j.pdf · nonparametric covariance estimation in functional mapping of

High Dimensional Covariance Matrix Estimation Using a ... · High Dimensional Covariance Matrix Estimation Using a Factor Model ... and by Chamberlain and Rothschild (1983)

Charles Stein, covariance matrix estimation and some ... · Covariance matrix estimation 6 I Proc. 4th Berkeley Symp.(1961): Charles considered the problem of estimating the covariance

Compressive Covariance Sensing - EURASIP · Compressive Covariance Sensing Covariance Estimation and Detection o Covariance Estimation Least Squares Maximum Likelihood Modal Analysis

Estimation of Covariance Matrix in Signal Processing When

Shrinkage Estimation of the Covariance Matrix

Consistent Covariance Matrix Estimation with Spatially ...siteresources.worldbank.org/DEC/Resources/RESTAT15.pdf · Consistent Covariance Matrix Estimation with Spatially-Dependent

Estimation of covariance matrices - University of Michiganromanv/slides/2010-Paris/2010-Paris.pdf · Estimation of covariance matrices Roman Vershynin University of Michigan Probability

Covariance and precision matrix estimation for high

Eigenstructure Methods for Noise Covariance Estimation

Smoothing and mean-covariance estimation of … and mean-covariance estimation of functional data ... proposed approach, we assume that the functional ... 2 Smoothing and mean-covariance

COVARIANCE ESTIMATION FROM LARGE ENSEMBLES OF …...COVARIANCE ESTIMATION FROM LARGE ENSEMBLES OF SIMULATIONS Linda Blot Fundamental Cosmology Meeting ... • IC: optimised version

High-dimensional covariance matrix estimation

Comparison among High Dimensional Covariance Matrix Estimation

High-Dimensional Covariance Decomposition into Sparse ... · Keywords: high-dimensional covariance estimation, sparse graphical model selection, sparse covariance models, sparsistency,

Comparative Analysis of Covariance Matrix Estimation for ... · Comparative Analysis of Covariance Matrix Estimation for Anomaly Detection in Hyperspectral Images Santiago Velasco-Forero,

Economic Impacts of Wind Covariance Estimation on … · Economic Impacts of Wind Covariance Estimation on Power Grid ... power grid Stochastic optimization models ... “Economic

Covariance matrix estimation and linear process bootstrap