Controlling false discovery rate via knockoffsrina/knockoff/knockoff_slides.pdf · Jan 21 2015...

Controlling false discovery ratevia knockoffs

Rina Foygel Barber & Emmanuel Candès

Code & demos available at http://web.stanford.edu/~candes/Knockoffs/Paper available at http://arxiv.org/abs/1404.5609

Jan 21 2015

Jan 21 2015 Controlling false discovery rate via knockoffs 1/36

Setting

An example:Which mutations in the reverse transcriptase (RT) of HIV-1determine susceptibility to reverse transcriptase inhibitors (RTIs)?

I yi ∈ R = resistance of virus in sample i to a RTI-type drugI Xij ∈ {0, 1} indicates if mutation j is present in virus sample i

How can we select mutations that determine drug resistance,in such a way that our answer will replicate in further trials?

Setting

Sparse linear model:

y = X · β + z, where ziiid∼ N (0, σ2)

I n observations, p featuresI β is sparse

Setting

Goal: select a set of features Xj

that are likely to be relevant to the response y,without too many false positives.

One way to measure performance:

FDR = E[

# false positivestotal # of features selected

[|S ∩H0||S|

↑ ︸︷︷︸False discovery rate False discovery proportion

S = set of selected featuresH0 = “null hypotheses” = {j : β?

j = 0}

Sparse regression

Lasso: βλ = arg minβ∈Rp

{12‖y− X · β‖2

2 + λ ‖β‖1

Asymptotically, Lasso will select the correct model (at a good λ).

In practice for a finite sample,I True positives & false positives intermixed along the Lasso pathI How to pick λ to balance FDR vs power?I Need to account for correlations between Xj & weak signals that

may have been missed on the Lasso path.

Sparse regressionSimulated data with n = 1500, p = 500.

Lasso fitted model for λ = 1.75:

●●●●●●●●●

●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●

●●●●●●●

●●●●●●●●●●●

●●●●●

●●●●●●●●

●●●●●●●●●●●

●●

●●●●●●●●●●●●●●

●●●●●●●●●●

●●●

●●●●

●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●

●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●

●●●●

●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●

●●●●

●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●●

●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●

●●●●●●●●●●●●●●

●●●●●●●●●●

●●●●

●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●

●●●●●●

0 100 200 300 400 500

Index j

●●●●●●●●●

●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●

●●●●●●●

●●●●●●●●●●●

●●●●●

●●●●●●●●

●●●●●●●●●●●

●●

●●●●●●●●●●●●●●

●●●●●●●●●●

●●●

●●●●

●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●

●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●

●●●●

●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●

●●●●

●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●●

●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●

●●●●●●●●●●●●●●

●●●●●●●●●●

●●●●

●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●

●●●●●●

0 100 200 300 400 500

Index j

o False positive?

True positive?

●●●●●●●●●

●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●

●●●●●●●

●●●●●●●●●●●

●●●●●

●●●●●●●●

●●●●●●●●●●●

●●

●●●●●●●●●●●●●●

●●●●●●●●●●

●●●

●●●●

●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●

●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●

●●●●

●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●

●●●●

●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●●

●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●

●●●●●●●●●●●●●●

●●●●●●●●●●

●●●●

●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●

●●●●●●

0 100 200 300 400 500

Index j

FDP = 2655 = 47%

To estimate FDP, would need to calculate distribution of βλj for null j(would need to know σ2, β?, . . . ). (Donoho et al 2009)

●●●●●●●●●

●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●

●●●●●●●

●●●●●●●●●●●

●●●●●

●●●●●●●●

●●●●●●●●●●●

●●

●●●●●●●●●●●●●●

●●●●●●●●●●

●●●

●●●●

●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●

●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●

●●●●

●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●

●●●●

●●●●●●●●●

●●●●●●●●●●

●●●●●●●●●●●

●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●

●●●●●●●●●●●●●●

●●●●●●●●●●

●●●●

●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●

●●●●●●

0 100 200 300 400 500

Index j

FDP = 2655 = 47%

To estimate FDP, would need to calculate distribution of βλj for null j(would need to know σ2, β?, . . . ). (Donoho et al 2009)

Construct knockoffs

Main idea:

For each feature Xj, construct a knockoff version X̃j.The knockoffs serve as a “control group”⇒ can estimate FDP.

Setting:I Require n > p (ongoing work for high-dim. setting)I Don’t need to know σ2

I Don’t need any information about β?

I Will get an exact, finite-sample guarantee for FDR

Construct knockoffs

Construction:

I The knockoffs replicate the correlation structure of X:

X̃>j X̃k = X>j Xk for all j, k

I Also preserve correlations between knockoffs & originals:

X̃>j Xk = X>j Xk for all j 6= k

Augmented design matrix[X X̃

]= (X1 X2 . . .Xp X̃1 X̃2 . . . X̃p) ∈ Rn×2p

Construct knockoffs

Define X̃ = X · (Ip − 2ξΣ−1) + U · C, where:

Σ = X>X � ξIp

U = n× p orthonormal matrix orthogonal to X

C>C = 4(ξIp − ξ2Σ−1) (Cholesky decomposition)

=⇒[X X̃

]>[X X̃]

(Σ Σ− 2ξIp

Σ− 2ξIp Σ

Construct knockoffs

For a null feature Xj,

X>j y = X>j Xβ? + X>j z D= X̃>j Xβ? + X̃>j z = X̃>j y

Construct knockoffs

For a null feature Xj,

X>j y = X>j Xβ? + X>j z D= X̃>j Xβ? + X̃>j z = X̃>j y

Construct knockoffs

Lemma 1: Pairwise exchangeability property.

For any N ⊂ H0,([X X̃

]swap(N)

=[X X̃

=⇒ the knockoffs are a “control group” for the nulls

[X X̃

]swap(N)

Knockoff method

Steps:

1. Construct knockoffs

2. Compute Lasso with augmented matrix:

βλ = arg minβ∈R2p

∥∥∥y−[X X̃

]· β∥∥∥2

2+ λ ‖β‖1

3. Use X̃j as a “control group” for Xj

Knockoff method

Fitted model for λ = 1.75 on the simulated dataset:

●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●

●●●●●

●●●●●●●●

●●●●●●●●●●●

●●

●●●●●●●●●●●●●●

●●●●●●●●●●

●●●

●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●

●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●

●●●●

●●

●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●

●●●●

●●●●●●●●●

●●●●●●●

●●

●●●●●●●●●●●

●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●

●●●●●●●●●●●●●●

●●●●●●●●●●

●●●●

●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●

●●●●●●●

●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●

●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●

●●●●●

500 original features 500 knockoff features

I Lasso selects 49 original features & 24 knockoff features

I Pairwise exchangeability of the nulls=⇒ probably ≈ 24 false positives among the 49 original features

Knockoff method

Fitted model for λ = 1.75 on the simulated dataset:

●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●

●●●●●

●●●●●●●●

●●●●●●●●●●●

●●

●●●●●●●●●●●●●●

●●●●●●●●●●

●●●

●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●

●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●

●●●●

●●

●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●

●●●●

●●●●●●●●●

●●●●●●●

●●

●●●●●●●●●●●

●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●

●●●●●●●●●●●●●●

●●●●●●●●●●

●●●●

●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●

●●●●●●●

●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●

●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●●

●●●●

●●●●●

500 original features 500 knockoff features

I Lasso selects 49 original features & 24 knockoff featuresI Pairwise exchangeability of the nulls

=⇒ probably ≈ 24 false positives among the 49 original features

Knockoff method

Compute Lasso on the entire path λ ∈ [0,∞).

λj = sup{λ : βλj 6= 0

}= first time Xj enters Lasso path

λ̃j = sup{λ : β̃λj 6= 0

}= first time X̃j enters Lasso path

Then define statistics

Wj = max{λj, λ̃j} · sign(λj − λ̃j)

Knockoff method

●●●●●●●●●

−−

● nullsignal

︸︷︷︸︸︷︷︸λ = 0 λ→∞

variables enter late variables enter early(probably not significant) (likely significant)

Knockoff method

0 1 2 3 4

λ when Xj enters

●●

●●●

●●

● ●

●●

● ●

●●

●●●●

●●

●●●●

●●

● ●

●●

● ●

●●

●●●

●●

●●●

●●

● ●

●●

● ●

●●

● ●

●●

● NullSignal

Knockoff method

Lemma 2: Pairwise exchangeability of the nulls =⇒

(W1,W2, . . . ,Wp)D= (|W1| · ε1, |W2| · ε2, . . . , |Wp| · εp)

where εj = sign(Wj) for non-nulls and εjiid∼ {±1} for nulls.

●●●●●●●●●

−−

● nullsignal

Knockoff method

Selected variables: Sλ = {j : Wj ≥ +λ}Control group: S̃λ = {j : Wj ≤ −λ}

F̂DP(Sλ) :=

∣∣S̃λ∣∣∣∣Sλ∣∣

0 1 2 3 4

λ when Xj enters

●●

●●●

●●

● ●

●●

● ●

●●

●●●●

●●

●●●●

●●

● ●

●●

● ●

●●

●●●

●●

●●●

●●

● ●

●●

● ●

●●

● ●

●●

● NullSignal

selected variables Sλ

control group Sλ

FDP(Sλ) =

∣∣Sλ ∩H0∣∣∣∣Sλ∣∣ ≈

∣∣S̃λ ∩H0∣∣∣∣Sλ∣∣ ≤ F̂DP(Sλ)

Knockoff method

Selected variables: Sλ = {j : Wj ≥ +λ}Control group: S̃λ = {j : Wj ≤ −λ}

F̂DP(Sλ) :=

∣∣S̃λ∣∣∣∣Sλ∣∣

0 1 2 3 4

λ when Xj enters

●●

●●●

●●

● ●

●●

● ●

●●

●●●●

●●

●●●●

●●

● ●

●●

● ●

●●

●●●

●●

●●●

●●

● ●

●●

● ●

●●

● ●

●●

● NullSignal

selected variables Sλ

control group Sλ

FDP(Sλ) =

∣∣Sλ ∩H0∣∣∣∣Sλ∣∣ ≈

∣∣S̃λ ∩H0∣∣∣∣Sλ∣∣ ≤ F̂DP(Sλ)

Knockoff method

The knockoff filter: define

F̂DP(Sλ) :=

∣∣S̃λ∣∣∣∣Sλ∣∣ =#{j : Wj ≤ −λ}#{j : Wj ≥ +λ}

then choose

Λ = min{λ : F̂DP(Sλ) ≤ q

}(or λ =∞ if empty set)

and select the variable set

SΛ = {j : Wj ≥ Λ} .

Knockoff method

Theoretical guarantees

Theorem 1: For SΛ chosen by the knockoff filter,

E [mFDP(SΛ)] ≤ q

where the modified FDP is given by

mFDP(S) =

∣∣S ∩H0∣∣∣∣S∣∣+ q−1.

The knockoff+ filter: define

F̂DP+(Sλ) :=

∣∣S̃λ∣∣+ 1∣∣Sλ∣∣ =#{j : Wj ≤ −λ}+ 1

#{j : Wj ≥ +λ},

then choose

Λ+ = min{λ : F̂DP+(Sλ) ≤ q

}(or λ =∞ if empty set)

and select the variable set

SΛ+ = {j : Wj ≥ Λ+} .

Theorem 2: For SΛ+ chosen by the knockoff+ filter,

E[FDP(SΛ+)

]≤ q .

Proof sketch:

FDP(SΛ+) =

∣∣SΛ+ ∩H0∣∣∣∣SΛ+

∣∣ =

∣∣S̃Λ+ ∩H0∣∣+ 1∣∣SΛ+

∣∣︸︷︷︸≤F̂DP+(SΛ+

·∣∣SΛ+ ∩H0

∣∣∣∣S̃Λ+ ∩H0∣∣+ 1︸︷︷︸

martingale

Theorem 2: For SΛ+ chosen by the knockoff+ filter,

E[FDP(SΛ+)

]≤ q .

Proof sketch:

FDP(SΛ+) =

∣∣SΛ+ ∩H0∣∣∣∣SΛ+

∣∣ =

∣∣S̃Λ+ ∩H0∣∣+ 1∣∣SΛ+

∣∣︸︷︷︸≤F̂DP+(SΛ+

·∣∣SΛ+ ∩H0

∣∣∣∣S̃Λ+ ∩H0∣∣+ 1︸︷︷︸

martingale

Theoretical guaranteesProof sketch cont’d:

M(λ) =

∣∣Sλ ∩H0∣∣∣∣S̃λ ∩H0

∣∣+ 1is a supermartingale w.r.t. increasing λ,

and Λ+ is a stopping time.

●●●●●●●●●

−−

E [M(Λ+)] ≤ E [M(0)] = E[

C|H0| − C + 1

]≤ 1,

for C = # of + coin flips ∼ Bin(|H0|, 0.5)

M(λ) =

∣∣Sλ ∩H0∣∣∣∣S̃λ ∩H0

●●●●●●●●●

−−

E [M(Λ+)] ≤ E [M(0)] = E[

C|H0| − C + 1

]≤ 1,

M(λ) =

∣∣Sλ ∩H0∣∣∣∣S̃λ ∩H0

●●●●●●●●●

−−

E [M(Λ+)] ≤ E [M(0)] = E[

C|H0| − C + 1

]≤ 1,

M(λ) =

∣∣Sλ ∩H0∣∣∣∣S̃λ ∩H0

●●●●●●●●●

−−

E [M(Λ+)] ≤ E [M(0)] = E[

C|H0| − C + 1

]≤ 1,

M(λ) =

∣∣Sλ ∩H0∣∣∣∣S̃λ ∩H0

●●●●●●●●●

−−

E [M(Λ+)] ≤ E [M(0)] = E[

C|H0| − C + 1

]≤ 1,

M(λ) =

∣∣Sλ ∩H0∣∣∣∣S̃λ ∩H0

●●●●●●●●●

−−

E [M(Λ+)] ≤ E [M(0)] = E[

C|H0| − C + 1

]≤ 1,

for C = # of + coin flips ∼ Bin(|H0|, 0.5)Jan 21 2015 Controlling false discovery rate via knockoffs 26/36

Simulations

Setup:I n = 3000, p = 1000, sparsity level kI Features Xj are random unit vectors with correlation level ρ

I For signals j, β?jiid∼ {±A} for amplitude level A

I y = Xβ + N(0, In)

Compare knockoff, knockoff+, & Benjamini-Hochberg (BH).

Simulations

I Fix amplitude A = 3.5 & sparsity level k = 30I Vary feature correlation ρ from 0 to 0.9 (set E[X>j Xk] = ρ|j−k|)

0.0 0.2 0.4 0.6 0.8

Feature correlation ρ

Nominal levelKnockoffKnockoff+BHq

●●

0.0 0.2 0.4 0.6 0.8

Feature correlation ρ

KnockoffKnockoff+BHq

●●

HIV data

Which mutations in the RT or protease of HIV-1determine susceptibility to RT inhibitors or protease inhibitors?

Data:Genotypic predictors of HIV type 1 drug resistance, Rhee et al (2006)Available at hivdb.stanford.edu (Stanford HIV Drug Resistance Database)

I Each drug analysed separatelyI Response y = resistance to the drugI Features X = which mutations are present in the RT or in the

protease

HIV data

The data set:

# protease or # mutationsDrug type # drugs Sample size RT positions appearing ≥ 3 times

genotyped in samplePI 6 848 99 209

NRTI 6 639 240 294NNRTI 3 747 240 319

To validate results:I Treatment-selected mutation (TSM) panel:

A separate study identifies mutations frequently present inpatients who have been treated with each type of drug

HIV data

The data set:

# protease or # mutationsDrug type # drugs Sample size RT positions appearing ≥ 3 times

genotyped in samplePI 6 848 99 209

NRTI 6 639 240 294NNRTI 3 747 240 319

To validate results:I Treatment-selected mutation (TSM) panel:

A separate study identifies mutations frequently present inpatients who have been treated with each type of drug

HIV data

Results forPI type drugs

Knockoff BHq

Data set size: n=768, p=201

Resistance to APV

Appear in TSM listNot in TSM list

Knockoff BHq

Resistance to ATV

Knockoff BHq

Resistance to IDV

Knockoff BHq

Resistance to LPV

Knockoff BHq

Resistance to NFV

Knockoff BHq

Resistance to SQV

HIV data

Results forNRTI type drugs

Results forNNRTI type drugs

Knockoff BHq

Resistance to X3TC

Knockoff BHq

Resistance to ABC

Knockoff BHq

Resistance to AZT

Knockoff BHq

Resistance to D4T

Knockoff BHq

Resistance to DDI

Knockoff BHq

Resistance to TDF

Knockoff BHq

Resistance to DLV

Knockoff BHq

Resistance to EFV

Knockoff BHq

Resistance to NVP

Can knockoffs be replaced by permutations?

Let Xπ = X with rows randomly permuted. Then[X Xπ

]>[X Xπ]≈(

Σ 00 Σ

●●●●

●●●●●●●●●●●●●

●●

●●●●●

●●

●●●●

●●●●●●●

●●●●●●

●●●●

●●

0 50 100 150 200

10152025

Index of column in the augmented matrix [X Xπ]

Original non−null featuresOriginal null featuresPermuted features

︸︷︷︸signals

︸︷︷︸nulls

︸︷︷︸permuted features

FDR (target level q = 20%)Knockoff method 12.29%

Permutation method 45.61%

Can knockoffs be replaced by permutations?

Let Xπ = X with rows randomly permuted. Then[X Xπ

]>[X Xπ]≈(

Σ 00 Σ

●●●●

●●●●●●●●●●●●●

●●

●●●●●

●●

●●●●

●●●●●●●

●●●●●●

●●●●

●●

0 50 100 150 200

10152025

Index of column in the augmented matrix [X Xπ]

Original non−null featuresOriginal null featuresPermuted features

︸︷︷︸signals

︸︷︷︸nulls

︸︷︷︸permuted features

FDR (target level q = 20%)Knockoff method 12.29%

Permutation method 45.61%

Summary

The knockoff filter for inference in a sparse linear model:

• Creates a “control group” for any type of statistic

• Handles any type of feature correlation

• Unknown noise level & sparsity level

• Finite-sample FDR guarantees

Summary

Future work:

1. How to move to high-dimensional setting?

2. Extend to GLMs or other regression models?

3. Similar principles for other problems, e.g. graphical models?

Summary

Thank you!

I Code & demos available athttp://web.stanford.edu/~candes/Knockoffs/

I Paper available at http://arxiv.org/abs/1404.5609

I Joint work with Emmanuel Candès @ StanfordI R. F. B. was partially supported by NSF award DMS-1203762. E. C. is partially

supported by AFOSR under grant FA9550-09-1-0643, by NSF via grant CCF-0963835and by the Math + X Award from the Simons Foundation.

Controlling false discovery rate via knockoffsrina/knockoff/knockoff_slides.pdf · Jan 21 2015...

Documents

Frequently Asked Questions - pursuitofresearch.org · Why does Zrii keep information about formulas proprietary? To protect our formulas from becoming competitor “knockoffs,”

False Premise, False Promise

· Web viewHanakāpī‘ai False Hanalulu Honolulu False hanalulu Honolulu False Honalulu Honolulu False honalulu Honolulu False Hanapepe Hanapēpē False hanau hānau False Hanawi

Answers Class/Mathematics/Exemplar... · 422 EXEMPLAR PROBLEMS MATHEMATICS 79. False 80. True 81. False 82. False 83. True 84. False 85. False 86. False 87. False 88. True 89. True

Sequential tests of multiple hypotheses controlling false ...jaybartroff.com/research/fdr_SQA.pdf · Sequential tests of multiple hypotheses controlling false discovery and nondiscovery

0000065394 · Intelltx Destqner [weather.kdm] Tot* SOUL Example Set Editor Rea 93 64 72 81 FALSE TRUE FALSE FALSE TRUE TRUE FALSE FALSE FALSE TRUE TRUE FALSE TRUE overcast

False! True! False! True! False!

Controlling False Positive Rates in Methods for

Autoscaling Bloom filter: controlling trade-off between ...Autoscaling Bloom filter: controlling trade-off between true and false positives Denis Kleyko1 • Abbas Rahimi2 • Ross

Management Review Packet. True of False There are five basic steps in the controlling process. True or False

v P ] v X } u [Digital Electronics for IBPS IT-Officer 2014] Input Output A B C False False False False True False True False False True True True Symbol for And gate: Also C= A.B

An embedded gene selection method using knockoffs

DEEP KNOCKOFFS By Yaniv Romano Matteo Sesia Emmanuel J. … · 2018. 12. 15. · DEEP KNOCKOFFS . By . Yaniv Romano . Matteo Sesia . Emmanuel J. Candès . Technical Report No. 2018-10

1 Controlling False Positive Rate Due to Multiple Analyses Controlling False Positive Rate Due to Multiple Analyses Unstratified vs. Stratified Logrank

Controlling the false discovery rate in GWAS with ...candes/publications/downloads/sesia2… · Controlling the false discovery rate in GWAS with population structure Matteo Sesia

False Ceilings False Ceiling Materials False Ceiling Design Ceiling Designs

A Power and Prediction Analysis for Knockoffs with Lasso Statisticscandes/publications/... · 2017-12-21 · A Power and Prediction Analysis for Knockoffs with Lasso Statistics Asaf

Sample Size Calculation for Controlling False …downloads.hindawi.com/journals/jps/2012/817948.pdfsample size to ensure adequate statistical power. Methods for calculating sample

Kellogg’s Fights Corn Flakes Knockoffs in the U.K. … With Lasers

· Web viewManagers engaged in the controlling function of management energize their employees and ensure they understand their role in achieving organizational goals. FALSE Leading