52
Estimating Symmetric Distribution Properties Maximum Likelihood Strikes Back! Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018

Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

EstimatingSymmetricDistributionPropertiesMaximumLikelihoodStrikesBack!

Jayadev AcharyaCornell

Hirakendu DasYahoo

Alon OrlitskyUCSD

Ananda SureshGoogle

ShannonChannelMarch5,2018

Page 2: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Basedon:

AUnifiedMaximumLikelihoodApproachforEstimatingSymmetricPropertiesofDiscreteDistributions,InternationalConferenceonMachineLearning(ICML),2017

http://proceedings.mlr.press/v70/acharya17a.html

Slidesavailableat:https://people.ece.cornell.edu/acharya/talks/shannon-18acharya.pdf

Page 3: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Outline1. Motivation2. Methods3. Proofs4. Futuredirections

Page 4: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Chapter1:Motivation

Page 5: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Distributionproperties𝒫:acollectionofdiscretedistributions• 𝒫 = Δ$ :alldistributionsover 𝑘 = {1,… , 𝑘}• Δ+:distributionsover[6]

Property𝑓:𝒫 → ℝ

• 𝑝 3 =?• Isitfair?Is𝑝 𝑖 = 1/6 forall𝑖?

Page 6: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Propertyestimation

𝑝 unknown distributionin𝒫

Givenindependentsamples𝑋:; = 𝑋:, 𝑋<,… , 𝑋; ∼ 𝑝

Estimate𝑓 𝑝

Samplecomplexity 𝑆 𝑓,𝒫, 𝜀, 𝛿

Minimum𝑛 necessaryto

Estimate𝑓 𝑝 ± 𝜀

Witherrorprobability< 𝛿 (usuallyconstant)

Page 7: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Symmetricproperties𝑓 symmetric ifunchangedunderinputpermutations

Entropy𝐻 𝑝 ≜ ∑ 𝑝(𝑥) log :M(N)N

SupportsizeS 𝑝 ≜ ∑ 𝕀 M N QRS

Manyothers:Renyi entropy,supportcoverage,…

Coinswithbias0.4,andwithbias0.6havesameentropy!

Page 8: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Entropy

𝐻 𝑝 ≜ −U𝑝(𝑥) log 𝑝(𝑥)𝒙

• Mostpopularmeasureofrandomness• Centralquantityininformationtheory[Shannon’48]

Howmanysamplestoestimate𝐻(𝑝) to±𝜀?

Longlineofwork:[Empirical,Miller-Maddow,Jackknifed,Coverageadjusted,BUB(paninski’03),…]

Page 9: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Estimatingentropy• Randomnessofneuralspiketrains• Featureselectionindecisiontrees• Graphicalmodels(Chow-Liu)

Traditionalsettingforpropertyestimation:𝒫 = Δ$:distributionson[𝑘] (small𝑘)Obtainmanysamples(large𝑛)

Genetics,neuralspikes,text,computervision,ecology:𝒌 large,possiblyinfinite,perhapsunknown

Page 10: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Estimatingtheunseen:Corbet’sbutterflies

2yearstrappingbutterfliesinMalaypeninsula:

AskedFisher:

howmanynewspeciesifhegoesfortwomoreyears?

Frequency 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

Species 118 74 44 24 29 22 20 19 20 15 12 14 6 12 6

Page 11: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Estimatingtheunseen: formulation𝑝: unknown discretedistribution

𝑆X 𝑝 ≜ 𝔼[#𝐝𝐢𝐬𝐭𝐢𝐧𝐜𝐭𝐬𝐲𝐦𝐛𝐨𝐥𝐬in𝑚ind. samples ∼ 𝑝]

𝑆X 𝑝 =U 1 − 1 − 𝑝(𝑥) X

N

Normalizedcoverage:pq MX

Howmanysamplestoestimatepq MX

to±𝜀?

Page 12: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Estimatingtheunseen:applications• Estimatingvocabularysize• Ecologicaldiversity• Microbialdiversityonskin

Wellstudied[GoodToulmin,Efron Thisted]Forconstant𝜀:Requires 𝒎

𝟐samples

Morerecently,[Zou ValiantValiantChan…’16,OrlitskySureshWu’16]:

Requires 𝒎𝐥𝐨𝐠𝒎

samples

Page 13: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Chapter2:Methods

Page 14: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Plug-inestimation

Using𝑋:;,findanestimate�̂� of𝑝

Estimate𝑓 𝑝 by𝑓 �̂�

Howtoestimate𝑝?

Page 15: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Sequencemaximumlikelihood (SML)

𝑝Nvwxy ≜ argmax

M 𝑝(𝑥;)= arg max

M∏ 𝑝(𝑥})}

𝑋:~ = ℎ, ℎ, 𝑡𝑝�X� = argmax𝑝< ℎ · 𝑝 𝑡

𝑝�X� ℎ = <~, 𝑝�X� 𝑡 = :

~

Sameastheempirical-frequency distribution

Multiplicity𝑁N - #times𝑥 appearsin𝑋:;

𝑝wxy 𝑥 =𝑁N𝑛

Page 16: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

SMLforentropy

SamplecomplexityofSMLtoestimate𝐻 𝑝 over𝚫$ :

𝑆�X� 𝐻, 𝚫$, 𝜀 = Θ𝑘𝜀

Intheasymptotic𝑛 → ∞,SMLisoptimal

Page 17: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Samplecomplexityofentropy

Samplecomplexityof𝐻(𝑝):

𝑆 𝐻, 𝚫$, 𝜀 = Θ𝑘

𝜀 ⋅ log 𝑘

[…,Paninski’03,ValiantValiant’11,HanJiaoVenkatWeissman’15,WuYang’15]

[ValiantValiant’11]:plug-in,sub-optimal in𝜀

[HanJiaoWeissman ’18]:optimalin𝜀 bytweakingVV’11

Page 18: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Optimalestimators

Generalrecipe:

1. Approximate𝐻(𝑝) withapolynomialin𝑝2. Estimatethepolynomial

Differentnon-plug-in estimatorforeachproperty

Sophisticatedapproximationtheoryresults

Page 19: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

PriorworkForseveralimportantproperties

Optimalisalogarithmic factor betterthanempirical

Property 𝒫 SML Optimal References

𝐻 𝒑 𝚫$𝑘𝜀

𝑘𝜀 ⋅ log 𝑘

ValiantValiant ‘11,HanJiaoVenkatWeissman ‘15,WuYang‘15

Page 20: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

PriorworkForseveralimportantproperties

Optimalisalogarithmic factor betterthanempirical

Property 𝒫 SML Optimal References

𝐻 𝒑 𝚫$𝑘𝜀

𝑘𝜀 ⋅ log 𝑘

ValiantValiant ‘11,HanJiaoVenkatWeissman ‘15,WuYang‘15

𝑆X 𝒑𝑚

𝚫� 𝑚𝑚

log 𝑚log

1𝜀

Orlitsky SureshWu’16

Page 21: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

PriorworkForseveralimportantproperties

Optimalisalogarithmic factor betterthanempirical

Property 𝒫 SML Optimal References

𝐻 𝒑 𝚫$𝑘𝜀

𝑘𝜀 ⋅ log 𝑘

ValiantValiant ‘11,HanJiaoVenkatWeissman ‘15,WuYang‘15

𝑆X 𝒑𝑚

𝚫� 𝑚𝑚

log 𝑚log

1𝜀

Orlitsky SureshWu’16

𝑆 𝒑𝑘

𝚫$ 𝑘 log1𝜀

𝑘log 𝑘

log<1𝜀

WuYang’16

∥ 𝒑 − 𝑢 ∥: 𝚫$𝑘𝜀<

𝑘𝜀< ⋅ log 𝑘

Han JiaoWeissman ‘16

Page 22: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Ourresults [AcharyaDasOrlitsky Suresh’17]Unified,simple,sample-optimalapproachforallaboveproblems

• Plug-inestimator• Maximumlikelihoodprinciple:

sequencemaximumlikelihood(SML)profilemaximumlikelihood(PML)

Page 23: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Chapter3:PML

Page 24: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

ProfilesProfileisthe multi-setofmultiplicities

Φ 𝑋:; ≜ {𝑁N: 𝑥 ∈ 𝑋:;}

Φ(ℎ, ℎ, 𝑡) = Φ(𝑡, ℎ, 𝑡) = {1,2}

Φ(𝛼, 𝛾, 𝛽, 𝛾) = {1,1,2}

Page 25: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

ProbabilitymultisetSymmetricpropertiesdeterminedby

• Probabilitymultiset:{𝑝(1), 𝑝(2),… }

𝑝 ℎ = 0.4 ⟹ {0.4,0.6}

𝑝 ℎ = 0.6 ⟹ {0.4,0.6}

Profilesaresufficientstatisticforsymmetricproperties

ℎ, ℎ, 𝑡,OR𝑡, ℎ, 𝑡 ⟹ sameestimate

Page 26: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

EstimatingprobabilitymultisetOrlitsky Santhanam Viswanathan Zhang ‘04:

“Onmodelingprofilesinsteadofvalues”,UAI

Moreextensively:

OSVZ:“Onestimatingaprobabilitymultiset”,online

Page 27: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Profilemaximumlikelihood(PML)Profile probability

𝑝 Φ = U 𝑝(𝑥;)Nv:� Nv ��

Distributionmaximizing theprofileprobability

𝑝�MX� = argmax

M∈𝒫𝑝(Φ)

Page 28: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

PMLexample𝑋:~ = ℎ, ℎ, 𝑡Φ ℎ, ℎ, 𝑡 = 1,2

𝑝 Φ = 1,2

= 𝑝 𝑠, 𝑠, 𝑑 + 𝑝 𝑠, 𝑑, 𝑠 + 𝑝 𝑑, 𝑠, 𝑠

= 3 ⋅ 𝑝 𝑠, 𝑠, 𝑑

= 3 ⋅ ∑ 𝑝< 𝑥 𝑝(𝑦)N��

Symmetricpolynomial

Page 29: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

SMLof{1,2}

𝑝 Φ = 1,2 = 3 U 𝑝< 𝑥 𝑝(𝑦)N��

𝑝�X� ℎ = <~, 𝑝�X� 𝑡 = :

~

𝑝�X� 1, 2 = 323

< 13 +

13

< 23 =

1827 =

23

Page 30: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

PMLof{1,2}

𝑝 Φ = 1,2 = 3 U 𝑝< 𝑥 𝑝(𝑦)N��

U 𝑝< 𝑥 𝑝(𝑦)N��

=U𝑝< 𝑥 (1 − 𝑝 𝑥 )N

=U𝑝 𝑥 ⋅ 𝑝(𝑥)(1 − 𝑝 𝑥 )N

≤14

𝑝MX� 1, 2 =34

Page 31: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

PMLof{1,1,2}Φ(𝛼, 𝛾, 𝛽, 𝛾) = {1,1, 2}

𝑝MX� 1,1,2 = 𝑈[5]

PMLcanpredictexistenceofunseensymbols

Maximize:

U 𝑝 𝑥 <𝑝 𝑦 𝑝 𝑧N���¤

,

subjectto:

U𝑝(𝑥)N

= 1,𝑝 𝑥 ≥ 0

Page 32: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Uniform[500],700samples

700samples

Page 33: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

U[500],700x,12experiments

Page 34: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

PMLplug-inToestimatesymmetric𝑓(𝑝):• Find𝑝MX� Φ(𝑋:;)• Output𝑓(𝑝MX�)

Advantages:• Notuningparameters• Notfunctionspecific

Rootedinthemaximumlikelihoodprinciple

Page 35: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Ingredient1:GoodnessofML

Page 36: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

GeneralMLpluginestimation

𝒫: collectionofdistributionsoveranabstract domain𝒵

𝑓:𝒫 → ℝ anyproperty

Given𝑧 ∈ 𝒵 estimate𝑓

MLestimator:

Determine𝑝¤§¨ ≜ arg maxM∈𝒫

𝑝 𝑧

Output𝑓(𝑝¤§¨)HowgoodisMLE?

Page 37: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

CompetitivenessofMLplugin

Theorem:Suppose𝑓©: 𝒵 → ℝ issuchthat∀𝑝 ∈ 𝒫,

Pr¬∼M

𝑓© 𝑍 − 𝑓 𝑝 > 𝜀 < 𝛿,

thenMLEpluginerrorboundedby

Pr¬∼M

𝑓 𝑝¤§¨ − 𝑓 𝑝 > 2 ⋅ 𝜀 < 𝛿 ⋅ |𝒵|.

Competitive withthebest𝑓©

Page 38: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

CompetitivenessofMLEplugin- proofConsiderany𝑝 ∈ 𝒫

𝒵°± ≜ {𝑧 ∈ 𝒵: 𝑝 𝑧 ≥ 𝛿}

• 𝑧 ∈ 𝒵°±:• 𝑓© 𝑧 − 𝑓(𝑝) ≤ 𝜀 (byconditioninTheorem)

• 𝑝¤§¨ 𝑧 ≥ 𝑝 𝑧 ≥ 𝛿,hence 𝑓© 𝑧 − 𝑓(𝑝¤§¨) ≤ 𝜀• Triangleinequality: 𝑓(𝑝¤§¨) − 𝑓 𝑝 ≤ 2𝜀

• 𝑧 ∈ 𝒵²± ≜ {𝑧: 𝑝 𝑧 < 𝛿}

• Pr 𝑓 𝑝¤§¨³ − 𝑓 𝑝 > 2𝜀 ≤ Pr 𝒵²± < 𝛿 ⋅ 𝒵

Page 39: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Ingredient2:Errorprobabilities

Page 40: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

PMLperformancebound

Theorem:If𝑛 = 𝑆 𝑓, 𝒫, 𝜀, 𝛿 ,then

𝑆MX� 𝑓, 𝒫, 2 ⋅ 𝜀, Φ; ⋅ 𝛿 ≤ 𝑛

|Φ;|:numberofprofilesoflength𝑛

Profileoflengthn:partitionofn

{3},{1,2}, {1,1,1}➞ 3,2+1,1+1+1

|Φ;| = partition#of𝑛

Hardy-Ramanujan: |Φ;| < 𝑒~ ;

Page 41: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

PMLperformance:Try1Theorem:𝑛 = 𝑆 𝑓, 𝒫, 𝜀, 1/3 ⇒ 𝑆MX� 𝑓, 𝒫, 2𝜀, 1/3 ≤ 𝑂(𝑛<).

Proof:• Boosterrorprobability:• Take𝑛 ⋅ ℓ independentsamples,anddividetheminℓ parts• Estimate𝑓 foreachofthesamples• Takethemedianoftheestimates⟹ 𝛿 < exp(−ℓ)

• Φ;ℓ < exp(3 𝑛ℓ .º)

ErrorprobabilityofPMLmostexp −ℓ + 3 𝑛ℓ .º

Whenℓ > 𝑛,theerrordominates#profiles

Page 42: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

PMLperformance:Try2Recall

𝑆 𝐻, Δ$, 𝜀, 1/3 = Θ𝑘

𝜀 ⋅ log 𝑘Withtwicethesampleserrordropsexponentially

𝑆 𝐻, Δ$, 𝜀, 𝑒»;¼.½ = Θ $

¾⋅y¿À $

• Modified estimatorswithsmallboundeddifferences• StrongerguaranteesfromMcDiarmid’s inequalityMosttechnicalpartofthepaperSimilarresultsforotherproperties

Page 43: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Combiningeverything

Fasterrorforpropertieswestudy:

If 𝒏 = 𝑺 𝒇,𝒫, 𝜺, 𝟏/𝟑 ,then 𝑺 𝒇,𝒫, 𝜺, 𝒆»𝟒 𝒏 ≤ 𝟒𝒏

MLplug-inresult:

If 𝑺 𝒇,𝒫, 𝜺, 𝒆»𝟒 𝒏 ≤ 𝟒𝒏,then 𝑺𝒑𝒎𝒍 𝒇,𝒫, 𝟐𝜺, 𝒆» 𝒏 ≤ 𝟒𝒏

Combining,wearedone!

Page 44: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

ComputingPMLdistribution• EMalgorithm[OrlitskyPanSajamaSanthanamViswanathanZhang‘04,’13]

• ApproximatePMLviaBethePermanents[Vontobel’14]• ExtensionsofMarkovChains[Vatedka Vontobel’16]• Approximationviarelaxation[JiaoPavlichinWeissman ‘17]:

Page 45: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Chapter5:Directions

Page 46: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

ApproximatePML• Perhaps findingexactPMLishard• CanshowthatapproximatingPMLenough

Question:Computeexp(−𝑛Ê) approximatePMLforany𝛽 < 1

Eventhisisoptimal(forlarge𝑘)

Page 47: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Higherdimensions• EstimateKLdivergencebetweendiscretedistributionsgivensamples(underassumptionsofcourse)

[BuZou LiangVeeravalli ‘16,HanJiaoWeissman ’16]

[Acharya’18]:PMLisoptimalforKLdivergenceestimation- Higherorderpartitions

Page 48: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

IndependentProofTechniquesMLEgoodwhensomethingelseisgood

• MLperformanceindependentofotherresults?

Page 49: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

OtherIsPMLoptimalforevery symmetricproperty?

Canwedosomethingforcontinuousdistributions?

Page 50: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

Summary• Symmetricpropertyestimation• PMLplug-inapproach• Universal,simpletostate• Independentofparticularproperties

Page 51: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

InFisher’swords…

OfcoursenobodyhasbeenabletoprovethatMLEisbestunderallcircumstances.MLEcomputedwithalltheinformationavailablemayturnouttobeinconsistent.Throwingawayasubstantialpartoftheinformationmayrenderthemconsistent.

R.A.Fisher

Page 52: Estimating Symmetric Distribution Properties Maximum ......Jayadev Acharya Cornell Hirakendu Das Yahoo Alon Orlitsky UCSD Ananda Suresh Google Shannon Channel March 5, 2018 Based on:

ThankYou!