Gibbs Sampling from 𝑘-Determinantal Point Processes13-09-00)-13... · 2019-06-13 · Point...

Gibbs Sampling from 𝑘-Determinantal Point Processes

Based on joint work with

Shayan Oveis Gharan

Alireza RezaeiUniversity of Washington

Point Process: A distribution on subsets of 𝑁 = {1,2,… ,𝑁}.

Determinantal Point Process: There is a PSD kernel 𝐿 ∈ ℝ𝑁×𝑁 such that

∀𝑆 ⊆ 𝑁 : ℙ 𝑆 ∝ det 𝐿𝑆

𝑳𝑺

𝒌-DPP: Conditioning of a DPP on picking subsets of size 𝑘

if 𝑆 = 𝑘: ℙ 𝑆 ∝ det 𝐿𝑆

otherwise : ℙ 𝑆 = 0

Focus of the talk:Sampling from 𝑘-

𝑳𝑺

DPPs are Very popular probabilistic models in machine learning to capture diversity.

𝒌-DPP: Conditioning of a DPP on picking subsets of size 𝑘

if 𝑆 = 𝑘: ℙ 𝑆 ∝ det 𝐿𝑆

otherwise : ℙ 𝑆 = 0

Focus of the talk:Sampling from 𝑘-

Applications [Kulesza-Taskar’11, Dang’05, Nenkova-Vanderwende-McKeown’06, Mirzasoleiman-Jegelka-Krause’17]

— Image search, document and video summarization, tweet timeline generation, pose

estimation, feature selection

𝑳𝑺

Continuous Domain

Input: PSD operator 𝐿: 𝒞 × 𝒞 → ℝand 𝑘

select a subset 𝑆 ⊂ 𝒞 with 𝑘 points from a distribution with PDF function

𝑝(𝑆) ∝ det 𝐿(𝑥, 𝑦) 𝑥,𝑦∈𝑆

Continuous Domain

Input: PSD operator 𝐿: 𝒞 × 𝒞 → ℝand 𝑘

select a subset 𝑆 ⊂ 𝒞 with 𝑘 points from a distribution with PDF function

𝑝(𝑆) ∝ det 𝐿(𝑥, 𝑦) 𝑥,𝑦∈𝑆

Applications.

— Hyper-parameter tuning [Dodge-Jamieson-Smith’17]

— Learning mixture of Gaussians[Affandi-Fox-Taskar’13]

Ex. Gaussian : 𝐿 𝑥, 𝑦 = exp −𝑥−𝑦 Σ−1 𝑥−𝑦

Random Scan Gibbs Sampler for 𝐾-DPP

1. Stay at the current state 𝑆 = {𝑥1, … 𝑥𝑘} with prob1

2. Choose 𝑥𝑖 ∈ 𝑆 u.a.r

3. Choose 𝑦 ∉ 𝑆 from the conditional dist 𝜋 . 𝑆 − 𝑥𝑖 is chosen)

Continuous: PDF 𝑦 ∝ 𝜋 𝑥1, … 𝑥𝑖−1, 𝑦, 𝑥𝑖+1, … , 𝑥𝑘 )S ∈

[𝑁]

𝑥𝑖

Main Result

Given a 𝑘-DPP 𝜋, an “approximate” sample from 𝜋 can be generated by running the

Gibbs sampler for 𝝉 = ෩𝑶 𝒌𝟒 ⋅ 𝐥𝐨𝐠 (𝐯𝐚𝐫𝝅𝒑𝝁

𝒑𝝅) steps where 𝜇 is the starting dist.

Main Result

Discrete: A simple greedy initialization gives 𝜏 = 𝑂 𝑘5log 𝑘 . Total running time is 𝑂 𝑁 . poly 𝑘 .

Does not improve upon the previous MCMC methods. [Anari-Oveis Gharan-R’16]

Mixing time is independent of 𝑁, so the running time in distributed settings is sublinear.

Main Result

Continuous: Given access to conditional oracles, 𝜇 can be found so 𝜏 = 𝑂(𝑘5log 𝑘).

First algorithm with a theoretical guarantee for sampling from continuous 𝑘-DPP.

Being able to run the chain.

Main Result

Continuous: Given access to conditional oracles, 𝜇 can be found so 𝜏 = 𝑂(𝑘5log 𝑘).

First algorithm with a theoretical guarantee for sampling from continuous 𝑘-DPP.

Using a rejection sampler as the conditional oracles for Gaussian kernels 𝐿 𝑥, 𝑦 = exp(−𝑥−𝑦 2

𝜎2)

defined a unit sphere in ℝ𝑑, the total running time is

• If 𝑘 =poly(d): poly(𝑑, 𝜎)

• If 𝑘 ≤ 𝑒𝑑1−𝛿

and 𝜎 = 𝑂 1 : poly 𝑑 ⋅ 𝑘𝑂(1

Being able to run the chain.

Gibbs Sampling from 𝑘-Determinantal Point Processes13-09-00)-13... · 2019-06-13 · Point...

Documents

A Determinantal Point Process Latent Variable Model for ...papers.nips.cc/paper/5051-a-determinantal-point-process...A Determinantal Point Process Latent Variable Model for Inhibition

©White Rose Maths...=𝑘 =𝑘 =𝒌 = +𝑎 =−𝑘 , =𝑎− , + =𝑎)

Learning the Parameters of Determinantal Point Process Kernelsrpa/pubs/affandi2014determinantal.pdf · appealing to Eq. (1), the DPP probability density for point con gurations Aˆ

1 Determinantal Point Processes - MIT CSAIL

Statistical aspects of determinantal point processes · Statistical aspects of determinantal point processes Jesper M˝ller, Department of Mathematical Sciences, Aalborg University

Zeros of Gaussian Analytic Functions and Determinantal

Determinantal Point Processes for Machine Learning · Determinantal Point Processes for Machine Learning Alex Kulesza1 and Ben Taskar2 1 University of Michigan, USA, kulesza@umich.edu

Determinantal Point Processes for Memory and Structured Inference · 2020. 7. 23. · Determinantal Point Processes for Memory and Structured Inference ... Figure 1). DPPs originated

The Schur functors and the resolution of determinantal varieties.diposit.ub.edu/dspace/bitstream/2445/113012/2/memoria.pdf · 3.2 Determinantal ideals and determinantal varieties

Statistical Shape Analysis - Scientific Computing and ... Shape Analysis It’s all about representation… 𝐩𝑘= 𝑘, 𝑘 T Configuration Space 𝑆𝑖= 𝐩𝑖 1 𝐩𝑖

Statistical aspects of determinantal point processes · Determinantal point processes (DPPs) are largely unexplored in statistics, though they possess a number of very attractive

Determinantal point process models on the spherepeople.math.aau.dk/~jm/DPPsphere.pdf · Determinantal point process models on the sphere ... DPPs are of interest because of their

DETERMINANTAL POINT PROCESSES AND APPLICATIONS · DETERMINANTAL POINT PROCESSES AND APPLICATIONS 5 ThemapK7!K isanisometrybetweenL2(X 2; )andthespaceofHilbert–Schmidtoperators HS(L2(X;

Determinantal point processes for machine learning - · PDF fileDeterminantal point processes for machine learning Alex Kulesza Ben Taskar Abstract Determinantal point processes (DPPs)

Inference for determinantal point processes without ... · Monte Carlo inference methods for DPPs. 1 Introduction Determinantal point processes (DPPs) are point processes [1] that

Determinantal Point Processes for Mini-Batch Diversiﬁcation · Disney Research Pittsburgh, PA, USA stephan.mandt@disneyresearch.com Abstract We study a mini-batch diversiﬁcation

DETERMINANTAL POINT PROCESSESapparikh/nips2014ml-nlp/slides/dpps-nips2014.pdfDETERMINANTAL POINT PROCESSES FOR NATURAL LANGUAGE PROCESSING Jennifer Gillenwater Joint work with Alex

Aalborg Universitet Statistical aspects of determinantal ... · Determinantal point processes (DPPs) are largely unexplored in statistics, though they possess a number of very attractive

Statistical aspects of determinantal point processesgdr-geostoch.math.cnrs.fr/workshop_Rouen/SlidesRouen2012/... · 2012. 6. 5. · IntroductionDe nitionSimulationParametric modelsInference

Expectation-Maximization for Learning Determinantal Point Processes