Boolean tensor factorizations - mpi-inf.mpg.depmiettin/papers/BTF_ICDM_slides.pdfFACTORIZATIONS: THE...

BOOLEAN TENSOR FACTORIZATIONS

Pauli Miettinen 14 December 2011

BACKGROUND: TENSORS AND TENSOR FACTORIZATIONS

BACKGROUND: BOOLEAN MATRIX FACTORIZATIONS

• Given a binary matrix X and a positive integer R, find two binary matrices A and B such that A has R columns and B has R rows and X ≈ A o B.

• A o B is the Boolean matrix product of A and B,

(A ◦ B)ij =R�

bilclj

BOOLEAN TENSOR FACTORIZATIONS: THE IDEA

1. Take existing (normal) tensor factorization

2. Make everything binary and define summation as 1 + 1 = 1

3. Try to understand what you just did.

Research problem. What can we say about Boolean tensor factorizations and how do they relate to normal tensor factorizations and Boolean matrix factorizations?

RANK-1 (BOOLEAN) TENSORS

X = a× b

RANK-1 (BOOLEAN) TENSORS

X = a×1 b×2 c

THE CP TENSOR DECOMPOSITION

xijk ≈R�

airbjrckr

a1 a2 aR

bRb2b1

c1 c2 cR

+ + · · ·+

THE CP TENSOR DECOMPOSITION

xijk ≈R�

airbjrckr

THE BOOLEAN CP TENSOR DECOMPOSITION

a1 a2 aR

bRb2b1

c1 c2 cR

∨ ∨ · · ·∨

xijk ≈R�

airbjrckr

THE BOOLEAN CP TENSOR DECOMPOSITION

≈ ◦◦

xijk ≈R�

airbjrckr

DIGRESSION: FREQUENT TRI-ITEMSET MINING

• Rank-1 N-way binary tensors define an N-way itemset

• Particularly, rank-1 binary matrices define an itemset

• In itemset mining the induced sub-tensor must be full of 1s

• Here, the items can have holes

• Boolean CP decomposition = lossy N-way tiling

TENSOR RANKThe rank of a tensor is the minimum number of rank-1

tensors needed to represent the tensor exactly.

a1 a2 aR

bRb2b1

c1 c2 cR

+ + · · ·+=

BOOLEAN TENSOR RANKThe Boolean rank of a binary tensor is the minimum

number of binary rank-1 tensors needed to represent the tensor exactly using Boolean arithmetic.

a1 a2 aR

bRb2b1

c1 c2 cR

∨ ∨ · · ·∨

SOME RESULTS ON RANKS

• Normal tensor rank is NP-hard to compute

• Normal tensor rank of n-by-m-by-k tensor can be more than min{n, m, k}

• But no more than min{nm, nk, mk}

• Boolean tensor rank is NP-hard to compute

• Boolean tensor rank of n-by-m-by-k tensor can be more than min{n, m, k}

• But no more thanmin{nm, nk, mk}

SPARSITY

• Binary matrix X of Boolean rank R and |X| 1s has Boolean rank-R decomposition A o B such that |A| + |B| ≤ 2|X| [M., ICDM ’10]

• Binary N-way tensor of Boolean tensor rank R has Boolean rank-R CP-decomposition with factor matrices A1, A2, …, AN such that ∑i |Ai| ≤ N| |

• Both results are existential only and extend to approximate decompositions

THE TUCKER TENSOR DECOMPOSITION

xijk ≈P�

gpqraipbjqckr

THE BOOLEAN TUCKER TENSOR DECOMPOSITION

xijk ≈P�

gpqraipbjqckr

THE ALGORITHMS

• The normal CP-decomposition can be solved using matricization and ALS

• ⊙ is the Khatri–Rao matrix product

• (C ⊙ B)T is R-by-mk

• For normal matrices, we can use standard least-squares projections

• One projection per mode

• Similar algorithms for the Tucker decomposition

X(1) = A(C⊙ B)T

X(2) = B(C⊙A)T

X(3) = C(B⊙A)T

THE ALGORITHMS

• For Boolean case, matrix product must be changed

• Khatri–Rao stays as it

• Finding the optimal projection is NP-hard even to approximate

• Good initial values are needed due to multiple local minima

• Obtained using Boolean matrix factorization to matricizations

X(1) = A ◦ (C⊙ B)T

X(2) = B ◦ (C⊙A)T

X(3) = C ◦ (B⊙A)T

THE TUCKER CASE

• The core tensor has global effects

• Updates are hard

• Core tensor is usually small

• We can afford more time per element

• In Boolean case many changes make no difference

xijk ≈P�

gpqraipbjqckr

SYNTHETIC EXPERIMENTS

4 8 16 32 640

12x 10

Reconstr

uction e

BCP_ALSCP_ALS

CP_ALSF

CP_OPTB

CP_OPTF

(4, 4, 4) (8, 4, 4) (8, 8, 8) (16, 8, 4)0

BTucker_ALSTucker_ALS

Tucker_ALSF

4 8 16 32 640

12x 10

Reconstr

uction e

BCP_ALSCP_ALS

CP_ALSF

CP_OPTB

CP_OPTF

(4, 4, 4) (8, 4, 4) (8, 8, 8) (16, 8, 4)0

Tucker_ALSF

4 8 16 32 640

12x 10

Reconstr

uction e

BCP_ALSCP_ALS

CP_ALSF

CP_OPTB

CP_OPTF

(4, 4, 4) (8, 4, 4) (8, 8, 8) (16, 8, 4)0

Tucker_ALSF

REAL-WORLD EXPERIMENTS

5 10 15 301200

uction

BCP_ALSCP_ALS

CP_ALSF

CP_OPTB

CP_OPTF

(5, 5, 5) (10, 10, 10) (15, 15, 15) (30, 30, 15)1000

RanksR

Tucker_ALSF

5 10 15 301200

uction

BCP_ALSCP_ALS

CP_ALSF

CP_OPTB

CP_OPTF

(5, 5, 5) (10, 10, 10) (15, 15, 15) (30, 30, 15)1000

RanksR

Tucker_ALSF

5 10 15 301200

uction

BCP_ALSCP_ALS

CP_ALSF

CP_OPTB

CP_OPTF

(5, 5, 5) (10, 10, 10) (15, 15, 15) (30, 30, 15)1000

RanksR

Tucker_ALSF

CONCLUSIONS

• Boolean tensor decompositions are a bit like normal tensor decompositions

• And a bit like Boolean matrix factorizations

• They generalize other data mining techniques in many ways

• The playing field between Boolean and normal tensor factorizations is more level

CONCLUSIONS

• Boolean tensor decompositions are a bit like normal tensor decompositions

• And a bit like Boolean matrix factorizations

• They generalize other data mining techniques in many ways

• The playing field between Boolean and normal tensor factorizations is more level

!ank Y#!

Boolean tensor factorizations - mpi-inf.mpg.depmiettin/papers/BTF_ICDM_slides.pdfFACTORIZATIONS: THE...

Documents

Generalization of Tensor Factorization and Applications

Bayesian Poisson Tensor Factorization (BPTF) for feature ... · Bayesian Poisson Tensor Factorization (BPTF) for feature selection from GDELT Chennan Zhou, Feiyan Liu, Jianbo Gao

Convolutional Auto-Encoder With Tensor-Train Factorization

1 Tensor Factorization for Low-Rank Tensor Completion · So tensor analysis is of practical signiﬁcance and beneﬁts many applications in computer vi-sion [2], [6], [7], data mining

Lecture XI Tensor Factorization: blind separation of

Scalable Tensor Factorizations with Missing Datadmdunla/publications/AcDuKoMo10.pdf · Scalable Tensor Factorizations with Missing Data Evrim Acar yDaniel M. Dunlavyz Tamara G. Kolda

Regularized Tensor Factorizations and Higher-Order ... · PDF fileRegularized Tensor Factorizations and Higher-Order Principal Components Analysis ... Jan and Dan Duncan ... (Harshman,

Scalable Tensor Factorizations with Missing Data

Negative-Unlabeled Tensor Factorization for Location Category Inference ... · Negative-Unlabeled Tensor Factorization for Location Category Inference from Highly Inaccurate Mobility

Nonnegative Matrix and Tensor Factorizations

Mu Li iPAL Group Meeting Sept. 17, 2010 Introduction to tensor, tensor factorization and its applications

Beyond word2vec: Distance-graph Tensor Factorization for ...charuaggarwal.net/cikm2019.pdf · factorization machines, which provides an alternative methodology for implementation

GRANITE: DIVERSIFIED, SPARSE TENSOR FACTORIZATION FOR ... · GRANITE: DIVERSIFIED, SPARSE TENSOR FACTORIZATION FOR ELECTRONIC HEALTH RECORD-BASED PHENOTYPING Jette Henderson ∗,

Multiverse Recommendation: N-dimensional Tensor Factorization for Context-aware Collaborative Filtering

Coupled Matrix and Tensor Factorizations: Models ...perso.ens-lyon.fr/bora.ucar/tensors-pp16/eacar-pp16.pdf · CMTF-OPT is a gradient–based optimization approach for joint factorization

NAG Library Chapter Introduction f08 – Least Squares and ...2.2.1 QR factorization The most common, and best known, of the factorizations is the QR factorization given by A ¼ Q

NAG Fortran Library Chapter Introduction F08 – Least ... · 2.2.1 QR factorization The most common, and best known, of the factorizations is the QR factorization given by A ¼ Q

Robust Tensor Factorization With Unknown Noise...niques to deal with tensor problems. However, as shown in [10], such matricization fails to exploit the essential tensor structure

Matrix and Tensor Factorization from a Machine Learning ...statmath.wu.ac.at/research/talks/resources/talkfreudenthaler.pdf · Matrix Factorization - SVD I SVD: Decompose p 1 Tp 2

CoSTCo: A Neural Tensor Completion Model for Sparse Tensorsyaguang/papers/kdd19_CoSTCo.pdfLow-rank tensor factorization has been widely used for many real world tensor completion problems