23
Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida 1

Image Compression by Learning Matrix Ortho-normal Bases

Embed Size (px)

DESCRIPTION

Image Compression by Learning Matrix Ortho-normal Bases. Karthik Gurumoorthy Ajit Rajwade Arunava Banerjee Anand Rangarajan Department of CISE University of Florida. Overview. A new approach to lossy image compression based on machine learning . - PowerPoint PPT Presentation

Citation preview

Page 1: Image Compression by Learning Matrix Ortho-normal Bases

Karthik GurumoorthyAjit Rajwade

Arunava BanerjeeAnand Rangarajan

Department of CISEUniversity of Florida

1

Page 2: Image Compression by Learning Matrix Ortho-normal Bases

A new approach to lossy image compression based on machine learning.

Key idea: Learning of Matrix Ortho-normal Bases from training data to efficiently code images.

Applied to compression of well-known face databases like ORL, Yale.

Competitive with JPEG.2

Page 3: Image Compression by Learning Matrix Ortho-normal Bases

Vector

Conventional learning methods in vision like PCA,

ICA, etc.

21 MM

Image 211 MM

3

Page 4: Image Compression by Learning Matrix Ortho-normal Bases

Our approach following Rangarajan [EMMCVPR-2001] &

Ye [JMLR-2004]

Treated as a21 MM

Image 21 MM

Matrix

4

Page 5: Image Compression by Learning Matrix Ortho-normal Bases

Image of size divided into N patches of size

each treated as a Matrix.

21 MM

Image

21 qq

5

21 MM

2211 , MqMq

Page 6: Image Compression by Learning Matrix Ortho-normal Bases

6

=P U S V

TUSVP

U and V:Ortho-normal matrices

S: Diagonal Matrix of singular values

Page 7: Image Compression by Learning Matrix Ortho-normal Bases

7

useful for compression (e.g.: SSVD [Ranade et al-IVC 2007]).

Page 8: Image Compression by Learning Matrix Ortho-normal Bases

8

Consider a set of N image patches:

SVD of each patch gives:

Costly in terms of storage as we need to store N ortho-normal basis pairs.

Tiiii VSUP

Page 9: Image Compression by Learning Matrix Ortho-normal Bases

Produce ortho-normal basis-pairs, common for all N patches.

Since storing the basis pairs is not expensive.

9

NK

NK

Page 10: Image Compression by Learning Matrix Ortho-normal Bases

10

iP aU iaS aV

Taiaai VSUP

iaS•Non-diagonal•Non-sparse

aiTaia VPUS

Page 11: Image Compression by Learning Matrix Ortho-normal Bases

What sparse matrix will optimally reconstruct from ?

Optimally = least error:

Sparse = matrix has at most some non-zero elements.

2|||| FTaiaa VSUP

11

),( aa VUiPiaS

iaS T

Page 12: Image Compression by Learning Matrix Ortho-normal Bases

We have a simple, provably optimal greedy method to compute such a

1. Compute the matrix . 2. In matrix , nullify all except the largest

elements to produce .

aiTaia VPUW

12

iaS

iaW T

iaS

Page 13: Image Compression by Learning Matrix Ortho-normal Bases

A set of N image patches .

Learning K << N ortho-normal basis

pairs )},{( aa VU

13

)1(, NiPi

2

1 1

||||}),),,({( FTaiaai

N

i

K

aiaiaiaaa VSUPMMSVUE

aIVVUU aTaa

Ta , aiTSia ,,|||| 0

MembershipsProjection Matrices

Page 14: Image Compression by Learning Matrix Ortho-normal Bases

Input: N image patches of size .

Output: K pairs of ortho-normal bases

called as dictionary.

14

21 qq

)},{( ii VU

Page 15: Image Compression by Learning Matrix Ortho-normal Bases

Divide each test image into patches of size

Fix per-pixel average error (say e), similar to the “quality” user-parameter in JPEG.

21 qq

15

Page 16: Image Compression by Learning Matrix Ortho-normal Bases

16

.

.

.

.

.

.

1U 1V

2U 2V

KU KV

iP

.

.

.

111 VPUS iT

i

222 VPUS iT

i

KiTKiK VPUS

eqq

VSUP FTaiaai

21

2||||

222 VPUS iT

i

Page 17: Image Compression by Learning Matrix Ortho-normal Bases

)(log10 10 ePSNR

RPP = number of bits per pixel

17

Page 18: Image Compression by Learning Matrix Ortho-normal Bases

18

0.5 bits 0.92 bits 1.36 bits

1.78 bits 3.023 bits

Page 19: Image Compression by Learning Matrix Ortho-normal Bases

Size of original database is 3.46 MB.Size of dictionary of 50 ortho-normal

basis pairs is 56 KB=0.05MB.Size of database after compression

and coding with our method with e = 0.0001 is 1.3 MB.

Total compression rate achieved is 61%.

19

Page 20: Image Compression by Learning Matrix Ortho-normal Bases

)(log10 10 ePSNR

RPP = number of bits per pixel

20

Page 21: Image Compression by Learning Matrix Ortho-normal Bases

New lossy image compression method using machine learning.

Key idea 1: matrix based image representation.

Key idea2: Learning small set of matrix ortho-normal basis pairs tuned to a database.

Results competitive with JPEG standard.

Future extensions: video compression.21

Page 22: Image Compression by Learning Matrix Ortho-normal Bases

A. Rangarajan, Learning matrix space image representations, Energy Minimizing Methods in Computer Vision and Pattern Recognition, 2001.

J. Ye, Generalized low rank approximation of matrices, Journal of Machine Learning Research ,2004.

M. Aharon, M. Elad and A. Bruckstein, The K-SVD: An algorithm for designing of overcomplete dictionaries for sparse representation. IEEE Transactions on Signal Processing, 2006.

A. Ranade, S. Mahabalarao and S. Kale. A variation on SVD based image compression. Image and Vision Computing, 2007.

22

Page 23: Image Compression by Learning Matrix Ortho-normal Bases

23