19
Oct. 18, 2004 Univers ity of Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent Cheung Probabilistic and Statistical Inference Group Electrical & Computer Engineering University of Toronto Toronto, Ontario, Canada Advisor: Dr. Brendan J. Frey

University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Embed Size (px)

Citation preview

Page 1: University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Oct. 18, 2004

Universityof Toronto

Modelling Motion Patterns with Video Epitomes

Machine Learning Group MeetingUniversity of Toronto

Oct. 18, 2004

Vincent Cheung

Probabilistic and Statistical Inference GroupElectrical & Computer Engineering

University of TorontoToronto, Ontario, Canada

Advisor: Dr. Brendan J. Frey

Page 2: University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Cheung2 / 18

ML Group Meeting, Oct. 18, 2004

Outline

● Image epitome► What?► Why?► How?

● Implementation computation issues► Efficiently implementing the learning algorithm

● Video epitome► Extension to videos► Filling-in missing information

Page 3: University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Cheung3 / 18

ML Group Meeting, Oct. 18, 2004Im

age

Lea

rnin

gV

ide

o

Image Epitome

● Jojic, N., Frey, B., & Kannan, A. (2003). Epitomic analysis of appearance and shape. In Proc. IEEE ICCV.

● Miniature, condensed version of the image

● Models the image’s shape and textural components.

● Applications► object detection► texture segmentation► image retrieval► compression

Page 4: University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Cheung4 / 18

ML Group Meeting, Oct. 18, 2004Im

age

Lea

rnin

gV

ide

o

Image Epitome Examples

Page 5: University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Cheung5 / 18

Imag

eL

earn

ing

Vid

eo

ML Group Meeting, Oct. 18, 2004

Learning the Image Epitome (1)

Page 6: University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Cheung6 / 18

Imag

eL

earn

ing

Vid

eo

ML Group Meeting, Oct. 18, 2004

Learning the Image Epitome (2)

Epitome

Input image

Training Set

SamplePatches

UnsupervisedLearning

e

Z1 Z2 ZM

TMT2T1Bayesiannetwork

e – epitomeTk – mappingZk – image patch

E:

M:

Page 7: University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Cheung7 / 18

Imag

eL

earn

ing

Vid

eo

ML Group Meeting, Oct. 18, 2004

Xe(i,j)

KK

N

N

Cumsum

2

N

N

Xe(i,j)

KK

N

N

Cumsum

2

N

N

Shifted Cumulative Sum Algorithm

(2, 2), (i+1, j+1)

(1, 2), (i, j+1)

(2, 1), (i+1, j)

– col 1+ col (P+1)

– row 1+ row (P+1)

+

+ + pixel (1,1)+ pixel (P+1, P+1)

(1, 1), (i, j)

(2, 2), (i+1, j+1)(2, 2), (i+1, j+1)

(1, 2), (i, j+1)(1, 2), (i, j+1)

(2, 1), (i+1, j)(2, 1), (i+1, j)

– col 1+ col (P+1)

– row 1+ row (P+1)

+

+ + pixel (1,1)+ pixel (P+1, P+1)

(1, 1), (i, j)(1, 1), (i, j)

+-

-+

Page 8: University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Cheung8 / 18

Imag

eL

earn

ing

Vid

eo

ML Group Meeting, Oct. 18, 2004

X e

P

P

P

P

Ta

X e

P

P

P

P

Ta

Collecting Sufficient Statistics

Page 9: University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Cheung9 / 18

Imag

eL

earn

ing

Vid

eo

ML Group Meeting, Oct. 18, 2004

Extending Epitomes to Videos

● Desire a miniature, condensed version of a video sequence

● Want the epitome to model the basic shape, textural, and motion patterns of the video

● Applications► optic flow► segmentation► texture transfer► layer separation► compression► noise reduction► fill-in / inpainting

Page 10: University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Cheung10 / 18

Imag

eL

earn

ing

Vid

eo

ML Group Meeting, Oct. 18, 2004

Input Video

Frame 1Frame 2

Frame 3

Training Set

SamplePatches

Video Epitome

UnsupervisedLearning

Video Epitome

Page 11: University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Cheung11 / 18

Imag

eL

earn

ing

Vid

eo

ML Group Meeting, Oct. 18, 2004

Video Epitome Example

Temporally Compressed

Spatially Compressed

Page 12: University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Cheung12 / 18

Imag

eL

earn

ing

Vid

eo

ML Group Meeting, Oct. 18, 2004

Video Inpainting (1)

● Fill in missing portions of a video► damaged films► occluding objects

● Reconstruct the missing pixels from the video epitome

Page 13: University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Cheung13 / 18

Imag

eL

earn

ing

Vid

eo

ML Group Meeting, Oct. 18, 2004

Video Inpainting (2)

Page 14: University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Cheung14 / 18

Imag

eL

earn

ing

Vid

eo

ML Group Meeting, Oct. 18, 2004

Filling-in Missing Data

Likelihood

Joint

VariationalApprox

E-Step

M-Step

Page 15: University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Cheung15 / 18

Imag

eL

earn

ing

Vid

eo

ML Group Meeting, Oct. 18, 2004

Missing Channels

● Generalization of the video inpainting problem

● Inpainting► Missing entire pixels

● Missing Channels► Missing one or more of the red, green, or blue (RGB)

components of a given pixel

● Epitome must consolidate multiple patches together to piece together the missing channel information

► No training patch contains all the channel information► Use the epitome to fill-in the missing data

Page 16: University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Cheung16 / 18

Imag

eL

earn

ing

Vid

eo

ML Group Meeting, Oct. 18, 2004

Image Missing Channels Fill-in

Page 17: University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Cheung17 / 18

Imag

eL

earn

ing

Vid

eo

ML Group Meeting, Oct. 18, 2004

Video Missing Channels Fill-in

Page 18: University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Cheung18 / 18

ML Group Meeting, Oct. 18, 2004

Conclusion

● Improved the efficiency of learning image epitomes

● Extended the concept of epitomes to video sequences

● Demonstrated the ability of video epitomes to model motion patterns through video inpainting

Page 19: University of Toronto Oct. 18, 2004 Modelling Motion Patterns with Video Epitomes Machine Learning Group Meeting University of Toronto Oct. 18, 2004 Vincent

Cheung19 / 18

ML Group Meeting, Oct. 18, 2004

Future Work

● Determining the size of the epitome► Dependent on the complexity of the image / video

■ Minimum description length■ Variational Bayesian

● Optimal patch size(s)► Problem specific

● Additional transformations into the epitome► Rotation► Scale

● Additional video epitome applications► Super-resolution► Layer separation► Object recognition