23
TIM @ SBU 2008

TIM @ SBU 2008

  • Upload
    winona

  • View
    30

  • Download
    2

Embed Size (px)

DESCRIPTION

TIM @ SBU 2008. Outline. Results and User Study. Implementation Steps. AutoCollage Framework. TIM @ SBU 2008. About AutoCollage. An automatic procedure for constructing a visually appealing collage from a collection of input images. TIM @ SBU 2008. - PowerPoint PPT Presentation

Citation preview

Page 1: TIM @ SBU   2008

TIM @ SBU 2008

Page 2: TIM @ SBU   2008

AutoCollage FrameworkImplementation StepsResults and User Study

TIM @ SBU 2008

Outline

Page 3: TIM @ SBU   2008

TIM @ SBU 2008

About AutoCollage

An automatic procedure for constructing a visually appealing collage from a collection of input images.

Page 4: TIM @ SBU   2008

TIM @ SBU 2008

AutoCollage : A Labelling System

Collage : I cInput Images : I = {I1, . . . , IN}Domain : PEach pixel-location : p P∈Label : L(p) = (n, s) in which In I and s S ∈ ∈Compact form : I(p) = S(p,L(p)) = In(p−s)

The labelling L = {L(p), p P} completely specifies the collage.∈

Goal : To find the labelling L which minimises the energy/cost E(L) E(L) = Erep(L) + Wimp Eimp(L) + Wtrans Etrans(L) + Wobj Eobj(L)

Page 5: TIM @ SBU   2008

AutoCollage FrameworkImplementation StepsResults and User Study

TIM @ SBU 2008

Page 6: TIM @ SBU   2008

TIM @ SBU 2008

Representative Images

Page 7: TIM @ SBU   2008

TIM @ SBU 2008

Representative Images

The cost associated with the set Is of chosen images is of the form Erep = ∑n Erep(n) where

Erep(n) = −an Dr(n) − min an am Vr(n,m) m:Im I∈ s

an = 1 if p P with L(p) = (n, s)∃ ∈Dr(n) = Entropy(In) + Wfaced δ({Image n contains a face})δ(π) = 1 if predicate π is trueWface weights the influence of an image containing a faceVr(m,n) shows pairwise distances between images

E(L) = Erep(L) + Wimp Eimp(L) + Wtrans Etrans(L) + Wobj Eobj(L)

Page 8: TIM @ SBU   2008

TIM @ SBU 2008

Region ofInterest

Page 9: TIM @ SBU   2008

TIM @ SBU 2008

Region of Interest

E(L) = Erep(L) + Wimp Eimp(L) + Wtrans Etrans(L) + Wobj Eobj(L)

The Eimp term ensures that a substantial and interesting region of interest (ROI) is selected from each image in Is.

Eimp(L) = − ∑p G(p, L(p)) T(p, L(p))

G(p, L(p)) is the Gaussian weighting function that favors the center of the input image from which p is drawn.

T(p, L(p)) measures the local entropy of a 32×32-pixel region around the pixel p, and is normalized so that local entropy sums to 1 over a given input image.

Page 10: TIM @ SBU   2008

TIM @ SBU 2008

Packing ROIs

Page 11: TIM @ SBU   2008

TIM @ SBU 2008

Packing ROIs : Constraints

1. Faces should be regard as a preferred material

2. Sky should be constrained to appear at the top

3. No two ROIs should intersect

4. Every pixel is couvered by an image

Page 12: TIM @ SBU   2008

TIM @ SBU 2008

Packing ROIs : The Energy

E(L) = Erep(L) + Wimp Eimp(L) + Wtrans Etrans(L) + Wobj Eobj(L)

Eobj incorporates information on object recognition, and favors placement of objects in reasonable configurations.

For faces,

Eobj = ∑p, q N∈ f (p, q, L(p), L(q))

f (p,q,L(p),L(q)) = ∞ whenever L(p) ≠ L(q) and p,q are pixels from the same face in either the images of L(p) or L(q), 0 otherwise.

For sky, rather than defining an explicit energy, we simply label images containing sky and pass this information to the constraint satisfaction engine which attempts to position such images only at the top of the collage.

Page 13: TIM @ SBU   2008

TIM @ SBU 2008

Transition Between Images

Page 14: TIM @ SBU   2008

TIM @ SBU 2008

Transition between Images

E(L) = Erep(L) + Wimp Eimp(L) + Wtrans Etrans(L) + Wobj Eobj(L)

Etrans is a pairwise term which penalises any transition between images that is not visually appealing.

Etrans = ∑p,q N∈ VT (p, q, L(p), L(q))

N is the set of all pairs of neighboring (8-neighborhood) pixels.

Compact form: I(p) = S(p,L(p)) = In(p−s)ε = 0.001 prevents underflow||.|| defines the Euclidean norm

L(q))L(p),q,(p, VT

)||L(q))S(q,-L(q))S(p,||

||L(q))S(p,-L(p))S(p,||,

||L(p))S(q,-L(p))S(p,||

||L(q))S(q,-L(p))S(q,||(min

Page 15: TIM @ SBU   2008

TIM @ SBU 2008

Transition between Images

E(L) = Erep(L) + Wimp Eimp(L) + Wtrans Etrans(L) + Wobj Eobj(L)

Etrans measures mismatch across the boundary betweentwo input images.

1. VT (p, q, L(p), L(q)) = 0 unless L(p) ≠ L(q). 2. VT (p, q, L(p), L(q)) is small if there is a strong gradient in one of the input images, since the relevant denominator will then be large.

L(q))L(p),q,(p, VT

)||L(q))S(q,-L(q))S(p,||

||L(q))S(p,-L(p))S(p,||,

||L(p))S(q,-L(p))S(p,||

||L(q))S(q,-L(p))S(q,||(min

Page 16: TIM @ SBU   2008

TIM @ SBU 2008

AutoCollage : Constraints

1. Information bound Any image In that is present in the labelling must satisfy Eimp(L,n) > T.Eimp(L,n) [0,1]∈ is the proportion of local image information that is captured in the ROI.

2. Uniform shift A given input image In may appear in the collage with one unique shift s. Given two distinct pixels p, q P : p ≠ q, ∈ with labels L(p) = (n, s), L(q) = (n, s), ′ it is required that s = s .′

3. Connectivity Each set Sn {p P: L(p)=(n, s), for some s} ∈ ∈ of collage pixels drawn from image n, should form a connected region.

Page 17: TIM @ SBU   2008

TIM @ SBU 2008

Implementation Summary

Erep tends to select the images from the input image set that are most representative.Eimp term ensures that a substantial and interesting region of interest (ROI) is selected from each image in I.Etrans is a pairwise term which penalises any transition between images that is not visually appealing. Eobj incorporates information on object recognition, and favors placement of objects in reasonable configurations.

Wimp, Wtrans, Wobj and Wfaced are weighting parameters and have been adjusted by informal testing over 50 sets of images.

E(L) = Erep(L) + Wimp Eimp(L) + Wtrans Etrans(L) + Wobj Eobj(L)

Page 18: TIM @ SBU   2008

TIM @ SBU 2008

Demo

Page 19: TIM @ SBU   2008

AutoCollage FrameworkImplementation StepsResults and User Study

TIM @ SBU 2008

Page 20: TIM @ SBU   2008

TIM @ SBU 2008

Results : Comparison

AutoCollage

Tapestry

Page 21: TIM @ SBU   2008

TIM @ SBU 2008

Results : Limitations

1. Occasional inclusion of sky fragments in the interior (83% Accuracy)

2. Occasionally the face detection fails, allowing inappropriate cut 3. Sometimes texture edges trigger inappropriately sharp transitions

4. Lake of user interaction

Page 22: TIM @ SBU   2008

TIM @ SBU 2008

User Study

Page 23: TIM @ SBU   2008

TIM @ SBU 2008