59
CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Embed Size (px)

Citation preview

Page 1: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

CS 691 Computational Photography

Instructor: Gianfranco DorettoCutting Images

Page 2: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

This Lecture: Finding Seams and Boundaries

Segmentation

Page 3: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

This Lecture: Finding Seams and Boundaries

Retargetting

http://swieskowski.net/carve/

Page 4: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

This Lecture: Finding Seams and Boundaries

Stitching

Page 5: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

This Lecture: Finding seams and boundaries

Fundamental Concept: The Image as a Graph

• Intelligent Scissors: Good boundary = short path

• Graph cuts: Good region has low cutting cost

Page 6: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Semi-automated segmentation

User provides imprecise and incomplete specification of region – your algorithm has to read his/her mind.

Key problems1. What groups of pixels form cohesive regions?2. What pixels are likely to be on the boundary of regions?3. Which region is the user trying to select?

Page 7: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

What makes a good region?

• Contains small range of color/texture• Looks different than background• Compact

Page 8: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

What makes a good boundary?

• High gradient along boundary• Gradient in right direction• Smooth

Page 9: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

The Image as a Graph

Node: pixel

Edge: cost of path or cut between two pixels

Page 10: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Intelligent ScissorsMortenson and Barrett (SIGGRAPH 1995)

Page 11: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Intelligent Scissors

• Formulation: find good boundary between seed points

• Challenges– Minimize interaction time– Define what makes a good boundary– Efficiently find it

Page 12: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Intelligent Scissors

A good image boundary has a short path through the graph.

Mortenson and Barrett (SIGGRAPH 1995)

1 2 1

4

1

6

9

1

3

1

4

113

2

3

5

Start

End

Page 13: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Intelligent Scissors: method

1. Define boundary cost between neighboring pixels

2. User specifies a starting point (seed)3. Compute lowest cost from seed to each

other pixel4. Get new seed, get path between seeds,

repeat

Page 14: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Intelligent Scissors: method

1. Define boundary cost between neighboring pixels

a) Lower if edge is present (e.g., with edge(im, ‘canny’))

b) Lower if gradient is strongc) Lower if gradient is in direction of boundary

Page 15: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Gradients, Edges, and Path Cost

Gradient Magnitude

Edge Image

Path Cost

Page 16: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Intelligent Scissors: method

1. Define boundary cost between neighboring pixels

2. User specifies a starting point (seed)– Snapping

Page 17: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Intelligent Scissors: method

1. Define boundary cost between neighboring pixels

2. User specifies a starting point (seed)3. Compute lowest cost from seed to each

other pixel– Djikstra’s shortest path algorithm

Page 18: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Djikstra’s shortest path algorithm

Initialize, given seed s:• Compute cost2(q, r) % cost for boundary from pixel q to

neighboring pixel r• cost(s) = 0 % total cost from seed to this point • A = {s} % set to be expanded• E = { } % set of expanded pixels• P(q) % pointer to pixel that leads to q

Loop while A is not empty1. q = pixel in A with lowest cost2. for each pixel r in neighborhood of q that is not in E

a) cost_tmp = cost(q) + cost2(q,r)

b) if (r is not in A) OR (cost_tmp < cost(r))i. cost(r) = cost_tmp ii. P(r) = qiii. Add r to A

Page 19: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Intelligent Scissors: method

1. Define boundary cost between neighboring pixels

2. User specifies a starting point (seed)3. Compute lowest cost from seed to each

other pixel4. Get new seed, get path between seeds,

repeat

Page 20: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Intelligent Scissors: improving interaction

1. Snap when placing first seed2. Automatically adjust as user drags3. Freeze stable boundary points to make

new seeds

Page 21: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Where will intelligent scissors work well, or have problems?

Page 22: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Grab cuts and graph cuts

User Input

Result

Magic Wand (198?)

Intelligent ScissorsMortensen and Barrett (1995)

GrabCut

Regions Boundary Regions & Boundary

Source: Rother

Page 23: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Segmentation with graph cuts

Energy(y;θ,data) = ψ 1(yi;θ,data)i∑ + ψ 2 (yi ,yj;θ,data)

i, j∈edges∑

Source (Label 0)

Sink (Label 1)

Cost to assign to 0

Cost to assign to 1

Cost to split nodes

Page 24: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Segmentation with graph cuts

Energy(y;θ,data) = ψ 1(yi;θ,data)i∑ + ψ 2 (yi ,yj;θ,data)

i, j∈edges∑

Source (Label 0)

Sink (Label 1)

Cost to assign to 0

Cost to assign to 1

Cost to split nodes

Page 25: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Interactive Graph Cuts [Boykov, Jolly ICCV’01]

Image Min Cut

Cut: separating source and sink; Energy: collection of edges

Min Cut: Global minimal enegry in polynomial time

Foreground (source)

Background(sink)

constraints

Page 26: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

GrabCut Colour Model

Gaussian Mixture Model (typically 5-8 components)

Foreground &Background

Background

Foreground

BackgroundG

R

G

RIterated graph cut

Source: Rother

Page 27: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Graph cuts segmentation1. Define graph

– usually 4-connected or 8-connected

2. Set weights to foreground/background– Color histogram or mixture of Gaussians for

background and foreground

3. Set weights for edges between pixels

4. Apply min-cut/max-flow algorithm5. Return to 2, using current labels to

compute foreground, background models€

edge _ potential(i, j) ∝ [y i ≠ y j ]exp− c(i) − c( j)

2

2σ 2

⎧ ⎨ ⎪

⎩ ⎪

⎫ ⎬ ⎪

⎭ ⎪€

unary _ potential(i) = −log(P(c(i);y i))

Page 28: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

What is easy or hard about these cases for graphcut-based segmentation?

Page 29: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Easier examples

GrabCut – Interactive Foreground Extraction 10

Page 30: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

More difficult Examples

Camouflage & Low Contrast Harder CaseFine structure

Initial Rectangle

InitialResult

GrabCut – Interactive Foreground Extraction 11

Page 31: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Lazy Snapping (Li et al. SG 2004)

Page 32: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Limitations of Graph Cuts

• Requires associative graphs– Connected nodes should prefer to have

the same label

• Is optimal only for binary problems

Page 35: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images
Page 36: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Dynamic Programming

• Well known algorithm design techniques:.– Divide-and-conquer algorithms

• Another strategy for designing algorithms is dynamic programming.– Used when problem breaks down into recurring small

subproblems

• Dynamic programming is typically applied to optimization problems. In such problem there can be many solutions. Each solution has a value, and we wish to find a solution with the optimal value.

Page 37: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Dynamic Programming• Dynamic programming is a way of improving on

inefficient divide-and-conquer algorithms.

• By “inefficient”, we mean that the same recursive call is made over and over.

• If same subproblem is solved several times, we can use table to store result of a subproblem the first time it is computed and thus never have to recompute it again.

• Dynamic programming is applicable when the subproblems are dependent, that is, when subproblems share subsubproblems.

• “Programming” refers to a tabular method

Page 38: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Elements of Dynamic Programming (DP)

DP is used to solve problems with the following characteristics:

• Simple subproblems – We should be able to break the original problem to smaller

subproblems that have the same structure

• Optimal substructure of the problems – The optimal solution to the problem contains within

optimal solutions to its subproblems.

• Overlapping sub-problems – there exist some places where we solve the same

subproblem more than once.

Page 39: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Steps to Designing a Dynamic Programming Algorithm

1. Characterize optimal substructure

2. Recursively define the value of an optimal solution

3. Compute the value bottom up

4. (if needed) Construct an optimal solution

Page 40: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

11-40

Example: Matrix-chain Multiplication

• Suppose we have a sequence or chain A1, A2, …, An of n matrices to be multiplied

– That is, we want to compute the product A1A2…An

• There are many possible ways (parenthesizations) to compute the product

Page 41: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

11-41

Matrix-chain Multiplication …

contd

• Example: consider the chain A1, A2, A3, A4 of 4 matrices

– Let us compute the product A1A2A3A4

• There are 5 possible ways:1. (A1(A2(A3A4)))

2. (A1((A2A3)A4))

3. ((A1A2)(A3A4))

4. ((A1(A2A3))A4)

5. (((A1A2)A3)A4)

Page 42: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

11-42

Matrix-chain Multiplication …

contd

• To compute the number of scalar multiplications necessary, we must know:

– Algorithm to multiply two matrices– Matrix dimensions

• Can you write the algorithm to multiply two matrices?

Page 43: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

11-43

Algorithm to Multiply 2 Matrices

Input: Matrices Ap×q and Bq×r (with dimensions p×q and q×r)

Result: Matrix Cp×r resulting from the product A·B

MATRIX-MULTIPLY(Ap×q , Bq×r)1. for i ← 1 to p2. for j ← 1 to r3. C[i, j] ← 04. for k ← 1 to q5. C[i, j] ← C[i, j] + A[i, k] · B[k,

j] 6. return C

Scalar multiplication in line 5 dominates time to compute CNumber of scalar multiplications = pqr

Page 44: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

11-44

Matrix-chain Multiplication …

contd

• Example: Consider three matrices A10x100, B100x5, and C5x50

• There are 2 ways to parenthesize – ((AB)C) = D10x5 · C5x50

• AB 10·100·5=5,000 scalar multiplications• DC 10·5·50 =2,500 scalar multiplications

– (A(BC)) = A10100 · E10050

• BC 100·5·50=25,000 scalar multiplications

• AE 10·100·50 =50,000 scalar multiplications

Total: 7,500

Total: 75,000

Page 45: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

11-45

Matrix-chain Multiplication …

contd

• Matrix-chain multiplication problem– Given a chain A1, A2, …, An of n

matrices, where for i=1, 2, …, n, matrix Ai has dimension pi-1xpi

– Parenthesize the product A1A2…An such that the total number of scalar multiplications is minimized

• Brute force method of exhaustive search takes time exponential in n

Page 46: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

11-46

Dynamic Programming Approach

• The structure of an optimal solution– Let us use the notation Ai..j for the

matrix that results from the product Ai Ai+1 … Aj

– An optimal parenthesization of the product A1A2…An splits the product between Ak and Ak+1 for some integer k where1 ≤ k < n

– First compute matrices A1..k and Ak+1..n ; then multiply them to get the final matrix A1..n

Page 47: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

11-47

Dynamic Programming Approach …contd

– Key observation: parenthesizations of the subchains A1A2…Ak and Ak+1Ak+2…An must also be optimal if the parenthesization of the chain A1A2…An is optimal (why?)

– That is, the optimal solution to the problem contains within it the optimal solution to subproblems

Page 48: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

11-48

Dynamic Programming Approach …contd

• Recursive definition of the value of an optimal solution

– Let m[i, j] be the minimum number of scalar multiplications necessary to compute Ai..j

– Minimum cost to compute A1..n is m[1, n]

– Suppose the optimal parenthesization of Ai..j splits the product between Ak and Ak+1 for some integer k where i ≤ k < j

Page 49: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

11-49

Dynamic Programming Approach …contd

– Ai..j = (Ai Ai+1…Ak)·(Ak+1Ak+2…Aj)= Ai..k · Ak+1..j

– Cost of computing Ai..j = cost of computing Ai..k + cost of computing Ak+1..j + cost of multiplying Ai..k and Ak+1..j

– Cost of multiplying Ai..k and Ak+1..j is pi-

1pk pj

– m[i, j ] = m[i, k] + m[k+1, j ] + pi-1pk pj for i ≤ k < j

– m[i, i ] = 0 for i=1,2,…,n

Page 50: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

11-50

Dynamic Programming Approach …contd

– But… optimal parenthesization occurs at one value of k among all possible i ≤ k < j

– Check all these and select the best one

m[i, j ] =0 if i=j

min {m[i, k] + m[k+1, j ] + pi-1pk pj } if i<ji ≤ k< j

Page 51: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

11-51

Dynamic Programming Approach …contd

• To keep track of how to construct an optimal solution, we use a table s

• s[i, j ] = value of k at which Ai Ai+1 … Aj is split for optimal parenthesization

• Algorithm: next slide– First computes costs for chains of

length l=1– Then for chains of length l=2,3, … and

so on– Computes the optimal cost bottom-up

Page 52: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

11-52

Algorithm to Compute Optimal Cost

Input: Array p[0…n] containing matrix dimensions and nResult: Minimum-cost table m and split table s

MATRIX-CHAIN-ORDER(p[ ], n)

for i ← 1 to nm[i, i] ← 0

for l ← 2 to nfor i ← 1 to n-l+1

j ← i+l-1m[i, j] ← ∞for k ← i to j-1

q ← m[i, k] + m[k+1, j] + p[i-1] p[k] p[j]if q < m[i, j]

m[i, j] ← qs[i, j] ← k

return m and s

Takes O(n3) time

Requires O(n2) space

Page 53: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

11-53

Constructing Optimal Solution

• Our algorithm computes the minimum-cost table m and the split table s

• The optimal solution can be constructed from the split table s

– Each entry s[i, j ]=k shows where to split the product Ai Ai+1 … Aj for the minimum cost

Page 54: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

11-54

Example

• Show how to multiply this matrix chain optimally

• Solution on the board– Minimum cost 15,125– Optimal

parenthesization ((A1(A2A3))((A4 A5)A6))

Matrix Dimension

A1 30×35

A2 35×15

A3 15×5

A4 5×10

A5 10×20

A6 20×25

Page 55: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

min. error boundary

Minimal error boundaryoverlapping blocks vertical boundary

_ =2

overlap error

Page 58: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Summary of big ideas

• Treat image as a graph– Pixels are nodes– Between-pixel edge weights based on gradients– Sometimes per-pixel weights for affinity to

foreground/background

• Good boundaries are a short path through the graph (Intelligent Scissors, Seam Carving)

• Good regions are produced by a low-cost cut (GrabCuts, Graph Cut Stitching)

Page 59: CS 691 Computational Photography Instructor: Gianfranco Doretto Cutting Images

Slide Credits

• This set of sides also contains contributions kindly made available by the following authors– Alexei Efros– Carsten Rother– Derek Hoiem