Upload
junior
View
24
Download
0
Embed Size (px)
DESCRIPTION
David Harwin Adviser: Petros Faloutsos. Background Removal. The State of the Art. This field has been one of great interest in the past decade There have been many recent papers on the topic with intended applications ranging from visual effects processing to ... - PowerPoint PPT Presentation
Citation preview
Background Removal
David HarwinAdviser: Petros Faloutsos
The State of the Art
This field has been one of great interest in the past decade
There have been many recent papers on the topic with intended applications ranging from visual effects processing to ...
In order to determine how to proceed, I have spent the quarter reading and analyzing existing papers.
Identifying motion
pixel differencing vs optical flow differencing vulnerable to signal noise, illumination
changes, and BG motion optical flow is computationally expensive
most opted for hybrid approach only Criminisi (2006) avoids optical flow
altogether, and instead uses spatio-temporal continuity of labels
Segmentation
2-pass approach
low-level then high-level (Calderara, Tsai, C alic) Calderara - 2nd pass checks for consistency of changes in color and intensity over
time. Objects with insufficiently coherent motion are ignored as BG
Tsai – 1st pass initial labelling (constantly vs never changing), 2nd only works on uncertain pixels
Calic - creates low-resolution representation, corrects BG for camera motion through matrix transformations, determines most ”salient” frames, then calculation of regions of interest occurs only on these keyframes
high-level then low-level (Park) Park - object tracking, followed by pixel differencing inside the object
window
Single pass Criminisi -
Key FeaturesPark
BG differencing
Runtime real-time
none
no no no
channels all colors unclear color, intensity, orientationdate of publication 2006 2006 (?) 2004 2005 2006
no no no
surveillance, distant objects
storage required
moving BG/ FG yes / yes yes / yes yes / yes yes / yes
Criminisi Calderara Tsai Calic
Feature detection method
motion/non-motion classifier
2D spatial scene graph, pixel difference-based
extract corner points with Gandalf library
multi-scale center-surround differences, as in human visual system
unknown, also performs other calculations
requires all data stored beforehand (not causal)
about ½ real-time without BG completion (assuming 24 fps)
real-time layout, keyframe extraction requires preprocessing
optical flow modeling/pixel velocity calculation
no optical flow, calculates spatio-temporal derivatives
determined to be foreground, compares spatio-temporal derivative to surroundings to find coherence
tracks “worms” of uncertain pixels between frames
moves tracking window around each frame to track single object
yes, used to determine camera work
required initial information/ user interaction
none, has “burn-in” period and learns BG features
none, bootstraps from “dirty” frames by classifying regions as stable or unstable (stable -> BG, unstable -> FG)
gets user input on pixels with the Uncertain label
user initializes tracking window
Has background model
color histogram for FG and BG averages (constant over time)
yes, constantly-updated BG model is median of past k pixel values and BG model at last iteration
segmentation discriminant
hidden Markov model – likelihood of motion, color (BG vs FG likelihoods), temporal and spatial continuity of labels
gray-level intensity difference of image compared to BG model, filter out low and high thresholds to get FG. Has object-validating second pass
Maximum a Posteriori (MAP) method
binary difference image from last frame if in tracking window of moving object inside a defined search window
is FG candidate if it has motion vector significantly different from camera-work induced transformations
yellow (spatio-temporal derivative), YUV color data
grayscale intensity, color and intensity
a priori knowledge/ assumptions
Gaussian BG color distribution, distinct BG/FG color difference, BG set with same tilt angle is static, narrowly defined content
intelligent cameraman following interesting motion
intended subject/ use
teleconferencing, small studios or offices
object movies (constant BG, single object in foreground rotated regularly)
film post-processing and VFX, user-directed removal of single object
viewing videos summaries on a small-screen device
FG/BG color histograms, past 2 frames + current frame
past k values for each pixel, BG model of size = size of image
2D spatial graph, per-pixel labeling,
all BG and FG features, 2 comparisons of object windows keyframes only
no / yes, but strictly controlled