View
216
Download
0
Tags:
Embed Size (px)
Citation preview
A neural approach to extract foreground from human movement imagesS.Conforto, M.Schmid, A.Neri, T.D’Alessio
Compute Method and Programs in Biomedicine 82(2006) 73-80
Che-Wei Sung 2007/12/24
Outline
Introduction Materials and methods
Subtraction techniques Neural approach
Qualitative evaluation of results Objective evaluation of results Conclusions
Introduction
The capture of human movement is a hot topic for surveillance, control and analysis.
In the framework of human movement analysis often consists of separating the moving subject (i.e. foreground) from the background by techniques based on temporal or spatial.
Introduction
Temporal data can be used in two different ways, subtraction and flow, while spatial techniques is applying markers on foreground.
Mixed approaches have been presented, but none can be considered as outperforming in general terms.
The work in this paper is to development of a markerless capture system for movement analysis application by making the ANN “learn” the background.
Materials and methods
The moving subject is detected by analyzing the differences between the background scene.
, corresponding to the background image
,represents the generic s-th image frame extracted from the video sequence gathering the moving subject over the background scene.
( )smI P
( )bI P
Materials and methods - Subtraction1. Compute the image difference
2. For each row of , calculate the vectors of mean value , and standard deviation
3. Determine the 3D-classification interval , if a pixel lies inside the
domain , it is classified as background,
vice versa as foreground.4. Detect the largest connected area that considered
as actual foreground.
( ) { ( ), ( ), ( )} ( ) ( )s R G B sdiff diff diff diff b mI P I P I P I P I P I P
( )sdiffI P
{ , , }R G Brow row row rowμ μ μ μ{ , , }R G B
row row row rowσ σ σ σ
{ , , }R R G G B Brow row row row row row μ σ μ σ μ σ
{ , ,R R R R R G G G G Grow row row row row row row rowP Pμ -σ < <μ +σ μ -σ < <μ +σ
}B B B B Brow row row rowPμ -σ < <μ +σ
Materials and methods - ANN
Neural network makes use of a Kohonen map, composed of (8×8) neurons
Materials and methods - ANN
In this work, background image is partitioned into blocks of (8×8) pixels, and arranged in a mono-dimensional vector composed of
H = (64×3) =192 components for training data.
Materials and methods - ANN
Assume the image is subdivided into B blocks of size (8×8), the training input vector Vb={b=1,2…B} and the size of each synaptic weight vector is randomly initialized in [0,1], where h=1,2…H
hWij
Materials and methods - Training1. One input vector Vb is randomly extracted from the
training set, and feeds the network.
2. In each neuron nij, the distance dij,b(k) between Vb
and (k) is calculated:
3. The best match neuron nBM(k) is defined as the nij whose corresponding vector (k) is at the minimum distance from Vb.
Wij
Wij
2,
1
( ) ( ( ) )H
h hij b ij b
h
d k W k V
Materials and methods - Training4. The weight vectors are updated by using typical Ko
honen neighborhood procedure.
where
( ) ( )( ( ))( 1) {
( )
h h hij ij b ijh
ij hij j
W k k V W kW k
W k n
λ ij BMn N
ij BMn N
ij BMn n
ij BMn N
ij BMn N
Materials and methods - Training
The training has been considered as complete when, for the 98% of training samples, the association between each Vb and the corresponding best match neuron is not altered
Materials and methods - Testing1. undergoes Data Shaping, creating a set of
vectors .
2. For each block, the best match neuron is identified by considering the minimum Euclidean distance criterion.
( )smI P
bS
2,
1
( )H
h hBM b BM b
h
d W S
Materials and methods - Testing3. is used to build up a distance matrix, whose el
ements are rearranged respecting the spatial of
, where each element occupies the position of block.
4. For each row of distance matrix, the mean value
and the standard deviation are calculated.
Blocks with corresponding distance values outside the range are considered as foreground.
5. A segmentation mask is built up by marking pixels with 0 for background, 1 for foreground.
,BM bd
( )smI P
rowμ rowσ
row rowμ σ
Qualitative evaluation of results The proposed algorithm have been applied to
analyze human body movement during three motor tasks: gait, pitching a ball and standing up from a chair.
The training of Kohonen’s map has met convergence after around 90000 presentations of background blocks.
Qualitative evaluation of results
Qualitative evaluation of results
Objective evaluation of results quality_indexs = 0.3shape_regs +
0.33temp_stabs + 0.37contrasts
shape_regs: the regularity of segmented object shape.
temp_stabs: the stability along the video sequence of extracted object.
contrasts: the contrast between the inside and the outside of the object evaluated along the border.
Objective evaluation of results
Conclusions
The work proposes a new unsupervised approach for foreground extraction in human movement images based on ANN and the presented results demonstrate it is suitable.