Eyes detection in compressed domain using classification

Eyes detection in compressed domain using Eyes detection in compressed domain using classificationclassification

Eng. Alexandru POPA

alexandru_popa@autenticmedia.com

Technical University of Cluj-Napoca

Faculty of Electronics, Telecommunications and Information Technology

Object detection in digital imagesThe principle of image processing in the compressed domainThe Discrete Cosine Transform (DCT)The spatial relationship of DCT coefficients between a block and its sub-blocks Object recognition using classificationThe linear discriminant classifier (LDA, Fisher classifier)DemoResultsConclusions

the approached method consists in feature extraction using image transformations, creation of a new space of features followed by objects classification in that space

feature extraction methods: DCT, Wavelet, Gabor

DCT gives in general good features for object description. Is the base of the JPEG standard, and the properties of the DCT coefficients blocks, makes them very good for generating features spaces

the idea is to make the classification of the objects direct in JPEG compressed domain

DCT = Discrete Cosine Transform

almost all image processing algorithms are defined in pixel level; rewriting them in the compressed domain is not directstandard implementation schemes decompress the image, apply the algorithm and them recompress the image. The disadvantage is that these schemes are time consumingit is wished to rewrite these algorithms directly in the compressed domain for optimizing the processing chain

E n tr o p yd ec o d e

D ec o m p r es s io n( R L E , Z ig - zag ,

Q u an tizer , D C T )

P ix e l lev e lp r o c es s in g

C o m p r es s ed d o m ainp r o c es s in g

C o m p r es s io n( R L E , Z ig - zag ,

Q u an tizer , D C T )R L E

v ec to r

E n tr o p yen c o d e

J P E Gb its tr eam

R L Ev ec to r

J P E Gb its tr eam

The formula for DCT applied on a image:

Properties:

Decorelation – the principal advantage of transformed images is the low redundancy between neighbours pixels. From this fact results uncorrelated coefficients which can be coded independently Energy compactness – the capacity of the transformation to pack the input datas in as few coefficients as possible Separability – the 2D DCT can be calculated in two steps by applying the 1D formula successively on the lines and the columns of an image

a new problem could occur from the fact that various DCT block sizes have to be used in order to ensure optimized performances8x8 blocks used in JPEG, 4x4 blocks used in image indexing, and 16x16 macro-blocks in MPEGto deal with inter-transfer of DCT coefficients from different blocks with various sizes, the existing approach would have to decompress the pixel data in the spatial domain via the IDCT, redivide the pixels into new blocks with the required size and then apply the DCT again to produce the DCT coefficientsit is obvious that the approach is inefficient

Bibliography: The Spatial Relationship of DCT Coefficients Between a Block and Its Sub-blocks, Jianmin Jiang and Guocan Feng

4x4 block

Transformation from 4 blocks of 2x2 pixels in one of 4x4 pixels:

106 97 83 85

106 95 84 85

105 84 74 69

77 60 57 89

The block with the pixelsluminance

DCT202 10 168 -1

1 -1 84 85

163 19 144 -13

26 2 -1 19

The DCT coefficients of4 block of 2x2 pixels

Matricea A*

339 23 22 -3

34 13 -13 0

-12 -16 8 -5

-1 13 -4 4

1 0 1 0

0.9239 0.3827 -0.9239 0.3827

0 1 0 -1

-0.3827 0.9239 0.3827 0.9239

Ecuation:

Original image

Transformation form a 4x4 block to 4 block of 2x2 pixels:

106 97 83 85

106 95 84 85

105 84 74 69

77 60 57 89

202 10 168 -1

1 -1 84 85

163 19 144 -13

26 2 -1 19

The inverse matrix of A*

339 23 22 -3

34 13 -13 0

-12 -16 8 -5

-1 13 -4 4

0.5 0.4619 0.5 -0.1913

0 0.1913 0 0.4619

0.5 -0.4619 0.5 0.1913

0 0.1913 0 0.4619

Ecuation :

The block with the pixelsluminance

The DCT coefficients ofThe 4x4 block

Original image 4x4 block

geometric classifiers are those classifiers which implies the deduction of some decision borders in the features spacea classifier demands a set of training datas (datas + labels)the number of datas must be big enough for a correct learning with generalization capacity for unknown datas

Data classification:

means that an unknown sample is presented to the classifier, his position regarding the decision boundaries is calculated and depending on it a label is associated

LDA (Linear Discriminant Analysis) using Fisher’s classifier implies finding a line in the features space and projecting the datas from the training set on this line. Describes the datas by their projectionsConsidering a bi-dimensional space we have:

Fisher’s criteria for selecting w and w0 parameters:

The optimal direction w is the line direction for which: 1) the distance between the projections of the classes centers on w is maximum2) the variance of the projections from each class is minimumThe optimum value w0 is the scalar value which minimize the classification error in the training data set

is the label assigned to the i data by the Fisher classifier

The image form which the training set was taken

it was proved that the implementation of Fisher`s classifier in compressed domain was a wise choice because it has good results in eyes regions detectionit`s a novelty in the image processing field because this algorithm wasn`t written in compressed domainusing the spatial relationship of DCT coefficients between a block and its sub-blocks facilitates the computation of coefficients for big blocks starting from small blocks in the way of speed and computation complexity

Others applications that can derive:

gaze tracking/focusing

automatic system for detecting the vigilance of driversbiometrics applications: person identification using iris recognition

, contează foarte mult structura acesteia precum şi setul de antrenare

Thank you for your Thank you for your attention!attention!Questions?Questions?

Eyes detection in compressed domain using classification

Documents

Learning in the Frequency DomainLearning in the frequency domain: Compressed repre-sentations in the frequency domain contain rich patterns for image understanding tasks. [14, 15,

Compressed-domain-based Transmission Distortion Modeling for Precoded H.264/AVC Video

Moving Object Detection in the HEVC Compressed · Moving Object Detection in the HEVC Compressed Domain for Ultra-High-Resolution Interactive Video Johan De Praeter, Jan Van de Vyver,

A Comparison of BWT Approaches to Compressed-Domain Pattern Matching

Fourier-Domain Beamforming: The Path to …webee.technion.ac.il/.../journals/157_Fourier-Domain.pdfchernyakova and eldar: fourier-domain beamforming: the path to compressed ultrasound

A CMOS approach to compressed-domain image acquisitionece-research.unm.edu/hayat/OE_2017.pdf · A CMOS approach to compressed-domain image acquisition Javad Ghasemi,1,* Manish Bhattarai,1

An Educational Domain with Google Internal email … Enochs... · An Educational Domain with Google Internal email system ... "NVC derives from various sources: eyes (eye contact

Discernible Compressed Images via Deep Perception Consistency · generating compressed images with perception consistency in the feature domain is urgently required. To address the

money must be acceptable, divisible, portable, and stable ...schoolwires.henry.k12.ga.us/cms/lib08/GA01000549/Centricity/Domain...Tea leaves compressed into “bricks” comprised

OPTIMIZED RATE ALLOCATION OF HYPERSPECTRAL IMAGES IN COMPRESSED DOMAIN USING JPEG2000 Part 2

Dual-Domain Compressed Beamforming for Medical Ultrasound

Survey of Compressed-Domain Features used in Audio- · PDF fileSurvey of Compressed-Domain Features used in Audio-Visual Indexing and Analysis ... Traditional TV broadcast is moving

Dual-camera snapshot spectral imaging with a pupil-domain ...amir1/PS/CS-SSI-dual.pdf · Dual-camera snapshot spectral imaging with a pupil-domain optical diffuser and compressed

A Hybrid, Dual Domain, Cascade of Convolutional Neural ...proceedings.mlr.press/v102/souza19a/souza19a.pdf · Keywords: Magnetic resonance imaging, image reconstruction, compressed

Lecture 12 - FAU · 5 Introduction to compressed sensing Compressed sensing theory tells us that this is possible, because the images are highly compressible in the wavelet domain

Compressed-Domain Techniques for Error-Resilient Video

INTERPRETABLE HUMAN ACTION RECOGNITION IN COMPRESSED DOMAINiphome.hhi.de/samek/pdf/SriICASSP17.pdf · Compressed domain human action recognition algorithms are ex-tremely efﬁcient,

Image processing in the compressed domain

Pressure reducers Compressed air filter units Compressed air … · 2017. 6. 30. · Compressed air filter units Compressed air regulators Lubricators Professional intelligence compressed

Methods of Video Object Segmentation in Compressed Domain